Query lcl|Aclame:protein:vir:104439|NCBI_annot:putative virion structural protein|genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Match_columns 404 No_of_seqs 72 out of 89 Neff 5.7 Searched_HMMs 1612 Date Mon Dec 2 01:04:32 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_79 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_79_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:819 Length: 404 # 100.0 3E-185 2E-188 1032.5 36.6 404 1-404 1-404 (404) 2 protein:vir:104439 Length: 404 100.0 3E-185 2E-188 1032.5 36.6 404 1-404 1-404 (404) 3 protein:vir:10123 Length: 404 100.0 3E-185 2E-188 1032.5 36.6 404 1-404 1-404 (404) 4 protein:vir:3298 Length: 404 # 100.0 3E-185 2E-188 1032.5 36.6 404 1-404 1-404 (404) 5 protein:vir:105610 Length: 430 100.0 4E-161 3E-164 899.7 31.8 400 1-404 1-425 (430) 6 protein:vir:93696 Length: 364 100.0 1E-146 6E-150 821.0 30.7 352 1-404 5-362 (364) 7 protein:vir:2770 Length: 318 # 100.0 5E-138 3E-141 773.1 29.7 318 1-318 1-318 (318) 8 protein:vir:95875 Length: 401 99.4 3.7E-14 2.3E-17 94.3 16.4 352 1-404 1-401 (401) 9 protein:vir:80213 Length: 334 99.1 3.2E-11 2E-14 78.2 18.3 322 1-403 1-334 (334) 10 protein:vir:8885 Length: 347 # 99.1 4.4E-11 2.7E-14 77.4 18.6 326 1-402 1-347 (347) 11 protein:vir:10450 Length: 344 99.0 2.4E-11 1.5E-14 78.8 15.5 331 1-401 1-344 (344) 12 protein:vir:94576 Length: 347 99.0 2.3E-11 1.4E-14 78.9 14.9 328 1-402 3-347 (347) 13 protein:vir:3364 Length: 347 # 99.0 6.1E-11 3.8E-14 76.6 16.9 335 1-403 3-347 (347) 14 protein:vir:80180 Length: 381 99.0 1.8E-11 1.1E-14 79.5 13.5 316 1-404 1-344 (381) 15 protein:vir:2201 Length: 345 # 99.0 1.3E-10 8.2E-14 74.7 18.2 333 1-403 1-345 (345) 16 protein:vir:94711 Length: 347 99.0 1.3E-10 7.8E-14 74.9 17.8 331 1-403 3-347 (347) 17 protein:vir:3613 Length: 272 # 99.0 4.6E-11 2.9E-14 77.3 14.6 262 1-404 1-267 (272) 18 protein:vir:78739 Length: 332 99.0 1.2E-10 7.6E-14 74.9 16.7 315 1-401 1-332 (332) 19 protein:vir:96123 Length: 274 98.9 8.2E-11 5.1E-14 75.9 15.0 267 1-404 1-273 (274) 20 protein:vir:105334 Length: 276 98.9 1E-10 6.3E-14 75.4 13.5 261 1-404 1-265 (276) 21 protein:vir:6324 Length: 335 # 98.9 1.8E-09 1.1E-12 68.5 19.8 327 1-404 1-330 (335) 22 protein:vir:94494 Length: 274 98.9 4.9E-10 3E-13 71.6 16.2 261 1-404 1-272 (274) 23 protein:vir:97433 Length: 274 98.9 4.9E-10 3E-13 71.6 16.2 261 1-404 1-272 (274) 24 protein:vir:96262 Length: 274 98.8 2.5E-10 1.5E-13 73.3 14.5 262 1-404 1-265 (274) 25 protein:vir:95898 Length: 274 98.8 2.5E-10 1.5E-13 73.3 14.5 262 1-404 1-265 (274) 26 protein:vir:739 Length: 231 # 98.8 1.7E-10 1E-13 74.2 13.2 227 62-402 1-231 (231) 27 protein:vir:1541 Length: 347 # 98.8 5.5E-10 3.4E-13 71.3 16.1 334 1-403 3-347 (347) 28 protein:vir:78935 Length: 335 98.8 2.7E-09 1.7E-12 67.6 19.5 325 1-404 1-330 (335) 29 protein:vir:93742 Length: 274 98.8 5.8E-10 3.6E-13 71.2 15.6 258 1-404 5-265 (274) 30 protein:vir:94622 Length: 341 98.8 9.1E-09 5.6E-12 64.7 21.4 313 1-404 5-340 (341) 31 protein:vir:1239 Length: 274 # 98.8 7.8E-10 4.8E-13 70.5 15.4 262 1-404 1-265 (274) 32 protein:vir:99675 Length: 324 98.8 3.9E-09 2.4E-12 66.7 18.4 286 56-404 1-299 (324) 33 protein:vir:95107 Length: 270 98.8 1.2E-09 7.7E-13 69.4 15.5 255 1-404 1-260 (270) 34 protein:vir:80930 Length: 278 98.7 1.4E-09 8.5E-13 69.2 15.1 265 1-404 5-272 (278) 35 protein:vir:7990 Length: 273 # 98.7 7.6E-09 4.7E-12 65.1 19.1 265 22-389 1-273 (273) 36 protein:vir:9820 Length: 272 # 98.7 3.6E-09 2.2E-12 66.9 15.5 271 1-404 1-272 (272) 37 protein:vir:3033 Length: 272 # 98.7 3.6E-09 2.2E-12 66.9 15.5 271 1-404 1-272 (272) 38 protein:vir:96833 Length: 275 98.6 1.6E-08 1E-11 63.3 16.5 268 1-395 3-275 (275) 39 protein:vir:97031 Length: 402 98.6 2.7E-08 1.7E-11 62.1 17.5 332 1-404 1-357 (402) 40 protein:vir:105822 Length: 273 98.4 1.9E-07 1.2E-10 57.4 18.1 270 22-403 1-273 (273) 41 protein:vir:102605 Length: 273 98.4 1.9E-07 1.2E-10 57.4 18.1 270 22-403 1-273 (273) 42 protein:vir:100057 Length: 375 98.4 6.9E-08 4.3E-11 59.8 14.8 331 1-404 1-373 (375) 43 protein:vir:103323 Length: 364 98.3 9.2E-08 5.7E-11 59.2 15.5 336 1-404 1-357 (364) 44 protein:vir:5974 Length: 324 # 98.3 1.3E-07 8.3E-11 58.3 16.0 296 18-404 1-324 (324) 45 protein:vir:79008 Length: 299 98.3 8.3E-07 5.2E-10 53.9 19.5 283 22-404 1-289 (299) 46 protein:vir:1583 Length: 351 # 98.3 9E-08 5.6E-11 59.2 13.7 295 1-404 1-330 (351) 47 protein:vir:102944 Length: 330 98.2 1.7E-07 1.1E-10 57.7 13.0 294 1-404 1-328 (330) 48 protein:vir:105645 Length: 400 97.7 2.3E-05 1.4E-08 46.0 18.7 336 1-404 1-366 (400) 49 protein:vir:99075 Length: 392 97.6 2.9E-05 1.8E-08 45.5 17.2 296 22-404 1-308 (392) 50 protein:vir:80446 Length: 367 97.6 3.2E-06 2E-09 50.7 11.1 317 1-404 1-360 (367) 51 protein:vir:7019 Length: 401 # 97.4 5.7E-05 3.6E-08 43.8 16.6 337 1-404 1-358 (401) 52 protein:vir:108303 Length: 418 97.4 5.7E-05 3.5E-08 43.9 16.6 277 20-404 1-292 (418) 53 protein:vir:102655 Length: 322 97.4 6.8E-05 4.2E-08 43.4 18.4 309 1-404 7-322 (322) 54 protein:vir:97331 Length: 319 96.5 0.00057 3.6E-07 38.3 16.5 278 1-404 1-287 (319) 55 protein:vir:94800 Length: 319 96.5 0.00057 3.6E-07 38.3 16.5 278 1-404 1-287 (319) 56 protein:vir:78387 Length: 349 96.3 0.00061 3.8E-07 38.2 13.7 305 1-404 1-340 (349) 57 protein:vir:94989 Length: 349 95.9 0.0012 7.6E-07 36.5 14.1 306 1-404 1-340 (349) 58 protein:vir:78920 Length: 290 95.8 0.0014 8.4E-07 36.3 17.0 277 13-404 1-283 (290) 59 protein:vir:107120 Length: 329 94.1 0.0054 3.3E-06 33.0 17.6 275 1-404 6-298 (329) 60 protein:vir:1781 Length: 221 # 92.8 0.0099 6.1E-06 31.6 12.9 209 106-387 1-221 (221) 61 protein:vir:102335 Length: 312 92.2 0.013 7.8E-06 31.0 19.7 277 22-404 1-300 (312) 62 protein:vir:79712 Length: 285 91.8 0.014 8.5E-06 30.8 10.8 266 1-404 1-276 (285) 63 protein:vir:105464 Length: 346 88.4 0.032 2E-05 28.8 17.9 280 1-404 1-290 (346) 64 protein:vir:8102 Length: 543 # 81.1 0.088 5.5E-05 26.4 14.2 300 1-402 229-543 (543) 65 protein:vir:3525 Length: 423 # 74.4 0.16 9.9E-05 25.0 18.1 289 22-404 1-313 (423) 66 protein:vir:95131 Length: 325 72.6 0.18 0.00011 24.7 13.6 291 15-404 1-301 (325) 67 protein:vir:105522 Length: 423 72.6 0.18 0.00011 24.7 17.1 295 22-404 1-313 (423) 68 protein:vir:9759 Length: 303 # 69.9 0.22 0.00013 24.2 16.4 291 15-404 1-303 (303) 69 protein:vir:105374 Length: 423 69.7 0.22 0.00014 24.2 18.3 294 22-404 1-313 (423) 70 protein:vir:99523 Length: 311 68.9 0.23 0.00014 24.1 11.6 287 3-404 1-303 (311) 71 protein:vir:191 Length: 385 # 62.5 0.33 0.00021 23.2 14.6 295 1-402 67-385 (385) 72 protein:vir:1886 Length: 385 # 62.5 0.33 0.00021 23.2 14.6 295 1-402 67-385 (385) 73 protein:vir:108211 Length: 318 59.0 0.4 0.00025 22.8 9.7 299 1-404 1-312 (318) 74 protein:vir:79928 Length: 393 52.3 0.56 0.00035 22.0 14.2 311 1-404 10-376 (393) 75 protein:vir:102119 Length: 404 47.4 0.7 0.00044 21.4 14.4 296 1-404 88-403 (404) 76 protein:vir:100135 Length: 418 41.5 0.92 0.00057 20.8 13.1 290 1-404 94-410 (418) 77 protein:vir:3136 Length: 322 # 39.7 1 0.00062 20.6 8.9 292 19-404 1-320 (322) 78 protein:vir:41 Length: 299 # N 30.5 1.6 0.00097 19.5 15.4 274 21-404 1-293 (299) 79 protein:vir:4511 Length: 409 # 30.0 1.6 0.001 19.4 13.9 289 1-404 99-400 (409) 80 protein:vir:174 Length: 423 # 29.0 1.7 0.0011 19.3 17.2 291 22-404 1-313 (423) 81 protein:vir:9574 Length: 300 # 28.9 1.7 0.0011 19.3 15.3 286 1-404 1-295 (300) 82 protein:vir:1383 Length: 421 # 27.0 1.9 0.0012 19.1 16.4 296 1-404 83-394 (421) 83 protein:vir:94771 Length: 298 25.7 2 0.0013 18.9 16.4 286 1-404 1-294 (298) 84 protein:vir:81160 Length: 371 24.0 2.2 0.0014 18.7 14.8 293 1-401 62-371 (371) 85 protein:vir:4830 Length: 397 # 22.1 2.5 0.0015 18.4 13.3 291 1-398 79-397 (397) No 1 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=100.00 E-value=2.6e-185 Score=1032.53 Aligned_cols=404 Identities=100% Similarity=1.433 Sum_probs=402.1 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |||||+|+||++|+|||||++++|+|++|+||++++.++++++++++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:81 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~ 160 (404) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||++|++||++++||++|+||||+||++. T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:81 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCC Q lcl|Aclame:pro 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~ 240 (404) |++|++|+.+|+.|+++|.|+|+|||++||||++++|++++|+++|+||+++||+++++++++++||+||+++|++++++ T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~ 240 (404) T protein:vir:81 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCccc Q lcl|Aclame:pro 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) Q Consensus 241 ~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~ 320 (404) +|+|||||||+|++|||+|+++++|+|+||+|++|+||++||||+|++||||||+|||||++|||||+++++++|+|+.+ T Consensus 241 ~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~ 320 (404) T protein:vir:81 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) T ss_pred cceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEee Q lcl|Aclame:pro 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) Q Consensus 321 a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idt 400 (404) +++++++++.+|+|+||||||||++|||+++|+||+|+||.+||||+.||++++|+|+||+||++++|++||||||+||| T Consensus 321 a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idt 400 (404) T protein:vir:81 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) T ss_pred cccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecC Q lcl|Aclame:pro 401 AVKL 404 (404) Q Consensus 401 a~~~ 404 (404) |||| T Consensus 401 a~~~ 404 (404) T protein:vir:81 401 AVKL 404 (404) T ss_pred cccC Confidence 9999 No 2 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=100.00 E-value=2.6e-185 Score=1032.53 Aligned_cols=404 Identities=100% Similarity=1.433 Sum_probs=402.1 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |||||+|+||++|+|||||++++|+|++|+||++++.++++++++++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~ 160 (404) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||++|++||++++||++|+||||+||++. T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:10 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCC Q lcl|Aclame:pro 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~ 240 (404) |++|++|+.+|+.|+++|.|+|+|||++||||++++|++++|+++|+||+++||+++++++++++||+||+++|++++++ T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~ 240 (404) T protein:vir:10 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCccc Q lcl|Aclame:pro 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) Q Consensus 241 ~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~ 320 (404) +|+|||||||+|++|||+|+++++|+|+||+|++|+||++||||+|++||||||+|||||++|||||+++++++|+|+.+ T Consensus 241 ~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~ 320 (404) T protein:vir:10 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) T ss_pred cceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEee Q lcl|Aclame:pro 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) Q Consensus 321 a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idt 400 (404) +++++++++.+|+|+||||||||++|||+++|+||+|+||.+||||+.||++++|+|+||+||++++|++||||||+||| T Consensus 321 a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idt 400 (404) T protein:vir:10 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) T ss_pred cccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecC Q lcl|Aclame:pro 401 AVKL 404 (404) Q Consensus 401 a~~~ 404 (404) |||| T Consensus 401 a~~~ 404 (404) T protein:vir:10 401 AVKL 404 (404) T ss_pred cccC Confidence 9999 No 3 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=100.00 E-value=2.6e-185 Score=1032.53 Aligned_cols=404 Identities=100% Similarity=1.433 Sum_probs=402.1 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |||||+|+||++|+|||||++++|+|++|+||++++.++++++++++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~ 160 (404) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||++|++||++++||++|+||||+||++. T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:10 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCC Q lcl|Aclame:pro 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~ 240 (404) |++|++|+.+|+.|+++|.|+|+|||++||||++++|++++|+++|+||+++||+++++++++++||+||+++|++++++ T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~ 240 (404) T protein:vir:10 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCccc Q lcl|Aclame:pro 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) Q Consensus 241 ~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~ 320 (404) +|+|||||||+|++|||+|+++++|+|+||+|++|+||++||||+|++||||||+|||||++|||||+++++++|+|+.+ T Consensus 241 ~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~ 320 (404) T protein:vir:10 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) T ss_pred cceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEee Q lcl|Aclame:pro 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) Q Consensus 321 a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idt 400 (404) +++++++++.+|+|+||||||||++|||+++|+||+|+||.+||||+.||++++|+|+||+||++++|++||||||+||| T Consensus 321 a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idt 400 (404) T protein:vir:10 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) T ss_pred cccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecC Q lcl|Aclame:pro 401 AVKL 404 (404) Q Consensus 401 a~~~ 404 (404) |||| T Consensus 401 a~~~ 404 (404) T protein:vir:10 401 AVKL 404 (404) T ss_pred cccC Confidence 9999 No 4 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=100.00 E-value=2.6e-185 Score=1032.53 Aligned_cols=404 Identities=100% Similarity=1.433 Sum_probs=402.1 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |||||+|+||++|+|||||++++|+|++|+||++++.++++++++++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:32 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~ 160 (404) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||++|++||++++||++|+||||+||++. T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~ 160 (404) T protein:vir:32 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCC Q lcl|Aclame:pro 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~ 240 (404) |++|++|+.+|+.|+++|.|+|+|||++||||++++|++++|+++|+||+++||+++++++++++||+||+++|++++++ T Consensus 161 n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~ 240 (404) T protein:vir:32 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) T ss_pred cccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCccc Q lcl|Aclame:pro 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) Q Consensus 241 ~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~ 320 (404) +|+|||||||+|++|||+|+++++|+|+||+|++|+||++||||+|++||||||+|||||++|||||+++++++|+|+.+ T Consensus 241 ~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~ 320 (404) T protein:vir:32 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLT 320 (404) T ss_pred cceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEee Q lcl|Aclame:pro 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) Q Consensus 321 a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idt 400 (404) +++++++++.+|+|+||||||||++|||+++|+||+|+||.+||||+.||++++|+|+||+||++++|++||||||+||| T Consensus 321 a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idt 400 (404) T protein:vir:32 321 ATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDT 400 (404) T ss_pred cccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecC Q lcl|Aclame:pro 401 AVKL 404 (404) Q Consensus 401 a~~~ 404 (404) |||| T Consensus 401 a~~~ 404 (404) T protein:vir:32 401 AVKL 404 (404) T ss_pred cccC Confidence 9999 No 5 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=100.00 E-value=4.4e-161 Score=899.70 Aligned_cols=400 Identities=33% Similarity=0.552 Sum_probs=371.4 Q ss_pred CC------CcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEE Q lcl|Aclame:pro 1 MT------TVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFS 74 (404) Q Consensus 1 ~~------~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~ 74 (404) || ++|||+|+++||++||+.+.+++++.+++.++.+.....+ +.....+++.++||||++||+|++||+|+|+ T Consensus 1 ~~~a~T~~~~~~p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~-~~~~~~~ts~~~pI~r~~dL~K~~GD~Vtf~ 79 (430) T protein:vir:10 1 MTASKTTMRYGDPNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDA-EKKTKGQSSLELPIVQAQDLGRNKGDEVRFH 79 (430) T ss_pred CcceeeecccCChhHHHHHHHHHHHHHhhhhhhHHHhhccccccccch-hhhccCCCCCCccEEEeccCCCCCccEEEEe Confidence 65 5799999999999999999999999999999887776655 3456778999999999999999999999999 Q ss_pred EeeccccCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 75 IMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (404) Q Consensus 75 L~~~L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG 154 (404) |++||+|+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||+.|++||++++||++|+|||| T Consensus 80 L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~R~~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laG 159 (430) T protein:vir:10 80 FVQPANAFPIMGSEYAEGKGTGLKIGSDQLRVNQARFPVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQSMLVHLAG 159 (430) T ss_pred EeeccccCceecCceeeccccceEEEeeEEEEeeeccccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccceeeccccccccccccCccCCCCCCceEeccCCc-c-------ccccccccccCHHHHHHHHHHHHhcCCC Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDAT-S-------FEQIEAADIFSIGLVDNLSLFIDEMAHP 226 (404) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at-~-------~~~i~a~D~~s~~~Id~a~~~a~~~~~p 226 (404) +||++.|++|++|+.+|+.|..++.|+|+|||+||||++++.+ + +.+|+++|+||+++||+|+++|+++++| T Consensus 160 arg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~~~~~~ 239 (430) T protein:vir:10 160 ARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMDQIELP 239 (430) T ss_pred hhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHHHHhhCCC Confidence 9999999999999999999999999999999999999966643 3 4579999999999999999999999999 Q ss_pred CCceEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeee Q lcl|Aclame:pro 227 LQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRF 306 (404) Q Consensus 227 i~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf 306 (404) |+||+|+|+++++++|+|||||||+|++|||+|+++++| ++|+.|.+ ++|++||||+|++||||||+||||| +|||| T Consensus 240 i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~w-q~~~~a~a-~~g~~nPlF~G~~gm~ngvii~~~~-~virf 316 (430) T protein:vir:10 240 PPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSW-QAAALARA-SNAKQHPIFRVDAGLWSNTLIIKMP-KPIRF 316 (430) T ss_pred CcceEeecccccCCccEEEEEechHHHHHHhhCcchHHH-HHHHHHhh-cccccCCceecceeeecCeEEecCC-ceeee Confidence 999999999999999999999999999999999999999 67887655 4689999999999999999999999 57999 Q ss_pred ccceeEEeecC--ccc----ccccccccccchhhheeeccceeEEEeeec--CCCCcceeecccccCchhHHHHHHHhch Q lcl|Aclame:pro 307 YQGSKVLVSEN--NLT----ATTKEVAAATNIDRAMLLGAQALANAYGQK--AGGHFNMVEKKTDMDNRTEIAISWINGL 378 (404) Q Consensus 307 ~~~~~~~~~~~--~~~----a~~~~~aa~~~v~ralLlGaQAl~~A~g~~--~g~r~~w~Ee~~D~g~~~~i~i~~i~G~ 378 (404) |+++..++|+. +.. +.+..++++++|+|+|||||||+++|||+. +|+||+|+||.+||||+.||++++|+|+ T Consensus 317 ~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~~~~i~~~~i~G~ 396 (430) T protein:vir:10 317 YAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGDKLELLIGAILGC 396 (430) T ss_pred cCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCchhhhhhhHHhcc Confidence 99999888774 222 233445667899999999999999999984 7899999999999999999999999999 Q ss_pred hhccccCCCCC---ceEEEEEEEeeeecC Q lcl|Aclame:pro 379 KKIRFPEKSGK---MQDHGVIAVDTAVKL 404 (404) Q Consensus 379 ~K~rF~~~~g~---~~DfGvi~idta~~~ 404 (404) ||+||+++++. ++|||||+||||||| T Consensus 397 kK~rF~~~~~~~~~~~DfGvi~idtaa~~ 425 (430) T protein:vir:10 397 SKIRFAVEATNGLEYTDHGVMAIDTAVKI 425 (430) T ss_pred ceeeecCCCCCCceeeeeEEEEhhhhhhh Confidence 99999987663 689999999999999 No 6 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=100.00 E-value=9.9e-147 Score=821.02 Aligned_cols=352 Identities=35% Similarity=0.503 Sum_probs=326.3 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) -|++|||+|+|+||++||+.+.+++|+.+ .++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 5 ~~~~~~p~a~~~ws~~l~~~~~~~s~f~~-----------------~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 67 (364) T protein:vir:93 5 VIPFGDPKAVKRWSADLAVDVRKKSYFEQ-----------------RFIGTSENAVIQRKTELESDAGDRITFDLSVHLR 67 (364) T ss_pred ccCcCCHHHHHHHHHHHHHHHHhhCcccc-----------------ccccCCCCCcEEEeeecCCCCCceEEeeeeeecc Confidence 56888899999999999977777555322 2679999999999999999999999999999999 Q ss_pred cCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~ 160 (404) |+||+||++||||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||++|++||++++|+++|+||||+||. T Consensus 68 g~gv~Gd~~leGnee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~-- 145 (364) T protein:vir:93 68 GKPTYGDARVEGKEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGI-- 145 (364) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-- Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999994 Q ss_pred cccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhc------CCCCCceEecC Q lcl|Aclame:pro 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEM------AHPLQPVRLSG 234 (404) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~------~~pi~Pv~~~g 234 (404) .+|...++.|..+++|+|+|||++||||++++|++++|+++|+||+++||+|+++|+++ ++||+||+++| T Consensus 146 ----~~~~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g 221 (364) T protein:vir:93 146 ----NLDFIETPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDG 221 (364) T ss_pred ----ccccccccCcccccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecC Confidence 36677899999999999999999999999999999999999999999999999999998 46799999999 Q ss_pred ccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEe Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~ 314 (404) ++ +|||||||+|++|||++++ ++|+|+||+|. .++|++||||+|++||||||+||||+++ |||+.. T Consensus 222 ~~------~yV~~l~p~q~~~Lr~~t~-~~w~d~qk~A~-~~~g~~nPlF~G~~gm~ngvii~~~~~v-i~~~~~----- 287 (364) T protein:vir:93 222 DD------HYVCVMSEYQATDMRTAAG-GTWIDFQKAAA-AAEGRNNPIFKGGLGMINNVVLHKHRNV-IRFNDY----- 287 (364) T ss_pred cc------eeEEEEcchhhhhhhhcCC-HHHHHHHHHhh-hcccccCCceecCeeeEcCeEEeccCCc-cccccc----- Confidence 87 7999999999999999876 67999999984 4689999999999999999999999986 888532 Q ss_pred ecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEE Q lcl|Aclame:pro 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHG 394 (404) Q Consensus 315 ~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfG 394 (404) ++ ++.++|+|+|||||||+++|||+++|+||+|+||.+||||+.||++++|+|+||+||+++ ||| T Consensus 288 -----~~-----~~~v~~~ralllGaQA~~~a~g~~~g~~~~w~Ee~~D~gn~~~i~~~~i~G~kK~rF~~~-----DfG 352 (364) T protein:vir:93 288 -----GA-----GANVEAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAIAAGFIAGMKKARFNNK-----DFG 352 (364) T ss_pred -----cc-----CccccchhhheecceeeEEEeecCCCCCceeeecccCCCCchhhhhhhHhhhhhcccCCc-----cce Confidence 22 234678999999999999999999999999999999999999999999999999999764 999 Q ss_pred EEEEeeeecC Q lcl|Aclame:pro 395 VIAVDTAVKL 404 (404) Q Consensus 395 vi~idta~~~ 404 (404) ||+||||||+ T Consensus 353 vi~idtaa~~ 362 (364) T protein:vir:93 353 VISIDTAAKK 362 (364) T ss_pred EEEecccccc Confidence 9999999999 No 7 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=100.00 E-value=5.4e-138 Score=773.15 Aligned_cols=318 Identities=97% Similarity=1.425 Sum_probs=312.0 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||.+...+++++|++||||++++|+|+||+|+++++.+++++++|++++|+++++|||+++||+|++||+|+|+|++||+ T Consensus 1 mt~~~~~~~~~~~~~~~ft~~~~~~~~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~L~ 80 (318) T protein:vir:27 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) T ss_pred CCccCCCChHHHHHHHHHHHHhcCChHHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeeccc Confidence 98877777777899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~ 160 (404) |+||+||+++|||||+|+|++|+|+|||.||+|+++|+|+||||+||||++||++|++||++++||++|+||||+||+|+ T Consensus 81 g~gv~Gd~~lEGnee~L~~~~d~l~IDq~r~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg~~~ 160 (318) T protein:vir:27 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFV 160 (318) T ss_pred cCccccCceeeccccceEEEeeEEEEeeeccccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCC Q lcl|Aclame:pro 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (404) Q Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~ 240 (404) |++|++|+.+|++|+++|+|+++|||++||||+|++|++++|+++|+||+++||+++++++++++||+||+++|++++++ T Consensus 161 n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~~~~ 240 (318) T protein:vir:27 161 ADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE 240 (318) T ss_pred cccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceeeccccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCc Q lcl|Aclame:pro 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENN 318 (404) Q Consensus 241 ~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~ 318 (404) +|+|||||||+|++|||+|+++++|++|||+|++|++|++||||+|++||||||+|||||+||||||+|++++++.-. T Consensus 241 ~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf~~G~~v~~~~~~ 318 (318) T protein:vir:27 241 DPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGQRFWYQRIT 318 (318) T ss_pred cceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEEEcCCCeeeeeecC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999887655 No 8 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=99.38 E-value=3.7e-14 Score=94.25 Aligned_cols=352 Identities=12% Similarity=0.142 Sum_probs=188.6 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHH--HHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeec Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNI--LTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~ 78 (404) |--|-.|.--. +=+-.+-++|.++. |-+++....++.-.+..+ -...++-|+.|-+|.|.-..+ T Consensus 1 ~~~~~a~~~~~-----~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~f---------A~~~piPkn~GkTIk~r~y~p 66 (401) T protein:vir:95 1 MLNYNAPTDGQ-----KSSIDGANSDQMQTFFWLKKAIITARKEQYFMPL---------ASVTNMPKHYGKTIKVYEYVP 66 (401) T ss_pred CCccCCCcccc-----cccccccccceeeehhhHHHHHhhhhhhhhhhhc---------ccccccccccCCeEEEEeccc Confidence 55554443211 11233445554443 556655555554333222 234566689999999988887 Q ss_pred cccC--ceecCceeee------ehh------------hhhhcccEEEEeeeccc-cccCcchhhhhhhhhHHHHHHHHHH Q lcl|Aclame:pro 79 LSKR--PTMGDERVEG------RGE------------DLSHADFSLKINQGRHL-VDAGGRMSQQRTKFNLASSARTLLG 137 (404) Q Consensus 79 L~G~--gv~Gd~~leG------nee------------~L~~~sd~v~Idq~R~~-V~~~gkms~qrs~~dlr~~ar~~L~ 137 (404) |.-. |....-..+| |.= =+......=+||+..+. ++..|++.|-=--..|-.+.-+--. T Consensus 67 l~~~~~pl~eGv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~ 146 (401) T protein:vir:95 67 LLDDRNINDQGIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDS 146 (401) T ss_pred ccccccchhcCCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhc Confidence 7532 2222222222 100 01111122234444332 1233333321000111111100000 Q ss_pred H-HHHHHHHHHHHHHhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCcccccc----ccccccCHHH Q lcl|Aclame:pro 138 T-YFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQI----EAADIFSIGL 212 (404) Q Consensus 138 ~-w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i----~a~D~~s~~~ 212 (404) | =+.+-+++++ |.|..+..- +. . .+++.+ ...-++|++.+++.+++ .+...++++. T Consensus 147 D~~l~~h~s~el---l~g~~~~t~--d~--------i-----~~dll~-ag~~viyAg~ats~At~~~~~~~~t~vt~~~ 207 (401) T protein:vir:95 147 DDGLMEHLSREL---MNGATQITE--AV--------L-----QKDLLA-AAGTVLYAGAATSDATITGEGSTPSVVSYKN 207 (401) T ss_pred chHHHHHHHHHH---hhhhhhhHH--HH--------H-----HHHHHh-hcCeeecCCccceeeeccccccccceechhH Confidence 0 0000011111 122211000 00 0 011110 01126777777777755 4678999999 Q ss_pred HHHHHHHHHhcCCCCCceEecCccccC---CccEEEEEEch------HHHHHHHhCcchHHHHHHHHHhhhccccccCcc Q lcl|Aclame:pro 213 VDNLSLFIDEMAHPLQPVRLSGDELHG---EDPYYVLYVTP------RQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPL 283 (404) Q Consensus 213 Id~a~~~a~~~~~pi~Pv~~~g~~~~~---~~~~yV~~l~P------~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPl 283 (404) |.++...+..-+.|.+-..+.|-.+.+ ..+.||.|||| +.++||..||+ |.+.+|||.+ .++ T Consensus 208 l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~---fi~v~kYa~~------~~i 278 (401) T protein:vir:95 208 LMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKA---FIETQHYADA------GTI 278 (401) T ss_pred HHHHHHHHHhcccccchhhhhhhhccCccccccceEEEEecCchhHHHHHHHhcCCCC---ceehhhcCCc------ccc Confidence 999999998877776555555443333 45789999999 88888889995 8899999744 679 Q ss_pred eeCCeEEEcCEEEEecCceeeeec-cceeEEeecCcccccccccccccchhhheeeccceeEEEeeecCCC--Cccee-- Q lcl|Aclame:pro 284 FKGECAMWRNILVRKYAGMPIRFY-QGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGG--HFNMV-- 358 (404) Q Consensus 284 F~G~~gm~ngvii~~~~~~~irf~-~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~--r~~w~-- 358 (404) |.||+|.++|+.+...|.+ ..|. +|.-..-.+..+...........+|--.|.||.+|.+..-=+.+|+ .|... T Consensus 279 ~~gEiG~i~~vR~i~~p~~-~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk 357 (401) T protein:vir:95 279 MNGEVGSIDKFRIIQVPEM-LHWAGAGAQATGANPGYRTSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTK 357 (401) T ss_pred ccccccccCceeEEecccc-eeecCCcccccccccccccccccCCCcceeeeeeEEccccceecccccCCccccceeEee Confidence 9999999999999988875 3343 2221222222222222333445678889999999977664333332 12222 Q ss_pred -------ecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEeeeecC Q lcl|Aclame:pro 359 -------EKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 359 -------Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idta~~~ 404 (404) +...-||..--+++++.++...++ +-|.+ .|-|++|| T Consensus 358 ~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~--------~e~m~-~ies~a~~ 401 (401) T protein:vir:95 358 MPGKETADRNDPYGETGFSSIKWYYGILVKR--------PERLA-LIKTVAPL 401 (401) T ss_pred cCCcCCCCCCCcccceehhhhhhhhhhheec--------cceeE-EEEeecCC Confidence 223446777789999999988886 24554 89999999 No 9 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.09 E-value=3.2e-11 Score=78.16 Aligned_cols=322 Identities=14% Similarity=0.069 Sum_probs=178.4 Q ss_pred CCCc-CchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTV-TSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~-~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) |+.. +.-+-...+++ .....+-++|.|++.+....+.++.|.. -+++.++ ..|+++.|+-+... T Consensus 1 m~~~~~~~~t~~~~~~----~~~~~~l~le~~~geV~~af~~~s~~~~---------~~~~r~i--~~G~s~~~~~iG~~ 65 (334) T protein:vir:80 1 MTYPAANTHTRPGWGG----ANSDVSLHIEEHLGLVDASFMYSSKFAS---------WMNVRSL--RGTNQLRVDRVGAS 65 (334) T ss_pred CCCCcCCCcccccccc----ccchheehhhhhhhHHHHHHHHhhhhhc---------cceeeec--cccceEEEeeecce Confidence 6554 32222222221 1112222589999998777776655431 2222344 34999999988888 Q ss_pred ccCceecCceeeeehhhhhhcccEEEEeee---ccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhh-hc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA-GA 155 (404) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~la-G~ 155 (404) +-...+=++.+.++ .+.....+|.||+. ||.|+ .+++-...+|+|.+.-.....=+++..|+.+|..|. ++ T Consensus 66 ~~~~~~~g~~l~~~--~~~~~~~~l~ID~~l~~~~~Vd---diD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa 140 (334) T protein:vir:80 66 TIAGRKAGEELVVQ--KNVSDKLNLTVDTVLYARHFFD---KFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCG 140 (334) T ss_pred eeeeecCCCCCCCC--CcccCceEEEEeeeeehhhhHh---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 76666667777754 57778889999995 45554 478888899999999999999999999999998865 44 Q ss_pred ccccccccceeeccccccccccccCccCCCCCCceEeccCCcccccc--cccc-ccCHHHHH----HHHHHHHhcCCCCC Q lcl|Aclame:pro 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQI--EAAD-IFSIGLVD----NLSLFIDEMAHPLQ 228 (404) Q Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i--~a~D-~~s~~~Id----~a~~~a~~~~~pi~ 228 (404) +. ..|....+.|. .+......+ ++++ .-+.+.|- .|.+.+.+..-|-. T Consensus 141 ~~-------~~~~~~~~~~~------------------~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~ 195 (334) T protein:vir:80 141 DF-------LAPAHLKPAFH------------------DGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQ 195 (334) T ss_pred hh-------ccccccccccc------------------CCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCC Confidence 42 11111111110 010000000 1111 11233333 34444444333311 Q ss_pred ceEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeecc Q lcl|Aclame:pro 229 PVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQ 308 (404) Q Consensus 229 Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~ 308 (404) |. . + ++++|.|.|++.|..++.+ .+ +.-...+..+++=.|.++.|+|+.|.+-+++|.. . T Consensus 196 ~~----~------~-R~~vv~P~~y~~Ll~~~r~---~n----~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~--~ 255 (334) T protein:vir:80 196 LM----S------E-GVTLLDPVIFSFLLEHDRL---MN----VEFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQS--A 255 (334) T ss_pred cC----C------c-eEEEeChHHHHHHhccccc---cc----ceeccccccccccceeEEEEeceEEEeecCCCCc--c Confidence 10 1 1 7999999999999999843 11 1111123357888899999999999998887731 1 Q ss_pred ceeEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCC Q lcl|Aclame:pro 309 GSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSG 388 (404) Q Consensus 309 ~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g 388 (404) + .++..+...+.++....-.-+++....|++.+-...--+...+.++.+ -.-|-....+|.+=+|= T Consensus 256 ~-----t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~----~d~i~~~~a~G~g~lRP----- 321 (334) T protein:vir:80 256 I-----TANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDF----GHYLDTFQSYNIGQRRP----- 321 (334) T ss_pred c-----cccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhH----HHHHHHHHHcCCceecc----- Confidence 1 111112222222222222234666777777665442122333332221 11344445667666663 Q ss_pred CceEEEEEEEeeeec Q lcl|Aclame:pro 389 KMQDHGVIAVDTAVK 403 (404) Q Consensus 389 ~~~DfGvi~idta~~ 403 (404) +=.+|+-|+.--| T Consensus 322 --eaa~vv~~~~~~~ 334 (334) T protein:vir:80 322 --DAVAVHDITVTNP 334 (334) T ss_pred --ceEEEEEEeeecC Confidence 2345555554444 No 10 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.08 E-value=4.4e-11 Score=77.37 Aligned_cols=326 Identities=10% Similarity=0.063 Sum_probs=175.6 Q ss_pred CCCcCchHHHHHHHHHHHHHhhcc-------chHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEE Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRN-------RSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTF 73 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f 73 (404) |- +...+.-|=|..+.. +-+++.|.+.+...-.+.+.+.. .++..++ ..|.+|.| T Consensus 1 ~a-------~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~---------~~~~r~i--~~G~sv~~ 62 (347) T protein:vir:88 1 MA-------NATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMD---------KHMVRTI--QNGKSASF 62 (347) T ss_pred CC-------CcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhh---------ccccccc--cCcceEEE Confidence 21 111111111222222 11589999988766665544322 2222334 35999999 Q ss_pred EEeeccccCceecCceeeeehhhhhhcccEEEEeee---ccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 74 SIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIV 150 (404) Q Consensus 74 ~L~~~L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~ 150 (404) +-+...+....+-.+.+.+..+++.....+|.||+. +|.|+ ..++-...+|+|++.......=|++..|+.+|. T Consensus 63 ~~iG~~~~~~~~~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vd---d~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~ 139 (347) T protein:vir:88 63 PVMGRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIY---DIEDAMNHYDVRAEYSAQLGEALAIAADGAVLA 139 (347) T ss_pred eeecceeeeeeccccCCCCCCCCCccceEEEEEechhhhhhhhh---hHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHH Confidence 999999888877778888888889999999999997 66776 578888999999999999999999999999999 Q ss_pred HhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccC---HHHHHHHHHHHHhcCCCC Q lcl|Aclame:pro 151 HLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS---IGLVDNLSLFIDEMAHPL 227 (404) Q Consensus 151 ~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s---~~~Id~a~~~a~~~~~pi 227 (404) +++.+...... .++..+ +-..-+..+.+..++ ++...... ++.|-.|.+..++..-| T Consensus 140 ~l~~~a~~~~~--------~~~~~~---------g~~~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~a~~~Lde~~VP- 199 (347) T protein:vir:88 140 EMAKLCNLPAA--------SNENIA---------GLGQAVVLNIGAAAD--LVDVEARGKAILKGLTLARARLTKNYVP- 199 (347) T ss_pred HHHHhhccccc--------cccccC---------Ccccccccccccccc--ccchhhhHHHHHHHHHHHHHHHhhcCCC- Confidence 98654321100 011111 000000000011111 11111111 34555566666654433 Q ss_pred CceEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeec Q lcl|Aclame:pro 228 QPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFY 307 (404) Q Consensus 228 ~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~ 307 (404) .+ . ++++|.|.|+.+|.+++.+.. ... .....+-.|.+|.++|+-|.+.+++|..-. T Consensus 200 ------~~------g-R~~vv~P~~y~~Ll~~~~~~~--------~~~--~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~ 256 (347) T protein:vir:88 200 ------AG------D-RRFYCAPEDYSAILSALMPNA--------ANY--AALIDPETGNIRNVMGFEVIEVPHLTVGGA 256 (347) T ss_pred ------CC------C-CEEEeCHHHHHHHhcchhhhh--------hhh--ccccchhcceeeeeccceEEEeeccccccc Confidence 11 1 678899999999999874321 111 123457789999999999999999884211 Q ss_pred cceeEEeecCc----ccccc---cccccccchhhheeeccceeEEEeeecCCCCcceeecccccCch-hHHHHHHHhchh Q lcl|Aclame:pro 308 QGSKVLVSENN----LTATT---KEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLK 379 (404) Q Consensus 308 ~~~~~~~~~~~----~~a~~---~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~-~~i~i~~i~G~~ 379 (404) .........+. ..... ..+.....-.-+|++-..|++.+ + ...-+.|-..|-.++ ..|-....+|.+ T Consensus 257 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v--~---~~d~~~e~~r~~~~~~d~i~~~~~~G~~ 331 (347) T protein:vir:88 257 GDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTV--K---LKDMALERARRPEFQADQIIGKYAMGHG 331 (347) T ss_pred ccccccccccccccccccccccccccccccCcEEEEEechhhhhhe--e---cccceeeeeechhhHHHHhhhhhhhcCc Confidence 00000000000 00000 00000000001122222222221 1 111234444444332 245566777777 Q ss_pred hccccCCCCCceEEEEEEEeeee Q lcl|Aclame:pro 380 KIRFPEKSGKMQDHGVIAVDTAV 402 (404) Q Consensus 380 K~rF~~~~g~~~DfGvi~idta~ 402 (404) =+|- +=-++|.+..+| T Consensus 332 ~~rP-------e~a~~~~~~~a~ 347 (347) T protein:vir:88 332 GLRP-------EAAGALVFTPAA 347 (347) T ss_pred eecc-------ceEEEEEeCCCC Confidence 6664 224566666666 No 11 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.04 E-value=2.4e-11 Score=78.78 Aligned_cols=331 Identities=11% Similarity=0.079 Sum_probs=177.0 Q ss_pred CC-CcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MT-TVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~-~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) |. ..|-++++-.+-.+-.......+-++|.|.+.+..--...+-+. .-+++.+++ .|.++.|+-+... T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~---------~~~~~r~i~--~g~s~~~~~iG~~ 69 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTT---------SRHMVRSIS--SGKSAQFPVLGRT 69 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhc---------ccceeeeec--ccceEEEEeecee Confidence 44 33444444444443333333444468999999877777665543 222233454 4999999999888 Q ss_pred ccCceecCceeeeehhhhhhcccEEEEeee---ccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGAR 156 (404) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~r 156 (404) +-...+-.+.+.|.-+++.-...+|.||+. |+.|+ .+++..+.+|+|.+.-.....=+++..|+.++.+++.+. T Consensus 70 ~~~~~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~Vd---DiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a 146 (344) T protein:vir:10 70 QAAYLAPGENLDDIRKDIKHTEKVITIDGLLTADVLIY---DIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLC 146 (344) T ss_pred EEEeeecCCCCCCCCCCcccceEEEEEcchhhhhhhhh---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 766777777788887788888899999995 46666 578999999999999999999999999999999986544 Q ss_pred cccccccceeeccccccccccccCccCCCCCCceEeccCCcccccccccccc-----CHHHHHHHHHHHHhcCCCCCceE Q lcl|Aclame:pro 157 GDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIF-----SIGLVDNLSLFIDEMAHPLQPVR 231 (404) Q Consensus 157 g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~-----s~~~Id~a~~~a~~~~~pi~Pv~ 231 (404) .. .-|....+.+. ++. ...... ...... ++.. -++.|..|.+.+++..-| T Consensus 147 ~~------~~~~~~~~~g~---------~~~--~~~~~~--~~~~~~-t~~~~~~~~~~~~i~~a~~~Lde~~VP----- 201 (344) T protein:vir:10 147 NV------ESQYNENITGL---------GTA--TVIETT--QDKTTL-TDQVALGKEIIAALTKARAALTKNYVP----- 201 (344) T ss_pred cc------ccccccccccc---------ccc--ceeecc--cccccc-cchhhhHHHHHHHHHHHHHHHhhcCCC----- Confidence 31 11111111110 000 000000 000011 1111 134566666777665433 Q ss_pred ecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeecccee Q lcl|Aclame:pro 232 LSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSK 311 (404) Q Consensus 232 ~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~ 311 (404) .+ . +++++.|.|+..|+.++.+- .. .-+..+.+=+|.+|.++|+.|.+.++.|.....+.. T Consensus 202 --~~------g-R~~vv~P~~y~~Ll~~~~~~------~~----~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~ 262 (344) T protein:vir:10 202 --SS------D-RVFYCDPDSYSAILAALMPN------AA----NYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSR 262 (344) T ss_pred --cc------C-CEEEeChHHHHHHhhccccc------cc----ccccccceeeeEEEEEeceEEEeccccccccCCccc Confidence 11 1 56889999999999997431 11 113456677999999999999999987743211100 Q ss_pred EEeecCcccccccccccccchh----hheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCC Q lcl|Aclame:pro 312 VLVSENNLTATTKEVAAATNID----RAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKS 387 (404) Q Consensus 312 ~~~~~~~~~a~~~~~aa~~~v~----ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~ 387 (404) .-...+.+ +..........++ .+|+|=.-|++.+=-...-+...|.|+. .+ ..|-....+|.+=+|- T Consensus 263 ~~~tg~~~-~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~--~~--d~i~g~~~~G~~vlRP---- 333 (344) T protein:vir:10 263 EGTTGQKH-AFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRANF--QA--DQIIAKYAMGHGGLRP---- 333 (344) T ss_pred ccccCccc-cccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchhH--HH--HHHHHHhhcccceecc---- Confidence 00000000 0000000001111 1222222222211111101111121111 11 1344455666665553 Q ss_pred CCceEEEEEEEeee Q lcl|Aclame:pro 388 GKMQDHGVIAVDTA 401 (404) Q Consensus 388 g~~~DfGvi~idta 401 (404) +=-|+|-+=|. T Consensus 334 ---e~a~~v~~~~~ 344 (344) T protein:vir:10 334 ---EAAGAVVFKTK 344 (344) T ss_pred ---cceEEEEeecC Confidence 12234333333 No 12 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.02 E-value=2.3e-11 Score=78.91 Aligned_cols=328 Identities=11% Similarity=0.071 Sum_probs=176.9 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) =|+.|.. ...++..+= +......-++|.|++.+...-...+.+.... +..+++ .|+++.|+-+...+ T Consensus 3 ~~~~~~~-~~t~~g~~~-~~~d~~al~ie~~~geV~~~f~~~s~~~~~~---------~~rti~--~G~sv~~~~iG~~~ 69 (347) T protein:vir:94 3 NMNGGQQ-MGKDQGKGM-SAGDKLALFLKVFGGEVLTAFTRTSVTMNKH---------LVRSIQ--SGKSAQFPVLGRTK 69 (347) T ss_pred ccccccc-cccccccCC-cccchHHHHHHHHhHHHHHHHHHHHhhhhhh---------hheecc--ccceEEeeecccee Confidence 0111211 112221110 0000011158999998876666665553222 222343 49999999999988 Q ss_pred cCceecCceeeeehhhhhhcccEEEEeee---ccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg 157 (404) -...+-.+.+.+.-+++.....+|.||+. ||.|+ .+++-...+|+|.+.-.....=|++..|+.+|.+|+-+.. T Consensus 70 ~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~Vd---diD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~ 146 (347) T protein:vir:94 70 AAYLQPGENLDDKRKDMKHTEKTINIDGLLTADVLIY---DIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCN 146 (347) T ss_pred EeeeecCcCCCCCcCCccccceEEEEcchhhhhhhhh---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 87777778888777889999999999996 45565 5788888999999999999999999999999988754332 Q ss_pred ccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccC----HHHHHHHHHHHHhcCCCCCceEec Q lcl|Aclame:pro 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS----IGLVDNLSLFIDEMAHPLQPVRLS 233 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s----~~~Id~a~~~a~~~~~pi~Pv~~~ 233 (404) . ......|..- ..... .+.+ +. .+.++.+..-+ ++.|.+|...+++..-| T Consensus 147 ~--~~~~~~~~~g--~~~~~-~v~i----------~~----~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP------- 200 (347) T protein:vir:94 147 L--PTANNENIAG--LGKAH-VLEV----------GD----QATLQGDQVKLGQAIIAQLTLARAKLTGNYVP------- 200 (347) T ss_pred c--cccccccccc--CCcce-eEee----------ec----cccccccccccHHHHHHHHHHHHHHhhhcCCC------- Confidence 1 0000000000 00000 0000 00 01112111111 34455555666554432 Q ss_pred CccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEE Q lcl|Aclame:pro 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVL 313 (404) Q Consensus 234 g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~ 313 (404) . .+ +++++.|.|+..|.+..... . ..++..+.+=+|.++.++|+.|.+.+++|+.-. + ... T Consensus 201 ~------~~-R~~vv~P~~y~~LLk~~~~~--------~--~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~-~-~~~ 261 (347) T protein:vir:94 201 S------SD-RVFYTTPDNYSAILAALMPN--------A--ANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGA-G-DNR 261 (347) T ss_pred C------CC-CEEEeChHHHHHHHHhhccc--------c--cccccccccccceeEEeeceEEEEcCccccccC-c-ccc Confidence 1 12 78999999999999754211 1 112334566689999999999999999875211 1 100 Q ss_pred eecCccccc---------ccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchh-HHHHHHHhchhhccc Q lcl|Aclame:pro 314 VSENNLTAT---------TKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRT-EIAISWINGLKKIRF 383 (404) Q Consensus 314 ~~~~~~~a~---------~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~-~i~i~~i~G~~K~rF 383 (404) .+.....++ .+.+.....-..+|++-..|++.+=....-+.. .+|..++. .|-....+|..=.|- T Consensus 262 ~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~-----~~~~~~~~~~i~~~~a~G~g~~rP 336 (347) T protein:vir:94 262 AEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALER-----ARRANFQADQIIAKYAMGHGGLRP 336 (347) T ss_pred cccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceee-----eechhhhhhhhhhhhhhcCccccc Confidence 000000000 011111111122566655555544222111112 23332222 455566777776665 Q ss_pred cCCCCCceEEEEEEEeeee Q lcl|Aclame:pro 384 PEKSGKMQDHGVIAVDTAV 402 (404) Q Consensus 384 ~~~~g~~~DfGvi~idta~ 402 (404) |..+..+.++| T Consensus 337 --------e~a~~i~~~~a 347 (347) T protein:vir:94 337 --------EACGALVFKKA 347 (347) T ss_pred --------ceeEEEEecCC Confidence 67766666666 No 13 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.02 E-value=6.1e-11 Score=76.57 Aligned_cols=335 Identities=11% Similarity=0.081 Sum_probs=178.8 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccc-hHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNR-SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) =|+.| .++.-++..+ -..+-.. -+++.|++.+...-.+.+.+.... +..++ ..|++|.|+-+... T Consensus 3 ~~~~~-~~~~t~~g~~--~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v---------~~r~~--~~G~sv~i~~iG~~ 68 (347) T protein:vir:33 3 NIQGG-QQIGTNQGKG--QSAADKLALFLKVFGGEVLTAFARTSVTMPRH---------MLRSI--ASGKSAQFPVIGRT 68 (347) T ss_pred CCccC-cccccccccC--CcccchHHHHHHHHHHHHHHHHHHHHhhhhhh---------ccccc--cccceeEeeeccce Confidence 12222 1121222222 0011111 258999998877777665543222 22233 24999999999999 Q ss_pred ccCceecCceeeeehhhhhhcccEEEEeeec---cccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGAR 156 (404) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R---~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~r 156 (404) +-...+..+.+.|+.++......+|.||+.. +.|+ .+++-...+|+|.+.-.....=+++..|+.++.+++.+. T Consensus 69 t~~~~~~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~Vd---diD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~ 145 (347) T protein:vir:33 69 KAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIY---DIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLV 145 (347) T ss_pred eeeeecCCCCCCCCCCCCccceEEEEechhhhhhHHHh---hHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 9888888889999988999999999999885 5665 467888899999999999999999999999999987665 Q ss_pred cccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCcc Q lcl|Aclame:pro 157 GDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (404) Q Consensus 157 g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~ 236 (404) +........ .+.+. +..-.|.. .++..+.......++ .-++.|..|.+.+.+..-| T Consensus 146 ~~~~~~~~~-----~~~~~----~~~~~~~~----~~~tg~~~d~~~~a~-~i~~~i~~a~~~Lde~~VP---------- 201 (347) T protein:vir:33 146 NLPDGSNEN-----IEGLG----KPTVLTLV----KPTTGSLTDPVELGK-AIIAQLTIARASLTKNYVP---------- 201 (347) T ss_pred hhhcccccc-----ccccc----cccccccc----ccccccccchhhhHH-HHHHHHHHHHHHHhhcCCC---------- Confidence 422111111 01111 10000000 000000000011111 1145566666777665433 Q ss_pred ccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeec Q lcl|Aclame:pro 237 LHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSE 316 (404) Q Consensus 237 ~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~ 316 (404) . +. ++++++|.|+..|..++.+. + +.. +....+-+|.+|.|+|+-|.+.+++|... .......+ T Consensus 202 --~-~g-R~~vv~P~~y~~Ll~~~~~~---~----~d~---~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~--~~~~~~~~ 265 (347) T protein:vir:33 202 --A-AD-RTFYTTPDNYSAILAALMPN---A----ANY---QALLDPERGTIRNVMGFEVVEVPHLTAGG--AGDTREDA 265 (347) T ss_pred --c-cC-cEEEeCHHHHHHHhcccccc---c----ccc---ccccccccceeEEEeceeEEEecccccCc--cccccccc Confidence 1 11 57889999999999998532 1 111 12345788999999999999999887421 11110000 Q ss_pred -----Cccccc-ccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCc Q lcl|Aclame:pro 317 -----NNLTAT-TKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKM 390 (404) Q Consensus 317 -----~~~~a~-~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~ 390 (404) +.+.+. +......+...-+|++-..|++.+=...--+.-.|.++. + -..|-....+|.+=+| +. T Consensus 266 ~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~--~--~d~i~~~~~~G~~vlr-P~----- 335 (347) T protein:vir:33 266 PADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANY--Q--ADQIIAKYAMGHGGLR-PE----- 335 (347) T ss_pred cccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchhh--h--hHhhhhhhhcCCceec-cc----- Confidence 000000 001111122233567777777655433111112232211 1 1233344445555444 11 Q ss_pred eEEEEEEEeeeec Q lcl|Aclame:pro 391 QDHGVIAVDTAVK 403 (404) Q Consensus 391 ~DfGvi~idta~~ 403 (404) =-++|.+.-... T Consensus 336 -~av~i~~~~~~~ 347 (347) T protein:vir:33 336 -AAGAIVLPKVSE 347 (347) T ss_pred -ceEEEecCCCCC Confidence 112222211111 No 14 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.00 E-value=1.8e-11 Score=79.49 Aligned_cols=316 Identities=12% Similarity=0.079 Sum_probs=160.4 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccch----HH-HHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEE Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRS----MV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~----~~-~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L 75 (404) |-.+ |.-+=|-..+.+.+ ++ ++|++.+...-.+..-+ ..+....+++-..||+|+|+- T Consensus 1 ~~~~--------~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~---------~~l~~~~~~~~~~GdTV~ip~ 63 (381) T protein:vir:80 1 MATI--------QGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAA---------LEATKKIPFEGKKGDLIHIPN 63 (381) T ss_pred Ccee--------cccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhh---------hhccccccceeecCceEEeec Confidence 2111 12233322223322 22 56666554333222221 122334566667899999987 Q ss_pred eeccccCceecCceeeeehhhhhhcccEEEEeeeccc-cccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 76 MHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHL-VDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (404) Q Consensus 76 ~~~L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~-V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG 154 (404) ....+-.....+..+. .+++...+.+|.||+.+.. +.+. .+++..+..|+|.+....+...+++..|+.++-.++. T Consensus 64 ~g~~~a~d~~~g~~i~--~~~~~~~~~~itID~~~~~~~~Id-d~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~ 140 (381) T protein:vir:80 64 ISRAAVYDKQPQTPVN--LQARTDSEFTFTVTKYKESSFMIE-DIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAV 140 (381) T ss_pred cCcceeeeecCCCccc--ccccCCceEEEEEeeeeecceeec-hHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 6655444444444444 4567778889999998754 5453 6788899999999999999999999999999877655 Q ss_pred cccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccc-cccccCHHHHHHHHHHHHhcCCCCCceEec Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIE-AADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 (404) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~-a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~ 233 (404) ........ ..+.++. -.+++....++ .+..++++.|..|++.+++..-| T Consensus 141 ~~~~~~~~------------------~~t~~~~-----i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP------- 190 (381) T protein:vir:80 141 INAFPSQR------------------IYSYDTT-----LGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVP------- 190 (381) T ss_pred cccccccc------------------ccccccc-----ccccccccccccchhhHHHHHHHHHHHHHhhcCCC------- Confidence 44311111 0111110 00111222233 23556778888888888876533 Q ss_pred CccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEE Q lcl|Aclame:pro 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVL 313 (404) Q Consensus 234 g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~ 313 (404) .+. ++++++|.++.+|++++. |.+.. -+..+.|..|.+|+|.|+.|++.+++|... ..... T Consensus 191 ~eg-------R~lvv~P~~~~~Ll~~~~---~~~ad-------~~~~~~l~~G~Ig~i~G~~Vv~Sn~lp~~~--~t~~~ 251 (381) T protein:vir:80 191 QEG-------RIVMVSPAQYIDLLSINQ---FISVD-------FSQVKPVTSGVVGTILGMEVIVTTQIGINS--LTGYV 251 (381) T ss_pred cCC-------cEEEeCHHHHHHHhhchh---hhhhh-------hccchhhhceeeeEEcceEEEeeccccccc--cccee Confidence 111 578899999999999974 44322 133467999999999999999998877421 11111 Q ss_pred eecCcccccccccccccchhhheeeccc---eeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCc Q lcl|Aclame:pro 314 VSENNLTATTKEVAAATNIDRAMLLGAQ---ALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKM 390 (404) Q Consensus 314 ~~~~~~~a~~~~~aa~~~v~ralLlGaQ---Al~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~ 390 (404) .... ..+... ..+.-.-..|.+ |.++. |. .+|+-+.+....-+.-..++.+.-..+.. T Consensus 252 ~~ag-ap~~~~-----~~~~~~~~~g~~s~~a~av~----------~~---k~yd~~~~~~~~~~~~~~g~~~~~~~~~~ 312 (381) T protein:vir:80 252 NGQG-APTQPT-----PGVLGSPYLPDQAGTANVVN----------TG---SASDLAVSLSYFGLPVFSGAGATAADGGQ 312 (381) T ss_pred eecc-cccccc-----ccccccccccccccceeeee----------ee---eeeceeeeeeeccceeeecceeeecCCCc Confidence 1000 000000 000011122211 11111 11 33333333322222222222221111100 Q ss_pred ------------------eEEEEEEEeeeecC Q lcl|Aclame:pro 391 ------------------QDHGVIAVDTAVKL 404 (404) Q Consensus 391 ------------------~DfGvi~idta~~~ 404 (404) -||-.++.--+++- T Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 344 (381) T protein:vir:80 313 TLGSFGGANRWATAVVCHPDWLAVGVQQNVKS 344 (381) T ss_pred eeeeehhhhhhhhhcccccccccccceeEeec Confidence 01111111111111 No 15 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.00 E-value=1.3e-10 Score=74.75 Aligned_cols=333 Identities=11% Similarity=0.087 Sum_probs=178.6 Q ss_pred CCCcCc-hHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTS-AQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~-~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) ||...- .++......+-...-...+-++|.|.+.+...-++.+-+. +-+++.+++ .|.++.|+-+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~---------~~~~~r~i~--~gks~~~~~iG~~ 69 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTT---------SRHMVRSIS--SGKSAQFPVLGRT 69 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhc---------ccceeeecc--ccceEEEeeecce Confidence 544322 2222222222222112223368999998877777665543 223334554 4899999999888 Q ss_pred ccCceecCceeeeehhhhhhcccEEEEeeec---cccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGAR 156 (404) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R---~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~r 156 (404) +-...+-.+.+.+..++.+....+|.||+.. |.|+ .+++....+|+|.+.-..+..=+++..|+.++.+|..+. T Consensus 70 ~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~Vd---diD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a 146 (345) T protein:vir:22 70 QAAYLAPGENLDDKRKDIKHTEKVITIDGLLTADVLIY---DIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLC 146 (345) T ss_pred EEEeeecCCCCCCCCCCcccceEEEEecchhhhhhhHh---hHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 8777777788998888888888999999976 4454 578899999999999999999999999999999886543 Q ss_pred cccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccC---HHHHHHHHHHHHhcCCCCCceEec Q lcl|Aclame:pro 157 GDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS---IGLVDNLSLFIDEMAHPLQPVRLS 233 (404) Q Consensus 157 g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s---~~~Id~a~~~a~~~~~pi~Pv~~~ 233 (404) .. .+.....|. +.+.++... ++ .+...++..-+.. ++.|..|...+++..-|. T Consensus 147 ~~-~~~~~~~~~---~~~~~~~~~-~~-------------~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~------ 202 (345) T protein:vir:22 147 NV-ESKYNENIE---GLGTATVIE-TT-------------QNKAALTDQVALGKEIIAALTKARAALTKNYVPA------ 202 (345) T ss_pred cc-ccccccccc---ccccccccc-cc-------------cccccccccccCHHHHHHHHHHHHHHhhhcCCCc------ Confidence 21 111001110 111111000 00 0011111111111 344445555555544331 Q ss_pred CccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeE- Q lcl|Aclame:pro 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKV- 312 (404) Q Consensus 234 g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~- 312 (404) ++ +++++.|.|+..|+.++.+- . + .-+..+.+=+|.++.++|+.|.+.++.|......... T Consensus 203 -~~-------R~~vv~P~~y~~Ll~~~~~~------~-~---~~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~ 264 (345) T protein:vir:22 203 -AD-------RVFYCDPDSYSAILAALMPN------A-A---NYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREG 264 (345) T ss_pred -cC-------CEEEeChHHHHHHhcccccc------c-c---ccccccccccceEEEEeceEEEecccccccccCccccC Confidence 11 57999999999999998532 1 1 1133455668999999999999998877432111000 Q ss_pred Ee-ecCcccccc---cccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCC Q lcl|Aclame:pro 313 LV-SENNLTATT---KEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSG 388 (404) Q Consensus 313 ~~-~~~~~~a~~---~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g 388 (404) .. ..+.+.... +...+..+ ..++++-..|++.+=...--+...|.|+ ..+ ..|-....+|.+=+|- T Consensus 265 ~~~~~~~~~~~~g~~~~~~~~~~-~~~l~~h~~A~~~v~~~~~~~e~~r~~~--~~~--d~I~~~~a~G~~vlRP----- 334 (345) T protein:vir:22 265 TTGQKHVFPANKGEGNVKVAKDN-VIGLFMHRSAVGTVKLRDLALERARRAN--FQA--DQIIAKYAMGHGGLRP----- 334 (345) T ss_pred cccccccccccccceeeeeccCc-eEEEEEehhheeeeeeecceeeeeechh--HHH--HHHHHHHhcCCccccc----- Confidence 00 000000000 00000011 1355665555443322211112222221 111 2455556677666663 Q ss_pred CceEEEEEEEeeeec Q lcl|Aclame:pro 389 KMQDHGVIAVDTAVK 403 (404) Q Consensus 389 ~~~DfGvi~idta~~ 403 (404) +..+ +|..-++ T Consensus 335 ---eaa~-~i~~~~~ 345 (345) T protein:vir:22 335 ---EAAG-AVVFKVE 345 (345) T ss_pred ---ceeE-EEEEeeC Confidence 2222 3333333 No 16 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.00 E-value=1.3e-10 Score=74.86 Aligned_cols=331 Identities=11% Similarity=0.084 Sum_probs=171.4 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||+.......-.|+.+ ....-+-++|.|.+.++..-+..+-+.. .+++.++ ..|++|.|+-+...+ T Consensus 3 ~~~~~~~~t~~g~~~~---~~d~~al~ik~f~~eV~~~f~~~s~~~~---------~~~~r~i--~~G~sv~i~~iG~~t 68 (347) T protein:vir:94 3 NVPGQKIGTDQGKGKS---SSDALALFLKVFAGEVLTAFTRRSVTAD---------KHIVRTI--QNGKSAQFPVMGRTS 68 (347) T ss_pred CCCccccccccccCCc---cccHHHHHHHHHhHHHHHHHHHHHhhhc---------ccccccc--cccceEEEeccccee Confidence 4443222222222200 0001112367777777666555543321 2233344 359999999999998 Q ss_pred cCceecCceeeeehhhhhhcccEEEEeee---ccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg 157 (404) -...+-++.+.|+-+++.-...+|.||+. |+.|+ .+++....+|+|++.-.....=+++..|+.++.+++.+.+ T Consensus 69 v~~~t~G~~l~~~~~~~~~~e~~itID~~~~~~~~Vd---diD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa 145 (347) T protein:vir:94 69 GVYLAPGERLSDKRKGIKHTEKVITIDGLLTADVMIF---DIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCN 145 (347) T ss_pred eeeecCCCCcCCCCCCCCcceEEEEecchhhhhHHhh---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 88888889998888888888888999998 56666 4688889999999999999999999999999988765443 Q ss_pred ccccccceeeccccccccccccCccCCCCCCceEeccCCccc-cccccccccCHHHHHHHHHHHHhcCCCCCceEecCcc Q lcl|Aclame:pro 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSF-EQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~-~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~ 236 (404) ........ .+.+ .+++. +-.+.++.. ......+.+ ++.|..|.+.+++..-| .+ T Consensus 146 ~~~~~~~~-----~~g~--------~~~s~---~~~~~~~~~~~~~~~~~~~-~~~i~~a~~~Lde~~VP-------~~- 200 (347) T protein:vir:94 146 LPAASNEN-----IAGL--------GTASV---LEVGKKADLDTPAKLGEAI-IGQLTIARAKLTSNYVP-------AG- 200 (347) T ss_pred cccccccc-----cCCC--------cccce---eeccccccccchhhhHHHH-HHHHHHHHHHHhhcCCC-------CC- Confidence 21110000 0011 01110 000000000 000001111 34555666666655433 11 Q ss_pred ccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeec Q lcl|Aclame:pro 237 LHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSE 316 (404) Q Consensus 237 ~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~ 316 (404) . ++++++|+++..|..++.+. . + . . ..+..+=.|.+|.++|+.|.+.+++|.. ..+....-.. T Consensus 201 -----~-R~~vv~P~~~~~Ll~~~~~~---~----~-~-~-~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~-~~t~~~~~~~ 263 (347) T protein:vir:94 201 -----D-RYFYTTPDNYSAILAALMPN---A----A-N-Y-AALIDPETGNIRNVMGFVVVEVPHLVQG-GAGETRGDDG 263 (347) T ss_pred -----C-cEEEeCHHHHHHHhccchhh---h----h-h-c-cccccccccceEEEeceEEEecCccccc-ccccccccCc Confidence 2 67789999999999987532 1 1 0 0 1123356799999999999999998742 1110000000 Q ss_pred Ccccccc---------cccccccchhhheeeccceeEEEeeecCCCCcceeecccccCch-hHHHHHHHhchhhccccCC Q lcl|Aclame:pro 317 NNLTATT---------KEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEK 386 (404) Q Consensus 317 ~~~~a~~---------~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~-~~i~i~~i~G~~K~rF~~~ 386 (404) -...++. ..+...+.-..+|++=.-|++ .++.-. +. .|-..|-.++ ..|-....+|.+=+|- T Consensus 264 ~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~--~v~~~~--~~-~e~~r~~~~~~d~i~~~~~~G~~~~rP--- 335 (347) T protein:vir:94 264 ITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVG--TVKLRD--LA-LERDRDVDAQGDLIVGKYAMGHGGLRP--- 335 (347) T ss_pred ceecCcccccccccchhhhcccccceeEEEeehhhhh--hhhccc--cc-ccchhchhhHHHHhhhhhhhcCccccc--- Confidence 0000000 000000111123333222322 222111 00 1212222111 2455556677666664 Q ss_pred CCCceEEEEEEEeeeec Q lcl|Aclame:pro 387 SGKMQDHGVIAVDTAVK 403 (404) Q Consensus 387 ~g~~~DfGvi~idta~~ 403 (404) |..+....++|. T Consensus 336 -----~~a~~~~~~~A~ 347 (347) T protein:vir:94 336 -----EAAGALVFSPAE 347 (347) T ss_pred -----ceeEEEEecCCC Confidence 444444444555 No 17 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=98.97 E-value=4.6e-11 Score=77.25 Aligned_cols=262 Identities=12% Similarity=0.082 Sum_probs=152.6 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchH-HHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) |.. +.+...+-. =+.|+..+.....++..+ . .+.....+|+-.+|++|+|+....+ T Consensus 1 ma~---------------~~T~~~d~iiPev~~~~v~~~~~~~~~~-~-------~~~~~~~~l~g~~G~ti~iP~~~~~ 57 (272) T protein:vir:36 1 MSK---------------QKTTLADLVNPEVLAPIVSYELNKALRF-A-------PLAQVDTTLQGQPGNTLKFPAFTYI 57 (272) T ss_pred CCC---------------cceehhhhhchHHHHHHHHHHHHhhhhh-c-------cccccccccccCCCCEEEEeeeccC Confidence 211 011111111 244544432222222111 1 1123346788889999999998766 Q ss_pred ccCc--eecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 80 SKRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 80 ~G~g--v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg 157 (404) |+. +..++.+ ..+.|...++++.|.+...++.... ++...+.-|+..++...++.+|++..|..++-.|.|... T Consensus 58 -gda~~~~eg~~i--~~~~lt~~~~~~~i~~~~k~~~vtD-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~ 133 (272) T protein:vir:36 58 -GDAADVAEGGEI--SLDKIGTTTKSVTIKKAAKGTEITD-EAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQ 133 (272) T ss_pred -ccccccCCCCcc--ChhhcCCcceeEeeehhhccccccH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 432 3334444 3788999999999999988888765 566678899999999999999999999999866655321 Q ss_pred ccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccc Q lcl|Aclame:pro 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~ 237 (404) . .+-..+.+.|..|........ T Consensus 134 -----------------------~----------------------~~~~~~~d~i~~A~~~lgd~~------------- 155 (272) T protein:vir:36 134 -----------------------T----------------------VSTKANVDGVQAALDIFNDED------------- 155 (272) T ss_pred -----------------------c----------------------ccccccHHHHHHHHHHhhhcC------------- Confidence 0 011234556666666554322 Q ss_pred cCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecC Q lcl|Aclame:pro 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 238 ~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~ 317 (404) .+-++++|||.++..|++|+. +.... ..+.++++++|.+|.|.|+.|..-.++|. T Consensus 156 ---~~~~~ivv~p~~~~~L~k~~~---~~~~~------~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~------------- 210 (272) T protein:vir:36 156 ---AQAYVLIVNPKDAAKIRKDAN---AKNIG------SEVGANALINGTYADVLGAQIVRSKKLAE------------- 210 (272) T ss_pred ---CCceEEEEcHHHHHHHhcccc---ccccc------ccccccceeeeccceecCeeEEEeCCCCC------------- Confidence 123689999999999999985 32221 12345789999999999999988776541 Q ss_pred cccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEE Q lcl|Aclame:pro 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIA 397 (404) Q Consensus 318 ~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~ 397 (404) +..+.-.++.|..|++.. ...+. . +|...|-.. ..+.|.+. +=||+-+ T Consensus 211 -----------~~~~~~~~~~~~gA~~~~--~~~~~--~-vE~~R~~~~----~~d~i~~~------------~~y~~~v 258 (272) T protein:vir:36 211 -----------GSALMFKIVSNSPALKLV--LKRGV--Q-VETDRDIVT----KTTVITAD------------EHYAAYL 258 (272) T ss_pred -----------CceeEEEEEecccceeee--ecCCc--c-cccccchhh----cCcEEEEE------------EEEEEEE Confidence 001123456666665543 22221 1 332222221 11222221 2255544 Q ss_pred Eeee--ecC Q lcl|Aclame:pro 398 VDTA--VKL 404 (404) Q Consensus 398 idta--~~~ 404 (404) ++-. |+| T Consensus 259 ~~~~~vv~~ 267 (272) T protein:vir:36 259 YDLTKVVNI 267 (272) T ss_pred EcCccEEEE Confidence 4422 444 No 18 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.96 E-value=1.2e-10 Score=74.92 Aligned_cols=315 Identities=14% Similarity=0.116 Sum_probs=176.1 Q ss_pred CCCcC---chHHHHHHHHHHHHHhhcc-chHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEe Q lcl|Aclame:pro 1 MTTVT---SAQANKLYQVALFTAANRN-RSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (404) Q Consensus 1 ~~~~~---~~~a~~~~~~~lft~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~ 76 (404) ||..- +|+=-...+++ .....+ .-++++|.+.+...-.+.+.+.. .+++.++. .|++|.|+-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~d~~~al~le~~~geV~~~f~~~s~~~~---------~~~~r~i~--~G~tv~i~~i 67 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARN--ADYDVRYATALKLFSGEVFTAFNNASIFKG---------LVRSYDLR--GGKSKQFMFT 67 (332) T ss_pred CcccccccCCccccCCccc--cccccchhhhhhhhhhhHHHHHHHHhhhhh---------cccccccc--ccceEEEEec Confidence 55432 12110000000 011112 12478999988777777766532 22223342 5999999999 Q ss_pred eccccCceecCceeeeehhhhhhcccEEEEeeec---cccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 77 HKLSKRPTMGDERVEGRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA 153 (404) Q Consensus 77 ~~L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R---~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~la 153 (404) ...+-...+.++.+.++ +++.-...+|.||+.. +.|+ .+++..+++|||.+.-.....=+++..|+.++.++. T Consensus 68 g~~~~~~~~~g~~l~~~-~~~~~~~~~l~ID~~ky~~~~Vd---diD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~ 143 (332) T protein:vir:78 68 GKLSAGYHTPGTPIVGD-AGIKANEKTLVMDDLLVSSQFVY---SLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLA 143 (332) T ss_pred cceeEeeecCCCCCCCC-CCCCCceEEEEEehhhhhHHHHH---hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88887777777788876 4588888899999954 5564 578889999999999999999999999999998875 Q ss_pred hcccccccccceeeccccccccccccCccCCCCCCceEeccCCcccccccccccc----CHHHHHHHHHHHHhcCCCCCc Q lcl|Aclame:pro 154 GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIF----SIGLVDNLSLFIDEMAHPLQP 229 (404) Q Consensus 154 G~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~----s~~~Id~a~~~a~~~~~pi~P 229 (404) .+... ..|....+.+. ...++++... -++.|..|.+.+++..-| T Consensus 144 ~aa~~------~~~~~~~~g~~-----------------------~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP--- 191 (332) T protein:vir:78 144 KASAE------ASPVTGEPGGF-----------------------HVNIGAGNTNDAQAIVDGFFEAAAVLDERSAP--- 191 (332) T ss_pred hhhcc------cCccccccccc-----------------------ccccCCccccCHHHHHHHHHHHHHHHhhcCCC--- Confidence 43321 00110111111 0112222222 234555666666655432 Q ss_pred eEecCccccCCccEEEEEEchHHHHHHHh--CcchHHHHHHHHHhhhccccccCcceeCC-eEEEcCEEEEecCceeeee Q lcl|Aclame:pro 230 VRLSGDELHGEDPYYVLYVTPRQWNDWYT--STSGKDWNQMMVRAVNRAKGFNHPLFKGE-CAMWRNILVRKYAGMPIRF 306 (404) Q Consensus 230 v~~~g~~~~~~~~~yV~~l~P~q~~dLr~--d~~~~~w~~~qk~A~ar~~g~~nPlF~G~-~gm~ngvii~~~~~~~irf 306 (404) .++ +++++.|+++..|.+ |+.+ .++ ...+.+-.+..|. ++.|+|+.|.+.++.|.-. T Consensus 192 ----~~g-------R~~vv~P~~y~~Ll~~~d~~~-------~n~--~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~ 251 (332) T protein:vir:78 192 ----QEG-------RVAVLSPRQYYSLISSVDTNI-------LNR--EIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLY 251 (332) T ss_pred ----ccC-------CEEEeCHHHHHHHHhhcCcee-------eee--eccccccceecceeeeEEeeeEEEecCccccCc Confidence 111 577899999999987 5421 001 1113344577775 8999999999999877321 Q ss_pred ccceeEEeecCcccccccccccccchhhheeeccceeEEEeeecC---CCCcceeecccccCchhHHHHHHHhchhhccc Q lcl|Aclame:pro 307 YQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKA---GGHFNMVEKKTDMDNRTEIAISWINGLKKIRF 383 (404) Q Consensus 307 ~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~---g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF 383 (404) +......+ ..-..+.++....-.-++++...|++.+=.+.. -.+-.|.|+.+ ...|-....+|.+=+| T Consensus 252 --g~~~~~~~--~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~----~d~i~~~~~~G~~v~r- 322 (332) T protein:vir:78 252 --GQDLSSAA--VTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQ----GDLIVGKLAMGCGSLR- 322 (332) T ss_pred --cccccccc--ccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhh----HhhhhhhhhhcCceec- Confidence 11110000 000112222223333467888887766644321 11223444432 2355555567764444 Q ss_pred cCCCCCceEEEEEEEeee Q lcl|Aclame:pro 384 PEKSGKMQDHGVIAVDTA 401 (404) Q Consensus 384 ~~~~g~~~DfGvi~idta 401 (404) +. ++++|-+| T Consensus 323 Pe--------~~v~l~~a 332 (332) T protein:vir:78 323 TS--------VAGSFQAA 332 (332) T ss_pred cc--------ceEEEeeC Confidence 22 23344444 No 19 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=98.94 E-value=8.2e-11 Score=75.88 Aligned_cols=267 Identities=15% Similarity=0.079 Sum_probs=152.3 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccch-HHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) |..-. | ...+= .-+.|+..+.....++ -.+ ..+....++|.-.+|++|+|+....+ T Consensus 1 ma~~~-------------T--~~~d~i~Pev~s~~v~~~~~~~-~~~-------~~~~~~~~~l~g~~G~tv~ip~~~~~ 57 (274) T protein:vir:96 1 MAQGT-------------T--KVSNLIVPEVLAPMMQAELDKK-LRF-------AQFADIDSTLVGQPGDTLTFPAFTYS 57 (274) T ss_pred CCccc-------------c--chhhhhhhHHHHHHHHHHHHhh-hhh-------cccccccccccCCCCCEEEEEeeccC Confidence 21111 1 00011 1234554443322222 111 12233456777789999999998643 Q ss_pred ccC--ceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 80 SKR--PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 80 ~G~--gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg 157 (404) |+ -+..++.+. .+.+...++++.|++...++.... .+...+..|+..++...++.+|++..|..++-.|.|+.. T Consensus 58 -g~~~~~~~g~~i~--~~~it~~~~~~~i~~~~~~~~i~D-~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~ 133 (274) T protein:vir:96 58 -GDAQVIAEGEKIP--VDQIGTSKREAKVRKIGKGTELTD-EAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL 133 (274) T ss_pred -CCccccCCCCcCc--hhhcccceeEEEEEeeeceeeecH-HHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 33 233334443 788999999999999888888775 355668889999999999999999999999866644220 Q ss_pred ccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccc Q lcl|Aclame:pro 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~ 237 (404) -..++.++.+.|..|..++.... T Consensus 134 --------------------------------------------~~~~~~~~~d~i~dA~~~l~d~~------------- 156 (274) T protein:vir:96 134 --------------------------------------------TVEADITKLDGLQTAIDKFNDED------------- 156 (274) T ss_pred --------------------------------------------CcCcccccHHHHHHHHHHhcccC------------- Confidence 01234456777777777664321 Q ss_pred cCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecC Q lcl|Aclame:pro 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 238 ~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~ 317 (404) .+..+++|||.++..|+++... +|..- ..+-++.+.+|.+|.|+|+.|..-.++|. T Consensus 157 ---~~~~~ivv~p~~~~~L~k~~~~-~f~~~-------~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~------------- 212 (274) T protein:vir:96 157 ---LEPMVLFVNPLDAGGLRTSASD-NFTRP-------TQLGDNIIVKGAFGEALGAVIVRSNKLNK------------- 212 (274) T ss_pred ---CCceEEEeCHHHHHHHHhcccc-ccccc-------ccccccceeecccceecCeeEEEcCCCCc------------- Confidence 1236899999999999999642 23321 11224788999999999998877665541 Q ss_pred cccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCch-hHHHHHHHhchhhccccCCCCCceEEEEE Q lcl|Aclame:pro 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQDHGVI 396 (404) Q Consensus 318 ~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~-~~i~i~~i~G~~K~rF~~~~g~~~DfGvi 396 (404) ..++|+|..|++.+-++ + .. +|...|-..+ -.|-....+|++-++ .=+++ T Consensus 213 ---------------~t~~l~~~gA~~~~~~~--~--~~-vE~~Rd~~~~~d~i~~~~~yg~~~~~---------~~~vv 263 (274) T protein:vir:96 213 ---------------GEALLAKKGAVKLITKR--D--FF-LEKDRDASRKSTALYSDKHYVAYLYD---------ESKVV 263 (274) T ss_pred ---------------ceEEEEeCcceeeeecC--C--cc-cccccchhhcccEEEEeeEEEEEEEc---------CccEE Confidence 12588898887765332 1 11 2332222211 111111222222221 11222 Q ss_pred EEeee--ecC Q lcl|Aclame:pro 397 AVDTA--VKL 404 (404) Q Consensus 397 ~idta--~~~ 404 (404) +|-++ =+. T Consensus 264 ~~t~~~~~~~ 273 (274) T protein:vir:96 264 KITKGAGDEV 273 (274) T ss_pred EEEcCccccc Confidence 22211 111 No 20 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=98.88 E-value=1e-10 Score=75.36 Aligned_cols=261 Identities=14% Similarity=0.080 Sum_probs=156.7 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccch-HHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) |..- .+...+= .=+.|+..+.....++..+. ......++|.-.+|++|+|+....+ T Consensus 1 Ma~~---------------~T~l~d~i~Pev~~~~v~~~~~~~~~~~--------~~~~~~~~l~g~~G~ti~iP~~~~i 57 (276) T protein:vir:10 1 MAQG---------------TTTKSTQIVPEVLAPMMQAELDKKLRFA--------QFADIDSTLVGQPGDTLTFPAFVYS 57 (276) T ss_pred CCcc---------------eeehhhhhchHHHHHHHHHHHHhhhhhc--------ccceecccccCCCCCEEEeeeecCC Confidence 2110 1111111 12455555444443332221 1223356788889999999999877 Q ss_pred ccC-ceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 80 SKR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (404) Q Consensus 80 ~G~-gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~ 158 (404) +.. .+..++.+. .+.|+..++++.|.+...++.... .+...+.-|+..++-+.++.+|++.+|..++-.|.+.... T Consensus 58 gda~~~~eg~~i~--~~~lt~~~~~a~i~~~~k~~~~tD-~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~ 134 (276) T protein:vir:10 58 GDATVVPEGQKIP--VDKIETNRREAKIHKIGKGTDITD-EALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLT 134 (276) T ss_pred CccccccCCCccC--ccccccceeeEEeehccccccccH-HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 322 244444444 788999999999999888888764 4666677899999999999999999999998666442210 Q ss_pred cccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCcccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (404) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~ 238 (404) .+++.++.+.|..|..++.... T Consensus 135 --------------------------------------------~~~~~~t~d~i~~A~~~lgd~~-------------- 156 (276) T protein:vir:10 135 --------------------------------------------VSADIGTLAGLEAAIDTFDDED-------------- 156 (276) T ss_pred --------------------------------------------ccccccCHHHHHHHHHHhcccc-------------- Confidence 0234566777877777664321 Q ss_pred CCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCc Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENN 318 (404) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~ 318 (404) ...++++|||.++..|+++... +|.+. ..+..+.+..|.+|.+.|+.|..-..+| T Consensus 157 --~~~~~ivv~p~~~~~L~k~~~~-~f~~~-------s~~g~~~~~~G~ig~~~G~~Vi~s~~~p--------------- 211 (276) T protein:vir:10 157 --LEPMVLFINPKDAGKLRSSASD-NFTRA-------TELGDNIIVKGAFGEALGAVIVRSKKLD--------------- 211 (276) T ss_pred --CcccEEEEcHHHHHHHHHhccc-ccccc-------ccccccceeccccceecceeEEEcCCCC--------------- Confidence 1247999999999999987432 34421 2234678999999999999887755543 Q ss_pred ccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEE Q lcl|Aclame:pro 319 LTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAV 398 (404) Q Consensus 319 ~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~i 398 (404) ...++|+|..|+.+.-.+ +. . +|...|-..+ .+.|.+. +=|||.++ T Consensus 212 -------------~~t~~l~~~gAi~~~~~~--~~--~-vE~dRd~~~~----~d~i~~~------------~~y~~~~~ 257 (276) T protein:vir:10 212 -------------EGEAILAKRGAVKLITKR--DF--F-LETDRDPSTK----TTALYSD------------KHYVAYLY 257 (276) T ss_pred -------------cceEEEEeccceeeeecC--Cc--e-eecccchhhc----ccEEEEe------------eEEEEEEE Confidence 113578888877654322 21 1 4444443322 1222221 11333333 Q ss_pred eee--ecC Q lcl|Aclame:pro 399 DTA--VKL 404 (404) Q Consensus 399 dta--~~~ 404 (404) +-. +++ T Consensus 258 ~~~~vv~~ 265 (276) T protein:vir:10 258 DESKAVKV 265 (276) T ss_pred cCcceEEE Confidence 221 111 No 21 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.87 E-value=1.8e-09 Score=68.48 Aligned_cols=327 Identities=12% Similarity=0.068 Sum_probs=170.6 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||.-- ..-.-.|.+ .....+-++|.|.+.+...-+..+.+... ..++ .+ ..|+++.|+-+...+ T Consensus 1 ms~~~-~~tr~~~~~----s~~d~al~le~f~geV~~af~~~s~~~~~------~~~r---ti--~~g~s~~~~~iG~~~ 64 (335) T protein:vir:63 1 MSFLN-DLTRPNYAG----KNADVDIHLEEHLGIVDKHFAYTSKFAPL------MNIR---DL--RGSNVVRLDRLGNVE 64 (335) T ss_pred CCCcc-cchhhhccc----ccchhheehhhhhhhHHHHHHhhhhhccc------ccee---ee--ccceeEEEeeeeeee Confidence 76541 111111111 01111124789999877666666554322 2222 33 449999999998888 Q ss_pred cCceecCceeeeehhhhhhcccEEEEee---eccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQ---GRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq---~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg 157 (404) -...+=++.+.|+- -......|.||. .||.|+- +++-..++|+|++.-..+..=+++..|+.+|.+++=+.. T Consensus 65 ~~~~~pG~~l~~~~--~~~~k~~itVD~ll~a~~~I~d---lDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~ 139 (335) T protein:vir:63 65 AKGRRAGEELERSR--VVNDKWNLTVDTLLYLRHQFDH---QDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAA 139 (335) T ss_pred eecccCCcCcCCCC--ccccceEEEecceeechhhhhh---HHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 77676777777764 344567899999 6777753 677888999999999999999999999999988753322 Q ss_pred ccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccc Q lcl|Aclame:pro 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~ 237 (404) . ......+...+++. ..+ . + .+...+.+..|.+.- .+..|.....+..-|-.|+ T Consensus 140 ~--~a~~~~~~~~~~G~---~~~-----~---~-----~tg~~~~~~~~~l~~-a~~~a~~~L~e~dVP~~~~------- 193 (335) T protein:vir:63 140 M--DAPVDLEDAFSPGV---LEK-----L---D-----LTGLTAKQAADKIVR-MHRRVVETFIDRDLGDAVY------- 193 (335) T ss_pred c--cCccccCCCcCCCc---cee-----e---e-----eccCcccccHHHHHH-HHHHHHHHHHhccCCCccc------- Confidence 1 00000000000110 000 0 0 011111222222211 2222333333322221100 Q ss_pred cCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecC Q lcl|Aclame:pro 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 238 ~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~ 317 (404) ++ ++++|+|.||+.|..++.+ .+ ..-...+..++.-.|.++.++||.|.+-+++|-. +++.... T Consensus 194 ---~d-r~~vv~P~~y~~Ll~~~~l---~n----~~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~--~~t~~~l--- 257 (335) T protein:vir:63 194 ---SE-GLTPMSPRVFSLLLEHDKL---MN----VEYQATGATNDYVKSRVAILNGVKVLETPRFATK--AIAAHPL--- 257 (335) T ss_pred ---Cc-eEEEeChHHHHHHhccccc---cc----cccccccccccccCceeEEeeceEEEeeccCCCC--Ccccccc--- Confidence 11 7899999999999999742 21 1001112346778899999999999999988742 2221111 Q ss_pred cccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEE Q lcl|Aclame:pro 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIA 397 (404) Q Consensus 318 ~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~ 397 (404) +...+.++.-..=.-++++-.-|++.+=...-.+...|.++.+ -.-|-....+|..=.|- |..+.. T Consensus 258 --g~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~----~~~i~~~~a~G~g~lRP--------e~a~~i 323 (335) T protein:vir:63 258 --GRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKF----SWVLDTFQMYNIGARRP--------DTAGAI 323 (335) T ss_pred --cccCCccccccceeEEEEEecceEEEEEEeecccceeeccchh----hHHhHHHHHcCCccccc--------ceEEEE Confidence 1111111111111235666666665554332223333332222 13445555677666664 333322 Q ss_pred EeeeecC Q lcl|Aclame:pro 398 VDTAVKL 404 (404) Q Consensus 398 idta~~~ 404 (404) --|-+.- T Consensus 324 ~~tg~~~ 330 (335) T protein:vir:63 324 ELKGIGA 330 (335) T ss_pred EEcCCCc Confidence 2233322 No 22 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=98.85 E-value=4.9e-10 Score=71.63 Aligned_cols=261 Identities=14% Similarity=0.071 Sum_probs=154.9 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |..-... ....=.| +.|+..+.....++..+ ..+..+.++|+-.+|++|+|+....+ T Consensus 1 ma~~~T~------------~~d~iiP--ev~~~~v~~~~~~~l~~--------~~~~~~d~~l~g~~G~tv~iP~~~~~- 57 (274) T protein:vir:94 1 MPQGLTK------------TSDQIIP--EVLAPMMQAQLEKKLRF--------ASFAEVDSTLQGQPGDTLTFPAFVYS- 57 (274) T ss_pred CCcccee------------hhheech--HHHHHHHHHhhhhhhhh--------cccceecccccCCCCCEEEEeeecCC- Confidence 2110000 0001112 45655543332222111 12334456787789999999998755 Q ss_pred cC--ceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 81 KR--PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (404) Q Consensus 81 G~--gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~ 158 (404) |+ -+..++.+. .+.|...++++.|++...++..... +...+.-|+..++.+.++.+|++..|..++-+|.++.. T Consensus 58 g~a~~~~~g~~i~--~~~lt~~~~~~~i~~~~~~~~i~D~-~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~- 133 (274) T protein:vir:94 58 GDAQVVAEGEKIP--TDILETKKREAKIRKIAKGTSITDE-ALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL- 133 (274) T ss_pred CccccccCCCccc--ccccccceeEEEeeeecceecccHH-HHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc- Confidence 33 233334443 7788999999999998888877753 55557789999999999999999999999877644221 Q ss_pred cccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCcccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (404) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~ 238 (404) . ..++.++.+.|..|..++.... T Consensus 134 ----------------------~---------------------~~~~~~~~d~i~dA~~~l~d~~-------------- 156 (274) T protein:vir:94 134 ----------------------T---------------------VNADITKLNGLQSAIDKFNDED-------------- 156 (274) T ss_pred ----------------------c---------------------ccccccCHHHHHHHHHHhhccC-------------- Confidence 0 0124456777777777664321 Q ss_pred CCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCc Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENN 318 (404) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~ 318 (404) ...++++|||.++..|++|+.. +|.. ...+-++.+.+|.+|.|.|+.|..-.++|. T Consensus 157 --~~~~~ivv~p~~~~~L~k~~~~-~f~~-------~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~-------------- 212 (274) T protein:vir:94 157 --LEPMVLFVNPLDAGKLRGDAST-NFTR-------ATELGDDIIVKGAFGEALGAIIVRTNKLEA-------------- 212 (274) T ss_pred --CCceEEEeCHHHHHHHHhhhhh-hccc-------cCcccccceeccccceecCeeEEEcCCCCc-------------- Confidence 1237899999999999999742 2332 122335789999999999999987666541 Q ss_pred ccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEE Q lcl|Aclame:pro 319 LTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAV 398 (404) Q Consensus 319 ~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~i 398 (404) ..++|+|..|+.+.-.+ + .. +|...|-..+ .+.|.+. +=|||-++ T Consensus 213 --------------~t~~l~~~gA~~~~~~~--~--~~-vE~~Rd~~~~----~d~i~~~------------~~y~~~~~ 257 (274) T protein:vir:94 213 --------------GTAILAKKGAVKLILKR--D--FF-LEVARDASTK----TTALYSD------------KHYVAYLY 257 (274) T ss_pred --------------ceEEEEeCcceEeeecC--C--ce-eccccchhhc----ccEEEEE------------EEEEEEEE Confidence 13578888877754322 2 12 4444443221 1222221 12344333 Q ss_pred ee---------eecC Q lcl|Aclame:pro 399 DT---------AVKL 404 (404) Q Consensus 399 dt---------a~~~ 404 (404) +- .+-| T Consensus 258 ~~~~vv~~t~~~~~~ 272 (274) T protein:vir:94 258 DESKAVKITKGSGSL 272 (274) T ss_pred cCCceEEEecCcccc Confidence 32 1222 No 23 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=98.85 E-value=4.9e-10 Score=71.63 Aligned_cols=261 Identities=14% Similarity=0.071 Sum_probs=154.9 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |..-... ....=.| +.|+..+.....++..+ ..+..+.++|+-.+|++|+|+....+ T Consensus 1 ma~~~T~------------~~d~iiP--ev~~~~v~~~~~~~l~~--------~~~~~~d~~l~g~~G~tv~iP~~~~~- 57 (274) T protein:vir:97 1 MPQGLTK------------TSDQIIP--EVLAPMMQAQLEKKLRF--------ASFAEVDSTLQGQPGDTLTFPAFVYS- 57 (274) T ss_pred CCcccee------------hhheech--HHHHHHHHHhhhhhhhh--------cccceecccccCCCCCEEEEeeecCC- Confidence 2110000 0001112 45655543332222111 12334456787789999999998755 Q ss_pred cC--ceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 81 KR--PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (404) Q Consensus 81 G~--gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~ 158 (404) |+ -+..++.+. .+.|...++++.|++...++..... +...+.-|+..++.+.++.+|++..|..++-+|.++.. T Consensus 58 g~a~~~~~g~~i~--~~~lt~~~~~~~i~~~~~~~~i~D~-~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~- 133 (274) T protein:vir:97 58 GDAQVVAEGEKIP--TDILETKKREAKIRKIAKGTSITDE-ALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL- 133 (274) T ss_pred CccccccCCCccc--ccccccceeEEEeeeecceecccHH-HHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc- Confidence 33 233334443 7788999999999998888877753 55557789999999999999999999999877644221 Q ss_pred cccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCcccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (404) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~ 238 (404) . ..++.++.+.|..|..++.... T Consensus 134 ----------------------~---------------------~~~~~~~~d~i~dA~~~l~d~~-------------- 156 (274) T protein:vir:97 134 ----------------------T---------------------VNADITKLNGLQSAIDKFNDED-------------- 156 (274) T ss_pred ----------------------c---------------------ccccccCHHHHHHHHHHhhccC-------------- Confidence 0 0124456777777777664321 Q ss_pred CCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCc Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENN 318 (404) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~ 318 (404) ...++++|||.++..|++|+.. +|.. ...+-++.+.+|.+|.|.|+.|..-.++|. T Consensus 157 --~~~~~ivv~p~~~~~L~k~~~~-~f~~-------~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~-------------- 212 (274) T protein:vir:97 157 --LEPMVLFVNPLDAGKLRGDAST-NFTR-------ATELGDDIIVKGAFGEALGAIIVRTNKLEA-------------- 212 (274) T ss_pred --CCceEEEeCHHHHHHHHhhhhh-hccc-------cCcccccceeccccceecCeeEEEcCCCCc-------------- Confidence 1237899999999999999742 2332 122335789999999999999987666541 Q ss_pred ccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEE Q lcl|Aclame:pro 319 LTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAV 398 (404) Q Consensus 319 ~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~i 398 (404) ..++|+|..|+.+.-.+ + .. +|...|-..+ .+.|.+. +=|||-++ T Consensus 213 --------------~t~~l~~~gA~~~~~~~--~--~~-vE~~Rd~~~~----~d~i~~~------------~~y~~~~~ 257 (274) T protein:vir:97 213 --------------GTAILAKKGAVKLILKR--D--FF-LEVARDASTK----TTALYSD------------KHYVAYLY 257 (274) T ss_pred --------------ceEEEEeCcceEeeecC--C--ce-eccccchhhc----ccEEEEE------------EEEEEEEE Confidence 13578888877754322 2 12 4444443221 1222221 12344333 Q ss_pred ee---------eecC Q lcl|Aclame:pro 399 DT---------AVKL 404 (404) Q Consensus 399 dt---------a~~~ 404 (404) +- .+-| T Consensus 258 ~~~~vv~~t~~~~~~ 272 (274) T protein:vir:97 258 DESKAVKITKGSGSL 272 (274) T ss_pred cCCceEEEecCcccc Confidence 32 1222 No 24 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=98.85 E-value=2.5e-10 Score=73.25 Aligned_cols=262 Identities=14% Similarity=0.072 Sum_probs=154.0 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |..-... ....=.| +.|+..+.....++..+ ..+....++|+-++||+|+|+....++ T Consensus 1 m~~~~T~------------l~d~i~P--ev~~~~v~~~~~~~l~~--------~~~~~~~~~l~g~~G~tv~iP~~~~ig 58 (274) T protein:vir:96 1 MAQGMTK------------LTNQIVP--EVLAPMMQAELEKKLRF--------ASFAEIDNTLVGQPGDTLTFPAFIYSG 58 (274) T ss_pred CCcceee------------hhheech--HHHHHHHHHHHHhhhhc--------cccceecccccCCCCCEEEeeeecCCC Confidence 2111100 0011112 45665554333222222 122233567887899999999987663 Q ss_pred cCc-eecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 81 KRP-TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~g-v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~ 159 (404) ..- +..++.++ .+.|...++++.|++..+++.... .+...+.-|+..++.+.++.+|++..|..++-.+.++... T Consensus 59 ~a~~~~~g~~i~--~~~lt~~~~~~~i~~~~~a~~i~D-~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~- 134 (274) T protein:vir:96 59 DAKVVAEGEKIP--TDILETKKREAKIRKIAKGTSISD-EALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT- 134 (274) T ss_pred ccccccCCCccc--hhhcccceeEEEeeeeecceeehH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc- Confidence 222 33334444 678999999999999888888765 4556667899999999999999999999998666443210 Q ss_pred ccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~ 239 (404) .+++.++.+.|..|..++.... T Consensus 135 -------------------------------------------~~~~~~~~d~i~~A~~~lgd~~--------------- 156 (274) T protein:vir:96 135 -------------------------------------------VEADITKLTGLQTAIDKFNDED--------------- 156 (274) T ss_pred -------------------------------------------ccccccCHHHHHHHHHHhcccc--------------- Confidence 0123456777777776664321 Q ss_pred CccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcc Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) ..-++++|||.++..|++|+.. +|.. ...+..+.+..|.+|.|.|+.|..-..+| T Consensus 157 -~~~~~ivv~p~~~~~L~k~~~~-~f~~-------~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~---------------- 211 (274) T protein:vir:96 157 -LEPMVLFISPLDAGKLRGDATT-NFTR-------ATELGDDVIVKGAFGEALGAVIVRSNKLE---------------- 211 (274) T ss_pred -ccccEEEeCHHHHHHHHhhccc-cccc-------cccccccceeccccceecCeEEEEeCCCC---------------- Confidence 1236899999999999999732 2331 12234588999999999999887644332 Q ss_pred cccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEe Q lcl|Aclame:pro 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVD 399 (404) Q Consensus 320 ~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~id 399 (404) ...++|+|-.|++.. ...+ .. +|-..|-.. ..+.|.+- +=||+-+++ T Consensus 212 ------------~~t~~l~~~gA~~~~--~~~~--~~-vE~~Rd~~~----~~d~i~~~------------~~y~~~~~~ 258 (274) T protein:vir:96 212 ------------AGTAILAKKGAVKLI--TKRD--FF-LETDRDPST----KTTALYSD------------KHYVAYLYD 258 (274) T ss_pred ------------CceEEEEeccceeee--ecCC--cc-ccccccccc----ccCEEEEe------------EEEEEEEEc Confidence 113578887776653 2211 12 444444332 11222221 124443333 Q ss_pred e--eecC Q lcl|Aclame:pro 400 T--AVKL 404 (404) Q Consensus 400 t--a~~~ 404 (404) - .|+| T Consensus 259 ~~~~v~~ 265 (274) T protein:vir:96 259 ESKAVKI 265 (274) T ss_pred CCcEEEE Confidence 2 2233 No 25 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=98.85 E-value=2.5e-10 Score=73.25 Aligned_cols=262 Identities=14% Similarity=0.072 Sum_probs=154.0 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |..-... ....=.| +.|+..+.....++..+ ..+....++|+-++||+|+|+....++ T Consensus 1 m~~~~T~------------l~d~i~P--ev~~~~v~~~~~~~l~~--------~~~~~~~~~l~g~~G~tv~iP~~~~ig 58 (274) T protein:vir:95 1 MAQGMTK------------LTNQIVP--EVLAPMMQAELEKKLRF--------ASFAEIDNTLVGQPGDTLTFPAFIYSG 58 (274) T ss_pred CCcceee------------hhheech--HHHHHHHHHHHHhhhhc--------cccceecccccCCCCCEEEeeeecCCC Confidence 2111100 0011112 45665554333222222 122233567887899999999987663 Q ss_pred cCc-eecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 81 KRP-TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~g-v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~ 159 (404) ..- +..++.++ .+.|...++++.|++..+++.... .+...+.-|+..++.+.++.+|++..|..++-.+.++... T Consensus 59 ~a~~~~~g~~i~--~~~lt~~~~~~~i~~~~~a~~i~D-~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~- 134 (274) T protein:vir:95 59 DAKVVAEGEKIP--TDILETKKREAKIRKIAKGTSISD-EALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT- 134 (274) T ss_pred ccccccCCCccc--hhhcccceeEEEeeeeecceeehH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc- Confidence 222 33334444 678999999999999888888765 4556667899999999999999999999998666443210 Q ss_pred ccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~ 239 (404) .+++.++.+.|..|..++.... T Consensus 135 -------------------------------------------~~~~~~~~d~i~~A~~~lgd~~--------------- 156 (274) T protein:vir:95 135 -------------------------------------------VEADITKLTGLQTAIDKFNDED--------------- 156 (274) T ss_pred -------------------------------------------ccccccCHHHHHHHHHHhcccc--------------- Confidence 0123456777777776664321 Q ss_pred CccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcc Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) ..-++++|||.++..|++|+.. +|.. ...+..+.+..|.+|.|.|+.|..-..+| T Consensus 157 -~~~~~ivv~p~~~~~L~k~~~~-~f~~-------~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~---------------- 211 (274) T protein:vir:95 157 -LEPMVLFISPLDAGKLRGDATT-NFTR-------ATELGDDVIVKGAFGEALGAVIVRSNKLE---------------- 211 (274) T ss_pred -ccccEEEeCHHHHHHHHhhccc-cccc-------cccccccceeccccceecCeEEEEeCCCC---------------- Confidence 1236899999999999999732 2331 12234588999999999999887644332 Q ss_pred cccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEe Q lcl|Aclame:pro 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVD 399 (404) Q Consensus 320 ~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~id 399 (404) ...++|+|-.|++.. ...+ .. +|-..|-.. ..+.|.+- +=||+-+++ T Consensus 212 ------------~~t~~l~~~gA~~~~--~~~~--~~-vE~~Rd~~~----~~d~i~~~------------~~y~~~~~~ 258 (274) T protein:vir:95 212 ------------AGTAILAKKGAVKLI--TKRD--FF-LETDRDPST----KTTALYSD------------KHYVAYLYD 258 (274) T ss_pred ------------CceEEEEeccceeee--ecCC--cc-ccccccccc----ccCEEEEe------------EEEEEEEEc Confidence 113578887776653 2211 12 444444332 11222221 124443333 Q ss_pred e--eecC Q lcl|Aclame:pro 400 T--AVKL 404 (404) Q Consensus 400 t--a~~~ 404 (404) - .|+| T Consensus 259 ~~~~v~~ 265 (274) T protein:vir:95 259 ESKAVKI 265 (274) T ss_pred CCcEEEE Confidence 2 2233 No 26 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=98.84 E-value=1.7e-10 Score=74.21 Aligned_cols=227 Identities=12% Similarity=0.087 Sum_probs=136.5 Q ss_pred cccCCCCcEEEEEEeeccccC--ceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHH Q lcl|Aclame:pro 62 DLNKQAGDEVTFSIMHKLSKR--PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTY 139 (404) Q Consensus 62 dL~k~~Gd~v~f~L~~~L~G~--gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w 139 (404) |=.-+.||+|+|+- ..|+ .+..++.+. .|.|+..+++..|.+...++.+.. +......-|+..++...|+.- T Consensus 1 ~~~~~~Gdtit~P~---~iGda~~v~eG~~i~--~~~l~~t~~~atIk~~gk~~~itD-~a~l~~~gDp~~ea~~Q~~~~ 74 (231) T protein:vir:73 1 ENGINLANLCEYPN---DIGDAADVAEGGEIS--LDKIGTTTKSVTIKKAAKGTEITD-EAALSGYGDPIGESNKQLGLS 74 (231) T ss_pred CccccCCceEEecc---cccchhhhcCCCcCC--hhhccccceeeeEeeeccceeeeH-HHHhhccCchHHHHHHHHHHH Confidence 45568899999993 3454 333444444 788999999999999999998875 355556779999999999999 Q ss_pred HHHHHHHHHHHHhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHH Q lcl|Aclame:pro 140 FNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLF 219 (404) Q Consensus 140 ~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~ 219 (404) +++..|..++-.+.+++ ++.+=.++.+.|.+|..+ T Consensus 75 iA~kvD~di~~~~~~a~---------------------------------------------l~~~~~~t~d~i~~A~~~ 109 (231) T protein:vir:73 75 LANKVDDDLLKAAKTTS---------------------------------------------QTVSTKANVDGVQAALDI 109 (231) T ss_pred HHHhhhHHHHHhhcccc---------------------------------------------ccccccccHHHHHHHHHH Confidence 99999999875443222 111112578888888776 Q ss_pred HHhcCCCCCceEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEec Q lcl|Aclame:pro 220 IDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKY 299 (404) Q Consensus 220 a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~ 299 (404) ... + +++-+|++|||.++.+||+++.+ .... ..+-++.+++|.+|++.||.|..- T Consensus 110 fgd-------------e---~~~~~vivv~p~~~~~Lrk~~~~---~~~~------~~~g~~i~~~G~iG~i~G~~Vi~S 164 (231) T protein:vir:73 110 FND-------------E---DAQAYVLIVNPKDAAKIRKDANA---KNIG------SEVGANALINGTYADVLGAQIVRS 164 (231) T ss_pred hcc-------------c---cccceEEEEcchHHHhhhhccch---hhhh------hhhccceeeecccceEcceEEEEc Confidence 632 1 22337899999999999999853 2211 224468899999999999998776 Q ss_pred CceeeeeccceeEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHH-HHHHHhch Q lcl|Aclame:pro 300 AGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEI-AISWINGL 378 (404) Q Consensus 300 ~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i-~i~~i~G~ 378 (404) +++|. ++. +.--++....|+.+.- +.+ +. +|...|-..+.-. ....++++ T Consensus 165 ~~~~~----~~~--------------------~~~~~i~~~gAl~~~~--k~~--~~-vEtdRd~~~k~~~i~~~~~y~v 215 (231) T protein:vir:73 165 KKLAE----GSA--------------------LMFKIVSNSPALKLVL--KRG--VQ-VETDRDIVTKTTVITADEHYAA 215 (231) T ss_pred CCCCC----Cce--------------------eeeeEEeeccceeeee--ccc--ce-eeccccccccccEEEEeEEEEE Confidence 66541 000 0011333344444332 211 12 5544444333221 11111111 Q ss_pred hhccccCCCCCceEEEEEEE-eeee Q lcl|Aclame:pro 379 KKIRFPEKSGKMQDHGVIAV-DTAV 402 (404) Q Consensus 379 ~K~rF~~~~g~~~DfGvi~i-dta~ 402 (404) +=. .|=+|+.| +..+ T Consensus 216 ~l~---------~~~~vv~~t~~g~ 231 (231) T protein:vir:73 216 YLY---------DLTKVVNITFTGV 231 (231) T ss_pred EEE---------cCccEEEEEeecC Confidence 100 01122222 1122 No 27 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.84 E-value=5.5e-10 Score=71.34 Aligned_cols=334 Identities=12% Similarity=0.095 Sum_probs=178.6 Q ss_pred CCCcCchH-HHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQ-ANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~~~-a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) =|..|... -.-.|+. ....+-+.++++|++.+...-++.+.+. ..++..++. .|++|.|+-+... T Consensus 3 ~~~~~~~~~t~~~~~~---~~~~~~a~~ie~f~g~V~~~f~~~s~~~---------~~~~~~~~~--~G~sv~i~~ig~~ 68 (347) T protein:vir:15 3 NIQGGQQIGTNQGKGQ---SAADKLALFLKVFGGEVLTAFARTSVTM---------PRHMLRSIA--SGKSAQFPVIGRT 68 (347) T ss_pred ccccCCccccccccCC---CcchHHHHHHHHHHHHHHHHHHHhhhhh---------hcccccccc--ccceeEeeeccce Confidence 11222211 1112221 1222233478999999887777665542 222333443 4999999999999 Q ss_pred ccCceecCceeeeehhhhhhcccEEEEeee---ccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGAR 156 (404) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~---R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~r 156 (404) +....+..+.+.++.++.+....+|.||+. ++.|+ .+++-.+++|+|.+.-.....=+++..|+.++.+|.++. T Consensus 69 t~~~~~~g~~l~~~~~~~~~~e~~ltID~~~~~~~~Vd---dlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~ 145 (347) T protein:vir:15 69 KAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIY---DIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLV 145 (347) T ss_pred eeeeeccCCCCCCCCCCCccceEEEEechhhhhhHHhh---hHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 988888888899998889999999999987 45664 578888999999999999999999999999999997765 Q ss_pred cccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCcc Q lcl|Aclame:pro 157 GDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (404) Q Consensus 157 g~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~ 236 (404) ..... ...+. ..+.. ..+.+- .............+.+.+ ++.|..|.+.+.+..-| .+ T Consensus 146 ~~~~~--~~~~~-~~~g~-----~~~~~~-----~~~~~~~~~~~~~~~~~i-~d~~~~a~~~Lde~~VP-------~~- 203 (347) T protein:vir:15 146 NLPDA--SNENI-EGLGK-----PTVLTL-----VKPTTGDLTDPVELGKAI-IAQLTIARASLTKNYVP-------AA- 203 (347) T ss_pred hcccc--ccccc-cccCc-----cccccc-----cccccccchhhhhHHHHH-HHHHHHHHHHHhhcCCC-------cc- Confidence 31000 00000 00000 000000 000000000011111222 55666666677665433 11 Q ss_pred ccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEee- Q lcl|Aclame:pro 237 LHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVS- 315 (404) Q Consensus 237 ~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~- 315 (404) . ++++++|.++..|..++.+- . + . -+....+-+|.+|.|+|+.|.+.+++|. ..+...+.. T Consensus 204 -----g-R~~vv~P~~y~~LL~~~~~~------~-~--d-~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~--~~~t~~~~~~ 265 (347) T protein:vir:15 204 -----D-RTFYTTPDNYSAILAALMPN------A-A--N-YQALIDHERGTIRNVMGFEVVEVPHLTA--GGAGDTREDA 265 (347) T ss_pred -----C-CEEEeCHHHHHHHhcccccc------c-c--c-ccccccccceEEEEEeceEEEecccccc--cccccccccc Confidence 1 57999999999999998532 1 1 1 1123457789999999999999988773 222111110 Q ss_pred ----cCccccccc-ccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCc Q lcl|Aclame:pro 316 ----ENNLTATTK-EVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKM 390 (404) Q Consensus 316 ----~~~~~a~~~-~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~ 390 (404) .+...+... ..+..+...-+|++-..|++.+=.+.--+.-.|.++. ++ ..|-....+|.+=+| + T Consensus 266 ~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~--~~--d~i~~~~~~G~~vlr-P------ 334 (347) T protein:vir:15 266 PADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANY--QA--DQIIAKYAMGHGGLR-P------ 334 (347) T ss_pred cccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccchh--hh--hhhehhhhcCCceec-c------ Confidence 001111110 0111122233566666666655433211111232211 11 223333344554444 1 Q ss_pred eEEE-EEEEeeeec Q lcl|Aclame:pro 391 QDHG-VIAVDTAVK 403 (404) Q Consensus 391 ~DfG-vi~idta~~ 403 (404) |.. +|.+.-... T Consensus 335 -~~av~~~~~~~~~ 347 (347) T protein:vir:15 335 -EAAGAIVLPKVSE 347 (347) T ss_pred -ccEEEEecCCCCC Confidence 221 111111111 No 28 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.83 E-value=2.7e-09 Score=67.55 Aligned_cols=325 Identities=12% Similarity=0.077 Sum_probs=168.5 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||.-- ..-.-.|+++ ..-.+-++|.|.+.+...-+.++.+... +++.++ ..|.++.|+-+.... T Consensus 1 ms~~~-~~t~~~~~~s----~~d~al~le~f~geV~~af~~~s~~~~~---------~~~rti--~~g~s~~~~~iG~~~ 64 (335) T protein:vir:78 1 MSFLN-DLTRPNYAGK----NADVDIHLEEHLGIVDKHFAYTSKFAPL---------MNIRDL--RGSNVVRLDRLGNVE 64 (335) T ss_pred CCccc-cccccccccc----cchhhhhhhhhhhHHHHHHHHhhhhccc---------cceeee--ccceeEEEeeeeeee Confidence 66541 1001111100 0000114899999887777766555322 222234 449999999988887 Q ss_pred cCceecCceeeeehhhhhhcccEEEEee---eccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQ---GRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq---~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg 157 (404) -...+=.+.+.|+ ........|.||+ .||.|+- +++-.+.+|+|++.-..+..=+++..||.+|.+++=+.. T Consensus 65 ~~~~~pG~~l~~~--~~~~~k~~itID~ll~a~~~Vdd---lDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~ 139 (335) T protein:vir:78 65 AKGRRAGEELERS--RVVNDKWNLTVDTLLYLRHQFDH---QDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAA 139 (335) T ss_pred ecccccCcccCCC--CcccCCeEEEecceeechhhHhh---HHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 6666666777665 4566777899999 6777753 677888999999999999999999999999988753321 Q ss_pred ccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccc-cCHHHHHHHHHHHHhcCCCCCceEecCcc Q lcl|Aclame:pro 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADI-FSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~-~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~ 236 (404) .......+ +.| ..+.+....|+..+. -....+..|.+.|... +...+ T Consensus 140 --~~a~~~~~----~~~------------------~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~--------l~ekd 187 (335) T protein:vir:78 140 --MDAPVDLE----DAF------------------SPGVLEKLDLTGLTAKEAAEKIVRMHRRVVET--------FIERD 187 (335) T ss_pred --cccccccC----CCc------------------CCCcceeeeeccccccccHHHHHHHHHHHHHH--------HHhcc Confidence 11111111 011 011111111111110 0122333444433221 11111 Q ss_pred ccCCcc-EEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEee Q lcl|Aclame:pro 237 LHGEDP-YYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVS 315 (404) Q Consensus 237 ~~~~~~-~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~ 315 (404) +....+ -.|++|+|.||+.|..++.+ .+ ..-...+..+++=.|.++..+||.|.+-+++|-. .++.. T Consensus 188 vP~~~~~~rv~vv~P~~y~~Ll~~~~l---~n----~~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~--~~t~~--- 255 (335) T protein:vir:78 188 LGDAVYSEGLTPMSPRVFSLLLEHDKL---MS----VEYQATGATNDYVKSRVAILNGVKVLETPRFATK--AISAH--- 255 (335) T ss_pred CCCCCCCccEEEeChHHHHHHhccccc---cc----ccccccccccccccceeEEeeceEEEeeccCCCC--CCccc--- Confidence 211111 27999999999999999743 11 1001112346778899999999999999988732 11111 Q ss_pred cCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEE Q lcl|Aclame:pro 316 ENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGV 395 (404) Q Consensus 316 ~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGv 395 (404) ..+...+..+....=.-++++=..|++.+=-..=.++..|.+..+ -.-|-....+|..=.|- |..+ T Consensus 256 --~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~----~~~i~~~~a~G~g~lRP--------e~a~ 321 (335) T protein:vir:78 256 --PLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQF----SWVLDTFQMYNIGARRP--------DTAG 321 (335) T ss_pred --cccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchh----hHhhhHHHHcCCcccCc--------ceEE Confidence 111111111111111123443344444332221123333322221 23455556677776664 4444 Q ss_pred EEEeeeecC Q lcl|Aclame:pro 396 IAVDTAVKL 404 (404) Q Consensus 396 i~idta~~~ 404 (404) ..--|-+.- T Consensus 322 ~i~~tg~~~ 330 (335) T protein:vir:78 322 AIELKGIEA 330 (335) T ss_pred EEEecCCCc Confidence 333333332 No 29 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=98.82 E-value=5.8e-10 Score=71.23 Aligned_cols=258 Identities=14% Similarity=0.084 Sum_probs=152.5 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) +|...+-. - =+.|+..+.....++.-+. .+..+..+|.-.+|++|+|+....+. T Consensus 5 ~T~~~~~i----------------i--Pev~~~~v~~~~~~~~~~~--------~~~~~~~~l~g~~G~tv~ip~~~~~g 58 (274) T protein:vir:93 5 ITKTSNQI----------------I--PEVLAPMMQAQLEKKLRFA--------SFAEVDSTLQGQPGDTLTFPAFVYSG 58 (274) T ss_pred ceehhhee----------------c--hHHHHHHHHHHHHhhhhhc--------ccccccccccCCCCCEEEEEeeccCC Confidence 11111100 1 1445555433322221111 12233467777899999999987663 Q ss_pred cC-ceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~-gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~ 159 (404) .. .+..++.+. .+.+...++++.|++...++..... +...+..|+..++...+++.|++..|..++-.+.++.. T Consensus 59 ~~~~~~eg~~i~--~~~it~~~~~~~i~~~~~~~~i~D~-~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~-- 133 (274) T protein:vir:93 59 DAQVVAEGEKIP--TDILETKKREAKIRKIAKGTSITDE-ALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-- 133 (274) T ss_pred CcccccCCCccc--ccccccceeEEEeeeecccccccHH-HHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-- Confidence 22 233334443 7889999999999998888877653 55557789999999999999999999999866644321 Q ss_pred ccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~ 239 (404) -..++.++.+.|..|..++.... T Consensus 134 ------------------------------------------~~~~~~~~~d~i~dA~~~l~d~~--------------- 156 (274) T protein:vir:93 134 ------------------------------------------TVNADITKLNGLQSAIDKFNDED--------------- 156 (274) T ss_pred ------------------------------------------cccccccCHHHHHHHHHHhhhcc--------------- Confidence 00134456777777776664321 Q ss_pred CccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcc Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) ....+++|||..+..|++|+.. +|.. . ...-++.+..|.+|.|.|+.|..-..+|. T Consensus 157 -~~~~~ivv~p~~~~~L~k~~~~-~f~~---~----s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~--------------- 212 (274) T protein:vir:93 157 -LEPMVLFINPLDAGKLRGDAST-NFTR---A----TELGDDIIVKGAFGEALGAIIVRTNKLEA--------------- 212 (274) T ss_pred -CCccEEEeCHHHHHHHHhhhhh-cccc---c----ccccccceeecccceecCeeEEEcCCCCc--------------- Confidence 1236899999999999999742 2331 1 12234789999999999999887665541 Q ss_pred cccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEe Q lcl|Aclame:pro 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVD 399 (404) Q Consensus 320 ~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~id 399 (404) .-++|+|..|++.+-.+ + .. +|...|-..+ .+.|.|. +=||+-+++ T Consensus 213 -------------~t~~l~~~gai~~~~~~--~--~~-vE~~Rd~~~~----~d~i~~~------------~~y~~~~~~ 258 (274) T protein:vir:93 213 -------------GTAILAKKGAVKLILKR--D--FF-LEVARDASTK----TTALYSD------------KHYVAYLYD 258 (274) T ss_pred -------------ceEEEEeCCeEEEEecC--C--cc-cccccchhhc----ccEEEEE------------EEEEEEEEc Confidence 12578888877765332 1 12 3443432211 1222221 113332222 Q ss_pred e--eecC Q lcl|Aclame:pro 400 T--AVKL 404 (404) Q Consensus 400 t--a~~~ 404 (404) - .+++ T Consensus 259 ~~~~v~~ 265 (274) T protein:vir:93 259 ESKAVKI 265 (274) T ss_pred CCceEEE Confidence 2 1222 No 30 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=98.79 E-value=9.1e-09 Score=64.67 Aligned_cols=313 Identities=13% Similarity=0.103 Sum_probs=153.7 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) =|--|++- +...+.+---++|++.+...-.++.-+. ...+..+.+-.+||+|+|+.....+ T Consensus 5 ~~~~~~~~----------~t~~v~~fipei~s~~i~~~l~~~~v~~---------~~~~d~~~~~~~Gdtv~ip~~g~~~ 65 (341) T protein:vir:94 5 NTITGPSI----------NTQRGQQFIPEQWLSEVQMFRKAKMLDT---------SVVKTWGAQVKKGDTFHVPRISELG 65 (341) T ss_pred hhhccccc----------cchhHHHHHHHHHHHHHHHHHHhhcchh---------hccccccccccCCceEEEeccCcce Confidence 01111110 0001111112555555432222221111 1122222333459999999765544 Q ss_pred cCceecCceeeeehhhhhhcccEEEEeeec-cccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R-~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~ 159 (404) -.....+..+. .+++.-.+.+|.||+.+ .++.+. .+++..+..|+|.+........+++..|+.++-.++++.+.. T Consensus 66 ~~d~~~~~~i~--~~~~~~~~~~itiD~~~~~~~~i~-d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~ 142 (341) T protein:vir:94 66 VEDKATDVPVG--VQPVNDTDFVITVDTDRTTAVALD-DLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTA 142 (341) T ss_pred eeeecCCCccc--cccccCceEEEEEeeeeecceeec-hHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 33333334443 56777889999999986 555554 568888899999999999999999999999887665544210 Q ss_pred ccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~ 239 (404) . .+.+..++. ......+.++.+.|..+.+.+++..-| .+. T Consensus 143 ~------------------~~~~~~~~~------------~~t~~~~~~~~~~i~~a~~~Lde~~VP-------~~g--- 182 (341) T protein:vir:94 143 S------------------QNVFSSSNG------------AITGNGQAFSFAVFLAARRLLLEADVP-------EEK--- 182 (341) T ss_pred c------------------CccccCccc------------cccCchhhhhHHHHHHHHHHHhhcCCC-------ccC--- Confidence 0 011111111 011234557778888888888876533 111 Q ss_pred CccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcc Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) ++++|+|.++.+|++|+. |..... +.++.|-+|.+|.|.|+-|.+.+++|. ........... T Consensus 183 ----R~lvv~P~~~~~Ll~~~~---~~~~~~-------~g~~~l~~G~ig~i~G~~V~~Sn~lp~--~~~~~~~~~~~-- 244 (341) T protein:vir:94 183 ----IVLLISPGQESALFTIPQ---FISKDF-------INNAPIAQGQIGSLMGVRVIRTSLIGN--NSATGWRNGAP-- 244 (341) T ss_pred ----CEEEeCHHHHHHHhhchh---hhhhhc-------cccchhheeeeeeEeceEEEEeccccc--ccccccccccc-- Confidence 567899999999999984 433221 124568899999999999999988763 22111111000 Q ss_pred cccccccccccchhhheeecccee------EEEeee-c----CCCCccee--------ecccccCch---hHHHHHHHhc Q lcl|Aclame:pro 320 TATTKEVAAATNIDRAMLLGAQAL------ANAYGQ-K----AGGHFNMV--------EKKTDMDNR---TEIAISWING 377 (404) Q Consensus 320 ~a~~~~~aa~~~v~ralLlGaQAl------~~A~g~-~----~g~r~~w~--------Ee~~D~g~~---~~i~i~~i~G 377 (404) .+.... ....|.-..-+|.+.. +++|-+ + -.+++.|- .-..+|.-+ ..|-....+| T Consensus 245 ~~~~~~--~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G 322 (341) T protein:vir:94 245 TIAPAE--ATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGRQAYG 322 (341) T ss_pred ceeccc--ccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhhhhhhhc Confidence 000000 0000111111111110 111100 0 00111110 001122211 1222444555 Q ss_pred hhhccccCCCCCceEEEEEEEeeeecC Q lcl|Aclame:pro 378 LKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 378 ~~K~rF~~~~g~~~DfGvi~idta~~~ 404 (404) .+=+|= +..| -|-+.+.- T Consensus 323 ~~~lrp--------~~~v-~~~~~~~~ 340 (341) T protein:vir:94 323 ARLYRP--------LHAV-NIHTTGDT 340 (341) T ss_pred ccccCc--------ceeE-EEecCcCC Confidence 555541 3333 33222222 No 31 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=98.79 E-value=7.8e-10 Score=70.52 Aligned_cols=262 Identities=15% Similarity=0.088 Sum_probs=153.4 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |..-.... ...=.| +.|+..+.....++..+ ..+..+.++|+-.+|++|+|+....++ T Consensus 1 ma~~~T~l------------~d~iiP--ev~~~~v~~~~~~~l~~--------~~~~~~d~~l~g~~G~tv~iP~~~~ig 58 (274) T protein:vir:12 1 MAQGLTKT------------SNQIIP--EVLAPMMQAQLEKKLRF--------ASFAEVDSTLQGQPGDTLTFPAFVYSG 58 (274) T ss_pred CCcceeeh------------hhhhch--HHHHHHHHHHHHhhhhh--------cccceecccccCCCCCEEEEeeecCCC Confidence 21110000 000011 45555543322222111 123344567877899999999987553 Q ss_pred c-CceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 81 K-RPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G-~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~ 159 (404) . .-+..++.++ .+.|+..++++.|++...++.... .+..-+.-|+..++.+.++.+|++..|..++..+.++... T Consensus 59 ~a~~~~~g~~i~--~~~lt~~~~~~~i~~~~~~~~i~D-~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~- 134 (274) T protein:vir:12 59 DAQVVAEGEKIP--TDILETKKREAKIRKIAKGTSITD-EALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT- 134 (274) T ss_pred ccccccCCCccc--hhhcccceeeEEeeeecceeeecH-HHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc- Confidence 2 1233334443 778999999999999888887765 3555566899999999999999999999998666432210 Q ss_pred ccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~ 239 (404) .+++.++.+.|..|..++.... T Consensus 135 -------------------------------------------~~~~a~~~d~i~dA~~~lgd~~--------------- 156 (274) T protein:vir:12 135 -------------------------------------------VNADITKLNGLQSAIDKFNDED--------------- 156 (274) T ss_pred -------------------------------------------ccccccCHHHHHHHHHHhcccc--------------- Confidence 0123467787877776654321 Q ss_pred CccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcc Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) ...++++|||.++..|++|+.. +|.. ...+..+.+.+|.+|.|.|+.|..-..+|. T Consensus 157 -~~~~~ivv~p~~~~~L~k~~~~-~fv~-------~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~--------------- 212 (274) T protein:vir:12 157 -LEPMVLFINPLDAGKLRGDAST-NFTR-------ATELGDDIIVKGAFGEALGAIIVRSNKLEA--------------- 212 (274) T ss_pred -ccccEEEeCHHHHHHHHhhhhh-hccc-------cccccccceecccceeecCeeEEEeCCCCc--------------- Confidence 1236899999999999999732 2332 122345788999999999999877665431 Q ss_pred cccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEe Q lcl|Aclame:pro 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVD 399 (404) Q Consensus 320 ~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~id 399 (404) .-++|+|..|++..-. .+ .. +|-..|-..+. +.|.+- +=|||-+++ T Consensus 213 -------------~t~~l~~~gA~~~~~~--~~--~~-vE~~Rd~~~~~----d~i~~~------------~~y~~~~~~ 258 (274) T protein:vir:12 213 -------------GTAILAKKGAVKLILK--RD--FF-LEVARDASTKT----TALYSD------------KHYVAYLYD 258 (274) T ss_pred -------------ceEEEEeccceeeeec--CC--ce-eccccchhhcc----cEEEee------------eEEEEEEEc Confidence 1247888877665422 22 12 44444332211 222221 124444433 Q ss_pred e--eecC Q lcl|Aclame:pro 400 T--AVKL 404 (404) Q Consensus 400 t--a~~~ 404 (404) - .|++ T Consensus 259 ~~~vv~~ 265 (274) T protein:vir:12 259 ESKAVKI 265 (274) T ss_pred CCceEEE Confidence 2 1222 No 32 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.76 E-value=3.9e-09 Score=66.70 Aligned_cols=286 Identities=10% Similarity=0.013 Sum_probs=156.6 Q ss_pred cEEEEecccCCCCcEEEEEEeeccccCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHH Q lcl|Aclame:pro 56 PVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTL 135 (404) Q Consensus 56 ~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~ 135 (404) .|+ .++ .|.++.|+-+-..+-...+=.+.+.|+-+++.-...+|.||+.--.=-.=..+++...++|+|.+.-.. T Consensus 1 ~vr---~i~--~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~ 75 (324) T protein:vir:99 1 MTR---TIT--SGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQ 75 (324) T ss_pred Cee---eee--cCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHH Confidence 333 343 399999999988876677777788888788888888899999764311112467888899999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccC----HH Q lcl|Aclame:pro 136 LGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS----IG 211 (404) Q Consensus 136 L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s----~~ 211 (404) ...=|++..|+.+|.++++...... |....+.. .+.... ++..+ ..+.++.-+ ++ T Consensus 76 ~G~aLA~~~Dq~i~~~~a~~~~~~a------~~~~~~~~----~~g~~~-----~~~~~------~~~~~~~~~~~~~~d 134 (324) T protein:vir:99 76 MGEALAMAADVANYAEMAKLVNSRK------ETTNENIE----GLGAAS-----LVKIT------GKKEDPAKYGTQVIQ 134 (324) T ss_pred HHHHHHHHHHHHHHHHHHHhhhccc------ccccCCcc----cCCccc-----eeccc------ccccccccCHHHHHH Confidence 9999999999999999875442110 11000000 000000 00000 011112222 34 Q ss_pred HHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEE Q lcl|Aclame:pro 212 LVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMW 291 (404) Q Consensus 212 ~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ 291 (404) .|..|.+.+++..-|. + . ++++|.|.|+..|+.++... ....+..+.+=+|.+|.+ T Consensus 135 ai~~a~~~Lde~~VP~-------~------g-R~~vv~P~~y~~Ll~~~~~~----------~~~~~~~~~~~~G~V~~i 190 (324) T protein:vir:99 135 ALTYARAAFAKKYIPA-------G------D-RTFYTDPDTYSAILAALMPN----------AANYAALIDPETGNIRNV 190 (324) T ss_pred HHHHHHHHHhhcCCCC-------C------C-CEEEeChHHHHHHhhccccc----------ccccccccceecceEEEE Confidence 5556666666654331 1 1 57999999999999776321 111233467888999999 Q ss_pred cCEEEEecCceeeeeccce-eEEeecCccccccc--------ccccccchhhheeeccceeEEEeeecCCCCcceeeccc Q lcl|Aclame:pro 292 RNILVRKYAGMPIRFYQGS-KVLVSENNLTATTK--------EVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKT 362 (404) Q Consensus 292 ngvii~~~~~~~irf~~~~-~~~~~~~~~~a~~~--------~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~ 362 (404) +|+.|.+-+++|.. .+. .....+....+.++ .+.....-.++|++=.+|++..=...--+.-+|.|+. T Consensus 191 ~Gf~V~~Sn~lp~~--~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~- 267 (324) T protein:vir:99 191 MGFEVVETPHMTAQ--MVTNPTDAFDGTGHIFPATGDSTTTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPEY- 267 (324) T ss_pred eceEEEecCCcccc--ccccccccccccccccccccccccccccccccCceeEEEEehhheEEEeeecceecceechhh- Confidence 99999999988742 110 11111111111111 1111111224566666655444332212233343321 Q ss_pred ccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEeeeecC Q lcl|Aclame:pro 363 DMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 363 D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idta~~~ 404 (404) .-.-|-....+|.+=+|- +=-++|.+..-+.= T Consensus 268 ---~~d~i~~~~a~G~~~lRP-------e~a~~v~l~~~~~~ 299 (324) T protein:vir:99 268 ---QADQIIAKYAMGHGGLRP-------EAVGAIIFEDGETP 299 (324) T ss_pred ---HHHhhhhhhhhcCccccc-------ceEEEEEEccCccc Confidence 123344455566665553 22344443332210 No 33 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=98.76 E-value=1.2e-09 Score=69.41 Aligned_cols=255 Identities=17% Similarity=0.114 Sum_probs=148.9 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |+--- +... =.| +.|+..+.....++..|. . .....++|.-.+|++|+|+... +. T Consensus 1 Ma~T~------------~~d~--I~P--ev~~~~V~e~~~~~~~~~-~-------~~~~d~~L~g~~G~ti~~P~~~-~i 55 (270) T protein:vir:95 1 MTQTK------------KANL--INP--EVLANVVSAQMQNAIRFT-P-------YAVTDDTLVGQPGDTITRPKYA-YI 55 (270) T ss_pred CCcee------------hhhh--cch--HHHHHHHHHHHHhHHhhc-c-------ccccccccCCCCCCEEEeeeec-CC Confidence 22110 0000 012 334444322222222221 1 1122467888899999999987 55 Q ss_pred cC--ceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 81 KR--PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (404) Q Consensus 81 G~--gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~ 158 (404) |+ .+..++.++ .+.|+...++.+|-+...++..... +..-+.-|...++...++.+|++..|..++-.|.|+... T Consensus 56 gdae~~~eg~~i~--~~~lt~~~~~a~i~~~gk~~~itD~-a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~ 132 (270) T protein:vir:95 56 GAAEDLQEGVAMD--TTQMSMTTTKVTVKETGKAVEVTQT-AIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQT 132 (270) T ss_pred CccccccCCCccc--hhhcccchheeeeehhhCcceecHH-HHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 54 344445555 7799999999999998888887653 444444589999999999999999999998777665421 Q ss_pred cccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCcccc Q lcl|Aclame:pro 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELH 238 (404) Q Consensus 159 ~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~ 238 (404) .+..++.+.+..|..+.- |+ T Consensus 133 ---------------------------------------------~~~~~t~~~~~dA~~~lg-------------d~-- 152 (270) T protein:vir:95 133 ---------------------------------------------ATVSADATGILDAIEVFN-------------SE-- 152 (270) T ss_pred ---------------------------------------------cccccCHHHHHHHHHHhc-------------cc-- Confidence 111234454545544432 11 Q ss_pred CCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEE-ecCceeeeeccceeEEeecC Q lcl|Aclame:pro 239 GEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVR-KYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 239 ~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~-~~~~~~irf~~~~~~~~~~~ 317 (404) .+...+++|||.++..||++. |.+ ..++-++.+.+|.+|.|.|+.+. .-. + T Consensus 153 -~~~~~~i~vhs~~~~~Lrk~~----~~~-------~~~~~~~~~~~G~ig~~~G~~Viv~s~-~--------------- 204 (270) T protein:vir:95 153 -NDEDYVLYVNPKDYNKLVKSL----FKV-------GGNVQDRAISKGDLVEIVGVSDIVKSK-R--------------- 204 (270) T ss_pred -cCCCcEEEEcHHHHHHHHhhh----ccc-------ccccccchhcccccceecceeEEEeCC-C--------------- Confidence 222478999999999999986 432 12345678999999999998542 211 0 Q ss_pred cccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEE Q lcl|Aclame:pro 318 NLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIA 397 (404) Q Consensus 318 ~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~ 397 (404) +.. ..++|.+..|+++. ...+ +. +|...|-..+. ..|.+ -+-|+|-. T Consensus 205 -----~~~-------~~~~l~~~gAi~~~--~~~~--~~-vEtdRd~~~~~----d~i~~------------~~~y~v~~ 251 (270) T protein:vir:95 205 -----VSE-------NTAFLQRYGAMEIV--NKKK--PE-AYTDFDILKRT----HLLST------------NYHYSVNL 251 (270) T ss_pred -----CCc-------eeEEEEeccceeee--ecCC--ce-eeeccchhhcc----cEEEe------------eeEEEEEE Confidence 011 13578887776654 3322 22 45444432211 11111 13566666 Q ss_pred Eeee--ecC Q lcl|Aclame:pro 398 VDTA--VKL 404 (404) Q Consensus 398 idta--~~~ 404 (404) ++.. |+| T Consensus 252 ~~~skvv~~ 260 (270) T protein:vir:95 252 KDETGVVKV 260 (270) T ss_pred EccceEEEE Confidence 6643 444 No 34 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=98.74 E-value=1.4e-09 Score=69.18 Aligned_cols=265 Identities=15% Similarity=0.081 Sum_probs=145.8 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) +|..++ +| .-+.|+..+.....++.-+ + .......+|+-++|++|+|+....++ T Consensus 5 ~T~~~~----------~i--------iPev~s~~v~~~~~~~~v~----~----~~~~~~~~l~g~~G~tv~ip~~~~~g 58 (278) T protein:vir:80 5 TTKLAN----------LI--------DPEVMGPMISAKLPKAIKF----G----KIAPIDNSLEGQPGSEITVPKYKYIG 58 (278) T ss_pred ceehhh----------ee--------cHHHHHHHHHHHHHHhhhh----c----ccceecccccCCCCCEEEEeeeccCC Confidence 111111 11 1244555543222211111 1 12233567777889999999987664 Q ss_pred cC-ceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~-gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~ 159 (404) .. -+..++.+. .+.|+..++++.|++...++.... .+..-+..|+..++...++.+|++..|..++-+|.|+.... T Consensus 59 ~a~~~~~g~~i~--~~~lt~~~~~~~i~~~~~a~~v~D-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~ 135 (278) T protein:vir:80 59 DAQDVAEGAAID--YSALETESVKHGIKKAGKGVKLTD-ESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEV 135 (278) T ss_pred cceeecCCCcCc--ccccccceeeEeeehhhccccccH-HHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 22 243344443 678999999999999888887765 46666788999999999999999999999998887654210 Q ss_pred ccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~ 239 (404) -.+++. ...| -.++.+-.+..+....+.| T Consensus 136 ----------------------~~~~t~---------------~~~~-~~~~~~~da~~~l~~~~~~------------- 164 (278) T protein:vir:80 136 ----------------------KGAINI---------------GLID-KIENTFTDAPDAIEDESIT------------- 164 (278) T ss_pred ----------------------cccccc---------------chhh-hHHHHHHHHHHhhcccCCC------------- Confidence 000110 0000 0123333333333222111 Q ss_pred CccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcc Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) ...+++|||.++..|++++.. +|.. + ...-++.+.+|.+|.|.|+.|....++|. T Consensus 165 --~~~~ivv~p~~~~~L~k~~~~-~~~~----~---~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~--------------- 219 (278) T protein:vir:80 165 --TTGVLFLNYKDTAKLREEAAG-SWTK----A---SQLGDDLLVKGAFGELLGWEIVRTKKLAD--------------- 219 (278) T ss_pred --cccEEEECHHHHHHHHhhhhh-hccc----c---ccccccceeeccceeecceeEEEcCCCCc--------------- Confidence 124689999999999999742 1221 1 11224567889999999999988776641 Q ss_pred cccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEe Q lcl|Aclame:pro 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVD 399 (404) Q Consensus 320 ~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~id 399 (404) ..++|++..|++..-.+ +. . +|-..|-.. ..+.|.+. +=||+-+++ T Consensus 220 -------------~t~~l~~~gAi~~~~~~--~~--~-vE~~Rd~~~----~~d~i~~~------------~~yg~~v~~ 265 (278) T protein:vir:80 220 -------------GNALAVKAGALKTFLKR--NL--L-AESGRDMDH----KLTKFNAD------------QHYAVALVD 265 (278) T ss_pred -------------ceEEEEeccceeeeecC--Cc--c-cccccchhh----ccceeeee------------eEEEEEEEc Confidence 12477887775543222 21 1 332222211 12222221 113333331 Q ss_pred --eeecC Q lcl|Aclame:pro 400 --TAVKL 404 (404) Q Consensus 400 --ta~~~ 404 (404) -.|+| T Consensus 266 ~~~~v~i 272 (278) T protein:vir:80 266 ETKAVKV 272 (278) T ss_pred CcceEEE Confidence 12222 No 35 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=98.73 E-value=7.6e-09 Score=65.10 Aligned_cols=265 Identities=13% Similarity=0.030 Sum_probs=140.3 Q ss_pred hccchH-HHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccccCceecCceeeeehhhhhhc Q lcl|Aclame:pro 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHA 100 (404) Q Consensus 22 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee~L~~~ 100 (404) +.++.+ -++|++.+...-.++..+... +.+--++....||+|+|+.....+-.-..+... ....+++... T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l--------~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~-~~~~~~~~~~ 71 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANL--------VNREYEGIASKGNVVHIAGVVAPTVKDYKAAGR-QTSADAISDT 71 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhh--------hhccccccccCCcEEEEeecCcccccccccCCC-ccCccccccc Confidence 444443 367777665554444333222 222234455689999999876554322221111 2457889999 Q ss_pred ccEEEEeeec-cccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeecccccccccccc Q lcl|Aclame:pro 101 DFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMI 179 (404) Q Consensus 101 sd~v~Idq~R-~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~~~~~ 179 (404) +.++.||+.+ .++... .+++..+..||++..+. +..=+++..|+.++-.++++... T Consensus 72 ~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~ala~~vD~~i~~~~~~a~~~--------------------- 128 (273) T protein:vir:79 72 GVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA--------------------- 128 (273) T ss_pred eEEEEEeeecccceeec-cHHHHhhcccHHHHHHH-HHHHHHHHHHHHHHHHHhhcccc--------------------- Confidence 9999999964 567665 45667788899986665 45568899999887666543310 Q ss_pred CccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHHHHHHhC Q lcl|Aclame:pro 180 NDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTS 259 (404) Q Consensus 180 N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d 259 (404) |...+ .++... .++.|..|...+++..-|- +. ++++++|.++..|+++ T Consensus 129 ~~~~~----------------~~~~~~--~~~~i~~a~~~ld~~~vP~-------~~-------R~lvv~p~~~~~Ll~~ 176 (273) T protein:vir:79 129 LTGSA----------------PSDADD--AFDLIASALKELTKANVPN-------VG-------RVVVVNAEMAFWLRSS 176 (273) T ss_pred ccccc----------------ccchhh--HHHHHHHHHHHhhhccCCc-------cC-------cEEEECHHHHHHHhhc Confidence 11111 111111 2566777877777765431 11 5789999999999998 Q ss_pred cchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcccccccccccccchhhheeec Q lcl|Aclame:pro 260 TSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLG 339 (404) Q Consensus 260 ~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlG 339 (404) +.+ +.+... .|..+.|-+|.+|.|.|+.|.+..++|.. .+.+.... = T Consensus 177 ~~~--~~~~~~------~~~~~~l~~G~ig~~~G~~i~~s~~lp~~--~~~~~~a~-----------------------~ 223 (273) T protein:vir:79 177 GSK--LTSADT------SGDAAGLRAGTIGNLLGARIVESNNLRDT--DDEQFVAF-----------------------H 223 (273) T ss_pred hhh--hhhhhh------cccccceeeeEeeEEeceEEEeccccccc--CceEEEEE-----------------------e Confidence 641 332211 24567899999999999999998877631 11000000 0 Q ss_pred cceeEEEeeec-CCCCcceeecccccCchhHHHHHHHhchhhccc-----cCCCCC Q lcl|Aclame:pro 340 AQALANAYGQK-AGGHFNMVEKKTDMDNRTEIAISWINGLKKIRF-----PEKSGK 389 (404) Q Consensus 340 aQAl~~A~g~~-~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF-----~~~~g~ 389 (404) ..|+ ++.+. ..+...+.++. ++.. |-....+|.+=+|= -.+.|. T Consensus 224 ~~A~--~~a~~~~~~e~~r~~~~--~~~~--v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 224 PSAA--AYVSQIDTVEALRDQDS--FSDR--IRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ccce--eeeeehhhhhcccCccc--ceee--eeeeeeeeeEEecCceEEEEeccCC Confidence 0011 11110 00000111111 0110 11122233222221 001122 No 36 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=98.67 E-value=3.6e-09 Score=66.86 Aligned_cols=271 Identities=13% Similarity=0.083 Sum_probs=150.6 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |...+... +.++ .=+.|+..+.....++.-+. ....+..+|+..+|++|+++....+. T Consensus 1 MA~~~T~~------~~~~--------iPev~s~~v~~~~~~~~~~~--------~~~~~~~~~~g~~G~tv~iP~~~~~~ 58 (272) T protein:vir:98 1 MAVGTTKM------AQML--------DPEVLADMIDAEVGKAIRFA--------PLAEVDTTLEGQPGTTLTVPKWDYIG 58 (272) T ss_pred CCCccccc------hhee--------chHHHHHHHHHHHHHHhhhh--------ccccccccccCCCCCEEEEEEecCCC Confidence 43221111 1111 11345444333222222111 11222345777899999998876553 Q ss_pred cC-ceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~-gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~ 159 (404) .. .+..++.+. .+.+.+.+.++.|.+..+.+..... ...++..|+.....+.|.+.|++..|..+|-.+.|+.. T Consensus 59 ~a~~v~eg~~i~--~~~~~~~~~~~~~~~~~~~~~itd~-~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~-- 133 (272) T protein:vir:98 59 DAEDVAEGEAIP--MTQLGFKKTTMTIKKAGKGVEITDE-AILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ-- 133 (272) T ss_pred CcccccCCCccc--ccccccceEEEEeeeeeeeeeecHH-HHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-- Confidence 32 343344444 6789999999999998888877654 45668889999999999999999999999866644321 Q ss_pred ccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~ 239 (404) .+ +...+++.|..|...+...+. T Consensus 134 ---------------------~~----------------------~~~~t~d~i~da~~~l~~~~~-------------- 156 (272) T protein:vir:98 134 ---------------------TV----------------------EATATVDGVSKALDIFNDEDD-------------- 156 (272) T ss_pred ---------------------cc----------------------ccccCHHHHHHHHHHHhccCC-------------- Confidence 01 111235556666665543321 Q ss_pred CccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcc Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) +..+++|||..+..|+++... +|. ++ .....+.+.+|.+|.|.|+.+..-+.+|- T Consensus 157 --~~~~~vv~p~~~~~L~k~~~~-~~~---~~----~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~--------------- 211 (272) T protein:vir:98 157 --AETVIVMNPADASTLRLDAAK-EWL---GA----TEVGANRVVSGVYGEVLGVQIVRSRKCPK--------------- 211 (272) T ss_pred --CccEEEEcHHHHHHHHHhccc-ccc---cc----ccccccccccccchhhcCeeEEEcCCCCc--------------- Confidence 125799999999999988532 122 11 12334678899999999999988776541 Q ss_pred cccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEe Q lcl|Aclame:pro 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVD 399 (404) Q Consensus 320 ~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~id 399 (404) + -++++|..|++++-.+ +. . +|...|-... ...|.+.. ||.-.--+.+=+-++.+. T Consensus 212 ------~-------t~~~~~~~a~~~~~~~--~~--~-ve~~r~~~~~----~~~i~~~~--~~~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:98 212 ------G-------TAYMVRKGALRIMLKR--NT--M-VETDRDITKA----INQIVANK--HYGVYLYKAEKAVKITLK 267 (272) T ss_pred ------c-------eEEEEcCCeEEEEecC--Cc--e-eeeccccccc----eeEEEEEE--EEEEEEEcCCceEEEEec Confidence 0 1477777776665332 21 1 3333332211 12222211 110000011123333344 Q ss_pred eeecC Q lcl|Aclame:pro 400 TAVKL 404 (404) Q Consensus 400 ta~~~ 404 (404) .|.|- T Consensus 268 ~a~~~ 272 (272) T protein:vir:98 268 DAAKK 272 (272) T ss_pred ccccC Confidence 44444 No 37 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=98.67 E-value=3.6e-09 Score=66.86 Aligned_cols=271 Identities=13% Similarity=0.083 Sum_probs=150.6 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |...+... +.++ .=+.|+..+.....++.-+. ....+..+|+..+|++|+++....+. T Consensus 1 MA~~~T~~------~~~~--------iPev~s~~v~~~~~~~~~~~--------~~~~~~~~~~g~~G~tv~iP~~~~~~ 58 (272) T protein:vir:30 1 MAVGTTKM------AQML--------DPEVLADMIDAEVGKAIRFA--------PLAEVDTTLEGQPGTTLTVPKWDYIG 58 (272) T ss_pred CCCccccc------hhee--------chHHHHHHHHHHHHHHhhhh--------ccccccccccCCCCCEEEEEEecCCC Confidence 43221111 1111 11345444333222222111 11222345777899999998876553 Q ss_pred cC-ceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~-gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~ 159 (404) .. .+..++.+. .+.+.+.+.++.|.+..+.+..... ...++..|+.....+.|.+.|++..|..+|-.+.|+.. T Consensus 59 ~a~~v~eg~~i~--~~~~~~~~~~~~~~~~~~~~~itd~-~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~-- 133 (272) T protein:vir:30 59 DAEDVAEGEAIP--MTQLGFKKTTMTIKKAGKGVEITDE-AILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ-- 133 (272) T ss_pred CcccccCCCccc--ccccccceEEEEeeeeeeeeeecHH-HHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-- Confidence 32 343344444 6789999999999998888877654 45668889999999999999999999999866644321 Q ss_pred ccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~ 239 (404) .+ +...+++.|..|...+...+. T Consensus 134 ---------------------~~----------------------~~~~t~d~i~da~~~l~~~~~-------------- 156 (272) T protein:vir:30 134 ---------------------TV----------------------EATATVDGVSKALDIFNDEDD-------------- 156 (272) T ss_pred ---------------------cc----------------------ccccCHHHHHHHHHHHhccCC-------------- Confidence 01 111235556666665543321 Q ss_pred CccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcc Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) +..+++|||..+..|+++... +|. ++ .....+.+.+|.+|.|.|+.+..-+.+|- T Consensus 157 --~~~~~vv~p~~~~~L~k~~~~-~~~---~~----~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~--------------- 211 (272) T protein:vir:30 157 --AETVIVMNPADASTLRLDAAK-EWL---GA----TEVGANRVVSGVYGEVLGVQIVRSRKCPK--------------- 211 (272) T ss_pred --CccEEEEcHHHHHHHHHhccc-ccc---cc----ccccccccccccchhhcCeeEEEcCCCCc--------------- Confidence 125799999999999988532 122 11 12334678899999999999988776541 Q ss_pred cccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEe Q lcl|Aclame:pro 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVD 399 (404) Q Consensus 320 ~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~id 399 (404) + -++++|..|++++-.+ +. . +|...|-... ...|.+.. ||.-.--+.+=+-++.+. T Consensus 212 ------~-------t~~~~~~~a~~~~~~~--~~--~-ve~~r~~~~~----~~~i~~~~--~~~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:30 212 ------G-------TAYMVRKGALRIMLKR--NT--M-VETDRDITKA----INQIVANK--HYGVYLYKAEKAVKITLK 267 (272) T ss_pred ------c-------eEEEEcCCeEEEEecC--Cc--e-eeeccccccc----eeEEEEEE--EEEEEEEcCCceEEEEec Confidence 0 1477777776665332 21 1 3333332211 12222211 110000011123333344 Q ss_pred eeecC Q lcl|Aclame:pro 400 TAVKL 404 (404) Q Consensus 400 ta~~~ 404 (404) .|.|- T Consensus 268 ~a~~~ 272 (272) T protein:vir:30 268 DAAKK 272 (272) T ss_pred ccccC Confidence 44444 No 38 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=98.57 E-value=1.6e-08 Score=63.31 Aligned_cols=268 Identities=13% Similarity=0.031 Sum_probs=152.1 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |.+.+.-..+. . =+.|+..+.....++..| . ......++|+-.+|++|+|+....++ T Consensus 3 ~~~~T~l~d~i-------------~--PEv~~~~v~~~~~~~~~~-~-------~~~~~~~~l~g~~G~tv~iP~~~~ig 59 (275) T protein:vir:96 3 LENMTKLANMV-------------N--PEVLAPMMQAELDKKLKF-A-------QFADIDNTLVGQPGNTITFPAFVYSG 59 (275) T ss_pred Ccccchhhhhh-------------c--hHHHHHHHHHHHHHhhhh-c-------ccceecccccCCCCCEEEeeeeccCC Confidence 33222111111 1 244555443333322222 1 12233567888899999999987663 Q ss_pred cC-ceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 81 KR-PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDF 159 (404) Q Consensus 81 G~-gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~ 159 (404) .. -+..++.+. .+.|+..++++.|.+..+++.... .+...+.-|+..++.+.++..|++..|..++..|.++.. T Consensus 60 ~a~~~~~g~~i~--~~~lt~~~~~~~i~~~~~~~~i~D-~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~-- 134 (275) T protein:vir:96 60 DAKVVPEGEEIP--IDLIETKKRQATIRKIGKGTVLTD-EALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATL-- 134 (275) T ss_pred ccccccCCCCcc--hhhcccceeeEEeehhcccccccH-HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-- Confidence 22 233334443 788999999999999989988765 355555679999999999999999999999866644221 Q ss_pred ccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccC Q lcl|Aclame:pro 160 VADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239 (404) Q Consensus 160 ~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~ 239 (404) . .+++.++.+.|..|..++.... T Consensus 135 ---------------------~---------------------~~~~~~~~d~i~dA~~~lgd~~--------------- 157 (275) T protein:vir:96 135 ---------------------K---------------------VEADITKLAGLQTAIDKFNDED--------------- 157 (275) T ss_pred ---------------------c---------------------ccccccCHHHHHHHHHHhcccc--------------- Confidence 0 0134467777777776664211 Q ss_pred CccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcc Q lcl|Aclame:pro 240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNL 319 (404) Q Consensus 240 ~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~ 319 (404) ...++++|||.++..|++++.. +|.. + ..+-.+.+..|.+|.|.|+.|..-.++|. + T Consensus 158 -~~~~~ivv~p~~~~~L~k~~~~-~f~~----~---~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~----~---------- 214 (275) T protein:vir:96 158 -LEPMVLFVNPLDAGKLRASATD-NFTR----A---TLLGDNVIVKGAFGEALGAIIVRSNKIKE----G---------- 214 (275) T ss_pred -CCccEEEeCHHHHHHHHhcccc-cccc----c---ccccccceeccccceecCeeEEEeCCCCc----c---------- Confidence 1237899999999999999742 2432 1 12234678899999999999987665541 1 Q ss_pred cccccccccccchhhheeeccceeEEEeeecCC---CCcceeecccccCchh-HHHHHHHhchhhccccCCCCCceEEEE Q lcl|Aclame:pro 320 TATTKEVAAATNIDRAMLLGAQALANAYGQKAG---GHFNMVEKKTDMDNRT-EIAISWINGLKKIRFPEKSGKMQDHGV 395 (404) Q Consensus 320 ~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g---~r~~w~Ee~~D~g~~~-~i~i~~i~G~~K~rF~~~~g~~~DfGv 395 (404) .++|+|-.|++..-..... .|--..-.+.=+++++ ++.+-.=-++.|++|... =.|| T Consensus 215 --------------t~~i~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~-----~~~~ 275 (275) T protein:vir:96 215 --------------EAILAKRGAVKLITKRDFFLETERHASHKSTALFSDKHYVAYLYDESKVVKITKSAS-----GLGV 275 (275) T ss_pred --------------eEEEEeccceeeeecCCcccccccchhhcCcEEEEeEEEEEEEEcCccEEEEEeccc-----ccCC Confidence 2345555555554332100 1111000000012221 122222234455555332 1122 No 39 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.56 E-value=2.7e-08 Score=62.08 Aligned_cols=332 Identities=13% Similarity=0.079 Sum_probs=164.5 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||-.... -.-.|++ .-...+-++|.|.+.+...-+..+.+. +-+++.+++ .|.++.|+-+...+ T Consensus 1 Ms~~n~~-t~~~~~~----s~~~~al~le~f~geV~taF~~~si~~---------~~~~vrti~--~GkS~qf~~iG~~~ 64 (402) T protein:vir:97 1 MSTPNTL-TNVAVSA----SGEVDSLLIEKFNGKVNEQYLKGENIL---------SYFDVQTVT--GTNTVSNKYLGETE 64 (402) T ss_pred CCCcccc-ccccccc----ccchhhhhhhhhhhhHHHHHHHHHhhc---------Ccceeeeec--ccceEEEEEEeeeE Confidence 5533111 1111111 001111246777777665555544432 112233443 78899999997777 Q ss_pred cCceecCceeeeehhhhhhcccEEEEee---eccccccCcchhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHH--hhh Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQ---GRHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVH--LAG 154 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq---~R~~V~~~gkms~qrs~~d-lr~~ar~~L~~w~~~~~D~~~~~~--laG 154 (404) -...+-.+.+.| +.+......|.||. .||.|+ .+++-...+| +|++--..+..=+++..||.+|-. +++ T Consensus 65 a~y~~~G~~ldg--~~~~~~k~~ItID~lL~a~~~V~---diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa 139 (402) T protein:vir:97 65 LQVLAPGQSPNA--TPTQADKNQLVIDTTVIARNTVA---HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGG 139 (402) T ss_pred EeeeccccccCC--CCcccccEEEEeCceeechhhhh---hHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 666665566765 46776777799998 456664 3677788899 899999999999999999988633 344 Q ss_pred cccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccc-cCHH----HHHHHHHHHHhcCCCCCc Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADI-FSIG----LVDNLSLFIDEMAHPLQP 229 (404) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~-~s~~----~Id~a~~~a~~~~~pi~P 229 (404) .+. ..... ..+.. .+.+....-..+.++. -+.. .|-.+...+++..-| T Consensus 140 ~a~-t~~~~------~~~~~-----------------~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP--- 192 (402) T protein:vir:97 140 IAN-TKAER------NKPRV-----------------KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVD--- 192 (402) T ss_pred ccc-ccccc------ccCcc-----------------cccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCC--- Confidence 432 11110 00011 0111000000111111 1222 222333444443333 Q ss_pred eEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccc Q lcl|Aclame:pro 230 VRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQG 309 (404) Q Consensus 230 v~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~ 309 (404) .++ ++++|+|.||.-|..++.+ .+-. ... .....+=.|.+++++||.|.+-++.|-.-... T Consensus 193 ---------~~d--Rv~vv~P~~y~~Ll~~~rl---~n~d---~~~--~~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~i 253 (402) T protein:vir:97 193 ---------ISD--VAIMMPWKFFNALRDADRI---VDKT---YTI--SQSGATINGFVLSSYNCPVIPSNRFPTFAQDQ 253 (402) T ss_pred ---------ccc--cEEEeChHHHHHHhhcccc---cchh---hcc--ccCCccccceeEEEeceEEEecCccccccccc Confidence 111 6999999999999999742 2211 000 11244568999999999999999887321112 Q ss_pred eeEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccc------ Q lcl|Aclame:pro 310 SKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRF------ 383 (404) Q Consensus 310 ~~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF------ 383 (404) +....++.+.+..-+ +.+...-.++++.=..|++.+=...--++++|.++.+ -.-|-....+|..=.|- T Consensus 254 t~~~ls~a~~G~~y~-~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~----~~~id~~~a~G~g~~RPeaa~vv 328 (402) T protein:vir:97 254 AHHLLSNEDNGYRYD-PIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK----TYYIDTFMAEGAIPDRWEAVSVV 328 (402) T ss_pred cccccccCCCCccCC-cCcccceeEEEEEecceEEEEEeeccccchhhchhHH----HHHHHHHHHhCCcccCccceEEE Confidence 222222222221111 1233344456666666666653332223343322211 11255566677766553 Q ss_pred --cCC--CC----CceEEEEEEEeeeecC Q lcl|Aclame:pro 384 --PEK--SG----KMQDHGVIAVDTAVKL 404 (404) Q Consensus 384 --~~~--~g----~~~DfGvi~idta~~~ 404 (404) ... .+ -.+|+..+.--.--|. T Consensus 329 ~~~~~~t~~~~~~~~~~~~~~~~~~~~~~ 357 (402) T protein:vir:97 329 TTKRDATTGDAGGPGDDHATVLARAQRKA 357 (402) T ss_pred EEecccccccCCccccchhhhhcccccce Confidence 110 00 1233332211110000 No 40 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=98.39 E-value=1.9e-07 Score=57.40 Aligned_cols=270 Identities=13% Similarity=0.038 Sum_probs=137.1 Q ss_pred hccchH-HHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccccCceecCceeeeehhhhhhc Q lcl|Aclame:pro 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHA 100 (404) Q Consensus 22 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee~L~~~ 100 (404) +.++.+ -++|++.+...-.+.+-+... +.+-.+.+-..||+|+|+....++-..-.+... ....+++... T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l--------~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~-~~~~~~~~~~ 71 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANL--------VNREYEGTASKGNVVHIAGVVAPTVKDYKAAGR-QTSADAISDT 71 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchh--------hccccccccccCceEEEeecccccccccccCCC-ccCccccccc Confidence 555554 367777665544444333221 112123333569999999876654222111111 2346788899 Q ss_pred ccEEEEeeec-cccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeecccccccccccc Q lcl|Aclame:pro 101 DFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMI 179 (404) Q Consensus 101 sd~v~Idq~R-~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~~~~~ 179 (404) +.++.||+.+ .++.+. .+++.-...|+++..|. +..=+++..|+.++-.++++-.. T Consensus 72 ~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~alA~~vD~~i~~~~~~a~~~--------------------- 128 (273) T protein:vir:10 72 GVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA--------------------- 128 (273) T ss_pred eEEEEEeeeeecceEee-cHHHhhhhccHHHHHHH-HHHHHHHHHHHHHHHHHhccccc--------------------- Confidence 9999999975 445554 34666677888876554 45668888999988777553210 Q ss_pred CccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHHHHHHhC Q lcl|Aclame:pro 180 NDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTS 259 (404) Q Consensus 180 N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d 259 (404) |.. ...++... .++.|..|.+.+++..-| .+ . ++++++|.++..|+++ T Consensus 129 ~~~----------------~~~~~~~~--~~~~i~~a~~~ld~~~vP-------~~------~-R~lvv~p~~~~~L~~~ 176 (273) T protein:vir:10 129 LTG----------------SAPTDADD--AFDLIAKALKELTKANVP-------NV------G-RVVVVNAEMAFWLRSS 176 (273) T ss_pred ccc----------------ccccchhH--HHHHHHHHHHHhhhcCCC-------cC------C-CEEEECHHHHHHHhcc Confidence 100 11122222 256677787777776543 11 1 5789999999999998 Q ss_pred cchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcccccccccccccchhhheeec Q lcl|Aclame:pro 260 TSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLG 339 (404) Q Consensus 260 ~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlG 339 (404) +.+ +.+... .|..+.|=.|.+|.+.|+-|.+..++|.. ++. . ++..= T Consensus 177 ~~~--~~~~~~------~~~~~~l~~G~ig~i~G~~v~~s~~lp~~-----------~~~-------~-------~~~~~ 223 (273) T protein:vir:10 177 GSK--LTSADT------SGDAAGLRAGTIGNLLGARIVESNNLRDT-----------DDE-------Q-------FVAFH 223 (273) T ss_pred hhh--hhhhhc------cccccceeeeeeeEEeceEEEEecccccC-----------Ccc-------E-------EEEEe Confidence 642 322211 13456677899999999999998776521 000 0 01111 Q ss_pred cceeEEEeeec-CCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEeeeec Q lcl|Aclame:pro 340 AQALANAYGQK-AGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK 403 (404) Q Consensus 340 aQAl~~A~g~~-~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idta~~ 403 (404) ..|++ +.+. ..+-..+.++.+ +.. |-...++|.+=+|= + |+++|=...- T Consensus 224 ~~A~~--~a~q~~~~e~~r~~~~~--~~~--v~~~~~yg~~v~~~--------~-~~~~l~~~g~ 273 (273) T protein:vir:10 224 PSAAA--YVSQIDTVEALRDQDSF--SDR--IRALHVYGGKVVRP--------T-GVVVFNKTGS 273 (273) T ss_pred cccee--eeeeeehhhcccCCCcc--eee--eeeeeeeeeeEecc--------c-eEEEEeccCC Confidence 12222 2221 000011111110 100 11112222221110 0 2222211111 No 41 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=98.39 E-value=1.9e-07 Score=57.40 Aligned_cols=270 Identities=13% Similarity=0.038 Sum_probs=137.1 Q ss_pred hccchH-HHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccccCceecCceeeeehhhhhhc Q lcl|Aclame:pro 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHA 100 (404) Q Consensus 22 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee~L~~~ 100 (404) +.++.+ -++|++.+...-.+.+-+... +.+-.+.+-..||+|+|+....++-..-.+... ....+++... T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l--------~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~-~~~~~~~~~~ 71 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANL--------VNREYEGTASKGNVVHIAGVVAPTVKDYKAAGR-QTSADAISDT 71 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchh--------hccccccccccCceEEEeecccccccccccCCC-ccCccccccc Confidence 555554 367777665544444333221 112123333569999999876654222111111 2346788899 Q ss_pred ccEEEEeeec-cccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeecccccccccccc Q lcl|Aclame:pro 101 DFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMI 179 (404) Q Consensus 101 sd~v~Idq~R-~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~~~~~ 179 (404) +.++.||+.+ .++.+. .+++.-...|+++..|. +..=+++..|+.++-.++++-.. T Consensus 72 ~~~~tid~~~~~~~~i~-d~d~~~~~~~~~~~~~~-~~~alA~~vD~~i~~~~~~a~~~--------------------- 128 (273) T protein:vir:10 72 GVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRA-GATALATDTDKFIADMLVDNGTA--------------------- 128 (273) T ss_pred eEEEEEeeeeecceEee-cHHHhhhhccHHHHHHH-HHHHHHHHHHHHHHHHHhccccc--------------------- Confidence 9999999975 445554 34666677888876554 45668888999988777553210 Q ss_pred CccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHHHHHHhC Q lcl|Aclame:pro 180 NDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTS 259 (404) Q Consensus 180 N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d 259 (404) |.. ...++... .++.|..|.+.+++..-| .+ . ++++++|.++..|+++ T Consensus 129 ~~~----------------~~~~~~~~--~~~~i~~a~~~ld~~~vP-------~~------~-R~lvv~p~~~~~L~~~ 176 (273) T protein:vir:10 129 LTG----------------SAPTDADD--AFDLIAKALKELTKANVP-------NV------G-RVVVVNAEMAFWLRSS 176 (273) T ss_pred ccc----------------ccccchhH--HHHHHHHHHHHhhhcCCC-------cC------C-CEEEECHHHHHHHhcc Confidence 100 11122222 256677787777776543 11 1 5789999999999998 Q ss_pred cchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcccccccccccccchhhheeec Q lcl|Aclame:pro 260 TSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLG 339 (404) Q Consensus 260 ~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlG 339 (404) +.+ +.+... .|..+.|=.|.+|.+.|+-|.+..++|.. ++. . ++..= T Consensus 177 ~~~--~~~~~~------~~~~~~l~~G~ig~i~G~~v~~s~~lp~~-----------~~~-------~-------~~~~~ 223 (273) T protein:vir:10 177 GSK--LTSADT------SGDAAGLRAGTIGNLLGARIVESNNLRDT-----------DDE-------Q-------FVAFH 223 (273) T ss_pred hhh--hhhhhc------cccccceeeeeeeEEeceEEEEecccccC-----------Ccc-------E-------EEEEe Confidence 642 322211 13456677899999999999998776521 000 0 01111 Q ss_pred cceeEEEeeec-CCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEeeeec Q lcl|Aclame:pro 340 AQALANAYGQK-AGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK 403 (404) Q Consensus 340 aQAl~~A~g~~-~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idta~~ 403 (404) ..|++ +.+. ..+-..+.++.+ +.. |-...++|.+=+|= + |+++|=...- T Consensus 224 ~~A~~--~a~q~~~~e~~r~~~~~--~~~--v~~~~~yg~~v~~~--------~-~~~~l~~~g~ 273 (273) T protein:vir:10 224 PSAAA--YVSQIDTVEALRDQDSF--SDR--IRALHVYGGKVVRP--------T-GVVVFNKTGS 273 (273) T ss_pred cccee--eeeeeehhhcccCCCcc--eee--eeeeeeeeeeEecc--------c-eEEEEeccCC Confidence 12222 2221 000011111110 100 11112222221110 0 2222211111 No 42 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.35 E-value=6.9e-08 Score=59.84 Aligned_cols=331 Identities=11% Similarity=0.077 Sum_probs=153.6 Q ss_pred CCCc-----CchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEE Q lcl|Aclame:pro 1 MTTV-----TSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) Q Consensus 1 ~~~~-----~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L 75 (404) |+-. |-++.-.....+ ......+-++|.|.+.+...-.+.+-+. ..+++.+++ .|.+|.|+- T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~--~~~~~~al~le~f~geV~~~f~~~si~~---------~~~~~rti~--~Gksv~f~~ 67 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYG--GATDKYALYLKLFSGEMFKGFQHETIAR---------DLVTKRTLK--NGKSLQFIY 67 (375) T ss_pred CccccccccCccccCCccccc--cccchHHHHHHHHhHHHHHHHHHHHhhh---------ccccccccc--cCceEEEEe Confidence 2211 111100000000 0011112247788887766666555442 233334443 599999999 Q ss_pred eeccccCceecCceeeee-hhhhhhcccEEEEeee---ccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 76 MHKLSKRPTMGDERVEGR-GEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVH 151 (404) Q Consensus 76 ~~~L~G~gv~Gd~~leGn-ee~L~~~sd~v~Idq~---R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~ 151 (404) +...+-...+..+.+.|+ .++..-.+.+|.||+. ++.|+ .+++-...+|||++.-.....=|++..|+.++.+ T Consensus 68 iG~~t~~~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~Vd---DiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~ 144 (375) T protein:vir:10 68 TGRMTSSFHTPGTPILGNADKAPPVAEKTIVMDDLLISSAFVY---DLDETLAHYELRGEISKKIGYALAEKYDRLIFRS 144 (375) T ss_pred eeeeEEeeecCCcCcCCccccCCCCCceEEEecchhhhhhhHh---hHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 988887788888888887 4567778889999998 45665 5788889999999999999999999999999988 Q ss_pred hh-hcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCce Q lcl|Aclame:pro 152 LA-GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPV 230 (404) Q Consensus 152 la-G~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv 230 (404) +. +++...- ...+-...+... -+-.+.+.++...+++. --++.|..+.+.+++..-| T Consensus 145 l~kaa~~~~p---~~~~~~~~~Gg~-------------~i~~~sg~~~~~~~ta~--~~~~ai~~a~~~Lde~~VP---- 202 (375) T protein:vir:10 145 ITRGARSASP---VSATNFVEPGGT-------------QIRVGSGTNESDAFTAS--ALVNAFYDAAAAMDEKGVS---- 202 (375) T ss_pred HHHhhhhccc---cccccccccCcc-------------eeeeccccccccccCHH--HHHHHHHHHHHHHhhcCCC---- Confidence 75 4442100 000000000100 01111111111111111 1134455555666554433 Q ss_pred EecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccce Q lcl|Aclame:pro 231 RLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGS 310 (404) Q Consensus 231 ~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~ 310 (404) .+ . ++++++|.|+.-|..+-+..... ++.-+.+.-.=.|.++.++|+.|.+-.+.|. ..+. T Consensus 203 ---~~------~-R~~vv~P~~y~~Ll~~~d~~~~~-------n~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~--~~~~ 263 (375) T protein:vir:10 203 ---SQ------G-RCAVLNPRQYYALIQDIGSNGLV-------NRDVQGSALQSGNGVIEIAGIHIYKSMNIPF--LGKY 263 (375) T ss_pred ---CC------C-CEEEeChHHHHHHHhcCCcccee-------eecccccceeccceEEEEeceEEEEeccccc--cccc Confidence 11 1 56889999999998752110111 1111112223357789999999999777662 1111 Q ss_pred eEEeecCcccccccccccccchhhheeeccc--------------------eeEEEeee-------cCCCCcceeecccc Q lcl|Aclame:pro 311 KVLVSENNLTATTKEVAAATNIDRAMLLGAQ--------------------ALANAYGQ-------KAGGHFNMVEKKTD 363 (404) Q Consensus 311 ~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQ--------------------Al~~A~g~-------~~g~r~~w~Ee~~D 363 (404) .+.. ++..++.+....-++.+..+.. .+++.|-+ .-+++..-.+ -| T Consensus 264 ~~~~-----g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~~~~~~~~--~~ 336 (375) T protein:vir:10 264 GVKY-----GGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEAIGPQVQVTN--GD 336 (375) T ss_pred cccc-----cccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchhheeeeeeecccccccc--ch Confidence 1000 0011111111111122222222 22222210 0000000000 01 Q ss_pred cCch---hHHHHHHHhchhhccccCCCCCceEEEEEEEeee--ecC Q lcl|Aclame:pro 364 MDNR---TEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTA--VKL 404 (404) Q Consensus 364 ~g~~---~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idta--~~~ 404 (404) |..+ ..|-..+.+|..=.|- .+ ++.|.+. +++ T Consensus 337 ~~~~~q~~~i~~~~a~G~~~lrp--------~~-av~l~~~~~~~~ 373 (375) T protein:vir:10 337 VSVIYQGDVILGRMAMGADYLNP--------AA-AVELYIGATAPS 373 (375) T ss_pred hhheeeeeeeeeeeeeccCccCc--------ee-EEEEecCcCccc Confidence 1100 0111122222222221 22 2233332 222 No 43 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.35 E-value=9.2e-08 Score=59.16 Aligned_cols=336 Identities=10% Similarity=0.043 Sum_probs=165.3 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||-.... -.-.|++ .-...+-++|.|.+.+...-+..+.+. +-+++.++ ..|.++.|+-+...+ T Consensus 1 ms~~n~~-t~~~~~~----~~~~~al~le~f~geV~taf~~~s~~~---------~~~~~rti--~~gkS~q~~~iG~~~ 64 (364) T protein:vir:10 1 MSNPNVL-TQPAVSA----SGEVDSLLIEKFNNRVHEQYLKGENLL---------QWFDVQEV--VGTNSVSNKYIGETE 64 (364) T ss_pred CCCcccc-ccccccc----ccchhhhhhhhhhhhHHHHHHHHHhhc---------Ccceeeee--cccceEEeeeeeeeE Confidence 5533111 1111111 001111246777777665555544432 11222344 378899999997777 Q ss_pred cCceecCceeeeehhhhhhcccEEEEee---eccccccCcchhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHh-hhc Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQ---GRHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVHL-AGA 155 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq---~R~~V~~~gkms~qrs~~d-lr~~ar~~L~~w~~~~~D~~~~~~l-aG~ 155 (404) -...+-.+.+.| +.+.....+|.||+ .||.|+ .+++-...+| +|++--..+..=+++..|+.++..+ +++ T Consensus 65 ~~~~~~G~~ld~--~~~~~~k~~itID~ll~a~~~V~---diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa 139 (364) T protein:vir:10 65 LQVLSPGKSPDA--SPTEFDKNRLVVDTTVIARNTVA---HFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGG 139 (364) T ss_pred EeeeccCcccCC--CCcccCcEEEEecceeeechhhh---hHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 666665556654 57777778999998 566664 3677778899 8999988899999999999887554 222 Q ss_pred ccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCc Q lcl|Aclame:pro 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) Q Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~ 235 (404) .. | ..+.+. +++..|-...+ -.++.++. ..+..+.+ ++.|..+...+++..-| T Consensus 140 ~a---~--------~~~~~~----~~~~~~~g~~i-~~~~~a~~-~~~~~~~l-~~ai~~a~~~LdEkdVP--------- 192 (364) T protein:vir:10 140 IS---N--------TEAIRK----NPRVAGHGFSI-HIVGLASS-FLTSPQYM-MAAIEMAMEQQTEQEVD--------- 192 (364) T ss_pred hh---c--------cccccc----CCcccCCccee-eecccCcc-hhhhHHHH-HHHHHHHHHHHhhcCCC--------- Confidence 11 1 011110 11111100000 00000000 11111111 12222344444444333 Q ss_pred cccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEe- Q lcl|Aclame:pro 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV- 314 (404) Q Consensus 236 ~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~- 314 (404) .+ -++++|.|.||..|..++.+ .+.. ....+ .+..-+|.+++++||.|.+-+++|.- .+..... T Consensus 193 ---~~--~R~~vv~P~~y~~Ll~~~~l---vn~d----~~~~~-~~~~~~G~v~~v~Gv~Vv~Sn~lP~~--~~~~~~t~ 257 (364) T protein:vir:10 193 ---TS--ELCGLMPWTAFNCLRDADRI---VDKS----YTIAA-SDNTVDGFVLKSWNTPIVPSNRFPKL--SDNTEGTG 257 (364) T ss_pred ---cc--ccEEEeChHHHHHHhcCCcc---cccc----ccccC-CCccccceeEEEeceEEEeccccccc--cccccccc Confidence 11 17999999999999998742 2110 01111 34466899999999999999988742 1111000 Q ss_pred --ecCcc-ccc-ccccc--cccchhhheeeccceeEEEeeecCCCCcceee-cccccCchhHHHHHHHhchhhccc---- Q lcl|Aclame:pro 315 --SENNL-TAT-TKEVA--AATNIDRAMLLGAQALANAYGQKAGGHFNMVE-KKTDMDNRTEIAISWINGLKKIRF---- 383 (404) Q Consensus 315 --~~~~~-~a~-~~~~a--a~~~v~ralLlGaQAl~~A~g~~~g~r~~w~E-e~~D~g~~~~i~i~~i~G~~K~rF---- 383 (404) +.... .++ ++.+. +..+-.+++++=.-|++.+=...--++..|.+ +..|+.+ ....+|..=.|- T Consensus 258 ~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~~~~id-----a~~a~G~g~lRPeaa~ 332 (364) T protein:vir:10 258 NTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEKTWYID-----TFLAEGAIPDRWEAVA 332 (364) T ss_pred cccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccceeeeeee-----eehcccCcccCccceE Confidence 00000 000 11111 22233456666555555443322122332322 2223322 245566666653 Q ss_pred ----cCCCCCceEEEEEEEeeeecC Q lcl|Aclame:pro 384 ----PEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 384 ----~~~~g~~~DfGvi~idta~~~ 404 (404) ....+...||..|.--.--|. T Consensus 333 ~i~~~~~~~~~~~~~~~~~~~~~~~ 357 (364) T protein:vir:10 333 VVTAADTAELATDHNAILARANRKV 357 (364) T ss_pred EEEecCCCCCccchhhhhhhccccE Confidence 222344566665432211111 No 44 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.33 E-value=1.3e-07 Score=58.27 Aligned_cols=296 Identities=13% Similarity=0.124 Sum_probs=161.9 Q ss_pred HHHhhccchH-HHHHHhhhhhhhhhccccccccCCCCCccEEEEecccC--CCCcEEEEEEeeccccCc--eecCceeee Q lcl|Aclame:pro 18 FTAANRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNK--QAGDEVTFSIMHKLSKRP--TMGDERVEG 92 (404) Q Consensus 18 ft~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k--~~Gd~v~f~L~~~L~G~g--v~Gd~~leG 92 (404) -+.+-..+-. -+.|...+.....+ ...+.+++--.+...+.++-. .+|+.|+++....|.|++ +.+++.+. T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~---~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~- 76 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQ---LSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLV- 76 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHH---HHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccc- Confidence 1111111111 13333333222222 233455554444444555432 579999999999998774 44455554 Q ss_pred ehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeeccccc Q lcl|Aclame:pro 93 RGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHP 172 (404) Q Consensus 93 nee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~ 172 (404) -+.|...++.-+|-....++.... .++..+.-|...++...|++||++..+..+|-.|.|+.+...... T Consensus 77 -~~~l~t~~~~a~i~~~~k~~~~tD-~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~--------- 145 (324) T protein:vir:59 77 -PQKINAGQDKAVLILRGNAWSSHD-LAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKD--------- 145 (324) T ss_pred -hhhcccceeeEEEEeecCceeehh-hhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc--------- Confidence 688999999999999889887664 467778889999999999999999999999988888765211000 Q ss_pred cccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHH Q lcl|Aclame:pro 173 EFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQ 252 (404) Q Consensus 173 ~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q 252 (404) ..-+++ -+++-+||.+.+.+|..+. ||+ .+...+++|||.. T Consensus 146 -----~~~dvs------------------a~~~~~~s~~~l~~A~~~~-------------GD~---~~~~~~ivmhS~v 186 (324) T protein:vir:59 146 -----NKLDIS------------------GTADGIYSAETFVDASYKL-------------GDH---ESLLTAIGMHSAT 186 (324) T ss_pred -----ceeeee------------------ccccceecHHHHHHHHHHh-------------CCc---ccCcEEEEEchHH Confidence 000111 1123357888777776664 222 2346899999999 Q ss_pred HHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcccccccccccccch Q lcl|Aclame:pro 253 WNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNI 332 (404) Q Consensus 253 ~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v 332 (404) +++|+++- ..+|. ++.. + .+.+|.|+|+.|..--.+|.- .++....+ T Consensus 187 ~~~L~~~~-li~~~---~~s~----~------~~~i~~~~G~~VivdD~~p~~-------------------~~~~~~~~ 233 (324) T protein:vir:59 187 MASAVKQD-LIEFV---KDSQ----S------GIRFPTYMNKRVIVDDSMPVE-------------------TLEDGTKV 233 (324) T ss_pred HHHHHHhh-hhhhc---cccc----c------CceeeeecccEEEEeCCCCcc-------------------ccCCCCce Confidence 99999983 22332 2221 1 135788999888765444421 11122234 Q ss_pred hhheeeccceeEEEeeecCCCCcceeecccccCchhHHHH-H--HHhchhhccccCCC--C-Cc--eE------------ Q lcl|Aclame:pro 333 DRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAI-S--WINGLKKIRFPEKS--G-KM--QD------------ 392 (404) Q Consensus 333 ~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i-~--~i~G~~K~rF~~~~--g-~~--~D------------ 392 (404) -.++++|..|+...-++. +.=+|...|-..+....+ + .+++++=..|..+. + +. .| T Consensus 234 y~s~l~~~GAi~~~~~~~----~v~vE~dRd~~~g~~~l~~r~~~~~~p~G~s~~~~~~~~~sPt~~~L~~~~NW~~v~~ 309 (324) T protein:vir:59 234 FTSYLFGAGALGYAEGQP----EVPTETARNALGSQDILINRKHFVLHPRGVKFTENAMAGTTPTDEELANGANWQRVYD 309 (324) T ss_pred EEEEEEecCeEEEeecCC----CcceecccCccccceEEEEeeEEEeEeeeEEecccccCCCCCChhhhcCCcccccccC Confidence 467999988877665442 222455444322221111 0 12222223342110 0 00 00 Q ss_pred ---EEEEEEeeeecC Q lcl|Aclame:pro 393 ---HGVIAVDTAVKL 404 (404) Q Consensus 393 ---fGvi~idta~~~ 404 (404) -..+.+=|-..- T Consensus 310 ~k~i~i~~~~~~~~~ 324 (324) T protein:vir:59 310 PKKIRIVQFKHRLQA 324 (324) T ss_pred ccccceEEEEeeccC Confidence 000000000000 No 45 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=98.29 E-value=8.3e-07 Score=53.91 Aligned_cols=283 Identities=8% Similarity=-0.034 Sum_probs=144.0 Q ss_pred hccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccccCceecCceee--e-ehhhhh Q lcl|Aclame:pro 22 NRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVE--G-RGEDLS 98 (404) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~le--G-nee~L~ 98 (404) +..-...++|+..|.....+.+.+...-+...+.-|. -..|++|.++-+ +..++ +|-... | +.++++ T Consensus 1 MA~~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~------~~gg~tVkI~~i---~~~gl-~DY~R~~~g~~~g~~~ 70 (299) T protein:vir:79 1 MAALNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYR------WTGSKTIEIPTI---STTGR-VDSNRDTIAVAQRNYD 70 (299) T ss_pred CccchhHHHHHHHHHHHHHhhceeeeeccCcccceee------ecCCCEEEEecc---ccccc-cccccCCCcccccccC Confidence 2211134677777766666555433222222121111 134899998744 33333 444332 2 344788 Q ss_pred hcccEEEEeeeccccccCcchhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHhh-hcccccccccceeecccccccc Q lcl|Aclame:pro 99 HADFSLKINQGRHLVDAGGRMSQQRTKFNL--ASSARTLLGTYFNDLQDQCAIVHLA-GARGDFVADDTILPTAEHPEFK 175 (404) Q Consensus 99 ~~sd~v~Idq~R~~V~~~gkms~qrs~~dl--r~~ar~~L~~w~~~~~D~~~~~~la-G~rg~~~n~~~~~p~~~~~~~~ 175 (404) ....++.+||.|----.=..|+...+...+ -...+....+...-.+|.-.|-.|+ ++.+. T Consensus 71 ~~~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~----------------- 133 (299) T protein:vir:79 71 NAWEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTAL----------------- 133 (299) T ss_pred cceeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhc----------------- Confidence 889999999999321111234433332222 2223333334444455555554442 22110 Q ss_pred ccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHHHH Q lcl|Aclame:pro 176 KIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWND 255 (404) Q Consensus 176 ~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~d 255 (404) +..++...++++.+ ++.|+.+.+++++.+-| .+. +||+|+|..+.- T Consensus 134 ------------------g~~~~~~~~T~~n~--y~~i~~~~~~lde~~vP-------~~~-------rvl~vtp~~~~~ 179 (299) T protein:vir:79 134 ------------------GNTADTTVLTTTNV--LEVFDKLMEKMTEARVP-------ENG-------RILYVTPVVNTL 179 (299) T ss_pred ------------------CCcccccccCHHHH--HHHHHHHHHHHHhcCCC-------CCC-------eEEEeCHHHHHH Confidence 00112223555544 68899999999886644 111 799999999999 Q ss_pred HHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcccccccccccccchhhh Q lcl|Aclame:pro 256 WYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRA 335 (404) Q Consensus 256 Lr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ra 335 (404) |+.++.| ++.-.. +..+....|.+|.++|+.|.+.|. .||+..-. ..+ +..+.+. +-... T Consensus 180 L~~~~~f------~k~~~~---~~~~~~~~g~Vg~idG~~Ii~Vps--~r~~t~~~---~~~--G~~~~~~----ak~in 239 (299) T protein:vir:79 180 IKNAKEI------QRTVNI---KDAGTSLNRQTTDIDTVKIIKVPS--NLMKTAYD---FTT--GWKVGAG----AKQIF 239 (299) T ss_pred Hhhchhh------hccccc---ccccceeeeeeeeecceEEEEech--hhcCccce---ecc--CccccCc----ccccc Confidence 9999843 232111 234568899999999999999886 47763211 111 1111111 22367 Q ss_pred eeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEeeeecC Q lcl|Aclame:pro 336 MLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 336 lLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idta~~~ 404 (404) +||....+.++.-|+...+.+ +=|....- +++. +=|. =|.+++++...+- T Consensus 240 ~ii~~~~a~~~~~K~~~~~~~------~P~~~~~~--~~~~---~~r~--------y~d~~v~~nk~~~ 289 (299) T protein:vir:79 240 MSLVHPSAIITPVSYQFSKLD------EPTAVTEG--KYFY---FEES--------FEDVFILNKKADA 289 (299) T ss_pred eEEEcCCeeeeeEeeeeEEee------cCCCCCcc--ceee---eeee--------eeeeeeeccccCe Confidence 899988888887764333221 11100000 1111 0111 1233444443333 No 46 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.26 E-value=9e-08 Score=59.22 Aligned_cols=295 Identities=9% Similarity=0.092 Sum_probs=159.9 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchH-HHHHHhhhhhhhhhccccccccCCCCCccEEEE---ecccCCCCcEEEEEEe Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI---TDLNKQAGDEVTFSIM 76 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~---~dL~k~~Gd~v~f~L~ 76 (404) |.. +...+-. -+.|...+.....++ ..++++ .+|+.. ..+-.++|+.|+|++. T Consensus 1 MA~-----------------T~lsd~i~PEvf~~yv~~~~~~~---~~l~qS---G~i~~~~~l~~~~~~~G~~it~P~~ 57 (351) T protein:vir:15 1 MAE-----------------THLSDLIVPEVFGNYVVNQIIKT---NRFVQS---GILTPDPDLGPHLLEAGTRITVPFL 57 (351) T ss_pred CCc-----------------eeeeeeechhHHHHHHhhhhHHh---hhHhhc---ccccccHHHHHHhhcCCCEEEeccc Confidence 321 1111110 122222221211112 223333 455543 4444479999999999 Q ss_pred eccccCc--eecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 77 HKLSKRP--TMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (404) Q Consensus 77 ~~L~G~g--v~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG 154 (404) ..|+|++ +.|+..++ .+.|...++.-+|=....++.... ++...+.-|...++...|++||++..+..+|-.|.| T Consensus 58 ~~l~Gd~~~~~~~~~i~--~~kitt~~~~a~i~~~~kg~~~tD-~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~g 134 (351) T protein:vir:15 58 NDLTGDPDNWTDSDDID--VNNLTSGKQQGIKFYQTKAYGYTD-LGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKG 134 (351) T ss_pred ccCCCcccccCCCcccc--hheecccceeEEEEeeccceehhh-hhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9998874 44555555 688999999999999889988764 567778889999999999999999999999988988 Q ss_pred cccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecC Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (404) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g 234 (404) +.+... +.|. |.+ +.+ ..-.++-+||.+.+-+|..++- T Consensus 135 v~~~~~-----------------~~~~--------~~~--d~t--~~~~~~~~is~~~l~~A~~~~G------------- 172 (351) T protein:vir:15 135 VMGVTK-----------------IANS--------KVY--DQT--KVSPSEPMFGAKGFTGAIGLMG------------- 172 (351) T ss_pred Hhhchh-----------------hccc--------cee--ccc--cccccccccCHHHHHHHHHHhc------------- Confidence 765211 0010 111 011 0112345688888777766652 Q ss_pred ccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEe Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~ 314 (404) |+ .++...+++|||..+++|+++- .++..++.. + .+.+|.|+|+.|..-..+|.-. T Consensus 173 D~--~~~~~~~ivmhS~v~~~L~~~~----li~~~~~s~----~------~~~i~t~~G~~VivdD~~p~~~-------- 228 (351) T protein:vir:15 173 DL--QDTAFGAIAVNSATYSLMKVQG----LIETIQPQN----G------ATPFEAYNGLRIVLDDDIEIDL-------- 228 (351) T ss_pred cc--cccceEEEEEChHHHHHHHhhh----hhhhccccc----c------CcccceecceEEEEcCCCcccc-------- Confidence 11 1223689999999999999973 444444321 1 1347899999887665554210 Q ss_pred ecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchh--HHH---HHHHhchhhccccCC--- Q lcl|Aclame:pro 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRT--EIA---ISWINGLKKIRFPEK--- 386 (404) Q Consensus 315 ~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~--~i~---i~~i~G~~K~rF~~~--- 386 (404) .++...+-.++|+|..|+++ ++. .++ +|-..|..... +.. ...+++..=..|... T Consensus 229 -----------~~~~~~~ytsyl~~~GAi~~--~~~--~~~--ve~~rd~~~~~g~d~l~~r~~~~~hp~G~s~~~~~~~ 291 (351) T protein:vir:15 229 -----------TDKTKPVSTSYIFAPGAVRY--STN--MRS--TETKYDPLINGGQDVIVQKRVGTIHVAGTSIKASFSP 291 (351) T ss_pred -----------CCCCCceeEEEEEecceeee--ecC--CcC--cceeecccCCCCceEEEEeeeeeeeeeeeeecccccc Confidence 01111234678888888664 431 111 23233321110 000 012222233333211 Q ss_pred CC----------C-----------ceEEEEEEEeeeecC Q lcl|Aclame:pro 387 SG----------K-----------MQDHGVIAVDTAVKL 404 (404) Q Consensus 387 ~g----------~-----------~~DfGvi~idta~~~ 404 (404) .+ + .+--+.+.+=|-..+ T Consensus 292 ~~~~sPt~~~L~~~~NW~~v~~~d~k~I~iv~~~~~~~~ 330 (351) T protein:vir:15 292 SKASFPTIDELAKSSTWEVVDGIDVRSIGVVAYTAQLDP 330 (351) T ss_pred cCcCCcChHHhcCCcccccccCCCccccceEEEEEecCc Confidence 00 0 011222222222211 No 47 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.15 E-value=1.7e-07 Score=57.69 Aligned_cols=294 Identities=13% Similarity=0.124 Sum_probs=152.8 Q ss_pred CCCcCchHHHHHHHHHHHHHhhcc---chHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecc---cCCCCcEEEEE Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRN---RSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDL---NKQAGDEVTFS 74 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL---~k~~Gd~v~f~ 74 (404) |.-- .+... .| +.|...+.....++ ..+.++ .+|+...+| -.++|+.|+++ T Consensus 1 Ma~~---------------~T~l~d~i~p--evf~~yv~~~~~~~---~~l~qS---G~i~~~~~i~~~~~~~G~~i~~P 57 (330) T protein:vir:10 1 MANE---------------LTKILDTITP--QQYNAYMQQYTAAK---SAFVQS---GIAVSDERVSKNITSGGLLVNMP 57 (330) T ss_pred CCCC---------------ceEeeeeech--hHHHHHHHHHhHHh---hhhhhc---ccccccHHHHHHhhcCCCEEEec Confidence 1100 00000 11 12222222222222 223443 345553333 34799999999 Q ss_pred EeeccccCc-ee--cCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 75 IMHKLSKRP-TM--GDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVH 151 (404) Q Consensus 75 L~~~L~G~g-v~--Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~ 151 (404) +...|+|+. +. |++.++ -+.|...++..+|=....++.... ++..-+--|...++...|++||++..+..++-. T Consensus 58 ~~~~l~G~~~~~~dg~~~i~--~~ki~t~~~~a~i~~~~k~~~~tD-~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~ 134 (330) T protein:vir:10 58 FWNDLTGDSEVLGNGDKALE--TGKITAGADIACVLYRGRGWAANE-LTGVVAGSDPVRAILNRIGAYWLREDQKALIAT 134 (330) T ss_pred ccccCCCcccccCCCccccc--hhhcccceeEEEEEeecceeeehh-hhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHH Confidence 999998874 33 233454 578999999999999999887764 356667889999999999999999999999988 Q ss_pred hhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceE Q lcl|Aclame:pro 152 LAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVR 231 (404) Q Consensus 152 laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~ 231 (404) |.|..++....... .+ -.|.... .-+++-.|+.+.+-+|..+.- T Consensus 135 l~gvf~~~~~~~~~-------~~---~~~~~~~----------------~~~~~a~~s~~~l~~A~~~~G---------- 178 (330) T protein:vir:10 135 LNGIFATGTAGEKG-------AL---EETHVSD----------------QSKASTGIDAGMVLDAKQLLG---------- 178 (330) T ss_pred HHhhhhhhhcccch-------hh---hhhheec----------------ccccccccCHHHHHHHHHHhc---------- Confidence 98887642221100 00 0011111 012344577776666644432 Q ss_pred ecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeecccee Q lcl|Aclame:pro 232 LSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSK 311 (404) Q Consensus 232 ~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~ 311 (404) |+ .+...+++|||.++++|+++- ..+..++.. ..+.+|.|+|+.|..--.+|. T Consensus 179 ---D~---~~~~~~ivmhS~v~~~L~~~~----li~~~~~s~----------~~~~i~~~~G~~VivdD~~p~------- 231 (330) T protein:vir:10 179 ---DS---ADQVTAIAMHSAVYTKLQKDN----LIQYIQPTT----------ATINIPTYLGYRVIIDDGIAP------- 231 (330) T ss_pred ---cc---cccceEEEEcHHHHHHHHHhh----hhhhhcccc----------cCcccccccceEEEEeCCCCC------- Confidence 22 123689999999999999963 333333221 125678999998865443321 Q ss_pred EEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHH---HHhchhhccccCC-- Q lcl|Aclame:pro 312 VLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAIS---WINGLKKIRFPEK-- 386 (404) Q Consensus 312 ~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~---~i~G~~K~rF~~~-- 386 (404) .. .+-.++++|..|+.+.-+... ++-.+|-..|-..+.+..+. .++...=..|..+ T Consensus 232 ----------~~-------~~yt~yl~~~GAi~~~~~~~~--~~v~~EtdRd~~~g~~~l~~r~~~~~hp~G~s~~~~~~ 292 (330) T protein:vir:10 232 ----------TG-------DIYTSYLFRTGSIGLNTGNPS--GLTTFETSREAAKGNDMIYTRRALVMHPYGVKWTGAEV 292 (330) T ss_pred ----------CC-------CceeEEEEecCceeeecccCC--ccccccccCCccccceEEEEeeEEEeeeeeeeeccccc Confidence 01 123568889888766644322 22334433332111111000 1111111222211 Q ss_pred -C-C---------CceEE---------EEEEEeeeecC Q lcl|Aclame:pro 387 -S-G---------KMQDH---------GVIAVDTAVKL 404 (404) Q Consensus 387 -~-g---------~~~Df---------Gvi~idta~~~ 404 (404) . + +..-| ..+.+= .|| T Consensus 293 ~~~~~sPt~~~L~~~~NW~~v~~~k~i~iv~~~--~~~ 328 (330) T protein:vir:10 293 DAGNITPSNADLAKFKNWKRVYEPKNIGIIALK--HKI 328 (330) T ss_pred ccCcCCcChHHhcCCcCcccccChhhcceEEEE--Eec Confidence 0 0 01111 111110 111 No 48 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=97.74 E-value=2.3e-05 Score=46.00 Aligned_cols=336 Identities=13% Similarity=0.037 Sum_probs=164.6 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||....-. .-.|++ .-..++-++|.|.+.+...-+..+.+... +.++ .+ ..|.++.|+-.-..+ T Consensus 1 Ms~~n~~t-~p~~~g----sg~~~aL~Le~f~GeV~taF~~~si~~~~------~~vR---tI--~~gkS~qf~~lG~s~ 64 (400) T protein:vir:10 1 MSTPNNLT-NVAVSA----SGEVDSLLIEKFNGKVNEQYLKGENIMSY------FDVQ---TV--TGTNTVSNKYLGETE 64 (400) T ss_pred CCCCcccc-cccccc----ccchhhhHHhHhcchHHHHHHHHhhhccc------ceee---ee--cccceEEEEEeeeeE Confidence 77653221 111221 00111124888888887777666554311 2333 23 567899999998887 Q ss_pred cCceecCceeeeehhhhhhcccEEEEeeec---cccccCcchhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHH--hhh Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVH--LAG 154 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R---~~V~~~gkms~qrs~~d-lr~~ar~~L~~w~~~~~D~~~~~~--laG 154 (404) -...+-.+.+.|+ ........|.||... |.|. .+++-...+| +|.+--..+..=+++..||.+|-+ +++ T Consensus 65 a~y~~pG~~ldg~--~~~~dk~~ItIDtLL~a~~~V~---dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~ 139 (400) T protein:vir:10 65 LQVLAPGQSPAAT--STQADKNQLVIDATVIARNTVA---HLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGG 139 (400) T ss_pred EeeecCCCCcCCC--CcccCcEEEEeCceeeecchhh---hHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 6677777778866 467777779999865 4443 4677788899 899999999999999999988744 444 Q ss_pred cccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecC Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (404) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g 234 (404) ... +..|. .+.+...+.... .+.+ ++ .+.+...+.+.-.+.+.+ ....+..-| T Consensus 140 ~a~------t~~~~----~~~~g~~~g~s~-----~v~~--~~-~~~~~~~~~l~~A~~~A~-~~LdEkdVP-------- 192 (400) T protein:vir:10 140 IAN------TQAKR----TNPRVKGHGFSV-----NVEV--NE-GEALVNPQYVMAAVEFAL-EQQLEQEVD-------- 192 (400) T ss_pred ccc------ccccc----ccCCccccccce-----eecc--cc-cccccCHHHHHHHHHHHH-HHHHhcCCC-------- Confidence 211 01111 111100011100 0100 11 111112222222222222 223332222 Q ss_pred ccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEe Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~ 314 (404) .+ -+|+|+.|..|.-|+..+- .. +..-... ..+..=+|.+++++||.|.|-++.|-.--..+.... T Consensus 193 -----~~-d~vvl~pp~~Ys~Ll~~dk---Lv----nrdf~~s-~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~l 258 (400) T protein:vir:10 193 -----IS-DVAILMPWRYFNVLRDADR---IV----DKSYTIS-QSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLL 258 (400) T ss_pred -----cc-ceEEEcCHHHHHHHHhCCc---cc----chhcccc-CCCccccceEEEEeceEEEeeCcCCcccCccccccc Confidence 11 2677777777767765431 11 0111101 135567799999999999999988732111222223 Q ss_pred ecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCc-hhHHHHHHHhchhhccccC-------- Q lcl|Aclame:pro 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDN-RTEIAISWINGLKKIRFPE-------- 385 (404) Q Consensus 315 ~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~-~~~i~i~~i~G~~K~rF~~-------- 385 (404) ++++.+..-+ +.+...-.+++++=..|++.+=...--+++ |.| ..+ -.-|-....+|+.-.|-.- T Consensus 259 S~a~~G~~y~-~t~d~s~~~av~F~~sAv~tvk~~~lt~~~-~~d----~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~ 332 (400) T protein:vir:10 259 SNEDNGYRYD-PIAEMNGAIAVLFTADALLVGRSIDVIGDI-FYE----KKEKTYYIDTFMSEGAIPDRWEAVSVVTTKR 332 (400) T ss_pred ccCCCCccCC-ccccccceeEEEEehhheEEEEeecccccc-ccc----hhhHHHHHHHHHHhCCcccchhheEEEEecC Confidence 3332222111 112333446666666666664222212223 222 211 2234556677777666410 Q ss_pred ------CCCCceEEEEE---------EEeeeecC Q lcl|Aclame:pro 386 ------KSGKMQDHGVI---------AVDTAVKL 404 (404) Q Consensus 386 ------~~g~~~DfGvi---------~idta~~~ 404 (404) ..|++.||..| -+-++.+- T Consensus 333 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (400) T protein:vir:10 333 QSTGAVDSGNAAQHTQVLNRAQRKAVYVKNAAPA 366 (400) T ss_pred CcccccccCcchhHHHHHhhcccceEEEeccccc Confidence 01112222211 01111111 No 49 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=97.64 E-value=2.9e-05 Score=45.50 Aligned_cols=296 Identities=11% Similarity=0.019 Sum_probs=135.4 Q ss_pred hccch-HHHHHHhhhhhhhhhccccccccCCCCCccEEE-E-ecccCCCCcEEEEEEeeccccCceecCceee---eehh Q lcl|Aclame:pro 22 NRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVR-I-TDLNKQAGDEVTFSIMHKLSKRPTMGDERVE---GRGE 95 (404) Q Consensus 22 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~-~-~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~le---Gnee 95 (404) +-|.- .-++|+..+...-.+..-+.. .+.| + .|+.-..||+|+++...........-....+ ...+ T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~--------lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTN--------LVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVS 72 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchh--------hhccccccccccCCCCeEEEeecccccceeeeccccccCCccccc Confidence 33322 134566543332222211111 1111 1 2444467999999877666544433222222 2345 Q ss_pred hhhhcccEEEEeeecc-ccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeeccccccc Q lcl|Aclame:pro 96 DLSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (404) Q Consensus 96 ~L~~~sd~v~Idq~R~-~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~ 174 (404) ++.-...++.||+.++ ++...+ .+.-....|++++.-+....=+++..|+.++..++++... T Consensus 73 ~~~~~~~~~~id~~k~~~~~i~d-~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~---------------- 135 (392) T protein:vir:99 73 DFTEDSFPVTLTDVAYHLGVLTD-EELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE---------------- 135 (392) T ss_pred ccccceEEEEEeeeeecceeech-HHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Confidence 7777888999988875 455553 3555678889888777777778888888887666654321 Q ss_pred cccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHHH Q lcl|Aclame:pro 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (404) Q Consensus 175 ~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~ 254 (404) +. .. ...++.+ ..++.|-.|...+++..-| +| ++++++|..+. T Consensus 136 -----~~-~~--------------~~~~~~~--~~~~~i~~a~~~L~~~~vP------~~---------R~~vv~p~~~~ 178 (392) T protein:vir:99 136 -----AA-GA--------------VHEVAPD--EFFKGVNGARRALNELYIP------QG---------RVLVVGTAVTE 178 (392) T ss_pred -----cc-cc--------------ccccChh--hhHHHHHHHHHHHhhcCCC------CC---------CEEEEcHHHHH Confidence 00 00 0011111 1345566677777765533 11 46778999999 Q ss_pred HHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcccccccccccccchhh Q lcl|Aclame:pro 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDR 334 (404) Q Consensus 255 dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~r 334 (404) .|++|+.+..+..... .....|-.|.+|.+.|+-+.+..+.|.. .+ +.+..... .....+. T Consensus 179 ~l~~~~~~~~~~~~g~-------~~~~~l~~G~vg~i~G~~v~~s~~~~~~--t~--~a~~~~a~--~~at~a~------ 239 (392) T protein:vir:99 179 QILNDDRFIKYESQGQ-------SAVSALQEARLGRIYGYEIVESTLIPHG--DA--YLYHPTAF--IMATRAP------ 239 (392) T ss_pred HHhcccceeecccccc-------hhhhhhhcceeeeeeeeEEEeecccccc--cc--eeeecccc--ccccccc------ Confidence 9999986432221111 0124577899999999999887765421 10 00000000 0000000 Q ss_pred heeeccceeEEEeeecCCCCcceeecccccCchh---HHHHHHHhchhhccccCCCCC--ceEEEEEEEeeeecC Q lcl|Aclame:pro 335 AMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRT---EIAISWINGLKKIRFPEKSGK--MQDHGVIAVDTAVKL 404 (404) Q Consensus 335 alLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~---~i~i~~i~G~~K~rF~~~~g~--~~DfGvi~idta~~~ 404 (404) ....|+.... ++.........|. .||.... ...+....|...+. ...+. .....+.+...-+.+ T Consensus 240 v~~~~~~~~~-s~s~~~~v~~~~~---~~~~~t~~s~~~~v~~~~g~~~v~--~~~~~~~~~~~~~~~~~~~v~v 308 (392) T protein:vir:99 240 APPMGAVRST-AISGDQRIAMRWL---VDYDSTITSNRSLIDTYFGLKVVE--DPNGVGFVRARKIHLIPGSIEV 308 (392) T ss_pred ccccccccee-EEecccceeccee---ecccceeeccccccceeEEEEEEe--eccccceeeeeeeeeecceeee Confidence 1111211111 1111011111222 2332211 11122223322111 00000 011111111100111 No 50 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=97.56 E-value=3.2e-06 Score=50.71 Aligned_cols=317 Identities=14% Similarity=0.120 Sum_probs=158.8 Q ss_pred CCCcCchHHHHH----HHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEeccc---CCCCcEEEE Q lcl|Aclame:pro 1 MTTVTSAQANKL----YQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN---KQAGDEVTF 73 (404) Q Consensus 1 ~~~~~~~~a~~~----~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~---k~~Gd~v~f 73 (404) |--+ .+.-+ .--=+|+..-.+.+ . ....+.+ +.+|+.-.+|. +..|+.|++ T Consensus 1 M~~~---~~~T~l~Dii~pEvF~~Yv~~~~-------------~---e~~~l~q---SGiv~~d~~l~~~~~~gG~~v~i 58 (367) T protein:vir:80 1 MPDF---NNQVRLVDAVIPEVYTSYTAIDR-------------P---ELTAFFL---SGAVASNDFLSQFLSAPGRLINI 58 (367) T ss_pred Ccch---hhhhhhhhccchhhhhHHHhhhh-------------h---hhhhhhh---cceeecCHHHHHHhhcCCCEEEe Confidence 2100 00000 01112222222111 1 1122333 34666666665 488999999 Q ss_pred EEeeccccCc-ee-cCce-eeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 74 SIMHKLSKRP-TM-GDER-VEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIV 150 (404) Q Consensus 74 ~L~~~L~G~g-v~-Gd~~-leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~ 150 (404) ++...|.|+. .. +|+. .+---..+.-.+|.-+|=....+..... +++--+--|..+....++++||++..-..+|- T Consensus 59 Pf~~~L~g~~~n~~~d~~~~~~t~~kittg~~~a~v~~r~kaw~~~D-la~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla 137 (367) T protein:vir:80 59 PFWRDLDSLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMD-LTAELAGSNPMTRIRNRFGVYWTRQWQRRIIA 137 (367) T ss_pred eeeccCCCCccccCCCCCcccccccccccchheeeeehhcccchhhh-HHHHhhCchHHHHHHHHHHHHhhhhhHHHHHH Confidence 9999998853 22 3222 1222356777777777777667765543 45556667999999999999999988888888 Q ss_pred HhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCce Q lcl|Aclame:pro 151 HLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPV 230 (404) Q Consensus 151 ~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv 230 (404) .|.|..+.....+..... ......-+.+....+|++=-.+.+. ++.-+||.+.+-.|...+- T Consensus 138 ~L~Gvf~~~~a~~~~~~~-----~~~~~~a~~~~~~~~~~~Dis~~t~----~~~~~~s~~~~~~A~~~lG--------- 199 (367) T protein:vir:80 138 MAVGVYKSNLAGNFATIK-----TRGRVPAEVLGTAGDMVIDISGQTN----PADAVFNREAFVDAAFTMG--------- 199 (367) T ss_pred HHHHhhccccccchhhhh-----hhhccccccccccCceeeeeeccCC----CccceecHHHHHHHHHHhc--------- Confidence 899988764443332110 0000001112222222222111111 1234688887666644332 Q ss_pred EecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccce Q lcl|Aclame:pro 231 RLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGS 310 (404) Q Consensus 231 ~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~ 310 (404) |. .+.+=+++|||..+..|++. +.++.-++. .+ ...++.|+|..|..--.||.-- T Consensus 200 ----D~---~~~l~~i~mHS~V~~~L~~~----~li~~i~~s----d~------~~~i~ty~G~~VIvDD~~Pv~~---- 254 (367) T protein:vir:80 200 ----DH---VGSIAAIAVHSMVYKRMTNN----DEIEFIPDS----KG------QLTIPTYMGKVVIVDDGMPVFG---- 254 (367) T ss_pred ----cc---cccccEEEEchHHHHHHHhc----cccccccCC----CC------ccccceecceeEEEeCCCcccc---- Confidence 11 11246999999999999997 345544432 11 2458999999888766665310 Q ss_pred eEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeeccccc--Cchh--HHHH---HHHhchhhccc Q lcl|Aclame:pro 311 KVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDM--DNRT--EIAI---SWINGLKKIRF 383 (404) Q Consensus 311 ~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~--g~~~--~i~i---~~i~G~~K~rF 383 (404) ++ +..+-.+.|+|..|.. |+.-+...+ +|-..|- |+.- ++.+ ..++...=+.| T Consensus 255 --------------~~--a~~~yttYlfg~GAi~--~~~~~~~~~--~E~~Rd~~~~~~gG~d~L~~Rr~~~~hP~G~s~ 314 (367) T protein:vir:80 255 --------------TG--ADKTYLSILFGGAAFG--YADGAPQVP--VAVGRRELRGNGSGLEYILERKEWIVHPGGFNW 314 (367) T ss_pred --------------cC--CCceEEEEEEecceee--ecccCCccc--eecccchhhhcCCceEEEEeeeeEEeecceeee Confidence 11 1123467999998855 443222122 3433333 1111 1122 12333333444 Q ss_pred cCCCC-------------------------CceEEEEEEEee-eecC Q lcl|Aclame:pro 384 PEKSG-------------------------KMQDHGVIAVDT-AVKL 404 (404) Q Consensus 384 ~~~~g-------------------------~~~DfGvi~idt-a~~~ 404 (404) .+... +..-+-. +.|. .++| T Consensus 315 ~~~~v~~~~~~~~~~~~~~~~~sPt~~eLa~~~NW~~-v~d~K~I~i 360 (367) T protein:vir:80 315 LDADVTIPDNTGSPSGITSGPPAITLANLANPDNWER-VTYRKNVPM 360 (367) T ss_pred cccccccccccccccccccccCCCChHHhcCCccccc-ccchhhcce Confidence 32210 0011111 1121 1112 No 51 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=97.45 E-value=5.7e-05 Score=43.84 Aligned_cols=337 Identities=13% Similarity=0.050 Sum_probs=160.5 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) ||....-. ...|+++ -...+-++|.|.+.+...-+..+.+... ..++ .+ ..|.++.|+-.-..+ T Consensus 1 Ms~~n~~t-~~~~~~s----g~~~al~Le~f~GeV~taF~~~si~~~~------~~vR---ti--~~gkS~qf~~~G~s~ 64 (401) T protein:vir:70 1 MSTPNNLT-NVAVSAS----GEVDSLLIEKFNGKVNEQYLKGENIMSY------FDVQ---TV--TGTNTVSNKYLGETE 64 (401) T ss_pred CCCCcccc-ccccccc----cchhHhHHhHhcchHHHHHHHHhhhccc------ceee---ee--cccceEEEEEeeeeE Confidence 77653322 1222210 0112225888888887777666554311 2333 23 567899999997777 Q ss_pred cCceecCceeeeehhhhhhcccEEEEeeec---cccccCcchhhhhhhhh-HHHHHHHHHHHHHHHHHHHHHHHH--hhh Q lcl|Aclame:pro 81 KRPTMGDERVEGRGEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVH--LAG 154 (404) Q Consensus 81 G~gv~Gd~~leGnee~L~~~sd~v~Idq~R---~~V~~~gkms~qrs~~d-lr~~ar~~L~~w~~~~~D~~~~~~--laG 154 (404) -...+-.+.+.| +........|.||... |.|. .+++-.+.+| +|.+--..+..=+++..||.++-. +|| T Consensus 65 ~~~~~pG~~ld~--~~~~~dK~~ItID~lL~a~~~V~---dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa 139 (401) T protein:vir:70 65 LQVLAPGQSPAA--TSTQADKNQLVIDATVIARNTVA---HLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGG 139 (401) T ss_pred eeeecCCCCcCC--CCcccccEEEEeCceeehhhhhh---hHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 666666666775 4566667779999875 4443 4677788899 899999999999999999987443 445 Q ss_pred cccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecC Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (404) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g 234 (404) ... ..|....|.. -+- ...+-.++++....++... +- +.+..|....++..-| T Consensus 140 ~an-------a~~~~~~p~~---------~~~-G~~i~v~~~~~~~~~~~~~-l~-~ai~dA~~~LdEkdVP-------- 192 (401) T protein:vir:70 140 IAN-------TQAKRTNPRV---------KGH-GFSINVEVAEGEALVNPQY-VM-AAVEFALEQQLEQEVD-------- 192 (401) T ss_pred ccc-------ccccccCCCc---------CCC-ceEEeccccccccccCHHH-HH-HHHHHHHHHHHhcCCC-------- Confidence 332 0000000000 000 0011111111111111111 11 1122233333333322 Q ss_pred ccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEe Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~ 314 (404) .+ -||+|+.|..|+-|..-+. ..+ +.... ...+..=+|.+++++||.|.+-++.|-.-........ T Consensus 193 -----~~-r~vvl~pp~~Ys~Ll~~d~---L~n----rd~~~-s~~g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~l 258 (401) T protein:vir:70 193 -----IS-DVAILMPWRYFNVLRDADR---IVD----KTYTI-SQSGATIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLL 258 (401) T ss_pred -----cc-ceEEEcCHHHHHHHHhcCc---ccc----hhhcc-ccCCccccceEEEEeceEEEeeccccccccccccccc Confidence 11 2788888888877776542 111 11110 1135566788999999999999987731100111222 Q ss_pred ecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccc----------c Q lcl|Aclame:pro 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRF----------P 384 (404) Q Consensus 315 ~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF----------~ 384 (404) ++.+.+.--+ +.+...-.+++++=..|++.+=...--+++ |.|+.. .-.-|-....+|+.-.|- . T Consensus 259 s~a~~G~~y~-~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~-~~d~r~---~~~~id~~~a~g~g~~RPeaa~vv~~k~~ 333 (401) T protein:vir:70 259 SNEDNGYRYD-PLPAMNGAIAVLFTADALLVGRSIDVTGDI-FYEKKE---KTYYIDTFMAEGAIPDRWEAVSVVTTKRN 333 (401) T ss_pred cccCCCccCC-CCccccceeEEEEehhheEEEEeeccccch-hhhhhh---hHHHHHHHHHhCCcccchhheEEEeecCc Confidence 2222211111 123334446666666666664222111222 222210 112233455566555553 1 Q ss_pred CCCC-----CceEEEEEEEeeeecC Q lcl|Aclame:pro 385 EKSG-----KMQDHGVIAVDTAVKL 404 (404) Q Consensus 385 ~~~g-----~~~DfGvi~idta~~~ 404 (404) -..+ .+.||-.+..--+-|- T Consensus 334 ~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (401) T protein:vir:70 334 TTTGAVEGTDGAQHTIVKNRAQRKA 358 (401) T ss_pred ccccccccCCcchhhhhhhhcccee Confidence 1000 0111222111111111 No 52 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=97.45 E-value=5.7e-05 Score=43.86 Aligned_cols=277 Identities=9% Similarity=0.056 Sum_probs=134.9 Q ss_pred HhhccchHH--HHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccccCceecCceeeeehhhh Q lcl|Aclame:pro 20 AANRNRSMV--NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDL 97 (404) Q Consensus 20 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee~L 97 (404) ..-+++.++ ++|+..+...-.+..-+....-++- -.|. +++||+|+++....+.-.- ...+. .+++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y------~~e~-~~~GDTV~I~vp~~~~v~d---g~~~~--~~~~ 68 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNY------EKTF-GKVGDTIRLKLPYRVKSAS---GRTLV--KQPM 68 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCC------chHH-hhCCCEEEEeeCCceeecc---cCCcc--cccc Confidence 333444443 3776665444433332222111111 1122 3579999998766553110 11122 4566 Q ss_pred hhcccEEEEeeecc-ccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeeccccccccc Q lcl|Aclame:pro 98 SHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKK 176 (404) Q Consensus 98 ~~~sd~v~Idq~R~-~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~~ 176 (404) .-.+-.|.||+..+ ++...++ ++-.+.-||+++.-+....=+++..|+.++-.+.++.. T Consensus 69 te~~v~l~id~~k~~~~~itD~-e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~------------------- 128 (418) T protein:vir:10 69 VDQTIPFKIAYQEHVGLEYTVK-DKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFH------------------- 128 (418) T ss_pred ccceEEEEEecccccceeechH-HHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc------------------- Confidence 66777899988874 4566543 33445668877666666666677777776644433221 Q ss_pred cccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHHHHH Q lcl|Aclame:pro 177 IMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDW 256 (404) Q Consensus 177 ~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~dL 256 (404) .+..+. + . .+ .++.|-.+..++.+.+-| -+|+ +.++++|..+..| T Consensus 129 ----~~gt~g----------t-----~-~~--~~~~i~~a~~~Ld~~~VP-----~~G~--------R~lVv~P~~~~~L 173 (418) T protein:vir:10 129 ----SSGTPG----------V-----R-PG--AFIDFANAGAKQTTYAVP-----QDGM--------RHAVLDPFTCASL 173 (418) T ss_pred ----ccccCC----------c-----C-cc--hHHHHHHHHHHHHhcCCC-----CCCc--------eEEEeCHHHHHHH Confidence 000000 0 0 01 255555677777766544 1121 5777999999999 Q ss_pred HhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcccccccccccccchhhhe Q lcl|Aclame:pro 257 YTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAM 336 (404) Q Consensus 257 r~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ral 336 (404) ..|..+. + ... +....|-.|.+|.+.|+-|.+..++|.. ..| ...+ -.. T Consensus 174 ~~~~~~~-~--------~~~-~~~~~lr~G~IG~i~GF~V~~S~nip~~-tag-------------~~~~-------t~~ 222 (418) T protein:vir:10 174 SDEVTKL-F--------KES-MVEQAYKMGYRGNVAAYEVYESQNLPKH-TVG-------------DHGG-------TPL 222 (418) T ss_pred hhhcccc-c--------ccc-ccchhhheeeeeeeeceEEEEecCCCcc-ccc-------------cccc-------cee Confidence 9886421 1 111 3345688999999999999998877621 111 0000 011 Q ss_pred eeccce--eEEEeeecCCCCccee--ecccccCchhHH-HHHHHhchhhccccCCCCCceEEEEEEE-e------eeecC Q lcl|Aclame:pro 337 LLGAQA--LANAYGQKAGGHFNMV--EKKTDMDNRTEI-AISWINGLKKIRFPEKSGKMQDHGVIAV-D------TAVKL 404 (404) Q Consensus 337 LlGaQA--l~~A~g~~~g~r~~w~--Ee~~D~g~~~~i-~i~~i~G~~K~rF~~~~g~~~DfGvi~i-d------ta~~~ 404 (404) ..|+++ ..++. ...|. +-..--|+...| ++..+.++.|-. .+..+-|-|..- + +.++| T Consensus 223 v~ga~~~~~~~~~------~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~----~~~~~~f~V~~~~~~~~~~~~tv~i 292 (418) T protein:vir:10 223 VNGTVVNGDTVGF------DGGTASTTGFLKAGDVITFGGVFGVNPQNYET----TGLLQEFVVLEDVDTDAGGAGSIKI 292 (418) T ss_pred eecccccceeEEE------eecceeeccceeeccEEEECceeecccccccc----cccceEEEEEeeccccccCcceeEe Confidence 122211 11111 00110 011112222222 223333333332 345566744322 1 35566 No 53 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=97.43 E-value=6.8e-05 Score=43.43 Aligned_cols=309 Identities=11% Similarity=0.012 Sum_probs=155.5 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeec-- Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK-- 78 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~-- 78 (404) ||- = ...|. ..-.-++++|.+.+....+.+..+ +-++ ++..+ .-+.++.+..-.... T Consensus 7 ~~~--~----~~Ms~------~i~~~fv~qy~~~v~~~~qq~~s~--L~~t-----V~~~~--~~~~~~~~~~~~~~~~~ 65 (322) T protein:vir:10 7 MSM--L----PLIAG------DIDQAFVQTYETTLRILSQQKSAK--LKQY-----CQHKN--ESSESHNWETLASMDPD 65 (322) T ss_pred eee--e----eeeec------hhhhHHHHHHHHHHHHHHHHhhhh--hhcc-----ccccc--ccccccceeeccccccc Confidence 110 0 00000 122225788888776655544221 1111 11111 111222222111111 Q ss_pred cccCceecCceeeee----hhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 79 LSKRPTMGDERVEGR----GEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (404) Q Consensus 79 L~G~gv~Gd~~leGn----ee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG 154 (404) .-+..-.+.....+. ..+.....-.+.+++...++.+. +++.-|..+|+|...-..++.=+++..|+.++- + T Consensus 66 ~~~~~~~~~~~~d~~~dtp~~~~~~~~r~~~~~d~~~~~~VD-d~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~---a 141 (322) T protein:vir:10 66 AVKRKRSRQQSADGTYPTPVNNKPFAKRRTNVDTYDTGHVVE-QEDISQMLLDPNSALITSQAYAMARKTDDLIIA---G 141 (322) T ss_pred ccccccccccccCcccCCCccccccceEEEeecccccceecc-hHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHh---h Confidence 111111111111111 01223333345555555555443 678888999999999999999999999998863 3 Q ss_pred cccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecC Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (404) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g 234 (404) +.+.-.. +. ..-++..|+++-+ ..++=.++.+.|..|+.+.++.. T Consensus 142 ~~g~a~~-----------~~---~gt~v~~~ss~~i-----------~~g~~g~t~~kl~~a~~~l~~~d---------- 186 (322) T protein:vir:10 142 AWKPASI-----------KG---TGQPVEFLATQEI-----------GDGTKPISFDYVTEITERFLENE---------- 186 (322) T ss_pred hhccccc-----------cc---cccccccCCCccc-----------ccCccchhHHHHHHHHHHHHhcC---------- Confidence 3331100 00 0112333332210 11222567777888888877654 Q ss_pred ccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCccee-CCeEEEcCEEEEecCceeeeeccceeEE Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-GECAMWRNILVRKYAGMPIRFYQGSKVL 313 (404) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~-G~~gm~ngvii~~~~~~~irf~~~~~~~ 313 (404) +.++.+ .+|++.|.|+.+|.+|+.+. +..- ....+|+. |.+|.|-|+-+..+.++|. +.++ T Consensus 187 --vp~d~~-R~~vv~p~~~~~LL~d~~~t---s~D~-------~~~~~l~~~G~ig~~lGf~~i~s~~lp~--~~~t--- 248 (322) T protein:vir:10 187 --IEPEVS-KVIVIGPTQARKLLQITEAT---SADY-------TSAMDLQSKGIITNWMGYTWIVSTRLDK--FDPT--- 248 (322) T ss_pred --CCCCCC-eEEEeCHHHHHHHhcchhhh---hhhc-------ccchhhhhcCeeeeeeeEEEEEeccCCc--cccc--- Confidence 222222 46899999999999998543 1111 12467875 8899999999988887762 1100 Q ss_pred eecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEE Q lcl|Aclame:pro 314 VSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDH 393 (404) Q Consensus 314 ~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~Df 393 (404) -........++..++.++.-=.+|++.|=++.-.++. .|.-|..+-.-|-..+.+|-.-++ += T Consensus 249 -----~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i---~~~~~~~~a~~I~~~~~~Ga~ri~---------~~ 311 (322) T protein:vir:10 249 -----QWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKV---AEDPSASFAWRIYSAFTADCVRVE---------DE 311 (322) T ss_pred -----cccccccCCCCccceeEEEEecCceeEEEeeeeeEEe---eccCCcchhhhhhhhhhhCceEec---------cC Confidence 0001112223344555565555666666444211222 122334444555556777766653 56 Q ss_pred EEEEEeeeecC Q lcl|Aclame:pro 394 GVIAVDTAVKL 404 (404) Q Consensus 394 Gvi~idta~~~ 404 (404) ||+.|+--=-| T Consensus 312 gVv~i~~~e~~ 322 (322) T protein:vir:10 312 HIFKLRLKNSL 322 (322) T ss_pred cEEEEEEeccC Confidence 88888875555 No 54 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=96.49 E-value=0.00057 Score=38.35 Aligned_cols=278 Identities=12% Similarity=0.055 Sum_probs=141.8 Q ss_pred CC-CcCchHHHHHHHHHHHHH--hhccch-HHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEe Q lcl|Aclame:pro 1 MT-TVTSAQANKLYQVALFTA--ANRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (404) Q Consensus 1 ~~-~~~~~~a~~~~~~~lft~--~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~ 76 (404) |- .+-.+.-+-+.----|+. ..-|+. +.++|+..|..-....++-.+.. .|.-+ +-..|++|.++-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~---~N~~~------e~~gg~tVkIp~i 71 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPAL---ISNDA------IFMEGRSFTVMKG 71 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcc---cCcce------EeccCcEEEEeee Confidence 21 111111111111112332 333344 35667776654444333321111 11112 2246999998766 Q ss_pred eccccCceecCcee--eeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 77 HKLSKRPTMGDERV--EGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNL--ASSARTLLGTYFNDLQDQCAIVHL 152 (404) Q Consensus 77 ~~L~G~gv~Gd~~l--eGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dl--r~~ar~~L~~w~~~~~D~~~~~~l 152 (404) .- .++ +|-.. -.+.++++....++.+||.|----.=..|+..-+..++ -........+.+...+|.-.|-.| T Consensus 72 ~~---~gl-~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skl 147 (319) T protein:vir:97 72 DT---TEL-KDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATL 147 (319) T ss_pred cc---ccc-ccccCCCCcccCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHH Confidence 54 223 23221 13456789999999999998331111345655555544 334455555556666676666555 Q ss_pred hhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEe Q lcl|Aclame:pro 153 AGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRL 232 (404) Q Consensus 153 aG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~ 232 (404) ++.-+. ..+..++++. .++.|+.+.+++++.+-| T Consensus 148 a~~a~~--------------------------------------~~~~~~t~~n--~y~~i~~a~~~Lde~~VP------ 181 (319) T protein:vir:97 148 ARNKAK--------------------------------------HLTVGTGSDA--QYDAVLDVSVELDEIKAP------ 181 (319) T ss_pred Hhhccc--------------------------------------ccccccCHHH--HHHHHHHHHHHHHhcCCC------ Confidence 432210 0111233222 378888999999886532 Q ss_pred cCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeE Q lcl|Aclame:pro 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) Q Consensus 233 ~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~ 312 (404) +. +||+|+|..+.-|++++.+ ++.-. .....+..|.+|.+|||.|.+.|+. |+. T Consensus 182 --~~-------Rvl~Vtp~~~~~L~~~~~f------~~~~~----~~~~~~~~g~Vg~idG~~Vi~vps~--~~k----- 235 (319) T protein:vir:97 182 --EN-------RVLFVSPTFYKGIKKFVIA------LPQGD----TRQQVLGKGVQGELDGFVIVKVPTK--LLQ----- 235 (319) T ss_pred --CC-------cEEEeCHHHHHHHHhhhhh------hcccc----ccccceeeeeceeecCeEEEEeccc--ccc----- Confidence 11 6899999999999999753 22211 1246789999999999999987742 220 Q ss_pred EeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcce-eecccccCchhHHHHHHHhchhhccccCCCCCce Q lcl|Aclame:pro 313 LVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNM-VEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQ 391 (404) Q Consensus 313 ~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w-~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~ 391 (404) ...+++|......+--+...++.+- .+.. . .+.+.| ..| T Consensus 236 --------------------~in~i~~h~~A~~~~~k~~~~~~~~p~~~~------~---a~~v~g---r~y-------- 275 (319) T protein:vir:97 236 --------------------GLQAIAVVGEVLASPIQADLAKTNSNIPGM------F---GTLAEQ---LLY-------- 275 (319) T ss_pred --------------------cceEEEEcCCeeeeeeeeeeeeccCCCccc------c---ceeeee---eee-------- Confidence 1357888776666655533222211 1111 1 112222 222 Q ss_pred EEEEEEEeeeecC Q lcl|Aclame:pro 392 DHGVIAVDTAVKL 404 (404) Q Consensus 392 DfGvi~idta~~~ 404 (404) ||+.+++...+- T Consensus 276 -~d~~V~~~k~~~ 287 (319) T protein:vir:97 276 -TGAFVPEHLQKY 287 (319) T ss_pred -eeeEEeccccce Confidence 333333332222 No 55 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=96.49 E-value=0.00057 Score=38.35 Aligned_cols=278 Identities=12% Similarity=0.055 Sum_probs=141.8 Q ss_pred CC-CcCchHHHHHHHHHHHHH--hhccch-HHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEe Q lcl|Aclame:pro 1 MT-TVTSAQANKLYQVALFTA--ANRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (404) Q Consensus 1 ~~-~~~~~~a~~~~~~~lft~--~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~ 76 (404) |- .+-.+.-+-+.----|+. ..-|+. +.++|+..|..-....++-.+.. .|.-+ +-..|++|.++-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~---~N~~~------e~~gg~tVkIp~i 71 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPAL---ISNDA------IFMEGRSFTVMKG 71 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcc---cCcce------EeccCcEEEEeee Confidence 21 111111111111112332 333344 35667776654444333321111 11112 2246999998766 Q ss_pred eccccCceecCcee--eeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 77 HKLSKRPTMGDERV--EGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNL--ASSARTLLGTYFNDLQDQCAIVHL 152 (404) Q Consensus 77 ~~L~G~gv~Gd~~l--eGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dl--r~~ar~~L~~w~~~~~D~~~~~~l 152 (404) .- .++ +|-.. -.+.++++....++.+||.|----.=..|+..-+..++ -........+.+...+|.-.|-.| T Consensus 72 ~~---~gl-~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skl 147 (319) T protein:vir:94 72 DT---TEL-KDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATL 147 (319) T ss_pred cc---ccc-ccccCCCCcccCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHH Confidence 54 223 23221 13456789999999999998331111345655555544 334455555556666676666555 Q ss_pred hhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEe Q lcl|Aclame:pro 153 AGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRL 232 (404) Q Consensus 153 aG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~ 232 (404) ++.-+. ..+..++++. .++.|+.+.+++++.+-| T Consensus 148 a~~a~~--------------------------------------~~~~~~t~~n--~y~~i~~a~~~Lde~~VP------ 181 (319) T protein:vir:94 148 ARNKAK--------------------------------------HLTVGTGSDA--QYDAVLDVSVELDEIKAP------ 181 (319) T ss_pred Hhhccc--------------------------------------ccccccCHHH--HHHHHHHHHHHHHhcCCC------ Confidence 432210 0111233222 378888999999886532 Q ss_pred cCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeE Q lcl|Aclame:pro 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) Q Consensus 233 ~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~ 312 (404) +. +||+|+|..+.-|++++.+ ++.-. .....+..|.+|.+|||.|.+.|+. |+. T Consensus 182 --~~-------Rvl~Vtp~~~~~L~~~~~f------~~~~~----~~~~~~~~g~Vg~idG~~Vi~vps~--~~k----- 235 (319) T protein:vir:94 182 --EN-------RVLFVSPTFYKGIKKFVIA------LPQGD----TRQQVLGKGVQGELDGFVIVKVPTK--LLQ----- 235 (319) T ss_pred --CC-------cEEEeCHHHHHHHHhhhhh------hcccc----ccccceeeeeceeecCeEEEEeccc--ccc----- Confidence 11 6899999999999999753 22211 1246789999999999999987742 220 Q ss_pred EeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcce-eecccccCchhHHHHHHHhchhhccccCCCCCce Q lcl|Aclame:pro 313 LVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNM-VEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQ 391 (404) Q Consensus 313 ~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w-~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~ 391 (404) ...+++|......+--+...++.+- .+.. . .+.+.| ..| T Consensus 236 --------------------~in~i~~h~~A~~~~~k~~~~~~~~p~~~~------~---a~~v~g---r~y-------- 275 (319) T protein:vir:94 236 --------------------GLQAIAVVGEVLASPIQADLAKTNSNIPGM------F---GTLAEQ---LLY-------- 275 (319) T ss_pred --------------------cceEEEEcCCeeeeeeeeeeeeccCCCccc------c---ceeeee---eee-------- Confidence 1357888776666655533222211 1111 1 112222 222 Q ss_pred EEEEEEEeeeecC Q lcl|Aclame:pro 392 DHGVIAVDTAVKL 404 (404) Q Consensus 392 DfGvi~idta~~~ 404 (404) ||+.+++...+- T Consensus 276 -~d~~V~~~k~~~ 287 (319) T protein:vir:94 276 -TGAFVPEHLQKY 287 (319) T ss_pred -eeeEEeccccce Confidence 333333332222 No 56 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=96.33 E-value=0.00061 Score=38.22 Aligned_cols=305 Identities=12% Similarity=0.136 Sum_probs=150.2 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEeccc---CCCCcEEEEEEee Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN---KQAGDEVTFSIMH 77 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~---k~~Gd~v~f~L~~ 77 (404) |+ .+.-.-++....-+|+..-.+.|. +. ..+.+ +.+|+.-.+|. .+.|+.|++++.. T Consensus 1 Ma-~T~l~D~iipe~~vf~~Yv~~~~~-------------e~---~~l~q---SGii~~d~~l~~~~~~gG~~~~iPf~~ 60 (349) T protein:vir:78 1 MA-ITTIGDIVTGNIPVLASYMTEDPV-------------EK---TAFFD---SGILTSTPYAAEIANGPSNIANLPFWK 60 (349) T ss_pred CC-ceEEeeeeccCHHHHHHHHHHhhH-------------Hh---hhhhh---ccceeccHHHHHHhhcCCCEEEeeeee Confidence 33 111111222233344443333331 11 22333 35666666665 4789999999999 Q ss_pred ccccC--c-eecCceeeee--hhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 78 KLSKR--P-TMGDERVEGR--GEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHL 152 (404) Q Consensus 78 ~L~G~--g-v~Gd~~leGn--ee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l 152 (404) +|.|+ + |.+|. -++. -+.+...++.-++=...++..... ++..-|--|..+....++++||.+.....+|-.| T Consensus 61 ~L~g~~e~nv~~D~-~~~~~t~~kitt~~~~a~~~~r~kaw~~~D-la~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L 138 (349) T protein:vir:78 61 AIDTSIEPNYSNDV-YQDIATPRAIQTGEMMARVAYLNEGFGQAD-LTVELTSQNPLQSVASRLDNFWQRQAQRRLIATA 138 (349) T ss_pred cCCCCcccccCCCC-cccccccccccccceeeeeeeeccccchhH-HHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 99984 3 43332 2222 345666777666666666654432 3444455688999999999999999888888889 Q ss_pred hhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHH-HHHHHHHHHHhcCCCCCceE Q lcl|Aclame:pro 153 AGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIG-LVDNLSLFIDEMAHPLQPVR 231 (404) Q Consensus 153 aG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~-~Id~a~~~a~~~~~pi~Pv~ 231 (404) .|+-+...... ....+. ++. + .++.++..++.+ +++....+.+.. . T Consensus 139 ~Gvf~~~~~a~------~~~~~~----~~~---t-------------~d~s~~a~~~~~~~~dA~~~lgda~-~------ 185 (349) T protein:vir:78 139 LGLYNDNVSAT------DAYHEQ----NDM---V-------------VDVSATLGFDAGAFIDATQTMGDAL-M------ 185 (349) T ss_pred HHhhccccccc------chhhhc----ccc---e-------------eeeccccCCChhhhhhhHHHHHHHh-c------ Confidence 88876321100 000011 100 0 012223334554 344444444431 0 Q ss_pred ecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeecccee Q lcl|Aclame:pro 232 LSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSK 311 (404) Q Consensus 232 ~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~ 311 (404) |++ .+.+=++.||+..+..|++.- -++..+. +++ ...++.|+|..+..--.+|. T Consensus 186 --Gd~---~~~lt~i~mHS~v~~~L~~~~----li~~i~~----s~~------~~~i~ty~G~~VivDD~~Pv------- 239 (349) T protein:vir:78 186 --GNG---GEVLGAIAMHSFVYAQARKAQ----LIDFIRD----AEN------NTMFATYQGYRVIVDDSMTV------- 239 (349) T ss_pred --ccc---ccceeEEEEchHHHHHHHhhh----hhhhccC----ccc------CcccceecCeEEEEeCCCcc------- Confidence 111 123569999999999999873 4443332 111 12367888887766555543 Q ss_pred EEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeeccccc--Cchh--HHHH---HHHhchhhcccc Q lcl|Aclame:pro 312 VLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDM--DNRT--EIAI---SWINGLKKIRFP 384 (404) Q Consensus 312 ~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~--g~~~--~i~i---~~i~G~~K~rF~ 384 (404) ... +...+-..+|+|..|.+..-+. +..-+|-..|- |+.. +..+ ..+++.+=+.|. T Consensus 240 -----------~~~--g~~~~yttylfg~GAi~~~~~~----~~~~~et~rd~~~g~~~G~d~l~~R~~~~~hp~G~s~~ 302 (349) T protein:vir:78 240 -----------VGQ--GAQRKFISIIFGQGAIGYGEGN----PVMPLEYEREASRANGGGVETLWTRKTWLLHPFGYRFT 302 (349) T ss_pred -----------ccC--CCCceEEEEEeecceEEEccCC----CccceeeecccccCCcceeEEEEEeeEEEeeeeeeeec Confidence 111 1123456799998775554332 22223433443 2211 1111 122333344443 Q ss_pred CCCC------------------CceEEEEEEEee-eecC Q lcl|Aclame:pro 385 EKSG------------------KMQDHGVIAVDT-AVKL 404 (404) Q Consensus 385 ~~~g------------------~~~DfGvi~idt-a~~~ 404 (404) .... +..-+-.+ +|. .++| T Consensus 303 ~a~v~~~~~~~~~~sPt~aeLa~~~NW~~v-~~~K~I~i 340 (349) T protein:vir:78 303 SAVITGNGTETIARSASWQDLANATNWNRV-VDRKHVPI 340 (349) T ss_pred cccccCCccccccCCCChHHhcCCcCcccc-cChhhcce Confidence 2210 01111111 111 1111 No 57 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=95.94 E-value=0.0012 Score=36.55 Aligned_cols=306 Identities=11% Similarity=0.130 Sum_probs=149.9 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEeccc---CCCCcEEEEEEee Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLN---KQAGDEVTFSIMH 77 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~---k~~Gd~v~f~L~~ 77 (404) |+ .+.-.-++..-.-+|+..-.+.|. +. ..+.+ +.+|+.-.+|. ++.|+.|++++.. T Consensus 1 Ma-~T~l~D~iipe~~vf~~Yv~~~~~-------------e~---~~l~q---SGii~~d~~l~~~~~~gG~~~~iPf~~ 60 (349) T protein:vir:94 1 MA-ITTIGNIVTGNIPVLASYMTEDPV-------------EK---TAFFN---SGILTPTPYAAEIARGPSNIANLPFWK 60 (349) T ss_pred CC-ceEEeeeeccChHHHHHHHHHhHH-------------Hh---hhhhh---ccceeccHHHHHHHhcCCCEEEeeeee Confidence 33 111111233333344444433331 11 22343 35677666665 4789999999999 Q ss_pred ccccC--c-eecCceee-eehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 78 KLSKR--P-TMGDERVE-GRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA 153 (404) Q Consensus 78 ~L~G~--g-v~Gd~~le-Gnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~la 153 (404) +|.|+ + +.||...+ .--..+...+|.-++=...++.... -+++.-|--|..+....++++||.+.....+|-.|. T Consensus 61 ~l~g~~e~n~~~dt~~~~~t~~kit~~~~~a~~~~r~kaw~~~-Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~ 139 (349) T protein:vir:94 61 AIDTSIEPNYSNDVYQDIATPRAIQTGEMMARVAYLNEGFGQA-DLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATAL 139 (349) T ss_pred cCCCCcccccCCCCcccccccccccccceeeeeeeeccccchh-HHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHH Confidence 99885 3 44444321 1234566666666665555554443 245555556889999999999999988888888898 Q ss_pred hcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHH-HHHHHHHHHHhcCCCCCceEe Q lcl|Aclame:pro 154 GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIG-LVDNLSLFIDEMAHPLQPVRL 232 (404) Q Consensus 154 G~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~-~Id~a~~~a~~~~~pi~Pv~~ 232 (404) |+-+...... ....+. + + -...+.++..++.+ +++....+.+.. . T Consensus 140 Gvf~~~~~~~------~~~~~~----~-------~---------~~~d~~~~a~~~~~~~~~A~~~~Gdaa-~------- 185 (349) T protein:vir:94 140 GLYNDNVSAT------DAYHEQ----N-------D---------MVVDVSATSGFDAGAFIDATQTMGDAL-M------- 185 (349) T ss_pred hhhccccccc------cccccc----C-------c---------eeEEecccCCCChhhHHHHHHHHHHHh-c------- Confidence 8876311100 000010 1 0 01123334445655 444444444431 1 Q ss_pred cCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeE Q lcl|Aclame:pro 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) Q Consensus 233 ~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~ 312 (404) |+ ..+.+=++.||+..+..|++.- -++.-+. +++ + ..++.|+|..+..--.+|.- T Consensus 186 -Gd---~~~~lt~i~mHS~v~~~L~~~~----li~~i~~----s~~--~----~~i~ty~G~~VivDD~~Pv~------- 240 (349) T protein:vir:94 186 -GN---GGEVLGAIAMHSFVYAQARKAQ----LIDFIRD----AEN--N----TMFATYQGYRVIVDDSMTVV------- 240 (349) T ss_pred -cc---cccceeEEEEchHHHHHHHhcc----hhhhccC----ccc--C----cccceecCcEEEEeCCCccc------- Confidence 11 1123568999999999999973 3443221 111 1 13577888777665555531 Q ss_pred EeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchh----HHHH---HHHhchhhccccC Q lcl|Aclame:pro 313 LVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRT----EIAI---SWINGLKKIRFPE 385 (404) Q Consensus 313 ~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~----~i~i---~~i~G~~K~rF~~ 385 (404) ......+-..+|+|..|....-+. +..-+|-..|--.+. +..+ ..+++.+=+.|.. T Consensus 241 -------------~~g~~~~yttylfg~GAi~~~~~~----~~~~~E~~rd~~~g~~~G~d~L~~R~~~~~hp~G~s~~~ 303 (349) T protein:vir:94 241 -------------GQDTSRKFISIIFGQGAIGYGEGN----PEMPLEYEREASRANGGGVETLWTRKTWLLHPFGYSFTS 303 (349) T ss_pred -------------cCCCCceEEEEEeecceEEeecCC----CCcceeeecccccCCcceeEEEEEeeEEEeeeeeeeecc Confidence 011122456799998775555332 112233333332111 1111 1223333344432 Q ss_pred CCC------------------CceEEEEEEEee-eecC Q lcl|Aclame:pro 386 KSG------------------KMQDHGVIAVDT-AVKL 404 (404) Q Consensus 386 ~~g------------------~~~DfGvi~idt-a~~~ 404 (404) ... +..-+-.+ +|. .++| T Consensus 304 a~v~~~~~~~~~~sPt~aeLa~~~NW~~v-~~~K~I~i 340 (349) T protein:vir:94 304 AVITGNGTETIARSASWQDLANAANWNRV-VDRKHVPI 340 (349) T ss_pred cccCCCccccccCCCChHHhcCCcCcccc-cChhhcce Confidence 210 01111111 111 1111 No 58 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=95.85 E-value=0.0014 Score=36.30 Aligned_cols=277 Identities=13% Similarity=0.112 Sum_probs=138.7 Q ss_pred HHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccccCceecCceeee Q lcl|Aclame:pro 13 YQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEG 92 (404) Q Consensus 13 ~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leG 92 (404) ++..+ .++|+..|.....+.+..... . +. +..=.-|++|.++=+. -.++.-=++-.| T Consensus 1 Main~----------a~~~~~~Ld~~~~~~~~t~~l-~---~~------~~~~~ggktVkI~~i~---~~gl~DY~R~~g 57 (290) T protein:vir:78 1 MAINY----------VDKYGKELDQKLVFGTYTNEL-E---TP------NLLWLDAKTFKIQTIT---TTGLKAHTRNKG 57 (290) T ss_pred CchhH----------HHHHHHHHHHHHHhhheeeec-c---cc------ceeeccCCEEEEeeec---cCcccccccCCC Confidence 12222 256666665555555443222 1 11 1112358999987444 222211111122 Q ss_pred e-hhhhhhcccEEEEeeec---cccccCcchhhhhh--hhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccee Q lcl|Aclame:pro 93 R-GEDLSHADFSLKINQGR---HLVDAGGRMSQQRT--KFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTIL 166 (404) Q Consensus 93 n-ee~L~~~sd~v~Idq~R---~~V~~~gkms~qrs--~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~ 166 (404) . ..+.+....++.+||.| +.|| .|+...+ ...+-........+.....+|.-.|-.|++.-+.. T Consensus 58 ~~~g~v~~~~et~tl~qdR~~~F~vD---~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~------- 127 (290) T protein:vir:78 58 YNEGSASNTNKSYTIDFDRDVEFFVD---VMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTN------- 127 (290) T ss_pred cccCccccceeeEEeeccccceeecc---ccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhcc------- Confidence 2 23456678889999988 3444 3443333 34455556666666666777777776664333200 Q ss_pred eccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEE Q lcl|Aclame:pro 167 PTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVL 246 (404) Q Consensus 167 p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~ 246 (404) | ......++++. -++.|+.+...+++ +..+. +|| T Consensus 128 -------------~---------------~~~~~t~t~~n--~~~~i~~~~~~lde---------vp~~~-------rvl 161 (290) T protein:vir:78 128 -------------S---------------NSVAEEITKDN--VFTKLKAAIRKVKK---------YGTQN-------LVM 161 (290) T ss_pred -------------C---------------cccccccCHHH--HHHHHHHHHHHHHh---------cCCCC-------eEE Confidence 0 00011122222 24566666666654 11122 899 Q ss_pred EEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCccccccccc Q lcl|Aclame:pro 247 YVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEV 326 (404) Q Consensus 247 ~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~ 326 (404) +|+|.-+.-|+.++.| ++..... ....-...|.+|.+||+.|.|.|.. -||+.- -+...+.... T Consensus 162 ~vtp~~~~lL~~~~~f------~r~~~~~--~~~~~~i~~~V~~idG~~ii~vps~-~r~~t~-------~~f~~G~~~~ 225 (290) T protein:vir:78 162 YVSPDVMAALELSDDF------VRAINVQ--NIGPSSIETRITAIDGTRIVEVEAE-DRFYDT-------FDFTDGYKPA 225 (290) T ss_pred EECHHHHHHHhhChhh------hcccccc--ccccccccceeeeecCcEEEEeccc-chhhhh-------hhhccccccc Confidence 9999999999999853 3322111 1123356999999999999998742 376521 0111111111 Q ss_pred ccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEEeeeecC Q lcl|Aclame:pro 327 AAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 327 aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~idta~~~ 404 (404) ..+-..++||-...+.+|.-|+.-++.+ +=|.... +=++.+.- | .=|.+++++...+- T Consensus 226 --~~ak~in~ii~~~~a~i~~~K~~~~~~~------~P~~~~~-~d~~~~~~---r--------~y~d~~v~~nk~~~ 283 (290) T protein:vir:78 226 --AGAKKLNFLLVNKGSVVGGAKHASIYLH------APGSVGQ-GDGWLYQY---R--------VYHDIFVLDQQKDG 283 (290) T ss_pred --CCccceeEEEEcCCceeeeeeeeEEEee------CCCCCcC-cceeeeee---e--------eeeeeeeeccccCe Confidence 1133468999999999998775333222 1000000 00000000 0 12334444444444 No 59 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=94.11 E-value=0.0054 Score=33.04 Aligned_cols=275 Identities=13% Similarity=0.071 Sum_probs=139.6 Q ss_pred CC-----------CcCchHHHHHHHHHHHHHh--hccch-HHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCC Q lcl|Aclame:pro 1 MT-----------TVTSAQANKLYQVALFTAA--NRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQ 66 (404) Q Consensus 1 ~~-----------~~~~~~a~~~~~~~lft~~--~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~ 66 (404) +| +.|.-+-+ ---|++- .-|+- ..+++...|.......+.- ++...-++.+.. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s---------~~~~~N~~~e~~ 72 (329) T protein:vir:10 6 ITGVKTMNKEIKNATGKLKLN----LQHFANKSVEPGDTLLKNKHVGILEKVTAANSYS---------APAVISNDAIFM 72 (329) T ss_pred EechhhhhhhhhcccceeEEe----hhhhcCCccCCchhHHHHHHHHHHHHHHHhhcee---------eeeecccceeec Confidence 11 11111100 0113222 22222 3566666665444433321 111112334466 Q ss_pred CCcEEEEEEeeccccCceecCce-eee-ehhhhhhcccEEEEeeeccccccCcchhhhhhhhhH--HHHHHHHHHHHHHH Q lcl|Aclame:pro 67 AGDEVTFSIMHKLSKRPTMGDER-VEG-RGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNL--ASSARTLLGTYFND 142 (404) Q Consensus 67 ~Gd~v~f~L~~~L~G~gv~Gd~~-leG-nee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dl--r~~ar~~L~~w~~~ 142 (404) .|++|.++-+.-. ++ +|-. -.| +.++++....++.+||.|----.=..|+..-+...+ -........+.+.. T Consensus 73 ~g~tVkIp~i~~~---gl-~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~p 148 (329) T protein:vir:10 73 QGRSFTVIKGDVT---EL-KDYKRNATNEFDHPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAP 148 (329) T ss_pred cCcEEEEeeeccc---cc-ccccCCCCccccccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhh Confidence 8999998876542 22 2222 122 345788899999999998321111235544444443 34455555666666 Q ss_pred HHHHHHHHHhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHh Q lcl|Aclame:pro 143 LQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDE 222 (404) Q Consensus 143 ~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~ 222 (404) .+|.-.|-.|++.-+ +..+..++++. .++.|+.+.+++++ T Consensus 149 EiDay~~skla~~a~--------------------------------------~~~~~~~t~~n--ay~~i~~a~~~Lde 188 (329) T protein:vir:10 149 YLDNLRFATLARNKA--------------------------------------KHLTVGSGADA--QYDAVLDVSVELDE 188 (329) T ss_pred HHHHHHHHHHHhhcc--------------------------------------cccccccCHHH--HHHHHHHHHHHHHh Confidence 677666655543221 00111233322 37888899999887 Q ss_pred cCCCCCceEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCce Q lcl|Aclame:pro 223 MAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGM 302 (404) Q Consensus 223 ~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~ 302 (404) .+.| + . +||+++|..+.-|++++.+ .+. ..+....++.|.+|.+||+.|.+.|+. T Consensus 189 ~~vp------~--~-------Rvl~VtP~~~~~Lk~~~~f------~~~----~~~~~~~~~~g~Vg~idG~~Ii~vps~ 243 (329) T protein:vir:10 189 IGAG------A--S-------RILFVTPKFYKGIKKFVIE------LPQ----GDNRQQVLGKGVQGELDGFTIVKVPSK 243 (329) T ss_pred cCCC------C--C-------cEEEeCHHHHHHHHhhhhh------hcc----ccccccceeeeeeeeecCeEEEEecCC Confidence 6533 1 1 6899999999999998743 111 123456899999999999999987742 Q ss_pred eeeeccceeEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhcc Q lcl|Aclame:pro 303 PIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIR 382 (404) Q Consensus 303 ~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~r 382 (404) |+ . ...+|+|......+--+...++.+- +-.+. ..+.+.| .. T Consensus 244 --~~---------------------k----~in~ii~~~~A~~~~~K~~~~~~~~-----p~~~~---~a~~v~g---r~ 285 (329) T protein:vir:10 244 --ML---------------------Q----GVEAMAVIGEVMASPIQANEAKLNS-----NVPGM---FGTLAEQ---ML 285 (329) T ss_pred --cc---------------------c----ceeEEEEcCCceeeeeeeeeeeeeC-----CCCcc---chheeee---ee Confidence 22 0 1356777766555544433222111 00111 1122222 11 Q ss_pred ccCCCCCceEEEEEEEeeeecC Q lcl|Aclame:pro 383 FPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 383 F~~~~g~~~DfGvi~idta~~~ 404 (404) | ||+.+++..++. T Consensus 286 y---------yd~~V~~~k~~~ 298 (329) T protein:vir:10 286 Y---------TGAFVPEHLQKY 298 (329) T ss_pred e---------eeeEEEccccCE Confidence 1 333333322222 No 60 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=92.81 E-value=0.0099 Score=31.57 Aligned_cols=209 Identities=13% Similarity=0.075 Sum_probs=99.3 Q ss_pred EeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhh-hcccccccccceeeccccccccccccCccCC Q lcl|Aclame:pro 106 INQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA-GARGDFVADDTILPTAEHPEFKKIMINDVLP 184 (404) Q Consensus 106 Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~la-G~rg~~~n~~~~~p~~~~~~~~~~~~N~v~a 184 (404) ||....+=-.=..+++..+..|+|.+.-.....=+++-.|+.++.++. +++.. .|....+.. T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~-------~p~~~~~~g---------- 63 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAA-------APVTGQDGG---------- 63 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc-------CcccccccC---------- Confidence 776654411113578889999999999999999999999999998876 33320 011000000 Q ss_pred CCCCceEeccCCccccccccccccC----HHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHHHHHHh-- Q lcl|Aclame:pro 185 PTHDRHFFGGDATSFEQIEAADIFS----IGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT-- 258 (404) Q Consensus 185 pt~~r~~~~~~at~~~~i~a~D~~s----~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~-- 258 (404) .+..+.++..-+ ++.|-.+...+++..-| .++ .++++.|.|++.|-+ T Consensus 64 -------------~~~~~~a~~t~~~~~l~dai~~a~~~LdekdVP-------~~g-------R~~vv~P~~y~~LL~~~ 116 (221) T protein:vir:17 64 -------------FSVNIGAGNTNNAQAIVDGFFEAAAVLDERSAP-------MDG-------RVAVLSPRQYYSLISSV 116 (221) T ss_pred -------------cceeccccccCCHHHHHHHHHHHHHHHhhcCCC-------CCC-------CEEEeCcHHHHHHHHhc Confidence 011122222122 34455555555554433 122 688999999999975 Q ss_pred CcchHHHHHHHHHhhhccccccCcceeC-CeEEEcCEEEEecCceeeeeccceeEEeecCcccccc----cccccccchh Q lcl|Aclame:pro 259 STSGKDWNQMMVRAVNRAKGFNHPLFKG-ECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATT----KEVAAATNID 333 (404) Q Consensus 259 d~~~~~w~~~qk~A~ar~~g~~nPlF~G-~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~----~~~aa~~~v~ 333 (404) |+-. .++... +..--+..| ++++++|+-|.+-+++|.. .++.+. ++....+.. +.+.....=. T Consensus 117 d~~~-------~n~d~~--~s~g~~~~g~~i~~v~G~~V~~SnnlP~~--~gt~~~-~~ag~~~~~~~~~~~yr~~fs~~ 184 (221) T protein:vir:17 117 DTNI-------LNREIG--NTQGDMNTGKGLYVNAGIRIYKSNVLASL--YGTNLV-TDPGDATTSGENNGSYRPAITDR 184 (221) T ss_pred Ccce-------eeeecc--cccccccccceeeeecCcEEEEeccCCcc--cccccc-cCCccccccccccccccccccce Confidence 4421 111111 112225556 6999999999999998742 222111 111111000 0111111111 Q ss_pred hheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCC Q lcl|Aclame:pro 334 RAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKS 387 (404) Q Consensus 334 ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~ 387 (404) -+|+.=..|++..=---+..|+-.+-..|-- |-+... T Consensus 185 ~glv~~~~Avgtvkl~~~~~~~~~~~~~~~~-----------------~~~~~~ 221 (221) T protein:vir:17 185 AGLVFHKEAADTVEVLLPPSRPPLVISMFSI-----------------RRPDRR 221 (221) T ss_pred EEEEEcchheeeeeeecCCCCCceeeeeeec-----------------cCCCCC Confidence 1344444444333110011222222111110 001100 No 61 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=92.18 E-value=0.013 Score=31.02 Aligned_cols=277 Identities=14% Similarity=0.070 Sum_probs=139.1 Q ss_pred hccch-HHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccccCceecCcee-ee---ehhh Q lcl|Aclame:pro 22 NRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERV-EG---RGED 96 (404) Q Consensus 22 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~l-eG---nee~ 96 (404) +.|.= ...+|...|.......+... .+ +..+.-|+ . .-|.+|.++= ++-.|. +|-.. .| +..+ T Consensus 1 Mantl~ya~~~~~~LD~~~~~~~~s~-~l-~~~~~~v~-~-----~ggktVkIp~---i~~~gl-~DY~R~~g~~~~~g~ 68 (312) T protein:vir:10 1 MANTLAYGQVLQQGLDKQATQELLTG-WM-DSNAKQIK-Y-----EGGKEVKIGK---LSTDGL-GDYSRGSANAYVGGD 68 (312) T ss_pred CCcchhHHHHHHHHHHHHHHhhhccc-cc-cCCCceEE-E-----ecCcEEEEEe---eecccc-cccccccCCcccccc Confidence 33222 34667777655555555433 22 22222232 2 3588999873 333444 33333 34 3346 Q ss_pred hhhcccEEEEeeec---cccccCcchhhhhh--hhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeecccc Q lcl|Aclame:pro 97 LSHADFSLKINQGR---HLVDAGGRMSQQRT--KFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEH 171 (404) Q Consensus 97 L~~~sd~v~Idq~R---~~V~~~gkms~qrs--~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~ 171 (404) ++....+..++|-| +.|| +|+..-| ...+-...+.-..+...-.+|.-.|-.|+..- T Consensus 69 v~~~~et~tl~qDR~~~F~vD---~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a--------------- 130 (312) T protein:vir:10 69 VKFEYETKTMTQDRGRKFTLD---AMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIA--------------- 130 (312) T ss_pred ccccceeEEeeecccceeecc---ccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhh--------------- Confidence 88888899999988 3444 3443322 22333333444444444445555554443111 Q ss_pred ccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchH Q lcl|Aclame:pro 172 PEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPR 251 (404) Q Consensus 172 ~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~ 251 (404) ...... +..+...+++++.+ ++.|+.+.+++++.+-| + -+||+|+|. T Consensus 131 ----------~~~~~~------~~~~~~~~~T~~ni--~~~i~~~~~~lde~~vp-------~--------~rvl~vTp~ 177 (312) T protein:vir:10 131 ----------IGIKGD------TNVEYSYSVNSSTI--INKIKTGIKIIRENGYN-------G--------PLVCHLTYD 177 (312) T ss_pred ----------hccccc------cccccccccCHHHH--HHHHHHHHHHHHHccCC-------C--------ceEEEeChH Confidence 000000 00011112333332 45677778888875532 1 179999999 Q ss_pred HHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCccccc---cccccc Q lcl|Aclame:pro 252 QWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTAT---TKEVAA 328 (404) Q Consensus 252 q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~---~~~~aa 328 (404) -+.-|+++..+ .+. .. ...+....+.++.+|||.|++.|. -||+.--. ..+...++ +....+ T Consensus 178 ~~~lLk~~~~~-~~~-----~~----~~~~~~i~~~V~~iDgv~Ii~VPs--~r~~t~~~---f~dG~t~~~~~gg~~~~ 242 (312) T protein:vir:10 178 SMFAIEEKVLE-KLT-----AV----TFAQGGIQTQVPSIDGCALIKTPQ--NRMYSSIL---LNDGTTSNQTAGGYLKG 242 (312) T ss_pred HHHHHhhhhhc-eec-----cc----ccccceeeeeeeeecccEEEEchh--hhccceee---eccCcccccccCceeec Confidence 99888875321 111 11 123446799999999999999986 47752211 11111000 111112 Q ss_pred ccchhhheeeccceeEEEeeecCCCCcc----------eeecccccCchhHHHHHHHhchhhccccCCCCCceEEEEEEE Q lcl|Aclame:pro 329 ATNIDRAMLLGAQALANAYGQKAGGHFN----------MVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAV 398 (404) Q Consensus 329 ~~~v~ralLlGaQAl~~A~g~~~g~r~~----------w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~DfGvi~i 398 (404) ..+-..++||=...+.+|.-|+.-++.+ |-=+..- =|.++++ T Consensus 243 ~~ak~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~----------------------------Y~D~fv~ 294 (312) T protein:vir:10 243 TKALDTNFIIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDYRR----------------------------YHDLWVT 294 (312) T ss_pred CcccccceEEeCCceeeceeeeeeeeeeCCCCCCCcceeeeeeee----------------------------eeeeeee Confidence 2345678999999999998875433332 2111111 1334444 Q ss_pred eeeecC Q lcl|Aclame:pro 399 DTAVKL 404 (404) Q Consensus 399 dta~~~ 404 (404) +...+. T Consensus 295 ~nk~~~ 300 (312) T protein:vir:10 295 DNKANS 300 (312) T ss_pred ccccCe Confidence 444443 No 62 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=91.83 E-value=0.014 Score=30.80 Aligned_cols=266 Identities=11% Similarity=0.116 Sum_probs=120.7 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~ 80 (404) |+-. . ..+|++.|+-..... +...... ++.+.-.++. .-|.+|.++=+.-.. T Consensus 1 Main---~-~~k~~~~ld~~~~~~------------------~~~~~l~-~~~n~~~~~~-----~gak~VkIp~ist~~ 52 (285) T protein:vir:79 1 MTVV---L-DSKDLARIDEEYKAD------------------SQVWSYL-TGGNGVTQRF-----RGHNEVRINKLSGFV 52 (285) T ss_pred Ccch---h-hHHHHHHHHHHHHHh------------------hhhhhhc-ccCCcceeEe-----cCCCEEEEeeecccc Confidence 4421 1 234455555333332 2211111 1111111111 236777776543222 Q ss_pred cCceecCceeee-ehhhhhhcccEEEEeeec---cccccCcchhhhhh-hhhHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 81 KRPTMGDERVEG-RGEDLSHADFSLKINQGR---HLVDAGGRMSQQRT-KFNLASSARTLLGTYFNDLQDQCAIVHLAGA 155 (404) Q Consensus 81 G~gv~Gd~~leG-nee~L~~~sd~v~Idq~R---~~V~~~gkms~qrs-~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~ 155 (404) |+.--++-.| +..+++....++.++|-| +-|| .|+..-+ ...+-...+....+ T Consensus 53 --gl~dY~R~~g~~~g~v~~~~et~tl~~DR~~~f~iD---~mDvdEn~~~~~~ni~~ef~~~----------------- 110 (285) T protein:vir:79 53 --DATAYKRGQDNARKTISVGKETVKLTHEDWFGYDLD---QFDMDENGAYTVENVVREHNKM----------------- 110 (285) T ss_pred --cccccccccCccccccceeeeEEEeeccccceeccc---ccchhhhhhhhHHHHHHHHHhh----------------- Confidence 2222222223 566677788888888888 3344 2221100 11111111111111 Q ss_pred ccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCc Q lcl|Aclame:pro 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) Q Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~ 235 (404) -.+|.-++..|+..+.+. +.....+|+++.+ ++.|+.+.+++++.+-| T Consensus 111 --------~vvPEiDayrfskla~~a-------------~~~~~~~~T~~nv--~~~i~~~~~~lde~~vp--------- 158 (285) T protein:vir:79 111 --------ITIPHRDKVAVQKLFDSA-------------AKKATDSITKDNA--LDAYDTAEAYMFDNEVP--------- 158 (285) T ss_pred --------hhcchhhHHHHHHHHhhc-------------ccccccccCHHHH--HHHHHHHHHHHHHcCCC--------- Confidence 233444444444443221 1122334665553 78888898988886533 Q ss_pred cccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcc----eeCCeEEEcC-EEEEecCceeeeeccce Q lcl|Aclame:pro 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPL----FKGECAMWRN-ILVRKYAGMPIRFYQGS 310 (404) Q Consensus 236 ~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPl----F~G~~gm~ng-vii~~~~~~~irf~~~~ 310 (404) .+ +||+|+|.-+.-|+.++.+. +.- .-+-.. +.+.++.+|| |.|.+.|.- ||... T Consensus 159 -----~~-rvl~vTp~~~~~Lk~s~~~~------r~~-----~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~--r~kt~- 218 (285) T protein:vir:79 159 -----GG-FVMFVSSAYYTALKQSAAVT------RTF-----STDGTMVINGIDRRVAQLDGGVPIVRVSSD--RLKGL- 218 (285) T ss_pred -----Cc-eEEEEChHHHHHHHhhhhhh------eec-----ccccceeccceeeeeccccceeEEEEcchh--hccCc- Confidence 11 79999999999999987542 210 001112 4455799998 999998853 55211 Q ss_pred eEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCc Q lcl|Aclame:pro 311 KVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKM 390 (404) Q Consensus 311 ~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~ 390 (404) ..+. ..++||-...++++.-|+.-++.+ |=+...+.-. +.+.- | T Consensus 219 --------------~~~k----~Infiiv~~~a~i~~~K~~~~~~f------~P~~~~~~d~-~~~~~---R-------- 262 (285) T protein:vir:79 219 --------------GITN----HVNFILTPLSAIAPIVKYDSVSVI------DPSTDRSGNR-WTIKG---L-------- 262 (285) T ss_pred --------------Ccch----hccEEEecCceeccceeeeeeEeE------CCCCCCCcce-eeeee---e-------- Confidence 0011 367888888888887764222211 1111100000 00000 0 Q ss_pred eEEEEEEEeeeecC Q lcl|Aclame:pro 391 QDHGVIAVDTAVKL 404 (404) Q Consensus 391 ~DfGvi~idta~~~ 404 (404) .=|.++++|...+- T Consensus 263 ~Y~d~fv~~nk~~~ 276 (285) T protein:vir:79 263 SYYDAIVLDNAKKG 276 (285) T ss_pred eeeeeeehhhccce Confidence 01223333333332 No 63 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=88.44 E-value=0.032 Score=28.76 Aligned_cols=280 Identities=14% Similarity=0.064 Sum_probs=128.9 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhcc-ccccccCCCCCccEEEEecccCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSP-DKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) || .+. .++|+..|........ ......++..+.-+. . ..|.+|.++=+.-- T Consensus 1 Ma---------------------iny-a~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~-~-----~ggktVkIp~is~t 52 (346) T protein:vir:10 1 MT---------------------INY-AEKYQAAVQQAFYDGHLYSAELWNSPSNSIIK-F-----DGAKHIKVPRLEIT 52 (346) T ss_pred Cc---------------------chh-HHHHHHHHHHHHHhhhccchhhcccccccceE-e-----cCCCEEEEEEeeee Confidence 22 111 2344444433222221 111111221121111 1 24788887655311 Q ss_pred ccCceecCc-eeeee--hhhhhhcccEEEEeeecc---ccccCcchhhhhh--hhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 80 SKRPTMGDE-RVEGR--GEDLSHADFSLKINQGRH---LVDAGGRMSQQRT--KFNLASSARTLLGTYFNDLQDQCAIVH 151 (404) Q Consensus 80 ~G~gv~Gd~-~leGn--ee~L~~~sd~v~Idq~R~---~V~~~gkms~qrs--~~dlr~~ar~~L~~w~~~~~D~~~~~~ 151 (404) +| . +|- +-.|- ..+++....++.++|-|- .|| .|+..-| ...+-...+....+...-.+|.-.|-. T Consensus 53 sG--l-~DY~R~~g~~~~g~v~~~~et~tl~qDR~~~F~vD---~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfsk 126 (346) T protein:vir:10 53 SG--R-KDRQRRTITTPVANYSNDWDSYELKNERYWSTLVD---PSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSH 126 (346) T ss_pred cc--c-ccccccCCcccccccccceeEEEeeccccceeccc---ccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHH Confidence 22 1 122 11222 246788888889999883 343 2332111 112222222222223333344444444 Q ss_pred hhh-cccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCce Q lcl|Aclame:pro 152 LAG-ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPV 230 (404) Q Consensus 152 laG-~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv 230 (404) |+. +.+ .+ ++.....+++++. -++.|+.+.+++++.+-| T Consensus 127 La~~a~~---------------------~~-------------~~~~~~~a~T~~n--i~~~i~~~~~~lde~~vp---- 166 (346) T protein:vir:10 127 LYSGKEA---------------------AH-------------DGGITTNTLDEKN--ILPAFDNMMLDFDEARIP---- 166 (346) T ss_pred HHHhhhh---------------------hc-------------cccccccccCHHH--HHHHHHHHHHHHHHccCC---- Confidence 431 111 00 0111122243332 256777788888775533 Q ss_pred EecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccce Q lcl|Aclame:pro 231 RLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGS 310 (404) Q Consensus 231 ~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~ 310 (404) .+. +||+|+|.-+.-|+.++.+ ++.-... ..+ ...|.+|.+|||.|.+.|. -||+.- T Consensus 167 ---~~~-------rvl~vTp~~~~lLk~s~~f------~k~~~v~---~~~-~i~~~V~siDGv~Ii~VPs--~r~~t~- 223 (346) T protein:vir:10 167 ---STN-------RILYVTPKTNAILKRAEAM------NRALTLK---DPN-NIQRTVYSLDDVTIRVVPS--DLMQTA- 223 (346) T ss_pred ---CCC-------eEEEECHHHHHHHhhchhh------eeccccc---ccc-ccceeeeeecCeEEEEcch--hhcccc- Confidence 111 7999999999999998854 3322111 122 3599999999999999886 487621 Q ss_pred eEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCc Q lcl|Aclame:pro 311 KVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKM 390 (404) Q Consensus 311 ~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~ 390 (404) ..+.+ +... +..+-..++||-...+.+|.-|+..++.+ +=++. .-+ .+.+.- |. T Consensus 224 --~~f~~----G~~~--~t~ak~INfiiv~~~A~ia~~K~~~~~if------~P~~~-~~g-~~l~~~---R~------- 277 (346) T protein:vir:10 224 --YDFSD----GSKI--IDTAKQIEMFLIYNGVQIAPEKYSFVGFD------QPSAA-TSG-NYLYYE---QS------- 277 (346) T ss_pred --hhhcc----Cccc--cCCccceeEEEECCceeeeeeeeeeeEee------CCCCC-ccc-ceeeee---ee------- Confidence 11111 1111 11233568999999999998875444332 11110 000 111110 11 Q ss_pred eEEEEEEEeeeecC Q lcl|Aclame:pro 391 QDHGVIAVDTAVKL 404 (404) Q Consensus 391 ~DfGvi~idta~~~ 404 (404) =|.+++++...+- T Consensus 278 -Y~D~fv~~nk~~~ 290 (346) T protein:vir:10 278 -YDDVLLLNTKTKG 290 (346) T ss_pred -eeeeeeeccccce Confidence 1334444444443 No 64 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=81.13 E-value=0.088 Score=26.36 Aligned_cols=300 Identities=10% Similarity=0.025 Sum_probs=121.1 Q ss_pred CC-CcCc---hHHHHHHHHHHH-HHhhccch-HH-HHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEE Q lcl|Aclame:pro 1 MT-TVTS---AQANKLYQVALF-TAANRNRS-MV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTF 73 (404) Q Consensus 1 ~~-~~~~---~~a~~~~~~~lf-t~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f 73 (404) +. ..+. +.-.+....+.. +.+..... ++ ..+....... .-...+||..+-......|+...+ T Consensus 229 ~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~-----------~~~~~~~l~~~~~~~~~~g~~~~~ 297 (543) T protein:vir:81 229 ARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIIT-----------SNGSLNDIRRFARQVVATGDVWHG 297 (543) T ss_pred HHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHH-----------HHhhhchhhhhcccccCCcceEEE Confidence 00 0000 000000000000 00000000 00 0111110000 001112232222222233432222 Q ss_pred EEeeccccCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 74 SIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA 153 (404) Q Consensus 74 ~L~~~L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~la 153 (404) -....-...+|...... .+..++|..-++.+.....-|.+...+-+ -+ .||-..-...|.+-+....|+.+| . T Consensus 298 ~~~~~~~a~~v~Eg~~~--~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~-~~~~~~i~~~l~~~~~~~~d~ail---~ 370 (543) T protein:vir:81 298 VSSAAVQWSWDAEFEEV--SDDSPEFGQPEIPVKKAQGFVPISIEALQ-DE-ANVTETVALLFAEGKDELEAVTLT---T 370 (543) T ss_pred EecCCcceeecccCccc--cccccccceeeeeeeeeEeeehhhHHHHh-cc-HHHHHHHHHHHHHHHHHHHHHHHh---c Confidence 21111122344333333 25567777777777777777776665553 34 699999999999999999999886 2 Q ss_pred hcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEec Q lcl|Aclame:pro 154 GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLS 233 (404) Q Consensus 154 G~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~ 233 (404) | +.++ ....+++.+.. +.+....-.+++.++++.+.++........ T Consensus 371 G---~Gt~----------~~p~Gi~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~--------- 416 (543) T protein:vir:81 371 G---TGQG----------NQPTGIVTALA------------GTAAEIAPVTAETFALADVYAVYEQLAARH--------- 416 (543) T ss_pred c---CCCC----------cccccchhhcc------------cccccccccccccccHHHHHHHHHhhhccc--------- Confidence 2 1111 11222221100 000010111334556666555544432110 Q ss_pred CccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcce----eCCeEEEcCEEEEecCceeeeeccc Q lcl|Aclame:pro 234 GDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLF----KGECAMWRNILVRKYAGMPIRFYQG 309 (404) Q Consensus 234 g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF----~G~~gm~ngvii~~~~~~~irf~~~ 309 (404) ... -+++|||.-+..|++=.+ +..+||| .|..+++.|.+++....+|..- T Consensus 417 ~~~-------~~~v~n~~~~~~l~~lkd----------------~~G~~l~~~~~~g~~~~l~G~pv~~~~~~~~~~--- 470 (543) T protein:vir:81 417 RRQ-------GAWLANNLIYNKIRQFDT----------------QGGAGLWTTIGNGEPSQLLGRPVGEAEAMDANW--- 470 (543) T ss_pred cCC-------cEEEEcHHHHHHHHHhhc----------------CCCceeccCcCCCCCccccceeeEEeccccccc--- Confidence 011 368899988888875211 1124454 4667788888887766554210 Q ss_pred eeEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeeccc-ccC---chhHHHHHHHhchhhccccC Q lcl|Aclame:pro 310 SKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKT-DMD---NRTEIAISWINGLKKIRFPE 385 (404) Q Consensus 310 ~~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~-D~g---~~~~i~i~~i~G~~K~rF~~ 385 (404) ....+++ .-.+++|-=.. +.++-..++...+..+.+ ++. +.+.+-+...+|++. T Consensus 471 -------------~~~~~~~---~~~i~~gd~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v----- 528 (543) T protein:vir:81 471 -------------NTSASAD---NFVLLYGNFQN-YVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADV----- 528 (543) T ss_pred -------------cccccCC---cceEEEeeccc-eeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEe----- Confidence 0001111 11355554221 222322344444443321 111 111222222222211 Q ss_pred CCCCceEEEEEEEeeee Q lcl|Aclame:pro 386 KSGKMQDHGVIAVDTAV 402 (404) Q Consensus 386 ~~g~~~DfGvi~idta~ 402 (404) -+.+=|-++.+-|+| T Consensus 529 --~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 529 --VNPNAFRLLNVETAS 543 (543) T ss_pred --ecccceEEEEecccC Confidence 122346666666666 No 65 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=74.39 E-value=0.16 Score=24.96 Aligned_cols=289 Identities=15% Similarity=0.041 Sum_probs=125.9 Q ss_pred hccch--H-HHHHHhhhhhhhhhccccccccCCCCCccEEEEe--cc-cCCCCcEEEEEEeeccccCceecCceeeeehh Q lcl|Aclame:pro 22 NRNRS--M-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRIT--DL-NKQAGDEVTFSIMHKLSKRPTMGDERVEGRGE 95 (404) Q Consensus 22 ~~~~~--~-~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~--dL-~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee 95 (404) +.|+= . -++|+..+...-.+..-+... +.|-- |. ..+.||+|++.......-.-....+...-+.+ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~l--------V~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~ 72 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKT--------VDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKN 72 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchh--------cccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcccc Confidence 33331 1 255655543332222211111 11111 11 24679999999876653221111111122346 Q ss_pred hhhhcccEEEEeeecc-ccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeeccccccc Q lcl|Aclame:pro 96 DLSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (404) Q Consensus 96 ~L~~~sd~v~Idq~R~-~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~ 174 (404) .+.-.+-.|.||+..+ ++...++ ++--..-||.+..++.... +++..|+.++..+..... T Consensus 73 ~~~e~~v~l~id~~k~~a~~v~d~-e~~l~i~~~~~~l~~a~~a-la~~vd~~l~~~l~~~a~----------------- 133 (423) T protein:vir:35 73 GLFSAKATGKVGKYITVAVEWTQI-EEALKLNQLDQILSPIHER-MVTDLETELAHFMMNNGA----------------- 133 (423) T ss_pred ccccceeeEEeccceeccceeCHH-HHHhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccc----------------- Confidence 6666677899999987 6676653 2222555676666666543 556677777644421110 Q ss_pred cccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHHH Q lcl|Aclame:pro 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (404) Q Consensus 175 ~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~ 254 (404) |.+-.|. +..-.++.|-.+...+.+.+-|- .+ +.+++.|+-+. T Consensus 134 -----~~vgt~~------------------t~~~~~~~i~~a~~~Ld~~~vP~-------~~-------R~~Vv~p~~~a 176 (423) T protein:vir:35 134 -----LSLGSPN------------------TAIKKWADVAQTASFIKDIGIKT-------GE-------NYAIMDPWSAQ 176 (423) T ss_pred -----ccccccc------------------CCcchHHHHHHHHHHHHHhcCCc-------CC-------CEEEeCHHHHH Confidence 1000010 00112677778888888776552 11 58899999998 Q ss_pred HHHhCcchHHHHHHHHHhhhccccccCcceeCCe-EEEcCEEEEecCceeeeeccceeEEeecCcccccccccccccchh Q lcl|Aclame:pro 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGEC-AMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNID 333 (404) Q Consensus 255 dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~-gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ 333 (404) .|..+... .. + +. .+...-|=.|.+ |.+.|+-+.+..++|..- ++.-... . ....+..+. T Consensus 177 ~Ll~~~~~--~~---~-~~---~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T-~gt~~~~--------~-~v~~a~~v~ 237 (423) T protein:vir:35 177 RLADAQSG--LH---A-AD---QLVRTAWENAQISGNFGGIRALMSNGLASRK-QGDFDGA--------I-TVKTAPNVD 237 (423) T ss_pred HHhccccc--ee---c-cc---cchhHHHhhccceeeecceEEEEcCCCcccc-ccccccc--------e-eeccccccc Confidence 88865421 11 1 11 122344677776 999999999988876421 1100000 0 000000000 Q ss_pred hheeeccceeEEEeeecCCCCcceeec--ccccCchhHHHHHHHhchh------hccccC-CCCCceEEEEEE------- Q lcl|Aclame:pro 334 RAMLLGAQALANAYGQKAGGHFNMVEK--KTDMDNRTEIAISWINGLK------KIRFPE-KSGKMQDHGVIA------- 397 (404) Q Consensus 334 ralLlGaQAl~~A~g~~~g~r~~w~Ee--~~D~g~~~~i~i~~i~G~~------K~rF~~-~~g~~~DfGvi~------- 397 (404) -.-.-|.++-.. +..-.|..- ..--|+ +-.|-|++ |-++.+ +.+..+-|-|.. T Consensus 238 ~~a~~~~~~~~~------~~~~~~~~~~g~l~~GD-----~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~ 306 (423) T protein:vir:35 238 YLSVKDSYQFTV------ALTGATPSKTGFLKAGD-----QLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTAS 306 (423) T ss_pred ccccccccccee------eeeeeeeccCCcEEecc-----eEEeeeeeeccccccceeecccCCceeEEEEecccccccc Confidence 000001110000 001112111 011122 11333432 222211 111222333320 Q ss_pred EeeeecC Q lcl|Aclame:pro 398 VDTAVKL 404 (404) Q Consensus 398 idta~~~ 404 (404) =.+.++| T Consensus 307 g~~~v~i 313 (423) T protein:vir:35 307 GDVTVKL 313 (423) T ss_pred CceeEEc Confidence 0223333 No 66 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=72.61 E-value=0.18 Score=24.66 Aligned_cols=291 Identities=13% Similarity=0.051 Sum_probs=117.1 Q ss_pred HHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccccC-----ceecCce Q lcl|Aclame:pro 15 VALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKR-----PTMGDER 89 (404) Q Consensus 15 ~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~-----gv~Gd~~ 89 (404) -+||-..=-|..+...+.+++.-.. ..++ ..++..|+.-+++. .||-|.+++..+|.|. .+.++.. T Consensus 1 m~lsD~~vfN~~~~~a~~e~~~q~~------~~fn-~as~gai~l~~~~~--~Gd~~~~pf~~~l~g~~~~~~~~~~~~~ 71 (325) T protein:vir:95 1 MALSDLAVYSEYAYSAFSETLRQQV------DLFN-TATGGAIMLQSAAH--QGDFSDVAFFAKVTGGLVRRRNAYGSGT 71 (325) T ss_pred CchhhhhhhhhhhhhhhhhhhhhhH------hhhh-hcccceeEeccccc--cCceeeccccccccccccccccCCCCce Confidence 2232222112222222222211111 1122 23445666555554 3999999999999773 3444444 Q ss_pred eeeehhhhhhcccEEEE-eeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeec Q lcl|Aclame:pro 90 VEGRGEDLSHADFSLKI-NQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPT 168 (404) Q Consensus 90 leGnee~L~~~sd~v~I-dq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~ 168 (404) ++. ..|....+.-++ -..+-++.. ..++...-.|-...+...+++.++++..+..+.++-|....-.. T Consensus 72 vt~--~kitt~~~~av~~~r~~g~~~~--d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~------- 140 (325) T protein:vir:95 72 VAE--KVLKHLVDTSVKVAAGTPPVRL--DPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALS------- 140 (325) T ss_pred ecc--ceeccccceeeEEecccCcccc--cHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc------- Confidence 442 334433332222 111111111 11221222222334444566666666655555554322210000 Q ss_pred cccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEE Q lcl|Aclame:pro 169 AEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYV 248 (404) Q Consensus 169 ~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l 248 (404) ....+++++++- +. .++..+|.+.+-+|..++= |. .+.+=.++| T Consensus 141 -----~~~~~v~dis~~-----------~~----~~~~~~s~~~l~~A~~klG-------------D~---~~~l~~~~M 184 (325) T protein:vir:95 141 -----QVSDVVYDATAN-----------TD----AADKLPTWNNLNNGQAKFG-------------DQ---SSQIAAWIM 184 (325) T ss_pred -----ccccceeeeecc-----------cC----cccccccHHHHHHHHHHhc-------------cc---ccceeEEEE Confidence 000112222220 00 1234568877766665541 11 123468999 Q ss_pred chHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecCccccccccccc Q lcl|Aclame:pro 249 TPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAA 328 (404) Q Consensus 249 ~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa 328 (404) |+.-+++|+++.-.. +..+..+ .... .=|-|.|-..++++. +|. ...++ T Consensus 185 HS~v~~~L~~~~L~~-~~~~~~~--~g~~--~i~t~~G~~VIVdD~-------~p~------------------~~~g~- 233 (325) T protein:vir:95 185 HSTPMHKLYGSNLTN-GERLFTY--GTVN--VVRDPFGKLLVMTDS-------PNL------------------FAAGT- 233 (325) T ss_pred chHHHHHHHHhhccc-ccccccc--CCcc--cccccCCcEEEEeCC-------CCC------------------CCccC- Confidence 999999999864221 1111111 0100 012233333333321 111 11111 Q ss_pred ccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHH----HHhchhhccccCCCCCceEEEEEEEeeeecC Q lcl|Aclame:pro 329 ATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAIS----WINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 329 ~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~----~i~G~~K~rF~~~~g~~~DfGvi~idta~~~ 404 (404) ..+-+.++||..|.++..+++.+. ...+.+-..+.+.... .+++.+=.+|....+.. . +|-+.| T Consensus 234 -~~~ytty~lg~GAi~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~tf~lhp~G~sw~~s~~g~------s-Pt~aeL 301 (325) T protein:vir:95 234 -PNVYHILGLVPGGVLIGQNNDFDA----NEETKNGDENIIRTYQAEWSYNIGVKGFAWDKANGGK------S-PTDAAL 301 (325) T ss_pred -ceeEEEEEEecCeEEecCCCCccc----cccccCcccceeeeeeeeeeEEeecceeeeecccccC------C-cChHhh Confidence 124467999988866554443221 1112222222332222 33455555563332110 0 222333 No 67 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=72.58 E-value=0.18 Score=24.65 Aligned_cols=295 Identities=14% Similarity=0.080 Sum_probs=121.7 Q ss_pred hccch---HHHHHHhhhhhhhhhccccccccCCCCCccEEEEecc-cCCCCcEEEEEEeeccccCceecCceeeee-hhh Q lcl|Aclame:pro 22 NRNRS---MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDL-NKQAGDEVTFSIMHKLSKRPTMGDERVEGR-GED 96 (404) Q Consensus 22 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL-~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGn-ee~ 96 (404) +-|+= .-++|+..+...-.+..-+....-++.. .|. ....||+|+++.-....-.-..+. .+.++ .++ T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~------~ef~~ak~GDTV~I~~P~~~~~~d~~~~-~~t~~~~~~ 73 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLL------AGEINSSTGDSVSFKRPHQFKSERTMDG-DITGKSKNS 73 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCC------ccccccccCCEEEEeeCCceeeecccCc-ccCcccccc Confidence 22111 1345554432222222111111100000 011 135799999977665532221111 12232 245 Q ss_pred hhhcccEEEEeeecc-ccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeecccccccc Q lcl|Aclame:pro 97 LSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFK 175 (404) Q Consensus 97 L~~~sd~v~Idq~R~-~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~ 175 (404) |.-.+-.|.|||..+ ++....+ +..-...||-+ .-+....=+++..|+.+...+..... T Consensus 74 l~e~~v~l~id~~k~~a~~v~d~-E~~l~i~~~~~-~l~~A~~aLA~~vd~~ia~~~~~~~~------------------ 133 (423) T protein:vir:10 74 LISAKATGEVGNYITVAVEYRQI-EEALKLNQLDQ-ILVPINERMVTDLETELALFMMKHGA------------------ 133 (423) T ss_pred cccceEEEEecceeeeeeeeChH-HHhcChhHHHH-HHHHHHHHHHHHHHHHHHHHhhhccc------------------ Confidence 555667899999987 5666542 22234555633 33333445666677766544433221 Q ss_pred ccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHHHH Q lcl|Aclame:pro 176 KIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWND 255 (404) Q Consensus 176 ~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~d 255 (404) |.+..|.. .+ + .++.+-.+..++.+.+-|- + + +.+++.|+-+.. T Consensus 134 ----~~vgt~~t-------------~~---~--a~~~~a~a~~~L~~~~vP~------~-~-------R~~Vv~p~~~a~ 177 (423) T protein:vir:10 134 ----LSLGSPNT-------------PI---K--KWSDVAQTASFLKDLGINS------G-E-------NYAVMDPWAAQR 177 (423) T ss_pred ----cccccccc-------------cc---c--cHHHHHHHHHHHhhccCCc------C-C-------CEEEeCHHHHHH Confidence 11111110 01 1 2456667777777766551 1 1 577999999999 Q ss_pred HHh-CcchHHHHHHHHHhhhccccccCcceeCCe-EEEcCEEEEecCceeeeeccceeEEeecCcccccccccccc--cc Q lcl|Aclame:pro 256 WYT-STSGKDWNQMMVRAVNRAKGFNHPLFKGEC-AMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAA--TN 331 (404) Q Consensus 256 Lr~-d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~-gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~--~~ 331 (404) |.. ++.+. ++.+ +..-.|=.|.+ |.+.|+-+.+..++|..- +++...... ..+.....+++. .. T Consensus 178 Ll~~~~~~~-------~~~~---~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T-~g~~~ga~~-~~~~~~vt~a~~~~~~ 245 (423) T protein:vir:10 178 LADAQSGLH-------VSEQ---LVRTAWENAQISGNFGGIRALMSNGLASRT-QGAFGGKLT-VKGTPEVNYDSVKDSY 245 (423) T ss_pred Hhhhhhhhc-------cccc---cchHHHHhcccceeecceEEEEecCCcccc-cccccceee-eeeeeEEEeccccccc Confidence 865 43211 1111 12244656665 899999998877765321 110000000 000000000000 11 Q ss_pred hhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccc-cCCCCCceEEEEEE-------Eeeeec Q lcl|Aclame:pro 332 IDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRF-PEKSGKMQDHGVIA-------VDTAVK 403 (404) Q Consensus 332 v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF-~~~~g~~~DfGvi~-------idta~~ 403 (404) +-++-.+++-+-.-++-+.+- .|.+- ++.++.=+.|-++ +...+..+-|-|.. =++.++ T Consensus 246 ~~~~~~~~~T~s~~g~l~~GD-~~t~a------------Gv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~~~tv~ 312 (423) T protein:vir:10 246 AFTATLTGATASKKGFLKVGD-QLQFD------------DTHWLNQQSKQTLYNGASALSFTATVMEDANAHSSGDVTVK 312 (423) T ss_pred ccccceeeccceeceeEEecc-eEeec------------ceeeecccccceeecccCCcceEEEEEecccccccCceEEE Confidence 223333443332222222111 01100 0011112223222 22233344454421 022233 Q ss_pred C Q lcl|Aclame:pro 404 L 404 (404) Q Consensus 404 ~ 404 (404) | T Consensus 313 i 313 (423) T protein:vir:10 313 I 313 (423) T ss_pred e Confidence 3 No 68 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=69.85 E-value=0.22 Score=24.22 Aligned_cols=291 Identities=8% Similarity=-0.025 Sum_probs=118.7 Q ss_pred HHHHHHhhccch--HHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEe-eccccCceecCceee Q lcl|Aclame:pro 15 VALFTAANRNRS--MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM-HKLSKRPTMGDERVE 91 (404) Q Consensus 15 ~~lft~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~-~~L~G~gv~Gd~~le 91 (404) -|..+..+.-=| ....+.+.+.. . ++|..+-....-.+.++++... ......+|-+.+.. T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~----~------------s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~- 63 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKG----H------------SSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKK- 63 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHh----h------------chhhhhcceeecCCCceEEEEEecCcceEEeecCccc- Confidence 112222211111 22222222111 1 1111111111112223444332 12223344333332 Q ss_pred eehhhhhhcccEEEEeeeccccccCcchhhh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeecc Q lcl|Aclame:pro 92 GRGEDLSHADFSLKINQGRHLVDAGGRMSQQ--RTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTA 169 (404) Q Consensus 92 Gnee~L~~~sd~v~Idq~R~~V~~~gkms~q--rs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~ 169 (404) .+..++|.+-++.+-....-+....++=+| -+.++|.+..++.|++-+.+.+|+.+|.-.-...+ .+ T Consensus 64 -~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g--~~-------- 132 (303) T protein:vir:97 64 -THGGLSLEPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTK--KA-------- 132 (303) T ss_pred -cccccceeeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCc--cc-------- Confidence 255677766666666666655544332221 45788999999999999999999998722100111 00 Q ss_pred ccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEc Q lcl|Aclame:pro 170 EHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVT 249 (404) Q Consensus 170 ~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~ 249 (404) ..+... . .+.+..+.....++++. +.+.|.++........ .+. =.++|| T Consensus 133 ~~~~~~-----------~---~~~~~~~~~~~~~~~~~-~~~~i~~~~~~~~~~~---------~~~-------~~~vmn 181 (303) T protein:vir:97 133 SDVIGT-----------N---HFDSKVTQVVKFTESED-ADANIEAAVNLIQGAE---------GVV-------TGLAMD 181 (303) T ss_pred cccccc-----------c---ccccccccccccccccc-hHHHHHHHHHHHhhcC---------CCc-------cEEEEc Confidence 000000 0 00111111111222222 3455555554443211 011 148889 Q ss_pred hHHHHHHHhCcchHHHHHHHHHhhhccccccCccee------CCeEEEcCEEEEecCceeeeeccceeEEeecCcccccc Q lcl|Aclame:pro 250 PRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK------GECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATT 323 (404) Q Consensus 250 P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~------G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~ 323 (404) |..+..|++=.+ ...+|||. +..+.+.|.+++.-..+|- . T Consensus 182 ~~~~~~L~~lkd----------------~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~------------------~ 227 (303) T protein:vir:97 182 TEFSTALAKVTN----------------GEMGPKMYPELAWGANPDSINGLKSSVNTTVGA------------------G 227 (303) T ss_pred HHHHHHHHHhhc----------------cCCCeEEecCccCCCCCceecceeeEEecccCC------------------c Confidence 999998875211 11255553 3345777887766554431 0 Q ss_pred cccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCC-CCCceEEEEEEEeeee Q lcl|Aclame:pro 324 KEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEK-SGKMQDHGVIAVDTAV 402 (404) Q Consensus 324 ~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~-~g~~~DfGvi~idta~ 402 (404) ...+.. .-.+++|-=.-++.|+-..+....+.++..+-+. ++.. +..++--.|.... +...-+-..|+.-+-+ T Consensus 228 ~~~~~~---~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~--~~~~-~~~n~~~~r~~~r~~~~v~~p~af~~l~~~ 301 (303) T protein:vir:97 228 ADEAES---KDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNS--GKDL-KGYNQIYLRAEAYIGWGILDAKSFARVTKG 301 (303) T ss_pred cccCCC---ccEEEEeeccccEEEEEecCcEEEEeeccCCCCc--chhh-hhcCcEEEEEEEEeccEeecccceEEeeCC Confidence 000000 0236777644455565545555555543222111 1111 1222222222111 0111122233344444 Q ss_pred cC Q lcl|Aclame:pro 403 KL 404 (404) Q Consensus 403 ~~ 404 (404) |+ T Consensus 302 ~~ 303 (303) T protein:vir:97 302 EV 303 (303) T ss_pred CC Confidence 44 No 69 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=69.69 E-value=0.22 Score=24.20 Aligned_cols=294 Identities=13% Similarity=0.040 Sum_probs=126.7 Q ss_pred hccch-H--HHHHHhhhhhhhhhccccccccCCCCCccEEEEe--ccc-CCCCcEEEEEEeeccccCceecCceeeeehh Q lcl|Aclame:pro 22 NRNRS-M--VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRIT--DLN-KQAGDEVTFSIMHKLSKRPTMGDERVEGRGE 95 (404) Q Consensus 22 ~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~--dL~-k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee 95 (404) +-|+= . -++|+..+...-.+..-+... |.|-- |.. ...||+|++.......-.-..+..--.-+-+ T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~l--------Vnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~ 72 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKT--------VDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKN 72 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchh--------hcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccC Confidence 33331 1 255655443322222221111 11101 111 2479999998777654333322211112467 Q ss_pred hhhhcccEEEEeeecc-ccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeeccccccc Q lcl|Aclame:pro 96 DLSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (404) Q Consensus 96 ~L~~~sd~v~Idq~R~-~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~ 174 (404) +|.-.+-.|.||+..| ++....+ +.....-||-+..+ ....=+++..|+.++..+.+... T Consensus 73 dl~e~~v~l~id~~k~va~~v~d~-E~~~~i~~~~~~l~-~A~~aLA~~vd~~ia~~~~~~~~----------------- 133 (423) T protein:vir:10 73 NLISGKATGRVGNYITVAVEYQQL-EEAIKLNQLEEILA-PVRQRIVTDLETELAHFMMNNGA----------------- 133 (423) T ss_pred ccccceeEEEeeceeeeeeeechH-HHhcChhhHHHHHH-HHHHHHHHHHHHHHHHHHhhccc----------------- Confidence 8888888999999987 5666542 22223344522222 22344666677766543322211 Q ss_pred cccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHHH Q lcl|Aclame:pro 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (404) Q Consensus 175 ~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~ 254 (404) |.+..|. + .. -.++.+-.+..++.+.+-|- .+ +.+++.|+-+. T Consensus 134 -----~~~gt~~----------t---~~-----~a~~~i~~a~~~Ld~~~vP~-------~~-------R~~Vv~p~~~a 176 (423) T protein:vir:10 134 -----LSLGSPN----------T---PI-----TKWSDVAQTASFLKDLGVNE-------GE-------NYAVMDPWSAQ 176 (423) T ss_pred -----cccccCC----------c---cc-----chHHHHHHHHHHHHhccCCc-------CC-------CEEEeChHHHH Confidence 0000000 0 00 12566667778887766551 11 56799999999 Q ss_pred HHHhCcchHHHHHHHHHhhhccccccCcceeCCe-EEEcCEEEEecCceeeeeccceeEEeecCcccccccccccccchh Q lcl|Aclame:pro 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGEC-AMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNID 333 (404) Q Consensus 255 dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~-gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ 333 (404) .|..+... .. ++. .+...-|=.|.+ |.+.|+-+.+..++|-. ..+.... +.....+..|. T Consensus 177 ~Ll~~~~~--~~----~~~---~~~~~alr~g~i~G~i~GFdv~~Snnip~~-T~gt~~~---------t~~~~~~~~v~ 237 (423) T protein:vir:10 177 RLADAQTG--LH----ASD---QLVRTAWENAQIPTNFGGIRALMSNGLASR-TQGAFGG---------TLTVKTQPTVT 237 (423) T ss_pred HHhccccc--ee----ccc---ccchhhhhhccceeeecceEEEEeCCCccc-ccccccc---------ceeeeecceec Confidence 98876421 11 111 122344666776 89999999998877631 1111000 00000011111 Q ss_pred hheeeccceeEEEeeecCCCCcceee--cccccCchhHH-HHHHHhchhhccc-cCCCCCceEEEEEEE-------eeee Q lcl|Aclame:pro 334 RAMLLGAQALANAYGQKAGGHFNMVE--KKTDMDNRTEI-AISWINGLKKIRF-PEKSGKMQDHGVIAV-------DTAV 402 (404) Q Consensus 334 ralLlGaQAl~~A~g~~~g~r~~w~E--e~~D~g~~~~i-~i~~i~G~~K~rF-~~~~g~~~DfGvi~i-------dta~ 402 (404) -+.--|++..-+... -.|.. -..--|+..-+ ++.++.=+.|-.+ +.+.+..+-|-|.+- ++.+ T Consensus 238 ~~a~~~a~~~~~~~~------~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv 311 (423) T protein:vir:10 238 YNAVKDSYQFTVTLT------GATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGGDVTV 311 (423) T ss_pred cccccccceeeeeee------eccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeeccCCceee Confidence 111111111111100 01110 00111221100 1111112223222 223344566666542 2345 Q ss_pred cC Q lcl|Aclame:pro 403 KL 404 (404) Q Consensus 403 ~~ 404 (404) +| T Consensus 312 ~i 313 (423) T protein:vir:10 312 TL 313 (423) T ss_pred ec Confidence 44 No 70 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=68.94 E-value=0.23 Score=24.08 Aligned_cols=287 Identities=10% Similarity=0.023 Sum_probs=126.5 Q ss_pred CcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccccC Q lcl|Aclame:pro 3 TVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKR 82 (404) Q Consensus 3 ~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~ 82 (404) --+|+.-|...-|-.| ...|.......+. ...+ ...+.-+ + ..|.+|.++=+. ..|. T Consensus 1 ~~~~an~mAlnya~~~-------------~~~Ld~~~~~~~~-t~~l-~~~~~~~--~-----~Gak~VkIp~i~-~~gl 57 (311) T protein:vir:99 1 MPTDAETRGFNYVTKD-------------GNLLDQKITAGLF-TAAL-GTPEVDL--V-----NGGRSFTLKTIS-TSGL 57 (311) T ss_pred CCCcchhhHHHHHHHH-------------HHHHHHHHHhhhc-ccce-ecCchhe--e-----ecCCEEEEEeee-eccc Confidence 1234444444334444 3333222222221 1112 1122111 1 347788876555 2322 Q ss_pred --ceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhh--hhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcc-c Q lcl|Aclame:pro 83 --PTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRT--KFNLASSARTLLGTYFNDLQDQCAIVHLAGAR-G 157 (404) Q Consensus 83 --gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs--~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~r-g 157 (404) -.. +. -.+..+++....+..++|-|----.=.+|+..-| .+.+-...+.-..+...=.+|.-.|-.|+..- + T Consensus 58 ~dY~R-~~--g~~~g~v~~~~et~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~ 134 (311) T protein:vir:99 58 KDHTR-GK--GFNSGTISDEKTIYTMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDN 134 (311) T ss_pred ccccc-cc--CccccceeeeeeEEEeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhc Confidence 111 11 1234566777888888888732111122332211 12222233333333333345555555554211 1 Q ss_pred ccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccc Q lcl|Aclame:pro 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~ 237 (404) .... +....++ .......+.|+++.++ +.|+.+...++.- | .+ T Consensus 135 ~~~~------------------~~~~~~~------~~~~~~~~~lt~~nvl--~~l~~~~~~~~~v--~-------~~-- 177 (311) T protein:vir:99 135 LDGT------------------DTEGTLL------AKTHKTEETLDETNAY--SQLKTGIGKVRKY--G-------TQ-- 177 (311) T ss_pred cccc------------------ccchhhh------ccccccccccCHHHHH--HHHHHHHHHHHhc--C-------CC-- Confidence 0000 0000000 0011223345555443 4455555555541 1 11 Q ss_pred cCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEec-CceeeeeccceeEEeec Q lcl|Aclame:pro 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKY-AGMPIRFYQGSKVLVSE 316 (404) Q Consensus 238 ~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~-~~~~irf~~~~~~~~~~ 316 (404) + +||||+|.-+.-|+.++.+. +..... .....-+.+.++.+|||.|.|. |. -||+..-. +. T Consensus 178 ----~-rvl~vTp~~~~lLk~~~~~~------r~~~~~--~~~~~~i~~~V~~lDgv~Ii~V~ps--~r~~t~~~---ft 239 (311) T protein:vir:99 178 ----N-LVGYVSSEVMDALERSKEFT------RNITNQ--NVGTTALESRITSIDGVQLIEVYES--NRFMTKYD---FT 239 (311) T ss_pred ----C-eEEEEChHHHHHHhhchhhh------eeeecc--cccccccccccceecCeEEEEecCc--hhhcchhh---hc Confidence 1 89999999999999887543 211111 1112246888999999999998 75 47652200 11 Q ss_pred CcccccccccccccchhhheeeccceeEEEeeecCCCCcc----------eeecccccCchhHHHHHHHhchhhccccCC Q lcl|Aclame:pro 317 NNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFN----------MVEKKTDMDNRTEIAISWINGLKKIRFPEK 386 (404) Q Consensus 317 ~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~----------w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~ 386 (404) + +..+.+ .+-..++||=...+.++.-|+.-++.+ |-=+..-| T Consensus 240 ~--G~~~~~----~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~~~gd~~l~~~R~Y---------------------- 291 (311) T protein:vir:99 240 D--GAKPTE----DAKAINFLVVAKPAVISIVKENAVFLFAPGQHTDGDGYLYQNRLY---------------------- 291 (311) T ss_pred C--CccccC----cccccceEEeCCCeeeeeeeeeeeeeeCCCCCCCcceeeeeeeee---------------------- Confidence 1 111111 123468888888888888774333222 21111111 Q ss_pred CCCceEEEEEEEeeeecC Q lcl|Aclame:pro 387 SGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 387 ~g~~~DfGvi~idta~~~ 404 (404) |.+++++...+. T Consensus 292 ------~D~fv~~nk~~~ 303 (311) T protein:vir:99 292 ------HDLFIKKHKRDG 303 (311) T ss_pred ------eeeeeeccccCe Confidence 233344433333 No 71 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=62.49 E-value=0.33 Score=23.19 Aligned_cols=295 Identities=8% Similarity=0.014 Sum_probs=116.5 Q ss_pred CC-CcCchHHHHHHHHHHHHH----hhccchHHHHHHhhhhhhh---------hhccccccccCCCCCccEEEEecccCC Q lcl|Aclame:pro 1 MT-TVTSAQANKLYQVALFTA----ANRNRSMVNILTEQQEAPK---------AVSPDKKSTKQTSAGAPVVRITDLNKQ 66 (404) Q Consensus 1 ~~-~~~~~~a~~~~~~~lft~----~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~gt~~~~~I~~~~dL~k~ 66 (404) .. ..+........+..-|-. ...+... ......+.... .....+... - ...++|...-....- T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~i~~~~~~~ii~~-~-~~~~~l~~~~~~~~~ 143 (385) T protein:vir:19 67 GAENPGEKKSFSERAAEELIKSWDGKQGTFGA-KTFNKSLGSDADSAGSLIQPMQIPGIIMP-G-LRRLTIRDLLAQGRT 143 (385) T ss_pred cccccchhhhhHHHHHHHHHHHHHHhhccchh-hHHHhhhccccccCCceecchhhhHHHHH-h-hhccchhhhcceecc Confidence 11 111111111111111111 1111110 00000000000 000000000 0 111222211111112 Q ss_pred CCcEEEEEEeecc--ccCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 AGDEVTFSIMHKL--SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQ 144 (404) Q Consensus 67 ~Gd~v~f~L~~~L--~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~ 144 (404) .|..+++.....- ...+|...+. -.+...+|..-++.+......+.+...+-+ ...+|-..-++.|++-+...+ T Consensus 144 ~~~~~~~~~~~~~~~~a~~v~E~~~--~~~~~~~~~~~~~~~~k~~~~~~is~ell~--d~~~l~~~i~~~la~a~~~~~ 219 (385) T protein:vir:19 144 SSNALEYVREEVFTNNADVVAEKAL--KPESDITFSKQTANVKTIAHWVQASRQVMD--DAPMLQSYINNRLMYGLALKE 219 (385) T ss_pred cCcceEEEEEecCCcceeeeccCcc--ccccccceeEEEEeeeeEEEeehhhHHHHh--hHHHHHHHHHHHHHHHHHHHH Confidence 2344555443221 2223322222 224456666666666666666555444322 335688888999999999999 Q ss_pred HHHHHHHhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcC Q lcl|Aclame:pro 145 DQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMA 224 (404) Q Consensus 145 D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~ 224 (404) |+.+| .|.- .+. ...++... ........+++...+++.|..+...+.... T Consensus 220 d~~~l---~G~g---~~~----------~~~Gi~~~--------------~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~ 269 (385) T protein:vir:19 220 EGQLL---NGDG---TGD----------NLEGLNKV--------------ATAYDTSLNATGDTRADIIAHAIYQVTESE 269 (385) T ss_pred HHHHH---hccC---CCC----------cccccccc--------------cccccccccccccchHHHHHHHHHhhcccc Confidence 98886 3321 111 11122110 001111233344445666666655553211 Q ss_pred CCCCceEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcce----eCCeEEEcCEEEEecC Q lcl|Aclame:pro 225 HPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLF----KGECAMWRNILVRKYA 300 (404) Q Consensus 225 ~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF----~G~~gm~ngvii~~~~ 300 (404) ... =+++|||..+..|++=.+ . ..+||| .|.-+.+.|++++..+ T Consensus 270 ---------~~~-------~~~~~~~~~~~~l~~lkd--------------~--~G~~l~~~~~~~~~~~l~G~pV~~~~ 317 (385) T protein:vir:19 270 ---------FSA-------SGIVLNPRDWHNIALLKD--------------N--EGRYIFGGPQAFTSNIMWGLPVVPTK 317 (385) T ss_pred ---------CCC-------CEEEEcHHHHHHHHHhhc--------------C--CCceeccCcccCCCceecceeeEEcC Confidence 001 278999999888875211 1 124454 5777888898887766 Q ss_pred ceeeeeccceeEEeecCcccccccccccccchhhheeecc--ceeEEEeeecCCCCcceeeccccc--CchhHHHHHHHh Q lcl|Aclame:pro 301 GMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGA--QALANAYGQKAGGHFNMVEKKTDM--DNRTEIAISWIN 376 (404) Q Consensus 301 ~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGa--QAl~~A~g~~~g~r~~w~Ee~~D~--g~~~~i~i~~i~ 376 (404) .+|- + .+++|- +++.++- ..++...|..+..|+ -+.+.+-+.+-+ T Consensus 318 ~~p~------------------------~-----~~~~gd~~~~~~~~~--~~~~~v~~~~~~~~~~~~~~~~~~~~~r~ 366 (385) T protein:vir:19 318 AQAA------------------------G-----TFTVGGFDMASQVWD--RMDATVEVSREDRDNFVKNMLTILCEERL 366 (385) T ss_pred cCCC------------------------C-----cEEEeecccEEEEEE--ecceEEEEeccccchhhcCcEEEEEEEee Confidence 5541 0 134443 3333321 123444454444333 111111111112 Q ss_pred chhhccccCCCCCceEEEEEEEeeee Q lcl|Aclame:pro 377 GLKKIRFPEKSGKMQDHGVIAVDTAV 402 (404) Q Consensus 377 G~~K~rF~~~~g~~~DfGvi~idta~ 402 (404) |.+-. +.+=|-++.+=+|+ T Consensus 367 ~~~v~-------~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 367 ALAHY-------RPTAIIKGTFSSGS 385 (385) T ss_pred ccEEe-------cccceEEEEeccCC Confidence 21111 12234444444444 No 72 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=62.49 E-value=0.33 Score=23.19 Aligned_cols=295 Identities=8% Similarity=0.014 Sum_probs=116.5 Q ss_pred CC-CcCchHHHHHHHHHHHHH----hhccchHHHHHHhhhhhhh---------hhccccccccCCCCCccEEEEecccCC Q lcl|Aclame:pro 1 MT-TVTSAQANKLYQVALFTA----ANRNRSMVNILTEQQEAPK---------AVSPDKKSTKQTSAGAPVVRITDLNKQ 66 (404) Q Consensus 1 ~~-~~~~~~a~~~~~~~lft~----~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~gt~~~~~I~~~~dL~k~ 66 (404) .. ..+........+..-|-. ...+... ......+.... .....+... - ...++|...-....- T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~i~~~~~~~ii~~-~-~~~~~l~~~~~~~~~ 143 (385) T protein:vir:18 67 GAENPGEKKSFSERAAEELIKSWDGKQGTFGA-KTFNKSLGSDADSAGSLIQPMQIPGIIMP-G-LRRLTIRDLLAQGRT 143 (385) T ss_pred cccccchhhhhHHHHHHHHHHHHHHhhccchh-hHHHhhhccccccCCceecchhhhHHHHH-h-hhccchhhhcceecc Confidence 11 111111111111111111 1111110 00000000000 000000000 0 111222211111112 Q ss_pred CCcEEEEEEeecc--ccCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 AGDEVTFSIMHKL--SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQ 144 (404) Q Consensus 67 ~Gd~v~f~L~~~L--~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~ 144 (404) .|..+++.....- ...+|...+. -.+...+|..-++.+......+.+...+-+ ...+|-..-++.|++-+...+ T Consensus 144 ~~~~~~~~~~~~~~~~a~~v~E~~~--~~~~~~~~~~~~~~~~k~~~~~~is~ell~--d~~~l~~~i~~~la~a~~~~~ 219 (385) T protein:vir:18 144 SSNALEYVREEVFTNNADVVAEKAL--KPESDITFSKQTANVKTIAHWVQASRQVMD--DAPMLQSYINNRLMYGLALKE 219 (385) T ss_pred cCcceEEEEEecCCcceeeeccCcc--ccccccceeEEEEeeeeEEEeehhhHHHHh--hHHHHHHHHHHHHHHHHHHHH Confidence 2344555443221 2223322222 224456666666666666666555444322 335688888999999999999 Q ss_pred HHHHHHHhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcC Q lcl|Aclame:pro 145 DQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMA 224 (404) Q Consensus 145 D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~ 224 (404) |+.+| .|.- .+. ...++... ........+++...+++.|..+...+.... T Consensus 220 d~~~l---~G~g---~~~----------~~~Gi~~~--------------~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~ 269 (385) T protein:vir:18 220 EGQLL---NGDG---TGD----------NLEGLNKV--------------ATAYDTSLNATGDTRADIIAHAIYQVTESE 269 (385) T ss_pred HHHHH---hccC---CCC----------cccccccc--------------cccccccccccccchHHHHHHHHHhhcccc Confidence 98886 3321 111 11122110 001111233344445666666655553211 Q ss_pred CCCCceEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcce----eCCeEEEcCEEEEecC Q lcl|Aclame:pro 225 HPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLF----KGECAMWRNILVRKYA 300 (404) Q Consensus 225 ~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF----~G~~gm~ngvii~~~~ 300 (404) ... =+++|||..+..|++=.+ . ..+||| .|.-+.+.|++++..+ T Consensus 270 ---------~~~-------~~~~~~~~~~~~l~~lkd--------------~--~G~~l~~~~~~~~~~~l~G~pV~~~~ 317 (385) T protein:vir:18 270 ---------FSA-------SGIVLNPRDWHNIALLKD--------------N--EGRYIFGGPQAFTSNIMWGLPVVPTK 317 (385) T ss_pred ---------CCC-------CEEEEcHHHHHHHHHhhc--------------C--CCceeccCcccCCCceecceeeEEcC Confidence 001 278999999888875211 1 124454 5777888898887766 Q ss_pred ceeeeeccceeEEeecCcccccccccccccchhhheeecc--ceeEEEeeecCCCCcceeeccccc--CchhHHHHHHHh Q lcl|Aclame:pro 301 GMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGA--QALANAYGQKAGGHFNMVEKKTDM--DNRTEIAISWIN 376 (404) Q Consensus 301 ~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGa--QAl~~A~g~~~g~r~~w~Ee~~D~--g~~~~i~i~~i~ 376 (404) .+|- + .+++|- +++.++- ..++...|..+..|+ -+.+.+-+.+-+ T Consensus 318 ~~p~------------------------~-----~~~~gd~~~~~~~~~--~~~~~v~~~~~~~~~~~~~~~~~~~~~r~ 366 (385) T protein:vir:18 318 AQAA------------------------G-----TFTVGGFDMASQVWD--RMDATVEVSREDRDNFVKNMLTILCEERL 366 (385) T ss_pred cCCC------------------------C-----cEEEeecccEEEEEE--ecceEEEEeccccchhhcCcEEEEEEEee Confidence 5541 0 134443 3333321 123444454444333 111111111112 Q ss_pred chhhccccCCCCCceEEEEEEEeeee Q lcl|Aclame:pro 377 GLKKIRFPEKSGKMQDHGVIAVDTAV 402 (404) Q Consensus 377 G~~K~rF~~~~g~~~DfGvi~idta~ 402 (404) |.+-. +.+=|-++.+=+|+ T Consensus 367 ~~~v~-------~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 367 ALAHY-------RPTAIIKGTFSSGS 385 (385) T ss_pred ccEEe-------cccceEEEEeccCC Confidence 21111 12234444444444 No 73 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=59.03 E-value=0.4 Score=22.76 Aligned_cols=299 Identities=10% Similarity=-0.045 Sum_probs=109.2 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccch-HHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeecc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 79 (404) ||+- ++..-.+-...|=-.-..++| .+-.-..++....-.+..++. .+...++. .|.|.-..+. T Consensus 1 ~~~~-~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~-~~~a~~~~-------------~v~f~~~~p~ 65 (318) T protein:vir:10 1 MTAP-TGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFR-NGGANPNG-------------VVAYNEGNPS 65 (318) T ss_pred CCCC-CcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhh-cccccccc-------------eeEEEecccc Confidence 6543 111111111111112223444 221111111111111111221 11121222 2222222222 Q ss_pred ccCceecCceeeeehhh---hhhcccEEE-EeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhh-h Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGED---LSHADFSLK-INQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA-G 154 (404) Q Consensus 80 ~G~gv~Gd~~leGnee~---L~~~sd~v~-Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~la-G 154 (404) -..+- =.+..||-|=. ...-..+|. +.-..-.+... .-.+.|..+|.-..+=..|..=+.++.|..++..|. + T Consensus 66 ~~~~d-~e~VaEggEiP~~~~~~G~~~ia~~~K~G~~~~vS-~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa 143 (318) T protein:vir:10 66 FLEDD-VADVAEFGEIPVSAGARGLPRTAFAVKKALGVRVS-KEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSP 143 (318) T ss_pred cccCc-HhhccCcccccccCCCCCchhhhhhehhccceecc-HHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 11110 01112222211 122222221 11222223332 224567788888888889999999999999987762 2 Q ss_pred cccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecC Q lcl|Aclame:pro 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (404) Q Consensus 155 ~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g 234 (404) .+ +.+.++..= ...+....|++. |+..+.+..+-+-|-.... T Consensus 144 ~t-----------------------~~~~~s~~w---------~~~~~~~~d~~~------A~e~v~~a~~~~~~a~~~~ 185 (318) T protein:vir:10 144 IV-----------------------PTLAVPTAW---------DNGGKVRTDIAI------AIEQISTAAPTAYPAGVGS 185 (318) T ss_pred cc-----------------------ccccCCcCC---------CCcccccccchh------hhhhhhhhhhhhhhhhhhh Confidence 22 111111110 000111112222 2222222111111111111 Q ss_pred ccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCcce-----eCCe-EEEcCEEEEecCceeeeecc Q lcl|Aclame:pro 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLF-----KGEC-AMWRNILVRKYAGMPIRFYQ 308 (404) Q Consensus 235 ~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF-----~G~~-gm~ngvii~~~~~~~irf~~ 308 (404) .+..-+=..=+++|||.++..|.++++ |++... +..||+| +|.+ |+.-|+-+.--|.+|. T Consensus 186 ~~~~~GY~pdtIVlhP~~~~~l~~n~~---~~~~y~-------~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~---- 251 (318) T protein:vir:10 186 SDEYFGFIPDTIVMHYALLPILMDNEN---FMKVYE-------RNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPI---- 251 (318) T ss_pred hhhccCccceeeEECHHHHHHHhcchh---hhhhhh-------ccchhhhhcccccccccceeeceEEeecCccCC---- Confidence 111111112389999999999999984 665432 2347776 3443 4456777766565431 Q ss_pred ceeEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccc-cCchhHHHHHHHhchhhccccCCC Q lcl|Aclame:pro 309 GSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTD-MDNRTEIAISWINGLKKIRFPEKS 387 (404) Q Consensus 309 ~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D-~g~~~~i~i~~i~G~~K~rF~~~~ 387 (404) +++++|=+++++.-+=..+-..-.|..|--| +|-+.+- |..=+. || T Consensus 252 ------------------------~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s---~~~~~~--~~---- 298 (318) T protein:vir:10 252 ------------------------DRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTES---YRADAS--HK---- 298 (318) T ss_pred ------------------------CeeEEEecCCcceeeccccceeeecccCCCCCCCCcchh---hheehh--ee---- Confidence 1356666555552211000011122222111 1111110 111111 11 Q ss_pred CCceEEEEEEEeeeecC Q lcl|Aclame:pro 388 GKMQDHGVIAVDTAVKL 404 (404) Q Consensus 388 g~~~DfGvi~idta~~~ 404 (404) .=+||.-=-.+++| T Consensus 299 ---~~~~V~~PkA~~~i 312 (318) T protein:vir:10 299 ---RALAVDQPKAALWL 312 (318) T ss_pred ---eeeeeeCcceeEEE Confidence 12233222234444 No 74 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=52.28 E-value=0.56 Score=21.97 Aligned_cols=311 Identities=16% Similarity=0.211 Sum_probs=142.4 Q ss_pred CCCcCchHHHHHHHHHHHHHhhc-----------------cchHHHHHHhhhhhhhhhccccccccCCCCCccEEEE--- Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANR-----------------NRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI--- 60 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~--- 60 (404) -+-|+..|-.. .-.|=|.-.+ +--.+.++.++++-++-.+.---...+++.++-|-.- T Consensus 10 ~~~~~~~~~~e--~k~lr~~me~~et~~e~~~~~~~~~~~e~el~E~f~Kmm~G~~p~~eV~~~e~mtt~~a~IliP~vi 87 (393) T protein:vir:79 10 ESGFTETQVQE--QKSLRTRMERGETLAEADANKLALNEEETQILESFAKMMEGETPTNEVNLREFMATPSAQILIPRVI 87 (393) T ss_pred hccCchhHHHH--HHHHHHHhhhhhhhhhhhhhhhhcchhHHHHHHHHHHHhcCCCchhheehhhhhcCCCcceechhhh Confidence 11111111111 0111111111 1112444444433222111111111233333222211 Q ss_pred ------------------ecccCCCCcEEEEEEeeccccCceecCceeeeehhhhh-hcccEEEEeeeccccccCcchhh Q lcl|Aclame:pro 61 ------------------TDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLS-HADFSLKINQGRHLVDAGGRMSQ 121 (404) Q Consensus 61 ------------------~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leGnee~L~-~~sd~v~Idq~R~~V~~~gkms~ 121 (404) ++..-+.|.+..|.=+.-++..-|-....++ +.+|+ +..+.|.+.+.|-++.+. +|| T Consensus 88 s~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g~~Ra~~IgEGgE~~--~~sld~~T~dsv~~~~gK~G~~Ia--~Sq 163 (393) T protein:vir:79 88 VGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIGIMRAYDVAEGQEIP--EDSIDWQTHESPEIRVGKSGIRLR--FTD 163 (393) T ss_pred hhhhhhcccchhHHHHHHHHHhhhcCcceeccchheeeecccccccccc--ccchhhhcCCceeEEechhhhhhh--hHH Confidence 1122246778887777766666665554444 77888 788899999999998874 566 Q ss_pred h---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCcc Q lcl|Aclame:pro 122 Q---RTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATS 198 (404) Q Consensus 122 q---rs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~ 198 (404) | +|..|+...+-....+-|+++.|+.+|..+ + .+ -|+-|..+..+++.-|+ |-+.. T Consensus 164 EmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~f---k---~~--------ghtvfDa~st~t~ahpt------Gr~~~- 222 (393) T protein:vir:79 164 EMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQF---R---SH--------GHTVFDNYSTNKLAHTT------GLDKN- 222 (393) T ss_pred HHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhh---h---cc--------cceeeeccccCccceee------cCCcc- Confidence 5 788999999999999999999999998443 1 11 22334444444444444 21111 Q ss_pred ccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhc--c Q lcl|Aclame:pro 199 FEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNR--A 276 (404) Q Consensus 199 ~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar--~ 276 (404) . .-.++||+.-+.++...+.... +++ =|++|||..|+.++++.... -.|.+|.-. . T Consensus 223 --~-~qNGTlSleDllDm~~av~~~h-------yt~---------svi~MHPLAWnv~AKna~me---~~~~na~gN~~~ 280 (393) T protein:vir:79 223 --G-VQNDTFSAEDFLDLIIAVMANE-------YTP---------SDLMMHPLAWTVFAKNELMG---SLQANPYGNYPA 280 (393) T ss_pred --c-cccccccHHHHHHHHHHHhccc-------CCc---------ceEEEcCchhhhhhhhhhhc---ceeeccccccCc Confidence 1 4578999887766655554322 221 38999999999999986433 222332210 0 Q ss_pred ccccCc------ceeCCeEEEcCEEEEecCceeeeeccceeEEeecCcccccccccccccchhhh---eeeccceeEEEe Q lcl|Aclame:pro 277 KGFNHP------LFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRA---MLLGAQALANAY 347 (404) Q Consensus 277 ~g~~nP------lF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ra---lLlGaQAl~~A~ 347 (404) ++.+.- +-+|.+-.-=+|++ .|-+|. . . .+.- ---+.|+|+ +|| T Consensus 281 ~~~~ts~algp~~i~~~~~~nlnv~~--sPfvp~--d---~--------k~~r---Fd~~~Vd~NnvgvlL--------- 333 (393) T protein:vir:79 281 KGAPSSMALGPDSIQGRLPFNFNVNL--SPFIPL--D---K--------KSRR---FDVYAVDRNNVGVLL--------- 333 (393) T ss_pred cccchhhhhchhhhccccccceeEEE--eccccc--c---c--------ccce---eeEEEeecCCceEEE--------- Confidence 111100 00111000012222 222221 0 0 0000 001123332 233 Q ss_pred eecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCC---ceEEEEEEEeeeecC Q lcl|Aclame:pro 348 GQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGK---MQDHGVIAVDTAVKL 404 (404) Q Consensus 348 g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~---~~DfGvi~idta~~~ 404 (404) -. -..+.+.+| +.+.|+.|++....+|- .+-+| |..+-.| T Consensus 334 V~-----D~i~tdq~d---------dk~rdiq~iKl~ERYG~gvLn~gka---iavakNI 376 (393) T protein:vir:79 334 VR-----DDLKTDQWD---------EKARGLQNIKMIERYGIGILNEGKA---IAVAKNI 376 (393) T ss_pred Ee-----cCcceeccc---------cccccceeeeeeeeeceeeeeCCce---EEEEecc Confidence 11 122222222 24556666666554332 11222 2222233 No 75 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=47.36 E-value=0.7 Score=21.42 Aligned_cols=296 Identities=9% Similarity=0.046 Sum_probs=106.0 Q ss_pred CCCcCchHHHHHHH------HHHHHHhhccch-HH-HHHHhhhhhhhhhccccccccCCCCCccEEE---EecccCCCCc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQ------VALFTAANRNRS-MV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVR---ITDLNKQAGD 69 (404) Q Consensus 1 ~~~~~~~~a~~~~~------~~lft~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~---~~dL~k~~Gd 69 (404) +.........+... .++=+.+.-... .+ ..+...+..... ..+||.. +..+....|. T Consensus 88 ~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~------------~~~~l~~l~~~~~~~~~~g~ 155 (404) T protein:vir:10 88 ADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLK------------DTTDLYNMVDYEPVFTRSGS 155 (404) T ss_pred HHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHh------------hhhhHhhhhceeeccCCccc Confidence 00000000000000 000000000000 00 111111111000 1112111 1111112222 Q ss_pred EEEEEEeeccccCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 70 EVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAI 149 (404) Q Consensus 70 ~v~f~L~~~L~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~ 149 (404) .....+...-...+|..++........++|..-++.+....+-+.....+- +.+.++|....++.|.+.+....|+.+| T Consensus 156 ~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~~~il 234 (404) T protein:vir:10 156 RTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLL-KFADKSLEDWIINWFVDKVRITRNAEIL 234 (404) T ss_pred eEEEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHH-hhcHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 111112222223344333333222234555544555555555554443322 4577899999999999999999999886 Q ss_pred HHhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCc Q lcl|Aclame:pro 150 VHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQP 229 (404) Q Consensus 150 ~~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~P 229 (404) .|.-.. .+.. +++ ... +...++.+...+.+.+..++.+.-..+ T Consensus 235 ---~G~g~~-~~~~------------gi~--------------~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~----- 277 (404) T protein:vir:10 235 ---YGAGGD-EHAT------------GIM--------------TAN--KFKKITLPKSPALKDFKKCKNVELLNV----- 277 (404) T ss_pred ---hcCCCC-Cccc------------cee--------------ecc--ccceeeccccccHHHHHHHHHhhhhcc----- Confidence 332110 1111 110 000 011122233334555555443321111 Q ss_pred eEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCccee-----CCeEEEcCEEEEecCceee Q lcl|Aclame:pro 230 VRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-----GECAMWRNILVRKYAGMPI 304 (404) Q Consensus 230 v~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~-----G~~gm~ngvii~~~~~~~i 304 (404) ... --+++|||.-+..|++= | . +..+|||. |.-+++.|.++...+.. T Consensus 278 --~~~--------~~~~v~n~~~~~~L~~l----------k----d--~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~-- 329 (404) T protein:vir:10 278 --FKA--------TSSWIVNQDGFNYLDSL----------E----D--KTGRPYLQPDPKDPTQYRFLGLPVIELPND-- 329 (404) T ss_pred --ccC--------CCEEEEcHHHHHHHHHh----------h----c--cCCceeeccCcCCCCCccccceeeEEeccc-- Confidence 111 12578999888777651 1 1 12367775 44557778766432210 Q ss_pred eeccceeEEeecCcccccccccccccchhhheeecc--ceeEEEeeecCCCCcceeecc-ccc-CchhHHHHHHHhchhh Q lcl|Aclame:pro 305 RFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGA--QALANAYGQKAGGHFNMVEKK-TDM-DNRTEIAISWINGLKK 380 (404) Q Consensus 305 rf~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGa--QAl~~A~g~~~g~r~~w~Ee~-~D~-g~~~~i~i~~i~G~~K 380 (404) .. .. + .-+-.+++|- +++.+... .++...+..+. .|| .+.+.+-+...+|++- T Consensus 330 -------------~~---~~--~---~~~~~~~~gd~s~~~~~~~~--~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v 386 (404) T protein:vir:10 330 -------------LL---LS--T---ESAIPVLLGDTKEAYKYVSD--GAYELATTNIGAGAFETNTTKARIIMRIDGNV 386 (404) T ss_pred -------------cc---CC--C---CCccEEEEEeccccEEEEEe--cceEEEEeccccchhhcCceEEEEEEeeccEE Confidence 00 00 0 1122467773 34333321 23333333222 112 1222222222222211 Q ss_pred ccccCCCCCceEEEEEEEeeeecC Q lcl|Aclame:pro 381 IRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 381 ~rF~~~~g~~~DfGvi~idta~~~ 404 (404) . +.+=|-++.+-+++.= T Consensus 387 ~-------~~~a~~~~~~~~aa~~ 403 (404) T protein:vir:10 387 K-------DSEALLIAEIPVESVQ 403 (404) T ss_pred e-------cccceEEEEeecccCC Confidence 1 1112222222222222 No 76 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=41.53 E-value=0.92 Score=20.77 Aligned_cols=290 Identities=11% Similarity=0.061 Sum_probs=103.8 Q ss_pred CCC--cCchHHHHHHHHHHHHHhhccchHHHHHHhhh----------------hhhhhhccccccccCCCCCccEEE-Ee Q lcl|Aclame:pro 1 MTT--VTSAQANKLYQVALFTAANRNRSMVNILTEQQ----------------EAPKAVSPDKKSTKQTSAGAPVVR-IT 61 (404) Q Consensus 1 ~~~--~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~gt~~~~~I~~-~~ 61 (404) .++ .+...+.....-+ |..........+.....+ ..+......+... -...++|.. ++ T Consensus 94 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~--~~~~~~l~~~~~ 170 (418) T protein:vir:10 94 ETPKTLGQLVTESEEMKG-MDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAP--PQRKMTIRDLLM 170 (418) T ss_pred chhhhhhHHhhhHHHHHH-HHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHH--HhhhhhHHhhcc Confidence 000 0000000000000 000000000000000000 0000000000000 001111111 11 Q ss_pred cccCCCCcEEEEEEeeccc--cCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHH Q lcl|Aclame:pro 62 DLNKQAGDEVTFSIMHKLS--KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTY 139 (404) Q Consensus 62 dL~k~~Gd~v~f~L~~~L~--G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w 139 (404) -. .-.|..+++.....-. ..+|...... .+..++|..-++.+.....-+.+...+-+ -+ .+|...-+..|.+- T Consensus 171 ~~-~~~~~~~~~~~~~~~~~~a~~v~E~~~~--~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds-~~l~~~i~~~l~~a 245 (418) T protein:vir:10 171 PG-QTSSSSIEYTVETGFTNNAAAVAEGAQK--PTSDLKFNLKNQPVRTIAHLFKASRQILD-DA-PALQSYIDGRARYG 245 (418) T ss_pred ee-eccCCceeEEEEecCCCceeeeccCccc--cccccceeeEEEeeeeEEEeehhhHHHHH-hH-HHHHHHHHHHHHHH Confidence 11 1122334444332211 1133222222 24445666666666666665655444432 23 48888999999999 Q ss_pred HHHHHHHHHHHHhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHH Q lcl|Aclame:pro 140 FNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLF 219 (404) Q Consensus 140 ~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~ 219 (404) ++...|..+| .| +..+. +| .++. |... ........++..+++.|-.++.. T Consensus 246 ~~~~~d~a~l---~G---~g~~~--------~p--~Gi~-~~~~-------------~~~~~~~~~~~~~~~~i~~~~~~ 295 (418) T protein:vir:10 246 LQLTEEGQIL---KG---DGTGA--------NI--LGIL-PQAS-------------AFMPSITLANATPIDKIRLALLQ 295 (418) T ss_pred HHHHHHHHHh---cc---CCCCc--------cc--cccc-cccc-------------cccccccccccccHHHHHHHHHh Confidence 9999999886 22 11111 11 1111 1000 00111222333344544444433 Q ss_pred HHhcCCCCCceEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCccee----CCeEEEcCEE Q lcl|Aclame:pro 220 IDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK----GECAMWRNIL 295 (404) Q Consensus 220 a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~----G~~gm~ngvi 295 (404) +... .. . .-+++|||..+..|++=- . +..+|||. |.-+.+.|++ T Consensus 296 ~~~~-------~~--~-------~~~~v~n~~~~~~L~~lk--------------d--~~G~~i~~~~~~~~~~~l~G~p 343 (418) T protein:vir:10 296 AVLA-------EF--P-------ATGIVLNPIDWASIELTK--------------D--SQGRYIVGNPVNGTTPRLWNLP 343 (418) T ss_pred hccc-------cC--C-------CCEEEEcHHHHHHHHHhh--------------c--CCCceeccccccCCCceeccee Confidence 3211 00 0 125789999988887521 1 12256663 5567888888 Q ss_pred EEecCceeeeeccceeEEeecCcccccccccccccchhhheeecc--ceeEEEeeecCCCCcceeecccccCchhHHHHH Q lcl|Aclame:pro 296 VRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGA--QALANAYGQKAGGHFNMVEKKTDMDNRTEIAIS 373 (404) Q Consensus 296 i~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGa--QAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~ 373 (404) ++....+|- + -+++|- |++.++ -..++...|.++..++ T Consensus 344 V~~~~~~p~------------------------~-----~~~~gd~s~~~~~~--~~~~~~i~~~~~~~~~--------- 383 (418) T protein:vir:10 344 VVETQAMTA------------------------N-----EFLVGAFSMAAQIF--DRMEIEVLLSTENVDD--------- 383 (418) T ss_pred eEEcCCCCC------------------------C-----cEEEeeccceEEEE--EecceEEEEecccchh--------- Confidence 876665431 0 134443 222222 1123344444332211 Q ss_pred HHhchhhccccCCCCCceEEEEEEEeeeecC Q lcl|Aclame:pro 374 WINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 374 ~i~G~~K~rF~~~~g~~~DfGvi~idta~~~ 404 (404) +..+.-..|+... -|+++.-=+..+.+ T Consensus 384 f~~~~~~~r~~~~----~d~~~~~~~a~~~~ 410 (418) T protein:vir:10 384 FEKNMVSIRAEER----LALAVYRPESFVTG 410 (418) T ss_pred hhcCceEEEEEEe----eccEEecccceEEE Confidence 1112222221000 02222211122222 No 77 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=39.73 E-value=1 Score=20.57 Aligned_cols=292 Identities=14% Similarity=0.165 Sum_probs=128.9 Q ss_pred HHhhccch-H-----HHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeeccccCceecCceeee Q lcl|Aclame:pro 19 TAANRNRS-M-----VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEG 92 (404) Q Consensus 19 t~~~~~~~-~-----~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~~leG 92 (404) -+++.|.+ . -++|+..+..---++ .. ...|.++.|. +.||+|.++=+.. ++.+|....+ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~-----Lv----~~~~~~~~d~--g~GDtV~InsIg~----~tV~dY~~~~ 65 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEK-----LL----DVNIARVVDF--PDGDKLTIPSVGT----PVVRSRPEQG 65 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhh-----hh----hhhhhccccc--CCCCeEEeccccc----cccccccCCC Confidence 22333443 1 357776543211111 11 1112333343 4599999876544 4444444333 Q ss_pred e--hhhhhhcccEEEEeeec---cccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhh-hccccccccccee Q lcl|Aclame:pro 93 R--GEDLSHADFSLKINQGR---HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLA-GARGDFVADDTIL 166 (404) Q Consensus 93 n--ee~L~~~sd~v~Idq~R---~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~la-G~rg~~~n~~~~~ 166 (404) . -+.|+-...+|.|||.. +.|+- .+ . -...||+..+-...++=+++-.|+-....|. |+-....... T Consensus 66 ~i~~d~ltt~~~~l~IDq~KYfaf~VdD-D~-~--Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~--- 138 (322) T protein:vir:31 66 DFTFDNLDTGEISIILRDEVYAGNAISK-KL-R--QDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQND--- 138 (322) T ss_pred CcccccCCCceEEEEEehhhhhccccch-hH-H--HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCC--- Confidence 3 57788899999999965 45554 33 2 2678899999999998888888887754332 2211000000 Q ss_pred eccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEE Q lcl|Aclame:pro 167 PTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVL 246 (404) Q Consensus 167 p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~ 246 (404) | .-.|+. | +.++ .+.+.=++.++.|.++..++.+..-| ... +++ T Consensus 139 p---------~vin~~--~---~~iv--------~~gt~~~~ay~~lv~l~~kLdkanVP-------~~g-------R~v 182 (322) T protein:vir:31 139 P---------NVINGV--P---HRFV--------GTGTDQTMDVTDFSRVNYVMTQSKMP-------MGG-------MIG 182 (322) T ss_pred c---------ceecCC--c---ccee--------ccCCCchhhHHHHHHHHHHhccccCC-------CCC-------eEE Confidence 0 001111 1 0111 12223456688888888888876544 122 688 Q ss_pred EEchHHHHHHHh---------CcchHHHHHHHHHhhhccccccCcceeCCeEEEcCEEEEecCceeeeeccceeEEeecC Q lcl|Aclame:pro 247 YVTPRQWNDWYT---------STSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) Q Consensus 247 ~l~P~q~~dLr~---------d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~ 317 (404) +++|.+++.|.. |+. |-.+..+..++ |- .| +|-+-|+-|..--.++. ++ T Consensus 183 VV~P~~~~~L~~i~~~~~l~~D~r---f~~i~~sG~a~--g~---~~---Vg~~~GF~V~~SN~l~~-----------~~ 240 (322) T protein:vir:31 183 IIDPSVAHHLETITNISNISNNPR---WEGIVESGIAP--DM---QF---VRSVYGIDLFVSNLLAD-----------AN 240 (322) T ss_pred EeCchhhhhhhhhhhhhhhhcccc---ccccccccchh--hH---HH---HHHHhceeeeeeccccc-----------cc Confidence 999999887744 542 32222222111 11 12 34444544433222110 00 Q ss_pred cccccccccccccchhhheee-----ccceeEEEeeecCCCCcceeeccccc--CchhHHHHHHHhchhhccccCCCCCc Q lcl|Aclame:pro 318 NLTATTKEVAAATNIDRAMLL-----GAQALANAYGQKAGGHFNMVEKKTDM--DNRTEIAISWINGLKKIRFPEKSGKM 390 (404) Q Consensus 318 ~~~a~~~~~aa~~~v~ralLl-----GaQAl~~A~g~~~g~r~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rF~~~~g~~ 390 (404) -..-.+..++...+.-+++|+ |.-..+-+|=+. -..|.|=+ -...+...-+.+|-.=.|= T Consensus 241 ~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l------~~~e~~r~~~~~~d~~~~~~~~g~g~~r~------- 307 (322) T protein:vir:31 241 ETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEM------PTTKSFIDDYNDDLNTATTARWGNGLVRD------- 307 (322) T ss_pred cccccCcccccccceeecccccccchhhhhhhhHhhhh------hhhhcccCccccccceeeeeeecceeecc------- Confidence 000000011111111112221 222222222110 01121111 1112222333444333331 Q ss_pred eEEEEEEEeeeecC Q lcl|Aclame:pro 391 QDHGVIAVDTAVKL 404 (404) Q Consensus 391 ~DfGvi~idta~~~ 404 (404) |-=+.++=++.|. T Consensus 308 -e~l~~~~a~~~~~ 320 (322) T protein:vir:31 308 -ENLVCVLANADKV 320 (322) T ss_pred -cceEEEEeccccc Confidence 1122344455555 No 78 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=30.50 E-value=1.6 Score=19.51 Aligned_cols=274 Identities=13% Similarity=0.096 Sum_probs=115.2 Q ss_pred hhccchHH-----------HHHHhhhhhhhhhccccccccCCCCCccEEE-EecccCCCCcEEEEEEeeccccCceecCc Q lcl|Aclame:pro 21 ANRNRSMV-----------NILTEQQEAPKAVSPDKKSTKQTSAGAPVVR-ITDLNKQAGDEVTFSIMHKLSKRPTMGDE 88 (404) Q Consensus 21 ~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~-~~dL~k~~Gd~v~f~L~~~L~G~gv~Gd~ 88 (404) .+-|.-.+ ..++..+..... +.+++.. .+-+ .-.|...++..........|.+.+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~------------~~s~l~~~~~~~-~~~~~~~~~~~~~~~~a~~v~E~~ 67 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVK------------NGSAAMKLAKAV-PMTKPEEEFTFMSGVGAFWVDEAE 67 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHH------------hcchhhhhceee-ecCCCcEEEEEEcCCceeeeecCc Confidence 23221100 112222111111 1122211 1222 222334444443333344443333 Q ss_pred eeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeec Q lcl|Aclame:pro 89 RVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPT 168 (404) Q Consensus 89 ~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~ 168 (404) . -.+...+|..-++.+.....-+.....+- +.+..||...-++.|++-+++..|+.+| .| +..+. |. T Consensus 68 ~--~~~~~~~f~~v~l~~~k~~~~~~is~ell-~ds~~~~~~~i~~~l~~a~~~~~d~a~l---~G---~g~~~----~~ 134 (299) T protein:vir:41 68 R--IQTSKPTFTKAKMRSKKMGVIIPTTKENL-NYSVTNFFSLMQAEIVEAFYKKFDQAVF---TG---VESPY----NW 134 (299) T ss_pred c--ccccccceeEEEEeeEEEEEeehhhHHHH-hcCHHHHHHHHHHHHHHHHHHHHHHHHh---hc---ccCcc----cc Confidence 2 23556677666666666555555543322 3567899999999999999999999887 22 21111 10 Q ss_pred cccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEE Q lcl|Aclame:pro 169 AEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYV 248 (404) Q Consensus 169 ~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l 248 (404) ++... ++...+..+.+..+++.|-++......... .+ -+++| T Consensus 135 -------gil~~---------------~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~---------------~~-~~~v~ 176 (299) T protein:vir:41 135 -------NILKS---------------ATDASNLVEETANKYDDLNEAIGLIEAEDL---------------EP-NGIAT 176 (299) T ss_pred -------ccccc---------------ccccceeeccccccHHHHHHHHHhhhcccC---------------Cc-CEEEE Confidence 11100 011112223444566656555554432110 01 36789 Q ss_pred chHHHHHHHhCcchHHHHHHHHHhhhccccccCccee----CCeEEEcCEEEEecCceeeeeccceeEEeecCccccccc Q lcl|Aclame:pro 249 TPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK----GECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTK 324 (404) Q Consensus 249 ~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~----G~~gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~ 324 (404) ||..+..|++=. . +..+|||. ++.+.+.|.+++..+.+|. + T Consensus 177 n~~~~~~L~~lk----------d------~~G~~l~~~~~~~~~~~l~G~PV~~~~~~~~-----------------~-- 221 (299) T protein:vir:41 177 IRKQRVKYRSTK----------D------GNGMPIFNTATSNGVDDVLGLPIAYTPKYTF-----------------G-- 221 (299) T ss_pred cHHHHHHHHHhh----------c------cCCceeecCCcCCCCceecceeeEEecccCC-----------------C-- Confidence 999988888521 1 12366664 4556777887777665531 0 Q ss_pred ccccccchhhheeeccceeEEEeeecCCCCcceeeccc--ccCchhHHHH-HHHhchhhccccCCCCCceEEEEEEEeee Q lcl|Aclame:pro 325 EVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKT--DMDNRTEIAI-SWINGLKKIRFPEKSGKMQDHGVIAVDTA 401 (404) Q Consensus 325 ~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~--D~g~~~~i~i-~~i~G~~K~rF~~~~g~~~DfGvi~idta 401 (404) +. ...+++|--+-++ ++-..++.+.-.+|.. .+.+.-+..+ -+-.++-.+|.... -|+.+.-=..- T Consensus 222 --~~----~~~~~~gdfs~~~-i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~----~d~~v~~~~A~ 290 (299) T protein:vir:41 222 --DK----DISELVGDWNQAY-YGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFE----VGFMVVKDEAF 290 (299) T ss_pred --CC----ceEEEEEecccEE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE----eccEEecccce Confidence 00 0125555444322 3333333333332211 0000000000 01122222221100 13333221112 Q ss_pred ecC Q lcl|Aclame:pro 402 VKL 404 (404) Q Consensus 402 ~~~ 404 (404) ++| T Consensus 291 ~~l 293 (299) T protein:vir:41 291 SAV 293 (299) T ss_pred EEE Confidence 222 No 79 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=30.02 E-value=1.6 Score=19.45 Aligned_cols=289 Identities=14% Similarity=0.054 Sum_probs=105.7 Q ss_pred CCCcCchHHHHHHHH--HHHHHhhcc-chHH-HHHHhhhhhhhhhccccccccCCCCCccEEE-EecccCCCCcEEEEEE Q lcl|Aclame:pro 1 MTTVTSAQANKLYQV--ALFTAANRN-RSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVR-ITDLNKQAGDEVTFSI 75 (404) Q Consensus 1 ~~~~~~~~a~~~~~~--~lft~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~-~~dL~k~~Gd~v~f~L 75 (404) +-....+...+.+.. ++-+.+... ..+| ..|...+..... ..+||.. ++-..-..|..+.+.. T Consensus 99 ~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~------------~~~~l~~~~~~~~~~~~~~~~~~~ 166 (409) T protein:vir:45 99 GASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMK------------SYGGIASVAQILTTSDGRTMEWAT 166 (409) T ss_pred hhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHH------------hhhhhhhhceeeecCCCceEEEEe Confidence 101111111111110 111100000 0000 112222111111 1112211 1111111222333333 Q ss_pred eeccc--cCceecCceeeeehhhhhhcccEEEEeee-ccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 76 MHKLS--KRPTMGDERVEGRGEDLSHADFSLKINQG-RHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHL 152 (404) Q Consensus 76 ~~~L~--G~gv~Gd~~leGnee~L~~~sd~v~Idq~-R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~l 152 (404) ..... +.+|...+ +-.+....|..-++..-.. .+-|.+...+-+. +.+||...-+..|++=+....|+.+| T Consensus 167 ~~~~~~~~~~v~E~~--~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~d-s~~~l~~~i~~~la~a~~~~~~~a~l--- 240 (409) T protein:vir:45 167 ADGTSEVGVLLGENE--EAGEEDTDFGMGSLGALKMTSKIIRVSNELLQD-SAIDMEAYLARRIAERIGRGEARYLI--- 240 (409) T ss_pred eccCccccccccccc--cccccccccceeeeeeeeeeeeehhhhHHHHhc-cHHHHHHHHHHHHHHHHHHHHHHHhh--- Confidence 32222 22332222 2234555655555554333 3445444443322 66888888888888888888888876 Q ss_pred hhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEe Q lcl|Aclame:pro 153 AGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRL 232 (404) Q Consensus 153 aG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~ 232 (404) . |+..... ....+++.+ . +......+++.++.+.|-++....... . T Consensus 241 ~---G~G~~~~--------~~p~Gil~~-~--------------~~~~~~~~~~~~~~d~i~~l~~~l~~~-------~- 286 (409) T protein:vir:45 241 Q---GTGAGTP--------KQPKGLAAS-V--------------TGTTQTAAANAVKWQEILALKHSIDPA-------Y- 286 (409) T ss_pred c---cCCCCCc--------cccceeeec-c--------------ccccccccccccchHHHHHHHHhhhhh-------h- Confidence 2 2111100 011122111 0 011122334556666554444433221 0 Q ss_pred cCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCccee-----CCeEEEcCEEEEecCceeeeec Q lcl|Aclame:pro 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-----GECAMWRNILVRKYAGMPIRFY 307 (404) Q Consensus 233 ~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~-----G~~gm~ngvii~~~~~~~irf~ 307 (404) .....|++++||.-+..|++= | . +..+|||. |.-+.+.|.+++....+|- T Consensus 287 ------~~~a~~~~~~n~~~~~~l~~l----------k----d--~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~--- 341 (409) T protein:vir:45 287 ------RRGPKFRLAFNDNTLKLISEM----------E----D--GQGRPLWLPDIVGVAPASVLNVPYVIDQEIDD--- 341 (409) T ss_pred ------ccCCeEEEEECHHHHHHHHHh----------h----c--CCCceeeccCcCCCCCceecceeeEEecCcCC--- Confidence 111358999999887777641 1 1 22367765 4446777887766554430 Q ss_pred cceeEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCC Q lcl|Aclame:pro 308 QGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKS 387 (404) Q Consensus 308 ~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~ 387 (404) . +++ ...+++|=-.-. ..+..+++...+..+.+--.+.+.+ . ...|| T Consensus 342 ---------------~---~~~---~~~i~~Gd~~~~-~i~~~~~~~~~~~~d~~~~~~~~~~-----~--~~~r~---- 388 (409) T protein:vir:45 342 ---------------I---GAG---KKFMFCGDFDRF-IIRRVRYMILKRLVERYAEYDQTGF-----L--AFHRF---- 388 (409) T ss_pred ---------------c---cCC---ccEEEEeehhhh-heeeccceEEEEeecccccCCcEEE-----E--EEEEe---- Confidence 0 000 012444431110 0111223334443332110111111 0 11133 Q ss_pred CCceEEEEEEEeeeecC Q lcl|Aclame:pro 388 GKMQDHGVIAVDTAVKL 404 (404) Q Consensus 388 g~~~DfGvi~idta~~~ 404 (404) |++++- +.|+++ T Consensus 389 ----d~~~~~-~~A~~~ 400 (409) T protein:vir:45 389 ----DCILED-TSAIKA 400 (409) T ss_pred ----ccEeec-hhheEE Confidence 333322 222222 No 80 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=28.95 E-value=1.7 Score=19.32 Aligned_cols=291 Identities=12% Similarity=0.030 Sum_probs=121.7 Q ss_pred hccch-H--HHHHHhhhhhhhhhccccccccCCCCCccEEEEe--ccc-CCCCcEEEEEEeeccccCceecCceeee-eh Q lcl|Aclame:pro 22 NRNRS-M--VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRIT--DLN-KQAGDEVTFSIMHKLSKRPTMGDERVEG-RG 94 (404) Q Consensus 22 ~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~--dL~-k~~Gd~v~f~L~~~L~G~gv~Gd~~leG-ne 94 (404) +-|+= . .++|+..+...-.+..-+... |.|-- |.. ...||+|++.......-.-..+. ...+ +- T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~l--------Vnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~-~~~~~~~ 71 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKT--------VDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTG-DISGQNK 71 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchh--------hcccCCcchhhcccCCEEEEeeCCcceeecccCc-ccCCccc Confidence 33331 1 255655543333222222111 11100 111 24799999987665543222221 1112 35 Q ss_pred hhhhhcccEEEEeeecc-ccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccceeecccccc Q lcl|Aclame:pro 95 EDLSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPE 173 (404) Q Consensus 95 e~L~~~sd~v~Idq~R~-~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg~~~n~~~~~p~~~~~~ 173 (404) ++|.-.+-.|.||+..| ++...++ ++.-...||- ..-+....=+++..|+.++..+.+... T Consensus 72 ~~l~e~~v~l~id~~k~va~~v~d~-E~~~~i~~~~-~~l~~A~~aLA~~vd~~ia~~~~~~a~---------------- 133 (423) T protein:vir:17 72 NNLISGKATGRVGNYITVAVEYQQL-EEAIKLNQLE-EILAPVRQRIVTDLETELAHFMMNNGA---------------- 133 (423) T ss_pred CccccceeEEEeeceeeeeeeecHH-HHhcChhHHH-HHHHHHHHHHHHHHHHHHHHHHhhccc---------------- Confidence 77777778899999987 5666542 2222334452 222222344566677666533322111 Q ss_pred ccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccccCCccEEEEEEchHHH Q lcl|Aclame:pro 174 FKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQW 253 (404) Q Consensus 174 ~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~~~~~~~yV~~l~P~q~ 253 (404) |.+..|. +..+ .++.|-.+...+.+.+-|- .+ +.+++.|+-+ T Consensus 134 ------~~~gt~~----------------t~~~--a~~~i~~a~~~Ld~~~vP~-------~~-------R~~Vv~p~~~ 175 (423) T protein:vir:17 134 ------LSLGSPN----------------TPIT--KWSDVAQTASFLKDLGVNE-------GE-------NYAVMDPWSA 175 (423) T ss_pred ------cccccCC----------------cccc--cHHHHHHHHHHHHhccCCc-------CC-------CEEEeChHHH Confidence 1000010 0011 2566677778887766551 11 5679999999 Q ss_pred HHHHhCcchHHHHHHHHHhhhccccccCcceeCCe-EEEcCEEEEecCceeeeeccceeEEeecCcccccccccccccch Q lcl|Aclame:pro 254 NDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGEC-AMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNI 332 (404) Q Consensus 254 ~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~G~~-gm~ngvii~~~~~~~irf~~~~~~~~~~~~~~a~~~~~aa~~~v 332 (404) ..|..+... .. + +. .+...-|=.|.+ |.+.|+-+.+..++|-. ..+.-.. +. .+ T Consensus 176 a~Ll~~~~~--~~---~-~~---~~~~~alr~g~i~G~i~GFdvy~Snnip~~-T~gt~~~-------------t~--~~ 230 (423) T protein:vir:17 176 QRLADAQTG--LH---A-SD---QLVRTAWENAQIPTNFGGIRALMSNGLASR-TQGAFGG-------------TL--TV 230 (423) T ss_pred HHHhccccc--ee---c-cc---ccchHHHhhccceeeecceEEEEeCCCccc-cccceec-------------ee--ee Confidence 998876421 11 1 11 122344656666 89999999988877621 1111000 00 00 Q ss_pred hhhee--eccceeEEEeeecCCCCcceee--cccccCchhHH-HHHHHhchhhcccc-CCCCCceEEEEEE-------Ee Q lcl|Aclame:pro 333 DRAML--LGAQALANAYGQKAGGHFNMVE--KKTDMDNRTEI-AISWINGLKKIRFP-EKSGKMQDHGVIA-------VD 399 (404) Q Consensus 333 ~ralL--lGaQAl~~A~g~~~g~r~~w~E--e~~D~g~~~~i-~i~~i~G~~K~rF~-~~~g~~~DfGvi~-------id 399 (404) .++-. .+++..... +..+.--.|.. ...--|+..-+ ++.++.=+.|-.+. .+.+..+-|.|.+ =+ T Consensus 231 ~~~~~v~~~a~~~~~~--~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~ 308 (423) T protein:vir:17 231 KTQPTVTYNAVKDSYQ--FTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSGD 308 (423) T ss_pred cccccccccccccccc--eeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEecccccccCc Confidence 00000 000000000 00000001110 01111221100 11111112232231 2233455666642 12 Q ss_pred eeecC Q lcl|Aclame:pro 400 TAVKL 404 (404) Q Consensus 400 ta~~~ 404 (404) +.++| T Consensus 309 ~tv~i 313 (423) T protein:vir:17 309 VTVTL 313 (423) T ss_pred eEEEe Confidence 33444 No 81 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=28.90 E-value=1.7 Score=19.31 Aligned_cols=286 Identities=9% Similarity=0.023 Sum_probs=118.3 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEee-cc Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH-KL 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~-~L 79 (404) |-..++.. +.| .....+..+..++... ++|..+--....++..+++.... .. T Consensus 1 ma~~t~~~------G~l-----ip~~~~~~ii~~l~~~----------------s~i~~l~~~~~~~~~~~~~p~~~~~~ 53 (300) T protein:vir:95 1 MSEAQLSK------GNL-----FNPELVTKVINKVKGH----------------SSIAKLSPQKPIPFNGQREFVFDFDS 53 (300) T ss_pred CcccccCC------cce-----echhhHHHHHHHHHhh----------------hhhhhhcceeeccCCceEEEEEecCc Confidence 21111110 000 0111122222211111 11100000001111123333211 11 Q ss_pred ccCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQ--RTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~q--rs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg 157 (404) ...+|-+++... +...+|..-++.+.....-+....++=++ -+..||-..-++.|.+=++...|+.+| -|.-. T Consensus 54 ~a~wv~Eg~~~~--~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l---~G~~~ 128 (300) T protein:vir:95 54 DIDIVAENGKKT--HGGVSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSI---HGINP 128 (300) T ss_pred ceEEeeCCcccc--cccccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhh---hcccC Confidence 222343333322 55677777777776666666554433221 246899999999999999999999997 33110 Q ss_pred ccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccc Q lcl|Aclame:pro 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~ 237 (404) .. +... + |. ......+......+++...+.+.|.++.......+. +. T Consensus 129 ---~~----g~~~---------~----~~---~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~---------~~- 175 (300) T protein:vir:95 129 ---RT----KQAS---------T----II---GDNCFDKKVTQTVPFKDTNPDESMEDAVGMIDGSER---------DI- 175 (300) T ss_pred ---CC----CCCc---------c----cc---cccccccccceeecccccchHHHHHHHHHHhhhcCC---------Cc- Confidence 00 0000 0 00 000001111222334455666777777666653221 11 Q ss_pred cCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCccee-----CCeEEEcCEEEEecCceeeeeccceeE Q lcl|Aclame:pro 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-----GECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) Q Consensus 238 ~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~-----G~~gm~ngvii~~~~~~~irf~~~~~~ 312 (404) =+++|||..+..|++=.+ +..+|||. |.-+.+.|.+++-.+.+|- T Consensus 176 ------~~~vmn~~~~~~L~~lkd----------------~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~-------- 225 (300) T protein:vir:95 176 ------TGAILDPIFTTALSKMKN----------------AEGGKLYPELAWGGVPDAINGLAVDKNRTVSY-------- 225 (300) T ss_pred ------cEEEECHHHHHHHHHhhc----------------cCCCeeccCccccCCCceecceeeEEecCCCC-------- Confidence 257899999888875321 12255653 5567888877765443320 Q ss_pred EeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHH-HHhchhhccccCCCCCce Q lcl|Aclame:pro 313 LVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAIS-WINGLKKIRFPEKSGKMQ 391 (404) Q Consensus 313 ~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~-~i~G~~K~rF~~~~g~~~ 391 (404) ...... --+|+|--+-++.|+-..+..+.+.+ |+..-+-++. +-.++--.|+.. .- T Consensus 226 ----------~~~~~~-----~~~~~GDf~~~~~~~~~~~~~~~v~~----~~~~d~~~~~~f~~~~v~~r~~~----r~ 282 (300) T protein:vir:95 226 ----------SQTDPK-----NTAIVGDFETMFKWGYAKEVPMEIIK----YGDPDNSGRDLKGYNQIYIRCEA----YI 282 (300) T ss_pred ----------CCCCCc-----cEEEEeeccceEEEEEecccEEEEee----ccCCCCcchhhhhcCcEEEEEEE----ee Confidence 000000 12455654444455543444455443 2221111111 000100011100 01 Q ss_pred EEEEEEEeeeecC Q lcl|Aclame:pro 392 DHGVIAVDTAVKL 404 (404) Q Consensus 392 DfGvi~idta~~~ 404 (404) |++|.--...++| T Consensus 283 d~~v~~~~a~~~l 295 (300) T protein:vir:95 283 GWGIMDAASFARI 295 (300) T ss_pred cceeecccceEEE Confidence 4555443444444 No 82 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=26.95 E-value=1.9 Score=19.06 Aligned_cols=296 Identities=8% Similarity=0.023 Sum_probs=110.7 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhh-------hhhhhhccccccccCCCCCccEEEEecccCCCCcEEEE Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQ-------EAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTF 73 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f 73 (404) ...-.............|....++......-...+ ..|......+-... ...++|..+-....-.+.++++ T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~--~~~~~l~~l~~~~~~~~~~~~~ 160 (421) T protein:vir:13 83 IINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLK--EGYPSLKEHCHVIPVNRNAGKM 160 (421) T ss_pred ccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhHHHHHHHH--HhhhhhhhhceeeeccCCceEE Confidence 11122222223333334444333332111000000 00000000000000 0112222111111222233444 Q ss_pred EEeeccccCceecCceeee---ehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 74 SIMHKLSKRPTMGDERVEG---RGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIV 150 (404) Q Consensus 74 ~L~~~L~G~gv~Gd~~leG---nee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~ 150 (404) .....-...+.. ..-|| .+..++|..-++.+....+-|.....|- +-+..||...-+..|.+-+....|..++. T Consensus 161 ~~~~~~~~~~~~--~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~~~i~~ 237 (421) T protein:vir:13 161 PVRAGASVDKLA--NLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLL-EDSEINFLEFVNEEFAEFAVNTENAEIVK 237 (421) T ss_pred EEeecCCcccee--eccccccccccccceeEEEeeeeeeEeehhhhHHHH-hhhHHHHHHHHHHHHHHHHHHHhhhhHhh Confidence 433332222110 11133 2445666666777777777776654433 23567777777777776666666655543 Q ss_pred HhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCce Q lcl|Aclame:pro 151 HLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPV 230 (404) Q Consensus 151 ~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv 230 (404) .+.|.. +.+...+.+-|.++...+.... + T Consensus 238 ~~~g~~----------------------------------------------~~~~~~~~d~i~~~~~~l~~~~---~-- 266 (421) T protein:vir:13 238 QAKAVL----------------------------------------------AEETINDYAGLVKTINSLVPNA---R-- 266 (421) T ss_pred hhhhcc----------------------------------------------ccccccchHHHHHHHHHhhhhh---c-- Confidence 222211 0001112333444444443211 0 Q ss_pred EecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCccee----CCeEEEcCEEEEecCceeeee Q lcl|Aclame:pro 231 RLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK----GECAMWRNILVRKYAGMPIRF 306 (404) Q Consensus 231 ~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~----G~~gm~ngvii~~~~~~~irf 306 (404) .. -+++|||.-+..|++= |. +..+|||. |.-+++.|.+++..+.+|. T Consensus 267 ----~~-------a~~v~n~~~~~~l~~l----------kd------~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~-- 317 (421) T protein:vir:13 267 ----KR-------AIIVTNSDGRAYLDGL----------MD------KQGRPLLKELSDGGDLVFKGRPVIELEESIF-- 317 (421) T ss_pred ----CC-------CEEEEcHHHHHHHHHh----------hc------CCCceeecCcCCCCCceecceeeEEeccccc-- Confidence 01 2678899888777641 11 12356664 5566788888776554431 Q ss_pred ccceeEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccC-chhHHHHHHHhchhhccccC Q lcl|Aclame:pro 307 YQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMD-NRTEIAISWINGLKKIRFPE 385 (404) Q Consensus 307 ~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g-~~~~i~i~~i~G~~K~rF~~ 385 (404) .. ++ .-.+++|--.-++.++..+++...|..+. +|. +.+.+-+...++.+...-.. T Consensus 318 ----------------~~---~~---~~~~~~gd~~~~~~~~~~~~~~v~~~~~~-~f~~~~~~~r~~~r~d~~~~~~~a 374 (421) T protein:vir:13 318 ----------------DV---GD---ETKFIVSDFKTLIKFMDRKQYLIDQSKEA-GYTKNETIARIIERFDVNSPLDKS 374 (421) T ss_pred ----------------cC---CC---ceEEEEEeccccEEEEEecceEEEeeccc-ccccCeeEEEEEeeecceeecchh Confidence 00 00 12356665333233333345555554432 111 11111111111111110000 Q ss_pred -CCCCceEEEEEEEeeeecC Q lcl|Aclame:pro 386 -KSGKMQDHGVIAVDTAVKL 404 (404) Q Consensus 386 -~~g~~~DfGvi~idta~~~ 404 (404) ..-....+++++-++-++= T Consensus 375 ~~~~~~~~~~a~v~~~~~~~ 394 (421) T protein:vir:13 375 SDAEKIRKFGVIVKLQEVLK 394 (421) T ss_pred hheeeecccceeeccccccC Confidence 0001123333322222111 No 83 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=25.65 E-value=2 Score=18.89 Aligned_cols=286 Identities=9% Similarity=0.003 Sum_probs=120.6 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHHhhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEEEeec-c Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK-L 79 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~L~~~-L 79 (404) ||..|.-..-..++.-++......++..+.. .. -.-.+..+++..... . T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~-~~-----------------------------~~~~~~~~~~p~~~~~~ 50 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLS-AQ-----------------------------KPIPFNGEKVFTFTMDS 50 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhc-ce-----------------------------eeccCCceEEEEEecCc Confidence 7776655444444444443333322221111 10 001111222222211 1 Q ss_pred ccCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhh--hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQ--QRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARG 157 (404) Q Consensus 80 ~G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~--qrs~~dlr~~ar~~L~~w~~~~~D~~~~~~laG~rg 157 (404) ...+|-+++.. .+...+|.+-++.+......+....++=+ .-...+|...-+..|++-+++..|+.+|.-.....| T Consensus 51 ~a~~v~Eg~~~--~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g 128 (298) T protein:vir:94 51 EIDVVAESGKK--THGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLG 128 (298) T ss_pred ceEEeeCCccc--cccccceeEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCC Confidence 12233322222 24566676666666666665555433322 135678999999999999999999999722100111 Q ss_pred ccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCceEecCccc Q lcl|Aclame:pro 158 DFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) Q Consensus 158 ~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv~~~g~~~ 237 (404) .+.. ... .+.+ .+..+......+.+....+.|.++.......... T Consensus 129 --~~~~----------~~~--~~~~----------~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~----------- 173 (298) T protein:vir:94 129 --TASA----------VIG--TNHF----------DSKVTQKVEAPRGIADPNGAIENAVELLTGVDAD----------- 173 (298) T ss_pred --cccc----------ccc--cccc----------ccccccccccccccccHHHHHHHHHHhhhhcCCC----------- Confidence 0000 000 0000 0111111112222333345565665555432110 Q ss_pred cCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCccee-----CCeEEEcCEEEEecCceeeeeccceeE Q lcl|Aclame:pro 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-----GECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) Q Consensus 238 ~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~-----G~~gm~ngvii~~~~~~~irf~~~~~~ 312 (404) .-+++|||..+..|++=.+ ...+|||. |..+.+.|+++.--+.+|- T Consensus 174 -----~~~~vmn~~~~~~l~~lkd----------------~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~-------- 224 (298) T protein:vir:94 174 -----VTGIAINPSFRSALAKQKD----------------LQGNALFPELKWGATPDTINGLPVDVNKTVSD-------- 224 (298) T ss_pred -----ccEEEEcHHHHHHHHHhhc----------------cCCCeeecCcccCCCCceecceeeEEeccccc-------- Confidence 1369999999988876211 11256664 4456777876654333220 Q ss_pred EeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeecccccCchhHHHHHHHhchhhccccCCCCCceE Q lcl|Aclame:pro 313 LVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQD 392 (404) Q Consensus 313 ~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~g~~~~i~i~~i~G~~K~rF~~~~g~~~D 392 (404) ...... . -+|+|--+-++.|+-.+++.+.+.+.....+ ..+. .+-.++--.|.... -| T Consensus 225 -----------~~~~~~---~-~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~--~~~~-~f~~~~v~~r~~~r----~~ 282 (298) T protein:vir:94 225 -----------MSLTQR---D-RAIIGDFANGFKWGYAKEVPLEVIQYGDPDN--SGLD-LKGYNQVYIRAELF----LG 282 (298) T ss_pred -----------ccCCCc---c-EEEEeeccceEEEEEecCceEEEeecCCCcC--cchh-hhhcCcEEEEEEEE----ec Confidence 000110 0 2677866655666654555555543221111 1111 01111111111000 02 Q ss_pred EEEEEEeeeecC Q lcl|Aclame:pro 393 HGVIAVDTAVKL 404 (404) Q Consensus 393 fGvi~idta~~~ 404 (404) +.+.-=...++| T Consensus 283 ~~~~~~~a~~~l 294 (298) T protein:vir:94 283 WGILDATKFARV 294 (298) T ss_pred cEeecccceEEE Confidence 222221222222 No 84 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=23.97 E-value=2.2 Score=18.67 Aligned_cols=293 Identities=11% Similarity=0.011 Sum_probs=112.2 Q ss_pred CCCcCchHHHHHHHHHHHHHhhccchHHHHHH------hhhhhhhhhccccccccCCCCCccEEEEecccCCCCcEEEEE Q lcl|Aclame:pro 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILT------EQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFS 74 (404) Q Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~gt~~~~~I~~~~dL~k~~Gd~v~f~ 74 (404) +.+-.....++....--|....+.. ..+-++ +....+......+-. .-.+.++|..+-....-.|...++. T Consensus 62 ~~~~~~~~~~~~~~~~~~~~~l~~~-~~~a~~~~t~~~gg~~vP~~~~~~ii~--~~~~~s~i~~~~~~~~~~~~~~~~~ 138 (371) T protein:vir:81 62 KEPLKPTVQVKENEVEAFVNHIRTR-FRNAMSEGSNQDGGYTVPQDIQTRINE--LRESKDALQNLITVEPVTTLSGSRV 138 (371) T ss_pred ccccccchhhHHHHHHHHHHHHHHH-HHHhhccCCCccCceeecHhHHHHHHH--HHHhhhhhhhhceeeeccCCceeEE Confidence 1111111111111222222221110 000010 000011110001100 0012233322111111112223332 Q ss_pred EeeccccCceecCceeeee----hhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 75 IMHKLSKRPTMGDERVEGR----GEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIV 150 (404) Q Consensus 75 L~~~L~G~gv~Gd~~leGn----ee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~~~~ 150 (404) ....-++ +. ..-.-||. ....+|..-++.+....+-+.....+- +-+.+||...-.+.|.+=+....|..++. T Consensus 139 ~~~~~~~-~~-a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~a~~~~~~~~i~~ 215 (371) T protein:vir:81 139 FKKRSQQ-TG-FVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELL-NDSTEAIVNTLVRWIGDESRVTRNGLIIN 215 (371) T ss_pred EEeecCC-cc-eeeeccccccccccccceeeEEeeeeEEEEeehhhHHHH-hhhhHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2222111 11 11223332 233566677777777777665544332 34567888888888888888888877752 Q ss_pred HhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCCCce Q lcl|Aclame:pro 151 HLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPV 230 (404) Q Consensus 151 ~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi~Pv 230 (404) |. | + .+|+ ...+.+-|..+... .+.|. T Consensus 216 ---g~-g--~----------------------~~~~-------------------~~~~~~~i~~~~~~------~l~~~ 242 (371) T protein:vir:81 216 ---VL-N--T----------------------KAKT-------------------AIADLDGLKQIINV------QLDPV 242 (371) T ss_pred ---hc-c--c----------------------cccc-------------------ccccHHHHHHHHHh------hcchh Confidence 11 0 0 0111 01111212111111 11111 Q ss_pred EecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCccee-----CCeEEEcCEEEEecCceeee Q lcl|Aclame:pro 231 RLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-----GECAMWRNILVRKYAGMPIR 305 (404) Q Consensus 231 ~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~-----G~~gm~ngvii~~~~~~~ir 305 (404) --. . =+++|||.-+..|++=.+ +..+|||. |.-+.+.|.++.....+|+. T Consensus 243 ~~~--~-------a~~vmn~~~~~~L~~lkd----------------~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~ 297 (371) T protein:vir:81 243 FRS--T-------SSVIVNQDAFNWLDTLKD----------------QNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLA 297 (371) T ss_pred hhc--C-------CEEEEcHHHHHHHHHhhc----------------cCCCeeeecccCCCCCceecceeEEEecccccC Confidence 100 1 167899988888875211 12366775 44578889888776666542 Q ss_pred eccceeEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeeccccc--CchhHHHHHHHhchhhccc Q lcl|Aclame:pro 306 FYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDM--DNRTEIAISWINGLKKIRF 383 (404) Q Consensus 306 f~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~--g~~~~i~i~~i~G~~K~rF 383 (404) .. ...+.. ...-.+++|-=.-.+-++...++...|.++..|+ .+.+.+-+...+|.+-. T Consensus 298 ~~-----------~~~~~~------~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~-- 358 (371) T protein:vir:81 298 NR-----------VDGGTG------AQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMR-- 358 (371) T ss_pred cc-----------cccccc------CCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEe-- Confidence 11 000000 0112466663221112222245566666665543 23333333333333222 Q ss_pred cCCCCCceEEEEEEEeee Q lcl|Aclame:pro 384 PEKSGKMQDHGVIAVDTA 401 (404) Q Consensus 384 ~~~~g~~~DfGvi~idta 401 (404) +.+=|-++-+-+| T Consensus 359 -----~~~a~~~~~~~~A 371 (371) T protein:vir:81 359 -----DDEAFVFGEVQLA 371 (371) T ss_pred -----cccceEEEEEecC Confidence 1223444444444 No 85 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=22.11 E-value=2.5 Score=18.40 Aligned_cols=291 Identities=9% Similarity=0.047 Sum_probs=100.3 Q ss_pred CC-CcCchHHHHHHHHHHHHHhhccch--HHHHHH------hhhhhhhhhccccccccCCCCCccEE---EEecccCCCC Q lcl|Aclame:pro 1 MT-TVTSAQANKLYQVALFTAANRNRS--MVNILT------EQQEAPKAVSPDKKSTKQTSAGAPVV---RITDLNKQAG 68 (404) Q Consensus 1 ~~-~~~~~~a~~~~~~~lft~~~~~~~--~~~~~~------~~~~~~~~~~~~~~~~~gt~~~~~I~---~~~dL~k~~G 68 (404) +. .-....+....+ |....+... ..+..+ +....+......+-... -..++|. .+..++...| T Consensus 79 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~--~~~~~l~~~~~~~~~~~~~~ 153 (397) T protein:vir:48 79 LTKSEEEVKAGFVKD---FKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLV--RQYDSLQEYVNVENVTTLTG 153 (397) T ss_pred ccchhhHHHHHHHHH---HHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHH--HHHHHHHhhhceeeccCCcc Confidence 00 000000000000 111111111 000000 00000000000000000 0111111 1111222233 Q ss_pred cEEEEEEeeccc-cCceecCceeeeehhhhhhcccEEEEeeeccccccCcchhhhhhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 69 DEVTFSIMHKLS-KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQC 147 (404) Q Consensus 69 d~v~f~L~~~L~-G~gv~Gd~~leGnee~L~~~sd~v~Idq~R~~V~~~gkms~qrs~~dlr~~ar~~L~~w~~~~~D~~ 147 (404) ..+.+.....-. ...+.+.+... .....+|..=++.+....+-+.....+= +.+.+||...-++.|++=+....|.. T Consensus 154 ~~~~~~~~~~~~~a~~v~E~~~~~-~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~v~~~l~~~~~~~~d~~ 231 (397) T protein:vir:48 154 SRVYEKWADITGLAKLDDEAGSIG-TNDDPKLYPIRYAIKRYAGISTVTNSLL-ADSAENILAWLSGWIAKKVVVTRNKA 231 (397) T ss_pred eEEEEeecCCCcceeeeccccccc-cccccceeeEEeeheeeeeehhhHHHHH-hhchHHHHHHHHHHHHHHHHHHHHHH Confidence 322222211111 11222111110 1123455555555655555555443332 35789999999999999999999998 Q ss_pred HHHHhhhcccccccccceeeccccccccccccCccCCCCCCceEeccCCccccccccccccCHHHHHHHHHHHHhcCCCC Q lcl|Aclame:pro 148 AIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPL 227 (404) Q Consensus 148 ~~~~laG~rg~~~n~~~~~p~~~~~~~~~~~~N~v~apt~~r~~~~~~at~~~~i~a~D~~s~~~Id~a~~~a~~~~~pi 227 (404) +| .|.- ++ + ++ ..+ .+.+-|.++....+.. T Consensus 232 il---~G~g---~~------------------~---~~--------------~~~-----~~~d~i~~~~~~l~~~---- 261 (397) T protein:vir:48 232 IL---EAIA---TL------------------P---TK--------------PTL-----TKWDDIIDLQAKVDPA---- 261 (397) T ss_pred Hh---hccc---cc------------------c---cc--------------ccc-----ccHHHHHHHHHHhhhh---- Confidence 86 2211 00 0 00 001 1222233333332211 Q ss_pred CceEecCccccCCccEEEEEEchHHHHHHHhCcchHHHHHHHHHhhhccccccCccee-----CCeEEEcCEEEEecCce Q lcl|Aclame:pro 228 QPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFK-----GECAMWRNILVRKYAGM 302 (404) Q Consensus 228 ~Pv~~~g~~~~~~~~~yV~~l~P~q~~dLr~d~~~~~w~~~qk~A~ar~~g~~nPlF~-----G~~gm~ngvii~~~~~~ 302 (404) -.. . -+++|||..+..|++= |. +..+|||. |.-++++|.+++..+.. T Consensus 262 ---~~~--~-------a~~v~n~~~~~~L~~l----------kd------~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~ 313 (397) T protein:vir:48 262 ---IKQ--T-------SFFLTNTSGFTALKKV----------KN------AFGDYLMERDVKSPTGYSIDGFAVKEVADR 313 (397) T ss_pred ---hcC--C-------CEEEECHHHHHHHHHh----------hc------CCCceeeccCcCCCCCceeccceeEEeccc Confidence 010 1 3678999998888752 11 12356764 45578888776543321 Q ss_pred eeeeccceeEEeecCcccccccccccccchhhheeeccceeEEEeeecCCCCcceeeccccc-C-chhHHHHHHHh---- Q lcl|Aclame:pro 303 PIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDM-D-NRTEIAISWIN---- 376 (404) Q Consensus 303 ~irf~~~~~~~~~~~~~~a~~~~~aa~~~v~ralLlGaQAl~~A~g~~~g~r~~w~Ee~~D~-g-~~~~i~i~~i~---- 376 (404) + ......+ .-.+++|-=.-.+.++..++......++..++ . +.+.+-+...+ T Consensus 314 ~----------------~~~~~~~------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~ 371 (397) T protein:vir:48 314 W----------------LANASSG------AMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVA 371 (397) T ss_pred c----------------cCCcCCC------ceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEE Confidence 0 0000011 12355663221111222233333332221111 0 11111111111 Q ss_pred ----chhhccccCCCCCceEEEEEEE Q lcl|Aclame:pro 377 ----GLKKIRFPEKSGKMQDHGVIAV 398 (404) Q Consensus 377 ----G~~K~rF~~~~g~~~DfGvi~i 398 (404) ++.++.+.......-|++.++| T Consensus 372 ~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 372 TDTESFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred ecccceEEEEecccccCCCCccccCC Confidence 1222222222122223333333 Done!