Query lcl|Aclame:protein:vir:100939|NCBI_annot:Gp5|genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Match_columns 430 No_of_seqs 66 out of 79 Neff 6.1 Searched_HMMs 1612 Date Sun Dec 1 07:40:43 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_23 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_23_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:9265 Length: 430 # 100.0 2E-164 1E-167 918.1 37.5 430 1-430 1-430 (430) 2 protein:vir:100939 Length: 430 100.0 2E-164 1E-167 918.1 37.5 430 1-430 1-430 (430) 3 protein:vir:2106 Length: 430 # 100.0 3E-164 2E-167 917.4 37.2 430 1-430 1-430 (430) 4 protein:vir:105522 Length: 423 100.0 1E-115 7E-119 650.8 32.6 401 1-428 1-423 (423) 5 protein:vir:108303 Length: 418 100.0 1E-112 8E-116 634.1 32.9 400 1-430 1-417 (418) 6 protein:vir:105374 Length: 423 100.0 1E-112 8E-116 633.9 31.4 396 1-428 1-423 (423) 7 protein:vir:174 Length: 423 # 100.0 6E-112 4E-115 630.2 32.2 401 1-428 1-423 (423) 8 protein:vir:3525 Length: 423 # 100.0 2E-111 1E-114 627.7 30.7 401 1-428 1-423 (423) 9 protein:vir:99075 Length: 392 100.0 3.4E-39 2.1E-42 231.5 22.7 374 1-410 1-392 (392) 10 protein:vir:7990 Length: 273 # 100.0 1E-31 6.5E-35 190.4 20.5 268 1-429 1-273 (273) 11 protein:vir:102605 Length: 273 100.0 1.2E-31 7.3E-35 190.2 19.7 268 1-429 1-273 (273) 12 protein:vir:105822 Length: 273 100.0 1.2E-31 7.3E-35 190.2 19.7 268 1-429 1-273 (273) 13 protein:vir:94622 Length: 341 100.0 1.1E-31 7E-35 190.3 16.5 317 1-430 3-341 (341) 14 protein:vir:80180 Length: 381 99.9 1E-26 6.4E-30 163.1 17.3 346 1-430 15-381 (381) 15 protein:vir:1541 Length: 347 # 99.8 1.7E-21 1.1E-24 134.4 17.5 299 1-430 1-345 (347) 16 protein:vir:3364 Length: 347 # 99.8 6.1E-21 3.8E-24 131.4 15.5 297 1-430 1-345 (347) 17 protein:vir:94711 Length: 347 99.7 4.5E-20 2.8E-23 126.7 13.5 301 1-430 1-346 (347) 18 protein:vir:10450 Length: 344 99.7 6E-20 3.8E-23 125.9 13.7 298 1-429 1-344 (344) 19 protein:vir:8885 Length: 347 # 99.7 9.2E-19 5.7E-22 119.5 16.3 300 1-430 1-347 (347) 20 protein:vir:94576 Length: 347 99.7 1E-18 6.4E-22 119.2 15.3 300 1-429 1-347 (347) 21 protein:vir:78739 Length: 332 99.7 4E-19 2.5E-22 121.5 12.8 291 1-427 9-332 (332) 22 protein:vir:96123 Length: 274 99.7 7.8E-18 4.8E-21 114.4 17.8 258 1-430 1-270 (274) 23 protein:vir:93742 Length: 274 99.6 9.5E-18 5.9E-21 113.9 16.6 258 1-430 1-270 (274) 24 protein:vir:1239 Length: 274 # 99.6 2.4E-17 1.5E-20 111.7 17.2 258 1-430 1-270 (274) 25 protein:vir:2201 Length: 345 # 99.6 2.9E-17 1.8E-20 111.2 16.7 299 1-429 1-345 (345) 26 protein:vir:3613 Length: 272 # 99.6 2.6E-17 1.6E-20 111.5 15.3 258 1-313 1-272 (272) 27 protein:vir:3136 Length: 322 # 99.6 2E-17 1.3E-20 112.1 13.8 284 1-342 1-322 (322) 28 protein:vir:100057 Length: 375 99.6 5.5E-16 3.4E-19 104.2 19.1 322 1-430 11-371 (375) 29 protein:vir:95898 Length: 274 99.5 5.9E-16 3.7E-19 104.1 15.2 262 1-308 1-274 (274) 30 protein:vir:96262 Length: 274 99.5 5.9E-16 3.7E-19 104.1 15.2 262 1-308 1-274 (274) 31 protein:vir:94494 Length: 274 99.5 7.9E-16 4.9E-19 103.4 14.8 262 1-308 1-274 (274) 32 protein:vir:97433 Length: 274 99.5 7.9E-16 4.9E-19 103.4 14.8 262 1-308 1-274 (274) 33 protein:vir:80930 Length: 278 99.5 1E-15 6.3E-19 102.8 14.4 265 1-304 1-278 (278) 34 protein:vir:103323 Length: 364 99.5 1.6E-14 1E-17 96.2 18.6 299 1-430 1-340 (364) 35 protein:vir:99675 Length: 324 99.4 2.5E-15 1.5E-18 100.7 13.5 272 26-430 1-297 (324) 36 protein:vir:96833 Length: 275 99.4 3E-14 1.9E-17 94.7 16.8 258 1-315 3-275 (275) 37 protein:vir:3033 Length: 272 # 99.4 6.9E-14 4.3E-17 92.7 18.4 257 1-430 1-269 (272) 38 protein:vir:9820 Length: 272 # 99.4 6.9E-14 4.3E-17 92.7 18.4 257 1-430 1-269 (272) 39 protein:vir:105334 Length: 276 99.4 1.1E-13 6.5E-17 91.7 17.2 263 1-337 1-276 (276) 40 protein:vir:80213 Length: 334 99.3 4.4E-14 2.7E-17 93.8 13.1 290 1-430 1-333 (334) 41 protein:vir:739 Length: 231 # 99.2 1.9E-13 1.2E-16 90.3 12.0 223 38-313 1-231 (231) 42 protein:vir:78935 Length: 335 99.1 3.7E-12 2.3E-15 83.3 13.6 290 1-430 1-330 (335) 43 protein:vir:6324 Length: 335 # 99.0 1.8E-11 1.1E-14 79.5 15.0 290 1-430 1-330 (335) 44 protein:vir:97031 Length: 402 99.0 7.8E-12 4.9E-15 81.5 11.2 291 1-430 1-340 (402) 45 protein:vir:107120 Length: 329 98.8 1.2E-09 7.2E-13 69.6 18.2 259 1-430 30-306 (329) 46 protein:vir:95107 Length: 270 98.8 6.9E-10 4.3E-13 70.8 15.5 257 1-320 1-270 (270) 47 protein:vir:102655 Length: 322 98.8 1.1E-09 6.6E-13 69.8 16.0 277 1-320 13-322 (322) 48 protein:vir:7019 Length: 401 # 98.7 4.6E-10 2.9E-13 71.8 12.6 291 1-430 1-334 (401) 49 protein:vir:79008 Length: 299 98.7 2.3E-09 1.4E-12 67.9 15.7 289 1-335 1-299 (299) 50 protein:vir:94800 Length: 319 98.6 3.1E-08 1.9E-11 61.7 18.0 286 1-340 19-319 (319) 51 protein:vir:97331 Length: 319 98.6 3.1E-08 1.9E-11 61.7 18.0 286 1-340 19-319 (319) 52 protein:vir:1781 Length: 221 # 98.6 2.1E-09 1.3E-12 68.2 11.2 203 79-401 1-221 (221) 53 protein:vir:105645 Length: 400 98.5 3.8E-09 2.4E-12 66.7 12.2 291 1-430 1-334 (400) 54 protein:vir:78920 Length: 290 97.9 4.7E-06 2.9E-09 49.8 16.6 274 1-340 1-290 (290) 55 protein:vir:102335 Length: 312 97.4 8.1E-05 5E-08 43.0 17.2 297 1-337 1-312 (312) 56 protein:vir:79712 Length: 285 96.8 0.00016 1E-07 41.4 13.4 269 1-306 1-285 (285) 57 protein:vir:95875 Length: 401 95.8 0.0014 8.7E-07 36.2 14.2 312 1-430 19-400 (401) 58 protein:vir:105464 Length: 346 95.8 0.0015 9.2E-07 36.1 13.6 315 1-356 1-346 (346) 59 protein:vir:102944 Length: 330 94.6 0.0039 2.4E-06 33.8 14.3 296 1-370 1-330 (330) 60 protein:vir:5974 Length: 324 # 92.4 0.012 7.3E-06 31.2 14.6 297 1-358 1-324 (324) 61 protein:vir:1583 Length: 351 # 92.2 0.012 7.7E-06 31.0 16.0 316 1-349 1-351 (351) 62 protein:vir:9927 Length: 295 # 91.8 0.014 8.8E-06 30.7 11.4 251 1-304 1-295 (295) 63 protein:vir:78090 Length: 302 88.7 0.031 1.9E-05 28.9 13.9 269 1-302 1-302 (302) 64 protein:vir:99523 Length: 311 87.5 0.039 2.4E-05 28.3 14.8 284 1-333 8-311 (311) 65 protein:vir:4339 Length: 395 # 86.1 0.048 3E-05 27.8 16.8 266 1-367 117-395 (395) 66 protein:vir:108211 Length: 318 79.4 0.1 6.5E-05 26.0 12.9 273 1-346 1-318 (318) 67 protein:vir:95451 Length: 313 76.0 0.14 8.7E-05 25.3 16.8 282 1-430 1-312 (313) 68 protein:vir:104085 Length: 320 75.0 0.15 9.4E-05 25.1 16.0 285 1-371 14-320 (320) 69 protein:vir:2430 Length: 318 # 73.8 0.17 0.0001 24.9 17.7 269 1-430 14-313 (318) 70 protein:vir:7771 Length: 330 # 67.5 0.25 0.00016 23.9 18.7 291 1-430 1-323 (330) 71 protein:vir:106647 Length: 303 64.8 0.29 0.00018 23.5 11.5 262 1-335 1-303 (303) 72 protein:vir:3870 Length: 400 # 59.7 0.39 0.00024 22.8 14.4 256 1-368 137-400 (400) 73 protein:vir:9875 Length: 296 # 59.1 0.4 0.00025 22.8 11.0 257 1-340 1-296 (296) 74 protein:vir:1638 Length: 298 # 55.6 0.47 0.00029 22.4 16.8 266 1-366 1-298 (298) 75 protein:vir:80446 Length: 367 54.8 0.5 0.00031 22.3 15.3 295 1-344 1-367 (367) 76 protein:vir:94771 Length: 298 52.1 0.56 0.00035 21.9 15.6 274 1-336 1-298 (298) 77 protein:vir:1025 Length: 408 # 50.6 0.6 0.00037 21.8 16.6 278 1-382 116-408 (408) 78 protein:vir:191 Length: 385 # 49.1 0.65 0.0004 21.6 16.5 266 1-368 105-385 (385) 79 protein:vir:1886 Length: 385 # 49.1 0.65 0.0004 21.6 16.5 266 1-368 105-385 (385) 80 protein:vir:3426 Length: 117 # 48.9 0.49 0.0003 22.3 5.8 112 179-340 1-117 (117) 81 protein:vir:2344 Length: 397 # 48.4 0.67 0.00042 21.5 18.9 280 1-430 10-306 (397) 82 protein:vir:78523 Length: 338 42.6 0.88 0.00054 20.9 18.9 291 1-430 12-335 (338) 83 protein:vir:6212 Length: 434 # 40.3 0.98 0.00061 20.6 14.1 271 1-350 141-434 (434) 84 protein:vir:4830 Length: 397 # 35.4 1.2 0.00076 20.1 13.4 267 1-342 109-397 (397) 85 protein:vir:9704 Length: 394 # 31.4 1.5 0.00093 19.6 14.5 240 1-315 127-394 (394) 86 protein:vir:102082 Length: 392 31.1 1.5 0.00095 19.6 12.7 269 1-336 106-392 (392) 87 protein:vir:107593 Length: 392 31.1 1.5 0.00095 19.6 12.7 269 1-336 106-392 (392) 88 protein:vir:105004 Length: 392 31.1 1.5 0.00095 19.6 12.7 269 1-336 106-392 (392) 89 protein:vir:102873 Length: 392 31.1 1.5 0.00095 19.6 12.7 269 1-336 106-392 (392) 90 protein:vir:101607 Length: 379 27.6 1.8 0.0011 19.2 16.4 258 1-313 109-379 (379) 91 protein:vir:4226 Length: 326 # 26.9 1.9 0.0012 19.1 17.9 279 1-430 22-323 (326) 92 protein:vir:9574 Length: 300 # 26.8 1.9 0.0012 19.0 16.0 275 1-372 1-300 (300) 93 protein:vir:100135 Length: 418 25.7 2 0.0013 18.9 15.5 269 1-337 136-418 (418) 94 protein:vir:8102 Length: 543 # 25.6 2 0.0013 18.9 16.5 275 1-368 249-543 (543) 95 protein:vir:395 Length: 117 # 23.6 2.3 0.0014 18.6 6.9 112 170-350 1-117 (117) 96 protein:vir:81100 Length: 415 22.7 2.4 0.0015 18.5 18.2 278 1-378 120-415 (415) 97 protein:vir:79987 Length: 415 22.7 2.4 0.0015 18.5 18.2 278 1-378 120-415 (415) 98 protein:vir:98339 Length: 415 22.7 2.4 0.0015 18.5 18.2 278 1-378 120-415 (415) 99 protein:vir:4856 Length: 293 # 22.3 2.5 0.0015 18.4 12.9 273 1-342 5-293 (293) 100 protein:vir:94673 Length: 419 22.0 2.5 0.0016 18.4 15.8 266 1-430 121-417 (419) 101 protein:vir:78223 Length: 333 21.6 2.6 0.0016 18.3 16.4 287 1-369 20-333 (333) 102 protein:vir:8187 Length: 311 # 21.0 2.7 0.0017 18.2 15.6 285 1-368 1-311 (311) 103 protein:vir:80684 Length: 315 20.9 2.7 0.0017 18.2 15.5 291 1-337 1-315 (315) 104 protein:vir:1383 Length: 421 # 20.7 2.7 0.0017 18.2 17.6 260 1-430 117-384 (421) No 1 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=100.00 E-value=2e-164 Score=918.08 Aligned_cols=430 Identities=100% Similarity=1.403 Sum_probs=422.4 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCccCCccCCCCcceEEEEec Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~s~~~~d~~e~sV~v~l~ 80 (430) |||||+++++++++|+|+.||+++||+|++++||+++++++|+|||||||+|+++++++|++.+.+++|++|++||++|+ T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~G~~~t~~~~~i~e~~v~~~v~ 80 (430) T protein:vir:92 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccccCcccCCCCCccccceEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceEecHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHHHHHHHhCC Q lcl|Aclame:pro 81 EPDNDFFQLRADDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSREL 160 (430) Q Consensus 81 ~~k~V~~~~t~keL~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~~~L~~~~a 160 (430) +||+|+|+|++|||++++++||+|+|||++|||+||+||+++++++++++.+.+...++.+...|+|+++++++|++++| T Consensus 81 ~~k~V~~~~~~kel~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~~~v 160 (430) T protein:vir:92 81 EPDNDFFQLRADDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSREL 160 (430) T ss_pred eeccceEEechhHhcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHHHHHHHHhcC Confidence 99999999999999999999999999999999999999999999999999887767777777889999999999999999 Q ss_pred CcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccccceecccceeeeeE Q lcl|Aclame:pro 161 NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) Q Consensus 161 P~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~ 240 (430) |++++|++||||+++++|++++++++++++.+++|||+|+|||+++||||+|+++++++|++|+++++||+||+++++++ T Consensus 161 P~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~ 240 (430) T protein:vir:92 161 NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) T ss_pred CCCCCcEEEeChHHHHHHHhhhccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCceecccccccccc Confidence 99888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEEeecccccccccc Q lcl|Aclame:pro 241 WQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) Q Consensus 241 ~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~ 320 (430) ++++.++++.++|++..++++|+||+||+||+|||+|||+|||+|||+++++|||+|+++.++++|+|||+|+|+++++. T Consensus 241 ~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~ 320 (430) T protein:vir:92 241 WQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) T ss_pred ceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEeeccccc Q lcl|Aclame:pro 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIST 400 (430) Q Consensus 321 ~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~ 400 (430) ++++++|||||++|||+++|||+|.+++++||+||||||+|+|||||+|+|+++++.++++++|++||++||++|||+++ T Consensus 321 ~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~ 400 (430) T protein:vir:92 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIST 400 (430) T ss_pred cccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEeccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 401 LSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 401 ~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) ++++||||+|||+|+|||||+||+|+||+| T Consensus 401 ~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:92 401 LSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred CceEEEEeeeccceecCcceEEEEcCCCCC Confidence 999999999999999999999999999999 No 2 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=100.00 E-value=2e-164 Score=918.08 Aligned_cols=430 Identities=100% Similarity=1.403 Sum_probs=422.4 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCccCCccCCCCcceEEEEec Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~s~~~~d~~e~sV~v~l~ 80 (430) |||||+++++++++|+|+.||+++||+|++++||+++++++|+|||||||+|+++++++|++.+.+++|++|++||++|+ T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~G~~~t~~~~~i~e~~v~~~v~ 80 (430) T protein:vir:10 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccccCcccCCCCCccccceEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceEecHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHHHHHHHhCC Q lcl|Aclame:pro 81 EPDNDFFQLRADDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSREL 160 (430) Q Consensus 81 ~~k~V~~~~t~keL~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~~~L~~~~a 160 (430) +||+|+|+|++|||++++++||+|+|||++|||+||+||+++++++++++.+.+...++.+...|+|+++++++|++++| T Consensus 81 ~~k~V~~~~~~kel~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~~~v 160 (430) T protein:vir:10 81 EPDNDFFQLRADDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSREL 160 (430) T ss_pred eeccceEEechhHhcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHHHHHHHHhcC Confidence 99999999999999999999999999999999999999999999999999887767777777889999999999999999 Q ss_pred CcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccccceecccceeeeeE Q lcl|Aclame:pro 161 NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) Q Consensus 161 P~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~ 240 (430) |++++|++||||+++++|++++++++++++.+++|||+|+|||+++||||+|+++++++|++|+++++||+||+++++++ T Consensus 161 P~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~ 240 (430) T protein:vir:10 161 NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) T ss_pred CCCCCcEEEeChHHHHHHHhhhccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCceecccccccccc Confidence 99888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEEeecccccccccc Q lcl|Aclame:pro 241 WQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) Q Consensus 241 ~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~ 320 (430) ++++.++++.++|++..++++|+||+||+||+|||+|||+|||+|||+++++|||+|+++.++++|+|||+|+|+++++. T Consensus 241 ~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~ 320 (430) T protein:vir:10 241 WQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) T ss_pred ceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEeeccccc Q lcl|Aclame:pro 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIST 400 (430) Q Consensus 321 ~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~ 400 (430) ++++++|||||++|||+++|||+|.+++++||+||||||+|+|||||+|+|+++++.++++++|++||++||++|||+++ T Consensus 321 ~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~ 400 (430) T protein:vir:10 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIST 400 (430) T ss_pred cccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEeccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 401 LSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 401 ~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) ++++||||+|||+|+|||||+||+|+||+| T Consensus 401 ~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:10 401 LSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred CceEEEEeeeccceecCcceEEEEcCCCCC Confidence 999999999999999999999999999999 No 3 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=100.00 E-value=2.6e-164 Score=917.38 Aligned_cols=430 Identities=99% Similarity=1.396 Sum_probs=423.4 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCccCCccCCCCcceEEEEec Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~s~~~~d~~e~sV~v~l~ 80 (430) ||||++++++|+++|+|+.||++|||+|+|++||+++++++|+||||+||+|+++++++|++.+.+++|++|++||++|+ T Consensus 1 Ma~~~~~~lti~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~~G~~~t~~~~~~~e~~v~~~~~ 80 (430) T protein:vir:21 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVNMG 80 (430) T ss_pred CccccchhhHHHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccccccccccccCCCccceeeeEeEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceEecHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHHHHHHHhCC Q lcl|Aclame:pro 81 EPDNDFFQLRADDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSREL 160 (430) Q Consensus 81 ~~k~V~~~~t~keL~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~~~L~~~~a 160 (430) +||+|+|+|++|||++.+++||+|+|||++|||+||+||+++++++++++.+.+...++.+..+|+|+++++++|+++++ T Consensus 81 ~~~~V~~~~~~kEl~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~~~v 160 (430) T protein:vir:21 81 EPDNDFFQLRADDLRDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEEIMFSREL 160 (430) T ss_pred eeccceEEeehhHhcChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccccCCCCCCCCcchhhHHHHHHHHHHhcC Confidence 99999999999999999999999999999999999999999999999999988777777788899999999999999999 Q ss_pred CcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccccceecccceeeeeE Q lcl|Aclame:pro 161 NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) Q Consensus 161 P~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~ 240 (430) |++++|++|+||++++++++++++++++++.+++|||+|+|||+++||||+|+++++|+|++|+++++||+||+|+++++ T Consensus 161 P~~~~R~~~~~p~~~~~l~~~l~~~~~~~~~~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv~gA~~~~~~~ 240 (430) T protein:vir:21 161 NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) T ss_pred CCCCCcEEEeChHHHHHHhhhhccccccccchhHHHhhcccccccchhhhhhhcCCcccccCccCcCceecccccccccc Confidence 99889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEEeecccccccccc Q lcl|Aclame:pro 241 WQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) Q Consensus 241 ~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~ 320 (430) ++++.++++.++|++..++++|+||+||+||+|||+|||+|||+|||+++++|||+|+++.++++|+|||+|+|+++++. T Consensus 241 ~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I~Pai~~~~~~~~ 320 (430) T protein:vir:21 241 WQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) T ss_pred ceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecCCceeEEeecccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEeeccccc Q lcl|Aclame:pro 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIST 400 (430) Q Consensus 321 ~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~ 400 (430) ++++++|||||++|||+++|||+|.+++++||+|||+||+|+|||||+|+|+++++.++++++|++|||+||++|||+++ T Consensus 321 ~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~~La~~pl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~ 400 (430) T protein:vir:21 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIST 400 (430) T ss_pred cccccccceeccccccCceeEEeccCCcccceeEccceeEEEEecccCCCChhHhhheeeeeccccceEEEEEEcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 401 LSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 401 ~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) ++++||||+|||+++|||||+||+|+||+| T Consensus 401 ~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 401 LSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred CceEEEEEeecCccccCcceEEEEcCCCCC Confidence 999999999999999999999999999999 No 4 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=1.1e-115 Score=650.76 Aligned_cols=401 Identities=16% Similarity=0.201 Sum_probs=341.4 Q ss_pred Cccchhh-HHHHHHHHHHHHHHhhcccchhhcccCChHHH--HhhcCCEEEEecCcccccccCC--ccCCc-cCCCCcce Q lcl|Aclame:pro 1 MALNEGQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAAS--MQRSSNTIWMPVEQESPTQEGW--DLTDK-ATGLLELN 74 (430) Q Consensus 1 MAn~~~~-~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~--~~k~GdTV~ip~P~~~~~~~g~--~~s~~-~~d~~e~s 74 (430) |||||.+ .-+++.+++|+.|++++||+++| +|+|+.+ .+|+||||+||+|..++.+++. +.+++ .++++|++ T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV--~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~~~~l~e~~ 78 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTV--DRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKSKNSLISAK 78 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhh--ccCCCccccccccCCEEEEeeCCceeeecccCcccCcccccccccce Confidence 9999986 45888999999999999999999 5666555 4789999999999999988754 33443 57899999 Q ss_pred EEEEeccccccceEecHHHhc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHH Q lcl|Aclame:pro 75 VAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAE 152 (430) Q Consensus 75 V~v~l~~~k~V~~~~t~keL~--~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~ 152 (430) |+++||++|+++|+|+++|+. +.+| +|+|+|||++||++||.+|++.++..+++..++++.. ...|+++++++ T Consensus 79 v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~vgt~~t~----~~a~~~~a~a~ 153 (423) T protein:vir:10 79 ATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGALSLGSPNTP----IKKWSDVAQTA 153 (423) T ss_pred EEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc----cccHHHHHHHH Confidence 999999999999999999964 4566 8999999999999999999988888888877765542 24699999999 Q ss_pred HHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccc-cccchhhhHHHhCCCcceecccccc-ccee Q lcl|Aclame:pro 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTI-QRQVAGFDDVLRSPKLPVLTKSTAT-GITV 230 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~i-gr~~~Gfd~~~~~~~~~~~~~gt~~-~~tV 230 (430) ++|+++++|++ +|.+|++|+.+++|++++..+++.++..+++||+|+| || ++|||+ |+|+++|.|++|+.+ ..++ T Consensus 154 ~~L~~~~vP~~-~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G~-~~GFdi-~~Sn~vp~~T~g~~~ga~~~ 230 (423) T protein:vir:10 154 SFLKDLGINSG-ENYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISGN-FGGIRA-LMSNGLASRTQGAFGGKLTV 230 (423) T ss_pred HHHhhccCCcC-CCEEEeCHHHHHHHhhhhhhhccccccchHHHHhccccee-ecceEE-EEecCCcccccccccceeee Confidence 99999999995 6889999999999999999999999999999999988 75 899996 579999999999966 4677 Q ss_pred cccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccc-----cccccceEEEEEee---- Q lcl|Aclame:pro 231 SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKN-----VLAQDATFSVVRVV---- 301 (430) Q Consensus 231 ~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~-----~~~~l~~fvVt~~~---- 301 (430) +|+...+ .+........++...+++.+.+|+||+||+|||+|||+|||+||| ++++++||+|+++. T Consensus 231 ~~~~~vt-----~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a 305 (423) T protein:vir:10 231 KGTPEVN-----YDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANAHS 305 (423) T ss_pred eeeeEEE-----ecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccccc Confidence 7654321 122222223455556677788899999999999999999999999 47999999999874 Q ss_pred -cCceeEEeeccccccccccccccccccccccccccCceeEEecCCC--ceeeeeecccceeEEEecccCCCCchhhcee Q lcl|Aclame:pro 302 -DGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKD--ARTNVFWADDAIRIVSQPIPANHELFAGMKT 378 (430) Q Consensus 302 -~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~--~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~ 378 (430) .+++|+|||+|++.. .+.+|+||+|+||++++|||+|+++ +++||+|||+||+|+|||||+|.+.+++ T Consensus 306 ~~~~tv~i~p~~~~~~------~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~--- 376 (423) T protein:vir:10 306 SGDVTVKISGVPIFDA------GYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYNKLFCGLGTIPLPKLHSIDSA--- 376 (423) T ss_pred cCceEEEecccccccc------CcccccceeccccCCceeEEeeccCCceeEEEEecCcceEEEEEcccCCCcccee--- Confidence 245799999998532 3678999999999999999999864 5799999999999999999988554433 Q ss_pred EEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCC Q lcl|Aclame:pro 379 TSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQ 428 (430) Q Consensus 379 ~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q 428 (430) +++.+|||+|+.++||+++++.+||||+|||+++|||||+|+++-+- T Consensus 377 ---~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 377 ---VATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCYNPHMGGQFFGNP 423 (423) T ss_pred ---ecccccceEEEEEeeeccccceEEEEEeecceeeeccceEEEEEecC Confidence 34556999999999999999999999999999999999998888777 No 5 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=1.3e-112 Score=634.07 Aligned_cols=400 Identities=20% Similarity=0.214 Sum_probs=343.0 Q ss_pred CccchhhHH--HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCccCCccCCCCcceEEEE Q lcl|Aclame:pro 1 MALNEGQIV--TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLELNVAVN 78 (430) Q Consensus 1 MAn~~~~~~--~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~s~~~~d~~e~sV~v~ 78 (430) ||.+.++++ +++.+++|+.|++++||+++| +|+|+.++.|+||||+||+|..+++++|... ..+|++|++++++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv--~r~y~~e~~~~GDTV~I~vp~~~~v~dg~~~--~~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCV--YRNYEKTFGKVGDTIRLKLPYRVKSASGRTL--VKQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhh--cCCCchHHhhCCCEEEEeeCCceeecccCCc--cccccccceEEEE Confidence 998766665 588999999999999999999 8999999999999999999999999999765 4679999999999 Q ss_pred eccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHHHHHH Q lcl|Aclame:pro 79 MGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMF 156 (430) Q Consensus 79 l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~~~L~ 156 (430) ||++|+++|+|+++| +.+.++.+|+++||+++||++||.+|+.++.. +++.++++++ ....|++++++++.|+ T Consensus 77 id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~-a~~~~gt~gt----~~~~~~~i~~a~~~Ld 151 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKK-AFHSSGTPGV----RPGAFIDFANAGAKQT 151 (418) T ss_pred EecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cccccccCCc----CcchHHHHHHHHHHHH Confidence 999999999999999 66789999999999999999999999987665 3444443332 2246999999999999 Q ss_pred HhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccc-cceecccce Q lcl|Aclame:pro 157 SRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTAT-GITVSGAQS 235 (430) Q Consensus 157 ~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~-~~tV~ga~~ 235 (430) ++++|++++|.+|++|+.++.|.+++..++. +....++||+|+||| ++||++ |+++++|.|++|+.. +.+|+|+.+ T Consensus 152 ~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~-~~~~~~~lr~G~IG~-i~GF~V-~~S~nip~~tag~~~~t~~v~ga~~ 228 (418) T protein:vir:10 152 TYAVPQDGMRHAVLDPFTCASLSDEVTKLFK-ESMVEQAYKMGYRGN-VAAYEV-YESQNLPKHTVGDHGGTPLVNGTVV 228 (418) T ss_pred hcCCCCCCceEEEeCHHHHHHHhhhcccccc-ccccchhhheeeeee-eeceEE-EEecCCCcccccccccceeeecccc Confidence 9999996679999999999999988877654 445778999999998 899985 589999999999855 488999876 Q ss_pred eeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeec-----CceeEEee Q lcl|Aclame:pro 236 FKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD-----GTHVEITP 310 (430) Q Consensus 236 ~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~-----~~~v~I~p 310 (430) .+. ++..+++ +.+.+|+|++||+|||+||++||++|||+++.+|||+|+++++ +++|+||| T Consensus 229 ~~~---~~~~~~~-----------t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p 294 (418) T protein:vir:10 229 NGD---TVGFDGG-----------TASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISP 294 (418) T ss_pred cce---eEEEeec-----------ceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEecc Confidence 332 2222211 3456789999999999999999999999999999999999852 46899999 Q ss_pred ccccccccccc-----cccccccccccccccCceeEEecCC--CceeeeeecccceeEEEecccCCCCchhhceeEEEec Q lcl|Aclame:pro 311 KPVALDDVSLS-----PEQRAYANVNTSLADAMAVNILNVK--DARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSI 383 (430) Q Consensus 311 ~~v~~~~~~~~-----~~~~~~~nVsa~pA~~aavTv~~~~--~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~ 383 (430) +|+....+... ...++|+|||++||++++|||+|++ ++++||+|||+||+|+||||++|.|.... .+..+ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~l~~p~g~~~~---~~~~~ 371 (418) T protein:vir:10 295 SLNDGTATINNENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALAMIDLELPQSAVIK---SRAAD 371 (418) T ss_pred ccccccccccccccccccccCCCcccccccCcceeeeecccccceeeeeeeecceEEEEEeeccCCCCCCcc---eEEEe Confidence 99755543322 2356899999999999999999975 56899999999999999999999885433 45667 Q ss_pred CCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 384 PDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 384 ~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) |.+|||+||.++||+++++.+||||+|||+|+|||||+ ++|.||-+ T Consensus 372 ~~~G~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~-~~~~g~~~ 417 (418) T protein:vir:10 372 PETGLSLTLTGAYDINEQSEIHRIDAVWGADMIYGELA-LRLWGAAS 417 (418) T ss_pred ccCCeEEEEEEcccccccceEEEEEeecCceeecccce-EEEEeecC Confidence 88999999999999999999999999999999999995 99999999 No 6 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=1.4e-112 Score=633.88 Aligned_cols=396 Identities=16% Similarity=0.178 Sum_probs=332.2 Q ss_pred Cccchhh-HHHHHHHHHHHHHHhhcccchhhcccCChHHHH--hhcCCEEEEecCcccccccC--Ccc-CCccCCCCcce Q lcl|Aclame:pro 1 MALNEGQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAASM--QRSSNTIWMPVEQESPTQEG--WDL-TDKATGLLELN 74 (430) Q Consensus 1 MAn~~~~-~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~--~k~GdTV~ip~P~~~~~~~g--~~~-s~~~~d~~e~s 74 (430) |||||+. +.+++.+++|+.|++++||+++| +|+|+.++ +|+||||+||+|..++..+. ++. +.+.+|++|++ T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lV--nr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e~~ 78 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTV--DRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGK 78 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhh--cccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCccccce Confidence 9999766 56999999999999999999999 56666555 68999999999998887664 333 35789999999 Q ss_pred EEEEeccccccceEecHHHhc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHH Q lcl|Aclame:pro 75 VAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAE 152 (430) Q Consensus 75 V~v~l~~~k~V~~~~t~keL~--~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~ 152 (430) |+++||++|++.|+|+++|+. +.+| +|+|+|||++||++||.+|+++++..++++.+.+++. ...|+++++++ T Consensus 79 v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~gt~~t~----~~a~~~i~~a~ 153 (423) T protein:vir:10 79 ATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTP----ITKWSDVAQTA 153 (423) T ss_pred eEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcc----cchHHHHHHHH Confidence 999999999999999999965 3455 8999999999999999999999998888777665542 24699999999 Q ss_pred HHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccc-cccchhhhHHHhCCCcceeccccccc-cee Q lcl|Aclame:pro 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTI-QRQVAGFDDVLRSPKLPVLTKSTATG-ITV 230 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~i-gr~~~Gfd~~~~~~~~~~~~~gt~~~-~tV 230 (430) +.|+++++|++ +|.+|++|+.+++|+++...+++.++..+++||+|+| || ++|||+ |+|+++|.|++|++++ .++ T Consensus 154 ~~Ld~~~vP~~-~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~-i~GFdv-~~Snnip~~T~gt~~~t~~~ 230 (423) T protein:vir:10 154 SFLKDLGVNEG-ENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTN-FGGIRA-LMSNGLASRTQGAFGGTLTV 230 (423) T ss_pred HHHHhccCCcC-CCEEEeChHHHHHHhccccceecccccchhhhhhccceee-ecceEE-EEeCCCccccccccccceee Confidence 99999999995 6889999999999999999999999999999999998 65 899996 5899999999998653 333 Q ss_pred cccceeeeeEEEEeeccccccccceeeE-----EEeeccceeecccEEEEcceeeccccccc-----cccccceEEEEEe Q lcl|Aclame:pro 231 SGAQSFKPVAWQLDNDGNKVNVDNRFAT-----VTLSATTGLKRGDKISFTGVKFLGQMAKN-----VLAQDATFSVVRV 300 (430) Q Consensus 231 ~ga~~~~~~~~t~~~~~~~~~~d~~~~~-----~~~s~tgtlk~GDv~TiaGV~~v~~~tk~-----~~~~l~~fvVt~~ 300 (430) +.+.+....+ ..+....+ .+++.+|+||+||+|||+||++|||+||| +++++++|+|+++ T Consensus 231 ~~~~~v~~~a----------~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~ 300 (423) T protein:vir:10 231 KTQPTVTYNA----------VKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTAD 300 (423) T ss_pred eecceecccc----------ccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEee Confidence 2222211000 01111122 23456799999999999999999999999 6699999999998 Q ss_pred ec-----CceeEEeeccccccccccccccccccccccccccCceeEEecCCC--ceeeeeecccceeEEEecccCCCCch Q lcl|Aclame:pro 301 VD-----GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKD--ARTNVFWADDAIRIVSQPIPANHELF 373 (430) Q Consensus 301 ~~-----~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~--~~~NlaFhr~A~~Latrpl~~p~~~~ 373 (430) .+ +++|+|+|+|+++. ...+|+||+++||++++||++|+++ +++||+|||+||+|+|||||+|.+.+ T Consensus 301 ~~~~~~g~~tv~i~p~~i~~~------~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~~~ 374 (423) T protein:vir:10 301 ANSDSGGDVTVTLSGVPIYDT------TNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHSID 374 (423) T ss_pred eeeccCCceeeeccCcccccc------CCcccccccccccCCceeeccccccCCeeEEEEecCcceEEEEEcccCCCccc Confidence 63 35799999998752 4578999999999999999999864 57999999999999999999875544 Q ss_pred hhceeEEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCC Q lcl|Aclame:pro 374 AGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQ 428 (430) Q Consensus 374 ~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q 428 (430) ++ +++.+|||+|+.++||+++++.+||||+|||+++|||||+|+++-+- T Consensus 375 ~~------~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 375 SA------VATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred ee------eccccCceEEEEEeeeccccceEEEEEeecceeeeccceEEEEEecC Confidence 33 44556999999999999999999999999999999999998888777 No 7 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=6.5e-112 Score=630.15 Aligned_cols=401 Identities=15% Similarity=0.166 Sum_probs=334.8 Q ss_pred Cccchhh-HHHHHHHHHHHHHHhhcccchhhcccCChHHHH--hhcCCEEEEecCccccccc--CCccC-CccCCCCcce Q lcl|Aclame:pro 1 MALNEGQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAASM--QRSSNTIWMPVEQESPTQE--GWDLT-DKATGLLELN 74 (430) Q Consensus 1 MAn~~~~-~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~--~k~GdTV~ip~P~~~~~~~--g~~~s-~~~~d~~e~s 74 (430) |||||+. +.+++.+++|+.|++++||+++| +|+|+.++ +|+||||+||+|..+...+ +++.+ .+.+|++|++ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lV--nr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e~~ 78 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTV--DRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGK 78 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhh--cccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCccccce Confidence 9999766 56999999999999999999999 56665554 6899999999999888765 34433 4689999999 Q ss_pred EEEEeccccccceEecHHHhc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHH Q lcl|Aclame:pro 75 VAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAE 152 (430) Q Consensus 75 V~v~l~~~k~V~~~~t~keL~--~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~ 152 (430) |+++||++|++.|+|+++|+. +.+| +|+|+|||++||++||.+|++++++.++++.+++++. ...|+++++++ T Consensus 79 v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~gt~~t~----~~a~~~i~~a~ 153 (423) T protein:vir:17 79 ATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTP----ITKWSDVAQTA 153 (423) T ss_pred eEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcc----cccHHHHHHHH Confidence 999999999999999999965 4455 8999999999999999999999988888777665543 24599999999 Q ss_pred HHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccc-cccchhhhHHHhCCCcceecccccc-ccee Q lcl|Aclame:pro 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTI-QRQVAGFDDVLRSPKLPVLTKSTAT-GITV 230 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~i-gr~~~Gfd~~~~~~~~~~~~~gt~~-~~tV 230 (430) +.|+++++|++ +|.+|++|+.+++|+++...+++.++..+++||+|+| || ++|||+ |+|+++|.|++|+.+ +.++ T Consensus 154 ~~Ld~~~vP~~-~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~-i~GFdv-y~Snnip~~T~gt~~~t~~~ 230 (423) T protein:vir:17 154 SFLKDLGVNEG-ENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTN-FGGIRA-LMSNGLASRTQGAFGGTLTV 230 (423) T ss_pred HHHHhccCCcC-CCEEEeChHHHHHHhccccceecccccchHHHhhccceee-ecceEE-EEeCCCccccccceeceeee Confidence 99999999995 6889999999999999999999999999999999998 65 999996 589999999999964 3444 Q ss_pred cccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccc-----cccccceEEEEEeec--- Q lcl|Aclame:pro 231 SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKN-----VLAQDATFSVVRVVD--- 302 (430) Q Consensus 231 ~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~-----~~~~l~~fvVt~~~~--- 302 (430) ..+.+.... ...+......+ ....+++++|+|++||+|||+||++|||+||+ +++++++|+|+++.+ T Consensus 231 ~~~~~v~~~----a~~~~~~~~~~-~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a 305 (423) T protein:vir:17 231 KTQPTVTYN----AVKDSYQFTVT-LTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDS 305 (423) T ss_pred ccccccccc----ccccccceeee-eeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEecccccc Confidence 433322111 11111111111 23345567899999999999999999999999 668999999998763 Q ss_pred --CceeEEeeccccccccccccccccccccccccccCceeEEecCCC--ceeeeeecccceeEEEecccCCCCchhhcee Q lcl|Aclame:pro 303 --GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKD--ARTNVFWADDAIRIVSQPIPANHELFAGMKT 378 (430) Q Consensus 303 --~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~--~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~ 378 (430) +++|+|||+|++.. ...+|+||+++||++++||++|+++ +++||+|||+||+|+|||||+|.+.+++ T Consensus 306 ~~~~tv~i~p~~i~~~------~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~--- 376 (423) T protein:vir:17 306 SGDVTVTLSGVPIYDT------TNPQYNSVSRQVAAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHSIDSA--- 376 (423) T ss_pred cCceEEEecCcccccc------CCcccccceecccCCceeeccccccCCeeEEEEecCcceEEEEEcccCCCcccee--- Confidence 35799999998753 3568999999999999999999864 5799999999999999999987543322 Q ss_pred EEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCC Q lcl|Aclame:pro 379 TSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQ 428 (430) Q Consensus 379 ~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q 428 (430) +++.+|||+|+.++||+++++.+||||+|||+++|||||+|+++-+- T Consensus 377 ---~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:17 377 ---VATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred ---ecccCCcEEEEEEecccccceeEEEEEeecceeeeccceEEEEEecC Confidence 45567999999999999999999999999999999999998888777 No 8 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=1.8e-111 Score=627.72 Aligned_cols=401 Identities=15% Similarity=0.179 Sum_probs=336.7 Q ss_pred CccchhhH-HHHHHHHHHHHHHhhcccchhhcccCChHHHH--hhcCCEEEEecCcccccccCC---ccCCccCCCCcce Q lcl|Aclame:pro 1 MALNEGQI-VTLAVDEIIETISAITPMAQKAKKYTPPAASM--QRSSNTIWMPVEQESPTQEGW---DLTDKATGLLELN 74 (430) Q Consensus 1 MAn~~~~~-~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~--~k~GdTV~ip~P~~~~~~~g~---~~s~~~~d~~e~s 74 (430) |||||++. .++++++.|+.|++++||+++| +|+|+.++ +|+||||+||+|..+++.+.. ..+.+.+|++|.+ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV--~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e~~ 78 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTV--DRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFSAK 78 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhc--ccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCccccccccce Confidence 99998775 5899999999999999999999 56665554 799999999999998887753 3456789999999 Q ss_pred EEEEeccccccceEecHHHhc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHH Q lcl|Aclame:pro 75 VAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAE 152 (430) Q Consensus 75 V~v~l~~~k~V~~~~t~keL~--~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~ 152 (430) |+++||++|++.|+|+++|+. +.+| +|+|+|+|++||++||.+|+..++..+++..+++++. ...|++|++++ T Consensus 79 v~l~id~~k~~a~~v~d~e~~l~i~~~-~~~l~~a~~ala~~vd~~l~~~l~~~a~~~vgt~~t~----~~~~~~i~~a~ 153 (423) T protein:vir:35 79 ATGKVGKYITVAVEWTQIEEALKLNQL-DQILSPIHERMVTDLETELAHFMMNNGALSLGSPNTA----IKKWADVAQTA 153 (423) T ss_pred eeEEeccceeccceeCHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCC----cchHHHHHHHH Confidence 999999999999999999964 4456 8999999999999999999998888777766655432 24599999999 Q ss_pred HHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccc-cccchhhhHHHhCCCcceeccccccc-cee Q lcl|Aclame:pro 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTI-QRQVAGFDDVLRSPKLPVLTKSTATG-ITV 230 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~i-gr~~~Gfd~~~~~~~~~~~~~gt~~~-~tV 230 (430) +.|+++++|++ +|.+|++|+.++.|++++..+++.++..+++||+|+| || ++|||+ |+|+++|.|++|+.++ .++ T Consensus 154 ~~Ld~~~vP~~-~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~-i~GFdv-~~Snnvp~~T~gt~~~~~~v 230 (423) T protein:vir:35 154 SFIKDIGIKTG-ENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGN-FGGIRA-LMSNGLASRKQGDFDGAITV 230 (423) T ss_pred HHHHHhcCCcC-CCEEEeCHHHHHHHhccccceeccccchhHHHhhccceee-ecceEE-EEcCCCccccccccccceee Confidence 99999999995 6889999999999999999999999999999999988 76 899996 5799999999999654 556 Q ss_pred cccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeecccccccc-----ccccceEEEEEee---- Q lcl|Aclame:pro 231 SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNV-----LAQDATFSVVRVV---- 301 (430) Q Consensus 231 ~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~-----~~~l~~fvVt~~~---- 301 (430) +++++.... ...+.....++ ....+++++|+|++||+|||+||++|||++|++ +++++||+|+++. T Consensus 231 ~~a~~v~~~----a~~~~~~~~~~-~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a 305 (423) T protein:vir:35 231 KTAPNVDYL----SVKDSYQFTVA-LTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTA 305 (423) T ss_pred ccccccccc----cccccccceee-eeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEEEEeccccccc Confidence 665532221 11122222222 233466788999999999999999999999995 7999999999775 Q ss_pred -cCceeEEeeccccccccccccccccccccccccccCceeEEecCCC--ceeeeeecccceeEEEecccCCCCchhhcee Q lcl|Aclame:pro 302 -DGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKD--ARTNVFWADDAIRIVSQPIPANHELFAGMKT 378 (430) Q Consensus 302 -~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~--~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~ 378 (430) .+++|+|||+|++.. ...+|+||+++||++++|||+|+++ +++||+|||+||+|+|||||+|.+.+++ T Consensus 306 ~g~~~v~i~p~~~~~~------~~~~~~~v~a~~a~~~~vt~~~~a~~~~~~nl~~~~~a~~l~~~~l~~~~~~~~~--- 376 (423) T protein:vir:35 306 SGDVTVKLSGVPIYDE------KNSQYNAVDAKVKAGDAVSIIGTAKQQMKPNLFYNKFFCGLGTIPLPKLHSLDSA--- 376 (423) T ss_pred cCceeEEccccccccC------CCcccccccccccCCceeeeeecCCCceeEEEeecCceeEEEEEccccCCcccee--- Confidence 246799999987642 3568999999999999999999864 4699999999999999999988654433 Q ss_pred EEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCC Q lcl|Aclame:pro 379 TSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQ 428 (430) Q Consensus 379 ~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q 428 (430) +.+.+|||+|+.++||+++++.+||||+|||+|+|||||+|+++.+- T Consensus 377 ---~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:35 377 ---VATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred ---eccccCceEEEEEeeccccCceEEEEEeecceeeecccceEEEEecC Confidence 33456999999999999999999999999999999999998888777 No 9 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=3.4e-39 Score=231.45 Aligned_cols=374 Identities=13% Similarity=0.090 Sum_probs=209.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHh-hcCCEEEEecCccccccc------CCccCCccCCCCcc Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQ-RSSNTIWMPVEQESPTQE------GWDLTDKATGLLEL 73 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~-k~GdTV~ip~P~~~~~~~------g~~~s~~~~d~~e~ 73 (430) |||++.+ -+++.+++|+.|++++||+++| +|+|+.++. |+||||+||+|..+...+ +.....+.+++.+. T Consensus 1 Ma~~~~~-p~~~a~~~l~~l~~~lv~~~lv--~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) T protein:vir:99 1 MANAFSK-PTAVVDTAIQMLQNELILTNLV--WLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) T ss_pred Ccccccc-HHHHHHHHHHHHHhhccchhhh--ccccccccccCCCCeEEEeecccccceeeeccccccCCcccccccccc Confidence 9999866 5677789999999999999999 789988875 789999999998876643 33445678899999 Q ss_pred eEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHH Q lcl|Aclame:pro 74 NVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) Q Consensus 74 sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a 151 (430) +++++||++|++.|+|+++|+ .+.++.+++++|++++||++||.+|++.+... ....... .....+...+++|.++ T Consensus 78 ~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a-~~~~~~~-~~~~~~~~~~~~i~~a 155 (392) T protein:vir:99 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGA-PYEAAGA-VHEVAPDEFFKGVNGA 155 (392) T ss_pred eEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcc-ccccccc-ccccChhhhHHHHHHH Confidence 999999999999999999995 46689999999999999999999999877642 2222111 1112233458899999 Q ss_pred HHHHHHhCCCcCCCcEEEecHHHHHHHHHh--hhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceeccccc--cc Q lcl|Aclame:pro 152 EELMFSRELNRDMGTSYFFNPQDYKKAGYD--LTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTA--TG 227 (430) Q Consensus 152 ~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~--~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~--~~ 227 (430) ++.|+++++|. +|.++++|+.+..|+.+ +...++.+....++||+|+||+ ++||++| ++++++.++.-.. .+ T Consensus 156 ~~~L~~~~vP~--~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~-i~G~~v~-~s~~~~~~t~~a~~~~a 231 (392) T protein:vir:99 156 RRALNELYIPQ--GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGR-IYGYEIV-ESTLIPHGDAYLYHPTA 231 (392) T ss_pred HHHHhhcCCCC--CCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeee-eeeeEEE-eecccccccceeeeccc Confidence 99999999996 48899999999988754 3333444444567899999997 8999964 6887775532110 11 Q ss_pred c-eecccceeeeeEEEEeeccccccccce-eeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEee--cC Q lcl|Aclame:pro 228 I-TVSGAQSFKPVAWQLDNDGNKVNVDNR-FATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVV--DG 303 (430) Q Consensus 228 ~-tV~ga~~~~~~~~t~~~~~~~~~~d~~-~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~--~~ 303 (430) . ...+++.. +.... .+.....++. ....+....++.+ .|.+++..+....-+++..... |...... .. T Consensus 232 ~~~at~a~v~-~~~~~---~~~s~s~~~~v~~~~~~~~~~t~~-s~~~~v~~~~g~~~v~~~~~~~---~~~~~~~~~~~ 303 (392) T protein:vir:99 232 FIMATRAPAP-PMGAV---RSTAISGDQRIAMRWLVDYDSTIT-SNRSLIDTYFGLKVVEDPNGVG---FVRARKIHLIP 303 (392) T ss_pred cccccccccc-ccccc---ceeEEecccceecceeecccceee-ccccccceeEEEEEEeeccccc---eeeeeeeeeec Confidence 1 11111110 00000 0000000110 1111111122222 2444444433333222221111 1111000 11 Q ss_pred ceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEec Q lcl|Aclame:pro 304 THVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSI 383 (430) Q Consensus 304 ~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~ 383 (430) .++++.|..+.............-+..+.+|++... ...++-|.-+--..|+..-. | . .. .. T Consensus 304 ~~v~v~~v~~~~~~~~~~~~~~~~~~~t~~~~~~~~--------~~~~vtw~Ssn~~vAtV~~~---G-~----Vt-~v- 365 (392) T protein:vir:99 304 GSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDD--------VTALCDFESSATDKATVAAG---G-L----VT-GV- 365 (392) T ss_pred ceeeeeeeecccceeEeeeccceeEEEEEEecCCcc--------ccceEEEEEcCCeeEEEcCC---c-e----EE-EE- Confidence 123332211111000000000111112222222111 12445666555556666521 1 1 00 00 Q ss_pred CCCcEE-EEEEeecccccceEEEEEEee Q lcl|Aclame:pro 384 PDVGLN-GIFATQGDISTLSGLCRIALW 410 (430) Q Consensus 384 ~~~Gls-irv~~~yd~~~~~~~~rldvl 410 (430) .-|-. |.......-......|.+.|+ T Consensus 366 -~~G~atITa~~~~~~~~~t~t~~vtV~ 392 (392) T protein:vir:99 366 -AAGTSTVTATLVTPSGDREDTIVITVV 392 (392) T ss_pred -ecceEEEEEEEEcCCCcEEEEEEEEeC Confidence 00222 222111111123456777776 No 10 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.96 E-value=1e-31 Score=190.42 Aligned_cols=268 Identities=18% Similarity=0.126 Sum_probs=187.4 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccccc--CCccCCccCCCCcceEEEE Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE--GWDLTDKATGLLELNVAVN 78 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~--g~~~s~~~~d~~e~sV~v~ 78 (430) |||+... -+++-+++++.|++.+++++++ +|+++.+ .++||||+||++......+ +....+..+++.+..++++ T Consensus 1 MA~~~~~-pei~~~~v~~~~~~~lv~~~l~--~~~~~~~-~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) T protein:vir:79 1 MAFNNFI-PELWSDMLLEEWTAQTVFANLV--NREYEGI-ASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) T ss_pred Ccchhhh-HHHHHHHHHHHHHhhccchhhh--hcccccc-ccCCcEEEEeecCcccccccccCCCccCccccccceEEEE Confidence 9998543 4677788999999999999999 5677654 4679999999987766554 3334456788999999999 Q ss_pred eccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHHHHHH Q lcl|Aclame:pro 79 MGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMF 156 (430) Q Consensus 79 l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~~~L~ 156 (430) |++++++.|.+++.|. ...++ ++++++++.+||+++|.+++..+...+... +...+..+...++.|.++++.|+ T Consensus 77 id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~vD~~i~~~~~~a~~~~---~~~~~~~~~~~~~~i~~a~~~ld 152 (273) T protein:vir:79 77 IDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTAL---TGSAPSDADDAFDLIASALKELT 152 (273) T ss_pred EeeecccceeeccHHHHhhcccH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc---ccccccchhhHHHHHHHHHHHhh Confidence 9999999999998773 33344 579999999999999999998876533222 11222222344778999999999 Q ss_pred HhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch-hhhhhhhccccccchhhhHHHhCCCcceecccccccceecccce Q lcl|Aclame:pro 157 SRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQS 235 (430) Q Consensus 157 ~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~ 235 (430) +.++|. ++|.++++|+.+..++..-..+...... ....+|+|.||| +.||+++ +++++|.+ T Consensus 153 ~~~vP~-~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~-~~G~~i~-~s~~lp~~--------------- 214 (273) T protein:vir:79 153 KANVPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGARIV-ESNNLRDT--------------- 214 (273) T ss_pred hccCCc-cCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeE-EeceEEE-eccccccc--------------- Confidence 999998 4699999999999887543222222222 346799999998 9999865 56555410 Q ss_pred eeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEEeeccccc Q lcl|Aclame:pro 236 FKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVAL 315 (430) Q Consensus 236 ~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~ 315 (430) .| T Consensus 215 ----------------------------------------~~-------------------------------------- 216 (273) T protein:vir:79 215 ----------------------------------------DD-------------------------------------- 216 (273) T ss_pred ----------------------------------------Cc-------------------------------------- Confidence 00 Q ss_pred cccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEee Q lcl|Aclame:pro 316 DDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQ 395 (430) Q Consensus 316 ~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~ 395 (430) .+ =++||++||.++.+-..+-.+ +++ T Consensus 217 ------------------------~~---------~~a~~~~A~~~a~~~~~~e~~----------r~~----------- 242 (273) T protein:vir:79 217 ------------------------EQ---------FVAFHPSAAAYVSQIDTVEAL----------RDQ----------- 242 (273) T ss_pred ------------------------eE---------EEEEeccceeeeeehhhhhcc----------cCc----------- Confidence 00 045788888888765432100 111 Q ss_pred cccccceEEEEEEeecCceeeCcceeEEEcCCCC Q lcl|Aclame:pro 396 GDISTLSGLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) Q Consensus 396 yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~ 429 (430) +..-..++-+..||++++|||-.+++=+.=+ T Consensus 243 ---~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 243 ---DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ---ccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 1112234456779999999996544332222 No 11 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.96 E-value=1.2e-31 Score=190.16 Aligned_cols=268 Identities=18% Similarity=0.144 Sum_probs=188.4 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccccc--CCccCCccCCCCcceEEEE Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE--GWDLTDKATGLLELNVAVN 78 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~--g~~~s~~~~d~~e~sV~v~ 78 (430) |||+... -++.-+++++.|++.+|+++++ +|+++.++ ++||||+||+|......+ +.......+++.+..++++ T Consensus 1 MA~~~~~-pe~~~~~v~~~~~~~lv~~~l~--~~~~~~~~-~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) T protein:vir:10 1 MAFNNFI-PELWSDMLLEEWTAQTVFANLV--NREYEGTA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) T ss_pred Ccchhhh-HHHHHHHHHHHHHhhhccchhh--cccccccc-ccCceEEEeecccccccccccCCCccCccccccceEEEE Confidence 9998553 4677788999999999999999 57877665 679999999998876654 3344556788999999999 Q ss_pred eccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHHHHHH Q lcl|Aclame:pro 79 MGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMF 156 (430) Q Consensus 79 l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~~~L~ 156 (430) ||+++++.|.+++.|. ...++ ++++++++.+||+++|.+++..+...+..+ ....+......++.|.++++.|+ T Consensus 77 id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~alA~~vD~~i~~~~~~a~~~~---~~~~~~~~~~~~~~i~~a~~~ld 152 (273) T protein:vir:10 77 IDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTAL---TGSAPTDADDAFDLIAKALKELT 152 (273) T ss_pred EeeeeecceEeecHHHhhhhccH-HHHHHHHHHHHHHHHHHHHHHHHhcccccc---ccccccchhHHHHHHHHHHHHhh Confidence 9999999999998663 33344 679999999999999999998776633322 11222222344788999999999 Q ss_pred HhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hhhhhhhhccccccchhhhHHHhCCCcceecccccccceecccce Q lcl|Aclame:pro 157 SRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQS 235 (430) Q Consensus 157 ~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~ 235 (430) ++.+|. ++|.++++|+.+..|...-....+... ....++|+|.||| +.||+++ +++++|.+ . T Consensus 153 ~~~vP~-~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~-i~G~~v~-~s~~lp~~------------~-- 215 (273) T protein:vir:10 153 KANVPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGARIV-ESNNLRDT------------D-- 215 (273) T ss_pred hcCCCc-CCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeE-EeceEEE-EecccccC------------C-- Confidence 999998 579999999999988754322222222 2456799999998 9999865 56655411 0 Q ss_pred eeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEEeeccccc Q lcl|Aclame:pro 236 FKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVAL 315 (430) Q Consensus 236 ~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~ 315 (430) + T Consensus 216 -----------------------------------------~-------------------------------------- 216 (273) T protein:vir:10 216 -----------------------------------------D-------------------------------------- 216 (273) T ss_pred -----------------------------------------c-------------------------------------- Confidence 0 Q ss_pred cccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEee Q lcl|Aclame:pro 316 DDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQ 395 (430) Q Consensus 316 ~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~ 395 (430) .+. ++||++||.++.+-..+- ..+++. . T Consensus 217 ------------------------~~~---------~~~~~~A~~~a~q~~~~e----------~~r~~~-~-------- 244 (273) T protein:vir:10 217 ------------------------EQF---------VAFHPSAAAYVSQIDTVE----------ALRDQD-S-------- 244 (273) T ss_pred ------------------------cEE---------EEEeccceeeeeeeehhh----------cccCCC-c-------- Confidence 000 567888888877654321 011111 1 Q ss_pred cccccceEEEEEEeecCceeeCcceeEEEcCCCC Q lcl|Aclame:pro 396 GDISTLSGLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) Q Consensus 396 yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~ 429 (430) ....++-+..||++++|||-.+++=+.=+ T Consensus 245 -----~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 245 -----FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -----ceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 11223345679999999996554433333 No 12 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.96 E-value=1.2e-31 Score=190.16 Aligned_cols=268 Identities=18% Similarity=0.144 Sum_probs=188.4 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccccc--CCccCCccCCCCcceEEEE Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE--GWDLTDKATGLLELNVAVN 78 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~--g~~~s~~~~d~~e~sV~v~ 78 (430) |||+... -++.-+++++.|++.+|+++++ +|+++.++ ++||||+||+|......+ +.......+++.+..++++ T Consensus 1 MA~~~~~-pe~~~~~v~~~~~~~lv~~~l~--~~~~~~~~-~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) T protein:vir:10 1 MAFNNFI-PELWSDMLLEEWTAQTVFANLV--NREYEGTA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) T ss_pred Ccchhhh-HHHHHHHHHHHHHhhhccchhh--cccccccc-ccCceEEEeecccccccccccCCCccCccccccceEEEE Confidence 9998553 4677788999999999999999 57877665 679999999998876654 3344556788999999999 Q ss_pred eccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHHHHHH Q lcl|Aclame:pro 79 MGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMF 156 (430) Q Consensus 79 l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~~~L~ 156 (430) ||+++++.|.+++.|. ...++ ++++++++.+||+++|.+++..+...+..+ ....+......++.|.++++.|+ T Consensus 77 id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~alA~~vD~~i~~~~~~a~~~~---~~~~~~~~~~~~~~i~~a~~~ld 152 (273) T protein:vir:10 77 IDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTAL---TGSAPTDADDAFDLIAKALKELT 152 (273) T ss_pred EeeeeecceEeecHHHhhhhccH-HHHHHHHHHHHHHHHHHHHHHHHhcccccc---ccccccchhHHHHHHHHHHHHhh Confidence 9999999999998663 33344 679999999999999999998776633322 11222222344788999999999 Q ss_pred HhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hhhhhhhhccccccchhhhHHHhCCCcceecccccccceecccce Q lcl|Aclame:pro 157 SRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQS 235 (430) Q Consensus 157 ~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~ 235 (430) ++.+|. ++|.++++|+.+..|...-....+... ....++|+|.||| +.||+++ +++++|.+ . T Consensus 153 ~~~vP~-~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~-i~G~~v~-~s~~lp~~------------~-- 215 (273) T protein:vir:10 153 KANVPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGARIV-ESNNLRDT------------D-- 215 (273) T ss_pred hcCCCc-CCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeE-EeceEEE-EecccccC------------C-- Confidence 999998 579999999999988754322222222 2456799999998 9999865 56655411 0 Q ss_pred eeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEEeeccccc Q lcl|Aclame:pro 236 FKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVAL 315 (430) Q Consensus 236 ~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~ 315 (430) + T Consensus 216 -----------------------------------------~-------------------------------------- 216 (273) T protein:vir:10 216 -----------------------------------------D-------------------------------------- 216 (273) T ss_pred -----------------------------------------c-------------------------------------- Confidence 0 Q ss_pred cccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEee Q lcl|Aclame:pro 316 DDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQ 395 (430) Q Consensus 316 ~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~ 395 (430) .+. ++||++||.++.+-..+- ..+++. . T Consensus 217 ------------------------~~~---------~~~~~~A~~~a~q~~~~e----------~~r~~~-~-------- 244 (273) T protein:vir:10 217 ------------------------EQF---------VAFHPSAAAYVSQIDTVE----------ALRDQD-S-------- 244 (273) T ss_pred ------------------------cEE---------EEEeccceeeeeeeehhh----------cccCCC-c-------- Confidence 000 567888888877654321 011111 1 Q ss_pred cccccceEEEEEEeecCceeeCcceeEEEcCCCC Q lcl|Aclame:pro 396 GDISTLSGLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) Q Consensus 396 yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~ 429 (430) ....++-+..||++++|||-.+++=+.=+ T Consensus 245 -----~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 245 -----FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred -----ceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 11223345679999999996554433333 No 13 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.96 E-value=1.1e-31 Score=190.26 Aligned_cols=317 Identities=12% Similarity=0.036 Sum_probs=195.5 Q ss_pred Cccchhh----------HH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccC-CccCCccC Q lcl|Aclame:pro 1 MALNEGQ----------IV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEG-WDLTDKAT 68 (430) Q Consensus 1 MAn~~~~----------~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g-~~~s~~~~ 68 (430) |.|.++- .+ ++.-+++++.|+..++++.+++ +++.++ +.||||+||++......+- ...+...+ T Consensus 3 ~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~---d~~~~~-~~Gdtv~ip~~g~~~~~d~~~~~~i~~~ 78 (341) T protein:vir:94 3 LGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVK---TWGAQV-KKGDTFHVPRISELGVEDKATDVPVGVQ 78 (341) T ss_pred chhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccc---cccccc-cCCceEEEeccCcceeeeecCCCccccc Confidence 6676665 44 6777888999999999999884 444333 5699999999887665541 22345667 Q ss_pred CCCcceEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCC----CC- Q lcl|Aclame:pro 69 GLLELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGT----NT- 141 (430) Q Consensus 69 d~~e~sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~----~~- 141 (430) ++.+..++++||+++...|.+++.|. ...++.+++++++.++||+++|.+++..+...+..........++ +. T Consensus 79 ~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~ 158 (341) T protein:vir:94 79 PVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITGNG 158 (341) T ss_pred cccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccCch Confidence 88899999999999999999999773 455778899999999999999999998776544322221111111 11 Q ss_pred -CCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCccee Q lcl|Aclame:pro 142 -ADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVL 220 (430) Q Consensus 142 -~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~ 220 (430) ...+..|.++++.|+++++|. ++|.+|++|+.+..|... ..+.+.+...+..+|+|.||+ ++||++ ++++++|.+ T Consensus 159 ~~~~~~~i~~a~~~Lde~~VP~-~gR~lvv~P~~~~~Ll~~-~~~~~~~~~g~~~l~~G~ig~-i~G~~V-~~Sn~lp~~ 234 (341) T protein:vir:94 159 QAFSFAVFLAARRLLLEADVPE-EKIVLLISPGQESALFTI-PQFISKDFINNAPIAQGQIGS-LMGVRV-IRTSLIGNN 234 (341) T ss_pred hhhhHHHHHHHHHHHhhcCCCc-cCCEEEeCHHHHHHHhhc-hhhhhhhccccchhheeeeee-EeceEE-EEecccccc Confidence 113677899999999999998 569999999999988753 333333444455699999997 999986 578888743 Q ss_pred cccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEe Q lcl|Aclame:pro 221 TKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRV 300 (430) Q Consensus 221 ~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~ 300 (430) ... . .+.++.. ....+..-.|.|+... T Consensus 235 ~~~---~-~~~~~~~------------------------------~~~~~~~~~i~~~~~~------------------- 261 (341) T protein:vir:94 235 SAT---G-WRNGAPT------------------------------IAPAEATPGFTGSRYL------------------- 261 (341) T ss_pred ccc---c-ccccccc------------------------------eecccccccccccccc------------------- Confidence 211 1 1111110 0011222223321000 Q ss_pred ecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEE-EecccCCCCchhhceeE Q lcl|Aclame:pro 301 VDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIV-SQPIPANHELFAGMKTT 379 (430) Q Consensus 301 ~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~La-trpl~~p~~~~~~~~~~ 379 (430) +.| . +-.+...-|+|||+|+..+ ...|+.- . .. .. T Consensus 262 ------------------------~~~----~-----------~~~~~~~gl~~~~~av~~~k~~~~~~~---~-~~-~~ 297 (341) T protein:vir:94 262 ------------------------PKQ----D-----------SFTSLPATFTGNSRPVHTAVMCHMDWA---A-AV-VS 297 (341) T ss_pred ------------------------ccc----c-----------cccccEEEEEEecccccceeeecchhh---h-cc-cc Confidence 000 0 0111234699999998766 2222110 0 00 00 Q ss_pred EEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcC-CCCC Q lcl|Aclame:pro 380 SFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLP-GQTA 430 (430) Q Consensus 380 ~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~-~q~~ 430 (430) ... .+.-+..+-+|.| ..+=..+||++++|||++..+-. +-|. T Consensus 298 ~~~--~~~~~~~~~~~~~------~i~~~~~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 298 KAP--RVTQSFENREQVW------LMVGRQAYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred ccc--cccccchhhhhhh------hhhhhhhhcccccCcceeEEEecCcCCC Confidence 000 0011122223433 23334579999999999743322 2222 No 14 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.91 E-value=1e-26 Score=163.06 Aligned_cols=346 Identities=12% Similarity=0.040 Sum_probs=204.2 Q ss_pred Cccchhh-HH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccccc-CCccCCccCCCCcceEEE Q lcl|Aclame:pro 1 MALNEGQ-IV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE-GWDLTDKATGLLELNVAV 77 (430) Q Consensus 1 MAn~~~~-~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~-g~~~s~~~~d~~e~sV~v 77 (430) |+++..+ .+ ++.-+++++.|++.+++.++++. ++++ .+.||||+||++......+ ........+++.+..+++ T Consensus 15 ~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~-~~~~---~~~GdTV~ip~~g~~~a~d~~~g~~i~~~~~~~~~~~i 90 (381) T protein:vir:80 15 VDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKK-IPFE---GKKGDLIHIPNISRAAVYDKQPQTPVNLQARTDSEFTF 90 (381) T ss_pred cchhhHHhhhhHHHHHHHHHHHHHhhhhhhcccc-ccce---eecCceEEeeccCcceeeeecCCCcccccccCCceEEE Confidence 5555333 33 68888999999999999999853 3443 2469999999987655433 122345667888999999 Q ss_pred EeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCC--------------CCCC Q lcl|Aclame:pro 78 NMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAI--------------GTNT 141 (430) Q Consensus 78 ~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~--------------~~~~ 141 (430) +||+.+...+.+++.|. ..-+..+++.+++..+||+++|.+++..+............+. ..+. T Consensus 91 tID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~~~~~~~~~t~~~~ 170 (381) T protein:vir:80 91 TVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLGDGTVNAHLTGTPA 170 (381) T ss_pred EEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccccccccchh Confidence 99999999999998663 3446778999999999999999999877654222111100000 0001 Q ss_pred CCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceec Q lcl|Aclame:pro 142 ADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLT 221 (430) Q Consensus 142 ~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~ 221 (430) ...++.|.++++.|++..+|. ++|.+|++|+.+..|+... .+.+.+......+|+|.||| ++||+++ ++.++|.. T Consensus 171 ~~t~~~i~~a~~~Lde~~VP~-egR~lvv~P~~~~~Ll~~~-~~~~ad~~~~~~l~~G~Ig~-i~G~~Vv-~Sn~lp~~- 245 (381) T protein:vir:80 171 PLTYAALLLAKQKLDEADVPQ-EGRIVMVSPAQYIDLLSIN-QFISVDFSQVKPVTSGVVGT-ILGMEVI-VTTQIGIN- 245 (381) T ss_pred hHHHHHHHHHHHHHhhcCCCc-CCcEEEeCHHHHHHHhhch-hhhhhhhccchhhhceeeeE-EcceEEE-eecccccc- Confidence 123578999999999999998 5799999999999987643 33344445567899999997 9999865 68888752 Q ss_pred ccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEee Q lcl|Aclame:pro 222 KSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVV 301 (430) Q Consensus 222 ~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~ 301 (430) .+++.. .+++. +...+-...+.. +.+.. .-...+.+++|.-|.-+......++-.++.+ .+.+. T Consensus 246 --~~t~~~-~~aga--p~~~~~~~~~~~--~~g~~-s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~--------~~~~~ 309 (381) T protein:vir:80 246 --SLTGYV-NGQGA--PTQPTPGVLGSP--YLPDQ-AGTANVVNTGSASDLAVSLSYFGLPVFSGAG--------ATAAD 309 (381) T ss_pred --ccccee-eeccc--cccccccccccc--ccccc-ccceeeeeeeeeeceeeeeeeccceeeecce--------eeecC Confidence 212211 11110 001000001111 11110 0011233456666655554432221111100 00000 Q ss_pred cCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEec--ccCCCCchhhceeE Q lcl|Aclame:pro 302 DGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQP--IPANHELFAGMKTT 379 (430) Q Consensus 302 ~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrp--l~~p~~~~~~~~~~ 379 (430) ..+++-+ + =.|.|.+-++++-| ++.. ++.. . T Consensus 310 ~~~~~~~---------------------------------------~---~~~~~~~~~~~~~~~~~~~~--~~~~--~- 342 (381) T protein:vir:80 310 GGQTLGS---------------------------------------F---GGANRWATAVVCHPDWLAVG--VQQN--V- 342 (381) T ss_pred CCceeee---------------------------------------e---hhhhhhhhhccccccccccc--ceeE--e- Confidence 0111111 0 11667775555544 2221 1111 1 Q ss_pred EEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 380 SFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 380 ~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) -++-+..+.+|.| ....|+. ||++.+||.| ||.|----. T Consensus 343 -----~~~~~~~~~~~~~----~~~~~~~--~~~~~~~~~~-~~~~~~~~~ 381 (381) T protein:vir:80 343 -----KSESSRETMYLAD----AFVTSCV--YGAKVFRPDH-CVLLHTSGI 381 (381) T ss_pred -----ecccchhheeehh----hhhhhhh--hccccccchh-hhhhhhcCC Confidence 1246677888887 5555655 9999999999 465532222 No 15 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.80 E-value=1.7e-21 Score=134.43 Aligned_cols=299 Identities=12% Similarity=0.061 Sum_probs=180.1 Q ss_pred Cccc-------------------hhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccccc-- Q lcl|Aclame:pro 1 MALN-------------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE-- 59 (430) Q Consensus 1 MAn~-------------------~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~-- 59 (430) |||. .+--++..-.+|+..|+...++..++... -.+.|++|+||.=......+ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~------~~~~G~sv~i~~ig~~t~~~~~ 74 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLR------SIASGKSAQFPVIGRTKAAYLK 74 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccc------cccccceeEeeeccceeeeeec Confidence 5554 22235677788899999999999999642 24569999999854444432 Q ss_pred -CCccCCccCCCCcceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceee---- Q lcl|Aclame:pro 60 -GWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVIT---- 132 (430) Q Consensus 60 -g~~~s~~~~d~~e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~---- 132 (430) |.....+.+++......++||+++...|.+.+-| ....+.-..+.+.+..+||.++|..++..+...+..... T Consensus 75 ~g~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~ 154 (347) T protein:vir:15 75 PGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNEN 154 (347) T ss_pred cCCCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 5444445567777788999999999888886533 233346667889999999999999998776543211100 Q ss_pred --ccCCCC-------C-CCCCc--------hhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhh Q lcl|Aclame:pro 133 --SPDAIG-------T-NTADA--------WNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEE 194 (430) Q Consensus 133 --~~~~~~-------~-~~~~~--------~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~ 194 (430) .++..+ + +...+ +.-+-++++.|+++.+|. ++|.+|++|+.+..|+.... +...+..... T Consensus 155 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~-~gR~~vv~P~~y~~LL~~~~-~~~~d~~~~~ 232 (347) T protein:vir:15 155 IEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPA-ADRTFYTTPDNYSAILAALM-PNAANYQALI 232 (347) T ss_pred ccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCc-cCCEEEeCHHHHHHHhcccc-cccccccccc Confidence 000000 0 00111 344667889999999998 46999999999998876543 3333334556 Q ss_pred hhhhccccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEE Q lcl|Aclame:pro 195 AYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKIS 274 (430) Q Consensus 195 a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~T 274 (430) .+++|.||+ ++||++ +++.++|....+..... . T Consensus 233 ~~~~G~Vg~-i~G~~V-~~Sn~lp~~~~t~~~~~---------------------------------------------~ 265 (347) T protein:vir:15 233 DHERGTIRN-VMGFEV-VEVPHLTAGGAGDTRED---------------------------------------------A 265 (347) T ss_pred cccceEEEE-EeceEE-Eeccccccccccccccc---------------------------------------------c Confidence 799999996 999986 57877873211110000 0 Q ss_pred EcceeeccccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeee Q lcl|Aclame:pro 275 FTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFW 354 (430) Q Consensus 275 iaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaF 354 (430) ++| ..|...++ .+.++ ++ .-+-.+=|+| T Consensus 266 ~~g---------------~~~~~~~~---~~~~~--------------------------------~~--~f~~~~~l~~ 293 (347) T protein:vir:15 266 PAD---------------QKHAFPAT---SSTTV--------------------------------KV--ALDNVVGLFQ 293 (347) T ss_pred ccc---------------cccccccc---cccee--------------------------------ee--ccccceeeee Confidence 011 00000000 00000 00 0011234999 Q ss_pred cccceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 355 ADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 355 hr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) ||+|+..+..-.. .++ .++|........+--..||.+++|||.+ +.|.=++. T Consensus 294 h~~A~g~v~~~~~---------------------~~e--~~~~~~~~~d~i~~~~~~G~~vlrP~~a-v~~~~~~~ 345 (347) T protein:vir:15 294 HRSAVGTVKLKDL---------------------ALE--RARRANYQADQIIAKYAMGHGGLRPEAA-GAIVLPKV 345 (347) T ss_pred ccceeeeeEeece---------------------eee--ecccchhhhhhhehhhhcCCceeccccE-EEEecCCC Confidence 9999877663221 111 1112222222334456799999999996 55565666 No 16 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.77 E-value=6.1e-21 Score=131.40 Aligned_cols=297 Identities=14% Similarity=0.095 Sum_probs=177.2 Q ss_pred Cccchh-------------------hHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccccc-- Q lcl|Aclame:pro 1 MALNEG-------------------QIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE-- 59 (430) Q Consensus 1 MAn~~~-------------------~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~-- 59 (430) |||+-. --++..-.+|+..|+...++..+++. | -.+.|++|+||.=......+ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~-r-----~~~~G~sv~i~~iG~~t~~~~~ 74 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHML-R-----SIASGKSAQFPVIGRTKAAYLK 74 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhcc-c-----cccccceeEeeeccceeeeeec Confidence 775411 23467777888889999999999964 2 24569999999754444332 Q ss_pred -CCccCCccCCCCcceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccee----- Q lcl|Aclame:pro 60 -GWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVI----- 131 (430) Q Consensus 60 -g~~~s~~~~d~~e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~----- 131 (430) |.....+.+|+......++||+++...|.+.+-| ...-++-..+.+.+..+||.++|..++......+.... T Consensus 75 ~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~ 154 (347) T protein:vir:33 75 PGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNEN 154 (347) T ss_pred CCCCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 5444444556677788899999999888777633 33334555688899999999999999866543211110 Q ss_pred -----e---ccCCCC-CCCCCc--------hhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhh Q lcl|Aclame:pro 132 -----T---SPDAIG-TNTADA--------WNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEE 194 (430) Q Consensus 132 -----~---~~~~~~-~~~~~~--------~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~ 194 (430) + .+...+ ++...+ +..+-+++..|+++.+|. ++|.+|++|+.+..|+... .+...+....+ T Consensus 155 ~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~-~gR~~vv~P~~y~~Ll~~~-~~~~~d~~~~~ 232 (347) T protein:vir:33 155 IEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPA-ADRTFYTTPDNYSAILAAL-MPNAANYQALL 232 (347) T ss_pred cccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCc-cCcEEEeCHHHHHHHhccc-ccccccccccc Confidence 0 000111 111111 345778999999999998 4699999999998887543 23333333456 Q ss_pred hhhhccccccchhhhHHHhCCCcceecccccc-cceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEE Q lcl|Aclame:pro 195 AYRDGTIQRQVAGFDDVLRSPKLPVLTKSTAT-GITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKI 273 (430) Q Consensus 195 a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~-~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~ 273 (430) .+++|.||+ +.||++ +++.++|... +++. ..++ T Consensus 233 ~~~~G~V~~-i~G~~V-~~Sn~lp~~~-~~~~~~~~~------------------------------------------- 266 (347) T protein:vir:33 233 DPERGTIRN-VMGFEV-VEVPHLTAGG-AGDTREDAP------------------------------------------- 266 (347) T ss_pred ccccceeEE-EeceeE-EEecccccCc-ccccccccc------------------------------------------- Confidence 799999996 899986 4677777321 1110 0001 Q ss_pred EEcceeeccccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeee Q lcl|Aclame:pro 274 SFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVF 353 (430) Q Consensus 274 TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~Nla 353 (430) +|-+. . |-+. . +.+. +. ..+-..=|+ T Consensus 267 --ag~~~-------~------~~~~---~--~~~~--------------------------------~~--a~~~~~gl~ 292 (347) T protein:vir:33 267 --ADQKH-------A------FPAT---S--STTV--------------------------------KV--ALDNVVGLF 292 (347) T ss_pred --ccccc-------c------ccCC---c--ccce--------------------------------ec--cccceeeee Confidence 11000 0 0000 0 0000 00 111123599 Q ss_pred ecccceeEEEe-cccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 354 WADDAIRIVSQ-PIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 354 Fhr~A~~Latr-pl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) |||+|+..+-. +|. ++ ..||.+.....++--..||.+++|||.+ +.|.=++. T Consensus 293 ~h~~A~g~v~~~~~~----------------------~e--~~r~~~~~~d~i~~~~~~G~~vlrP~~a-v~i~~~~~ 345 (347) T protein:vir:33 293 QHRSAVGTVKLKDLA----------------------LE--RARRANYQADQIIAKYAMGHGGLRPEAA-GAIVLPKV 345 (347) T ss_pred ecchhheeeeeecee----------------------ee--eccchhhhhHhhhhhhhcCCceecccce-EEEecCCC Confidence 99999865442 221 11 1122222223344556799999999996 44555555 No 17 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.72 E-value=4.5e-20 Score=126.67 Aligned_cols=301 Identities=17% Similarity=0.122 Sum_probs=176.9 Q ss_pred Cccch------------------hhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc---c Q lcl|Aclame:pro 1 MALNE------------------GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ---E 59 (430) Q Consensus 1 MAn~~------------------~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~---~ 59 (430) |||.- .--++.+..||+..|+...++..++..+ -.+.|++|+||.=...... . T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r------~i~~G~sv~i~~iG~~tv~~~t~ 74 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVR------TIQNGKSAQFPVMGRTSGVYLAP 74 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc------cccccceEEEecccceeeeeecC Confidence 65541 1123455666777788888888888532 2467999999975444443 2 Q ss_pred CCccCCccCCCCcceEEEEeccccccceEecH--HHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCC Q lcl|Aclame:pro 60 GWDLTDKATGLLELNVAVNMGEPDNDFFQLRA--DDLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAI 137 (430) Q Consensus 60 g~~~s~~~~d~~e~sV~v~l~~~k~V~~~~t~--keL~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~ 137 (430) |-....+.+++....+.++||+++...|.+.+ +....-+....+.+.+..+||..+|..++.++...++......... T Consensus 75 G~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~ 154 (347) T protein:vir:94 75 GERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENI 154 (347) T ss_pred CCCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 55555566677888899999999988877765 2234444555788999999999999999877765333211110000 Q ss_pred -----------CCCCC--C-------chhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhh Q lcl|Aclame:pro 138 -----------GTNTA--D-------AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYR 197 (430) Q Consensus 138 -----------~~~~~--~-------~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r 197 (430) .+.+. + -+..+-+++..|+++.+|.+ +|.+|++|+.+..|+.. .............++ T Consensus 155 ~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~-~R~~vv~P~~~~~Ll~~-~~~~~~~~~~~~~~~ 232 (347) T protein:vir:94 155 AGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAG-DRYFYTTPDNYSAILAA-LMPNAANYAALIDPE 232 (347) T ss_pred CCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCC-CcEEEeCHHHHHHHhcc-chhhhhhcccccccc Confidence 00000 0 13446688999999999985 69999999999877643 222333333445689 Q ss_pred hccccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcc Q lcl|Aclame:pro 198 DGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTG 277 (430) Q Consensus 198 ~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaG 277 (430) +|.||+ +.||+. +++.++|.-..+ -+. ..+..-+.+| T Consensus 233 ~G~Vg~-i~G~~V-~~Sn~lp~~~~t----~~~-------------------------------------~~~~~~~~aG 269 (347) T protein:vir:94 233 TGNIRN-VMGFVV-VEVPHLVQGGAG----ETR-------------------------------------GDDGITIASG 269 (347) T ss_pred ccceEE-EeceEE-EecCcccccccc----ccc-------------------------------------ccCcceecCc Confidence 999996 899985 578888732100 000 0112222333 Q ss_pred eeeccccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeeccc Q lcl|Aclame:pro 278 VKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADD 357 (430) Q Consensus 278 V~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~ 357 (430) -. .-|.-+ ..+. | . +.-+.+.=|+|||+ T Consensus 270 ~~-------------~~~~~~---~~~~---------------------~-----------~----~~~~~~~~l~~h~~ 297 (347) T protein:vir:94 270 QK-------------HAFPAT---ASSD---------------------V-----------K----VTMDNVVGLFSHRS 297 (347) T ss_pred cc-------------cccccc---chhh---------------------h-----------c----ccccceeEEEeehh Confidence 00 000000 0000 0 0 01112345999999 Q ss_pred ceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEE--EeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 358 AIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRI--ALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 358 A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rl--dvlyG~~~v~Pe~agv~l~~q~~ 430 (430) |+..+-.-. +.++ .+||.+. ..+.| =..||.+++|||.++++-+. .| T Consensus 298 A~~~v~~~~---------------------~~~e--~~r~~~~--~~d~i~~~~~~G~~~~rP~~a~~~~~~-~A 346 (347) T protein:vir:94 298 AVGTVKLRD---------------------LALE--RDRDVDA--QGDLIVGKYAMGHGGLRPEAAGALVFS-PA 346 (347) T ss_pred hhhhhhccc---------------------cccc--chhchhh--HHHHhhhhhhhcCcccccceeEEEEec-CC Confidence 987554221 1111 1122111 11222 24699999999999888776 66 No 18 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.72 E-value=6e-20 Score=125.95 Aligned_cols=298 Identities=15% Similarity=0.146 Sum_probs=180.2 Q ss_pred Cccchhh--------------------HHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc-- Q lcl|Aclame:pro 1 MALNEGQ--------------------IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ-- 58 (430) Q Consensus 1 MAn~~~~--------------------~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~-- 58 (430) |||.-.- -+++.--||+..|+...++..++.. | -.+.|++++||.=-..+.+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~-r-----~i~~g~s~~~~~iG~~~~~~~ 74 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMV-R-----SISSGKSAQFPVLGRTQAAYL 74 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhccccee-e-----eecccceEEEEeeceeEEEee Confidence 8876222 4577778899999999999999965 2 2456999999975333333 Q ss_pred -cCCccCCccCCCCcceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccee---e Q lcl|Aclame:pro 59 -EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVI---T 132 (430) Q Consensus 59 -~g~~~s~~~~d~~e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~---~ 132 (430) .|.......+|+....+.++||+.+...|.+.+=| ...-++-..+.+.+.++||..+|..++..+++.+.... . T Consensus 75 ~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~ 154 (344) T protein:vir:10 75 APGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNE 154 (344) T ss_pred ecCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 25444444457777789999999999988888744 34445555688889999999999999877655332110 0 Q ss_pred c--------------cCCCCCCCCC----chhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhh Q lcl|Aclame:pro 133 S--------------PDAIGTNTAD----AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEE 194 (430) Q Consensus 133 ~--------------~~~~~~~~~~----~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~ 194 (430) . .+...+.+.. -++.+-+++..|+++.+|. ++|.+|++|+.+..|.... .+.+....... T Consensus 155 ~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~-~gR~~vv~P~~y~~Ll~~~-~~~~~~~~~~~ 232 (344) T protein:vir:10 155 NITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPS-SDRVFYCDPDSYSAILAAL-MPNAANYAALI 232 (344) T ss_pred ccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCc-cCCEEEeChHHHHHHhhcc-ccccccccccc Confidence 0 0000011111 1344678999999999998 4699999999998775432 22333333456 Q ss_pred hhhhccccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEE Q lcl|Aclame:pro 195 AYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKIS 274 (430) Q Consensus 195 a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~T 274 (430) .+++|.+++ ++||. ++++.++|....++... -+. |.... T Consensus 233 ~~~~G~V~~-v~G~~-V~~Sn~lp~~~~~~~~~-~~t--------------------------------------g~~~~ 271 (344) T protein:vir:10 233 DPEKGSIRN-VMGFE-VVEVPHLTAGGAGTSRE-GTT--------------------------------------GQKHA 271 (344) T ss_pred ceeeeEEEE-EeceE-EEeccccccccCCcccc-ccc--------------------------------------Ccccc Confidence 689999996 89997 67888876211110000 000 11111 Q ss_pred EcceeeccccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeee Q lcl|Aclame:pro 275 FTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFW 354 (430) Q Consensus 275 iaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaF 354 (430) +.+ . .+ ....+ ..+-+.=|+| T Consensus 272 ~~~----------------------~-~~----------------------------------~~~~~--~~s~~~~l~~ 292 (344) T protein:vir:10 272 FPA----------------------T-KS----------------------------------GNDKV--AKDNVIGLFM 292 (344) T ss_pred ccC----------------------C-cc----------------------------------cceee--ecceeEEEee Confidence 111 0 00 00111 1122245899 Q ss_pred cccceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCC Q lcl|Aclame:pro 355 ADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) Q Consensus 355 hr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~ 429 (430) ||+|+..+-.- .+.++.+|.-+-..+.... =.+||.+++|||.+|++..=+| T Consensus 293 h~~A~~~v~~~---------------------~~~~e~~r~~~~~~d~i~g--~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 293 HRSAVGTVKLR---------------------DLALERARRANFQADQIIA--KYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred chhhhhhhhhc---------------------cceeecccchhHHHHHHHH--HhhcccceecccceEEEEeecC Confidence 99998554321 1222222211111112222 2459999999999999999888 No 19 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.69 E-value=9.2e-19 Score=119.47 Aligned_cols=300 Identities=15% Similarity=0.118 Sum_probs=179.7 Q ss_pred Cccch-------------------hhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc--- Q lcl|Aclame:pro 1 MALNE-------------------GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--- 58 (430) Q Consensus 1 MAn~~-------------------~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~--- 58 (430) |||.- .--+++...||+..|+...++..++.. | -.+.|++++||.=-..... T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~-r-----~i~~G~sv~~~~iG~~~~~~~~ 74 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV-R-----TIQNGKSASFPVMGRTKGYYLA 74 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhcccc-c-----cccCcceEEEeeecceeeeeec Confidence 77541 113466677888889999999999954 2 2467999999975444332 Q ss_pred cCCccCCccCCCCcceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceee---- Q lcl|Aclame:pro 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVIT---- 132 (430) Q Consensus 59 ~g~~~s~~~~d~~e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~---- 132 (430) .|........|+....+.++||+.+...|.+.+-| ....+.-..+.+.+.++||..+|.-++..+...+..... T Consensus 75 ~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~ 154 (347) T protein:vir:88 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) T ss_pred cccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 23332222235666789999999999998888755 223345557788899999999999998777654332111 Q ss_pred ccC-----CCCCCCCC-----------chhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhh Q lcl|Aclame:pro 133 SPD-----AIGTNTAD-----------AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY 196 (430) Q Consensus 133 ~~~-----~~~~~~~~-----------~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~ 196 (430) +.+ ..+.+... -++.+-+++..|+++.+|.+ +|.+|++|+.+..|+.... .......+...+ T Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~-gR~~vv~P~~y~~Ll~~~~-~~~~~~~~~~~~ 232 (347) T protein:vir:88 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAG-DRRFYCAPEDYSAILSALM-PNAANYAALIDP 232 (347) T ss_pred cCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCC-CCEEEeCHHHHHHHhcchh-hhhhhhccccch Confidence 000 00001111 14567889999999999995 6889999999988865332 222233344568 Q ss_pred hhccccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEc Q lcl|Aclame:pro 197 RDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT 276 (430) Q Consensus 197 r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~Tia 276 (430) ++|.+|+ +.||+. +++.++|....+. .. .|+.+... T Consensus 233 ~~G~vg~-i~G~~V-~~s~nlp~~~~~~----~~--------------------------------------~~~~~~~t 268 (347) T protein:vir:88 233 ETGNIRN-VMGFEV-IEVPHLTVGGAGD----NN--------------------------------------PADGVAPT 268 (347) T ss_pred hcceeee-eccceE-EEeeccccccccc----cc--------------------------------------cccccccc Confidence 9999996 899985 5787777211111 00 01111111 Q ss_pred ceeeccccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecc Q lcl|Aclame:pro 277 GVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWAD 356 (430) Q Consensus 277 GV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr 356 (430) + ....| +..+. . . .-+..+.+.-|+||| T Consensus 269 ~-------------~~~~~-------~~~~~-----------------~-------------~--~~~d~~~~~~l~~~~ 296 (347) T protein:vir:88 269 N-------------QKHIF-------PATAT-----------------G-------------D--DRVAQNNVVGLFNHR 296 (347) T ss_pred c-------------ccccc-------ccccc-----------------c-------------c--cccccCcEEEEEech Confidence 1 00000 00000 0 0 000122246799999 Q ss_pred cceeEEE-ecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEE--EeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 357 DAIRIVS-QPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRI--ALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 357 ~A~~Lat-rpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rl--dvlyG~~~v~Pe~agv~l~~q~~ 430 (430) +|+..+. .+|.+ +.+ |+.+ ...|.| =..||++++|||.++++-..-.| T Consensus 297 ~a~g~v~~~d~~~----------------------e~~--r~~~--~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 297 SAVGTVKLKDMAL----------------------ERA--RRPE--FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred hhhhheeccccee----------------------eee--echh--hHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 9997763 33321 111 2211 112223 24699999999999888888878 No 20 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.68 E-value=1e-18 Score=119.20 Aligned_cols=300 Identities=16% Similarity=0.148 Sum_probs=176.0 Q ss_pred Cccchh-------------------hHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc--- Q lcl|Aclame:pro 1 MALNEG-------------------QIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--- 58 (430) Q Consensus 1 MAn~~~-------------------~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~--- 58 (430) |||.-. --+++.--||+..|+...++..++..+ -.+.|++++||.=.....+ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r------ti~~G~sv~~~~iG~~~~~~~~ 74 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVR------SIQSGKSAQFPVLGRTKAAYLQ 74 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhhe------eccccceEEeeeccceeEeeee Confidence 665411 134666778888999999999999642 2467999999975444443 Q ss_pred cCCccCCccCCCCcceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeec--- Q lcl|Aclame:pro 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITS--- 133 (430) Q Consensus 59 ~g~~~s~~~~d~~e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~--- 133 (430) .|.......+|+.-..+.++||+.+...|.+.+-| ...-+.-..+.+.+..+||..+|.-++..++..+.....+ T Consensus 75 ~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~ 154 (347) T protein:vir:94 75 PGENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNEN 154 (347) T ss_pred cCcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 24433223346666678899999999888777633 3333444557888999999999999887766533321110 Q ss_pred cCCCC--------------CCC-CC---chhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhh Q lcl|Aclame:pro 134 PDAIG--------------TNT-AD---AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEA 195 (430) Q Consensus 134 ~~~~~--------------~~~-~~---~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a 195 (430) +...+ ..+ .. -++.+-+++..|+++.+|.+ +|.+|++|+.+..|+...-.... +...... T Consensus 155 ~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~-~R~~vv~P~~y~~LLk~~~~~~~-~~~~~~~ 232 (347) T protein:vir:94 155 IAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSS-DRVFYTTPDNYSAILAALMPNAA-NYQALID 232 (347) T ss_pred cccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCC-CCEEEeChHHHHHHHHhhccccc-ccccccc Confidence 00000 000 00 14568899999999999984 69999999999988864322222 2223456 Q ss_pred hhhccccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEE Q lcl|Aclame:pro 196 YRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISF 275 (430) Q Consensus 196 ~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~Ti 275 (430) +++|.|+. ++||. ++++.++|....+. ++. |.-.++ T Consensus 233 ~~~G~V~~-v~G~~-V~~Sn~~p~~~~~~------~~~------------------------------------~~~~~~ 268 (347) T protein:vir:94 233 PSTGSIRN-VMGFE-VIEVPHLTAGGAGD------NRA------------------------------------EEGVAP 268 (347) T ss_pred cccceeEE-eeceE-EEEcCccccccCcc------ccc------------------------------------cccccc Confidence 88999996 89997 56788876321110 000 111122 Q ss_pred cceeeccccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeec Q lcl|Aclame:pro 276 TGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWA 355 (430) Q Consensus 276 aGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFh 355 (430) ++- ..-|.... +.-|. +++ +-+..|+|| T Consensus 269 ~~~-------------~~~~~~~~------------------------~~~y~-----------~d~----~~~~~l~~~ 296 (347) T protein:vir:94 269 TNQ-------------KHAFPDTA------------------------SGDTR-----------VAL----DNVVGLFNH 296 (347) T ss_pred ccc-------------cccccccc------------------------ccccc-----------ccc----cceEEEEec Confidence 220 00000000 00010 011 113589999 Q ss_pred ccceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEE--EeecCceeeCcceeEEEcCCCC Q lcl|Aclame:pro 356 DDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRI--ALWYGVNATRPEAIGVGLPGQT 429 (430) Q Consensus 356 r~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rl--dvlyG~~~v~Pe~agv~l~~q~ 429 (430) ++|...+ -+- .++++.+ ||..... |-| =..||...+|||.+++++..-- T Consensus 297 ~~A~~tv--~~~-------------------~~~~e~~--~~~~~~~--~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 297 RSAVGTV--KLK-------------------DMALERA--RRANFQA--DQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhh--hhc-------------------ccceeee--echhhhh--hhhhhhhhhcCcccccceeEEEEecCC Confidence 9987643 221 1222322 2222211 223 2459999999999877766544 No 21 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.68 E-value=4e-19 Score=121.47 Aligned_cols=291 Identities=13% Similarity=0.130 Sum_probs=174.0 Q ss_pred Cccch-------------hhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccccc---CCccC Q lcl|Aclame:pro 1 MALNE-------------GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE---GWDLT 64 (430) Q Consensus 1 MAn~~-------------~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~---g~~~s 64 (430) -+|+. .-.++....+|+..|+...++..+++. |+ .+.|+||+||.=...+.++ |.+.. T Consensus 9 ~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~-r~-----i~~G~tv~i~~ig~~~~~~~~~g~~l~ 82 (332) T protein:vir:78 9 LPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRS-YD-----LRGGKSKQFMFTGKLSAGYHTPGTPIV 82 (332) T ss_pred CCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhcccc-cc-----ccccceEEEEeccceeEeeecCCCCCC Confidence 11222 234467777899999999999999964 33 3579999999855554443 33332 Q ss_pred CccCCCCcceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccee---eccCCC-- Q lcl|Aclame:pro 65 DKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVI---TSPDAI-- 137 (430) Q Consensus 65 ~~~~d~~e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~---~~~~~~-- 137 (430) ++ .|+....+.++||+.+...|.+.+=| ....+....+.+.+..+||.++|..++..+++.+.... +.++.. T Consensus 83 ~~-~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~~ 161 (332) T protein:vir:78 83 GD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHV 161 (332) T ss_pred CC-CCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccccccc Confidence 22 25556778899999999888886522 33445667888999999999999999988877442211 111111 Q ss_pred --CCCCC-Cc---hhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhh-hccc-hhhhhhhhcc-ccccchhh Q lcl|Aclame:pro 138 --GTNTA-DA---WNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRD-IFGR-IPEEAYRDGT-IQRQVAGF 208 (430) Q Consensus 138 --~~~~~-~~---~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~-~~~~-~~~~a~r~g~-igr~~~Gf 208 (430) +.+.. ++ ++-|-+++..|+++.||. ++|.+|++|+.+..|+...-..+ +... .....+++|. |+ .+.|| T Consensus 162 ~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~-~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~-~i~G~ 239 (332) T protein:vir:78 162 NIGAGNTNDAQAIVDGFFEAAAVLDERSAPQ-EGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLY-SIAGI 239 (332) T ss_pred ccCCccccCHHHHHHHHHHHHHHHhhcCCCc-cCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeee-EEeee Confidence 11111 12 355789999999999998 46989999999988876432222 2212 2345688886 76 58999 Q ss_pred hHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeecccccccc Q lcl|Aclame:pro 209 DDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNV 288 (430) Q Consensus 209 d~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~ 288 (430) + ++++.++|... |+.. ..+ + ++|++ T Consensus 240 ~-V~~Sn~lp~~~-g~~~---~~~-------------------------------------~----~~~~~--------- 264 (332) T protein:vir:78 240 R-ILKSNNLAGLY-GQDL---SSA-------------------------------------A----VTGEN--------- 264 (332) T ss_pred E-EEecCccccCc-cccc---ccc-------------------------------------c----ccccc--------- Confidence 8 56888887211 1000 000 0 01100 Q ss_pred ccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccC Q lcl|Aclame:pro 289 LAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPA 368 (430) Q Consensus 289 ~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~ 368 (430) .-|.+ ++ +.+.=|+|||+|+.++..- T Consensus 265 ----n~~~~-------------------------------------------~~----~~~~~~~~h~~a~~~v~~~--- 290 (332) T protein:vir:78 265 ----NDYQV-------------------------------------------DA----SALAGLIFHREAAGCIQSV--- 290 (332) T ss_pred ----ccccc-------------------------------------------cc----ccceEEeecccceeeeeee--- Confidence 00100 01 1123489999997665421 Q ss_pred CCCchhhceeEEEecCCCcEEEEEEe-ecccccceEEEEEEeecCceeeCcceeEEEcCC Q lcl|Aclame:pro 369 NHELFAGMKTTSFSIPDVGLNGIFAT-QGDISTLSGLCRIALWYGVNATRPEAIGVGLPG 427 (430) Q Consensus 369 p~~~~~~~~~~~~~~~~~Glsirv~~-~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~ 427 (430) ++++++.+ +|+.+......+=-..||++++|||.++++.+- T Consensus 291 ------------------~~~~~~t~~~~~~~~~~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 291 ------------------APTIQTTSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred ------------------ccchhhhhcccchhhhHhhhhhhhhhcCceecccceEEEeeC Confidence 11121111 111111111122234699999999999988877 No 22 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.66 E-value=7.8e-18 Score=114.38 Aligned_cols=258 Identities=14% Similarity=0.071 Sum_probs=164.7 Q ss_pred CccchhhHHHHHHHHHH-----HHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccc-cccc---CCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEII-----ETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES-PTQE---GWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl-----~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~-~~~~---g~~~s~~~~d~~ 71 (430) |||..+++.+++..|++ +.|++.++.+.++.. +++-+ .+.||||+||.-... ..++ |.+ ...+.+. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~--~~~l~-g~~G~tv~ip~~~~~g~~~~~~~g~~--i~~~~it 75 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADI--DSTLV-GQPGDTLTFPAFTYSGDAQVIAEGEK--IPVDQIG 75 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccc--ccccc-CCCCCEEEEEeeccCCCccccCCCCc--Cchhhcc Confidence 99987776666666654 447888888888743 43323 257999999985322 2222 332 3455777 Q ss_pred cceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) .....++|++ +.--|.+++.+ +...+...++.+++.+.||+++|.+++..+.. +.+. .......++.|. T Consensus 76 ~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~-a~~~-------~~~~~~~~d~i~ 146 (274) T protein:vir:96 76 TSKREAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKG-ATLT-------VEADITKLDGLQ 146 (274) T ss_pred cceeEEEEEe-eeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCCC-------cCcccccHHHHH Confidence 8889999966 56678999877 34457788999999999999999999976643 2211 112234578899 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch-hhhhhhhccccccchhhhHHHhCCCcceecccccccc Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~ 228 (430) +|...|++... ..|.++++|+.+..|..+....+..... ....+|+|.||+ +.||+- +.++++|. T Consensus 147 dA~~~l~d~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~-~~G~~V-i~s~~~p~--------- 212 (274) T protein:vir:96 147 TAIDKFNDEDL---EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGE-ALGAVI-VRSNKLNK--------- 212 (274) T ss_pred HHHHHhcccCC---CceEEEeCHHHHHHHHhcccccccccccccccceeecccce-ecCeeE-EEcCCCCc--------- Confidence 99999998875 3588999999888776543332222211 223455666664 556542 23332220 Q ss_pred eecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEE Q lcl|Aclame:pro 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I 308 (430) T Consensus 213 -------------------------------------------------------------------------------- 212 (274) T protein:vir:96 213 -------------------------------------------------------------------------------- 212 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcE Q lcl|Aclame:pro 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) Q Consensus 309 ~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Gl 388 (430) . ..++||+.||.++...- + T Consensus 213 ------------------------------------~----t~~l~~~gA~~~~~~~~---------------------~ 231 (274) T protein:vir:96 213 ------------------------------------G----EALLAKKGAVKLITKRD---------------------F 231 (274) T ss_pred ------------------------------------c----eEEEEeCcceeeeecCC---------------------c Confidence 0 02566777777765432 1 Q ss_pred EEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 389 NGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 389 sirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) +++ .++|.+......+.+..||++.++|+-. |.|.-..| T Consensus 232 ~vE--~~Rd~~~~~d~i~~~~~yg~~~~~~~~v-v~~t~~~~ 270 (274) T protein:vir:96 232 FLE--KDRDASRKSTALYSDKHYVAYLYDESKV-VKITKGAG 270 (274) T ss_pred ccc--cccchhhcccEEEEeeEEEEEEEcCccE-EEEEcCcc Confidence 111 1223333445556667799999999985 78877777 No 23 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.64 E-value=9.5e-18 Score=113.92 Aligned_cols=258 Identities=15% Similarity=0.061 Sum_probs=163.6 Q ss_pred CccchhhHHHHHHH-----HHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccc-ccc---cCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIVTLAVD-----EIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES-PTQ---EGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~~~~~~-----~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~-~~~---~g~~~s~~~~d~~ 71 (430) |||..+++.+++.. .+++.+++.++.++++.. +++-++ +.|+||+||+-... ..+ +|.+ ...+.+. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~--~~~l~g-~~G~tv~ip~~~~~g~~~~~~eg~~--i~~~~it 75 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEV--DSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEK--IPTDILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccc--cccccC-CCCCEEEEEeeccCCCcccccCCCc--ccccccc Confidence 99998776654444 455668999999999843 443333 56999999985332 222 2333 3456777 Q ss_pred cceEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) .++..+++.+ +.-.|++++.+. ...+...++.+++.+.|++++|.+++..+.... +.+ ......+++|. T Consensus 76 ~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~-~~~-------~~~~~~~d~i~ 146 (274) T protein:vir:93 76 TKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-LTV-------NADITKLNGLQ 146 (274) T ss_pred cceeEEEeee-ecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-ccc-------cccccCHHHHH Confidence 8889999966 455799998773 345677899999999999999999997765422 111 11234678999 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch-hhhhhhhccccccchhhhHHHhCCCcceecccccccc Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~ 228 (430) +|...|++.+. ..|.++++|+.+..|..+....|..... ....+++|.||+ +.||+- +.++++| T Consensus 147 dA~~~l~d~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~V-i~s~~~p---------- 211 (274) T protein:vir:93 147 SAIDKFNDEDL---EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAII-VRTNKLE---------- 211 (274) T ss_pred HHHHHhhhccC---CccEEEeCHHHHHHHHhhhhhcccccccccccceeecccce-ecCeeE-EEcCCCC---------- Confidence 99999998765 3578999999998876543222221111 123455566654 555542 2222221 Q ss_pred eecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEE Q lcl|Aclame:pro 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I 308 (430) T Consensus 212 -------------------------------------------------------------------------------- 211 (274) T protein:vir:93 212 -------------------------------------------------------------------------------- 211 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcE Q lcl|Aclame:pro 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) Q Consensus 309 ~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Gl 388 (430) . + .-++||+.||.+...+.. T Consensus 212 --------------------------~--------~-----t~~l~~~gai~~~~~~~~--------------------- 231 (274) T protein:vir:93 212 --------------------------A--------G-----TAILAKKGAVKLILKRDF--------------------- 231 (274) T ss_pred --------------------------c--------c-----eEEEEeCCeEEEEecCCc--------------------- Confidence 0 0 025667777776654321 Q ss_pred EEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 389 NGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 389 sirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) .++ .++|.+......+.+..||++.++|+-. |.|.-..+ T Consensus 232 ~vE--~~Rd~~~~~d~i~~~~~y~~~~~~~~~~-v~~t~~~~ 270 (274) T protein:vir:93 232 FLE--VARDASTKTTALYSDKHYVAYLYDESKA-VKITKGSG 270 (274) T ss_pred ccc--cccchhhcccEEEEEEEEEEEEEcCCce-EEEeeCcc Confidence 111 2223344445566777799999999985 66665555 No 24 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.63 E-value=2.4e-17 Score=111.66 Aligned_cols=258 Identities=16% Similarity=0.083 Sum_probs=164.5 Q ss_pred CccchhhHHHHHHHH-----HHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccc-ccc---cCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIVTLAVDE-----IIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES-PTQ---EGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~~~~~~~-----vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~-~~~---~g~~~s~~~~d~~ 71 (430) |||..+++.+++..| +++.|++.++.+.++. ++++-++ +.||||+||.-... ..+ +|.+. ..+.+. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~--~d~~l~g-~~G~tv~iP~~~~ig~a~~~~~g~~i--~~~~lt 75 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAE--VDSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEKI--PTDILE 75 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccce--ecccccC-CCCCEEEEeeecCCCccccccCCCcc--chhhcc Confidence 999977766555555 5556889999999994 4544333 47999999984332 111 23333 345677 Q ss_pred cceEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) .++..++|.+ +.--|++++.+. ...+...++.+++...||+++|.+|+..+.... +-+ ......++.|. T Consensus 76 ~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~-~~~-------~~~a~~~d~i~ 146 (274) T protein:vir:12 76 TKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-LTV-------NADITKLNGLQ 146 (274) T ss_pred cceeeEEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-ccc-------cccccCHHHHH Confidence 7788888866 566799998773 445777899999999999999999997765421 111 11234688899 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch-hhhhhhhccccccchhhhHHHhCCCcceecccccccc Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~ 228 (430) +|+..|.+... ..|.++++|+.+..|..+....|..... ....+|+|.||+ +.||.- +.++++|. T Consensus 147 dA~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~-~~G~~V-i~s~~~p~--------- 212 (274) T protein:vir:12 147 SAIDKFNDEDL---EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAII-VRSNKLEA--------- 212 (274) T ss_pred HHHHHhccccc---cccEEEeCHHHHHHHHhhhhhhccccccccccceeccccee-ecCeeE-EEeCCCCc--------- Confidence 99999998753 4688999999998886554333332221 234456666665 666653 33322220 Q ss_pred eecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEE Q lcl|Aclame:pro 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I 308 (430) T Consensus 213 -------------------------------------------------------------------------------- 212 (274) T protein:vir:12 213 -------------------------------------------------------------------------------- 212 (274) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcE Q lcl|Aclame:pro 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) Q Consensus 309 ~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Gl 388 (430) . ..++|++.||.+...+. + T Consensus 213 ------------------------------------~----t~~l~~~gA~~~~~~~~---------------------~ 231 (274) T protein:vir:12 213 ------------------------------------G----TAILAKKGAVKLILKRD---------------------F 231 (274) T ss_pred ------------------------------------c----eEEEEeccceeeeecCC---------------------c Confidence 0 01455556666543321 1 Q ss_pred EEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 389 NGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 389 sirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) .++ .++|++......+-+..||+++++|+-. |+|.-..+ T Consensus 232 ~vE--~~Rd~~~~~d~i~~~~~y~~~~~~~~~v-v~~t~~~~ 270 (274) T protein:vir:12 232 FLE--VARDASTKTTALYSDKHYVAYLYDESKA-VKITKGSG 270 (274) T ss_pred eec--cccchhhcccEEEeeeEEEEEEEcCCce-EEEEcCCc Confidence 111 2223333444555667799999999995 77776666 No 25 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.62 E-value=2.9e-17 Score=111.24 Aligned_cols=299 Identities=16% Similarity=0.149 Sum_probs=175.4 Q ss_pred Cccch--------------------hhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc-- Q lcl|Aclame:pro 1 MALNE--------------------GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ-- 58 (430) Q Consensus 1 MAn~~--------------------~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~-- 58 (430) ||+.. .--+++.--||+..|+...++..+++. | -.+-|+++++|.=-..+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~-r-----~i~~gks~~~~~iG~~~~~~~ 74 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMV-R-----SISSGKSAQFPVLGRTQAAYL 74 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhccccee-e-----eccccceEEEeeecceEEEee Confidence 44432 223466667889999999999999964 2 2345999999975444433 Q ss_pred -cCCccCCccCCCCcceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeec-- Q lcl|Aclame:pro 59 -EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITS-- 133 (430) Q Consensus 59 -~g~~~s~~~~d~~e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~-- 133 (430) .|.....+..|+.-....++||+.+...|.+.+=| ...-++-..+-+++.++||..+|.-++..+.+.+.....+ T Consensus 75 ~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~ 154 (345) T protein:vir:22 75 APGENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNE 154 (345) T ss_pred ecCCCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 24332222223333456699999999888888633 3444555568889999999999999887776543321111 Q ss_pred -c--------------CCCCCCCCC----chhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhh Q lcl|Aclame:pro 134 -P--------------DAIGTNTAD----AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEE 194 (430) Q Consensus 134 -~--------------~~~~~~~~~----~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~ 194 (430) + +...+.+.+ -|..+-++++.|+++.+|.+ +|.+|++|+.+..|.... .+......... T Consensus 155 ~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~-~R~~vv~P~~y~~Ll~~~-~~~~~~~~~~~ 232 (345) T protein:vir:22 155 NIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAA-DRVFYCDPDSYSAILAAL-MPNAANYAALI 232 (345) T ss_pred cccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCcc-CCEEEeChHHHHHHhccc-ccccccccccc Confidence 0 010011111 16668899999999999995 699999999998775433 22233333456 Q ss_pred hhhhccccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEE Q lcl|Aclame:pro 195 AYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKIS 274 (430) Q Consensus 195 a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~T 274 (430) .+++|.+++ ++||. ++++.++|....++..+ .+ .++... T Consensus 233 ~~~~G~V~~-i~G~~-V~~sn~lp~~~~~~~~~---~~------------------------------------~~~~~~ 271 (345) T protein:vir:22 233 DPEKGSIRN-VMGFE-VVEVPHLTAGGAGTARE---GT------------------------------------TGQKHV 271 (345) T ss_pred ccccceEEE-EeceE-EEecccccccccCcccc---Cc------------------------------------cccccc Confidence 689999996 89997 77888777321111000 00 122222 Q ss_pred EcceeeccccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeee Q lcl|Aclame:pro 275 FTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFW 354 (430) Q Consensus 275 iaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaF 354 (430) +... ++ ....++ ..+.+.=|+| T Consensus 272 ~~~~-----------------------~g---------------------------------~~~~~~--~~~~~~~l~~ 293 (345) T protein:vir:22 272 FPAN-----------------------KG---------------------------------EGNVKV--AKDNVIGLFM 293 (345) T ss_pred cccc-----------------------cc---------------------------------ceeeee--ccCceEEEEE Confidence 2210 00 000000 1122345999 Q ss_pred cccceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCC Q lcl|Aclame:pro 355 ADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) Q Consensus 355 hr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~ 429 (430) ||+|+..+-.- + ++++.+|.=+-..+...+ =.+||.+++|||.++++..--+ T Consensus 294 h~~A~~~v~~~-~--------------------~~~e~~r~~~~~~d~I~~--~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 294 HRSAVGTVKLR-D--------------------LALERARRANFQADQIIA--KYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred ehhheeeeeee-c--------------------ceeeeeechhHHHHHHHH--HHhcCCcccccceeEEEEEeeC Confidence 99987644321 1 222222211111112122 2459999999999999987766 No 26 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.60 E-value=2.6e-17 Score=111.51 Aligned_cols=258 Identities=19% Similarity=0.153 Sum_probs=151.9 Q ss_pred CccchhhHHHHHHHH-----HHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccc----cccCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIVTLAVDE-----IIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESP----TQEGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~~~~~~~-----vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~----~~~g~~~s~~~~d~~ 71 (430) |||...++.+++..| +++.|++.++.+.++...+.+ + .+.||||+||+-.... ..+|.+. ..+.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l--~-g~~G~ti~iP~~~~~gda~~~~eg~~i--~~~~lt 75 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTL--Q-GQPGNTLKFPAFTYIGDAADVAEGGEI--SLDKIG 75 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhcccccccccc--c-cCCCCEEEEeeeccCccccccCCCCcc--ChhhcC Confidence 999877766555444 445688899999988543333 2 2479999999833221 1234333 345666 Q ss_pred cceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) .+...+++.+ +.-.|.+++.+ +...+...++.+++...||+++|.+|+..+.. +. .. ......++++. T Consensus 76 ~~~~~~~i~~-~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~-~~------~~--~~~~~~~d~i~ 145 (272) T protein:vir:36 76 TTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKT-TS------QT--VSTKANVDGVQ 145 (272) T ss_pred CcceeEeeeh-hhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc-cc------cc--ccccccHHHHH Confidence 6778888854 55578888866 34467778899999999999999999865542 11 11 12334678899 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccccce Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT 229 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~t 229 (430) +|+..|.+.+.+. |.++++|..+..|.++....+.........+++|.||+ +.||+ ++.++++|. +++.... T Consensus 146 ~A~~~lgd~~~~~---~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~-~~G~~-Vv~s~~~p~---~~~~~~~ 217 (272) T protein:vir:36 146 AALDIFNDEDAQA---YVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYAD-VLGAQ-IVRSKKLAE---GSALMFK 217 (272) T ss_pred HHHHHhhhcCCCc---eEEEEcHHHHHHHhcccccccccccccccceeeeccce-ecCee-EEEeCCCCC---CceeEEE Confidence 9999999998874 77999999998887654433333344567899999997 99998 567888773 2221111 Q ss_pred ---ecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCcee Q lcl|Aclame:pro 230 ---VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 230 ---V~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v 306 (430) ..||-.... ....... ..|- -.+..|.|.- +++|.+--.-...-+ T Consensus 218 ~~~~~gA~~~~~------~~~~~vE-~~R~---------~~~~~d~i~~----------------~~~y~~~v~~~~~vv 265 (272) T protein:vir:36 218 IVSNSPALKLVL------KRGVQVE-TDRD---------IVTKTTVITA----------------DEHYAAYLYDLTKVV 265 (272) T ss_pred EEecccceeeee------cCCcccc-cccc---------hhhcCcEEEE----------------EEEEEEEEEcCccEE Confidence 122211000 0000000 0010 0111222221 122322211122234 Q ss_pred EEeeccc Q lcl|Aclame:pro 307 EITPKPV 313 (430) Q Consensus 307 ~I~p~~v 313 (430) .++-+.+ T Consensus 266 ~~t~~g~ 272 (272) T protein:vir:36 266 NITFTGV 272 (272) T ss_pred EEeecCC Confidence 4444444 No 27 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=99.59 E-value=2e-17 Score=112.08 Aligned_cols=284 Identities=12% Similarity=0.026 Sum_probs=156.3 Q ss_pred Cc--cchhhH-----HHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccC-CccCCccCCCCc Q lcl|Aclame:pro 1 MA--LNEGQI-----VTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEG-WDLTDKATGLLE 72 (430) Q Consensus 1 MA--n~~~~~-----~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g-~~~s~~~~d~~e 72 (430) |+ |+-++. -++-.++++..|++.++-.+++++ .+ .+.||||+||......+++= .......+++.. T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~-~d-----~g~GDtV~InsIg~~tV~dY~~~~~i~~d~ltt 74 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARV-VD-----FPDGDKLTIPSVGTPVVRSRPEQGDFTFDNLDT 74 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcc-cc-----cCCCCeEEeccccccccccccCCCCcccccCCC Confidence 66 444443 367778999999999987776643 11 15799999998777666652 223345678888 Q ss_pred ceEEEEeccccccceEecHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeec---------c---CCCC Q lcl|Aclame:pro 73 LNVAVNMGEPDNDFFQLRADDLRD--ETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITS---------P---DAIG 138 (430) Q Consensus 73 ~sV~v~l~~~k~V~~~~t~keL~~--~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~---------~---~~~~ 138 (430) ..+.++||+.|+-.|.+++ |+.. .++...+.+.+..+|+..+|.-+++..+..+..+-.. | ...+ T Consensus 75 ~~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~g 153 (322) T protein:vir:31 75 GEISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTG 153 (322) T ss_pred ceEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccC Confidence 8999999999999999999 6531 2344556677778888889888877666544211111 1 1223 Q ss_pred CCCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHH--hhhhhhhc-------cchhhhhhhhccccccchhhh Q lcl|Aclame:pro 139 TNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGY--DLTKRDIF-------GRIPEEAYRDGTIQRQVAGFD 209 (430) Q Consensus 139 ~~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~--~~~~l~~~-------~~~~~~a~r~g~igr~~~Gfd 209 (430) ++....|+.+-..+..|++..+|+ ++|.+|++|+.++.|.. ..+.+... ...+.+.+| .+|+ ++||| T Consensus 154 t~~~~ay~~lv~l~~kLdkanVP~-~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~--~Vg~-~~GF~ 229 (322) T protein:vir:31 154 TDQTMDVTDFSRVNYVMTQSKMPM-GGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQ--FVRS-VYGID 229 (322) T ss_pred CCchhhHHHHHHHHHHhccccCCC-CCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHH--HHHH-Hhcee Confidence 444567999999999999999999 57999999998875522 11111111 111222222 3886 89998 Q ss_pred HHHhCCCcce--ecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccc Q lcl|Aclame:pro 210 DVLRSPKLPV--LTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKN 287 (430) Q Consensus 210 ~~~~~~~~~~--~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~ 287 (430) ++.|.+++. ++-=.+....++++++.. .-.++.-.+ +-.++-+--++-|. T Consensus 230 -V~~SN~l~~~~~~i~aG~d~~~t~ag~~n-~f~~~~~~~--------------------------~~~~~~~~~~l~~~ 281 (322) T protein:vir:31 230 -LFVSNLLADANETINAGGDARSTTAGKCN-MFMNVSDMG--------------------------LLPFVVAWKEMPTT 281 (322) T ss_pred -eeeeccccccccccccCcccccccceeec-ccccccchh--------------------------hhhhhhHhhhhhhh Confidence 667777641 110011112233333321 111111111 11111111122221 Q ss_pred cccccc-----eEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEE Q lcl|Aclame:pro 288 VLAQDA-----TFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNI 342 (430) Q Consensus 288 ~~~~l~-----~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv 342 (430) --..++ -++++.+...+- ++|.- +-.-.|+.+.+|+ T Consensus 282 e~~r~~~~~~d~~~~~~~~g~g~--~r~e~-----------------l~~~~a~~~~~~~ 322 (322) T protein:vir:31 282 KSFIDDYNDDLNTATTARWGNGL--VRDEN-----------------LVCVLANADKVTF 322 (322) T ss_pred hcccCccccccceeeeeeeccee--ecccc-----------------eEEEEeccccccC Confidence 111111 134443332221 01111 1112344556666 No 28 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.56 E-value=5.5e-16 Score=104.23 Aligned_cols=322 Identities=15% Similarity=0.101 Sum_probs=177.6 Q ss_pred Cccch------------hhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccccc---CCccCC Q lcl|Aclame:pro 1 MALNE------------GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE---GWDLTD 65 (430) Q Consensus 1 MAn~~------------~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~---g~~~s~ 65 (430) ++|.. .--++..--||+..|+...++..+++. | -.+-|++++|+.=-..+.+. |....+ T Consensus 11 ~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~-r-----ti~~Gksv~f~~iG~~t~~~~t~G~~i~~ 84 (375) T protein:vir:10 11 RSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTK-R-----TLKNGKSLQFIYTGRMTSSFHTPGTPILG 84 (375) T ss_pred ccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccc-c-----ccccCceEEEEeeeeeEEeeecCCcCcCC Confidence 22211 223466667888889999999988864 2 23459999999754444433 322211 Q ss_pred c-cCCCCcceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccC------- Q lcl|Aclame:pro 66 K-ATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPD------- 135 (430) Q Consensus 66 ~-~~d~~e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~------- 135 (430) + -.|+--....++||+.+...|.+.+=| ...-++-..+.+.+.++||..+|..++..+++.+.....+.+ T Consensus 85 ~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~~~~~G 164 (375) T protein:vir:10 85 NADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSATNFVEPG 164 (375) T ss_pred ccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Confidence 1 113222335599999999888888633 344455566888999999999999999888764422211000 Q ss_pred -C---CCCCC-----CC---chhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhh--hhhhccchhhhhhhhccc Q lcl|Aclame:pro 136 -A---IGTNT-----AD---AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLT--KRDIFGRIPEEAYRDGTI 201 (430) Q Consensus 136 -~---~~~~~-----~~---~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~--~l~~~~~~~~~a~r~g~i 201 (430) . .+++. .+ -++.+-+++..|+++.||. ++|.+|++|+.+..|+.+.- .+.+........+.+|.+ T Consensus 165 g~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~-~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~~~~~g~v 243 (375) T protein:vir:10 165 GTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSS-QGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSALQSGNGV 243 (375) T ss_pred cceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCC-CCCEEEeChHHHHHHHhcCCccceeeecccccceeccceE Confidence 0 00000 11 1466889999999999997 56999999999988775432 222323334456778888 Q ss_pred cccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeec Q lcl|Aclame:pro 202 QRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFL 281 (430) Q Consensus 202 gr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v 281 (430) ++ ++||. ++++.++|..+. + +... |+.. . .+..-+.++.+.+..- T Consensus 244 ~~-i~Gv~-V~~Sn~lP~~~~-~--~~~~-g~~~-~-------------------------~~a~~~~~~~~~~~~~--- 288 (375) T protein:vir:10 244 IE-IAGIH-IYKSMNIPFLGK-Y--GVKY-GGTT-G-------------------------ETSPGNLGSHIGPTPE--- 288 (375) T ss_pred EE-EeceE-EEEecccccccc-c--cccc-cccc-c-------------------------ccchhhhhccccccCC--- Confidence 85 89997 778988885422 1 1111 1110 0 0000111222221110 Q ss_pred cccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeE Q lcl|Aclame:pro 282 GQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRI 361 (430) Q Consensus 282 ~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~L 361 (430) . ..+ ++...+.=...+.. .+-+.-|+|||+|..- T Consensus 289 -----------~-~~~---------------------------------~~g~~~~y~~d~~~-~~~~~~~~~~~~A~g~ 322 (375) T protein:vir:10 289 -----------N-ANA---------------------------------TGGVNNDYGTNAEL-GAKSCGLIFQKEAAGV 322 (375) T ss_pred -----------c-cee---------------------------------eccccccccccccc-cCceEEEEEchhheee Confidence 0 000 00000000000100 0113559999999765 Q ss_pred EEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 362 VSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 362 atrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) + .-+.+ ..+.+ +=..++.+|.| ....+ +.||...+|||+|+.+=.+=+| T Consensus 323 v-~~~~~------~~~~~-------~~~~~~~~q~~----~i~~~--~a~G~~~lrp~~av~l~~~~~~ 371 (375) T protein:vir:10 323 V-EAIGP------QVQVT-------NGDVSVIYQGD----VILGR--MAMGADYLNPAAAVELYIGATA 371 (375) T ss_pred e-eeecc------ccccc-------cchhhheeeee----eeeee--eeeccCccCceeEEEEecCcCc Confidence 5 11110 00000 11123334443 33333 4599999999999888777666 No 29 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.51 E-value=5.9e-16 Score=104.08 Aligned_cols=262 Identities=15% Similarity=0.065 Sum_probs=153.7 Q ss_pred CccchhhHHHHHHHH-----HHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccc-cc---cCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIVTLAVDE-----IIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESP-TQ---EGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~~~~~~~-----vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~-~~---~g~~~s~~~~d~~ 71 (430) |||..+++.+++..| +++.|.+.++.+.++..-+++ ++ +.||||+||+..... .+ +|.+ ...+.+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l--~g-~~G~tv~iP~~~~ig~a~~~~~g~~--i~~~~lt 75 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTL--VG-QPGDTLTFPAFIYSGDAKVVAEGEK--IPTDILE 75 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccc--cC-CCCCEEEeeeecCCCccccccCCCc--cchhhcc Confidence 999877766555554 445588888888887432222 22 569999999854322 22 2333 3455777 Q ss_pred cceEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) ..+..++|.+ +.-.|.+++.+. ...+...++.+++...||+++|.+|+..+.... +.+ .+ ....++.|. T Consensus 76 ~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-~~~------~~-~~~~~d~i~ 146 (274) T protein:vir:95 76 TKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-LTV------EA-DITKLTGLQ 146 (274) T ss_pred cceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-ccc------cc-cccCHHHHH Confidence 8889999965 566799998773 345788899999999999999999997776522 221 11 224588899 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hhhhhhhhccccccchhhhHHHhCCCcceecccccccc Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~ 228 (430) +|...|.+... ..|.++++|+.+..|..+....+.... .....+|+|.||+ +.||+- +.+++++. ++ ++ T Consensus 147 ~A~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~V-i~s~~~~~---~t--~~ 216 (274) T protein:vir:95 147 TAIDKFNDEDL---EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGE-ALGAVI-VRSNKLEA---GT--AI 216 (274) T ss_pred HHHHHhccccc---cccEEEeCHHHHHHHHhhccccccccccccccceeccccce-ecCeEE-EEeCCCCC---ce--EE Confidence 99999998764 468899999999988765433333322 2346789999997 999985 56777752 22 22 Q ss_pred eecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEE Q lcl|Aclame:pro 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I 308 (430) .. |.+.++.. ........ ..|- -.+.-|.+..--+|.++-... .-+|.-....++++. T Consensus 217 l~-~~gA~~~~----~~~~~~vE-~~Rd---------~~~~~d~i~~~~~y~~~~~~~-------~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 217 LA-KKGAVKLI----TKRDFFLE-TDRD---------PSTKTTALYSDKHYVAYLYDE-------SKAVKITKGSGSLEM 274 (274) T ss_pred EE-eccceeee----ecCCcccc-cccc---------cccccCEEEEeEEEEEEEEcC-------CcEEEEEcCCccccC Confidence 21 11111100 00000000 0010 122334444443333221111 011111122233333 No 30 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.51 E-value=5.9e-16 Score=104.08 Aligned_cols=262 Identities=15% Similarity=0.065 Sum_probs=153.7 Q ss_pred CccchhhHHHHHHHH-----HHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccc-cc---cCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIVTLAVDE-----IIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESP-TQ---EGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~~~~~~~-----vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~-~~---~g~~~s~~~~d~~ 71 (430) |||..+++.+++..| +++.|.+.++.+.++..-+++ ++ +.||||+||+..... .+ +|.+ ...+.+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l--~g-~~G~tv~iP~~~~ig~a~~~~~g~~--i~~~~lt 75 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTL--VG-QPGDTLTFPAFIYSGDAKVVAEGEK--IPTDILE 75 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccc--cC-CCCCEEEeeeecCCCccccccCCCc--cchhhcc Confidence 999877766555554 445588888888887432222 22 569999999854322 22 2333 3455777 Q ss_pred cceEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) ..+..++|.+ +.-.|.+++.+. ...+...++.+++...||+++|.+|+..+.... +.+ .+ ....++.|. T Consensus 76 ~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-~~~------~~-~~~~~d~i~ 146 (274) T protein:vir:96 76 TKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-LTV------EA-DITKLTGLQ 146 (274) T ss_pred cceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-ccc------cc-cccCHHHHH Confidence 8889999965 566799998773 345788899999999999999999997776522 221 11 224588899 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hhhhhhhhccccccchhhhHHHhCCCcceecccccccc Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~ 228 (430) +|...|.+... ..|.++++|+.+..|..+....+.... .....+|+|.||+ +.||+- +.+++++. ++ ++ T Consensus 147 ~A~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~V-i~s~~~~~---~t--~~ 216 (274) T protein:vir:96 147 TAIDKFNDEDL---EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGE-ALGAVI-VRSNKLEA---GT--AI 216 (274) T ss_pred HHHHHhccccc---cccEEEeCHHHHHHHHhhccccccccccccccceeccccce-ecCeEE-EEeCCCCC---ce--EE Confidence 99999998764 468899999999988765433333322 2346789999997 999985 56777752 22 22 Q ss_pred eecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEE Q lcl|Aclame:pro 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I 308 (430) .. |.+.++.. ........ ..|- -.+.-|.+..--+|.++-... .-+|.-....++++. T Consensus 217 l~-~~gA~~~~----~~~~~~vE-~~Rd---------~~~~~d~i~~~~~y~~~~~~~-------~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 217 LA-KKGAVKLI----TKRDFFLE-TDRD---------PSTKTTALYSDKHYVAYLYDE-------SKAVKITKGSGSLEM 274 (274) T ss_pred EE-eccceeee----ecCCcccc-cccc---------cccccCEEEEeEEEEEEEEcC-------CcEEEEEcCCccccC Confidence 21 11111100 00000000 0010 122334444443333221111 011111122233333 No 31 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.50 E-value=7.9e-16 Score=103.37 Aligned_cols=262 Identities=13% Similarity=0.057 Sum_probs=156.0 Q ss_pred CccchhhHHHHHHHH-----HHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcc-cccc---cCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIVTLAVDE-----IIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQE-SPTQ---EGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~~~~~~~-----vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~-~~~~---~g~~~s~~~~d~~ 71 (430) |||..+++.+++..| +++.+++.++.+.++.. +++-++ +.|+||+||+-.. ...+ +|.+ ...+.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~--d~~l~g-~~G~tv~iP~~~~~g~a~~~~~g~~--i~~~~lt 75 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEV--DSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEK--IPTDILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhccccee--cccccC-CCCCEEEEeeecCCCccccccCCCc--ccccccc Confidence 999977766555554 45558888888888843 433332 4699999997322 1222 2433 3455777 Q ss_pred cceEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) ..+..+++.+. .--|++++++. ...+...++.+++.+.||+++|.+++..+..... .+ .....+++.|. T Consensus 76 ~~~~~~~i~~~-~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~-~~-------~~~~~~~d~i~ 146 (274) T protein:vir:94 76 TKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-TV-------NADITKLNGLQ 146 (274) T ss_pred cceeEEEeeee-cceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc-cc-------cccccCHHHHH Confidence 77888999664 45689998773 4457788999999999999999999977654222 11 11234578999 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch-hhhhhhhccccccchhhhHHHhCCCcceecccccccc Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~ 228 (430) +|+..|++.+.. .|.++++|+.+..|..+....|..... ....+++|.||+ +.||+ ++.++++|. + +++ T Consensus 147 dA~~~l~d~~~~---~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~---~--t~~ 216 (274) T protein:vir:94 147 SAIDKFNDEDLE---PMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRTNKLEA---G--TAI 216 (274) T ss_pred HHHHHhhccCCC---ceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccce-ecCee-EEEcCCCCc---c--eEE Confidence 999999987653 578999999999887654333332222 345689999997 89998 557777872 2 222 Q ss_pred eecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEE Q lcl|Aclame:pro 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I 308 (430) .+ |.+..+.. ........ ..|- -.+.-|.+..--+|.+.-.. . .-+|....++++++. T Consensus 217 l~-~~gA~~~~----~~~~~~vE-~~Rd---------~~~~~d~i~~~~~y~~~~~~------~-~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 217 LA-KKGAVKLI----LKRDFFLE-VARD---------ASTKTTALYSDKHYVAYLYD------E-SKAVKITKGSGSLEM 274 (274) T ss_pred EE-eCcceEee----ecCCceec-cccc---------hhhcccEEEEEEEEEEEEEc------C-CceEEEecCcccccC Confidence 22 11111110 00000000 0010 12334555555544431111 0 112222223344444 No 32 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.50 E-value=7.9e-16 Score=103.37 Aligned_cols=262 Identities=13% Similarity=0.057 Sum_probs=156.0 Q ss_pred CccchhhHHHHHHHH-----HHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcc-cccc---cCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIVTLAVDE-----IIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQE-SPTQ---EGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~~~~~~~-----vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~-~~~~---~g~~~s~~~~d~~ 71 (430) |||..+++.+++..| +++.+++.++.+.++.. +++-++ +.|+||+||+-.. ...+ +|.+ ...+.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~--d~~l~g-~~G~tv~iP~~~~~g~a~~~~~g~~--i~~~~lt 75 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEV--DSTLQG-QPGDTLTFPAFVYSGDAQVVAEGEK--IPTDILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhccccee--cccccC-CCCCEEEEeeecCCCccccccCCCc--ccccccc Confidence 999977766555554 45558888888888843 433332 4699999997322 1222 2433 3455777 Q ss_pred cceEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) ..+..+++.+. .--|++++++. ...+...++.+++.+.||+++|.+++..+..... .+ .....+++.|. T Consensus 76 ~~~~~~~i~~~-~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~-~~-------~~~~~~~d~i~ 146 (274) T protein:vir:97 76 TKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-TV-------NADITKLNGLQ 146 (274) T ss_pred cceeEEEeeee-cceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc-cc-------cccccCHHHHH Confidence 77888999664 45689998773 4457788999999999999999999977654222 11 11234578999 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch-hhhhhhhccccccchhhhHHHhCCCcceecccccccc Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~ 228 (430) +|+..|++.+.. .|.++++|+.+..|..+....|..... ....+++|.||+ +.||+ ++.++++|. + +++ T Consensus 147 dA~~~l~d~~~~---~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~---~--t~~ 216 (274) T protein:vir:97 147 SAIDKFNDEDLE---PMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGE-ALGAI-IVRTNKLEA---G--TAI 216 (274) T ss_pred HHHHHhhccCCC---ceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccce-ecCee-EEEcCCCCc---c--eEE Confidence 999999987653 578999999999887654333332222 345689999997 89998 557777872 2 222 Q ss_pred eecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEE Q lcl|Aclame:pro 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I 308 (430) .+ |.+..+.. ........ ..|- -.+.-|.+..--+|.+.-.. . .-+|....++++++. T Consensus 217 l~-~~gA~~~~----~~~~~~vE-~~Rd---------~~~~~d~i~~~~~y~~~~~~------~-~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 217 LA-KKGAVKLI----LKRDFFLE-VARD---------ASTKTTALYSDKHYVAYLYD------E-SKAVKITKGSGSLEM 274 (274) T ss_pred EE-eCcceEee----ecCCceec-cccc---------hhhcccEEEEEEEEEEEEEc------C-CceEEEecCcccccC Confidence 22 11111110 00000000 0010 12334555555544431111 0 112222223344444 No 33 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.49 E-value=1e-15 Score=102.79 Aligned_cols=265 Identities=14% Similarity=0.107 Sum_probs=155.8 Q ss_pred CccchhhHH-----HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccc-cc---cCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIV-----TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESP-TQ---EGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~-----~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~-~~---~g~~~s~~~~d~~ 71 (430) |||...++. +++-+.+++.|++.+++++++.. +++-+ .+.||||+||.-.... .+ +|.+ ...+++. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~--~~~l~-g~~G~tv~ip~~~~~g~a~~~~~g~~--i~~~~lt 75 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPI--DNSLE-GQPGSEITVPKYKYIGDAQDVAEGAA--IDYSALE 75 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhccccee--ccccc-CCCCCEEEEeeeccCCcceeecCCCc--Ccccccc Confidence 997544433 44555566779999999998844 32222 2579999999854322 12 1332 3445777 Q ss_pred cceEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCC-CchhHH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTA-DAWNFV 148 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~-~~~~d~ 148 (430) ..+..++|++.+ --|++++.+. ...+..+++.+++.+.|+.++|.+|+..+... .+.. .++....+. ..+..| T Consensus 76 ~~~~~~~i~~~~-~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a-~~~~--~~~~t~~~~~~~~~~~ 151 (278) T protein:vir:80 76 TESVKHGIKKAG-KGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTT-TLEV--KGAINIGLIDKIENTF 151 (278) T ss_pred cceeeEeeehhh-ccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccc--ccccccchhhhHHHHH Confidence 888999997754 4799998773 44678899999999999999999999877642 2221 122212222 225678 Q ss_pred HHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhcc-chhhhhhhhccccccchhhhHHHhCCCcceeccccccc Q lcl|Aclame:pro 149 ADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFG-RIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATG 227 (430) Q Consensus 149 a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~-~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~ 227 (430) .+++..|++..+|. . +.++++|+.+..|..+....+... ......+++|.||+ +.||+. +.++++|. + ++ T Consensus 152 ~da~~~l~~~~~~~-~-~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~-~~G~~V-i~s~~~p~---~--t~ 222 (278) T protein:vir:80 152 TDAPDAIEDESITT-T-GVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGE-LLGWEI-VRTKKLAD---G--NA 222 (278) T ss_pred HHHHHhhcccCCCc-c-cEEEECHHHHHHHHhhhhhhccccccccccceeecccee-ecceeE-EEcCCCCc---c--eE Confidence 89999999999997 3 458999999988875543333322 22345689999997 999985 56777763 2 23 Q ss_pred ceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCc Q lcl|Aclame:pro 228 ITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGT 304 (430) Q Consensus 228 ~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~ 304 (430) +.+ +.+.++. ......... +.|- -.+.-|.+..--+|+++-... .-.++....+|+ T Consensus 223 ~l~-~~gAi~~----~~~~~~~vE-~~Rd---------~~~~~d~i~~~~~yg~~v~~~------~~~v~it~~a~~ 278 (278) T protein:vir:80 223 LAV-KAGALKT----FLKRNLLAE-SGRD---------MDHKLTKFNADQHYAVALVDE------TKAVKVVPVAGN 278 (278) T ss_pred EEE-eccceee----eecCCcccc-cccc---------hhhccceeeeeeEEEEEEEcC------cceEEEeeccCC Confidence 332 1111110 000000000 0011 123345555554454432221 112232223344 No 34 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.45 E-value=1.6e-14 Score=96.21 Aligned_cols=299 Identities=14% Similarity=0.034 Sum_probs=170.7 Q ss_pred Cccc--------------hhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccC-CccCC Q lcl|Aclame:pro 1 MALN--------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEG-WDLTD 65 (430) Q Consensus 1 MAn~--------------~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g-~~~s~ 65 (430) |++- ..--++.+.-||+..|+...++..++.. |+ .+.|++++||.=-......- ..... T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~-rt-----i~~gkS~q~~~iG~~~~~~~~~G~~l 74 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDV-QE-----VVGTNSVSNKYIGETELQVLSPGKSP 74 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee-ee-----ecccceEEeeeeeeeEEeeeccCccc Confidence 4321 1123466667888889998888888854 22 56899999998644443221 11233 Q ss_pred ccCCCCcceEEEEeccccccceEecH-HH-hccHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cceeeccCCC-C-- Q lcl|Aclame:pro 66 KATGLLELNVAVNMGEPDNDFFQLRA-DD-LRDET-AYRHRIQSAARKLANNVELKVANMAAEMG-SLVITSPDAI-G-- 138 (430) Q Consensus 66 ~~~d~~e~sV~v~l~~~k~V~~~~t~-ke-L~~~~-~~~r~l~pAm~~LAn~Id~dl~~~~~~~a-s~~~~~~~~~-~-- 138 (430) ..+++.-..+.++||.-+...+.+.+ +| +..-+ .-..+-+.+-++||...|+-++..++..+ ++........ + T Consensus 75 d~~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~ 154 (364) T protein:vir:10 75 DASPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAG 154 (364) T ss_pred CCCCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccC Confidence 45667777899999998876666554 22 22222 12344577888999999999887665432 1111110000 0 Q ss_pred -------CCCCCc--------hhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch--hhhhhhhccc Q lcl|Aclame:pro 139 -------TNTADA--------WNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI--PEEAYRDGTI 201 (430) Q Consensus 139 -------~~~~~~--------~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~--~~~a~r~g~i 201 (430) ....++ ..-|-++...|+++.+|.+ +|.++++|+-+..|+.. ..+.+.+.. ....+++|.+ T Consensus 155 ~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~-~R~~vv~P~~y~~Ll~~-~~lvn~d~~~~~~~~~~~G~v 232 (364) T protein:vir:10 155 HGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTS-ELCGLMPWTAFNCLRDA-DRIVDKSYTIAASDNTVDGFV 232 (364) T ss_pred CcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCcc-ccEEEeChHHHHHHhcC-CccccccccccCCCcccccee Confidence 001111 1235578899999999995 59999999999887653 223322211 3456899999 Q ss_pred cccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeec Q lcl|Aclame:pro 202 QRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFL 281 (430) Q Consensus 202 gr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v 281 (430) +. +.||. ++++.++|.. +++ .++.|... +=-++-+| T Consensus 233 ~~-v~Gv~-Vv~Sn~lP~~-~~~---~~~t~~~t----------------------------------~h~ls~~~---- 268 (364) T protein:vir:10 233 LK-SWNTP-IVPSNRFPKL-SDN---TEGTGNTK----------------------------------HHKLSNAG---- 268 (364) T ss_pred EE-EeceE-EEeccccccc-ccc---cccccccc----------------------------------cccccccc---- Confidence 96 99997 7889888733 111 11111100 00011111 Q ss_pred cccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeE Q lcl|Aclame:pro 282 GQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRI 361 (430) Q Consensus 282 ~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~L 361 (430) ....|-|+++. +...=+.|||+|..- T Consensus 269 ---------~g~~y~v~~d~---------------------------------------------~~~~~~~f~~~Al~t 294 (364) T protein:vir:10 269 ---------NGNRYDVTAGQ---------------------------------------------TSAQAVLFTQDALLV 294 (364) T ss_pred ---------CCccccccccc---------------------------------------------ceeEEEEEecceEEE Confidence 11122233221 112337899986553 Q ss_pred EEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEE--eecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 362 VSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIA--LWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 362 atrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rld--vlyG~~~v~Pe~agv~l~~q~~ 430 (430) .-+- +++.++++.-+ ...|=+| ..||...+|||.++++..++++ T Consensus 295 --v~~~-------------------~~t~e~~~~~~----~~~~~ida~~a~G~g~lRPeaa~~i~~~~~~ 340 (364) T protein:vir:10 295 --GRTI-------------------SITGDIFYEKK----EKTWYIDTFLAEGAIPDRWEAVAVVTAADTA 340 (364) T ss_pred --EEEe-------------------cceeeeeeccc----eeeeeeeeehcccCcccCccceEEEEecCCC Confidence 3221 23344433222 1222232 3499999999999999999998 No 35 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.44 E-value=2.5e-15 Score=100.65 Aligned_cols=272 Identities=15% Similarity=0.144 Sum_probs=152.6 Q ss_pred cchhhcccCChHHHHhhcCCEEEEecCcccccc--c-CCccCCccCCCCcceEEEEeccccccceEecHHH--hccHHHH Q lcl|Aclame:pro 26 MAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--E-GWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAY 100 (430) Q Consensus 26 ma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~--~-g~~~s~~~~d~~e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~ 100 (430) |-|.| +.|++++||.=-..+.. + |...-.+.+++......++||+.+...|.+.+=| ...-++- T Consensus 1 ~vr~i-----------~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr 69 (324) T protein:vir:99 1 MTRTI-----------TSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVR 69 (324) T ss_pred Ceeee-----------ecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccch Confidence 33333 46999999975333332 2 4433333456667778899999999998888744 3444566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccc--------eeeccCCCC---CCCC-C------c-hhHHHHHHHHHHHhCCC Q lcl|Aclame:pro 101 RHRIQSAARKLANNVELKVANMAAEMGSL--------VITSPDAIG---TNTA-D------A-WNFVADAEELMFSRELN 161 (430) Q Consensus 101 ~r~l~pAm~~LAn~Id~dl~~~~~~~as~--------~~~~~~~~~---~~~~-~------~-~~d~a~a~~~L~~~~aP 161 (430) ..+.+++.++||..+|.-++.++...+.. ..+..++.. .++. . . ++.+-+++..|+++.+| T Consensus 70 ~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP 149 (324) T protein:vir:99 70 SEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIP 149 (324) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCC Confidence 68889999999999999887776543211 111000000 0011 0 1 44567899999999999 Q ss_pred cCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccccceecccceeeeeEE Q lcl|Aclame:pro 162 RDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAW 241 (430) Q Consensus 162 ~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~ 241 (430) . ++|.+|++|+.+..|. +...+.+........+++|.|++ ++||. ++++.++|.. .++...-.-+ T Consensus 150 ~-~gR~~vv~P~~y~~Ll-~~~~~~~~~~~~~~~~~~G~V~~-i~Gf~-V~~Sn~lp~~-~~t~~~~a~~---------- 214 (324) T protein:vir:99 150 A-GDRTFYTDPDTYSAIL-AALMPNAANYAALIDPETGNIRN-VMGFE-VVETPHMTAQ-MVTNPTDAFD---------- 214 (324) T ss_pred C-CCCEEEeChHHHHHHh-hcccccccccccccceecceEEE-EeceE-EEecCCcccc-cccccccccc---------- Confidence 8 4699999999987664 43333333334556799999996 99997 6789888832 1111000000 Q ss_pred EEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEEeeccccccccccc Q lcl|Aclame:pro 242 QLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLS 321 (430) Q Consensus 242 t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~ 321 (430) ++ |=.++++| +. ..+ T Consensus 215 ---------------------~~-----~~~~~~~~----------------------~~-----~~~------------ 229 (324) T protein:vir:99 215 ---------------------GT-----GHIFPATG----------------------DS-----TTT------------ 229 (324) T ss_pred ---------------------cc-----cccccccc----------------------cc-----ccc------------ Confidence 00 01122222 00 000 Q ss_pred cccccccccccccccCceeEEecCCCceeeeeecccce-eEEEecccCCCCchhhceeEEEecCCCcEEEEEEeeccccc Q lcl|Aclame:pro 322 PEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAI-RIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDIST 400 (430) Q Consensus 322 ~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~-~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~ 400 (430) +.| + +..+.+.=|+||++|. +....+|.+ + .. . ..-++.| T Consensus 230 ---~ky----~-----------~d~~~~~gl~~~~~a~~tv~~~~~~~--e------~~--~--------~~~~~~d--- 270 (324) T protein:vir:99 230 ---GKM----T-----------VGADNVVGLFVHRSAVATLKLKDMAL--E------RA--R--------RPEYQAD--- 270 (324) T ss_pred ---ccc----c-----------cccCceeEEEEehhheEEEeeeccee--c------ce--e--------chhhHHH--- Confidence 000 0 0011234589999975 233333321 1 11 0 1112222 Q ss_pred ceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 401 LSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 401 ~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) .... =..||.+.+|||.++++-..-.+ T Consensus 271 -~i~~--~~a~G~~~lRPe~a~~v~l~~~~ 297 (324) T protein:vir:99 271 -QIIA--KYAMGHGGLRPEAVGAIIFEDGE 297 (324) T ss_pred -hhhh--hhhhcCcccccceEEEEEEccCc Confidence 1111 23599999999999876543332 No 36 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.40 E-value=3e-14 Score=94.72 Aligned_cols=258 Identities=14% Similarity=0.087 Sum_probs=149.8 Q ss_pred Cccc--hhhHH--HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccc----ccCCccCCccCCCCc Q lcl|Aclame:pro 1 MALN--EGQIV--TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT----QEGWDLTDKATGLLE 72 (430) Q Consensus 1 MAn~--~~~~~--~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~----~~g~~~s~~~~d~~e 72 (430) |||. ++.++ +++-+-+++.+++.++.++++..-..+ + .+.||||+||+...... .+|.+. ..+.+.. T Consensus 3 ~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l--~-g~~G~tv~iP~~~~ig~a~~~~~g~~i--~~~~lt~ 77 (275) T protein:vir:96 3 LENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTL--V-GQPGNTITFPAFVYSGDAKVVPEGEEI--PIDLIET 77 (275) T ss_pred CcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccc--c-CCCCCEEEeeeeccCCccccccCCCCc--chhhccc Confidence 6664 33332 444455666799999999998432222 2 24699999998543321 123333 3456667 Q ss_pred ceEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHH Q lcl|Aclame:pro 73 LNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVAD 150 (430) Q Consensus 73 ~sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~ 150 (430) +...+++ +++.--|.+++++. ...+...++.+++...||+++|.+|+..+... .+. . ......++.|.+ T Consensus 78 ~~~~~~i-~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a-~~~------~-~~~~~~~d~i~d 148 (275) T protein:vir:96 78 KKRQATI-RKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGA-TLK------V-EADITKLAGLQT 148 (275) T ss_pred ceeeEEe-ehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc-ccc------c-cccccCHHHHHH Confidence 7788888 44677789998774 34577889999999999999999998766542 111 1 112345788999 Q ss_pred HHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hhhhhhhhccccccchhhhHHHhCCCcceecccccccce Q lcl|Aclame:pro 151 AEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT 229 (430) Q Consensus 151 a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~t 229 (430) |+..|.+... ..|.++++|+.+..|.......+.... .....+++|.||+ +.||+- +.++++|. ++ ++. T Consensus 149 A~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~-~~G~~V-i~s~~~p~---~t--~~i 218 (275) T protein:vir:96 149 AIDKFNDEDL---EPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGE-ALGAII-VRSNKIKE---GE--AIL 218 (275) T ss_pred HHHHhccccC---CccEEEeCHHHHHHHHhcccccccccccccccceeccccce-ecCeeE-EEeCCCCc---ce--EEE Confidence 9999987653 468899999999888765433333222 2345689999997 999985 56777762 22 222 Q ss_pred ecccceeeeeEEEEee-ccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCc---e Q lcl|Aclame:pro 230 VSGAQSFKPVAWQLDN-DGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGT---H 305 (430) Q Consensus 230 V~ga~~~~~~~~t~~~-~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~---~ 305 (430) . +.+ +..... ...... ..|- -.+.-|.+..- ++|.+ .+..+. . T Consensus 219 ~-~~g-----A~~~~~~~~~~vE-~~Rd---------~~~~~d~i~~~----------------~~y~~-~~~~~~~vv~ 265 (275) T protein:vir:96 219 A-KRG-----AVKLITKRDFFLE-TERH---------ASHKSTALFSD----------------KHYVA-YLYDESKVVK 265 (275) T ss_pred E-ecc-----ceeeeecCCcccc-cccc---------hhhcCcEEEEe----------------EEEEE-EEEcCccEEE Confidence 2 111 111100 000000 0000 01222444333 23333 222322 2 Q ss_pred eEEeeccccc Q lcl|Aclame:pro 306 VEITPKPVAL 315 (430) Q Consensus 306 v~I~p~~v~~ 315 (430) ++..|+.+-. T Consensus 266 ~t~~~~~~~~ 275 (275) T protein:vir:96 266 ITKSASGLGV 275 (275) T ss_pred EEecccccCC Confidence 4455554321 No 37 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.39 E-value=6.9e-14 Score=92.75 Aligned_cols=257 Identities=16% Similarity=0.135 Sum_probs=149.1 Q ss_pred CccchhhHH-----HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccc----cccCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIV-----TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESP----TQEGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~-----~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~----~~~g~~~s~~~~d~~ 71 (430) ||+..++.. +++-+.+++.|++.++.++++.. ++.-+ .+.|++|+||+-.... ..+|... ..+++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~--~~~~~-g~~G~tv~iP~~~~~~~a~~v~eg~~i--~~~~~~ 75 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEV--DTTLE-GQPGTTLTVPKWDYIGDAEDVAEGEAI--PMTQLG 75 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccc--ccccc-CCCCCEEEEEEecCCCCcccccCCCcc--cccccc Confidence 996544433 34445555678889998888843 33222 2479999999742211 1234333 345777 Q ss_pred cceEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) ...+.+++.+.. .-|.+++++. ...+...++.++..+.+++++|.+++..+.. +.+. ..+...++++. T Consensus 76 ~~~~~~~~~~~~-~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~-a~~~--------~~~~~t~d~i~ 145 (272) T protein:vir:30 76 FKKTTMTIKKAG-KGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK-STQT--------VEATATVDGVS 145 (272) T ss_pred cceEEEEeeeee-eeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccc--------cccccCHHHHH Confidence 778899997644 4589998774 3456778999999999999999999875432 2111 12234688999 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhcc-chhhhhhhhccccccchhhhHHHhCCCcceecccccccc Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFG-RIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~-~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~ 228 (430) +|...|.+.+.+ .+.++++|..+..|.......+... +.....+++|.+| T Consensus 146 da~~~l~~~~~~---~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig-------------------------- 196 (272) T protein:vir:30 146 KALDIFNDEDDA---ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYG-------------------------- 196 (272) T ss_pred HHHHHHhccCCC---ccEEEEcHHHHHHHHHhccccccccccccccccccccch-------------------------- Confidence 999999988655 4779999998877643221111100 0001112222222 Q ss_pred eecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEE Q lcl|Aclame:pro 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I 308 (430) +|.|+ .|... T Consensus 197 ---------------------------------------------~i~G~-----------------~Vi~s-------- 206 (272) T protein:vir:30 197 ---------------------------------------------EVLGV-----------------QIVRS-------- 206 (272) T ss_pred ---------------------------------------------hhcCe-----------------eEEEc-------- Confidence 34441 11110 Q ss_pred eeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcE Q lcl|Aclame:pro 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) Q Consensus 309 ~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Gl 388 (430) +.+ |. ...++|++.||.++.+.- + T Consensus 207 -~~~---------------------p~-------------~t~~~~~~~a~~~~~~~~---------------------~ 230 (272) T protein:vir:30 207 -RKC---------------------PK-------------GTAYMVRKGALRIMLKRN---------------------T 230 (272) T ss_pred -CCC---------------------Cc-------------ceEEEEcCCeEEEEecCC---------------------c Confidence 000 00 012667778877765431 2 Q ss_pred EEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 389 NGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 389 sirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) .++. +.|.+......+...-||++.++|+-. |.+.-..| T Consensus 231 ~ve~--~r~~~~~~~~i~~~~~~~~~v~~~~~v-v~~t~~~a 269 (272) T protein:vir:30 231 MVET--DRDITKAINQIVANKHYGVYLYKAEKA-VKITLKDA 269 (272) T ss_pred eeee--ccccccceeEEEEEEEEEEEEEcCCce-EEEEeccc Confidence 2221 223334445555666799999999964 55533333 No 38 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.39 E-value=6.9e-14 Score=92.75 Aligned_cols=257 Identities=16% Similarity=0.135 Sum_probs=149.1 Q ss_pred CccchhhHH-----HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccc----cccCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIV-----TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESP----TQEGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~-----~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~----~~~g~~~s~~~~d~~ 71 (430) ||+..++.. +++-+.+++.|++.++.++++.. ++.-+ .+.|++|+||+-.... ..+|... ..+++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~--~~~~~-g~~G~tv~iP~~~~~~~a~~v~eg~~i--~~~~~~ 75 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEV--DTTLE-GQPGTTLTVPKWDYIGDAEDVAEGEAI--PMTQLG 75 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccc--ccccc-CCCCCEEEEEEecCCCCcccccCCCcc--cccccc Confidence 996544433 34445555678889998888843 33222 2479999999742211 1234333 345777 Q ss_pred cceEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) ...+.+++.+.. .-|.+++++. ...+...++.++..+.+++++|.+++..+.. +.+. ..+...++++. T Consensus 76 ~~~~~~~~~~~~-~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~-a~~~--------~~~~~t~d~i~ 145 (272) T protein:vir:98 76 FKKTTMTIKKAG-KGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK-STQT--------VEATATVDGVS 145 (272) T ss_pred cceEEEEeeeee-eeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccc--------cccccCHHHHH Confidence 778899997644 4589998774 3456778999999999999999999875432 2111 12234688999 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhcc-chhhhhhhhccccccchhhhHHHhCCCcceecccccccc Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFG-RIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~-~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~ 228 (430) +|...|.+.+.+ .+.++++|..+..|.......+... +.....+++|.+| T Consensus 146 da~~~l~~~~~~---~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig-------------------------- 196 (272) T protein:vir:98 146 KALDIFNDEDDA---ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYG-------------------------- 196 (272) T ss_pred HHHHHHhccCCC---ccEEEEcHHHHHHHHHhccccccccccccccccccccch-------------------------- Confidence 999999988655 4779999998877643221111100 0001112222222 Q ss_pred eecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEE Q lcl|Aclame:pro 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEI 308 (430) Q Consensus 229 tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I 308 (430) +|.|+ .|... T Consensus 197 ---------------------------------------------~i~G~-----------------~Vi~s-------- 206 (272) T protein:vir:98 197 ---------------------------------------------EVLGV-----------------QIVRS-------- 206 (272) T ss_pred ---------------------------------------------hhcCe-----------------eEEEc-------- Confidence 34441 11110 Q ss_pred eeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcE Q lcl|Aclame:pro 309 TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGL 388 (430) Q Consensus 309 ~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Gl 388 (430) +.+ |. ...++|++.||.++.+.- + T Consensus 207 -~~~---------------------p~-------------~t~~~~~~~a~~~~~~~~---------------------~ 230 (272) T protein:vir:98 207 -RKC---------------------PK-------------GTAYMVRKGALRIMLKRN---------------------T 230 (272) T ss_pred -CCC---------------------Cc-------------ceEEEEcCCeEEEEecCC---------------------c Confidence 000 00 012667778877765431 2 Q ss_pred EEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 389 NGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 389 sirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) .++. +.|.+......+...-||++.++|+-. |.+.-..| T Consensus 231 ~ve~--~r~~~~~~~~i~~~~~~~~~v~~~~~v-v~~t~~~a 269 (272) T protein:vir:98 231 MVET--DRDITKAINQIVANKHYGVYLYKAEKA-VKITLKDA 269 (272) T ss_pred eeee--ccccccceeEEEEEEEEEEEEEcCCce-EEEEeccc Confidence 2221 223334445555666799999999964 55533333 No 39 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.36 E-value=1.1e-13 Score=91.73 Aligned_cols=263 Identities=15% Similarity=0.103 Sum_probs=152.2 Q ss_pred CccchhhHHHHHHHHHH-----HHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccc----cccCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEII-----ETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESP----TQEGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl-----~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~----~~~g~~~s~~~~d~~ 71 (430) |||..+++.+++..|++ +.+++.++.+.++.. +.+-+ .+.|+||+||.-.... ..+|.+. ..+.+. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~--~~~l~-g~~G~ti~iP~~~~igda~~~~eg~~i--~~~~lt 75 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADI--DSTLV-GQPGDTLTFPAFVYSGDATVVPEGQKI--PVDKIE 75 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhccccee--ccccc-CCCCCEEEeeeecCCCccccccCCCcc--Cccccc Confidence 99987776666665554 448888888888843 33322 2579999999743321 2234333 344566 Q ss_pred cceEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) -++..+++ +++.--|.+++++. ...+...+++++....||+++|.+|+..+......+ +.....++.|. T Consensus 76 ~~~~~a~i-~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~--------~~~~~t~d~i~ 146 (276) T protein:vir:10 76 TNRREAKI-HKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTV--------SADIGTLAGLE 146 (276) T ss_pred cceeeEEe-ehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--------cccccCHHHHH Confidence 67778888 44677788888774 345778899999999999999999997766522211 11223578899 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hhhhhhhhccccccchhhhHHHhCCCcceecccccccc Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~ 228 (430) +|...|.++.. ..+.++++|..++.|.......+.... .....+++|+||+ +.||+ ++.++++|. + +.+ T Consensus 147 ~A~~~lgd~~~---~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~-~~G~~-Vi~s~~~p~---~--t~~ 216 (276) T protein:vir:10 147 AAIDTFDDEDL---EPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGE-ALGAV-IVRSKKLDE---G--EAI 216 (276) T ss_pred HHHHHhccccC---cccEEEEcHHHHHHHHHhccccccccccccccceeccccce-eccee-EEEcCCCCc---c--eEE Confidence 99999998765 357899999999988765433333222 2344589999997 89997 556777762 1 122 Q ss_pred eecccceeeeeEEEEee-ccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeE Q lcl|Aclame:pro 229 TVSGAQSFKPVAWQLDN-DGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVE 307 (430) Q Consensus 229 tV~ga~~~~~~~~t~~~-~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~ 307 (430) .. +.+ +...-. ....... .|- -.+.-|.++. +++|.|--....+-+. T Consensus 217 l~-~~g-----Ai~~~~~~~~~vE~-dRd---------~~~~~d~i~~----------------~~~y~~~~~~~~~vv~ 264 (276) T protein:vir:10 217 LA-KRG-----AVKLITKRDFFLET-DRD---------PSTKTTALYS----------------DKHYVAYLYDESKAVK 264 (276) T ss_pred EE-ecc-----ceeeeecCCceeec-ccc---------hhhcccEEEE----------------eeEEEEEEEcCcceEE Confidence 21 111 111000 0000000 000 0111233332 2345443222334555 Q ss_pred EeeccccccccccccccccccccccccccC Q lcl|Aclame:pro 308 ITPKPVALDDVSLSPEQRAYANVNTSLADA 337 (430) Q Consensus 308 I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~ 337 (430) |.++. .+ .|++. T Consensus 265 ~t~~~-------~~-----------~~~~~ 276 (276) T protein:vir:10 265 VTKGA-------GT-----------TDSGA 276 (276) T ss_pred EecCC-------cC-----------CcCCC Confidence 54442 11 11111 No 40 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.32 E-value=4.4e-14 Score=93.83 Aligned_cols=290 Identities=12% Similarity=0.056 Sum_probs=167.8 Q ss_pred Cccc----h------------hhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccccc-CCcc Q lcl|Aclame:pro 1 MALN----E------------GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE-GWDL 63 (430) Q Consensus 1 MAn~----~------------~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~-g~~~ 63 (430) |+|= + .--+++.--||+..|+...++..++.. |. .+.|+|++||.=...+.+. -... T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~-r~-----i~~G~s~~~~~iG~~~~~~~~~g~ 74 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNV-RS-----LRGTNQLRVDRVGASTIAGRKAGE 74 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhcccee-ee-----ccccceEEEeeecceeeeeecCCC Confidence 5432 1 112356666888889999999999864 32 4779999999754444332 1123 Q ss_pred CCccCCCCcceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeec-------c Q lcl|Aclame:pro 64 TDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITS-------P 134 (430) Q Consensus 64 s~~~~d~~e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~-------~ 134 (430) +...+++....+.++||..+...|.+.+=| +..-++-..+-+++.++||...|..++..+++.+...--. + T Consensus 75 ~l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~ 154 (334) T protein:vir:80 75 ELVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHD 154 (334) T ss_pred CCCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccC Confidence 344566777789999999999888887633 3333455567788889999999998877766544321100 0 Q ss_pred C----CCCCCCCC-c-------hhHHHHHHHHHHHhCCCcC--CCcEEEecHHHHHHHHHhhhhhhhc---cchhhhhhh Q lcl|Aclame:pro 135 D----AIGTNTAD-A-------WNFVADAEELMFSRELNRD--MGTSYFFNPQDYKKAGYDLTKRDIF---GRIPEEAYR 197 (430) Q Consensus 135 ~----~~~~~~~~-~-------~~d~a~a~~~L~~~~aP~~--~~R~~vl~p~~~a~~~~~~~~l~~~---~~~~~~a~r 197 (430) | ...+++.. . ..-+-.|++.|+++.+|.. .+|.+|++|+.+..|+..- .+.+. ......-|. T Consensus 155 G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~-r~~n~d~~~s~~~~~~~ 233 (334) T protein:vir:80 155 GILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHD-RLMNVEFGAKEGGNSFV 233 (334) T ss_pred CcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhccc-ccccceecccccccccc Confidence 0 00111111 1 1235589999999999942 4699999999998887642 22222 111223478 Q ss_pred hccccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcc Q lcl|Aclame:pro 198 DGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTG 277 (430) Q Consensus 198 ~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaG 277 (430) +|.+++ ++||. ++++.++|.. ..+ .++ . | + T Consensus 234 ~g~i~~-v~G~~-V~~Sn~~P~~-~~t-----~~~-------------------------------~-----g------~ 263 (334) T protein:vir:80 234 GGRIAM-LNGVR-VVETPRFPQS-AIT-----ANA-------------------------------L-----G------A 263 (334) T ss_pred ceeEEE-EeceE-EEeecCCCCc-ccc-----ccc-------------------------------c-----c------c Confidence 888886 88986 5677777611 000 000 0 0 1 Q ss_pred eeeccccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeeccc Q lcl|Aclame:pro 278 VKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADD 357 (430) Q Consensus 278 V~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~ 357 (430) . | |+.+. ..+.++=++||++ T Consensus 264 ~----------------~---------------------------------~~~ag-----------d~t~~~~~~~~~~ 283 (334) T protein:vir:80 264 D----------------F---------------------------------NVTDA-----------EVRRKMITFIPSM 283 (334) T ss_pred c----------------c---------------------------------ccccc-----------cccceEEEEEeCc Confidence 0 0 00110 1112345899999 Q ss_pred ceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 358 AIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 358 A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) |+.-+-.. .++.+.+|.-+-..+...+. .+||.+.+|||-++++---=+- T Consensus 284 Al~t~~~~---------------------~~~~e~~~~~~~~~d~i~~~--~a~G~g~lRPeaa~vv~~~~~~ 333 (334) T protein:vir:80 284 ALISAQVH---------------------PVSAQFWEEKKDFGHYLDTF--QSYNIGQRRPDAVAVHDITVTN 333 (334) T ss_pred eEEEEEEe---------------------ecceeeeechhhHHHHHHHH--HHcCCceeccceEEEEEEeeec Confidence 87644322 12222222221111111111 4699999999998877433333 No 41 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.23 E-value=1.9e-13 Score=90.35 Aligned_cols=223 Identities=17% Similarity=0.120 Sum_probs=129.7 Q ss_pred HHHhhcCCEEEEecCccccc---ccCCccCCccCCCCcceEEEEeccccccceEecHHHh-c-cHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 38 ASMQRSSNTIWMPVEQESPT---QEGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADDL-R-DETAYRHRIQSAARKLA 112 (430) Q Consensus 38 ~~~~k~GdTV~ip~P~~~~~---~~g~~~s~~~~d~~e~sV~v~l~~~k~V~~~~t~keL-~-~~~~~~r~l~pAm~~LA 112 (430) +.+-+.||||++|+ +.... .+|...+ ...+.-.+..++| ++..-.|++++++. . ..++....-+|....|| T Consensus 1 ~~~~~~Gdtit~P~-~iGda~~v~eG~~i~--~~~l~~t~~~atI-k~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA 76 (231) T protein:vir:73 1 ENGINLANLCEYPN-DIGDAADVAEGGEIS--LDKIGTTTKSVTI-KKAAKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) T ss_pred CccccCCceEEecc-cccchhhhcCCCcCC--hhhccccceeeeE-eeeccceeeeHHHHhhccCchHHHHHHHHHHHHH Confidence 56678999999995 22121 2243332 3344555677777 44567899999883 2 45667788899999999 Q ss_pred HHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchh Q lcl|Aclame:pro 113 NNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIP 192 (430) Q Consensus 113 n~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~ 192 (430) ++||.||+..+.. +.+.+ .+...+..+.+|...|.+... ..+.++++|.+...|...........+.. T Consensus 77 ~kvD~di~~~~~~-a~l~~--------~~~~t~d~i~~A~~~fgde~~---~~~vivv~p~~~~~Lrk~~~~~~~~~~~g 144 (231) T protein:vir:73 77 NKVDDDLLKAAKT-TSQTV--------STKANVDGVQAALDIFNDEDA---QAYVLIVNPKDAAKIRKDANAKNIGSEVG 144 (231) T ss_pred HhhhHHHHHhhcc-ccccc--------cccccHHHHHHHHHHhccccc---cceEEEEcchHHHhhhhccchhhhhhhhc Confidence 9999999876554 22221 123457789999999998764 35789999999988876443322223345 Q ss_pred hhhhhhccccccchhhhHHHhCCCcceecccccccce---ecccceeeeeEEEEeeccccccccceeeEEEeeccceeec Q lcl|Aclame:pro 193 EEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT---VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKR 269 (430) Q Consensus 193 ~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~t---V~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~ 269 (430) ...+|+|.||+ +.|++ ++.|.+++. |++.-.. ..||-....... -.+. +.|- -.++ T Consensus 145 ~~i~~~G~iG~-i~G~~-Vi~S~~~~~---~~~~~~~~i~~~gAl~~~~k~~------~~vE-tdRd---------~~~k 203 (231) T protein:vir:73 145 ANALINGTYAD-VLGAQ-IVRSKKLAE---GSALMFKIVSNSPALKLVLKRG------VQVE-TDRD---------IVTK 203 (231) T ss_pred cceeeecccce-EcceE-EEEcCCCCC---CceeeeeEEeeccceeeeeccc------ceee-cccc---------cccc Confidence 67889999997 99997 677887762 3222111 123322111100 0000 0000 0122 Q ss_pred ccEEEEcceeeccccccccccccceEEEEEeecCceeEEeeccc Q lcl|Aclame:pro 270 GDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPV 313 (430) Q Consensus 270 GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v 313 (430) -|.++.. ++|.|--.....-+.|+-+.+ T Consensus 204 ~~~i~~~----------------~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 204 TTVITAD----------------EHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred ccEEEEe----------------EEEEEEEEcCccEEEEEeecC Confidence 2333222 233332222233455555555 No 42 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.10 E-value=3.7e-12 Score=83.25 Aligned_cols=290 Identities=13% Similarity=0.107 Sum_probs=166.7 Q ss_pred Cccc--------------hhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccC-CccCC Q lcl|Aclame:pro 1 MALN--------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEG-WDLTD 65 (430) Q Consensus 1 MAn~--------------~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g-~~~s~ 65 (430) |.|= ..--+++.--||+..|+...++..++.. | -.|.|+++++|.=-....+.- ...+. T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~-r-----ti~~g~s~~~~~iG~~~~~~~~pG~~l 74 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNI-R-----DLRGSNVVRLDRLGNVEAKGRRAGEEL 74 (335) T ss_pred CCccccccccccccccchhhhhhhhhhhHHHHHHHHhhhhccccce-e-----eeccceeEEEeeeeeeeecccccCccc Confidence 4431 2223466667889999999999999864 2 247899999998655444321 12233 Q ss_pred ccCCCCcceEEEEeccccccceEecH-HH-hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeec--cCCC--C- Q lcl|Aclame:pro 66 KATGLLELNVAVNMGEPDNDFFQLRA-DD-LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITS--PDAI--G- 138 (430) Q Consensus 66 ~~~d~~e~sV~v~l~~~k~V~~~~t~-ke-L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~--~~~~--~- 138 (430) ..+.+......++||.-....+.+.+ +| +..-+.-..+-++..++||...|+-++..+++.+...--+ ++.. | T Consensus 75 ~~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:78 75 ERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred CCCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCc Confidence 45566677889999998766666554 22 3332333457788889999999999887776644322111 0000 0 Q ss_pred ------CCC-C--C---chhHHHHHHHHHHHhCCCcC--CCcEEEecHHHHHHHHHhhhhhhhcc---chhhhhhhhccc Q lcl|Aclame:pro 139 ------TNT-A--D---AWNFVADAEELMFSRELNRD--MGTSYFFNPQDYKKAGYDLTKRDIFG---RIPEEAYRDGTI 201 (430) Q Consensus 139 ------~~~-~--~---~~~d~a~a~~~L~~~~aP~~--~~R~~vl~p~~~a~~~~~~~~l~~~~---~~~~~a~r~g~i 201 (430) ++. . . ...-+.++...|+++-+|.. .+|.++++|+-+..|+.. ..+.+.. .....-+++|.+ T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~-~~l~n~~~~~s~~~~~~~~g~v 233 (335) T protein:vir:78 155 LEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEH-DKLMSVEYQATGATNDYVKSRV 233 (335) T ss_pred ceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcc-ccccccccccccccccccccee Confidence 000 0 1 12346778889999999963 369999999999888754 2222322 112345888999 Q ss_pred cccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeec Q lcl|Aclame:pro 202 QRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFL 281 (430) Q Consensus 202 gr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v 281 (430) ++ ++||. ++++.++|.. ++++. .+ |..+ T Consensus 234 ~~-v~Gv~-V~~Sn~lP~~-~~t~~---------------~l--------------------------g~a~-------- 261 (335) T protein:vir:78 234 AI-LNGVK-VLETPRFATK-AISAH---------------PL--------------------------GRHF-------- 261 (335) T ss_pred EE-eeceE-EEeeccCCCC-CCccc---------------cc--------------------------cccC-------- Confidence 86 88996 7788877721 00000 00 0000 Q ss_pred cccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeE Q lcl|Aclame:pro 282 GQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRI 361 (430) Q Consensus 282 ~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~L 361 (430) |+++. ..+.+.=++||++|+.- T Consensus 262 -----------------------------------------------n~~~~-----------d~~~~~~~~~~~~Al~t 283 (335) T protein:vir:78 262 -----------------------------------------------NVSAE-----------EAERQIALFLPSKTLIT 283 (335) T ss_pred -----------------------------------------------Ccccc-----------cccceEEEEEecceEEE Confidence 00000 00112348899998766 Q ss_pred EEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEE-cCCCCC Q lcl|Aclame:pro 362 VSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVG-LPGQTA 430 (430) Q Consensus 362 atrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~-l~~q~~ 430 (430) +-.- .+..++++.-+-..+...++ ..||...+|||.++++ +-|=-| T Consensus 284 ~~~~---------------------~~~~e~~~~~~~~~~~i~~~--~a~G~g~lRPe~a~~i~~tg~~~ 330 (335) T protein:vir:78 284 AQVA---------------------PVQAKLWEDHDQFSWVLDTF--QMYNIGARRPDTAGAIELKGIEA 330 (335) T ss_pred EEEE---------------------ecccceeeccchhhHhhhHH--HHcCCcccCcceEEEEEecCCCc Confidence 5221 12223322222111122222 3499999999998765 334333 No 43 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.04 E-value=1.8e-11 Score=79.47 Aligned_cols=290 Identities=11% Similarity=0.056 Sum_probs=162.9 Q ss_pred Cccc--------------hhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccC-CccCC Q lcl|Aclame:pro 1 MALN--------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEG-WDLTD 65 (430) Q Consensus 1 MAn~--------------~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g-~~~s~ 65 (430) |.|= ..--+++.--||+..|+...++..++.. |. -+.|+++++|.--..+.+.- ...+. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~-rt-----i~~g~s~~~~~iG~~~~~~~~pG~~l 74 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNI-RD-----LRGSNVVRLDRLGNVEAKGRRAGEEL 74 (335) T ss_pred CCCcccchhhhcccccchhheehhhhhhhHHHHHHhhhhhccccce-ee-----eccceeEEEeeeeeeeeecccCCcCc Confidence 4431 1112356666888889999999988854 22 47799999998655554421 11233 Q ss_pred ccCCCCcceEEEEeccccccceEecH-HH-hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCC----- Q lcl|Aclame:pro 66 KATGLLELNVAVNMGEPDNDFFQLRA-DD-LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIG----- 138 (430) Q Consensus 66 ~~~d~~e~sV~v~l~~~k~V~~~~t~-ke-L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~----- 138 (430) ..+.+......++||.-....+.+.+ +| +..-+.-..+-+..-++||...|..++..+++.+...--.....+ T Consensus 75 ~~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:63 75 ERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred CCCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCc Confidence 44455666888999988766655554 22 222233335777888999999999998777775543221110000 Q ss_pred -------CCC-CCch----hHHHHHHHHHHHhCCCcC--CCcEEEecHHHHHHHHHhhhhhhhcc---chhhhhhhhccc Q lcl|Aclame:pro 139 -------TNT-ADAW----NFVADAEELMFSRELNRD--MGTSYFFNPQDYKKAGYDLTKRDIFG---RIPEEAYRDGTI 201 (430) Q Consensus 139 -------~~~-~~~~----~d~a~a~~~L~~~~aP~~--~~R~~vl~p~~~a~~~~~~~~l~~~~---~~~~~a~r~g~i 201 (430) ..+ .++. .-+-++...|+++.+|.. ++|.++++|+-+..|+.. ..+.+.. ......+.+|.+ T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~-~~l~n~~~~~s~~~~~~~~g~v 233 (335) T protein:vir:63 155 LEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEH-DKLMNVEYQATGATNDYVKSRV 233 (335) T ss_pred ceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcc-ccccccccccccccccccCcee Confidence 000 1111 235688899999999952 359999999999888754 2233322 112345788999 Q ss_pred cccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeec Q lcl|Aclame:pro 202 QRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFL 281 (430) Q Consensus 202 gr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v 281 (430) ++ ++||. ++++.++|.. .+++. . + |+.+ .+ T Consensus 234 ~~-v~Gv~-V~~sn~lP~~-~~t~~----------------------~-----------------l--g~a~--n~---- 263 (335) T protein:vir:63 234 AI-LNGVK-VLETPRFATK-AIAAH----------------------P-----------------L--GRHF--NV---- 263 (335) T ss_pred EE-eeceE-EEeeccCCCC-Ccccc----------------------c-----------------c--cccC--Cc---- Confidence 86 88996 7788777721 00000 0 0 1100 00 Q ss_pred cccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeE Q lcl|Aclame:pro 282 GQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRI 361 (430) Q Consensus 282 ~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~L 361 (430) ++. ..+.+.=++||++|..- T Consensus 264 -------------------------------------------------~~~-----------d~~~~~~~~~~~~Al~t 283 (335) T protein:vir:63 264 -------------------------------------------------SAE-----------ESERQIALFLPSKTLIT 283 (335) T ss_pred -------------------------------------------------ccc-----------ccceeEEEEEecceEEE Confidence 000 00112348999997654 Q ss_pred EEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcC-CCCC Q lcl|Aclame:pro 362 VSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLP-GQTA 430 (430) Q Consensus 362 atrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~-~q~~ 430 (430) +-..- ++.++++.-+-..+...++ ..||...+|||.++++=- |=-| T Consensus 284 ~~~~~---------------------vt~e~~~~~~~~~~~i~~~--~a~G~g~lRPe~a~~i~~tg~~~ 330 (335) T protein:vir:63 284 AQVAP---------------------VQAKLWEDNEKFSWVLDTF--QMYNIGARRPDTAGAIELKGIGA 330 (335) T ss_pred EEEee---------------------cccceeeccchhhHHhHHH--HHcCCcccccceEEEEEEcCCCc Confidence 43321 1122222211111121222 349999999999876532 2222 No 44 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=99.00 E-value=7.8e-12 Score=81.47 Aligned_cols=291 Identities=13% Similarity=0.048 Sum_probs=162.5 Q ss_pred Cccc--------------hhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc--c-CCcc Q lcl|Aclame:pro 1 MALN--------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--E-GWDL 63 (430) Q Consensus 1 MAn~--------------~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~--~-g~~~ 63 (430) |++- ..--++.+.-||+..|+...++..++.. |. .+.|+++++|.=-..... + |. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~v-rt-----i~~GkS~qf~~iG~~~a~y~~~G~-- 72 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-QT-----VTGTNTVSNKYLGETELQVLAPGQ-- 72 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee-ee-----ecccceEEEEEEeeeEEeeecccc-- Confidence 4321 1123456667788889888888888854 22 568999999985333332 2 32 Q ss_pred CCccCCCCcceEEEEeccccccceEecH-HH-hccHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-e---eeccCC Q lcl|Aclame:pro 64 TDKATGLLELNVAVNMGEPDNDFFQLRA-DD-LRDET-AYRHRIQSAARKLANNVELKVANMAAEMGSL-V---ITSPDA 136 (430) Q Consensus 64 s~~~~d~~e~sV~v~l~~~k~V~~~~t~-ke-L~~~~-~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~-~---~~~~~~ 136 (430) ....+++.-..+.++||.-....+.+.+ +| +..-+ .-..+-+..-++||...|+-++..++..+.. . .+.+.. T Consensus 73 ~ldg~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~ 152 (402) T protein:vir:97 73 SPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) T ss_pred ccCCCCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcc Confidence 2344566666788999887765544443 22 22112 1124447778899999999988777653321 1 111111 Q ss_pred CC------C-CCC----Cc----hhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc--hhhhhhhhc Q lcl|Aclame:pro 137 IG------T-NTA----DA----WNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR--IPEEAYRDG 199 (430) Q Consensus 137 ~~------~-~~~----~~----~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~--~~~~a~r~g 199 (430) .+ + ... .+ ..-|-++...|++..+|.+ +|.++++|+-+..|+.. ..+.+..- .....+++| T Consensus 153 ~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~-dRv~vv~P~~y~~Ll~~-~rl~n~d~~~~~~g~~~~G 230 (402) T protein:vir:97 153 KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDIS-DVAIMMPWKFFNALRDA-DRIVDKTYTISQSGATING 230 (402) T ss_pred cccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCcc-ccEEEeChHHHHHHhhc-ccccchhhccccCCccccc Confidence 11 1 100 11 1335688899999999995 59999999988887653 22222221 234558899 Q ss_pred cccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEccee Q lcl|Aclame:pro 200 TIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVK 279 (430) Q Consensus 200 ~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~ 279 (430) .++. +.||. ++++.++|.. ++ .+.+ =-++-+| T Consensus 231 ~v~~-v~Gv~-Vv~SnnlP~~--a~----~it~--------------------------------------~~ls~a~-- 262 (402) T protein:vir:97 231 FVLS-SYNCP-VIPSNRFPTF--AQ----DQAH--------------------------------------HLLSNED-- 262 (402) T ss_pred eeEE-EeceE-EEecCccccc--cc----cccc--------------------------------------cccccCC-- Confidence 9985 89997 6788888831 00 0000 0000011 Q ss_pred eccccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccce Q lcl|Aclame:pro 280 FLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAI 359 (430) Q Consensus 280 ~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~ 359 (430) ....|-++++ -+.+.=++|||+|+ T Consensus 263 -----------~G~~y~~t~d---------------------------------------------~t~~~~~~f~~~Av 286 (402) T protein:vir:97 263 -----------NGYRYDPIAE---------------------------------------------MNGAVAVLFTSDAL 286 (402) T ss_pred -----------CCccCCcCcc---------------------------------------------cceeEEEEEecceE Confidence 0112222221 11123488999855 Q ss_pred eEEEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEE--eecCceeeCcceeEEEcCCC--C----C Q lcl|Aclame:pro 360 RIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIA--LWYGVNATRPEAIGVGLPGQ--T----A 430 (430) Q Consensus 360 ~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rld--vlyG~~~v~Pe~agv~l~~q--~----~ 430 (430) .- .-+. .++-++|++-+ ...|=|| .+||...+|||-+||+...- | . T Consensus 287 ~t--vk~~-------------------~vT~~~~~d~r----~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~t~~~~~ 340 (402) T protein:vir:97 287 LV--GRTI-------------------EVTGDIFYEKK----EKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAG 340 (402) T ss_pred EE--EEee-------------------ccccchhhchh----HHHHHHHHHHHhCCcccCccceEEEEEecccccccCC Confidence 43 2221 13334333322 1222233 46999999999999995544 2 2 No 45 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=98.85 E-value=1.2e-09 Score=69.56 Aligned_cols=259 Identities=11% Similarity=0.035 Sum_probs=135.8 Q ss_pred Cccch--------hhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccccc-CCccCCccCCCC Q lcl|Aclame:pro 1 MALNE--------GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE-GWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~--------~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~-g~~~s~~~~d~~ 71 (430) .||.. .+.-.-++|+++. .+.+.-+.+++ ++++ ..-|++|+||.-......| .+....+.+++. T Consensus 30 ~~~~~~~~nt~~l~~k~~~~LD~~~~--~~~~s~~~~~N--~~~e---~~~g~tVkIp~i~~~gl~DY~R~~g~~~g~vt 102 (329) T protein:vir:10 30 FANKSVEPGDTLLKNKHVGILEKVTA--ANSYSAPAVIS--NDAI---FMQGRSFTVIKGDVTELKDYKRNATNEFDHPQ 102 (329) T ss_pred hcCCccCCchhHHHHHHHHHHHHHHH--hhceeeeeecc--ccee---eccCcEEEEeeecccccccccCCCCccccccc Confidence 33331 1112222333321 12223333443 3333 3469999999864433332 223334556777 Q ss_pred cceEEEEeccccccceEecHHHh---ccHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhH Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDL---RDETAYRHRI-QSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNF 147 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL---~~~~~~~r~l-~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d 147 (430) .....++|++.|...|.+.+-|. +..-....++ +.+-..++-+||......++..+..... .+.....-|+. T Consensus 103 ~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~~~~----~~~t~~nay~~ 178 (329) T protein:vir:10 103 IQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAKHLT----VGSGADAQYDA 178 (329) T ss_pred cceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccc----cccCHHHHHHH Confidence 88999999999999999886442 2211122332 3345678889998888777665443221 11112234888 Q ss_pred HHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hhhhhhhhccccccchhhhHHHhCCCcceecccccc Q lcl|Aclame:pro 148 VADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTAT 226 (430) Q Consensus 148 ~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~ 226 (430) +..+...|+++++|. +|.+++.|+.+.-|..+ .++.... ...+-+++|.+| T Consensus 179 i~~a~~~Lde~~vp~--~Rvl~VtP~~~~~Lk~~--~~f~~~~~~~~~~~~~g~Vg------------------------ 230 (329) T protein:vir:10 179 VLDVSVELDEIGAGA--SRILFVTPKFYKGIKKF--VIELPQGDNRQQVLGKGVQG------------------------ 230 (329) T ss_pred HHHHHHHHHhcCCCC--CcEEEeCHHHHHHHHhh--hhhhccccccccceeeeeee------------------------ Confidence 999999999999993 69999999988655321 1111110 011112222222 Q ss_pred cceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCcee Q lcl|Aclame:pro 227 GITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 227 ~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v 306 (430) +|.|+ .|..+ T Consensus 231 -----------------------------------------------~idG~-----------------~Ii~v------ 240 (329) T protein:vir:10 231 -----------------------------------------------ELDGF-----------------TIVKV------ 240 (329) T ss_pred -----------------------------------------------eecCe-----------------EEEEe------ Confidence 23331 01110 Q ss_pred EEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecc--c--CCCCchhhceeEEEe Q lcl|Aclame:pro 307 EITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPI--P--ANHELFAGMKTTSFS 382 (430) Q Consensus 307 ~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl--~--~p~~~~~~~~~~~~~ 382 (430) |... . ..|.+ ++.|++|+..+..=- + .|..+. T Consensus 241 ---ps~~----------~-------------k~in~---------ii~~~~A~~~~~K~~~~~~~~p~~~~--------- 276 (329) T protein:vir:10 241 ---PSKM----------L-------------QGVEA---------MAVIGEVMASPIQANEAKLNSNVPGM--------- 276 (329) T ss_pred ---cCCc----------c-------------cceeE---------EEEcCCceeeeeeeeeeeeeCCCCcc--------- Confidence 0000 0 01111 677888877665522 1 111100 Q ss_pred cCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 383 IPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 383 ~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) .|- .|+-=..||+++++|+..+|.....++ T Consensus 277 ---~a~---------------~v~gr~yyd~~V~~~k~~~I~~~~~~a 306 (329) T protein:vir:10 277 ---FGT---------------LAEQMLYTGAFVPEHLQKYIFTIGGKE 306 (329) T ss_pred ---chh---------------eeeeeeeeeeEEEccccCEEEEecccC Confidence 010 111223499999999999988877766 No 46 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=98.80 E-value=6.9e-10 Score=70.81 Aligned_cols=257 Identities=17% Similarity=0.207 Sum_probs=143.2 Q ss_pred Ccc-chhhHH--HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcc----cccccCCccCCccCCCCcc Q lcl|Aclame:pro 1 MAL-NEGQIV--TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQE----SPTQEGWDLTDKATGLLEL 73 (430) Q Consensus 1 MAn-~~~~~~--~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~----~~~~~g~~~s~~~~d~~e~ 73 (430) ||- .++.++ +++-+-|.+++++.+++++++. .++.-+ .+.|+||+||.=.. -...+|.+.+. ..+.-+ T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~--~d~~L~-g~~G~ti~~P~~~~igdae~~~eg~~i~~--~~lt~~ 75 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAV--TDDTLV-GQPGDTITRPKYAYIGAAEDLQEGVAMDT--TQMSMT 75 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccc--cccccC-CCCCCEEEeeeecCCCccccccCCCccch--hhcccc Confidence 996 476654 5666777777888888888883 233222 35799999997211 11123333322 233344 Q ss_pred eEEEEeccccccceEecHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHH Q lcl|Aclame:pro 74 NVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) Q Consensus 74 sV~v~l~~~k~V~~~~t~keL--~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a 151 (430) ...++| ++..-.|.++++.. ...+....+.++....||+++|.+|+..+.. +.+.. .......+|.+| T Consensus 76 ~~~a~i-~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~-a~~~~--------~~~~t~~~~~dA 145 (270) T protein:vir:95 76 TTKVTV-KETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNK-SKQTA--------TVSADATGILDA 145 (270) T ss_pred hheeee-ehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcc-ccccc--------ccccCHHHHHHH Confidence 555666 44456788888773 3356778889999999999999999866553 22111 123456789999 Q ss_pred HHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceeccccccccee- Q lcl|Aclame:pro 152 EELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITV- 230 (430) Q Consensus 152 ~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV- 230 (430) ..+|.+..-. ...++++|..++.|..+. ++...+.....+++|.||+ +.|+.-++.+..++ ...++.+ T Consensus 146 ~~~lgd~~~~---~~~i~vhs~~~~~Lrk~~--~~~~~~~~~~~~~~G~ig~-~~G~~Viv~s~~~~-----~~~~~l~~ 214 (270) T protein:vir:95 146 IEVFNSENDE---DYVLYVNPKDYNKLVKSL--FKVGGNVQDRAISKGDLVE-IVGVSDIVKSKRVS-----ENTAFLQR 214 (270) T ss_pred HHHhccccCC---CcEEEEcHHHHHHHHhhh--cccccccccchhcccccce-ecceeEEEeCCCCC-----ceeEEEEe Confidence 9999776533 356999999999887654 2333334556789999997 89996555444322 1223332 Q ss_pred cccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeec-Cc--eeE Q lcl|Aclame:pro 231 SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD-GT--HVE 307 (430) Q Consensus 231 ~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~-~~--~v~ 307 (430) .||-..... .+.... ..|- -.+.=|.+... ++|+|- ... .. .++ T Consensus 215 ~gAi~~~~~------~~~~vE-tdRd---------~~~~~d~i~~~----------------~~y~v~-~~~~skvv~~t 261 (270) T protein:vir:95 215 YGAMEIVNK------KKPEAY-TDFD---------ILKRTHLLSTN----------------YHYSVN-LKDETGVVKVT 261 (270) T ss_pred ccceeeeec------CCceee-eccc---------hhhcccEEEee----------------eEEEEE-EEccceEEEEE Confidence 122110000 000000 0010 01112333332 334432 322 22 234 Q ss_pred Eeecccccccccc Q lcl|Aclame:pro 308 ITPKPVALDDVSL 320 (430) Q Consensus 308 I~p~~v~~~~~~~ 320 (430) +.|++-. .. T Consensus 262 ~~~a~~~----~~ 270 (270) T protein:vir:95 262 FKPSGSL----EM 270 (270) T ss_pred ecCCCCc----CC Confidence 4444422 11 No 47 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.79 E-value=1.1e-09 Score=69.76 Aligned_cols=277 Identities=14% Similarity=0.089 Sum_probs=126.7 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccch--hhcccCCh--HHHHhhcCCEEEEecCccccccc-CCc--cCCc-----c- Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQ--KAKKYTPP--AASMQRSSNTIWMPVEQESPTQE-GWD--LTDK-----A- 67 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~--lV~~y~~~--~~~~~k~GdTV~ip~P~~~~~~~-g~~--~s~~-----~- 67 (430) |+.++.+. ++..|.++..|+. .=++.++. .+...++++++..+....+.... +.. .+.. . T Consensus 13 Ms~~i~~~-------fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~dtp~ 85 (322) T protein:vir:10 13 IAGDIDQA-------FVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSADGTYPTPV 85 (322) T ss_pred eechhhhH-------HHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccccCcccCCCc Confidence 66664442 2223333333320 00223333 22245667777777654443321 100 0100 0 Q ss_pred CCCCcceEEEEeccccccceEecHHHh-c-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCC-------- Q lcl|Aclame:pro 68 TGLLELNVAVNMGEPDNDFFQLRADDL-R-DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAI-------- 137 (430) Q Consensus 68 ~d~~e~sV~v~l~~~k~V~~~~t~keL-~-~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~-------- 137 (430) .+.-...+.+.+ ..+.+.+.+.+.|+ + ..+.-.-+.+.+..+|+.+.|.-++..+...+. .+.+++. T Consensus 86 ~~~~~~~r~~~~-~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~--~~~~gt~v~~~ss~~ 162 (322) T protein:vir:10 86 NNKPFAKRRTNV-DTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPAS--IKGTGQPVEFLATQE 162 (322) T ss_pred cccccceEEEee-cccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccc--ccccccccccCCCcc Confidence 112234555555 44467788777663 2 223335677889999999999877765554332 1111111 Q ss_pred -CC-CCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhh-hhccccccchhhhHHHhC Q lcl|Aclame:pro 138 -GT-NTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY-RDGTIQRQVAGFDDVLRS 214 (430) Q Consensus 138 -~~-~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~-r~g~igr~~~Gfd~~~~~ 214 (430) +. +....+..+..|++.|+++.||.+.+|.+|++|+.+..|+. ...+...+....+++ ++|.+|+ ++||.|+ .+ T Consensus 163 i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~-d~~~ts~D~~~~~~l~~~G~ig~-~lGf~~i-~s 239 (322) T protein:vir:10 163 IGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQ-ITEATSADYTSAMDLQSKGIITN-WMGYTWI-VS 239 (322) T ss_pred cccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhc-chhhhhhhcccchhhhhcCeeee-eeeEEEE-Ee Confidence 11 11233778999999999999998777999999999988874 222333333345555 7899997 9999985 56 Q ss_pred CCcceeccccc---ccceecccceeeeeEEEEeeccccccccceeeEEEeeccceee----cccEEEEcceeeccccccc Q lcl|Aclame:pro 215 PKLPVLTKSTA---TGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLK----RGDKISFTGVKFLGQMAKN 287 (430) Q Consensus 215 ~~~~~~~~gt~---~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk----~GDv~TiaGV~~v~~~tk~ 287 (430) .++|.. .++. +...+-|.....--++ ..+ +.+..+ ..++=-..+-.+-.+++ T Consensus 240 ~~lp~~-~~t~~~~~~~~~~~~~~~~~~a~----~k~--------------Av~~a~~~dv~~~i~~~~~~~~a~~I~-- 298 (322) T protein:vir:10 240 TRLDKF-DPTQWGMAAEDGPQGDEIWCIAM----TDM--------------ALGYHSCKDIWTKVAEDPSASFAWRIY-- 298 (322) T ss_pred ccCCcc-ccccccccccCCCCccceeEEEE----ecC--------------ceeEEEeeeeeEEeeccCCcchhhhhh-- Confidence 777622 1111 0111111111110111 000 000000 01111111111000110 Q ss_pred cccccceEEEEEeecCceeEEeecccccccccc Q lcl|Aclame:pro 288 VLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) Q Consensus 288 ~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~ 320 (430) ....|-.+..-..+=|.|- ...++ T Consensus 299 ---~~~~~Ga~ri~~~gVv~i~------~~e~~ 322 (322) T protein:vir:10 299 ---SAFTADCVRVEDEHIFKLR------LKNSL 322 (322) T ss_pred ---hhhhhCceEeccCcEEEEE------EeccC Confidence 0000001111011222221 11111 No 48 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.74 E-value=4.6e-10 Score=71.77 Aligned_cols=291 Identities=14% Similarity=0.065 Sum_probs=162.8 Q ss_pred Cccc--------------hhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc--c-CCcc Q lcl|Aclame:pro 1 MALN--------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--E-GWDL 63 (430) Q Consensus 1 MAn~--------------~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~--~-g~~~ 63 (430) |++- ..-.++.+.-||+..|+...++..++.. | -.+.|+++++|.--..+.. + |. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~v-R-----ti~~gkS~qf~~~G~s~~~~~~pG~-- 72 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDV-Q-----TVTGTNTVSNKYLGETELQVLAPGQ-- 72 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhccccee-e-----eecccceEEEEEeeeeEeeeecCCC-- Confidence 4431 1223455556777788888888877753 2 2678999999985333332 2 32 Q ss_pred CCccCCCCcceEEEEeccccccceEecH-HH-hccHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhccc----ceeeccCC Q lcl|Aclame:pro 64 TDKATGLLELNVAVNMGEPDNDFFQLRA-DD-LRDETA-YRHRIQSAARKLANNVELKVANMAAEMGS----LVITSPDA 136 (430) Q Consensus 64 s~~~~d~~e~sV~v~l~~~k~V~~~~t~-ke-L~~~~~-~~r~l~pAm~~LAn~Id~dl~~~~~~~as----~~~~~~~~ 136 (430) +...+++.-..+.++||.-....+.+.+ +| ++.-+. -..+-+..-++||...|+-|+..++..+. .....|.. T Consensus 73 ~ld~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~ 152 (401) T protein:vir:70 73 SPAATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRV 152 (401) T ss_pred CcCCCCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCc Confidence 2344566666788999998877666654 22 222121 12455667788999999988777755221 11111111 Q ss_pred CCCC-----------CC-C---chhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch--hhhhhhhc Q lcl|Aclame:pro 137 IGTN-----------TA-D---AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI--PEEAYRDG 199 (430) Q Consensus 137 ~~~~-----------~~-~---~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~--~~~a~r~g 199 (430) .+.+ .. . -...|-+++..|+++.+|. + |.+++.|..+..++.+...+.+..-. ....+.+| T Consensus 153 ~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~-~-r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G 230 (401) T protein:vir:70 153 KGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDI-S-DVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQG 230 (401) T ss_pred CCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCc-c-ceEEEcCHHHHHHHHhcCcccchhhccccCCccccc Confidence 1100 00 1 1234678999999999995 4 56777666555554444444443322 23568899 Q ss_pred cccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEccee Q lcl|Aclame:pro 200 TIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVK 279 (430) Q Consensus 200 ~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~ 279 (430) .++. ++||. ++++.++|.... .+. |.-++-+| T Consensus 231 ~v~~-vaGv~-Vv~SnnlP~~a~------~it--------------------------------------~~~ls~a~-- 262 (401) T protein:vir:70 231 FTLS-SYNCP-VIPSNRFPKYSQ------GQT--------------------------------------HHLLSNED-- 262 (401) T ss_pred eEEE-EeceE-EEeecccccccc------ccc--------------------------------------cccccccC-- Confidence 9985 89996 678888873100 000 11112222 Q ss_pred eccccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccce Q lcl|Aclame:pro 280 FLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAI 359 (430) Q Consensus 280 ~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~ 359 (430) ....|-++++ -+.+.=|.|||+|. T Consensus 263 -----------~G~~y~~~~d---------------------------------------------~s~~~~v~f~~~Av 286 (401) T protein:vir:70 263 -----------NGYRYDPLPA---------------------------------------------MNGAIAVLFTADAL 286 (401) T ss_pred -----------CCccCCCCcc---------------------------------------------ccceeEEEEehhhe Confidence 1112222222 11123488999965 Q ss_pred eEEEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEE--eecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 360 RIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIA--LWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 360 ~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rld--vlyG~~~v~Pe~agv~l~~q~~ 430 (430) .-+ .-+ .|+.++|++-+ ...|=|| .+||...+|||-++|+...-+. T Consensus 287 ~tv-k~~--------------------~lt~~~~~d~r----~~~~~id~~~a~g~g~~RPeaa~vv~~k~~~ 334 (401) T protein:vir:70 287 LVG-RSI--------------------DVTGDIFYEKK----EKTYYIDTFMAEGAIPDRWEAVSVVTTKRNT 334 (401) T ss_pred EEE-Eee--------------------ccccchhhhhh----hhHHHHHHHHHhCCcccchhheEEEeecCcc Confidence 431 111 23334433322 2334454 4699999999999999877775 No 49 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=98.71 E-value=2.3e-09 Score=67.95 Aligned_cols=289 Identities=11% Similarity=-0.020 Sum_probs=134.4 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHH-hhcCCEEEEecCccccccc-CCcc-CCccCCCCcceEEE Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASM-QRSSNTIWMPVEQESPTQE-GWDL-TDKATGLLELNVAV 77 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~-~k~GdTV~ip~P~~~~~~~-g~~~-s~~~~d~~e~sV~v 77 (430) ||+ ++ ..++.-..+++.|+..++.+.|.. ++++.+- +.-|++|+||.=......+ .+.. .-...++.-...++ T Consensus 1 MA~-~n-~a~~~~~~Ld~~~~~~l~~~~L~~--~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~ 76 (299) T protein:vir:79 1 MAA-LN-YAKEYSNVLAQAYPYTLNFGDLYA--TPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPK 76 (299) T ss_pred Ccc-ch-hHHHHHHHHHHHHHhhceeeeecc--CcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEE Confidence 993 22 246666777778889999888873 3433221 2337999999743332222 1111 22333555678899 Q ss_pred EeccccccceEecHHHh---ccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCC-CCchhHHHHHH Q lcl|Aclame:pro 78 NMGEPDNDFFQLRADDL---RDETAYRHRIQS-AARKLANNVELKVANMAAEMGSLVITSPDAIGTNT-ADAWNFVADAE 152 (430) Q Consensus 78 ~l~~~k~V~~~~t~keL---~~~~~~~r~l~p-Am~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~-~~~~~d~a~a~ 152 (430) +|++.|...|.+..-|. +.......++.. +-..++-+||......++..+......... .+.+ ..-++.+-.+. T Consensus 77 ~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~-~~~T~~n~y~~i~~~~ 155 (299) T protein:vir:79 77 VLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADT-TVLTTTNVLEVFDKLM 155 (299) T ss_pred EeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccc-cccCHHHHHHHHHHHH Confidence 99999999999984322 111112233322 224567788887766655544322221111 1112 22377889999 Q ss_pred HHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch-hhhhhhhccccccchhhhHHHhCCCcceecccccccceec Q lcl|Aclame:pro 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVS 231 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ 231 (430) ..|++.++|. ++|.++++|+.+.-|.. .......... ...-.++|.+|+ +.||. +++.+.- +.. +.-.+ .+ T Consensus 156 ~~lde~~vP~-~~rvl~vtp~~~~~L~~-~~~f~k~~~~~~~~~~~~g~Vg~-idG~~-Ii~Vps~-r~~--t~~~~-~~ 227 (299) T protein:vir:79 156 EKMTEARVPE-NGRILYVTPVVNTLIKN-AKEIQRTVNIKDAGTSLNRQTTD-IDTVK-IIKVPSN-LMK--TAYDF-TT 227 (299) T ss_pred HHHHhcCCCC-CCeEEEeCHHHHHHHhh-chhhhcccccccccceeeeeeee-ecceE-EEEechh-hcC--cccee-cc Confidence 9999999998 57999999998875542 2111111111 122467999996 89996 6553221 111 00000 11 Q ss_pred ccceeeeeEEEEeeccccccccceeeEEEeecc-ceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEEee Q lcl|Aclame:pro 232 GAQSFKPVAWQLDNDGNKVNVDNRFATVTLSAT-TGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITP 310 (430) Q Consensus 232 ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~t-gtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p 310 (430) |... +.. +-.++ ...+.-++. .-.|--.+-.| .|-+-+.--.+.+++.--+.- |.. T Consensus 228 G~~~-~~~-------ak~in----~ii~~~~a~~~~~K~~~~~~~------~P~~~~~~~~~~~~r~y~d~~-----v~~ 284 (299) T protein:vir:79 228 GWKV-GAG-------AKQIF----MSLVHPSAIITPVSYQFSKLD------EPTAVTEGKYFYFEESFEDVF-----ILN 284 (299) T ss_pred Cccc-cCc-------ccccc----eEEEcCCeeeeeEeeeeEEee------cCCCCCccceeeeeeeeeeee-----eec Confidence 1100 000 00000 000000010 11122111112 122222211122222222210 000 Q ss_pred ccccccccccccccccccccccccc Q lcl|Aclame:pro 311 KPVALDDVSLSPEQRAYANVNTSLA 335 (430) Q Consensus 311 ~~v~~~~~~~~~~~~~~~nVsa~pA 335 (430) .- ..+-|.|+.++-+ T Consensus 285 nk----------~~~i~~~~~~a~~ 299 (299) T protein:vir:79 285 KK----------ADAIQFVVEGAGA 299 (299) T ss_pred cc----------cCeEEEEeeecCC Confidence 00 0011222222211 No 50 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=98.57 E-value=3.1e-08 Score=61.73 Aligned_cols=286 Identities=12% Similarity=0.046 Sum_probs=135.4 Q ss_pred Cccc--hhh--HHHHHHHHHHHHHHhhcccch-h-hcccCChHHHHhhcCCEEEEecCccccccc-CCccCCccCCCCcc Q lcl|Aclame:pro 1 MALN--EGQ--IVTLAVDEIIETISAITPMAQ-K-AKKYTPPAASMQRSSNTIWMPVEQESPTQE-GWDLTDKATGLLEL 73 (430) Q Consensus 1 MAn~--~~~--~~~~~~~~vl~~l~~~~Vma~-l-V~~y~~~~~~~~k~GdTV~ip~P~~~~~~~-g~~~s~~~~d~~e~ 73 (430) .||+ +++ .+.-.+...|+.+....+.+. + ++ ++++- .-|++|+||.=......| .+....+.+++.-. T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N--~~~e~---~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~ 93 (319) T protein:vir:94 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALIS--NDAIF---MEGRSFTVMKGDTTELKDYKRNATNEFDHPKIE 93 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccC--cceEe---ccCcEEEEeeecccccccccCCCCcccCCcccc Confidence 5555 111 333344455555555555443 2 22 22222 369999999743322222 22334455677788 Q ss_pred eEEEEeccccccceEecHHHh---ccHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 74 NVAVNMGEPDNDFFQLRADDL---RDETAYRHRI-QSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 74 sV~v~l~~~k~V~~~~t~keL---~~~~~~~r~l-~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) ..+++|++.|...|.+.+-|. +..-....++ +.+-..++-+||......++..+.-... .+.....-|..+. T Consensus 94 ~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~----~~~t~~n~y~~i~ 169 (319) T protein:vir:94 94 ETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLT----VGTGSDAQYDAVL 169 (319) T ss_pred eeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccc----cccCHHHHHHHHH Confidence 999999999999999887442 2211122233 3344567778998877776665433221 1111223488899 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccccce Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT 229 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~t 229 (430) .+...|++.++| . +|.++++|+.+.-|..+ ..........++.+++|.+|+ +.||. +++..+.. +..-.+. T Consensus 170 ~a~~~Lde~~VP-~-~Rvl~Vtp~~~~~L~~~-~~f~~~~~~~~~~~~~g~Vg~-idG~~-Vi~vps~~----~k~in~i 240 (319) T protein:vir:94 170 DVSVELDEIKAP-E-NRVLFVSPTFYKGIKKF-VIALPQGDTRQQVLGKGVQGE-LDGFV-IVKVPTKL----LQGLQAI 240 (319) T ss_pred HHHHHHHhcCCC-C-CcEEEeCHHHHHHHHhh-hhhhccccccccceeeeecee-ecCeE-EEEecccc----cccceEE Confidence 999999999999 3 69999999988766433 222223333445678999996 89996 55543221 1111111 Q ss_pred ecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEc----ceeeccccccccccccceEEEEEeecCce Q lcl|Aclame:pro 230 VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT----GVKFLGQMAKNVLAQDATFSVVRVVDGTH 305 (430) Q Consensus 230 V~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~Tia----GV~~v~~~tk~~~~~l~~fvVt~~~~~~~ 305 (430) -|....-.....++....-.+..++-.- .--.++-.|.|.+. |+|- T Consensus 241 -~~h~~A~~~~~k~~~~~~~~p~~~~~a~----~v~gr~y~d~~V~~~k~~~Iy~------------------------- 290 (319) T protein:vir:94 241 -AVVGEVLASPIQADLAKTNSNIPGMFGT----LAEQLLYTGAFVPEHLQKYIFT------------------------- 290 (319) T ss_pred -EEcCCeeeeeeeeeeeeccCCCccccce----eeeeeeeeeeEEeccccceEEE------------------------- Confidence 1111100011111111000000000000 00123333333332 1111 Q ss_pred eEEeeccccccccccccccccccccccccccCcee Q lcl|Aclame:pro 306 VEITPKPVALDDVSLSPEQRAYANVNTSLADAMAV 340 (430) Q Consensus 306 v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aav 340 (430) +.++ ..++...++.+++-=.|.|-..--. T Consensus 291 --~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:94 291 --IGGT----EVATKRDGVDAHADNVAKPSGSLEM 319 (319) T ss_pred --eecC----CcccCCCccccccccccCCcccccC Confidence 1111 0011111112222222222222111 No 51 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=98.57 E-value=3.1e-08 Score=61.73 Aligned_cols=286 Identities=12% Similarity=0.046 Sum_probs=135.4 Q ss_pred Cccc--hhh--HHHHHHHHHHHHHHhhcccch-h-hcccCChHHHHhhcCCEEEEecCccccccc-CCccCCccCCCCcc Q lcl|Aclame:pro 1 MALN--EGQ--IVTLAVDEIIETISAITPMAQ-K-AKKYTPPAASMQRSSNTIWMPVEQESPTQE-GWDLTDKATGLLEL 73 (430) Q Consensus 1 MAn~--~~~--~~~~~~~~vl~~l~~~~Vma~-l-V~~y~~~~~~~~k~GdTV~ip~P~~~~~~~-g~~~s~~~~d~~e~ 73 (430) .||+ +++ .+.-.+...|+.+....+.+. + ++ ++++- .-|++|+||.=......| .+....+.+++.-. T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N--~~~e~---~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~ 93 (319) T protein:vir:97 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALIS--NDAIF---MEGRSFTVMKGDTTELKDYKRNATNEFDHPKIE 93 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccC--cceEe---ccCcEEEEeeecccccccccCCCCcccCCcccc Confidence 5555 111 333344455555555555443 2 22 22222 369999999743322222 22334455677788 Q ss_pred eEEEEeccccccceEecHHHh---ccHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHH Q lcl|Aclame:pro 74 NVAVNMGEPDNDFFQLRADDL---RDETAYRHRI-QSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVA 149 (430) Q Consensus 74 sV~v~l~~~k~V~~~~t~keL---~~~~~~~r~l-~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a 149 (430) ..+++|++.|...|.+.+-|. +..-....++ +.+-..++-+||......++..+.-... .+.....-|..+. T Consensus 94 ~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~----~~~t~~n~y~~i~ 169 (319) T protein:vir:97 94 ETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLT----VGTGSDAQYDAVL 169 (319) T ss_pred eeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccc----cccCHHHHHHHHH Confidence 999999999999999887442 2211122233 3344567778998877776665433221 1111223488899 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccccce Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT 229 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~t 229 (430) .+...|++.++| . +|.++++|+.+.-|..+ ..........++.+++|.+|+ +.||. +++..+.. +..-.+. T Consensus 170 ~a~~~Lde~~VP-~-~Rvl~Vtp~~~~~L~~~-~~f~~~~~~~~~~~~~g~Vg~-idG~~-Vi~vps~~----~k~in~i 240 (319) T protein:vir:97 170 DVSVELDEIKAP-E-NRVLFVSPTFYKGIKKF-VIALPQGDTRQQVLGKGVQGE-LDGFV-IVKVPTKL----LQGLQAI 240 (319) T ss_pred HHHHHHHhcCCC-C-CcEEEeCHHHHHHHHhh-hhhhccccccccceeeeecee-ecCeE-EEEecccc----cccceEE Confidence 999999999999 3 69999999988766433 222223333445678999996 89996 55543221 1111111 Q ss_pred ecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEc----ceeeccccccccccccceEEEEEeecCce Q lcl|Aclame:pro 230 VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT----GVKFLGQMAKNVLAQDATFSVVRVVDGTH 305 (430) Q Consensus 230 V~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~Tia----GV~~v~~~tk~~~~~l~~fvVt~~~~~~~ 305 (430) -|....-.....++....-.+..++-.- .--.++-.|.|.+. |+|- T Consensus 241 -~~h~~A~~~~~k~~~~~~~~p~~~~~a~----~v~gr~y~d~~V~~~k~~~Iy~------------------------- 290 (319) T protein:vir:97 241 -AVVGEVLASPIQADLAKTNSNIPGMFGT----LAEQLLYTGAFVPEHLQKYIFT------------------------- 290 (319) T ss_pred -EEcCCeeeeeeeeeeeeccCCCccccce----eeeeeeeeeeEEeccccceEEE------------------------- Confidence 1111100011111111000000000000 00123333333332 1111 Q ss_pred eEEeeccccccccccccccccccccccccccCcee Q lcl|Aclame:pro 306 VEITPKPVALDDVSLSPEQRAYANVNTSLADAMAV 340 (430) Q Consensus 306 v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aav 340 (430) +.++ ..++...++.+++-=.|.|-..--. T Consensus 291 --~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:97 291 --IGGT----EVATKRDGVDAHADNVAKPSGSLEM 319 (319) T ss_pred --eecC----CcccCCCccccccccccCCcccccC Confidence 1111 0011111112222222222222111 No 52 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=98.55 E-value=2.1e-09 Score=68.19 Aligned_cols=203 Identities=15% Similarity=0.090 Sum_probs=107.0 Q ss_pred eccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCCC------CCC----CCch- Q lcl|Aclame:pro 79 MGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIG------TNT----ADAW- 145 (430) Q Consensus 79 l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~------~~~----~~~~- 145 (430) ||.-..-+|.+.+-| +..-+.-..+-+++.++||.++|.-++..+++.+......++-.+ +++ ...| T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 555555555555433 333445567778899999999999999888875543221110000 011 1124 Q ss_pred hHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhh-hhhhhccch-hhhhhhhc-cccccchhhhHHHhCCCcceecc Q lcl|Aclame:pro 146 NFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDL-TKRDIFGRI-PEEAYRDG-TIQRQVAGFDDVLRSPKLPVLTK 222 (430) Q Consensus 146 ~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~-~~l~~~~~~-~~~a~r~g-~igr~~~Gfd~~~~~~~~~~~~~ 222 (430) +-|-++++.|+++.+|. ++|.++++|..+..|+... ..+.+.... +..-+|+| .+++ +.||+ ++++.++|.. . T Consensus 81 dai~~a~~~LdekdVP~-~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~-v~G~~-V~~SnnlP~~-~ 156 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPM-DGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYV-NAGIR-IYKSNVLASL-Y 156 (221) T ss_pred HHHHHHHHHHhhcCCCC-CCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeee-ecCcE-EEEeccCCcc-c Confidence 45788999999999998 5788999998888887532 222222222 22237888 4885 89997 7889888832 1 Q ss_pred cccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeec Q lcl|Aclame:pro 223 STATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) Q Consensus 223 gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~ 302 (430) |+ .+ +. .+| .|.+.+ +..+| |.+ T Consensus 157 gt--~~-~~------------------------------------~ag-~~~~~~----~~~~~--------yr~----- 179 (221) T protein:vir:17 157 GT--NL-VT------------------------------------DPG-DATTSG----ENNGS--------YRP----- 179 (221) T ss_pred cc--cc-cc------------------------------------CCc-cccccc----ccccc--------ccc----- Confidence 11 00 00 011 111111 00000 000 Q ss_pred CceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEE--EecccCCCCchhhceeEE Q lcl|Aclame:pro 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIV--SQPIPANHELFAGMKTTS 380 (430) Q Consensus 303 ~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~La--trpl~~p~~~~~~~~~~~ 380 (430) ..+.+.=|+|||+|..-+ +.|+..|+= . .++ T Consensus 180 ------------------------------------------~fs~~~glv~~~~Avgtvkl~~~~~~~~~----~-~~~ 212 (221) T protein:vir:17 180 ------------------------------------------AITDRAGLVFHKEAADTVEVLLPPSRPPL----V-ISM 212 (221) T ss_pred ------------------------------------------cccceEEEEEcchheeeeeeecCCCCCce----e-eee Confidence 111134599999997654 333333211 1 111 Q ss_pred EecCCCcEEEEEEeecccccc Q lcl|Aclame:pro 381 FSIPDVGLNGIFATQGDISTL 401 (430) Q Consensus 381 ~~~~~~Glsirv~~~yd~~~~ 401 (430) +||| ++... T Consensus 213 -------~~~~-----~~~~~ 221 (221) T protein:vir:17 213 -------FSIR-----RPDRR 221 (221) T ss_pred -------eecc-----CCCCC Confidence 2222 11111 No 53 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.54 E-value=3.8e-09 Score=66.74 Aligned_cols=291 Identities=15% Similarity=0.083 Sum_probs=160.2 Q ss_pred Cccc--------------hhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc--c-CCcc Q lcl|Aclame:pro 1 MALN--------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--E-GWDL 63 (430) Q Consensus 1 MAn~--------------~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~--~-g~~~ 63 (430) |++- ..-.++.+.-||+..|+...++..++.. | -.+.|+|+++|.--....+ + |. T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~v-R-----tI~~gkS~qf~~lG~s~a~y~~pG~-- 72 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDV-Q-----TVTGTNTVSNKYLGETELQVLAPGQ-- 72 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhccccee-e-----eecccceEEEEEeeeeEEeeecCCC-- Confidence 4331 1123455556777788888888877753 2 2678999999985333322 2 32 Q ss_pred CCccCCCCcceEEEEeccccccceEecH-HH-hccHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcc----cceeeccCC Q lcl|Aclame:pro 64 TDKATGLLELNVAVNMGEPDNDFFQLRA-DD-LRDET-AYRHRIQSAARKLANNVELKVANMAAEMG----SLVITSPDA 136 (430) Q Consensus 64 s~~~~d~~e~sV~v~l~~~k~V~~~~t~-ke-L~~~~-~~~r~l~pAm~~LAn~Id~dl~~~~~~~a----s~~~~~~~~ 136 (430) +...+++.-..+.++||.-....+.+-+ +| ++.=+ --..+-+..-++||...|+-++..++..+ ....+-++. T Consensus 73 ~ldg~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g 152 (400) T protein:vir:10 73 SPAATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRV 152 (400) T ss_pred CcCCCCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCc Confidence 2344556666788899888765555543 22 22111 11244456667899999998887665532 111111111 Q ss_pred CCC--------CCCC---c----hhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch--hhhhhhhc Q lcl|Aclame:pro 137 IGT--------NTAD---A----WNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI--PEEAYRDG 199 (430) Q Consensus 137 ~~~--------~~~~---~----~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~--~~~a~r~g 199 (430) ... .... + ...|-+|...|+++.+|. + |.+++.|..+..++...-.|.+.... ....+.+| T Consensus 153 ~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~-~-d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g 230 (400) T protein:vir:10 153 KGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDI-S-DVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQG 230 (400) T ss_pred cccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCc-c-ceEEEcCHHHHHHHHhCCcccchhccccCCCccccc Confidence 100 0111 1 123567888899999996 4 55777666666554443334433322 22457788 Q ss_pred cccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEccee Q lcl|Aclame:pro 200 TIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVK 279 (430) Q Consensus 200 ~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~ 279 (430) .+.. ++||. ++++.++|.. ++. -.+..++-+| T Consensus 231 ~v~~-v~Gv~-Iv~Sn~lP~~------------a~~--------------------------------~~~~~lS~a~-- 262 (400) T protein:vir:10 231 FVLS-SYNCP-VIPSNRFPKY------------SQG--------------------------------QKHHLLSNED-- 262 (400) T ss_pred eEEE-EeceE-EEeeCcCCcc------------cCc--------------------------------ccccccccCC-- Confidence 8874 88885 6788877721 100 0012222222 Q ss_pred eccccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccce Q lcl|Aclame:pro 280 FLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAI 359 (430) Q Consensus 280 ~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~ 359 (430) ....|-++++. +.+.=|.|||+|. T Consensus 263 -----------~G~~y~~t~d~---------------------------------------------s~~~av~F~~sAv 286 (400) T protein:vir:10 263 -----------NGYRYDPIAEM---------------------------------------------NGAIAVLFTADAL 286 (400) T ss_pred -----------CCccCCccccc---------------------------------------------cceeEEEEehhhe Confidence 11222232221 1123488999965 Q ss_pred eEEEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEE--eecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 360 RIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIA--LWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 360 ~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rld--vlyG~~~v~Pe~agv~l~~q~~ 430 (430) .-+ .-+ .|+.++||+=+ ...|=|| .+||...+|||-++|+..+-++ T Consensus 287 ~tv-k~~--------------------~lt~~~~~d~r----~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~ 334 (400) T protein:vir:10 287 LVG-RSI--------------------DVIGDIFYEKK----EKTYYIDTFMSEGAIPDRWEAVSVVTTKRQS 334 (400) T ss_pred EEE-Eee--------------------ccccccccchh----hHHHHHHHHHHhCCcccchhheEEEEecCCc Confidence 431 111 23444443322 2334454 4699999999999999999888 No 54 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=97.90 E-value=4.7e-06 Score=49.78 Aligned_cols=274 Identities=10% Similarity=0.017 Sum_probs=125.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcc--cccccCCccCCccCCCCcceEEEE Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQE--SPTQEGWDLTDKATGLLELNVAVN 78 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~--~~~~~g~~~s~~~~d~~e~sV~v~ 78 (430) ||.++.++-. ..+.+.|...++.+.+... +++ ..-|++|+||.=.. .+.++ +..+....++.-..-+.+ T Consensus 1 Main~a~~~~---~~Ld~~~~~~~~t~~l~~~--~~~---~~ggktVkI~~i~~~gl~DY~-R~~g~~~g~v~~~~et~t 71 (290) T protein:vir:78 1 MAINYVDKYG---KELDQKLVFGTYTNELETP--NLL---WLDAKTFKIQTITTTGLKAHT-RNKGYNEGSASNTNKSYT 71 (290) T ss_pred CchhHHHHHH---HHHHHHHHhhheeeecccc--cee---eccCCEEEEeeeccCcccccc-cCCCcccCccccceeeEE Confidence 9998765444 4455556688888887743 222 23599999997332 22222 222223345555677899 Q ss_pred eccccccceEecH---HHhccHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCC-CCchhHHHHHHH Q lcl|Aclame:pro 79 MGEPDNDFFQLRA---DDLRDETAYRHRI-QSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNT-ADAWNFVADAEE 153 (430) Q Consensus 79 l~~~k~V~~~~t~---keL~~~~~~~r~l-~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~-~~~~~d~a~a~~ 153 (430) |++.|...|.+.. +|-+..-....+. +.+-..++-+||......++..+......... +.+ ..-+.-+-.+.. T Consensus 72 l~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~--t~t~~n~~~~i~~~~~ 149 (290) T protein:vir:78 72 IDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAE--EITKDNVFTKLKAAIR 149 (290) T ss_pred eeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCccccc--ccCHHHHHHHHHHHHH Confidence 9999999999973 3321111111221 33445677888887666555544322111111 112 223666777777 Q ss_pred HHHHhCCCcCCCcEEEecHHHHHHHHH--hhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccccceec Q lcl|Aclame:pro 154 LMFSRELNRDMGTSYFFNPQDYKKAGY--DLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVS 231 (430) Q Consensus 154 ~L~~~~aP~~~~R~~vl~p~~~a~~~~--~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ 231 (430) .|++ +|. ++|.++++|+.+.-|.. .+......+. ..+-..+|.+|+ +.||. +++.....++. +.... .+ T Consensus 150 ~lde--vp~-~~rvl~vtp~~~~lL~~~~~f~r~~~~~~-~~~~~i~~~V~~-idG~~-ii~vps~~r~~--t~~~f-~~ 220 (290) T protein:vir:78 150 KVKK--YGT-QNLVMYVSPDVMAALELSDDFVRAINVQN-IGPSSIETRITA-IDGTR-IVEVEAEDRFY--DTFDF-TD 220 (290) T ss_pred HHHh--cCC-CCeEEEECHHHHHHHhhChhhhccccccc-cccccccceeee-ecCcE-EEEecccchhh--hhhhh-cc Confidence 8876 887 58999999998875432 2222111111 122334888886 88985 55543221211 00000 00 Q ss_pred ccceeeeeEEEEeeccccccccceeeEEEeecc-ceeecccEEEEcceeecccccccc-ccccceEEEEEeec-----Cc Q lcl|Aclame:pro 232 GAQSFKPVAWQLDNDGNKVNVDNRFATVTLSAT-TGLKRGDKISFTGVKFLGQMAKNV-LAQDATFSVVRVVD-----GT 304 (430) Q Consensus 232 ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~t-gtlk~GDv~TiaGV~~v~~~tk~~-~~~l~~fvVt~~~~-----~~ 304 (430) |... . ..+-.+| ...+.-++. .-.|---+-.|+ |-+-+. .+++-+++.--+.- .. T Consensus 221 G~~~---~-----~~ak~in----~ii~~~~a~i~~~K~~~~~~~~------P~~~~~~d~~~~~~r~y~d~~v~~nk~~ 282 (290) T protein:vir:78 221 GYKP---A-----AGAKKLN----FLLVNKGSVVGGAKHASIYLHA------PGSVGQGDGWLYQYRVYHDIFVLDQQKD 282 (290) T ss_pred cccc---c-----CCcccee----EEEEcCCceeeeeeeeEEEeeC------CCCCcCcceeeeeeeeeeeeeeeccccC Confidence 1000 0 0000000 011111111 112221111221 222111 22333433333321 00 Q ss_pred eeEEeeccccccccccccccccccccccccccCcee Q lcl|Aclame:pro 305 HVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAV 340 (430) Q Consensus 305 ~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aav 340 (430) .|-++-+ | T Consensus 283 ~i~~~~~----------------------------~ 290 (290) T protein:vir:78 283 GVIASTE----------------------------V 290 (290) T ss_pred eeEEEee----------------------------C Confidence 1111000 0 No 55 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=97.38 E-value=8.1e-05 Score=43.01 Aligned_cols=297 Identities=8% Similarity=-0.008 Sum_probs=119.3 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCc--ccccccCCccC--CccCCCCcceEE Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQ--ESPTQEGWDLT--DKATGLLELNVA 76 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~--~~~~~~g~~~s--~~~~d~~e~sV~ 76 (430) |||++. ..++....+-+.+...++-+-+-. -+...+ +.-|++|+||.=. -.+.++ |+.. -+..++....-+ T Consensus 1 Mantl~-ya~~~~~~LD~~~~~~~~s~~l~~--~~~~v~-~~ggktVkIp~i~~~gl~DY~-R~~g~~~~~g~v~~~~et 75 (312) T protein:vir:10 1 MANTLA-YGQVLQQGLDKQATQELLTGWMDS--NAKQIK-YEGGKEVKIGKLSTDGLGDYS-RGSANAYVGGDVKFEYET 75 (312) T ss_pred CCcchh-HHHHHHHHHHHHHHhhhccccccC--CCceEE-EecCcEEEEEeeecccccccc-cccCCcccccccccccee Confidence 998874 233333333333445554443320 011111 2448999999732 222222 1111 122355666789 Q ss_pred EEeccccccceEecHHHh---ccHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcccceeeccCCCCC--CCCC-chhHHH Q lcl|Aclame:pro 77 VNMGEPDNDFFQLRADDL---RDETAYRHRIQ-SAARKLANNVELKVANMAAEMGSLVITSPDAIGT--NTAD-AWNFVA 149 (430) Q Consensus 77 v~l~~~k~V~~~~t~keL---~~~~~~~r~l~-pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~--~~~~-~~~d~a 149 (430) .+|++-+...|.+..-|. +..-....+.. .+-...+=+||+.-...++..+....+....... .+.. -+..+- T Consensus 76 ~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni~~~i~ 155 (312) T protein:vir:10 76 KTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTIINKIK 155 (312) T ss_pred EEeeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccccccccccCHHHHHHHHH Confidence 999999999998883222 11111222222 2333455678877555555444333222222111 1222 367788 Q ss_pred HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch-hhhhhhhccccccchhhhHHHhCCCcceecccccccc Q lcl|Aclame:pro 150 DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 150 ~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~ 228 (430) .+...|++.++|. +|.+++.|+.+. ++.+. ..+..... ..+-..++.++. +-|+. +++.+.- +.- +.+ T Consensus 156 ~~~~~lde~~vp~--~rvl~vTp~~~~-lLk~~-~~~~~~~~~~~~~~i~~~V~~-iDgv~-Ii~VPs~-r~~----t~~ 224 (312) T protein:vir:10 156 TGIKIIRENGYNG--PLVCHLTYDSMF-AIEEK-VLEKLTAVTFAQGGIQTQVPS-IDGCA-LIKTPQN-RMY----SSI 224 (312) T ss_pred HHHHHHHHccCCC--ceEEEeChHHHH-HHhhh-hhceecccccccceeeeeeee-ecccE-EEEchhh-hcc----cee Confidence 9999999999993 799999999774 44432 22221111 112223455543 55553 3333221 110 111 Q ss_pred e-ecccceeee-eEEEEeeccccccccceeeEEEeecc-ceeecccEEEEcceeeccccccccccccceEEEEEeecCce Q lcl|Aclame:pro 229 T-VSGAQSFKP-VAWQLDNDGNKVNVDNRFATVTLSAT-TGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTH 305 (430) Q Consensus 229 t-V~ga~~~~~-~~~t~~~~~~~~~~d~~~~~~~~s~t-gtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~ 305 (430) . ..|....-+ ..+..+..+-.+| ...+.-++. .-.|.-.+-.|+= + ++....+++-+++.--|.- T Consensus 225 ~f~dG~t~~~~~gg~~~~~~ak~IN----fiiv~~~a~i~~~K~~~~~if~P--~---~~~~~d~~~~~~R~Y~D~f--- 292 (312) T protein:vir:10 225 LLNDGTTSNQTAGGYLKGTKALDTN----FIIAPVDVPLAITKQDKMRIFDP--E---TNQTANAWSMDYRRYHDLW--- 292 (312) T ss_pred eeccCcccccccCceeecCcccccc----eEEeCCceeeceeeeeeeeeeCC--C---CCCCcceeeeeeeeeeeee--- Confidence 1 011100000 0000000110000 111111221 1233322222221 0 1111223444444333311 Q ss_pred eEEeeccccccccccccccccccccccccccC Q lcl|Aclame:pro 306 VEITPKPVALDDVSLSPEQRAYANVNTSLADA 337 (430) Q Consensus 306 v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~ 337 (430) | +.. ..-+-|.|+..+-.-+ T Consensus 293 --v-----~~n-----k~~~Iyv~~k~a~~~~ 312 (312) T protein:vir:10 293 --V-----TDN-----KANSVYANFKDAKPVG 312 (312) T ss_pred --e-----ecc-----ccCeEEEEeecccCCC Confidence 0 000 0001122222111111 No 56 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=96.80 E-value=0.00016 Score=41.35 Aligned_cols=269 Identities=13% Similarity=0.047 Sum_probs=118.4 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCc---ccccccCCccCCccCCCCcceEEE Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQ---ESPTQEGWDLTDKATGLLELNVAV 77 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~---~~~~~~g~~~s~~~~d~~e~sV~v 77 (430) ||+++.+.-.-.++ +.|...++.+.+++.--+.... ..=|.+|+||+=+ ..+.+ .+....+..++.-..-+. T Consensus 1 Main~~~k~~~~ld---~~~~~~~~~~~l~~~~n~~~~~-~~gak~VkIp~ist~~gl~dY-~R~~g~~~g~v~~~~et~ 75 (285) T protein:vir:79 1 MTVVLDSKDLARID---EEYKADSQVWSYLTGGNGVTQR-FRGHNEVRINKLSGFVDATAY-KRGQDNARKTISVGKETV 75 (285) T ss_pred CcchhhHHHHHHHH---HHHHHhhhhhhhcccCCcceeE-ecCCCEEEEeeeccccccccc-ccccCccccccceeeeEE Confidence 99997554433333 3334555666665431111111 2337899999732 23322 233333445666667899 Q ss_pred EeccccccceEecHHHh--ccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCC-CCchhHHHHHHH Q lcl|Aclame:pro 78 NMGEPDNDFFQLRADDL--RDETAYRHRIQS-AARKLANNVELKVANMAAEMGSLVITSPDAIGTNT-ADAWNFVADAEE 153 (430) Q Consensus 78 ~l~~~k~V~~~~t~keL--~~~~~~~r~l~p-Am~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~-~~~~~d~a~a~~ 153 (430) +|++-+.+.|.+..-|. +-.-....+++. .-...+=+||+.-...++..+..... . +.+ ..-+..+-.+.. T Consensus 76 tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~~~~~---~--~~T~~nv~~~i~~~~~ 150 (285) T protein:vir:79 76 KLTHEDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAAKKAT---D--SITKDNALDAYDTAEA 150 (285) T ss_pred EeeccccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcccccc---c--ccCHHHHHHHHHHHHH Confidence 99999999888874333 100111122211 11234457777655555544432211 1 112 223777899999 Q ss_pred HHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceec-ccccc----c- Q lcl|Aclame:pro 154 LMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLT-KSTAT----G- 227 (430) Q Consensus 154 ~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~-~gt~~----~- 227 (430) .|++.++|. +|.+++.|+.+.-| .+... +.......+....|.|-|++..+|-.+.--.+|.-. ++... . T Consensus 151 ~lde~~vp~--~rvl~vTp~~~~~L-k~s~~-~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Inf 226 (285) T protein:vir:79 151 YMFDNEVPG--GFVMFVSSAYYTAL-KQSAA-VTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNF 226 (285) T ss_pred HHHHcCCCC--ceEEEEChHHHHHH-Hhhhh-hheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhccE Confidence 999999993 69999999987644 33322 222211223344455555566665211111111111 11111 1 Q ss_pred ceecccceeeeeE-EEEeec--cccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCc Q lcl|Aclame:pro 228 ITVSGAQSFKPVA-WQLDND--GNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGT 304 (430) Q Consensus 228 ~tV~ga~~~~~~~-~t~~~~--~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~ 304 (430) +.|.......+.- ..+..- +.--..|++.. ...+=+|+|.+.- |.. .=|+-... + T Consensus 227 iiv~~~a~i~~~K~~~~~~f~P~~~~~~d~~~~-------~~R~Y~d~fv~~n--------k~~----~Iy~~~~a--~- 284 (285) T protein:vir:79 227 ILTPLSAIAPIVKYDSVSVIDPSTDRSGNRWTI-------KGLSYYDAIVLDN--------AKK----GIYVAATA--G- 284 (285) T ss_pred EEecCceeccceeeeeeEeECCCCCCCcceeee-------eeeeeeeeeehhh--------ccc----eeeeeecc--c- Confidence 1121111111000 000000 00000111100 1233356666542 100 01222111 1 Q ss_pred ee Q lcl|Aclame:pro 305 HV 306 (430) Q Consensus 305 ~v 306 (430) | T Consensus 285 -~ 285 (285) T protein:vir:79 285 -V 285 (285) T ss_pred -C Confidence 1 No 57 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=95.82 E-value=0.0014 Score=36.23 Aligned_cols=312 Identities=13% Similarity=0.108 Sum_probs=123.7 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc-----cCCccCCccC---CC-- Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ-----EGWDLTDKAT---GL-- 70 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~-----~g~~~s~~~~---d~-- 70 (430) |..+.. .-+.+++.|+.-+..+|+.++...| .--++.|.||.++....+... .|.+.++++- .+ T Consensus 19 ~~~~~~--t~y~~~k~L~~Aa~~lv~~~fA~~~----piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a~G~~~~~g~~y~ 92 (401) T protein:vir:95 19 NSDQMQ--TFFWLKKAIITARKEQYFMPLASVT----NMPKHYGKTIKVYEYVPLLDDRNINDQGIDASGATIVNGNLYG 92 (401) T ss_pred ccceee--ehhhHHHHHhhhhhhhhhhhccccc----ccccccCCeEEEEecccccccccchhcCCCcccccccCccccc Confidence 333322 2356688888877778887777432 223568999998876554442 2333222200 00 Q ss_pred ---CcceEEEE----------ec-------------cccccceEecHHHh-cc-HH-----HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 71 ---LELNVAVN----------MG-------------EPDNDFFQLRADDL-RD-ET-----AYRHRIQSAARKLANNVEL 117 (430) Q Consensus 71 ---~e~sV~v~----------l~-------------~~k~V~~~~t~keL-~~-~~-----~~~r~l~pAm~~LAn~Id~ 117 (430) .-+++..+ ++ +|..-..+||+..+ .. +. ++.+.|+.+...--..|-. T Consensus 93 ~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~g~~~~t~d~i~~ 172 (401) T protein:vir:95 93 SSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMNGATQITEAVLQK 172 (401) T ss_pred cccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhhhhhhhHHHHHHH Confidence 01122221 11 22233344554332 11 11 1223333332222222334 Q ss_pred HHHHHH--Hhcccc---eeeccCCCCCCCCCchhHHHHHHHHHHHhCCCcC----------------CCcEEEecHHHHH Q lcl|Aclame:pro 118 KVANMA--AEMGSL---VITSPDAIGTNTADAWNFVADAEELMFSRELNRD----------------MGTSYFFNPQDYK 176 (430) Q Consensus 118 dl~~~~--~~~as~---~~~~~~~~~~~~~~~~~d~a~a~~~L~~~~aP~~----------------~~R~~vl~p~~~a 176 (430) ||++.. +-+++. +....+....++.-.+.++-.+..+|++|-+|+- .-|.+++.|+... T Consensus 173 dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~~ 252 (401) T protein:vir:95 173 DLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELVP 252 (401) T ss_pred HHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccccceEEEEecCchh Confidence 444222 111111 2222223333444468899999999999999971 1244566663333 Q ss_pred HHH--Hhhhh--hhhcc--chhhhhhhhccccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccc Q lcl|Aclame:pro 177 KAG--YDLTK--RDIFG--RIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKV 250 (430) Q Consensus 177 ~~~--~~~~~--l~~~~--~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~ 250 (430) .+. .++-+ -|.+- -...+.+-+|+||. +.+|+++.+. ..- |-+|.+. ...|++. T Consensus 253 di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~-i~~vR~i~~p-~~~-~w~~ag~--~a~~~~~--------------- 312 (401) T protein:vir:95 253 ELKAMKDLFGNKAFIETQHYADAGTIMNGEVGS-IDKFRIIQVP-EML-HWAGAGA--QATGANP--------------- 312 (401) T ss_pred HHHHHHHhcCCCCceehhhcCCccccccccccc-cCceeEEecc-cce-eecCCcc--ccccccc--------------- Confidence 221 11111 01111 11344566777774 6777765322 221 1111100 0000000 Q ss_pred cccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEEeecccccccccccccccccccc Q lcl|Aclame:pro 251 NVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANV 330 (430) Q Consensus 251 ~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nV 330 (430) | |.+.-+.++++..+||-+| T Consensus 313 -------------------~------------------------y~~~~~~~gg~~dVyp~lV----------------- 332 (401) T protein:vir:95 313 -------------------G------------------------YRTSMVSGQEHYDVYPMLV----------------- 332 (401) T ss_pred -------------------c------------------------cccccccCCCcceeeeeeE----------------- Confidence 0 0111112344444555443 Q ss_pred ccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEEee Q lcl|Aclame:pro 331 NTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALW 410 (430) Q Consensus 331 sa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvl 410 (430) +-++|| ++.||+- ++.+..-.....-|+-|- ....|+-.......+=.+ T Consensus 333 -----------------------~G~dAf--~~~~l~g--~g~~~~~~~ivk~pG~~~----ad~~DPlgQ~g~vgwK~~ 381 (401) T protein:vir:95 333 -----------------------VGDDSF--TSIGFQT--DGKSLKFTVMTKMPGKET----ADRNDPYGETGFSSIKWY 381 (401) T ss_pred -----------------------Eccccc--eeccccc--CCccccceeEeecCCcCC----CCCCCcccceehhhhhhh Confidence 112221 1222211 000000000000010000 000122222233344566 Q ss_pred cCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 411 YGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 411 yG~~~v~Pe~agv~l~~q~~ 430 (430) ||+..++||+. ++|.---- T Consensus 382 ~a~~vL~~e~m-~~ies~a~ 400 (401) T protein:vir:95 382 YGILVKRPERL-ALIKTVAP 400 (401) T ss_pred hhhheecccee-EEEEeecC Confidence 88888888885 43321111 No 58 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=95.77 E-value=0.0015 Score=36.10 Aligned_cols=315 Identities=7% Similarity=-0.052 Sum_probs=125.8 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHH-HhhcCCEEEEecCc---ccccccCCccCCccCCCCcceEE Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAAS-MQRSSNTIWMPVEQ---ESPTQEGWDLTDKATGLLELNVA 76 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~-~~k~GdTV~ip~P~---~~~~~~g~~~s~~~~d~~e~sV~ 76 (430) ||.++.+.-+-.+++.+ ....+.+.++- -.+.... ...-|++|+||.=+ -.+.++-........++.-..-+ T Consensus 1 Mainya~~~~~~Ld~~~---~~~~lts~~l~-~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~et 76 (346) T protein:vir:10 1 MTINYAEKYQAAVQQAF---YDGHLYSAELW-NSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWDS 76 (346) T ss_pred CcchhHHHHHHHHHHHH---Hhhhccchhhc-ccccccceEecCCCEEEEEEeeeecccccccccCCcccccccccceeE Confidence 99998776555555544 33433333320 0111111 11248999999743 23333311112112345556889 Q ss_pred EEeccccccceEecHHHh---ccHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcccce-eeccCCCCCCCC-CchhHHHH Q lcl|Aclame:pro 77 VNMGEPDNDFFQLRADDL---RDETAYRHRIQ-SAARKLANNVELKVANMAAEMGSLV-ITSPDAIGTNTA-DAWNFVAD 150 (430) Q Consensus 77 v~l~~~k~V~~~~t~keL---~~~~~~~r~l~-pAm~~LAn~Id~dl~~~~~~~as~~-~~~~~~~~~~~~-~~~~d~a~ 150 (430) .+|++-+...|.+..-|. +..-....++. ..-...+=+||..-...++..+... .+...+. +.+. .-+.-+-. T Consensus 77 ~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~~~~~~~-a~T~~ni~~~i~~ 155 (346) T protein:vir:10 77 YELKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHDGGITTN-TLDEKNILPAFDN 155 (346) T ss_pred EEeeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccccc-ccCHHHHHHHHHH Confidence 999999999998883221 11111122221 1222344477766444433322111 1110111 1122 23677888 Q ss_pred HHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceeccccccccee Q lcl|Aclame:pro 151 AEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITV 230 (430) Q Consensus 151 a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV 230 (430) +...|++.++|. ++|.+++.|+.+.-| .+...........+.-.-+|.+|+ +.||. +++.+.-.-. +...+ . T Consensus 156 ~~~~lde~~vp~-~~rvl~vTp~~~~lL-k~s~~f~k~~~v~~~~~i~~~V~s-iDGv~-Ii~VPs~r~~---t~~~f-~ 227 (346) T protein:vir:10 156 MMLDFDEARIPS-TNRILYVTPKTNAIL-KRAEAMNRALTLKDPNNIQRTVYS-LDDVT-IRVVPSDLMQ---TAYDF-S 227 (346) T ss_pred HHHHHHHccCCC-CCeEEEECHHHHHHH-hhchhheeccccccccccceeeee-ecCeE-EEEcchhhcc---cchhh-c Confidence 999999999998 579999999988744 322211111111111123788875 78884 5553222111 10000 0 Q ss_pred cccceeeeeEEEEeeccccccccceeeEEEeec-cceeecccEEEEcceeecccccccccc-ccceEEEEEeec-----C Q lcl|Aclame:pro 231 SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSA-TTGLKRGDKISFTGVKFLGQMAKNVLA-QDATFSVVRVVD-----G 303 (430) Q Consensus 231 ~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~-tgtlk~GDv~TiaGV~~v~~~tk~~~~-~l~~fvVt~~~~-----~ 303 (430) +|... . ..+-.+| ...+.-++ ..-.|--.+-.|+ |. -|.-+ ++-+++.--+.- . T Consensus 228 ~G~~~---~-----t~ak~IN----fiiv~~~A~ia~~K~~~~~if~------P~-~~~~g~~l~~~R~Y~D~fv~~nk~ 288 (346) T protein:vir:10 228 DGSKI---I-----DTAKQIE----MFLIYNGVQIAPEKYSFVGFDQ------PS-AATSGNYLYYEQSYDDVLLLNTKT 288 (346) T ss_pred cCccc---c-----CCcccee----EEEECCceeeeeeeeeeeEeeC------CC-CCcccceeeeeeeeeeeeeecccc Confidence 11000 0 0000000 00000011 1112222222222 21 11111 233322222211 1 Q ss_pred ce--eEEeeccccccccccccccccccccccccccCceeE----EecCCC--c-----e-eeeeecc Q lcl|Aclame:pro 304 TH--VEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVN----ILNVKD--A-----R-TNVFWAD 356 (430) Q Consensus 304 ~~--v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavT----v~~~~~--~-----~-~NlaFhr 356 (430) .. +.+..++...-... --.+.|.++-.|- |+-+|+ + . -=|+.-| T Consensus 289 ~~Iyv~~~~a~~~~~~~~---------~~~~kpt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (346) T protein:vir:10 289 KGIQFVVSDKPKKDQEQS---------GQDAKPTAESTLEEIKAYLDKNHIDYTGKTKKDELLALVK 346 (346) T ss_pred ceEEEeeecccccCccCc---------ccccCcccccchHHHHHHhcccccccccccchhhHHhhcC Confidence 11 22222211100000 0012333322221 222221 1 1 1133334 No 59 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=94.64 E-value=0.0039 Score=33.81 Aligned_cols=296 Identities=13% Similarity=0.078 Sum_probs=117.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhccc-------CCh--HHHHhhcCCEEEEecCcccccccCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKY-------TPP--AASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y-------~~~--~~~~~k~GdTV~ip~P~~~~~~~g~~~s~~~~d~~ 71 (430) |||+..++.+++..||+..+-.....-+ ++| .+. +......|++|.+|. +..-+|- ++++. T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~--~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~---~~~l~G~-----~~~~~ 70 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAK--SAFVQSGIAVSDERVSKNITSGGLLVNMPF---WNDLTGD-----SEVLG 70 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHh--hhhhhcccccccHHHHHHhhcCCCEEEecc---cccCCCc-----ccccC Confidence 9998888877777777766444333221 112 121 222335799999997 2222221 11221 Q ss_pred cceEEE-----------EeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc------c-ee Q lcl|Aclame:pro 72 ELNVAV-----------NMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGS------L-VI 131 (430) Q Consensus 72 e~sV~v-----------~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as------~-~~ 131 (430) |+...+ ..-+...--|..++.. +.-.+..+++-++-....+.+.+.+|+..+...-+ . .. T Consensus 71 dg~~~i~~~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~ 150 (330) T protein:vir:10 71 NGDKALETGKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGAL 150 (330) T ss_pred CCccccchhhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhh Confidence 211111 1111112223333322 33345555555554444555555555554432110 0 00 Q ss_pred -eccCCCCCCCCC--chhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhh Q lcl|Aclame:pro 132 -TSPDAIGTNTAD--AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGF 208 (430) Q Consensus 132 -~~~~~~~~~~~~--~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gf 208 (430) .......+.... +.+.+.+|...|.|+.-. ...++++|..+.+|..+. ...+.. .++ -++.||. +.|. T Consensus 151 ~~~~~~~~~~~~a~~s~~~l~~A~~~~GD~~~~---~~~ivmhS~v~~~L~~~~-li~~~~--~s~--~~~~i~~-~~G~ 221 (330) T protein:vir:10 151 EETHVSDQSKASTGIDAGMVLDAKQLLGDSADQ---VTAIAMHSAVYTKLQKDN-LIQYIQ--PTT--ATINIPT-YLGY 221 (330) T ss_pred hhhheecccccccccCHHHHHHHHHHhcccccc---ceEEEEcHHHHHHHHHhh-hhhhhc--ccc--cCccccc-ccce Confidence 000000011112 245688999999887643 467999999999887532 221111 111 1467875 7787 Q ss_pred hHHHhCCCcceecccccccce-ecccceeeeeEEEEeeccccccccceeeEEEeeccceeecc-cEEEEcceeecccccc Q lcl|Aclame:pro 209 DDVLRSPKLPVLTKSTATGIT-VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRG-DKISFTGVKFLGQMAK 286 (430) Q Consensus 209 d~~~~~~~~~~~~~gt~~~~t-V~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~G-Dv~TiaGV~~v~~~tk 286 (430) . +..++.+|.. .+....+. ..||-..+... .....++..-. -.++| |.+...=-|.+|| T Consensus 222 ~-VivdD~~p~~-~~~yt~yl~~~GAi~~~~~~-----~~~~v~~EtdR---------d~~~g~~~l~~r~~~~~hp--- 282 (330) T protein:vir:10 222 R-VIIDDGIAPT-GDIYTSYLFRTGSIGLNTGN-----PSGLTTFETSR---------EAAKGNDMIYTRRALVMHP--- 282 (330) T ss_pred E-EEEeCCCCCC-CCceeEEEEecCceeeeccc-----CCccccccccC---------CccccceEEEEeeEEEeee--- Confidence 5 4567777632 12222222 22221111000 00000000000 00111 2222222222222 Q ss_pred ccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecc Q lcl|Aclame:pro 287 NVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPI 366 (430) Q Consensus 287 ~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl 366 (430) .+-.|....+..+ ..+|..- ..++ +++ =++.|.+.+|.+|---- T Consensus 283 ----~G~s~~~~~~~~~---~~sPt~~-------------------~L~~--------~~N--W~~v~~~k~i~iv~~~~ 326 (330) T protein:vir:10 283 ----YGVKWTGAEVDAG---NITPSNA-------------------DLAK--------FKN--WKRVYEPKNIGIIALKH 326 (330) T ss_pred ----eeeeecccccccC---cCCcChH-------------------HhcC--------CcC--cccccChhhcceEEEEE Confidence 0001111000000 0111110 0000 111 12455555555444332 Q ss_pred cCCC Q lcl|Aclame:pro 367 PANH 370 (430) Q Consensus 367 ~~p~ 370 (430) -+.. T Consensus 327 ~~~~ 330 (330) T protein:vir:10 327 KIGK 330 (330) T ss_pred ecCC Confidence 2211 No 60 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=92.36 E-value=0.012 Score=31.17 Aligned_cols=297 Identities=15% Similarity=0.044 Sum_probs=121.5 Q ss_pred CccchhhHHHHHHHHHHHHH-----Hhh--cccchhhcccCChHHHH--hhcCCEEEEecCccc--cccc---CCccCCc Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETI-----SAI--TPMAQKAKKYTPPAASM--QRSSNTIWMPVEQES--PTQE---GWDLTDK 66 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l-----~~~--~Vma~lV~~y~~~~~~~--~k~GdTV~ip~P~~~--~~~~---g~~~s~~ 66 (430) ||.. ++.+++..||...+ .+. +.=+..|...-+.+.-+ +..|++|++|.-... ..++ +.+.+ T Consensus 1 MA~T--~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~-- 76 (324) T protein:vir:59 1 MAYT--KISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLV-- 76 (324) T ss_pred CCce--eeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccc-- Confidence 9955 22333333333332 111 11122221101112222 346999999975433 1111 21111 Q ss_pred cCCCCcceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc------ccceeeccCCCC Q lcl|Aclame:pro 67 ATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEM------GSLVITSPDAIG 138 (430) Q Consensus 67 ~~d~~e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~------as~~~~~~~~~~ 138 (430) ...+.-++-..++ +++.-.|+.++.. +.-.++.+++-++-...++++++.+|+..+... +.....++.+. T Consensus 77 ~~~l~t~~~~a~i-~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~- 154 (324) T protein:vir:59 77 PQKINAGQDKAVL-ILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTA- 154 (324) T ss_pred hhhcccceeeEEE-EeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeeccc- Confidence 1122222222222 2344446666533 455677777777777788888888888765421 12222332221 Q ss_pred CCCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcc Q lcl|Aclame:pro 139 TNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLP 218 (430) Q Consensus 139 ~~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~ 218 (430) ...-....+.+|...|-|+.- .-..++++|..+.+|..+.- ..+. ...-..+.||. +.|.. ++.++.+| T Consensus 155 -~~~~s~~~l~~A~~~~GD~~~---~~~~ivmhS~v~~~L~~~~l-i~~~----~~s~~~~~i~~-~~G~~-VivdD~~p 223 (324) T protein:vir:59 155 -DGIYSAETFVDASYKLGDHES---LLTAIGMHSATMASAVKQDL-IEFV----KDSQSGIRFPT-YMNKR-VIVDDSMP 223 (324) T ss_pred -cceecHHHHHHHHHHhCCccc---CcEEEEEchHHHHHHHHhhh-hhhc----cccccCceeee-ecccE-EEEeCCCC Confidence 112235678899999988753 34678999999999875421 1111 11112467775 77876 45777777 Q ss_pred eecc-ccc---ccce-ecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccc Q lcl|Aclame:pro 219 VLTK-STA---TGIT-VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDA 293 (430) Q Consensus 219 ~~~~-gt~---~~~t-V~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~ 293 (430) .... ++. .++. ..||-..+... .......| |.. .+.=|.+.-..-|.+||.= T Consensus 224 ~~~~~~~~~~y~s~l~~~GAi~~~~~~-----~~v~vE~d-Rd~---------~~g~~~l~~r~~~~~~p~G-------- 280 (324) T protein:vir:59 224 VETLEDGTKVFTSYLFGAGALGYAEGQ-----PEVPTETA-RNA---------LGSQDILINRKHFVLHPRG-------- 280 (324) T ss_pred ccccCCCCceEEEEEEecCeEEEeecC-----CCcceecc-cCc---------cccceEEEEeeEEEeEeee-------- Confidence 5322 111 1221 23332111100 00000000 000 0111333333333333310 Q ss_pred eEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccc Q lcl|Aclame:pro 294 TFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDA 358 (430) Q Consensus 294 ~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A 358 (430) |.-+.. ..-.++|..--+ ...+.+.+.| +-=+|.++- |-+.=+| T Consensus 281 -~s~~~~---~~~~~sPt~~~L--~~~~NW~~v~--------~~k~i~i~~-------~~~~~~~ 324 (324) T protein:vir:59 281 -VKFTEN---AMAGTTPTDEEL--ANGANWQRVY--------DPKKIRIVQ-------FKHRLQA 324 (324) T ss_pred -EEeccc---ccCCCCCChhhh--cCCccccccc--------CccccceEE-------EEeeccC Confidence 111100 001123332111 1122222211 111222211 2222233 No 61 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=92.21 E-value=0.012 Score=31.04 Aligned_cols=316 Identities=8% Similarity=0.018 Sum_probs=118.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccc-------chhhcccCChHHHHhhcCCEEEEecCccc--cccc---CCccCCccC Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPM-------AQKAKKYTPPAASMQRSSNTIWMPVEQES--PTQE---GWDLTDKAT 68 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vm-------a~lV~~y~~~~~~~~k~GdTV~ip~P~~~--~~~~---g~~~s~~~~ 68 (430) ||.. ++.+++..||...+-.+..+ |..|..--..+......|++|+||.=... ..++ +.+.+ .. T Consensus 1 MA~T--~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~--~~ 76 (351) T protein:vir:15 1 MAET--HLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDID--VN 76 (351) T ss_pred CCce--eeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccc--hh Confidence 9965 33344444444332222111 11121100122333457999999972211 1111 11111 11 Q ss_pred CCCcceEEEEeccccccceEecHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---c----cceeeccCCCCC Q lcl|Aclame:pro 69 GLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEM---G----SLVITSPDAIGT 139 (430) Q Consensus 69 d~~e~sV~v~l~~~k~V~~~~t~ke--L~~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~---a----s~~~~~~~~~~~ 139 (430) .+.-++-..++ +.+.-.|..++.. +...++.+++-.+-...++.+++.+|+..+... . ...+......+. T Consensus 77 kitt~~~~a~i-~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~ 155 (351) T protein:vir:15 77 NLTSGKQQGIK-FYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPS 155 (351) T ss_pred eecccceeEEE-EeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceecccccccc Confidence 11111222222 2222224444432 445677777777766777788888887765421 1 111222222222 Q ss_pred CCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcce Q lcl|Aclame:pro 140 NTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPV 219 (430) Q Consensus 140 ~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~ 219 (430) ...-+...+.+|...|-+..=.. -..++++|..+.+|..+. -+.+... ++ .+..||. +.|.. ++.++.+|. T Consensus 156 ~~~is~~~l~~A~~~~GD~~~~~--~~~ivmhS~v~~~L~~~~-li~~~~~--s~--~~~~i~t-~~G~~-VivdD~~p~ 226 (351) T protein:vir:15 156 EPMFGAKGFTGAIGLMGDLQDTA--FGAIAVNSATYSLMKVQG-LIETIQP--QN--GATPFEA-YNGLR-IVLDDDIEI 226 (351) T ss_pred ccccCHHHHHHHHHHhccccccc--eEEEEEChHHHHHHHhhh-hhhhccc--cc--cCcccce-ecceE-EEEcCCCcc Confidence 22234577999999987632222 255788999998886532 1222111 11 2456886 88986 557888876 Q ss_pred ecccccc----cc-eecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccE---EEEcceeecccc--ccccc Q lcl|Aclame:pro 220 LTKSTAT----GI-TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDK---ISFTGVKFLGQM--AKNVL 289 (430) Q Consensus 220 ~~~gt~~----~~-tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv---~TiaGV~~v~~~--tk~~~ 289 (430) ...+... ++ ...||-..+.... ...++ |... ..++-..|-- +. +-..|+.|-+.. ++-.. T Consensus 227 ~~~~~~~~~ytsyl~~~GAi~~~~~~~-------~ve~~-rd~~-~~~g~d~l~~-r~~~~~hp~G~s~~~~~~~~~~~s 296 (351) T protein:vir:15 227 DLTDKTKPVSTSYIFAPGAVRYSTNMR-------STETK-YDPL-INGGQDVIVQ-KRVGTIHVAGTSIKASFSPSKASF 296 (351) T ss_pred ccCCCCCceeEEEEEecceeeeecCCc-------Cccee-eccc-CCCCceEEEE-eeeeeeeeeeeeecccccccCcCC Confidence 5433211 22 2344432221110 00000 0000 0000000110 11 111222221110 00000 Q ss_pred cccceEEEEEeecCceeEE----eeccccccccccccccccccccccccccCceeEEecCCCce Q lcl|Aclame:pro 290 AQDATFSVVRVVDGTHVEI----TPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDAR 349 (430) Q Consensus 290 ~~l~~fvVt~~~~~~~v~I----~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~ 349 (430) +- -++..++++++. -|+-|+...- ..+.-.+++..-.+.+|=|-.-..++. T Consensus 297 Pt-----~~~L~~~~NW~~v~~~d~k~I~iv~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 351 (351) T protein:vir:15 297 PT-----IDELAKSSTWEVVDGIDVRSIGVVAY----TAQLDPALTPGAQMPAADTSTDTGTTK 351 (351) T ss_pred cC-----hHHhcCCcccccccCCCccccceEEE----EEecCcccccCCcCcCCCCccccCCCC Confidence 00 001122334333 2433332100 000000000000000000000001111 No 62 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=91.81 E-value=0.014 Score=30.73 Aligned_cols=251 Identities=12% Similarity=0.041 Sum_probs=95.2 Q ss_pred Cccc-hhhHHHHH---HHHHHHHHHhh----cccchhhcccCChHHHHhhcCCEEEEecCcccccccCCccCCccCCCCc Q lcl|Aclame:pro 1 MALN-EGQIVTLA---VDEIIETISAI----TPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDKATGLLE 72 (430) Q Consensus 1 MAn~-~~~~~~~~---~~~vl~~l~~~----~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~s~~~~d~~e 72 (430) ||-. +.+.-+.. -=+++..|+.+ +-+-. |.+..|.+ .|+||++|+=.. .| .++|+.| T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lg-i~r~~p~a-----~G~tIt~pK~~~----tg-----da~dVaE 65 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLG-VTRRETLT-----NDLKIQTYKWEV----TL-----DQTDPGE 65 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhc-cccccccc-----cCCeEEeeeeee----ec-----ccccccC Confidence 8876 43322221 11223333211 11110 11222322 499999998321 12 2223333 Q ss_pred c--------------eEEEEeccccccceEecHHHhccHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccC Q lcl|Aclame:pro 73 L--------------NVAVNMGEPDNDFFQLRADDLRDETA---YRHRIQSAARKLANNVELKVANMAAEMGSLVITSPD 135 (430) Q Consensus 73 ~--------------sV~v~l~~~k~V~~~~t~keL~~~~~---~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~ 135 (430) + ...+++.|.+.. .|+|.+.+..+ ..+-=++=.+.|+++|+.|++..++. +.+-+. T Consensus 66 Ge~Iplskvt~~~~~t~t~kikK~rK~---tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lkt-at~t~t--- 138 (295) T protein:vir:99 66 GETIPLSKVTRTKDKDYTVKWFKKRRA---TTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKT-KPTKVK--- 138 (295) T ss_pred CcccchhhheeeeeeeeEEEeeeeccc---ccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhcc-Cceeee--- Confidence 3 355666665552 36666532211 11222445567999999999976653 222211 Q ss_pred CCCCCCCCch-hHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhh-hhccchhhhhhhhccccccchhhhHHHh Q lcl|Aclame:pro 136 AIGTNTADAW-NFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKR-DIFGRIPEEAYRDGTIQRQVAGFDDVLR 213 (430) Q Consensus 136 ~~~~~~~~~~-~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l-~~~~~~~~~a~r~g~igr~~~Gfd~~~~ 213 (430) ...+ ..++.+...|+...--.+....++++|.+.+.++++..-. ....+...+.+. +|.|+..+++ T Consensus 139 ------g~~lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~------nfLG~q~II~ 206 (295) T protein:vir:99 139 ------GVGLQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLK------NFLGMQNVIV 206 (295) T ss_pred ------hhhHHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhhh------hhhccceEEE Confidence 1111 1234443334332222333467999999999887654331 111111122222 4778765666 Q ss_pred CCCcceeccccccc-----ce-----ecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccc Q lcl|Aclame:pro 214 SPKLPVLTKSTATG-----IT-----VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQ 283 (430) Q Consensus 214 ~~~~~~~~~gt~~~-----~t-----V~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~ 283 (430) +..++ .|..-. +. ++|..-. .......+.. +..+++-+....=.-.|.+.+.|+... T Consensus 207 S~kv~---~G~~~aT~~~Ni~~ay~~~~~g~l~--~~f~~~~D~t------glIg~~h~~~~~~~t~et~~~~~~~lf-- 273 (295) T protein:vir:99 207 MPSVP---EGKIYSTAVENLVFASLNVKGGDLG--GLFADFTDET------GLIAAARNRQLSNLTYESVFFGANVLF-- 273 (295) T ss_pred cccCC---CceEEEeeccceEEEEecCCchhhh--hhhhhccCcc------cceEEEeccccceeeehhhhHhHHHhc-- Confidence 76665 232111 11 1110000 0000001100 001111000000000111111111000 Q ss_pred cccccccccceEEEEEeec-------Cc Q lcl|Aclame:pro 284 MAKNVLAQDATFSVVRVVD-------GT 304 (430) Q Consensus 284 ~tk~~~~~l~~fvVt~~~~-------~~ 304 (430) +.-...+|..... || T Consensus 274 ------pE~~dgiv~~tI~~~~~~~~~~ 295 (295) T protein:vir:99 274 ------AEIPEGVVEATIEAAAVPGIGG 295 (295) T ss_pred ------ccccceEEEEEEecCcCCCCCC Confidence 0001112222211 22 No 63 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=88.73 E-value=0.031 Score=28.89 Aligned_cols=269 Identities=13% Similarity=0.019 Sum_probs=108.0 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccc-------ccccCCccCCccCCCCcc Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES-------PTQEGWDLTDKATGLLEL 73 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~-------~~~~g~~~s~~~~d~~e~ 73 (430) |||++. ..++....+-+.|...++-+-|.. -+.... ..=|.+|+||.=+.. +.++ |+..-+..++.-. T Consensus 1 Mantl~-ya~~~~~~Ld~~~~~~~~t~~l~~--~~~~v~-~~Gak~vkIp~is~~~~~TsGl~dy~-R~~g~~~g~v~~~ 75 (302) T protein:vir:78 1 MANSLA-LAQIYQDNIDKAIAVNSKSAFLEA--NPNNVQ-YNGGNTIKIADISFGSGTTGDLKAYN-RSTGFTQGSVTLA 75 (302) T ss_pred CCchhH-HHHHHHHHHHHHHHhhhceeeccc--CCceEE-EecCcEEEEEEEEeeccccccccccc-cccCccccceeee Confidence 999874 334444444444455555554431 111111 334789999974321 1121 2222222344455 Q ss_pred eEEEEeccccccceEecHHHh---ccHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcccceeeccCCCCC-CCC-CchhH Q lcl|Aclame:pro 74 NVAVNMGEPDNDFFQLRADDL---RDETAYRHRIQ-SAARKLANNVELKVANMAAEMGSLVITSPDAIGT-NTA-DAWNF 147 (430) Q Consensus 74 sV~v~l~~~k~V~~~~t~keL---~~~~~~~r~l~-pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~-~~~-~~~~d 147 (430) .-+.+|++-+...|.+..-|. +..-....+.. ..-...+=+||+.-...++..+....+....... .+. .-+.+ T Consensus 76 ~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~~nvl~~ 155 (302) T protein:vir:78 76 WSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASAQALMGD 155 (302) T ss_pred eeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccCccccccccchhHHHHHHH Confidence 677899999998888874322 11111122221 1223344577776554444333322222222111 111 22567 Q ss_pred HHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcce--e----- Q lcl|Aclame:pro 148 VADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPV--L----- 220 (430) Q Consensus 148 ~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~--~----- 220 (430) +..+...|++. ++|.+++.|+.+.-| .+...... ....+.+.+|.|-|.+..+|.+ .--.+|. . T Consensus 156 i~~~~~~~~e~-----~~~vl~vtp~~~~~L-k~a~~~~~--~~~~~~~~~~~i~~~V~~lDgv-~Ii~VPs~r~~t~~~ 226 (302) T protein:vir:78 156 IATAMELVDDS-----NQLILVTSPTTLAGL-LNTALIRE--SKNTQVLRRGEVDTKITFIQDV-EVLQVPSEYLYDKVA 226 (302) T ss_pred HHHHHHHhhcc-----CCeEEEEChHHHHHH-hcchhhcc--ceeccccccccccceeeeeccc-EEEEchhhhccccee Confidence 77888888874 379999999977644 33222211 1122233444444555444431 1111111 1 Q ss_pred -ccccc---c----c-ceec-ccceeeeeEEEEeeccccccccc--eeeEEEeeccceeecccEEEEcceeecccccccc Q lcl|Aclame:pro 221 -TKSTA---T----G-ITVS-GAQSFKPVAWQLDNDGNKVNVDN--RFATVTLSATTGLKRGDKISFTGVKFLGQMAKNV 288 (430) Q Consensus 221 -~~gt~---~----~-~tV~-ga~~~~~~~~t~~~~~~~~~~d~--~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~ 288 (430) +.|.. + . +.|. .|...-.....+..-.-..+.+| +.. ...+=+|+|.+.-- .. T Consensus 227 f~~G~~~~~~ak~INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd~~l~-------~~R~Y~D~fV~~nk--------~~ 291 (302) T protein:vir:78 227 PKVGVPDYTGAKKIPYMIFKRDAPTGIVKTDKVRVFEPDTNQSADAYKV-------DLRLYHDLIVPKNQ--------RP 291 (302) T ss_pred ccCCccccCCccceeEEEECCCeeeeeeeeeeeEeeCCCCCCCcceeee-------eeeeEeeeeeeccc--------cC Confidence 11110 0 0 0010 00000000000000110111100 000 12233566655421 00 Q ss_pred ccccceEE-EEEeec Q lcl|Aclame:pro 289 LAQDATFS-VVRVVD 302 (430) Q Consensus 289 ~~~l~~fv-Vt~~~~ 302 (430) .=|+ +++..+ T Consensus 292 ----gI~~~~~~~~~ 302 (302) T protein:vir:78 292 ----GIIKASFGTIA 302 (302) T ss_pred ----eEEEeeccccC Confidence 0011 122222 No 64 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=87.46 E-value=0.039 Score=28.33 Aligned_cols=284 Identities=12% Similarity=0.044 Sum_probs=107.8 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccccc-CCccCCccCCCCcceEEEEe Q lcl|Aclame:pro 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE-GWDLTDKATGLLELNVAVNM 79 (430) Q Consensus 1 MAn~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~-g~~~s~~~~d~~e~sV~v~l 79 (430) ||.+..+.-+-.|++++ ...++-+-+.. + +.+.+.-|.+|+||+=....-.+ .|+..-+..++.-..-+.+| T Consensus 8 mAlnya~~~~~~Ld~~~---~~~~~t~~l~~---~-~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~~g~v~~~~et~tl 80 (311) T protein:vir:99 8 RGFNYVTKDGNLLDQKI---TAGLFTAALGT---P-EVDLVNGGRSFTLKTISTSGLKDHTRGKGFNSGTISDEKTIYTM 80 (311) T ss_pred hHHHHHHHHHHHHHHHH---Hhhhcccceec---C-chheeecCCEEEEEeeeeccccccccccCccccceeeeeeEEEe Confidence 55544444444455444 34554444432 2 23344458899999743211111 12222334555666788999 Q ss_pred ccccccceEecH---HHhccHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcccceeeccCCCC----------CCCCC-c Q lcl|Aclame:pro 80 GEPDNDFFQLRA---DDLRDETAYRHRIQ-SAARKLANNVELKVANMAAEMGSLVITSPDAIG----------TNTAD-A 144 (430) Q Consensus 80 ~~~k~V~~~~t~---keL~~~~~~~r~l~-pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~----------~~~~~-~ 144 (430) ++-+...|.+.. +|-+..-....+.. ..-...+=+||+.-...++..+....+...... +.+.+ - T Consensus 81 ~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~lt~~nv 160 (311) T protein:vir:99 81 GQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEETLDETNA 160 (311) T ss_pred eeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccccccCHHHH Confidence 999999998883 22111111111111 111223345665544444443333222211100 01111 1 Q ss_pred hhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhH--HHhC-CCcceec Q lcl|Aclame:pro 145 WNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDD--VLRS-PKLPVLT 221 (430) Q Consensus 145 ~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~--~~~~-~~~~~~~ 221 (430) ++.+-.+...|++ +|. ++|.+++.|+.+. ++.+...... ........++.|-|.+..+|. +.+. +.-.-.+ T Consensus 161 l~~l~~~~~~~~~--v~~-~~rvl~vTp~~~~-lLk~~~~~~r--~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ps~r~~t 234 (311) T protein:vir:99 161 YSQLKTGIGKVRK--YGT-QNLVGYVSSEVMD-ALERSKEFTR--NITNQNVGTTALESRITSIDGVQLIEVYESNRFMT 234 (311) T ss_pred HHHHHHHHHHHHh--cCC-CCeEEEEChHHHH-HHhhchhhhe--eeecccccccccccccceecCeEEEEecCchhhcc Confidence 4556667777776 687 5799999999876 4443322211 011122223334344444432 2222 1110000 Q ss_pred ccccccceecccceeeeeEEEEeeccccccccceeeEEEeecc-ceeecccEEEEcceeeccccccccccccceEEEEEe Q lcl|Aclame:pro 222 KSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSAT-TGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRV 300 (430) Q Consensus 222 ~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~t-gtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~ 300 (430) + ... .+|... + ..+-.+| ...+.-++. .-.|.--+-.|+= +.|+. ..+++-+++.--+ T Consensus 235 ~---~~f-t~G~~~-~-------~~ak~IN----fiiv~~~a~i~~~K~~~v~~f~P--~~~~~---gd~~l~~~R~Y~D 293 (311) T protein:vir:99 235 K---YDF-TDGAKP-T-------EDAKAIN----FLVVAKPAVISIVKENAVFLFAP--GQHTD---GDGYLYQNRLYHD 293 (311) T ss_pred h---hhh-cCCccc-c-------Ccccccc----eEEeCCCeeeeeeeeeeeeeeCC--CCCCC---cceeeeeeeeeee Confidence 0 000 011100 0 0000000 011111111 1122211222210 11111 1133444333333 Q ss_pred ecCceeEEeeccccccccccccccccccccccc Q lcl|Aclame:pro 301 VDGTHVEITPKPVALDDVSLSPEQRAYANVNTS 333 (430) Q Consensus 301 ~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~ 333 (430) .- |...- .-+-|.|+..+ T Consensus 294 ~f-----v~~nk----------~~~Iyv~~k~A 311 (311) T protein:vir:99 294 LF-----IKKHK----------RDGIFVSVKKA 311 (311) T ss_pred ee-----eeccc----------cCeEEEeeecC Confidence 11 00000 00112222221 No 65 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=86.12 E-value=0.048 Score=27.82 Aligned_cols=266 Identities=11% Similarity=-0.009 Sum_probs=103.9 Q ss_pred Cccchhh-HHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCcc---CCccCCCCcceEE Q lcl|Aclame:pro 1 MALNEGQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDL---TDKATGLLELNVA 76 (430) Q Consensus 1 MAn~~~~-~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~---s~~~~d~~e~sV~ 76 (430) ..-+-+. +.+.+..++|+.++...++.++++...- .|.++.+|+.......-.+.. .....++.=..+. T Consensus 117 ~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~-------~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~ 189 (395) T protein:vir:43 117 IDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTT-------ESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELEN 189 (395) T ss_pred cCCCCccccchhhHHHHHHHHHhhhhHHhhccceec-------CCCceEEEEEecCCCceeeecCCccccccccceeEEE Confidence 2222222 3466778899999999999999865422 255677776433221111111 1111222223444 Q ss_pred EEeccccccceEecHHHhccHHHHHHHHHHH-HHHHHHHHHHHHHHHHH--------hcccceeeccCCCCCCCCCchhH Q lcl|Aclame:pro 77 VNMGEPDNDFFQLRADDLRDETAYRHRIQSA-ARKLANNVELKVANMAA--------EMGSLVITSPDAIGTNTADAWNF 147 (430) Q Consensus 77 v~l~~~k~V~~~~t~keL~~~~~~~r~l~pA-m~~LAn~Id~dl~~~~~--------~~as~~~~~~~~~~~~~~~~~~d 147 (430) +++.+-. +-+.+|.+=|.+..+.+++|... .++++..+|..+++-.- ...........+........+.+ T Consensus 190 ~~~~k~~-~~~~is~ell~d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~ 268 (395) T protein:vir:43 190 APVRTIA-HLFKASRQILDDASALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDR 268 (395) T ss_pred EeeeeEE-EeehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccchhHHH Confidence 4443333 33455544344444556666554 35788888887764210 00000011111111111223666 Q ss_pred HHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceeccccccc Q lcl|Aclame:pro 148 VADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATG 227 (430) Q Consensus 148 ~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~ 227 (430) +..+-..|.....+. -..+++|.+...+. .+ ... -||.+ +.+ . ..+ ++ T Consensus 269 i~~~~~~~~~~~~~~---~~~vmn~~~~~~l~-~l---kd~------------~G~~i------~~~--~---~~~--~~ 316 (395) T protein:vir:43 269 IRLAILQAQLAEFPA---SGIVLNPIDWALIE-LN---KDA------------ENRYI------IGS--P---QNG--TT 316 (395) T ss_pred HHHHHHhhccccCCC---cEEEEcHHHHHHHH-Hh---hcc------------CCcee------ccc--c---ccC--CC Confidence 767766666555443 24899999876553 21 111 12321 110 0 001 11 Q ss_pred ceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeE Q lcl|Aclame:pro 228 ITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVE 307 (430) Q Consensus 228 ~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~ 307 (430) .++.|-+- . .+..+.+|+++ | | +..++....+-.+-+|. T Consensus 317 ~~l~G~pV-----v---------------------~~~~~~~~~~~-~-g-------------d~~~~~~~~~~~~~~i~ 355 (395) T protein:vir:43 317 PTLWRLPV-----V---------------------ETQAITQDEFL-T-G-------------AFSLGAQIFDRMDIEVL 355 (395) T ss_pred ceecceee-----E---------------------EcCCCCCCcEE-E-E-------------eccceEEEEEecceEEE Confidence 12222110 0 00112223322 1 1 11222111222222333 Q ss_pred EeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEeccc Q lcl|Aclame:pro 308 ITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIP 367 (430) Q Consensus 308 I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~ 367 (430) +++..-. .+ --|-.++-+..--. =...+++||+..+.+-. T Consensus 356 ~~~~~~~-----------~f------~~~~~~~r~~~r~d---~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 356 VSTENDK-----------DF------ENNMVTIRAEERLA---FAVYRPEAFVTGSLTAS 395 (395) T ss_pred Eeccccc-----------hh------hcCcEEEEEEEeec---cEEecccceEEEEeccC Confidence 3221000 00 00000111100000 02234555555444433 No 66 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=79.41 E-value=0.1 Score=25.96 Aligned_cols=273 Identities=8% Similarity=0.001 Sum_probs=104.9 Q ss_pred Cccc-------------hhhHH--HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEec--Cccccccc--CC Q lcl|Aclame:pro 1 MALN-------------EGQIV--TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPV--EQESPTQE--GW 61 (430) Q Consensus 1 MAn~-------------~~~~~--~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~--P~~~~~~~--g~ 61 (430) |-|- .+-++ +.++...|..+-....++.++ |+. .+++.+-.|.... |. +..-+ +. T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~l--f~~---~~a~~~~~v~f~~~~p~-~~~~d~e~V 74 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESL--FRN---GGANPNGVVAYNEGNPS-FLEDDVADV 74 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhh--hhc---ccccccceeEEEecccc-cccCcHhhc Confidence 2111 00000 112222222233455555555 332 2244444555544 21 10000 00 Q ss_pred ccCCc--cCCCCcceEEEEeccccccceEecHHHhc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeeccCCC Q lcl|Aclame:pro 62 DLTDK--ATGLLELNVAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAI 137 (430) Q Consensus 62 ~~s~~--~~d~~e~sV~v~l~~~k~V~~~~t~keL~--~~~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~ 137 (430) ...+. ..+...+.-.+...+.....|++|.|... .-+..+|.++++.++++.++|..++.............+.+ T Consensus 75 aEggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~- 153 (318) T protein:vir:10 75 AEFGEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTA- 153 (318) T ss_pred cCcccccccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcC- Confidence 00000 01111222233233344677889988853 34566788888999999999999887543321111111110 Q ss_pred CCCCCCchhHHHHHHHHHHHhC------------CCcC-CCcEEEecHHHHHHHHHhhhhhhhcc-ch--hhhhhh-hcc Q lcl|Aclame:pro 138 GTNTADAWNFVADAEELMFSRE------------LNRD-MGTSYFFNPQDYKKAGYDLTKRDIFG-RI--PEEAYR-DGT 200 (430) Q Consensus 138 ~~~~~~~~~d~a~a~~~L~~~~------------aP~~-~~R~~vl~p~~~a~~~~~~~~l~~~~-~~--~~~a~r-~g~ 200 (430) ..+.....+|+..|........ .+.+ ..-.+|++|....-+.++..-+..-. +. .....+ .|. T Consensus 154 w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~ 233 (318) T protein:vir:10 154 WDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGN 233 (318) T ss_pred CCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhccccccc Confidence 0111122334444433221100 0100 11358999999888876544322211 11 122222 477 Q ss_pred ccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceee Q lcl|Aclame:pro 201 IQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKF 280 (430) Q Consensus 201 igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~ 280 (430) +++.++||+++ .+.++|. ++ .+.+-. +.. |++.-.--++..+.|. T Consensus 234 ~~g~~lGl~vi-~s~~~p~---~~--alvlq~-g~v----------------------------G~~~d~~pl~~t~~~~ 278 (318) T protein:vir:10 234 FPGSVMGLNVI-RSRTFPI---DR--VLIMER-GTV----------------------------GFYSDTRPLQFTALYP 278 (318) T ss_pred ccceeeceEEe-ecCccCC---Ce--eEEEec-CCc----------------------------ceeeccccceeeeccc Confidence 76678899976 6777773 22 233321 110 1111011123333221 Q ss_pred cccc---ccccccccceEEEE--EeecCceeEEeeccccccccccccccccccccccccccCceeEEecCC Q lcl|Aclame:pro 281 LGQM---AKNVLAQDATFSVV--RVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVK 346 (430) Q Consensus 281 v~~~---tk~~~~~l~~fvVt--~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~ 346 (430) =+-. .+......+-+.++ ++..+ +- .--||=+++. T Consensus 279 egg~~~g~~~~s~~~~~~~~~~~~V~~P-------kA------------------------~~~itgi~~~ 318 (318) T protein:vir:10 279 EGNGPNGGPTESYRADASHKRALAVDQP-------KA------------------------ALWLTGIVTP 318 (318) T ss_pred CCCCCCCCcchhhheehheeeeeeeeCc-------ce------------------------eEEEeeccCC Confidence 0000 01111111112222 11111 11 0011111111 No 67 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=75.97 E-value=0.14 Score=25.25 Aligned_cols=282 Identities=17% Similarity=0.150 Sum_probs=135.8 Q ss_pred Cccchhh----HHHHHHHHHHHHHHhhcc---cchhhcccCChHHHHhhcCCEEEEecCcccccccCC-ccCCccCCCCc Q lcl|Aclame:pro 1 MALNEGQ----IVTLAVDEIIETISAITP---MAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGW-DLTDKATGLLE 72 (430) Q Consensus 1 MAn~~~~----~~~~~~~~vl~~l~~~~V---ma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~-~~s~~~~d~~e 72 (430) |-..... ..++-.++++..|-+.+. .+|.|.. ...||++.||-=-....++-. +..--..++-- T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~D--------F~~G~~L~I~tiGs~~~~~~~E~~~~~~~~i~T 72 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSD--------FGSGETLHIKTIGSVTLQEAEEDTPLIYNPIET 72 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhcc--------CCCCCEEEecccCceeeeccccCCCeeeccccc Confidence 5433222 123333444433322211 2222222 236999999852222222210 01111234445 Q ss_pred ceEEEEeccccccceEecHHHhc-cHHHHHHHH----HHHHHHHHHHHHHHHHHHHHhcc-----cce----eeccCCCC Q lcl|Aclame:pro 73 LNVAVNMGEPDNDFFQLRADDLR-DETAYRHRI----QSAARKLANNVELKVANMAAEMG-----SLV----ITSPDAIG 138 (430) Q Consensus 73 ~sV~v~l~~~k~V~~~~t~keL~-~~~~~~r~l----~pAm~~LAn~Id~dl~~~~~~~a-----s~~----~~~~~~~~ 138 (430) +-+++.|.+-+.-.+.++.+ |+ +..+..|+. ....|++-..-+.|+++.-.++- +.. .+.--..+ T Consensus 73 GEIt~~i~~Y~G~A~~vt~~-LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~ 151 (313) T protein:vir:95 73 GEITFQITEYKGDAWYVTDD-LREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAE 151 (313) T ss_pred ceEEEEEEeecCChhhhhhh-hhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEecc Confidence 56888888888877777653 44 333444443 33567888888999887654321 111 11111223 Q ss_pred CCCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhh--hh-hhhccchhhhhhhhcc-----ccccchhhhH Q lcl|Aclame:pro 139 TNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDL--TK-RDIFGRIPEEAYRDGT-----IQRQVAGFDD 210 (430) Q Consensus 139 ~~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~--~~-l~~~~~~~~~a~r~g~-----igr~~~Gfd~ 210 (430) +.+.-++++|...+-.|++..+|. ++|..++||-.++-+-+-. +. ..++... -++.|+ .=|.++|+| T Consensus 152 T~~~~~~~~~~~~~~~~~~a~~P~-~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~---I~ESG~A~~~~Fi~~~YG~D- 226 (313) T protein:vir:95 152 TNGVFALKHLIAMRLAFDKANVPA-EGRVFIVDPVAEATLNGLVTITHDVTDFGKM---ILESGMARGQRFIMNLYGWD- 226 (313) T ss_pred CCceehhhHHHHhhhhhhhccCCc-cceEEEEcchhhhhhhhhheeecccccccce---eeeccCCchhHHHHHHhhhh- Confidence 444456889999999999999999 5799999999887653321 11 1111111 122222 223466776 Q ss_pred HHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeecccccccccc Q lcl|Aclame:pro 211 VLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLA 290 (430) Q Consensus 211 ~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~ 290 (430) ++-|..+- + |+. .|++ .||.=|.|-.|.- T Consensus 227 i~~SN~L~-----------~--AN~----------------~D~~-------tT~~G~~~NlFM~--------------- 255 (313) T protein:vir:95 227 ILTSNRLH-----------V--ANY----------------NDGT-------TTGNGYVGNLFMC--------------- 255 (313) T ss_pred hhhhhhhh-----------h--ccc----------------cccc-------cccCceeeeeeee--------------- Confidence 44443221 0 000 0111 0111122332221 Q ss_pred ccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCC Q lcl|Aclame:pro 291 QDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANH 370 (430) Q Consensus 291 ~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~ 370 (430) |.. -++.+ |.-|-|-||..+ T Consensus 256 ------i~D---~~~~P---------------------------------------------------~~~AWr~MP~s~ 275 (313) T protein:vir:95 256 ------ILD---DQTKP---------------------------------------------------IMGAWRRMPKSE 275 (313) T ss_pred ------eec---ccccc---------------------------------------------------eeeeeccccccc Confidence 000 01110 122233444432 Q ss_pred CchhhceeEEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 371 ELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 371 ~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) + ..+.. -+-.+-.++|| ||+-.+|-|--++++++-+| T Consensus 276 ~---~~~~~----------------~~~~~~~~~~R----~G~Gi~R~~~L~~~~~~A~~ 312 (313) T protein:vir:95 276 G---ERNKD----------------RARDEHVVRCR----YGFGIQRLDTLGLLATSATA 312 (313) T ss_pred c---ccccc----------------cccccceeeee----ecccceeecceeEEEecccc Confidence 2 11111 11122244555 99999999999999999999 No 68 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=75.01 E-value=0.15 Score=25.08 Aligned_cols=285 Identities=11% Similarity=0.028 Sum_probs=102.8 Q ss_pred Cccc----hhh-HHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc----cCCccCCccCCCC Q lcl|Aclame:pro 1 MALN----EGQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ----EGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~----~~~-~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~----~g~~~s~~~~d~~ 71 (430) |+.. .+. +.+....++|+.+++..++.++++.+.- .|.++++|+-...... .|... ...+.. T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~-------~~~~~~~p~~~~~~~a~~v~E~~~~--~~~~~~ 84 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPM-------GTTGQKIPHWIGDVSAQWIGEGDMK--PITKGN 84 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhcceeec-------cCCceEEEEEeCCcceEEecCCccc--cccccc Confidence 3332 232 4577788999999999999888865321 2567788874322111 12111 112222 Q ss_pred cceEEEEeccccccceEecHHHhccH-HHHHHHH-HHHHHHHHHHHHHHHHHHHHhccc-------ceeeccCCCCCCCC Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDLRDE-TAYRHRI-QSAARKLANNVELKVANMAAEMGS-------LVITSPDAIGTNTA 142 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL~~~-~~~~r~l-~pAm~~LAn~Id~dl~~~~~~~as-------~~~~~~~~~~~~~~ 142 (430) =..+.++..+ -.+-+.+|.+=|++. ...+++| +.-.++++..+|..++.---.... ......... .... T Consensus 85 f~~v~~~~~k-~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~-~~~~ 162 (320) T protein:vir:10 85 MTSQNIAPHK-IATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPG-GATA 162 (320) T ss_pred eeEEEEeeEE-EEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceecc-cccc Confidence 2233333322 133455555555542 2234444 555578888888887631110000 000000000 0011 Q ss_pred Cc---hh-HHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcc Q lcl|Aclame:pro 143 DA---WN-FVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLP 218 (430) Q Consensus 143 ~~---~~-d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~ 218 (430) .+ +. ++..+...+.....+ .-..+++|.....+. .+ ...+ ||.+ ..+... .+ T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~v~n~~~~~~L~-~l---kd~~------------G~~l-~~~~~~-~~--- 218 (320) T protein:vir:10 163 SDLTAYDAVAVNGLSLLVNAKKK---WTHTLLDDIVEPILN-GA---KDKN------------GRPL-FIESTY-TD--- 218 (320) T ss_pred cccccHHHHHHHHHhhhhcccCC---CcEEEEcHHHHHHHH-Hh---hccC------------Ccee-eccccc-cC--- Confidence 11 11 233333333333333 235899999887664 22 1111 2211 000000 00 Q ss_pred eecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEE Q lcl|Aclame:pro 219 VLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVV 298 (430) Q Consensus 219 ~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt 298 (430) ......+.++.|-+- -.+..+..|+.+-+-| +...++ . T Consensus 219 --~~~~~~~~~i~g~pv--------------------------~~~~~~~~~~~~~~~g-------------d~~~~~-~ 256 (320) T protein:vir:10 219 --ENSPFRAGRIVSRPT--------------------------ILSDHVADGTTVGYMG-------------DFRNVI-W 256 (320) T ss_pred --ccccccCceeeeeee--------------------------EecCCCCCCceEEEEe-------------ecceEE-E Confidence 000000111111110 0011122344333323 222322 2 Q ss_pred EeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCC Q lcl|Aclame:pro 299 RVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHE 371 (430) Q Consensus 299 ~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~ 371 (430) +...+-++.++.......... ..+..++.---|-.++-+.---. =...+++||+..+--= .|+- T Consensus 257 ~~~~~~~i~~~~~~~~~~~~~-----~~~~~~~~f~~~~~~~r~~~~~d---~~v~~~~a~~~l~~~~-ap~~ 320 (320) T protein:vir:10 257 GQVGGLSFDVTDQATLNLGTP-----TEPNFVSLWQHNLVAVRVEAEYA---FHNNDKDAFVKLTNVV-TPDA 320 (320) T ss_pred EEecCeEEEEeecceeeeccc-----cccccchhhhcCcEEEEEEEeec---cEEecccceEEEEecc-CCCC Confidence 222222333332211110000 00000000001111111100000 0335566776644211 1211 No 69 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=73.84 E-value=0.17 Score=24.87 Aligned_cols=269 Identities=12% Similarity=0.042 Sum_probs=112.5 Q ss_pred Cccc----hhh-HHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc----cCCccCCccCCCC Q lcl|Aclame:pro 1 MALN----EGQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ----EGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~----~~~-~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~----~g~~~s~~~~d~~ 71 (430) |++. .+. +-+.+..++|+.+++..++.++++.. +- .+.++++|+-...... .|... ..++.. T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~-~~------~~~~~~ip~~~~~~~a~~v~Eg~~~--~~~~~~ 84 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV-PM------GTTGQKIPHWVGDVSAQWIGEGDMK--PITKGN 84 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhccee-ec------cCCceEEEEEeCCcceEEecCCccc--cccccc Confidence 3322 223 34677789999999999999998643 21 2456777763221111 12111 112222 Q ss_pred cceEEEEeccccccceEecHHHhccH-HHHHHHH-HHHHHHHHHHHHHHHHHHHHhcc--cc---eeeccCCCCCCCCCc Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDLRDE-TAYRHRI-QSAARKLANNVELKVANMAAEMG--SL---VITSPDAIGTNTADA 144 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL~~~-~~~~r~l-~pAm~~LAn~Id~dl~~~~~~~a--s~---~~~~~~~~~~~~~~~ 144 (430) =..+.++..+- .+-+.+|.+-|++. ...+++| +...++++.++|..+++---... .. .............+. T Consensus 85 f~~i~~~~~k~-~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (318) T protein:vir:24 85 MTSQTIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTGATTV 163 (318) T ss_pred eeEEEEeeEEE-EEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccccccch Confidence 22344443222 23455555545542 2234444 55667899999988863110000 00 000000000111111 Q ss_pred h-hHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceeccc Q lcl|Aclame:pro 145 W-NFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKS 223 (430) Q Consensus 145 ~-~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~g 223 (430) + .++..+...+.....+ .-..+++|.....+. .+ ...+ ||.+ +... . ..+ T Consensus 164 ~~~~~~~~~~~~~~~~~~---~~~~v~n~~~~~~L~-~l---kd~~------------G~~l------~~~~-~---~~~ 214 (318) T protein:vir:24 164 YDQVAVNGLSLLVNDGKK---WTHTLLDDITEPILN-GA---KDQN------------GRPL------FIES-T---YGE 214 (318) T ss_pred HHHHHHHHHHhhccccCC---CCEEEEcHHHHHHHH-Hh---hccC------------Ccee------ecCc-c---ccC Confidence 2 2233333333332222 234799999876553 21 1111 2321 1110 0 000 Q ss_pred ccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecC Q lcl|Aclame:pro 224 TATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDG 303 (430) Q Consensus 224 t~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~ 303 (430) +.. ++.-+ ++-|+ T Consensus 215 ~~~---------------------------------------~~~~~---~i~g~------------------------- 227 (318) T protein:vir:24 215 AAS---------------------------------------PFRSG---RIVAR------------------------- 227 (318) T ss_pred ccc---------------------------------------cccCc---eEEEE------------------------- Confidence 000 00000 11110 Q ss_pred ceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEec Q lcl|Aclame:pro 304 THVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSI 383 (430) Q Consensus 304 ~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~ 383 (430) .+.+++.+ +++..+-++|..+ -+.+..+- T Consensus 228 -pv~~~~~~----------------------~~~~~~~~~gdfs----------~~~~~~~~------------------ 256 (318) T protein:vir:24 228 -PTILSDHV----------------------VEGTTVGFMGDFS----------QLIWGQIG------------------ 256 (318) T ss_pred -eeEEeCCC----------------------CCCccEEEEeecc----------eEEEEEec------------------ Confidence 00000000 0011111112110 01111110 Q ss_pred CCCcEEEEEEeec--------------ccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 384 PDVGLNGIFATQG--------------DISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 384 ~~~Glsirv~~~y--------------d~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) |+++++.+++ ...+++..+|...-+|+++++|+-. +.|-+-++ T Consensus 257 ---~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~-~~i~~~~a 313 (318) T protein:vir:24 257 ---GLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAF-VALTNVVS 313 (318) T ss_pred ---CeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccce-EEEEeecc Confidence 1222111111 1345678888999999999999875 66888877 No 70 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=67.46 E-value=0.25 Score=23.87 Aligned_cols=291 Identities=13% Similarity=0.057 Sum_probs=117.0 Q ss_pred Cccc------------h-hhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCccC--- Q lcl|Aclame:pro 1 MALN------------E-GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLT--- 64 (430) Q Consensus 1 MAn~------------~-~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~s--- 64 (430) ||-. . ..+.+.+.+++|+.+++..++.++++.. + -.+..+++|+-...... .+... T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~-~------~~~~~~~~p~~~~~~~a-~~v~Eg~~ 72 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKV-P------MGPTGISIPHWTGAVSA-SWTGEAER 72 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhccee-e------ccCCceEEEEEcCCcce-eEecCCCc Confidence 3322 1 2244677889999999999998888542 1 12455777764322221 11111 Q ss_pred CccCCCCcceEEEEeccccccceEecHHHhccH-HHHHHHH-HHHHHHHHHHHHHHHHHHH---------Hhcccceeec Q lcl|Aclame:pro 65 DKATGLLELNVAVNMGEPDNDFFQLRADDLRDE-TAYRHRI-QSAARKLANNVELKVANMA---------AEMGSLVITS 133 (430) Q Consensus 65 ~~~~d~~e~sV~v~l~~~k~V~~~~t~keL~~~-~~~~r~l-~pAm~~LAn~Id~dl~~~~---------~~~as~~~~~ 133 (430) ....++.=..+.++..+- ..-+.+|.+=|++. ...+++| +.-.++++.++|..+++-- .......... T Consensus 73 ~~~~~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~ 151 (330) T protein:vir:77 73 KPITKGSFGKQELEPVKI-TTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSL 151 (330) T ss_pred cccccceeeEEEEeEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcccccccccccccee Confidence 111222222344444221 23345555445442 2234444 4555788888988776210 0000000000 Q ss_pred cCCCC-C---CCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhh Q lcl|Aclame:pro 134 PDAIG-T---NTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFD 209 (430) Q Consensus 134 ~~~~~-~---~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd 209 (430) ..... + .....+.++..+...+.....+. ...+++|.....+.. + ... -||.+ T Consensus 152 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~---~~~vmn~~~~~~l~~-l---kd~------------~G~~l---- 208 (330) T protein:vir:77 152 ADTNLTTASGPQGNAYLAVNNALSLLVNSGKKW---TGTLLDNVTEPILNT-A---VDG------------NGRPL---- 208 (330) T ss_pred ecccccccccccchhHHHHHHHHHhhhhcCCCc---cEEEEcHHHHHHHHH-H---hcc------------CCcee---- Confidence 00000 1 11122556666666666665553 247999998766532 1 111 13322 Q ss_pred HHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccc Q lcl|Aclame:pro 210 DVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVL 289 (430) Q Consensus 210 ~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~ 289 (430) +... ......+...+.++.|-+-.-.. ..+ + ++. .....-|-| T Consensus 209 --~~~~-~~~~~~~~~~~~~l~G~PV~~~~---------~~p-~---------~~~---~~~~~~~~g------------ 251 (330) T protein:vir:77 209 --FVES-TYTEQVGAIREGRILGRPTYVAD---------NVV-N---------GTV---GNRVVGVMG------------ 251 (330) T ss_pred --ecCc-cccccccccCCceecceeeEEec---------ccc-C---------CCC---CCccEEEEE------------ Confidence 1000 00000000011111111100000 000 0 000 001111111 Q ss_pred cccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCC Q lcl|Aclame:pro 290 AQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPAN 369 (430) Q Consensus 290 ~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p 369 (430) +..+|+ ..+..+-+|.++.... +... T Consensus 252 -d~s~~~-i~~~~~~~i~~~~e~~----------------------------------------------------~~~~ 277 (330) T protein:vir:77 252 -DFSQVI-WGQIGGLSFDVTDQAT----------------------------------------------------LDFG 277 (330) T ss_pred -ecceEE-EEEecCcEEEEeecce----------------------------------------------------eeec Confidence 111221 1111111222211100 0000 Q ss_pred CCchhhceeEEEecCCCcEEEEEEeec-ccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 370 HELFAGMKTTSFSIPDVGLNGIFATQG-DISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 370 ~~~~~~~~~~~~~~~~~Glsirv~~~y-d~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) .+..+.. .+..+ ...++...+|.-.-+|+++++|+-. +.|-+.++ T Consensus 278 -------------~~~~~~~--~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~-~~i~~~~~ 323 (330) T protein:vir:77 278 -------------EEQGGVW--VPKLISLWQHNMVAVRCEAEFAFMVNDKDAF-VKLTDQVA 323 (330) T ss_pred -------------ccccccc--cccccchhhcCcEEEEEEEEeccEEecccce-EEEEeccC Confidence 0000000 00000 1334577788888899999999874 77888888 No 71 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=64.85 E-value=0.29 Score=23.51 Aligned_cols=262 Identities=10% Similarity=0.029 Sum_probs=92.9 Q ss_pred Cc--------------------cchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccC Q lcl|Aclame:pro 1 MA--------------------LNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEG 60 (430) Q Consensus 1 MA--------------------n~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g 60 (430) |+ |+|++.+..++ +.|- |.|+.|.+ .|.+|++.+=..+ .+.| T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~--------~~LG----v~r~~pla-----~Gt~iktyK~~~~-~y~g 62 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLF--------EALA----IQNKIPMN-----VGSALKQYRFKVE-DSEK 62 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHH--------HHhh----hhcccccc-----CCceeeeeeeece-eecc Confidence 33 33333222222 2221 12333433 6889987762111 1112 Q ss_pred CccCCccCCCCcc--------------eEEEEeccccccceEecHHHhccHHH---HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 61 WDLTDKATGLLEL--------------NVAVNMGEPDNDFFQLRADDLRDETA---YRHRIQSAARKLANNVELKVANMA 123 (430) Q Consensus 61 ~~~s~~~~d~~e~--------------sV~v~l~~~k~V~~~~t~keL~~~~~---~~r~l~pAm~~LAn~Id~dl~~~~ 123 (430) .++|+.|+ ...+++.|++..- |+|-+-+..+ ..+-=++=.+.|+++|+.|++..+ T Consensus 63 -----da~dVaEGe~Iplskvt~~~~~t~~~~~kK~rK~t---TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~l 134 (303) T protein:vir:10 63 -----PNGDVAEGDVIPLTKVTREQVDITELQFAKYRKST---SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETL 134 (303) T ss_pred -----ccccccCCcccchhhheeeecceEEEEeecccccc---cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHH Confidence 22233332 3567777766522 6665522111 112224455679999999999777 Q ss_pred HhcccceeeccCCCCCCCCCchhHHHHHHHHHHHh-CC--CcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhcc Q lcl|Aclame:pro 124 AEMGSLVITSPDAIGTNTADAWNFVADAEELMFSR-EL--NRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGT 200 (430) Q Consensus 124 ~~~as~~~~~~~~~~~~~~~~~~d~a~a~~~L~~~-~a--P~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~ 200 (430) +.....+.. +. .+......+..|......+ .. ..+.+-.+|+||.+.+.++++..-.....+..-+.+. T Consensus 135 ktaT~t~~~---t~--~t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L~--- 206 (303) T protein:vir:10 135 KSAIENGKR---TN--KTKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGVNLLT--- 206 (303) T ss_pred hhccccccc---cc--ceeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhhhhhh--- Confidence 653222211 11 1112233444443322211 01 1123457999999999987643221111111222333 Q ss_pred ccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeecc-ccccccceeeEEEeeccceeecccEEEEccee Q lcl|Aclame:pro 201 IQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDG-NKVNVDNRFATVTLSATTGLKRGDKISFTGVK 279 (430) Q Consensus 201 igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~-~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~ 279 (430) +|.|+. +.++..++ .|+.-..... . ..+.+.. ++.- ++.-..++-.|| +=||. T Consensus 207 ---nfLG~~-II~S~kv~---~G~~~~T~~~---N-----i~~ay~~~~g~l--~~~f~~t~D~tg---------lIGv~ 260 (303) T protein:vir:10 207 ---PYVGVK-IVEFADVP---QGEVWMTVAE---N-----LNVAYANPRGEL--SRAFAFATDATG---------FVGVL 260 (303) T ss_pred ---hhhcce-EEEeccCC---CceEEEeecc---c-----eEEEEecCchhh--hhhhhhcccccc---------ceEEE Confidence 377776 45666655 2321110000 0 0000000 0000 000001111111 11110 Q ss_pred eccccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccc Q lcl|Aclame:pro 280 FLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLA 335 (430) Q Consensus 280 ~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA 335 (430) - ++.++..+.. + ..-+.++++|.++ +..-...-+ -.-++.-|+ T Consensus 261 h-~~~~~~~t~e--T------~~~~~~~lfpE~~--dgiv~~ti~--~~e~~~~~~ 303 (303) T protein:vir:10 261 H-DIQPQRLTSD--T------IYASAISMFPENI--DAVIKVTIK--KDEAGELPS 303 (303) T ss_pred e-ccccceeeeh--h------HhHhHHHhccccc--ceEEEEEEe--ccccCCCCC Confidence 0 0000000000 0 0011133333321 000000000 000112232 No 72 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=59.67 E-value=0.39 Score=22.84 Aligned_cols=256 Identities=10% Similarity=0.030 Sum_probs=98.7 Q ss_pred Cccch-hhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCC-ccCCccC---CCCcce Q lcl|Aclame:pro 1 MALNE-GQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGW-DLTDKAT---GLLELN 74 (430) Q Consensus 1 MAn~~-~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~-~~s~~~~---d~~e~s 74 (430) +.... +.++ +.+..++|+.++...++-++++...- .+.++.+|++......-++ ......+ +..=.. T Consensus 137 ~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 209 (400) T protein:vir:38 137 VKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQA-------STQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKP 209 (400) T ss_pred ccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEec-------cCcceEEEEEecCCCcccccccccccccccccccee Confidence 11111 1233 44567777777777777777753211 2446677776433221111 1111222 222223 Q ss_pred EEEEeccccccceEecHHHhccHHH-HHHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHH Q lcl|Aclame:pro 75 VAVNMGEPDNDFFQLRADDLRDETA-YRHRIQS-AARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAE 152 (430) Q Consensus 75 V~v~l~~~k~V~~~~t~keL~~~~~-~~r~l~p-Am~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~ 152 (430) +.++..+-. .-+.+|.+=|++..+ .+.+|.. ..++|+..+|..++. . .++..+.+...++++..+- T Consensus 210 i~~~~~k~~-~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~---------~--~~~~~~~~~~~~~~~~~~~ 277 (400) T protein:vir:38 210 VNWSVETYR-QALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVAT---------L--LKGFTAKTISSVDDLKHIN 277 (400) T ss_pred eEeehhhee-eehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhh---------c--cccccccccccHHHHHHHH Confidence 444432222 233444332333222 4444433 334566666665541 1 1122233455677776664 Q ss_pred HHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccccceecc Q lcl|Aclame:pro 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSG 232 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~g 232 (430) ....+ | ..+-..+++|.+...+. .+ ... -||++ +.. . ... +.+.++.| T Consensus 278 ~~~~~---~-~~~a~~v~~~~~~~~l~-~l---kd~------------~G~~i------~~~-~---~~~--~~~~~l~G 325 (400) T protein:vir:38 278 NVDLD---P-AYSRVIIASQSFYNFLD-TV---KDG------------NGRYL------LQD-S---ILT--PSGKSVLG 325 (400) T ss_pred Hhhhh---h-hhCcEEEEcHHHHHHHH-Hh---hcc------------CCCee------eec-C---cCC--CCcccccc Confidence 43322 2 23455889998876543 22 111 13322 111 0 011 11122333 Q ss_pred cceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEEeecc Q lcl|Aclame:pro 233 AQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKP 312 (430) Q Consensus 233 a~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~ 312 (430) -+-.. .+. .....+||.+-|-| +..+|+...+-.+-++..+... T Consensus 326 ~pv~~-----~~~------------------~~~~~~g~~~~~~g-------------d~s~~~~~~~~~~~~~~~~~~~ 369 (400) T protein:vir:38 326 MPIAV-----VSD------------------DTLGAAGEAHAFLG-------------DIKRAILFANRADFMVRWVDDQ 369 (400) T ss_pred ceeEE-----ecc------------------cccCCCCceEEEEE-------------eccccEEEEeecceEEEEeccc Confidence 22100 000 00011355544444 3344433333223333332211 Q ss_pred ccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccC Q lcl|Aclame:pro 313 VALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPA 368 (430) Q Consensus 313 v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~ 368 (430) . . ....+.|..+-- -..|++||+........ T Consensus 370 ~-----~-~~~~~~~~r~d~-------------------~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 370 I-----Y-GQFLQAGMRFGV-------------------SVADEKAGYFLTYTPKA 400 (400) T ss_pred c-----c-ceeEEEEEEecc-------------------EEecccceEEEEeecCC Confidence 0 0 000111111111 23355555555544322 No 73 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=59.13 E-value=0.4 Score=22.77 Aligned_cols=257 Identities=11% Similarity=0.054 Sum_probs=90.9 Q ss_pred Cc-------------cchhhHHHH-H---HHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCcc Q lcl|Aclame:pro 1 MA-------------LNEGQIVTL-A---VDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDL 63 (430) Q Consensus 1 MA-------------n~~~~~~~~-~---~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~ 63 (430) |- ..|.....| + |.+-|..|.+.|-+ .++.|.+ .|+||+..+...+. T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv----~r~~pla-----~GstIkt~k~~~y~------- 64 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGV----TRKISVS-----EGMTLKTYAGYDVT------- 64 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhh----ccccccc-----CCCEEeeccceeee------- Confidence 22 222221111 0 11111112222211 2222322 39999765422221 Q ss_pred CCccCCCCcc--------------eEEEEeccccccceEecHHHhccHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 64 TDKATGLLEL--------------NVAVNMGEPDNDFFQLRADDLRDETA---YRHRIQSAARKLANNVELKVANMAAEM 126 (430) Q Consensus 64 s~~~~d~~e~--------------sV~v~l~~~k~V~~~~t~keL~~~~~---~~r~l~pAm~~LAn~Id~dl~~~~~~~ 126 (430) ..++|+.|+ ...+++.|.+.. .|+|-+.+..+ ..+-=++=.+.|+++|+.|++..++.. T Consensus 65 -gda~dVaEGe~Iplskvt~~~~~t~t~~ikK~rK~---tTdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~Lkta 140 (296) T protein:vir:98 65 -LAEGNVPEGEVIPLSKVERKIHSEKKIELKKYRKA---TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG 140 (296) T ss_pred -eccccccCCcccchhhheeeecceEEEEeeccccc---cCHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcc Confidence 222333333 355667666554 26665532211 112224455679999999998766542 Q ss_pred ccceeeccCCCCCCCCCchhH-----HHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccc Q lcl|Aclame:pro 127 GSLVITSPDAIGTNTADAWNF-----VADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTI 201 (430) Q Consensus 127 as~~~~~~~~~~~~~~~~~~d-----~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~i 201 (430) . +-... ....+.. +.++...+++.+ +....+++||.+.+.++++.. + .. +.++ -+.+ T Consensus 141 T-~t~~~-------t~~~lQ~Ala~~~~~l~~~feded---~~~~V~FVnP~D~a~ylg~a~-i-t~----qt~f-G~ty 202 (296) T protein:vir:98 141 T-GTQDA-------LGAGLQGALASAWGKLQVLFEDYG---SERAIVFANSLDVAEYIAKAG-I-TT----QTAF-GLTY 202 (296) T ss_pred c-ceeee-------chhhHHHHHHHHhhhhhhhccccC---CCceEEEEehHHHHHHhcCCc-c-ch----hhee-chhh Confidence 2 11110 0112221 223334555553 235779999999998876542 1 11 1111 1222 Q ss_pred cccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeec Q lcl|Aclame:pro 202 QRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFL 281 (430) Q Consensus 202 gr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v 281 (430) ..+|.|- .++++..++ .|..-......= .+.+... -|.. + +..+.--+|--=+=||. T Consensus 203 l~nfLG~-~II~S~kV~---~G~~~~T~~~Ni--------~~ay~~~----~~~~----l-~~~f~~~~d~tglIGv~-- 259 (296) T protein:vir:98 203 LVDFTGT-VIISTNDVT---KGEIWATVPENI--------IFAYINP----NNSE----L-AKEFNLYGDPTGYIGMN-- 259 (296) T ss_pred hhhcccc-EEEEcCcCC---CceEEEeeecce--------EEEeecc----cccc----h-hhhhccccccccceEEE-- Confidence 2235553 355666665 333211111000 0111000 0000 0 00000011211122221 Q ss_pred cccccccccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCcee Q lcl|Aclame:pro 282 GQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAV 340 (430) Q Consensus 282 ~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aav 340 (430) |... .+ .-++ +...-+.++++|.++ +..- -...+++ | T Consensus 260 h~~~----~~--~~t~-eT~~~~~~~lfpE~~--dgiv-------~~tI~~~------~ 296 (296) T protein:vir:98 260 HFQE----NT--TLTI-QTLLVSGMLMYPERI--DGIV-------KVTLTPG------V 296 (296) T ss_pred eccc----cc--eeee-hhHhHhHHHhccccc--ceEE-------EEEecCC------C Confidence 1000 00 0000 000112244555432 0000 0001111 1 No 74 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=55.64 E-value=0.47 Score=22.36 Aligned_cols=266 Identities=14% Similarity=0.027 Sum_probs=101.1 Q ss_pred CccchhhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCccC---CccCCCCcceEE Q lcl|Aclame:pro 1 MALNEGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLT---DKATGLLELNVA 76 (430) Q Consensus 1 MAn~~~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~s---~~~~d~~e~sV~ 76 (430) ||.+-+.++ +.+..++|+.++...++.++++...- .+..+++|+-...... .+... ....++.=.++. T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~-------~~~~~~ip~~~~~~~a-~~v~E~~~~~~~~~~f~~v~ 72 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPI-------PFNGEKVFTFTMDSEI-DVVAESGKKTHGGVTLAPQT 72 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeec-------cCCceEEEEEecCcce-EEecCCccccccccceeEEE Confidence 999988755 67889999999999999888864321 2345677763222211 11111 111222222333 Q ss_pred EEeccccccceEecHHHhc--cH--HHHHH-HHHHHHHHHHHHHHHHHHHHHHhccc---ceee---c------cCCCCC Q lcl|Aclame:pro 77 VNMGEPDNDFFQLRADDLR--DE--TAYRH-RIQSAARKLANNVELKVANMAAEMGS---LVIT---S------PDAIGT 139 (430) Q Consensus 77 v~l~~~k~V~~~~t~keL~--~~--~~~~r-~l~pAm~~LAn~Id~dl~~~~~~~as---~~~~---~------~~~~~~ 139 (430) ++.-+-. .-+.+|.+=|+ .+ ...++ +.+...++++..+|..+++-.-...+ ...+ . ...... T Consensus 73 l~~~k~a-~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 151 (298) T protein:vir:16 73 MVPIKVE-YGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPR 151 (298) T ss_pred EeeeeEE-EeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccccccccc Confidence 4332222 23455544453 22 12233 34556678888888887642100000 0000 0 000111 Q ss_pred CCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch--hhhhhhhccccccchhhhHHHhCCCc Q lcl|Aclame:pro 140 NTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI--PEEAYRDGTIQRQVAGFDDVLRSPKL 217 (430) Q Consensus 140 ~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~--~~~a~r~g~igr~~~Gfd~~~~~~~~ 217 (430) .....+.|+..+...+.....+. -..++||.+...+.. + ...++. -......|.-++ +.|+- +.-++.+ T Consensus 152 ~~~~~~~~i~~~~~~~~~~~~~~---~~~vmn~~~~~~l~~-l---kd~~G~~i~~~~~~~~~~~~-l~G~P-V~~~~~v 222 (298) T protein:vir:16 152 GIADPNGAIENAVELLTGVDADV---TGIAINPSFRSALAK-Q---KDLQDNALFPELKWGATPDT-INGLP-VDVNKTV 222 (298) T ss_pred ccccHHHHHHHHHHHhhhcCCCc---cEEEEcHHHHHHHHH-h---hccCCCeeecCcccCCCCce-eccee-eEEeccc Confidence 11223556777776676666553 238899998876642 2 111100 000011111111 22221 1111111 Q ss_pred ceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEE Q lcl|Aclame:pro 218 PVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSV 297 (430) Q Consensus 218 ~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvV 297 (430) +.. . . +....+-.||. .+++. T Consensus 223 ~~~--------------------------~-~------------~~~~~~~~GDf--------------------s~~~~ 243 (298) T protein:vir:16 223 SDM--------------------------S-L------------TQRDRAIIGDF--------------------ANGFK 243 (298) T ss_pred ccc--------------------------c-C------------CCccEEEEeec--------------------cceEE Confidence 100 0 0 00001112221 11111 Q ss_pred EEeecCceeEEeeccccccccccc---------cccccccccccccccCceeEEecCCCceeeeeecccceeEEEecc Q lcl|Aclame:pro 298 VRVVDGTHVEITPKPVALDDVSLS---------PEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPI 366 (430) Q Consensus 298 t~~~~~~~v~I~p~~v~~~~~~~~---------~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl 366 (430) .....+-++++.+.. +.+.. ...++...+...+ .+.+||+.....= T Consensus 244 ~~~~~~~~~~~~~~~----~~~~~~~~~f~~~~v~~ra~~r~d~~v-------------------~~~~a~~~l~~at 298 (298) T protein:vir:16 244 WGYAKEVPLEVIQYG----DPDNSGLDLKGYNQVYIRAELFLGWGI-------------------LDATKFARVTEAN 298 (298) T ss_pred EEEecCceEEEeecc----CCcCcchhhhhcCcEEEEEEEEEccEe-------------------ecccceEEEeecC Confidence 111112223332211 00000 0001111111111 2222322221110 No 75 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=54.76 E-value=0.5 Score=22.25 Aligned_cols=295 Identities=14% Similarity=0.062 Sum_probs=117.2 Q ss_pred Cc--cchhhHHHHHHHHHHHHHHhhcccchhhccc-------CChHH--HHhhcCCEEEEecCcccccccCCccCCcc-- Q lcl|Aclame:pro 1 MA--LNEGQIVTLAVDEIIETISAITPMAQKAKKY-------TPPAA--SMQRSSNTIWMPVEQESPTQEGWDLTDKA-- 67 (430) Q Consensus 1 MA--n~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y-------~~~~~--~~~k~GdTV~ip~P~~~~~~~g~~~s~~~-- 67 (430) |+ |+.+++.+++..|+...+-.+..+-+ +.+ ++.+- ....-|++|+||.=.. -+|-.. +. T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~--~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~---L~g~~~--n~~~ 73 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPEL--TAFFLSGAVASNDFLSQFLSAPGRLINIPFWRD---LDSLEP--NYGS 73 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhh--hhhhhcceeecCHHHHHHhhcCCCEEEeeeecc---CCCCcc--ccCC Confidence 88 55677788888888877655554321 112 22222 2346699999997222 222111 10 Q ss_pred -CCCCcceEEEEecccccc------ceEecHHHhccHHHHHHHHHHHHHHHHHHHH---H-HHHHHHHh----------- Q lcl|Aclame:pro 68 -TGLLELNVAVNMGEPDND------FFQLRADDLRDETAYRHRIQSAARKLANNVE---L-KVANMAAE----------- 125 (430) Q Consensus 68 -~d~~e~sV~v~l~~~k~V------~~~~t~keL~~~~~~~r~l~pAm~~LAn~Id---~-dl~~~~~~----------- 125 (430) +|..+ -.|-+++..+.+ --.|+..+|.-.......++....+++...+ + .|+..+.- T Consensus 74 d~~~~~-~t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~ 152 (367) T protein:vir:80 74 DNPNVE-APIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFA 152 (367) T ss_pred CCCccc-ccccccccchheeeeehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchh Confidence 11111 111233333221 1345666654322222222222222332222 2 22222211 Q ss_pred ---------------cccceeeccCCCCCC-CCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhcc Q lcl|Aclame:pro 126 ---------------MGSLVITSPDAIGTN-TADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFG 189 (430) Q Consensus 126 ---------------~as~~~~~~~~~~~~-~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~ 189 (430) .....+.++...+.. ..-+.+.|.+|++.|-|++-- --.++++|.-+.+|... .-+.+.. T Consensus 153 ~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~---l~~i~mHS~V~~~L~~~-~li~~i~ 228 (367) T protein:vir:80 153 TIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGS---IAAIAVHSMVYKRMTNN-DEIEFIP 228 (367) T ss_pred hhhhhhccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhcccccc---ccEEEEchHHHHHHHhc-ccccccc Confidence 134455555444322 223466799999999987654 35589999999887543 2232222 Q ss_pred chhhhhhhhccccccchhhhHHHhCCCcceecccccccce--e--cccceeeeeEEEEe----eccccccccceeeEEEe Q lcl|Aclame:pro 190 RIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT--V--SGAQSFKPVAWQLD----NDGNKVNVDNRFATVTL 261 (430) Q Consensus 190 ~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~t--V--~ga~~~~~~~~t~~----~~~~~~~~d~~~~~~~~ 261 (430) . ++ .+..|++ +.|.. +..++.+|....|..+-+| + .||-.++....... -+.-+.+..|... T Consensus 229 ~--sd--~~~~i~t-y~G~~-VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~~E~~Rd~~~~~~gG~d~---- 298 (367) T protein:vir:80 229 D--SK--GQLTIPT-YMGKV-VIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEY---- 298 (367) T ss_pred C--CC--Cccccce-eccee-EEEeCCCcccccCCCceEEEEEEecceeeecccCCccceecccchhhhcCCceEE---- Confidence 1 11 1456786 77875 5578889877655433333 2 33322111110000 0000000000000 Q ss_pred eccceeeccc--EEEEcceeeccccccccccccceEEEEEeecCc---------eeEEeecccccccccccccccccc-- Q lcl|Aclame:pro 262 SATTGLKRGD--KISFTGVKFLGQMAKNVLAQDATFSVVRVVDGT---------HVEITPKPVALDDVSLSPEQRAYA-- 328 (430) Q Consensus 262 s~tgtlk~GD--v~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~---------~v~I~p~~v~~~~~~~~~~~~~~~-- 328 (430) |--=. ++--.|+.| ....+.+++ +.+.+|.. ..-..++.+.+.|- T Consensus 299 -----L~~Rr~~~~hP~G~s~---------------~~~~v~~~~~~~~~~~~~~~~~sPt~--~eLa~~~NW~~v~d~K 356 (367) T protein:vir:80 299 -----ILERKEWIVHPGGFNW---------------LDADVTIPDNTGSPSGITSGPPAITL--ANLANPDNWERVTYRK 356 (367) T ss_pred -----EEeeeeEEeecceeee---------------cccccccccccccccccccccCCCCh--HHhcCCcccccccchh Confidence 00001 111223222 211111000 00011111 01111112222210 Q ss_pred ccccccccCceeEEec Q lcl|Aclame:pro 329 NVNTSLADAMAVNILN 344 (430) Q Consensus 329 nVsa~pA~~aavTv~~ 344 (430) ++. =+.+..-| T Consensus 357 ~I~-----iv~~it~g 367 (367) T protein:vir:80 357 NVP-----MAFLVTKG 367 (367) T ss_pred hcc-----eEEEEecC Confidence 000 01233333 No 76 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=52.10 E-value=0.56 Score=21.95 Aligned_cols=274 Identities=13% Similarity=0.053 Sum_probs=101.7 Q ss_pred CccchhhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccc----ccCCccCCccCCCCcceE Q lcl|Aclame:pro 1 MALNEGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT----QEGWDLTDKATGLLELNV 75 (430) Q Consensus 1 MAn~~~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~----~~g~~~s~~~~d~~e~sV 75 (430) ||.+-+.++ +.+..++++.++...++.++++...- .+..+++|+-..... ..|... ...+..=.++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~-------~~~~~~~p~~~~~~~a~~v~Eg~~~--~~~~~~f~~v 71 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPI-------PFNGEKVFTFTMDSEIDVVAESGKK--THGGVTLAPQ 71 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeec-------cCCceEEEEEecCcceEEeeCCccc--cccccceeEE Confidence 999988866 66778999999999999888854321 224566766321111 122111 1122222234 Q ss_pred EEEeccccccceEecHHHhc--cH--HHHHHHH-HHHHHHHHHHHHHHHHHHHHhccc---ceeec---cC------CCC Q lcl|Aclame:pro 76 AVNMGEPDNDFFQLRADDLR--DE--TAYRHRI-QSAARKLANNVELKVANMAAEMGS---LVITS---PD------AIG 138 (430) Q Consensus 76 ~v~l~~~k~V~~~~t~keL~--~~--~~~~r~l-~pAm~~LAn~Id~dl~~~~~~~as---~~~~~---~~------~~~ 138 (430) .++..+-. .-+.+|.+=|+ .+ ...+++| +.-.++++.++|..+++-.-...+ ...+. .. ..+ T Consensus 72 ~l~~~k~~-~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:94 72 TMVPIKVE-YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred EEeeeEEE-EeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccc Confidence 44332222 23444444342 11 1223333 445567888888777632100000 00000 00 011 Q ss_pred CCCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch-hhhhhhhccccccchhhhHHHhCCCc Q lcl|Aclame:pro 139 TNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKL 217 (430) Q Consensus 139 ~~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~-~~~a~r~g~igr~~~Gfd~~~~~~~~ 217 (430) +.....+.|+..+-..+.....+. ...+++|.+...+.. +.. ..++- -......|.-++ +.|+- +.-++.+ T Consensus 151 ~~~~~~~~~i~~~~~~~~~~~~~~---~~~vmn~~~~~~l~~-lkd--~~G~~l~~~~~~~~~~~t-l~G~P-V~~~~~v 222 (298) T protein:vir:94 151 RGIADPNGAIENAVELLTGVDADV---TGIAINPSFRSALAK-QKD--LQGNALFPELKWGATPDT-INGLP-VDVNKTV 222 (298) T ss_pred cccccHHHHHHHHHHhhhhcCCCc---cEEEEcHHHHHHHHH-hhc--cCCCeeecCcccCCCCce-eccee-eEEeccc Confidence 111223667878777777665553 348999998877643 210 11110 011112222222 44543 2233333 Q ss_pred ceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEE Q lcl|Aclame:pro 218 PVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSV 297 (430) Q Consensus 218 ~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvV 297 (430) +.- .++.....+-|-.+.. . .+-....-.++--|-.+..|. +.+.- ..+.-.|++ T Consensus 223 ~~~-~~~~~~~~~~Gdfs~~---~----------------~~~~~~~~~~~~~~~~~~d~~----~~~~f-~~~~v~~r~ 277 (298) T protein:vir:94 223 SDM-SLTQRDRAIIGDFANG---F----------------KWGYAKEVPLEVIQYGDPDNS----GLDLK-GYNQVYIRA 277 (298) T ss_pred ccc-cCCCccEEEEeeccce---E----------------EEEEecCceEEEeecCCCcCc----chhhh-hcCcEEEEE Confidence 311 0000001111100000 0 000000001111010001110 00000 001111333 Q ss_pred EEeecCceeEEeec-ccccccccccccccccccccccccc Q lcl|Aclame:pro 298 VRVVDGTHVEITPK-PVALDDVSLSPEQRAYANVNTSLAD 336 (430) Q Consensus 298 t~~~~~~~v~I~p~-~v~~~~~~~~~~~~~~~nVsa~pA~ 336 (430) ..-.+. .+ ..|. ++.+. .|+ T Consensus 278 ~~r~~~-~~-~~~~a~~~l~-----------------~~t 298 (298) T protein:vir:94 278 ELFLGW-GI-LDATKFARVT-----------------EAN 298 (298) T ss_pred EEEecc-Ee-ecccceEEEE-----------------ecC Confidence 222111 01 1111 11000 000 No 77 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=50.63 E-value=0.6 Score=21.78 Aligned_cols=278 Identities=11% Similarity=-0.016 Sum_probs=111.5 Q ss_pred Cccc----hhhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccc--ccCCcc-CCccCC--- Q lcl|Aclame:pro 1 MALN----EGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT--QEGWDL-TDKATG--- 69 (430) Q Consensus 1 MAn~----~~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~--~~g~~~-s~~~~d--- 69 (430) |... -+.++ +.+..++|+.++...++.++++.+.- .+....+++|..... .-.+.. ....++ T Consensus 116 ~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 188 (408) T protein:vir:10 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-------STSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDN 188 (408) T ss_pred hhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeec-------cCCcceEEEeeccccccceeeecCccccccccC Confidence 1110 11223 45667788888888888888754321 123444555432211 111111 111111 Q ss_pred CCcceEEEEeccccccceEecHHHhccHHH-HHHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhH Q lcl|Aclame:pro 70 LLELNVAVNMGEPDNDFFQLRADDLRDETA-YRHRIQS-AARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNF 147 (430) Q Consensus 70 ~~e~sV~v~l~~~k~V~~~~t~keL~~~~~-~~r~l~p-Am~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d 147 (430) ..=..|.++..+-. .-+.+|.+=|++..+ .+.+|.. ..+.++..+|..++. +.....+..+...|.| T Consensus 189 ~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~----------g~g~~~~~~~~~~~~~ 257 (408) T protein:vir:10 189 PQLTIIKYLIKRYA-GIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIE----------VMKAAPKKPTIAKFDD 257 (408) T ss_pred cceeeEEeeeeeEE-eeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh----------cccccccccccccHHH Confidence 12234444443332 234455444554332 3555544 446788888877752 1112223345567888 Q ss_pred HHHHH-HHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccc Q lcl|Aclame:pro 148 VADAE-ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTAT 226 (430) Q Consensus 148 ~a~a~-~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~ 226 (430) +..+. ..|+....+ +-..+++|.+++.+. .+ ...+ ||.+ +..+ .. .+. T Consensus 258 l~~~~~~~~~~~~~~---~a~~v~n~~~~~~l~-~l---kd~~------------G~~i------~~~~----~~--~~~ 306 (408) T protein:vir:10 258 VITMINTAVDPAIIA---TSSLLTNQSGLNKLA-LV---KTAE------------GKYL------LEPD----PT--KPN 306 (408) T ss_pred HHHHHHHhhhhhhcc---CCEEEEcHHHHHHHH-Hh---hccC------------CceE------eccC----cC--CCC Confidence 76654 333322221 234788998876553 22 1111 2322 1100 01 112 Q ss_pred cceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCcee Q lcl|Aclame:pro 227 GITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 227 ~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v 306 (430) +.++.|-+-.- ... ... +..-.|+..-|-| +...|++..+-.+-+| T Consensus 307 ~~~l~G~PV~~-----~~~--~~~--------------~~~~~~~~~i~~g-------------d~~~~~~~~~~~~~~v 352 (408) T protein:vir:10 307 SYLIKGKQVIV-----VAD--RWL--------------PNTGSTVYPLYYG-------------DMSQAITLFDRENMSL 352 (408) T ss_pred CceecceeeEE-----ecc--ccc--------------CccCCCceEEEEE-------------ehhccEEEEEecceEE Confidence 22333332100 000 000 0011233333333 3344444444334445 Q ss_pred EEeeccccccccccccccccccccccccccCceeEEecCCCcee-eeeecccceeEEEecccCCCCchhhceeEEEe Q lcl|Aclame:pro 307 EITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDART-NVFWADDAIRIVSQPIPANHELFAGMKTTSFS 382 (430) Q Consensus 307 ~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~-NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~ 382 (430) .+++..-.. ....-+.+... .+. =...|.+||.+.+-.-..|.-|....-+.+.. T Consensus 353 ~~~~~~~~~-------------------f~~~~~~~r~~--~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:10 353 LPTNIGAGA-------------------FETDTTKIRVI--DRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) T ss_pred EEcccccch-------------------hhcCceEEEEE--EeeccEEeccccEEEEEeeccccCCCCCCCCCcccC Confidence 444332100 00001111111 011 14556777777776554443333332222111 No 78 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=49.15 E-value=0.65 Score=21.62 Aligned_cols=266 Identities=14% Similarity=0.032 Sum_probs=93.4 Q ss_pred Cccch---hh-HHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCc---cCCccCCCCcc Q lcl|Aclame:pro 1 MALNE---GQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWD---LTDKATGLLEL 73 (430) Q Consensus 1 MAn~~---~~-~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~---~s~~~~d~~e~ 73 (430) |.... +. +.+.+..++++.++...++-++++...- .|.++.+|+-......-.+. ......+..=. T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~ 177 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRT-------SSNALEYVREEVFTNNADVVAEKALKPESDITFS 177 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhcceecc-------cCcceEEEEEecCCcceeeeccCcccccccccee Confidence 22221 11 3456677788888888888887754211 24456665421111111111 11111232223 Q ss_pred eEEEEeccccccceEecHHHhccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhc---ccce---eeccCCCCCCCCCchh Q lcl|Aclame:pro 74 NVAVNMGEPDNDFFQLRADDLRDETAYRHRIQS-AARKLANNVELKVANMAAEM---GSLV---ITSPDAIGTNTADAWN 146 (430) Q Consensus 74 sV~v~l~~~k~V~~~~t~keL~~~~~~~r~l~p-Am~~LAn~Id~dl~~~~~~~---as~~---~~~~~~~~~~~~~~~~ 146 (430) .+.+++.+-. .-+.+|.+=|.+....+++|.. -.++++..+|..++.---.. .+.. ..........+...+. T Consensus 178 ~~~~~~~k~~-~~~~is~ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d 256 (385) T protein:vir:19 178 KQTANVKTIA-HWVQASRQVMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRAD 256 (385) T ss_pred EEEEeeeeEE-EeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHH Confidence 3444443322 3345554324433445555544 44678888887776311000 0000 0000011112223467 Q ss_pred HHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hhhhhhhhccccccchhhhHHHhCCCcceeccccc Q lcl|Aclame:pro 147 FVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTA 225 (430) Q Consensus 147 d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~ 225 (430) ++..+...|.....+. -..+++|.+.+.+.. + ...++ -.-.-...|.-++ +.|+- +..++.+ T Consensus 257 ~i~~~~~~l~~~~~~~---~~~~~~~~~~~~l~~-l---kd~~G~~l~~~~~~~~~~~-l~G~p-V~~~~~~-------- 319 (385) T protein:vir:19 257 IIAHAIYQVTESEFSA---SGIVLNPRDWHNIAL-L---KDNEGRYIFGGPQAFTSNI-MWGLP-VVPTKAQ-------- 319 (385) T ss_pred HHHHHHHhhccccCCC---CEEEEcHHHHHHHHH-h---hcCCCceeccCcccCCCce-eccee-eEEcCcC-------- Confidence 7877777776655553 248999998876542 1 11110 0000000111111 22221 1112111 Q ss_pred ccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCce Q lcl|Aclame:pro 226 TGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTH 305 (430) Q Consensus 226 ~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~ 305 (430) .+|+++- | +..++.+..+-.+-+ T Consensus 320 ------------------------------------------p~~~~~~--g-------------d~~~~~~~~~~~~~~ 342 (385) T protein:vir:19 320 ------------------------------------------AAGTFTV--G-------------GFDMASQVWDRMDAT 342 (385) T ss_pred ------------------------------------------CCCcEEE--e-------------ecccEEEEEEecceE Confidence 1222111 1 011111111111111 Q ss_pred eEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccC Q lcl|Aclame:pro 306 VEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPA 368 (430) Q Consensus 306 v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~ 368 (430) |.++...-.. -.......+.+..+.-. ..+++||+.++-.-.. T Consensus 343 v~~~~~~~~~-~~~~~~~~~~~~r~~~~-------------------v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 343 VEVSREDRDN-FVKNMLTILCEERLALA-------------------HYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEeccccch-hhcCcEEEEEEEeeccE-------------------EecccceEEEEeccCC Confidence 1111100000 00000000000111111 1222233222222111 No 79 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=49.15 E-value=0.65 Score=21.62 Aligned_cols=266 Identities=14% Similarity=0.032 Sum_probs=93.4 Q ss_pred Cccch---hh-HHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCc---cCCccCCCCcc Q lcl|Aclame:pro 1 MALNE---GQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWD---LTDKATGLLEL 73 (430) Q Consensus 1 MAn~~---~~-~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~---~s~~~~d~~e~ 73 (430) |.... +. +.+.+..++++.++...++-++++...- .|.++.+|+-......-.+. ......+..=. T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~ 177 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRT-------SSNALEYVREEVFTNNADVVAEKALKPESDITFS 177 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhcceecc-------cCcceEEEEEecCCcceeeeccCcccccccccee Confidence 22221 11 3456677788888888888887754211 24456665421111111111 11111232223 Q ss_pred eEEEEeccccccceEecHHHhccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhc---ccce---eeccCCCCCCCCCchh Q lcl|Aclame:pro 74 NVAVNMGEPDNDFFQLRADDLRDETAYRHRIQS-AARKLANNVELKVANMAAEM---GSLV---ITSPDAIGTNTADAWN 146 (430) Q Consensus 74 sV~v~l~~~k~V~~~~t~keL~~~~~~~r~l~p-Am~~LAn~Id~dl~~~~~~~---as~~---~~~~~~~~~~~~~~~~ 146 (430) .+.+++.+-. .-+.+|.+=|.+....+++|.. -.++++..+|..++.---.. .+.. ..........+...+. T Consensus 178 ~~~~~~~k~~-~~~~is~ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d 256 (385) T protein:vir:18 178 KQTANVKTIA-HWVQASRQVMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRAD 256 (385) T ss_pred EEEEeeeeEE-EeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHH Confidence 3444443322 3345554324433445555544 44678888887776311000 0000 0000011112223467 Q ss_pred HHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hhhhhhhhccccccchhhhHHHhCCCcceeccccc Q lcl|Aclame:pro 147 FVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTA 225 (430) Q Consensus 147 d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~ 225 (430) ++..+...|.....+. -..+++|.+.+.+.. + ...++ -.-.-...|.-++ +.|+- +..++.+ T Consensus 257 ~i~~~~~~l~~~~~~~---~~~~~~~~~~~~l~~-l---kd~~G~~l~~~~~~~~~~~-l~G~p-V~~~~~~-------- 319 (385) T protein:vir:18 257 IIAHAIYQVTESEFSA---SGIVLNPRDWHNIAL-L---KDNEGRYIFGGPQAFTSNI-MWGLP-VVPTKAQ-------- 319 (385) T ss_pred HHHHHHHhhccccCCC---CEEEEcHHHHHHHHH-h---hcCCCceeccCcccCCCce-eccee-eEEcCcC-------- Confidence 7877777776655553 248999998876542 1 11110 0000000111111 22221 1112111 Q ss_pred ccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCce Q lcl|Aclame:pro 226 TGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTH 305 (430) Q Consensus 226 ~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~ 305 (430) .+|+++- | +..++.+..+-.+-+ T Consensus 320 ------------------------------------------p~~~~~~--g-------------d~~~~~~~~~~~~~~ 342 (385) T protein:vir:18 320 ------------------------------------------AAGTFTV--G-------------GFDMASQVWDRMDAT 342 (385) T ss_pred ------------------------------------------CCCcEEE--e-------------ecccEEEEEEecceE Confidence 1222111 1 011111111111111 Q ss_pred eEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccC Q lcl|Aclame:pro 306 VEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPA 368 (430) Q Consensus 306 v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~ 368 (430) |.++...-.. -.......+.+..+.-. ..+++||+.++-.-.. T Consensus 343 v~~~~~~~~~-~~~~~~~~~~~~r~~~~-------------------v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 343 VEVSREDRDN-FVKNMLTILCEERLALA-------------------HYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEeccccch-hhcCcEEEEEEEeeccE-------------------EecccceEEEEeccCC Confidence 1111100000 00000000000111111 1222233222222111 No 80 >protein:vir:3426 Length: 117 # NCBI annotation: head-tail joining protein # Family: family:all:1908 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040589;genbank:gi:9626253;genbank:GeneID:2703484 Probab=48.88 E-value=0.49 Score=22.29 Aligned_cols=112 Identities=20% Similarity=0.201 Sum_probs=40.3 Q ss_pred HHhhhhhhh-ccchhhhhhhhccccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceee Q lcl|Aclame:pro 179 GYDLTKRDI-FGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFA 257 (430) Q Consensus 179 ~~~~~~l~~-~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~ 257 (430) ..++-.||- +-....++++ +.| |. ...++.+.+.+.+|+|--.. + ......+.+...++..- T Consensus 1 m~~~dNlfd~a~~~aD~~i~-----~~f-g~--------~a~i~~~~g~~~~i~gVFDd-P--~~~~~~~gG~~i~~s~P 63 (117) T protein:vir:34 1 MADFDNLFDAAIARADETIR-----GYM-GT--------SATITSGEQSGAVIRGVFDD-P--ENISYAGQGVRVEGSSP 63 (117) T ss_pred CCcccchhHHHHhhcchhhH-----hhc-Ce--------eEEEEeCCCcceEEEEEecC-c--cchhhccCCEEeecCCc Confidence 001111100 0000111111 111 11 12334444455555553221 1 11222222233333334 Q ss_pred EEEeecc--ceeecccEEEEcceeeccccccccccccceEEEEEee-c-CceeEEeeccccccccccccccccccccccc Q lcl|Aclame:pro 258 TVTLSAT--TGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVV-D-GTHVEITPKPVALDDVSLSPEQRAYANVNTS 333 (430) Q Consensus 258 ~~~~s~t--gtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~-~-~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~ 333 (430) .+.+..+ ..||++|.+||+| ++|.|+... + .|.-.|..+- . .. T Consensus 64 ~L~vk~aDv~~l~r~D~v~I~G---------------~~y~V~~~~PD~~G~~~l~L~r----------g--------~p 110 (117) T protein:vir:34 64 SLFVRTDEVRQLRRGDTLTIGE---------------ENFWVDRVSPDDGGSCHLWLGR----------G--------VP 110 (117) T ss_pred EEEeeechhhccCCCCEEEECC---------------CeeEeeecccCCCceEEEEeec----------C--------CC Confidence 4444222 4699999999999 677777532 1 2332232221 0 01 Q ss_pred cccCcee Q lcl|Aclame:pro 334 LADAMAV 340 (430) Q Consensus 334 pA~~aav 340 (430) |+++-.= T Consensus 111 p~~~~~~ 117 (117) T protein:vir:34 111 PAVNRRR 117 (117) T ss_pred CccccCC Confidence 1111111 No 81 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=48.37 E-value=0.67 Score=21.53 Aligned_cols=280 Identities=13% Similarity=0.052 Sum_probs=110.7 Q ss_pred Cccc----hh-hHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc----cCCccCCccCCCC Q lcl|Aclame:pro 1 MALN----EG-QIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ----EGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~----~~-~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~----~g~~~s~~~~d~~ 71 (430) |+.. .+ -+.+.+..++|+.+++..++.++++.. +- .+.++++|+-...... +|. .....+.. T Consensus 10 ~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~-~~------~~~~~~ip~~~~~~~a~wv~Eg~--~~~~s~~~ 80 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKI-PM------GATGIVIPHWTGDVSAQWIGEGD--MKPITKGN 80 (397) T ss_pred HhhccCCCCccccchhHHHHHHHHHHhccchhhhccee-ec------cCCceEEEEEcCCcceEEecCCc--cccccccc Confidence 3332 12 245777889999999999988887542 21 2456777763222211 121 11122332 Q ss_pred cceEEEEeccccccceEecHHHhccHH-HHHHHH-HHHHHHHHHHHHHHHHHHHHhc---ccceeeccCCCCCCCCCchh Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDLRDET-AYRHRI-QSAARKLANNVELKVANMAAEM---GSLVITSPDAIGTNTADAWN 146 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL~~~~-~~~r~l-~pAm~~LAn~Id~dl~~~~~~~---as~~~~~~~~~~~~~~~~~~ 146 (430) =.++.+++.+ -..-+.+|.+=|++.. ..+.+| +.-.++++.++|..++.---.. ..............+..... T Consensus 81 f~~v~l~~~k-~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~~~~~ 159 (397) T protein:vir:23 81 MTKRDVHPAK-IATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISPNAYQG 159 (397) T ss_pred eeEEEEeeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecccchhH Confidence 3345554422 2333555554455432 234555 4555789999998886311000 00000000011111222344 Q ss_pred HHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccc-- Q lcl|Aclame:pro 147 FVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKST-- 224 (430) Q Consensus 147 d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt-- 224 (430) ++..+...|.....+. -..+++|.....+.. +... -||.+ +... ...+. T Consensus 160 ~~~~~~~~l~~~~~~~---a~~vmn~~~~~~L~~----lkd~------------~G~~i------~~~~----~~~~~~~ 210 (397) T protein:vir:23 160 LGVSGLTKLVTDGKKW---THTLLDDTVEPVLNG----SVDA------------NGRPL------FVES----TYESLTT 210 (397) T ss_pred HHHHHHHhhhhcccCC---CEEEEcHHHHHHHHH----hhcc------------CCcee------eccc----ccccccc Confidence 5555555555555443 347899887765532 1111 13322 1110 00000 Q ss_pred -cccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecC Q lcl|Aclame:pro 225 -ATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDG 303 (430) Q Consensus 225 -~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~ 303 (430) ....++.|-+- -.+..+.+|++.-|-| +...++ .....+ T Consensus 211 ~~~~~tl~G~Pv--------------------------~~s~~~~~g~~~~~~g-------------Dfs~~~-i~~~~~ 250 (397) T protein:vir:23 211 PFREGRILGRPT--------------------------ILSDHVAEGDVVGYAG-------------DFSQII-WGQVGG 250 (397) T ss_pred cccCceeeeeeE--------------------------EEeCCCCCCceEEEEe-------------ecceEE-EEEEec Confidence 00001111100 0000112233322222 001111 011000 Q ss_pred ceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEec Q lcl|Aclame:pro 304 THVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSI 383 (430) Q Consensus 304 ~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~ 383 (430) -.+.++ |++- +... .+ T Consensus 251 i~i~~~----------------------------------------------~e~~------~~~~------------~~ 266 (397) T protein:vir:23 251 LSFDVT----------------------------------------------DQAT------LNLG------------SQ 266 (397) T ss_pred eEEEEe----------------------------------------------eeee------eeec------------cc Confidence 001110 0000 0000 00 Q ss_pred CCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 384 PDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 384 ~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) + .+-.+- . ..+++..+|.-.-+++++++||.. +.|..... T Consensus 267 ~-~~~~~~-l----f~~d~v~~ra~~r~d~~v~~~~a~-~~~~~~~~ 306 (397) T protein:vir:23 267 E-SPNFVS-L----WQHNLVAVRVEAEYGLLINDVNAF-VKLTFDPV 306 (397) T ss_pred c-ccceee-e----eeccceeEEEEeeeccceecccce-EEEeeccc Confidence 0 000000 1 122366677777777888888875 44444333 No 82 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=42.61 E-value=0.88 Score=20.89 Aligned_cols=291 Identities=12% Similarity=0.025 Sum_probs=113.7 Q ss_pred Cc---------cchhhHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc--cC-----Cc-- Q lcl|Aclame:pro 1 MA---------LNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--EG-----WD-- 62 (430) Q Consensus 1 MA---------n~~~~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~--~g-----~~-- 62 (430) |. ....-+-+.+.+++++.++...++.+++++. + -.|..+++|+-...+.. .+ +. T Consensus 12 ~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~-~------~~~~~~~ip~~~~~~~a~~v~~~~~~~~~E 84 (338) T protein:vir:78 12 AGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENI-P------ISYGETIIPTTVKRPEVGQVGVGTSNEQRE 84 (338) T ss_pred cccccccceecccccccchHHHHHHHHHHHhhchhhhhccee-e------ccCCceEEEEEecCccceeecccccccccc Confidence 11 1111233667899999999999999998642 1 13567777773221110 00 00 Q ss_pred -cCCccCCCCcceEEEEeccccccceEecHHHhccH-HHHHHHH-HHHHHHHHHHHHHHHHHHHHhcccc-eeecc---- Q lcl|Aclame:pro 63 -LTDKATGLLELNVAVNMGEPDNDFFQLRADDLRDE-TAYRHRI-QSAARKLANNVELKVANMAAEMGSL-VITSP---- 134 (430) Q Consensus 63 -~s~~~~d~~e~sV~v~l~~~k~V~~~~t~keL~~~-~~~~r~l-~pAm~~LAn~Id~dl~~~~~~~as~-~~~~~---- 134 (430) ......+..=.++.++.-+- ..-+.+|.+=|++. ...+++| +.-.++++..+|..+++---..... ..++. T Consensus 85 g~~~~~~~~~f~~v~l~~~k~-~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~ 163 (338) T protein:vir:78 85 GGTKPLSGTAWDTRSVAPIKL-ATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNV 163 (338) T ss_pred cccccccccceeEEEEEEEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccc Confidence 00011222222344443222 23445555444443 2334554 4455678888888876311000000 00000 Q ss_pred -------CCCCCCCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchh Q lcl|Aclame:pro 135 -------DAIGTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAG 207 (430) Q Consensus 135 -------~~~~~~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~G 207 (430) ....+.....+.++..+.+.+..+.- . .....+++|...+.|. .+..+...+ ||.+ T Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~m~~~~~~~L~-~~~~l~d~~------------g~~l-- 226 (338) T protein:vir:78 164 IVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTD-V-DFNGWAADPRYRARLL-RSQAYRDAN------------GNVD-- 226 (338) T ss_pred cccccccccccccchhhHHHHHHHHHHhhhhcc-c-cceEEEEchHHHHHHH-HHhhhccCC------------Ccee-- Confidence 00011111224556666555543332 2 2234788988877663 332222211 2222 Q ss_pred hhHHHhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccc Q lcl|Aclame:pro 208 FDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKN 287 (430) Q Consensus 208 fd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~ 287 (430) +... .. . |-.-+|-|+- T Consensus 227 ----~~~~----~~--~---------------------------------------------~~~~~l~G~P-------- 243 (338) T protein:vir:78 227 ----PTRI----NL--A---------------------------------------------ASAGDLLGLP-------- 243 (338) T ss_pred ----eccc----cc--C---------------------------------------------CCCceeeeee-------- Confidence 1000 00 0 1111222310 Q ss_pred cccccceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEeccc Q lcl|Aclame:pro 288 VLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIP 367 (430) Q Consensus 288 ~~~~l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~ 367 (430) .+++.. + |... .++.+...+-++|.- ...+...+..+.+..-+=. T Consensus 244 -------V~~~~~-----i-------p~~~--------------~~~~~~~~~~~~gdf--s~~~~~~~~~~~i~~~~~~ 288 (338) T protein:vir:78 244 -------VQFGKA-----V-------GGDL--------------GAATDSKVRVVGGDF--SQLKYGFADEIRVKMSDTA 288 (338) T ss_pred -------EEEccc-----c-------Cccc--------------cccCCcccEEEEEec--ceEEEEeecccEEEEeecc Confidence 000000 0 0000 000000011111211 0111111111111110000 Q ss_pred CCCCchhhceeEEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 368 ANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 368 ~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) .. ....+|. -+.....+ +++..+|.-.-+|+++++|+.. +.|-+=++ T Consensus 289 ~~---------~~~~~~~--~~~~~~~~----~~~~~~r~~~r~d~~v~~~~a~-~~l~~~~~ 335 (338) T protein:vir:78 289 TL---------TDNTSPT--PQTVSMWQ----TNQIAILIEVTFGWLLGDKQAF-VKFVDDED 335 (338) T ss_pred cc---------ccccccc--ccchhhhh----cCcEEEEEEEEeccEeecccce-EEEecccC Confidence 00 0000000 01111122 3478889999999999999986 55555555 No 83 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=40.32 E-value=0.98 Score=20.64 Aligned_cols=271 Identities=10% Similarity=0.052 Sum_probs=90.2 Q ss_pred Cccch-----hhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCccCCc-----cCC Q lcl|Aclame:pro 1 MALNE-----GQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDK-----ATG 69 (430) Q Consensus 1 MAn~~-----~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~s~~-----~~d 69 (430) +|.+. +.++ +.+..+|++.++...++.+++++. + .+..+++|+............... ..| T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~-~-------~~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~ 212 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGV-K-------TKENIKYPVLVKKAEAQGHKNERTNNEMPETD 212 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhhhhhccee-c-------cCCceEEEEEecCCcccceecccccccccccc Confidence 22111 1123 445566888888888888887653 2 223467776433222221111111 122 Q ss_pred CCcceEEEEeccccccceEecHHHhccHHH-HHHHHH-HHHHHHHHHHHHHHHHHHHhcc--cceeeccC-CCCCCCCCc Q lcl|Aclame:pro 70 LLELNVAVNMGEPDNDFFQLRADDLRDETA-YRHRIQ-SAARKLANNVELKVANMAAEMG--SLVITSPD-AIGTNTADA 144 (430) Q Consensus 70 ~~e~sV~v~l~~~k~V~~~~t~keL~~~~~-~~r~l~-pAm~~LAn~Id~dl~~~~~~~a--s~~~~~~~-~~~~~~~~~ 144 (430) ..=..|.++..+-.. -+.+|.+=|.+..+ .+.+|. .-...|+..+|..+++---... .......+ ...+..... T Consensus 213 ~~f~~v~~~~~k~~~-~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~ 291 (434) T protein:vir:62 213 IEFDEIELSPTEFDA-LATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKNL 291 (434) T ss_pred cceeeEEeeheeeEe-ehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccch Confidence 222233333322222 23343333444322 345543 4456788888887762100000 00000000 111112233 Q ss_pred hhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccc Q lcl|Aclame:pro 145 WNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKST 224 (430) Q Consensus 145 ~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt 224 (430) |+++......|.....+ +=..++||.+...+. .+ ...+ ||++ |. .+.+ ... T Consensus 292 ~d~l~~l~~~l~~~~~~---~a~~v~n~~~~~~L~-~l---kd~~------------G~~l----~~-~~~~-----~~~ 342 (434) T protein:vir:62 292 YDALVKMKNTPVKEVRK---KARWVLNTAALTKIE-TM---KTDD------------GFPL----LR-PFNQ-----AEG 342 (434) T ss_pred hhHHHHHHhhcchhhhc---CCEEEEcHHHHHHHH-Hh---hccC------------CCEe----ec-cCCC-----ccC Confidence 66666554444332222 123589999876553 22 1111 2221 00 0000 001 Q ss_pred cccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeec---cc---EEEEcceeeccccccccccccce-EEE Q lcl|Aclame:pro 225 ATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKR---GD---KISFTGVKFLGQMAKNVLAQDAT-FSV 297 (430) Q Consensus 225 ~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~---GD---v~TiaGV~~v~~~tk~~~~~l~~-fvV 297 (430) +.+.++.|-+-.-..... .+. .+....+.. |-++. +| ..+|.- .+ -+.. ...+. |++ T Consensus 343 g~~~tl~G~pV~~~~~~~---~~~----~~~~~~i~~---Gdfs~~~i~~~~g~~~i~~---~~--~~~~-~~~~v~~~~ 406 (434) T protein:vir:62 343 GIGYTLLGFPVEEEDAID---IPD----SPDTPVFYF---GDFSKFYIQDVIGSLEVQK---LV--ELFS-RTNRVGFRI 406 (434) T ss_pred CCCceecceeeEEecCcc---Ccc----CCCceEEEE---eeccceEEEEeeceeEEEe---eh--hhhc-ccCceEEEE Confidence 112223332210000000 000 000000000 01111 11 111110 00 0000 11222 555 Q ss_pred EEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCcee Q lcl|Aclame:pro 298 VRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDART 350 (430) Q Consensus 298 t~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~ 350 (430) ..-.++--| .+|.- -+.+.+.++..+.. T Consensus 407 ~~r~Dgk~i-~~~~~------------------------~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 407 WNLLDAQLI-HSPFE------------------------VPVYKYVLKAPTGA 434 (434) T ss_pred Eeeecceee-cCccc------------------------ceEEEEEeccCCCC Confidence 544333211 11211 11222222211111 No 84 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=35.43 E-value=1.2 Score=20.09 Aligned_cols=267 Identities=9% Similarity=0.046 Sum_probs=98.5 Q ss_pred Cccc----hhhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc--cCCccC-Ccc---CC Q lcl|Aclame:pro 1 MALN----EGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--EGWDLT-DKA---TG 69 (430) Q Consensus 1 MAn~----~~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~--~g~~~s-~~~---~d 69 (430) |+.. -+.++ +.+..++|+.++...++.++++.+.- .+.+..++.+...... -.+... ... ++ T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 181 (397) T protein:vir:48 109 KTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENV-------TTLTGSRVYEKWADITGLAKLDDEAGSIGTNDD 181 (397) T ss_pred hhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeec-------cCCcceEEEEeecCCCcceeeeccccccccccc Confidence 3221 12233 56678888889999999888864321 2344444443221110 011111 111 11 Q ss_pred CCcceEEEEeccccccceEecHHHhccHHH-HHHHHH-HHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhH Q lcl|Aclame:pro 70 LLELNVAVNMGEPDNDFFQLRADDLRDETA-YRHRIQ-SAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNF 147 (430) Q Consensus 70 ~~e~sV~v~l~~~k~V~~~~t~keL~~~~~-~~r~l~-pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d 147 (430) ..=..|.++..+- ..-+.+|.+=|++..+ .+++|. ...++++..+|..++. +.....+..+...|.+ T Consensus 182 ~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~----------G~g~~~~~~~~~~~d~ 250 (397) T protein:vir:48 182 PKLYPIRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILE----------AIATLPTKPTLTKWDD 250 (397) T ss_pred cceeeEEeeheee-eeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh----------cccccccccccccHHH Confidence 2223444444322 2234454444544332 344443 4446777888887752 2222233445677899 Q ss_pred HHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceeccccccc Q lcl|Aclame:pro 148 VADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATG 227 (430) Q Consensus 148 ~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~ 227 (430) +..+...|.....+. -..+++|.+...|. .+ ...+ ||++..-+ . ..+++ T Consensus 251 i~~~~~~l~~~~~~~---a~~v~n~~~~~~L~-~l---kd~~------------G~~i~~~~----------~--~~~~~ 299 (397) T protein:vir:48 251 IIDLQAKVDPAIKQT---SFFLTNTSGFTALK-KV---KNAF------------GDYLMERD----------V--KSPTG 299 (397) T ss_pred HHHHHHHhhhhhcCC---CEEEECHHHHHHHH-Hh---hcCC------------CceeeccC----------c--CCCCC Confidence 988887777665543 34789999876553 22 1111 22211000 0 01112 Q ss_pred ceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccE---EEE---ccee-eccccc-cccccccceEEEEE Q lcl|Aclame:pro 228 ITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDK---ISF---TGVK-FLGQMA-KNVLAQDATFSVVR 299 (430) Q Consensus 228 ~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv---~Ti---aGV~-~v~~~t-k~~~~~l~~fvVt~ 299 (430) .++.|-+-...........+ . +...+.-||- +.+ .|+. .+.+.. +....+...|++.. T Consensus 300 ~~l~G~PV~~~~~~~~~~~~-----~---------~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~ 365 (397) T protein:vir:48 300 YSIDGFAVKEVADRWLANAS-----S---------GAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVID 365 (397) T ss_pred ceeccceeEEecccccCCcC-----C---------CceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEe Confidence 23333221000000000000 0 0000111110 000 0100 000000 00111111233332 Q ss_pred eecCceeEEeec-cccccccccccccccccccccccccCceeEE Q lcl|Aclame:pro 300 VVDGTHVEITPK-PVALDDVSLSPEQRAYANVNTSLADAMAVNI 342 (430) Q Consensus 300 ~~~~~~v~I~p~-~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv 342 (430) -.++ .+ +.|. ++.+. +..++..|++..++-| T Consensus 366 r~d~-~~-~~~~a~~~~~----------~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 366 RFDV-VA-TDTESFVPAS----------FKAIADQKGNLGSTAV 397 (397) T ss_pred eecc-EE-ecccceEEEE----------ecccccCCCCccccCC Confidence 2221 11 1111 11110 1112222222222212 No 85 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=31.43 E-value=1.5 Score=19.62 Aligned_cols=240 Identities=15% Similarity=0.118 Sum_probs=86.0 Q ss_pred Ccc--c---hhhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCcc-CCcc---CCC Q lcl|Aclame:pro 1 MAL--N---EGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDL-TDKA---TGL 70 (430) Q Consensus 1 MAn--~---~~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~-s~~~---~d~ 70 (430) ++. + -+.++ +.+..++++.++...++.++++.. +- .+.+..+|++......-++.. .... .+. T Consensus 127 ~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~-~~------~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~ 199 (394) T protein:vir:97 127 QKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVY-QA------KKASGKYPVLQRATTKMVTVAELEKNPALAKP 199 (394) T ss_pred hccccccccccccChHHHHHHHHHHhhhhhhhhhhceee-ec------cCcceEEEEEecCCCccceecccccccccccc Confidence 111 0 11122 445566777777888887777542 10 223456665432221111111 1111 112 Q ss_pred CcceEEEEeccccccceEecHHHhccHHH-HHHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHH Q lcl|Aclame:pro 71 LELNVAVNMGEPDNDFFQLRADDLRDETA-YRHRIQS-AARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFV 148 (430) Q Consensus 71 ~e~sV~v~l~~~k~V~~~~t~keL~~~~~-~~r~l~p-Am~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~ 148 (430) .=..|.++..+-. .-+.+|.+=|.+..+ .+.+|.. -.+.|+..+|..++.- .++..+.+...+.++ T Consensus 200 ~~~~v~l~~~k~~-~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g-----------~~~~~~~~~~~~~~~ 267 (394) T protein:vir:97 200 DFKDVAWNIDTYR-GAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKV-----------LKSFTTKTVKNLDEI 267 (394) T ss_pred cceeEEeehhhee-eehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhc-----------cccccccccccHHHH Confidence 2234444443322 223444333443322 3444433 3456667777666521 112223345567777 Q ss_pred HHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccccc Q lcl|Aclame:pro 149 ADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGI 228 (430) Q Consensus 149 a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~ 228 (430) ..+-..+.+ |. .+-..++||.+.+.+.. + ...+ ||++ +..+ .+. +.+. T Consensus 268 ~~~~~~~~~---~~-~~a~~v~n~~~~~~l~~-l---kd~~------------G~~i------~~~~----~~~--~~~~ 315 (394) T protein:vir:97 268 KALLNGGFD---PA-YNVSLIVSQSFYQTLDT-L---KDGN------------GRYL------LQDD----ITA--VSGK 315 (394) T ss_pred HHHHHhhhh---hh-hCCEEEEcHHHHHHHHH-h---hccC------------CCee------eecC----cCC--CCCc Confidence 665443322 22 23458999998765532 2 1111 2221 1000 001 1111 Q ss_pred eecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccE---EEEc---c--eeeccccccccccccceEEEEEe Q lcl|Aclame:pro 229 TVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDK---ISFT---G--VKFLGQMAKNVLAQDATFSVVRV 300 (430) Q Consensus 229 tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv---~Tia---G--V~~v~~~tk~~~~~l~~fvVt~~ 300 (430) ++.|-+-.. .+.... +.+++--||. +.|. + +.+... ....+.|++..- T Consensus 316 ~l~G~pv~~-----~~~~~~--------------~~~~~~~gd~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~r 371 (394) T protein:vir:97 316 VLLGKPVFV-----LSDEVL--------------GANKAFIGDFKRGVLFADRKDLGLRWADN-----EIYGQYLQAVLR 371 (394) T ss_pred eeccceeEE-----eccccc--------------CCccEEEeeccccEEEEEecceEEEEecc-----cccceeEEEEEE Confidence 222221100 000000 0001111210 0000 0 000000 001122333322 Q ss_pred ecC--------ceeEEeeccccc Q lcl|Aclame:pro 301 VDG--------THVEITPKPVAL 315 (430) Q Consensus 301 ~~~--------~~v~I~p~~v~~ 315 (430) .++ -.++++|.+.|+ T Consensus 372 ~d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 372 FGVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred EccEEecccceEEEEecccccCC Confidence 221 124444444433 No 86 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=31.09 E-value=1.5 Score=19.58 Aligned_cols=269 Identities=11% Similarity=0.011 Sum_probs=89.9 Q ss_pred Cccc----hhhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc-cCCcc-CCccCC---C Q lcl|Aclame:pro 1 MALN----EGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ-EGWDL-TDKATG---L 70 (430) Q Consensus 1 MAn~----~~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~-~g~~~-s~~~~d---~ 70 (430) |... -+.++ +.+..++++.+++..++.++++.. +- .+.+...++|...... -+|.. ....++ . T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~-~~------~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 178 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE-PV------RTRSGSRVLEKNSDMIPFAEITEMGEIPETDNP 178 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee-ec------cCCceeEEEEeecCCccceeecccccccccccc Confidence 2211 11233 445567777788888877777432 10 1333334443221110 11111 111111 1 Q ss_pred CcceEEEEeccccccceEecHHHhccHH-HHHHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHH Q lcl|Aclame:pro 71 LELNVAVNMGEPDNDFFQLRADDLRDET-AYRHRIQS-AARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFV 148 (430) Q Consensus 71 ~e~sV~v~l~~~k~V~~~~t~keL~~~~-~~~r~l~p-Am~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~ 148 (430) .-..|.++..+- .+-+.+|.+=|.+.. ..+.+|.. -.++|+..+|..+++ + .++....+...|.++ T Consensus 179 ~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~----------g-~g~~~~~~~~~~d~i 246 (392) T protein:vir:10 179 KFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILG----------V-IEKLTKQAIKSLDDI 246 (392) T ss_pred cceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhh----------c-cccccccCccCHHHH Confidence 122344444222 344555555454432 23444433 335677777766642 1 122233455678888 Q ss_pred HHHH-HHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hh-hhhhhhccccccchhhhHHHhCCCcceeccccc Q lcl|Aclame:pro 149 ADAE-ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IP-EEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTA 225 (430) Q Consensus 149 a~a~-~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~-~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~ 225 (430) ..+- ..|.....+ +-..+++|.+.+.+. .+ ...++ -. ..-+..|.-++ +.|+.-+...++...-..+.+ T Consensus 247 ~~~~~~~l~~~~~~---~a~~vm~~~~~~~L~-~l---kd~~G~~l~~~~~~~~~~~t-llG~~~v~~~~~~~~~~~~~~ 318 (392) T protein:vir:10 247 KDVLNVKLDPAISP---NAILLTNQDGFNYLD-KL---KDKDGKYILQSDPTQKNKKL-FAGTNPVVVVSNRFLKSKGTT 318 (392) T ss_pred HHHHHHhhhhhhcc---CCEEEEcHHHHHHHH-Hh---hccCCCeEeecCccCCcccc-ccCcccEEEecccccCCCccc Confidence 7764 344433333 244899999877663 22 11110 00 00111121111 222211110000000000100 Q ss_pred cc--ceecccceeeeeEEEEeecccccccccee-eEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeec Q lcl|Aclame:pro 226 TG--ITVSGAQSFKPVAWQLDNDGNKVNVDNRF-ATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) Q Consensus 226 ~~--~tV~ga~~~~~~~~t~~~~~~~~~~d~~~-~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~ 302 (430) .+ ..+-| |... ..+.....-+|+..| ..+.++ ..+...|++..-.+ T Consensus 319 ~~~~~~~~g--------------------dfs~~~~i~~~~~~~~~~~~---~~~~~f--------~~~~~~~r~~~r~d 367 (392) T protein:vir:10 319 AKKAPLIIG--------------------DLKEAIVLFKREDMELASTD---VGGKAF--------TRNTLDLRAIQRDD 367 (392) T ss_pred CCceEEEEE--------------------ehhceEEEEeecceEEEEec---cccchh--------hcCceEEEEEEeec Confidence 00 00000 0000 000000000111100 011111 01112244433222 Q ss_pred CceeEEeecccccccccccccccccccccccccc Q lcl|Aclame:pro 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLAD 336 (430) Q Consensus 303 ~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~ 336 (430) + .+....+++.+..... -+++.|+- T Consensus 368 ~-~v~~~~a~~~l~~~~~--------a~~~~~~~ 392 (392) T protein:vir:10 368 V-QMWDNEAAVYGEIDLS--------APVEQPQG 392 (392) T ss_pred c-EEecccceEEEEeccc--------ccccCCCC Confidence 1 1111111111111110 11122222 No 87 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=31.09 E-value=1.5 Score=19.58 Aligned_cols=269 Identities=11% Similarity=0.011 Sum_probs=89.9 Q ss_pred Cccc----hhhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc-cCCcc-CCccCC---C Q lcl|Aclame:pro 1 MALN----EGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ-EGWDL-TDKATG---L 70 (430) Q Consensus 1 MAn~----~~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~-~g~~~-s~~~~d---~ 70 (430) |... -+.++ +.+..++++.+++..++.++++.. +- .+.+...++|...... -+|.. ....++ . T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~-~~------~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 178 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE-PV------RTRSGSRVLEKNSDMIPFAEITEMGEIPETDNP 178 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee-ec------cCCceeEEEEeecCCccceeecccccccccccc Confidence 2211 11233 445567777788888877777432 10 1333334443221110 11111 111111 1 Q ss_pred CcceEEEEeccccccceEecHHHhccHH-HHHHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHH Q lcl|Aclame:pro 71 LELNVAVNMGEPDNDFFQLRADDLRDET-AYRHRIQS-AARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFV 148 (430) Q Consensus 71 ~e~sV~v~l~~~k~V~~~~t~keL~~~~-~~~r~l~p-Am~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~ 148 (430) .-..|.++..+- .+-+.+|.+=|.+.. ..+.+|.. -.++|+..+|..+++ + .++....+...|.++ T Consensus 179 ~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~----------g-~g~~~~~~~~~~d~i 246 (392) T protein:vir:10 179 KFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILG----------V-IEKLTKQAIKSLDDI 246 (392) T ss_pred cceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhh----------c-cccccccCccCHHHH Confidence 122344444222 344555555454432 23444433 335677777766642 1 122233455678888 Q ss_pred HHHH-HHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hh-hhhhhhccccccchhhhHHHhCCCcceeccccc Q lcl|Aclame:pro 149 ADAE-ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IP-EEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTA 225 (430) Q Consensus 149 a~a~-~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~-~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~ 225 (430) ..+- ..|.....+ +-..+++|.+.+.+. .+ ...++ -. ..-+..|.-++ +.|+.-+...++...-..+.+ T Consensus 247 ~~~~~~~l~~~~~~---~a~~vm~~~~~~~L~-~l---kd~~G~~l~~~~~~~~~~~t-llG~~~v~~~~~~~~~~~~~~ 318 (392) T protein:vir:10 247 KDVLNVKLDPAISP---NAILLTNQDGFNYLD-KL---KDKDGKYILQSDPTQKNKKL-FAGTNPVVVVSNRFLKSKGTT 318 (392) T ss_pred HHHHHHhhhhhhcc---CCEEEEcHHHHHHHH-Hh---hccCCCeEeecCccCCcccc-ccCcccEEEecccccCCCccc Confidence 7764 344433333 244899999877663 22 11110 00 00111121111 222211110000000000100 Q ss_pred cc--ceecccceeeeeEEEEeecccccccccee-eEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeec Q lcl|Aclame:pro 226 TG--ITVSGAQSFKPVAWQLDNDGNKVNVDNRF-ATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) Q Consensus 226 ~~--~tV~ga~~~~~~~~t~~~~~~~~~~d~~~-~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~ 302 (430) .+ ..+-| |... ..+.....-+|+..| ..+.++ ..+...|++..-.+ T Consensus 319 ~~~~~~~~g--------------------dfs~~~~i~~~~~~~~~~~~---~~~~~f--------~~~~~~~r~~~r~d 367 (392) T protein:vir:10 319 AKKAPLIIG--------------------DLKEAIVLFKREDMELASTD---VGGKAF--------TRNTLDLRAIQRDD 367 (392) T ss_pred CCceEEEEE--------------------ehhceEEEEeecceEEEEec---cccchh--------hcCceEEEEEEeec Confidence 00 00000 0000 000000000111100 011111 01112244433222 Q ss_pred CceeEEeecccccccccccccccccccccccccc Q lcl|Aclame:pro 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLAD 336 (430) Q Consensus 303 ~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~ 336 (430) + .+....+++.+..... -+++.|+- T Consensus 368 ~-~v~~~~a~~~l~~~~~--------a~~~~~~~ 392 (392) T protein:vir:10 368 V-QMWDNEAAVYGEIDLS--------APVEQPQG 392 (392) T ss_pred c-EEecccceEEEEeccc--------ccccCCCC Confidence 1 1111111111111110 11122222 No 88 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=31.09 E-value=1.5 Score=19.58 Aligned_cols=269 Identities=11% Similarity=0.011 Sum_probs=89.9 Q ss_pred Cccc----hhhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc-cCCcc-CCccCC---C Q lcl|Aclame:pro 1 MALN----EGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ-EGWDL-TDKATG---L 70 (430) Q Consensus 1 MAn~----~~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~-~g~~~-s~~~~d---~ 70 (430) |... -+.++ +.+..++++.+++..++.++++.. +- .+.+...++|...... -+|.. ....++ . T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~-~~------~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 178 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE-PV------RTRSGSRVLEKNSDMIPFAEITEMGEIPETDNP 178 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee-ec------cCCceeEEEEeecCCccceeecccccccccccc Confidence 2211 11233 445567777788888877777432 10 1333334443221110 11111 111111 1 Q ss_pred CcceEEEEeccccccceEecHHHhccHH-HHHHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHH Q lcl|Aclame:pro 71 LELNVAVNMGEPDNDFFQLRADDLRDET-AYRHRIQS-AARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFV 148 (430) Q Consensus 71 ~e~sV~v~l~~~k~V~~~~t~keL~~~~-~~~r~l~p-Am~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~ 148 (430) .-..|.++..+- .+-+.+|.+=|.+.. ..+.+|.. -.++|+..+|..+++ + .++....+...|.++ T Consensus 179 ~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~----------g-~g~~~~~~~~~~d~i 246 (392) T protein:vir:10 179 KFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILG----------V-IEKLTKQAIKSLDDI 246 (392) T ss_pred cceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhh----------c-cccccccCccCHHHH Confidence 122344444222 344555555454432 23444433 335677777766642 1 122233455678888 Q ss_pred HHHH-HHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hh-hhhhhhccccccchhhhHHHhCCCcceeccccc Q lcl|Aclame:pro 149 ADAE-ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IP-EEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTA 225 (430) Q Consensus 149 a~a~-~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~-~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~ 225 (430) ..+- ..|.....+ +-..+++|.+.+.+. .+ ...++ -. ..-+..|.-++ +.|+.-+...++...-..+.+ T Consensus 247 ~~~~~~~l~~~~~~---~a~~vm~~~~~~~L~-~l---kd~~G~~l~~~~~~~~~~~t-llG~~~v~~~~~~~~~~~~~~ 318 (392) T protein:vir:10 247 KDVLNVKLDPAISP---NAILLTNQDGFNYLD-KL---KDKDGKYILQSDPTQKNKKL-FAGTNPVVVVSNRFLKSKGTT 318 (392) T ss_pred HHHHHHhhhhhhcc---CCEEEEcHHHHHHHH-Hh---hccCCCeEeecCccCCcccc-ccCcccEEEecccccCCCccc Confidence 7764 344433333 244899999877663 22 11110 00 00111121111 222211110000000000100 Q ss_pred cc--ceecccceeeeeEEEEeecccccccccee-eEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeec Q lcl|Aclame:pro 226 TG--ITVSGAQSFKPVAWQLDNDGNKVNVDNRF-ATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) Q Consensus 226 ~~--~tV~ga~~~~~~~~t~~~~~~~~~~d~~~-~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~ 302 (430) .+ ..+-| |... ..+.....-+|+..| ..+.++ ..+...|++..-.+ T Consensus 319 ~~~~~~~~g--------------------dfs~~~~i~~~~~~~~~~~~---~~~~~f--------~~~~~~~r~~~r~d 367 (392) T protein:vir:10 319 AKKAPLIIG--------------------DLKEAIVLFKREDMELASTD---VGGKAF--------TRNTLDLRAIQRDD 367 (392) T ss_pred CCceEEEEE--------------------ehhceEEEEeecceEEEEec---cccchh--------hcCceEEEEEEeec Confidence 00 00000 0000 000000000111100 011111 01112244433222 Q ss_pred CceeEEeecccccccccccccccccccccccccc Q lcl|Aclame:pro 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLAD 336 (430) Q Consensus 303 ~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~ 336 (430) + .+....+++.+..... -+++.|+- T Consensus 368 ~-~v~~~~a~~~l~~~~~--------a~~~~~~~ 392 (392) T protein:vir:10 368 V-QMWDNEAAVYGEIDLS--------APVEQPQG 392 (392) T ss_pred c-EEecccceEEEEeccc--------ccccCCCC Confidence 1 1111111111111110 11122222 No 89 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=31.09 E-value=1.5 Score=19.58 Aligned_cols=269 Identities=11% Similarity=0.011 Sum_probs=89.9 Q ss_pred Cccc----hhhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc-cCCcc-CCccCC---C Q lcl|Aclame:pro 1 MALN----EGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ-EGWDL-TDKATG---L 70 (430) Q Consensus 1 MAn~----~~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~-~g~~~-s~~~~d---~ 70 (430) |... -+.++ +.+..++++.+++..++.++++.. +- .+.+...++|...... -+|.. ....++ . T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~-~~------~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 178 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE-PV------RTRSGSRVLEKNSDMIPFAEITEMGEIPETDNP 178 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee-ec------cCCceeEEEEeecCCccceeecccccccccccc Confidence 2211 11233 445567777788888877777432 10 1333334443221110 11111 111111 1 Q ss_pred CcceEEEEeccccccceEecHHHhccHH-HHHHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHH Q lcl|Aclame:pro 71 LELNVAVNMGEPDNDFFQLRADDLRDET-AYRHRIQS-AARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFV 148 (430) Q Consensus 71 ~e~sV~v~l~~~k~V~~~~t~keL~~~~-~~~r~l~p-Am~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~ 148 (430) .-..|.++..+- .+-+.+|.+=|.+.. ..+.+|.. -.++|+..+|..+++ + .++....+...|.++ T Consensus 179 ~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~----------g-~g~~~~~~~~~~d~i 246 (392) T protein:vir:10 179 KFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILG----------V-IEKLTKQAIKSLDDI 246 (392) T ss_pred cceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhh----------c-cccccccCccCHHHH Confidence 122344444222 344555555454432 23444433 335677777766642 1 122233455678888 Q ss_pred HHHH-HHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hh-hhhhhhccccccchhhhHHHhCCCcceeccccc Q lcl|Aclame:pro 149 ADAE-ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IP-EEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTA 225 (430) Q Consensus 149 a~a~-~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~-~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~ 225 (430) ..+- ..|.....+ +-..+++|.+.+.+. .+ ...++ -. ..-+..|.-++ +.|+.-+...++...-..+.+ T Consensus 247 ~~~~~~~l~~~~~~---~a~~vm~~~~~~~L~-~l---kd~~G~~l~~~~~~~~~~~t-llG~~~v~~~~~~~~~~~~~~ 318 (392) T protein:vir:10 247 KDVLNVKLDPAISP---NAILLTNQDGFNYLD-KL---KDKDGKYILQSDPTQKNKKL-FAGTNPVVVVSNRFLKSKGTT 318 (392) T ss_pred HHHHHHhhhhhhcc---CCEEEEcHHHHHHHH-Hh---hccCCCeEeecCccCCcccc-ccCcccEEEecccccCCCccc Confidence 7764 344433333 244899999877663 22 11110 00 00111121111 222211110000000000100 Q ss_pred cc--ceecccceeeeeEEEEeecccccccccee-eEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeec Q lcl|Aclame:pro 226 TG--ITVSGAQSFKPVAWQLDNDGNKVNVDNRF-ATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) Q Consensus 226 ~~--~tV~ga~~~~~~~~t~~~~~~~~~~d~~~-~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~ 302 (430) .+ ..+-| |... ..+.....-+|+..| ..+.++ ..+...|++..-.+ T Consensus 319 ~~~~~~~~g--------------------dfs~~~~i~~~~~~~~~~~~---~~~~~f--------~~~~~~~r~~~r~d 367 (392) T protein:vir:10 319 AKKAPLIIG--------------------DLKEAIVLFKREDMELASTD---VGGKAF--------TRNTLDLRAIQRDD 367 (392) T ss_pred CCceEEEEE--------------------ehhceEEEEeecceEEEEec---cccchh--------hcCceEEEEEEeec Confidence 00 00000 0000 000000000111100 011111 01112244433222 Q ss_pred CceeEEeecccccccccccccccccccccccccc Q lcl|Aclame:pro 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLAD 336 (430) Q Consensus 303 ~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~ 336 (430) + .+....+++.+..... -+++.|+- T Consensus 368 ~-~v~~~~a~~~l~~~~~--------a~~~~~~~ 392 (392) T protein:vir:10 368 V-QMWDNEAAVYGEIDLS--------APVEQPQG 392 (392) T ss_pred c-EEecccceEEEEeccc--------ccccCCCC Confidence 1 1111111111111110 11122222 No 90 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=27.65 E-value=1.8 Score=19.15 Aligned_cols=258 Identities=11% Similarity=0.033 Sum_probs=98.0 Q ss_pred Cc--cchhh-HHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccc-ccCCccC---CccCCCCcc Q lcl|Aclame:pro 1 MA--LNEGQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT-QEGWDLT---DKATGLLEL 73 (430) Q Consensus 1 MA--n~~~~-~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~-~~g~~~s---~~~~d~~e~ 73 (430) |. .+... +.+....++|..++...++.++++...- .+.++.+|+...... ...+... ....++.=. T Consensus 109 ~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~ 181 (379) T protein:vir:10 109 MTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSI-------SGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDIS 181 (379) T ss_pred cccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeec-------cCCceEEEEeecCCCcccccccCCcccccccccee Confidence 11 11222 2344556777777888888888765322 356778877543221 1111111 111233334 Q ss_pred eEEEEeccccccceEecHHHhccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHH Q lcl|Aclame:pro 74 NVAVNMGEPDNDFFQLRADDLRDETAYRHRIQS-AARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAE 152 (430) Q Consensus 74 sV~v~l~~~k~V~~~~t~keL~~~~~~~r~l~p-Am~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~ 152 (430) .|.+.+.+-.. -+.+|.+=|.+..+.+.+|.. -.+.|+..+|..+++-. ...++.+..+....+.+.++..+. T Consensus 182 ~i~~~~~k~~~-~~~iS~ell~D~~~l~~~i~~~la~~~~~~~~~~~~~g~-----~~~~~~~~~~~~~~~~~d~i~~~~ 255 (379) T protein:vir:10 182 MIDVNTDFIAG-FTRYSKKMANNLPFLTSFIPNALRRDYAKAENAAFNAVL-----AANATASTEIITNKNKVEMLINEI 255 (379) T ss_pred eeEeeeeeEEe-eehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhccc-----ccccccccccccCcccHHHHHHHH Confidence 55555544443 245555435554556666654 33577777777664211 111111112222234567777766 Q ss_pred HHHHHhCCCcCCCcEEEecHHHHHHHHHhh--hhhhhccchhhhhhhhccccccchhhhHHHhCCCcceeccccccccee Q lcl|Aclame:pro 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDL--TKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITV 230 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~--~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV 230 (430) ..+...+.+.+ ..++||.++..+.... .+....+ ..-....|.-. .+.|+- +..++.++ +|+ +.+ T Consensus 256 ~~~~~~~~~~~---~~vmn~~~~~~l~~lkd~~G~~l~~--~~~~~~~~~~~-~l~G~p-vv~s~~~~---ag~---~~~ 322 (379) T protein:vir:10 256 AKQENLDFPVT---AIVLRPTDYYDILVTQKSVGAGYGL--PGVVTQDNGVL-RINGIP-LFRATWLA---ANK---YYV 322 (379) T ss_pred HhhhhccCCCC---EEEEcHHHHHHHHHhhccCCceecc--CCccCCCCCcc-eeccee-eEecCCCC---CCc---eEE Confidence 66666655542 3899999887653211 1111000 00000111111 134443 22333332 111 111 Q ss_pred cccceeeeeEEEEeeccccccccceeeEEEeeccc--eeecccEEEEcceeeccccccccccccceEEEEEeecC-ceeE Q lcl|Aclame:pro 231 SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATT--GLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDG-THVE 307 (430) Q Consensus 231 ~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tg--tlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~-~~v~ 307 (430) |-.+ .... .. . +-..+.++-.. .+.. |.+.|-...-+ . +.| ..+ .=+. T Consensus 323 -gdf~--~~~~-~~-~--------~~~~i~~~~~~~~~f~~-~~~~~r~~~R~------~------~~v---~~p~a~v~ 373 (379) T protein:vir:10 323 -GDWT--RVTK-VT-T--------EGLSLEFSEVEGTNFVK-NNITARIEAQV------A------LAV---EQPAALIF 373 (379) T ss_pred -eecc--cEEE-EE-E--------eceEEEEeecccccccC-CcEEEEEEEEe------c------cEE---ecCccEEE Confidence 1110 0000 00 0 00011111110 0111 12222110000 0 001 011 1122 Q ss_pred Eeeccc Q lcl|Aclame:pro 308 ITPKPV 313 (430) Q Consensus 308 I~p~~v 313 (430) |+-+-| T Consensus 374 ~~~~~~ 379 (379) T protein:vir:10 374 GDFTAV 379 (379) T ss_pred EEecCC Confidence 222222 No 91 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=26.87 E-value=1.9 Score=19.05 Aligned_cols=279 Identities=11% Similarity=0.042 Sum_probs=106.4 Q ss_pred Cccc-hhh-HHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccc----cccCCccCCccCCCCcce Q lcl|Aclame:pro 1 MALN-EGQ-IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESP----TQEGWDLTDKATGLLELN 74 (430) Q Consensus 1 MAn~-~~~-~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~----~~~g~~~s~~~~d~~e~s 74 (430) ...+ .+. +-+.+.+++|+.+++..++.+++++..- .+.++++|+=.... ...|...+ .+++.=.+ T Consensus 22 ~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~-------~~~~~~~p~~~~~~~a~~v~Eg~~~~--~~~~~f~~ 92 (326) T protein:vir:42 22 TGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPM-------GTTGQKIPHWTGDVSASWIGEGDMKP--ITKGNMTS 92 (326) T ss_pred ccccCCcceechhhHHHHHHHHHhcchhhhhcceeec-------cCCceEEEEEeCCcceEEecCCcccc--ccccceeE Confidence 1222 122 4466778899999999999888865321 24566776522111 11221111 12222223 Q ss_pred EEEEeccccccceEecHHHhccH-HHHHHHHHH-HHHHHHHHHHHHHHHHHH-------hcccceeeccCCCCCCC--CC Q lcl|Aclame:pro 75 VAVNMGEPDNDFFQLRADDLRDE-TAYRHRIQS-AARKLANNVELKVANMAA-------EMGSLVITSPDAIGTNT--AD 143 (430) Q Consensus 75 V~v~l~~~k~V~~~~t~keL~~~-~~~~r~l~p-Am~~LAn~Id~dl~~~~~-------~~as~~~~~~~~~~~~~--~~ 143 (430) +.++..+. ..-+.+|.+-|++. ...+.+|.. -.++++..+|..++.--- .............++.. .. T Consensus 93 i~~~~~k~-~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~~~~~~~~~~~~~~~ 171 (326) T protein:vir:42 93 QTIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTKEVSLVDPDGTGSNADL 171 (326) T ss_pred EEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccceeecccccccccc Confidence 44444222 23445554444432 233455544 446789999988863100 00000000001111111 11 Q ss_pred chhHHH--HHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceec Q lcl|Aclame:pro 144 AWNFVA--DAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLT 221 (430) Q Consensus 144 ~~~d~a--~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~ 221 (430) .+.+.. .....+. ..... .-..+++|.....+.. +...+ ||++ +... .. T Consensus 172 ~~~~~~~~~~~~~~~--~~~~~-~a~~v~n~~~~~~L~~----lkd~~------------G~~l------~~~~----~~ 222 (326) T protein:vir:42 172 TVYDAVAVNALSLLV--NAGKK-WTHTLLDDITEPILNG----AKDKS------------GRPL------FIES----TY 222 (326) T ss_pred hhHHHHHHHHHhhhh--hhccC-ccEEEEeHHHHHHHHH----hhccC------------Ccee------eccc----cc Confidence 122221 1111111 11111 2336889888765532 21111 2322 1100 00 Q ss_pred cccc---ccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEE Q lcl|Aclame:pro 222 KSTA---TGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVV 298 (430) Q Consensus 222 ~gt~---~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt 298 (430) .|.. .+.++.|-+- -.+..+.+|+.+.+.| +..++ T Consensus 223 ~~~~~~~~~~~l~G~pv--------------------------~~~~~~~~~~~~~~~G-------------d~s~~--- 260 (326) T protein:vir:42 223 TEENSPFRLGRIVARPT--------------------------ILSDHVASGTVVGYQG-------------DFRQL--- 260 (326) T ss_pred cCccccccCceeeeeeE--------------------------EEcCCCCCCceEEEEe-------------ecceE--- Confidence 0000 0001111000 0000111222222222 00111 Q ss_pred EeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEE-EecccCCCCchhhce Q lcl|Aclame:pro 299 RVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIV-SQPIPANHELFAGMK 377 (430) Q Consensus 299 ~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~La-trpl~~p~~~~~~~~ 377 (430) +...+..+.+- ++..-+ T Consensus 261 -----------------------------------------------------~~~~~~~~~v~~~~e~~~--------- 278 (326) T protein:vir:42 261 -----------------------------------------------------VWGQVGGLSFDVTDQATL--------- 278 (326) T ss_pred -----------------------------------------------------EEEEecceEEEEeeccee--------- Confidence 11111111110 000000 Q ss_pred eEEEecCCCcEEEEEEeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 378 TTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 378 ~~~~~~~~~Glsirv~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) . ...+..+-.+.. ..+++..+|...-+++++++|+-. +.|.+=.+ T Consensus 279 -~-~~~~~~~~~~~~-----~~~d~~~~r~~~~~d~~v~~~~a~-~~l~~~~~ 323 (326) T protein:vir:42 279 -N-LGTPQAPNFVSL-----WQHNLVAVRVEAEYAFHCNDKDAF-VKLTNVDA 323 (326) T ss_pred -e-ecccccccchhh-----hhcCcEEEEEEEEeccEEecccce-EEEeeccc Confidence 0 000000000111 224578888888899999999874 66777766 No 92 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=26.81 E-value=1.9 Score=19.05 Aligned_cols=275 Identities=11% Similarity=-0.002 Sum_probs=101.9 Q ss_pred Cccchhh----HHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCccCC---ccCCCCcc Q lcl|Aclame:pro 1 MALNEGQ----IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTD---KATGLLEL 73 (430) Q Consensus 1 MAn~~~~----~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~s~---~~~d~~e~ 73 (430) ||..-.. +-+.+.+++|+.+++..++.++.... + -.+..+++|+=..... -.|.... ...+..=. T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~-~------~~~~~~~~p~~~~~~~-a~wv~Eg~~~~~s~~~f~ 72 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQK-P------IPFNGQREFVFDFDSD-IDIVAENGKKTHGGVSLD 72 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhccee-e------ccCCceEEEEEecCcc-eEEeeCCcccccccccce Confidence 9988444 44778899999999999998887542 2 1234566665211111 1111111 11122222 Q ss_pred eEEEEeccccccceEecHHHhc--cH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccce----------eeccCCCC Q lcl|Aclame:pro 74 NVAVNMGEPDNDFFQLRADDLR--DE---TAYRHRIQSAARKLANNVELKVANMAAEMGSLV----------ITSPDAIG 138 (430) Q Consensus 74 sV~v~l~~~k~V~~~~t~keL~--~~---~~~~r~l~pAm~~LAn~Id~dl~~~~~~~as~~----------~~~~~~~~ 138 (430) .+.++..+- .+-+.+|.+=|+ .+ ++...+.+.-.++++.++|..++.-.-...+.. ........ T Consensus 73 ~v~l~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 151 (300) T protein:vir:95 73 PVTIVPLKV-EYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVP 151 (300) T ss_pred eeEeeeEEE-EEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeec Confidence 333333221 223444443332 11 222334455667888888888863210000000 00000111 Q ss_pred CCCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch-h-hhhhhhccccccchhhhHHHhCCC Q lcl|Aclame:pro 139 TNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-P-EEAYRDGTIQRQVAGFDDVLRSPK 216 (430) Q Consensus 139 ~~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~-~-~~a~r~g~igr~~~Gfd~~~~~~~ 216 (430) ..+...|.++..+...+...+... ...+++|.+...+.. +...++. . ......|.-++ +.|+- +..++. T Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~---~~~vmn~~~~~~L~~----lkd~~G~~i~~~~~~~~~~~~-l~G~P-v~~s~~ 222 (300) T protein:vir:95 152 FKDTNPDESMEDAVGMIDGSERDI---TGAILDPIFTTALSK----MKNAEGGKLYPELAWGGVPDA-INGLA-VDKNRT 222 (300) T ss_pred ccccchHHHHHHHHHHhhhcCCCc---cEEEECHHHHHHHHH----hhccCCCeeccCccccCCCce-eccee-eEEecC Confidence 122345778888887777655443 248999998776532 2111110 0 01111111111 23332 111111 Q ss_pred cceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEE Q lcl|Aclame:pro 217 LPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFS 296 (430) Q Consensus 217 ~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fv 296 (430) ++.- . +. ....+-.||- .++. T Consensus 223 v~~~---~---------------------~~---------------~~~~~~~GDf--------------------~~~~ 243 (300) T protein:vir:95 223 VSYS---Q---------------------TD---------------PKNTAIVGDF--------------------ETMF 243 (300) T ss_pred CCCC---C---------------------CC---------------CccEEEEeec--------------------cceE Confidence 1100 0 00 0001112331 0001 Q ss_pred EEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceee-eeecccceeEEEecccCCCCc Q lcl|Aclame:pro 297 VVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTN-VFWADDAIRIVSQPIPANHEL 372 (430) Q Consensus 297 Vt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~N-laFhr~A~~Latrpl~~p~~~ 372 (430) ....-.+-++++.+..- .+.. .++. .....|.|..- .+.. ...|++||+..+-. +| T Consensus 244 ~~~~~~~~~~~v~~~~~----~d~~-------~~~~--f~~~~v~~r~~--~r~d~~v~~~~a~~~l~~~-----~g 300 (300) T protein:vir:95 244 KWGYAKEVPMEIIKYGD----PDNS-------GRDL--KGYNQIYIRCE--AYIGWGIMDAASFARIVKT-----GG 300 (300) T ss_pred EEEEecccEEEEeeccC----CCCc-------chhh--hhcCcEEEEEE--EeecceeecccceEEEecC-----CC Confidence 11111122233322110 0000 0000 00001111000 0111 22344444443211 11 No 93 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=25.66 E-value=2 Score=18.90 Aligned_cols=269 Identities=16% Similarity=0.064 Sum_probs=95.0 Q ss_pred Cccc---hh-hHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccccCCcc---CCccCCCCcc Q lcl|Aclame:pro 1 MALN---EG-QIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDL---TDKATGLLEL 73 (430) Q Consensus 1 MAn~---~~-~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~g~~~---s~~~~d~~e~ 73 (430) +... .+ -+-+.+..+++..++...++.++++...- .|.++.+|+.......-++.. .....++.=. T Consensus 136 ~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~ 208 (418) T protein:vir:10 136 VGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQT-------SSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFN 208 (418) T ss_pred ccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeec-------cCCceeEEEEecCCCceeeeccCcccccccccee Confidence 1111 11 13355666788888888888888754211 245667766432211111111 1111222223 Q ss_pred eEEEEeccccccceEecHHHhccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh---cccceeec---cCCCCCCCCCchh Q lcl|Aclame:pro 74 NVAVNMGEPDNDFFQLRADDLRDETAYRHRIQS-AARKLANNVELKVANMAAE---MGSLVITS---PDAIGTNTADAWN 146 (430) Q Consensus 74 sV~v~l~~~k~V~~~~t~keL~~~~~~~r~l~p-Am~~LAn~Id~dl~~~~~~---~as~~~~~---~~~~~~~~~~~~~ 146 (430) .|.++..+-. .-+.+|.+=|.+....+.+|.. ..++++..+|..++.---. ..+..... .......+...+. T Consensus 209 ~v~~~~~k~~-~~~~is~ell~ds~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 287 (418) T protein:vir:10 209 LKNQPVRTIA-HLFKASRQILDDAPALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLANATPID 287 (418) T ss_pred eEEEeeeeEE-EeehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccHH Confidence 4444443332 2345554434433445555544 4567888888877631000 00000000 0001111223456 Q ss_pred HHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccc Q lcl|Aclame:pro 147 FVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTAT 226 (430) Q Consensus 147 d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~ 226 (430) ++..+-..+...+.+.. ..+++|.+...+.. +. ...++-.-.-..++.-++ +.|+. +..++++|.. + T Consensus 288 ~i~~~~~~~~~~~~~~~---~~v~n~~~~~~L~~-lk--d~~G~~i~~~~~~~~~~~-l~G~p-V~~~~~~p~~---~-- 354 (418) T protein:vir:10 288 KIRLALLQAVLAEFPAT---GIVLNPIDWASIEL-TK--DSQGRYIVGNPVNGTTPR-LWNLP-VVETQAMTAN---E-- 354 (418) T ss_pred HHHHHHHhhccccCCCC---EEEEcHHHHHHHHH-hh--cCCCceeccccccCCCce-eccee-eEEcCCCCCC---c-- Confidence 66666555554444432 37899998876642 21 111110000011233332 44553 2334444311 0 Q ss_pred cceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCcee Q lcl|Aclame:pro 227 GITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHV 306 (430) Q Consensus 227 ~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v 306 (430) + +-|-.+. ...+ .+....++.++ .-.+ +.-..+.-.|++..-.++. + T Consensus 355 -~-~~gd~s~---~~~~--------~~~~~~~i~~~-----------~~~~--------~~f~~~~~~~r~~~~~d~~-~ 401 (418) T protein:vir:10 355 -F-LVGAFSM---AAQI--------FDRMEIEVLLS-----------TENV--------DDFEKNMVSIRAEERLALA-V 401 (418) T ss_pred -E-EEeeccc---eEEE--------EEecceEEEEe-----------cccc--------hhhhcCceEEEEEEeeccE-E Confidence 0 0010000 0000 00000000000 0000 0001111123333222211 1 Q ss_pred EEeeccccccccccccccccccccccccccC Q lcl|Aclame:pro 307 EITPKPVALDDVSLSPEQRAYANVNTSLADA 337 (430) Q Consensus 307 ~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~ 337 (430) .--.+++... + ..++.| T Consensus 402 ~~~~a~~~~~-------------~-~~~~~g 418 (418) T protein:vir:10 402 YRPESFVTGA-------------L-VEQAGG 418 (418) T ss_pred ecccceEEEE-------------e-ccCCCC Confidence 1101111000 0 011111 No 94 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=25.62 E-value=2 Score=18.89 Aligned_cols=275 Identities=13% Similarity=0.105 Sum_probs=96.1 Q ss_pred Cccc-----hhhHH-HHHHHHHH-HHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc----cCCccCCccCC Q lcl|Aclame:pro 1 MALN-----EGQIV-TLAVDEII-ETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ----EGWDLTDKATG 69 (430) Q Consensus 1 MAn~-----~~~~~-~~~~~~vl-~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~----~g~~~s~~~~d 69 (430) ++.. -+.++ +.+..++| ..++...++.+++++. +. .| .+++|+-...... .|... ...+ T Consensus 249 ~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~-~~------~g-~~~~~~~~~~~~a~~v~Eg~~~--~~~~ 318 (543) T protein:vir:81 249 RAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQV-VA------TG-DVWHGVSSAAVQWSWDAEFEEV--SDDS 318 (543) T ss_pred hhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccc-cC------Cc-ceEEEEecCCcceeecccCccc--cccc Confidence 1111 11223 22333444 4466777777776532 11 23 3555542222111 12111 1122 Q ss_pred CCcceEEEEeccccccceEecHHHhccHHHHHHHHHH-HHHHHHHHHHHHHHHHHH---hccccee---eccC--CCCCC Q lcl|Aclame:pro 70 LLELNVAVNMGEPDNDFFQLRADDLRDETAYRHRIQS-AARKLANNVELKVANMAA---EMGSLVI---TSPD--AIGTN 140 (430) Q Consensus 70 ~~e~sV~v~l~~~k~V~~~~t~keL~~~~~~~r~l~p-Am~~LAn~Id~dl~~~~~---~~as~~~---~~~~--~~~~~ 140 (430) +.=..+.++..+-. +-+.+|-+=|.+.-..+.+|.. -.+.++..+|..++.--- ...+... .... +..+. T Consensus 319 ~~~~~i~~~~~k~~-~~~~is~ell~d~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~ 397 (543) T protein:vir:81 319 PEFGQPEIPVKKAQ-GFVPISIEALQDEANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAPVTA 397 (543) T ss_pred cccceeeeeeeeeE-eeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhccccccccccccc Confidence 22233444443332 2345554334333233444433 345677777776652100 0000000 0000 11112 Q ss_pred CCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCccee Q lcl|Aclame:pro 141 TADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVL 220 (430) Q Consensus 141 ~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~ 220 (430) ....|.|+..+-..|.....+. -..+++|.+++.+.. + ...+ ||++ +.. + T Consensus 398 ~~~~~~~~~~~~~~l~~~~~~~---~~~v~n~~~~~~l~~-l---kd~~------------G~~l------~~~--~--- 447 (543) T protein:vir:81 398 ETFALADVYAVYEQLAARHRRQ---GAWLANNLIYNKIRQ-F---DTQG------------GAGL------WTT--I--- 447 (543) T ss_pred ccccHHHHHHHHHhhhccccCC---cEEEEcHHHHHHHHH-h---hcCC------------Ccee------ccC--c--- Confidence 2334777766665554333222 247899988776642 2 1111 2221 110 0 Q ss_pred cccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEe Q lcl|Aclame:pro 221 TKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRV 300 (430) Q Consensus 221 ~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~ 300 (430) . .+.+.++.|-+- ..+-.... ....+..+|+..-|-| +...|.+ .+ T Consensus 448 ~--~g~~~~l~G~pv----~~~~~~~~--------------~~~~~~~~~~~~i~~g-------------d~~~~~i-~~ 493 (543) T protein:vir:81 448 G--NGEPSQLLGRPV----GEAEAMDA--------------NWNTSASADNFVLLYG-------------NFQNYVI-AD 493 (543) T ss_pred C--CCCCccccceee----EEeccccc--------------cccccccCCcceEEEe-------------eccceeE-Ee Confidence 0 011122333211 00000000 0001233344443334 2333333 23 Q ss_pred ecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccC Q lcl|Aclame:pro 301 VDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPA 368 (430) Q Consensus 301 ~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~ 368 (430) ..+-+|.++|....... .....+.+..-.-+ -=-..+++||++...+-.- T Consensus 494 ~~~~~i~~~~~~~~~~~-----------------~~~~~~~~~~~~r~-d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 494 RIGMTVEFIPHLFGTNR-----------------RPNGSRGWFAYYRM-GADVVNPNAFRLLNVETAS 543 (543) T ss_pred ecccEEEEeccccccch-----------------hhcCceEEEEEEee-ccEeecccceEEEEecccC Confidence 23344555444211000 00001112110000 0022445666555544322 No 95 >protein:vir:395 Length: 117 # NCBI annotation: gp10 # Family: family:all:1908 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046905;genbank:gi:9630475;genbank:GeneID:1261649 Probab=23.57 E-value=2.3 Score=18.61 Aligned_cols=112 Identities=21% Similarity=0.235 Sum_probs=35.2 Q ss_pred ecHHHHHHHHHhhhhhhh-ccchhhhhhhhccccccchhhhHHHhCCCcceecccccccceecccceeeeeEEEEeeccc Q lcl|Aclame:pro 170 FNPQDYKKAGYDLTKRDI-FGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGN 248 (430) Q Consensus 170 l~p~~~a~~~~~~~~l~~-~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~ 248 (430) |. ++-.||- .-....++++ +. .|... ..+.|...+.+|+|--. -+. .+..... T Consensus 1 m~---------~~dNlFd~ama~aD~aI~-----~~-~g~~a--------~i~~g~~~~rti~gVFD-dP~--~~~~~ag 54 (117) T protein:vir:39 1 MA---------DFDNLFDEAMSRADGAIR-----GV-MGTEA--------KVMSGTLSGATLVGVFD-DPE--NIGYAGA 54 (117) T ss_pred CC---------cccchHHHHHHhhhHHHH-----Hh-cCceE--------EEEeCCCCceEEEEEec-Ccc--ccccccC Confidence 11 1111100 0000112221 11 12221 22233333344444321 111 0111111 Q ss_pred cccccceeeEEEee--ccceeecccEEEEcceeeccccccccccccceEEEEEee-c-CceeEEeecccccccccccccc Q lcl|Aclame:pro 249 KVNVDNRFATVTLS--ATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVV-D-GTHVEITPKPVALDDVSLSPEQ 324 (430) Q Consensus 249 ~~~~d~~~~~~~~s--~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~-~-~~~v~I~p~~v~~~~~~~~~~~ 324 (430) +...-+..-.+.+- -...|+.||.+||+| ++|.|+... + +|.-.|+.+- +.+ T Consensus 55 gg~ie~saP~LfvktaDv~gl~r~D~vtI~g---------------~~y~V~~~~pDg~G~~~l~L~r-------g~p-- 110 (117) T protein:vir:39 55 GIRVEGTSPTLFVKTSTVSQLQRMDTLTING---------------RQFWVDRVGPDDCGSCHIWLGN-------GTP-- 110 (117) T ss_pred ceEEeccCcEEEEeeccccccCCCCEEEECC---------------CceEEeeeccCCCceEEEEeec-------CCC-- Confidence 11111111222221 124699999999999 677776542 2 2332332220 000 Q ss_pred ccccccccccccCceeEEecCCCcee Q lcl|Aclame:pro 325 RAYANVNTSLADAMAVNILNVKDART 350 (430) Q Consensus 325 ~~~~nVsa~pA~~aavTv~~~~~~~~ 350 (430) |+++-. . T Consensus 111 ---------p~~~~~----------~ 117 (117) T protein:vir:39 111 ---------PASSRR----------R 117 (117) T ss_pred ---------CCccCC----------C Confidence 111100 0 No 96 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=22.71 E-value=2.4 Score=18.49 Aligned_cols=278 Identities=10% Similarity=0.049 Sum_probs=114.6 Q ss_pred Cccch-----hhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccc-ccCCcc-CCccCCC-- Q lcl|Aclame:pro 1 MALNE-----GQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT-QEGWDL-TDKATGL-- 70 (430) Q Consensus 1 MAn~~-----~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~-~~g~~~-s~~~~d~-- 70 (430) +++.. +.++ +.+..++++.++...++.++++.+.- .+...++|+|..... .-.+.. ....++. T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~ 192 (415) T protein:vir:81 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-------TNGSGKYPVVRQSEVAALEKVEELEENPELAV 192 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec-------cCCceeEEEEeecCCccceeeccccccCcccc Confidence 22211 1134 35567788888888888887754211 233344555432211 111111 1122211 Q ss_pred -CcceEEEEeccccccceEecHHHhccHHH-HHHHHH-HHHHHHHHHHHHHHHHHHHhcccce-----eeccCCCCCCCC Q lcl|Aclame:pro 71 -LELNVAVNMGEPDNDFFQLRADDLRDETA-YRHRIQ-SAARKLANNVELKVANMAAEMGSLV-----ITSPDAIGTNTA 142 (430) Q Consensus 71 -~e~sV~v~l~~~k~V~~~~t~keL~~~~~-~~r~l~-pAm~~LAn~Id~dl~~~~~~~as~~-----~~~~~~~~~~~~ 142 (430) .-.++.+++.+-. .-+.+|.+=|.+..+ .+.+|. .-.+.++..+|..++.-.-...... ..........+. T Consensus 193 ~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~ 271 (415) T protein:vir:81 193 KPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKA 271 (415) T ss_pred cceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccc Confidence 1234444443332 224444443443322 455554 3446778888887764221111000 001111122234 Q ss_pred CchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecc Q lcl|Aclame:pro 143 DAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTK 222 (430) Q Consensus 143 ~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~ 222 (430) ..|+++.++...|.+..... -..+++|.+...+. . +...+ ||++ +... + . T Consensus 272 ~~~~~i~~~~~~~~~~~~~~---~~~v~n~~~~~~l~-~---lkd~~------------G~~l------~~~~-~---~- 321 (415) T protein:vir:81 272 KSLDDIKDAINLNVKPNYEH---NVAIVSQTMFAKLD-K---MKDKL------------GNYL------IQPD-V---K- 321 (415) T ss_pred cchhHHHHHHHhhhhhccCC---CEEEEcHHHHHHHH-H---hhccC------------Ccee------eccC-c---C- Confidence 56888888777776655543 23789998876553 2 21111 3322 1110 0 1 Q ss_pred cccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeec Q lcl|Aclame:pro 223 STATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) Q Consensus 223 gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~ 302 (430) .+.+.++.|-+-. ..+.. ..-.+||..-+-| +..+|++..+-. T Consensus 322 -~~~~~~l~G~pV~-----~~~~~------------------~~~~~~~~~~~~G-------------d~~~~~~~~~~~ 364 (415) T protein:vir:81 322 -EKTQQRLLGAKIE-----ILPDE------------------VLGQKGNNTLIIG-------------NLKDAIVLFDRS 364 (415) T ss_pred -CCCCceecceeeE-----Eeccc------------------ccCCCCccEEEEE-------------ehhccEEEEeec Confidence 1122233332210 00000 0001244433333 333433333222 Q ss_pred CceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhcee Q lcl|Aclame:pro 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKT 378 (430) Q Consensus 303 ~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~ 378 (430) +-++..++... ++.. - -++.-++. -..|++||+..+-.-+.-.+|+-+... T Consensus 365 ~~~v~~~~~~~-------------~~~~-----~-~~~~r~d~------~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:81 365 QYQASWTDYMH-------------FGEC-----L-MIAVRQDC------RILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ceEEEEecccc-------------CceE-----E-EEEEEecc------EEeccccEEEEEEeccCCCCCccccCC Confidence 22333322110 0000 0 01111121 456889999988886543344433333 No 97 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=22.71 E-value=2.4 Score=18.49 Aligned_cols=278 Identities=10% Similarity=0.049 Sum_probs=114.6 Q ss_pred Cccch-----hhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccc-ccCCcc-CCccCCC-- Q lcl|Aclame:pro 1 MALNE-----GQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT-QEGWDL-TDKATGL-- 70 (430) Q Consensus 1 MAn~~-----~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~-~~g~~~-s~~~~d~-- 70 (430) +++.. +.++ +.+..++++.++...++.++++.+.- .+...++|+|..... .-.+.. ....++. T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~ 192 (415) T protein:vir:79 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-------TNGSGKYPVVRQSEVAALEKVEELEENPELAV 192 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec-------cCCceeEEEEeecCCccceeeccccccCcccc Confidence 22211 1134 35567788888888888887754211 233344555432211 111111 1122211 Q ss_pred -CcceEEEEeccccccceEecHHHhccHHH-HHHHHH-HHHHHHHHHHHHHHHHHHHhcccce-----eeccCCCCCCCC Q lcl|Aclame:pro 71 -LELNVAVNMGEPDNDFFQLRADDLRDETA-YRHRIQ-SAARKLANNVELKVANMAAEMGSLV-----ITSPDAIGTNTA 142 (430) Q Consensus 71 -~e~sV~v~l~~~k~V~~~~t~keL~~~~~-~~r~l~-pAm~~LAn~Id~dl~~~~~~~as~~-----~~~~~~~~~~~~ 142 (430) .-.++.+++.+-. .-+.+|.+=|.+..+ .+.+|. .-.+.++..+|..++.-.-...... ..........+. T Consensus 193 ~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~ 271 (415) T protein:vir:79 193 KPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKA 271 (415) T ss_pred cceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccc Confidence 1234444443332 224444443443322 455554 3446778888887764221111000 001111122234 Q ss_pred CchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecc Q lcl|Aclame:pro 143 DAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTK 222 (430) Q Consensus 143 ~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~ 222 (430) ..|+++.++...|.+..... -..+++|.+...+. . +...+ ||++ +... + . T Consensus 272 ~~~~~i~~~~~~~~~~~~~~---~~~v~n~~~~~~l~-~---lkd~~------------G~~l------~~~~-~---~- 321 (415) T protein:vir:79 272 KSLDDIKDAINLNVKPNYEH---NVAIVSQTMFAKLD-K---MKDKL------------GNYL------IQPD-V---K- 321 (415) T ss_pred cchhHHHHHHHhhhhhccCC---CEEEEcHHHHHHHH-H---hhccC------------Ccee------eccC-c---C- Confidence 56888888777776655543 23789998876553 2 21111 3322 1110 0 1 Q ss_pred cccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeec Q lcl|Aclame:pro 223 STATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) Q Consensus 223 gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~ 302 (430) .+.+.++.|-+-. ..+.. ..-.+||..-+-| +..+|++..+-. T Consensus 322 -~~~~~~l~G~pV~-----~~~~~------------------~~~~~~~~~~~~G-------------d~~~~~~~~~~~ 364 (415) T protein:vir:79 322 -EKTQQRLLGAKIE-----ILPDE------------------VLGQKGNNTLIIG-------------NLKDAIVLFDRS 364 (415) T ss_pred -CCCCceecceeeE-----Eeccc------------------ccCCCCccEEEEE-------------ehhccEEEEeec Confidence 1122233332210 00000 0001244433333 333433333222 Q ss_pred CceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhcee Q lcl|Aclame:pro 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKT 378 (430) Q Consensus 303 ~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~ 378 (430) +-++..++... ++.. - -++.-++. -..|++||+..+-.-+.-.+|+-+... T Consensus 365 ~~~v~~~~~~~-------------~~~~-----~-~~~~r~d~------~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:79 365 QYQASWTDYMH-------------FGEC-----L-MIAVRQDC------RILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ceEEEEecccc-------------CceE-----E-EEEEEecc------EEeccccEEEEEEeccCCCCCccccCC Confidence 22333322110 0000 0 01111121 456889999988886543344433333 No 98 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=22.71 E-value=2.4 Score=18.49 Aligned_cols=278 Identities=10% Similarity=0.049 Sum_probs=114.6 Q ss_pred Cccch-----hhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccc-ccCCcc-CCccCCC-- Q lcl|Aclame:pro 1 MALNE-----GQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT-QEGWDL-TDKATGL-- 70 (430) Q Consensus 1 MAn~~-----~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~-~~g~~~-s~~~~d~-- 70 (430) +++.. +.++ +.+..++++.++...++.++++.+.- .+...++|+|..... .-.+.. ....++. T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~ 192 (415) T protein:vir:98 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV-------TNGSGKYPVVRQSEVAALEKVEELEENPELAV 192 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec-------cCCceeEEEEeecCCccceeeccccccCcccc Confidence 22211 1134 35567788888888888887754211 233344555432211 111111 1122211 Q ss_pred -CcceEEEEeccccccceEecHHHhccHHH-HHHHHH-HHHHHHHHHHHHHHHHHHHhcccce-----eeccCCCCCCCC Q lcl|Aclame:pro 71 -LELNVAVNMGEPDNDFFQLRADDLRDETA-YRHRIQ-SAARKLANNVELKVANMAAEMGSLV-----ITSPDAIGTNTA 142 (430) Q Consensus 71 -~e~sV~v~l~~~k~V~~~~t~keL~~~~~-~~r~l~-pAm~~LAn~Id~dl~~~~~~~as~~-----~~~~~~~~~~~~ 142 (430) .-.++.+++.+-. .-+.+|.+=|.+..+ .+.+|. .-.+.++..+|..++.-.-...... ..........+. T Consensus 193 ~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~ 271 (415) T protein:vir:98 193 KPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKA 271 (415) T ss_pred cceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccc Confidence 1234444443332 224444443443322 455554 3446778888887764221111000 001111122234 Q ss_pred CchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecc Q lcl|Aclame:pro 143 DAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTK 222 (430) Q Consensus 143 ~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~ 222 (430) ..|+++.++...|.+..... -..+++|.+...+. . +...+ ||++ +... + . T Consensus 272 ~~~~~i~~~~~~~~~~~~~~---~~~v~n~~~~~~l~-~---lkd~~------------G~~l------~~~~-~---~- 321 (415) T protein:vir:98 272 KSLDDIKDAINLNVKPNYEH---NVAIVSQTMFAKLD-K---MKDKL------------GNYL------IQPD-V---K- 321 (415) T ss_pred cchhHHHHHHHhhhhhccCC---CEEEEcHHHHHHHH-H---hhccC------------Ccee------eccC-c---C- Confidence 56888888777776655543 23789998876553 2 21111 3322 1110 0 1 Q ss_pred cccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeec Q lcl|Aclame:pro 223 STATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) Q Consensus 223 gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~ 302 (430) .+.+.++.|-+-. ..+.. ..-.+||..-+-| +..+|++..+-. T Consensus 322 -~~~~~~l~G~pV~-----~~~~~------------------~~~~~~~~~~~~G-------------d~~~~~~~~~~~ 364 (415) T protein:vir:98 322 -EKTQQRLLGAKIE-----ILPDE------------------VLGQKGNNTLIIG-------------NLKDAIVLFDRS 364 (415) T ss_pred -CCCCceecceeeE-----Eeccc------------------ccCCCCccEEEEE-------------ehhccEEEEeec Confidence 1122233332210 00000 0001244433333 333433333222 Q ss_pred CceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhcee Q lcl|Aclame:pro 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKT 378 (430) Q Consensus 303 ~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~ 378 (430) +-++..++... ++.. - -++.-++. -..|++||+..+-.-+.-.+|+-+... T Consensus 365 ~~~v~~~~~~~-------------~~~~-----~-~~~~r~d~------~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:98 365 QYQASWTDYMH-------------FGEC-----L-MIAVRQDC------RILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ceEEEEecccc-------------CceE-----E-EEEEEecc------EEeccccEEEEEEeccCCCCCccccCC Confidence 22333322110 0000 0 01111121 456889999988886543344433333 No 99 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=22.27 E-value=2.5 Score=18.43 Aligned_cols=273 Identities=11% Similarity=0.021 Sum_probs=99.3 Q ss_pred Cccc----hhhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcc--cccccCCccC-CccC--C- Q lcl|Aclame:pro 1 MALN----EGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQE--SPTQEGWDLT-DKAT--G- 69 (430) Q Consensus 1 MAn~----~~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~--~~~~~g~~~s-~~~~--d- 69 (430) |+.. -+.++ +.+..++++.++...++.++++...- .+++-.+++|.. ....-++... ...+ + T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~-------~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 77 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENV-------TTLTGSRVYEKWTDITGLANIDDEAGKIADIDD 77 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeec-------cCCcceEEEEeecCCCcceeeecCCcccccccc Confidence 4443 22233 56667899999999999888754321 112223333321 1111112111 1111 1 Q ss_pred CCcceEEEEeccccccceEecHHHhccHHH-HHHHH-HHHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhH Q lcl|Aclame:pro 70 LLELNVAVNMGEPDNDFFQLRADDLRDETA-YRHRI-QSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNF 147 (430) Q Consensus 70 ~~e~sV~v~l~~~k~V~~~~t~keL~~~~~-~~r~l-~pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d 147 (430) ..=..+.++..+-. .-+.+|.+=|++..+ .+.+| +...++++..+|..++.- .....+......|.| T Consensus 78 ~~~~~i~l~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g----------~~~~~~~~~~~~~d~ 146 (293) T protein:vir:48 78 PKLSLIKYTIKRYA-GISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGV----------VDKLPTKPTLTKWDD 146 (293) T ss_pred cceeEEEEeeeEEE-EeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhc----------cccccccccccCHHH Confidence 12234444443333 235565555555332 34444 334456777777766521 111222334567999 Q ss_pred HHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch--hhhhhhhccccccchhhhHHH-hCCCcceecccc Q lcl|Aclame:pro 148 VADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI--PEEAYRDGTIQRQVAGFDDVL-RSPKLPVLTKST 224 (430) Q Consensus 148 ~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~--~~~a~r~g~igr~~~Gfd~~~-~~~~~~~~~~gt 224 (430) +.++...|.....+. -..+++|.+.+.+. .+. ..++. -...+.+|.-++ +.|+--+. .+..++... . T Consensus 147 i~~~~~~l~~~~~~~---a~~vmn~~~~~~L~-~lk---d~~g~~l~~~~~~~~~~~~-l~G~Pv~~~~~~~~~~~~--~ 216 (293) T protein:vir:48 147 IIDLEAKVDPAIKQT---SFFLTNTSGFTALK-KVK---NALGDYLMERDVKSPTGYS-IAGFAVKEISDRWLPNAS--S 216 (293) T ss_pred HHHHHHhhhhhhcCC---CEEEEcHHHHHHHH-Hhh---ccCCceEeecCcCCCCCce-ecceeeEEecccccCCcc--C Confidence 988888776544332 34789999877653 221 11110 011122232222 33432110 011111000 0 Q ss_pred cccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCc Q lcl|Aclame:pro 225 ATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGT 304 (430) Q Consensus 225 ~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~ 304 (430) +....+-|.-+ .. ..+.....-+++. ....+ +.-..+...|++..-.++ T Consensus 217 ~~~~~~~gd~~--~~-----------------~~~~~~~~~~i~~---~~~~~--------~~~~~~~~~~r~~~r~d~- 265 (293) T protein:vir:48 217 GVMPLYFGDLK--QA-----------------VTLFDRQQMSLLS---TNIGG--------GAFETDTTKVRVIDRFDV- 265 (293) T ss_pred CceEEEEEecc--ce-----------------EEEEEecceEEEE---ecccc--------hhhhcCeEEEEEEEeeCc- Confidence 00000000000 00 0000000000000 00011 111112223444433222 Q ss_pred eeEEeeccccccccccccccccccccccccccCceeEE Q lcl|Aclame:pro 305 HVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNI 342 (430) Q Consensus 305 ~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv 342 (430) ....|.-+... .+..+...|++..+.-| T Consensus 266 -~~~~~~a~~~l---------~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:48 266 -VATDTEAFVPA---------SFKAIADQKGNIGSTAV 293 (293) T ss_pred -EEecccceEEE---------EeeccccCCccccccCC Confidence 11222211100 01111222333222212 No 100 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=22.03 E-value=2.5 Score=18.39 Aligned_cols=266 Identities=9% Similarity=0.035 Sum_probs=104.6 Q ss_pred Cccc-------hhhHHHHHHHHHHHH-HHhhcccchhhcccCChHHHHhhcCCEEEEecCccccc--cc-----CCccCC Q lcl|Aclame:pro 1 MALN-------EGQIVTLAVDEIIET-ISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT--QE-----GWDLTD 65 (430) Q Consensus 1 MAn~-------~~~~~~~~~~~vl~~-l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~--~~-----g~~~s~ 65 (430) ++.. ...+++..+.+.|.. .+...++.++++... -.+.++.+|+-..... .. .+...+ T Consensus 121 ~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg 193 (419) T protein:vir:94 121 RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQN-------ADYNVLEYIRDTSGTAGAGSTWNKAAVVPEG 193 (419) T ss_pred cccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeee-------ccCCceeeeeeccccccccccCcccceecCC Confidence 1111 111233333443333 345556666664321 1344555554211111 00 011000 Q ss_pred ---ccCCCCcceEEEEeccccccceEecHHHhccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh--ccc-----ce--ee Q lcl|Aclame:pro 66 ---KATGLLELNVAVNMGEPDNDFFQLRADDLRDETAYRHRIQS-AARKLANNVELKVANMAAE--MGS-----LV--IT 132 (430) Q Consensus 66 ---~~~d~~e~sV~v~l~~~k~V~~~~t~keL~~~~~~~r~l~p-Am~~LAn~Id~dl~~~~~~--~as-----~~--~~ 132 (430) ...++.=..+.+++.+-.. -+.+|.+=|++..+.+.+|.. ..++++..+|..++.---. ..+ .. .. T Consensus 194 ~~~~~~~~~~~~i~~~~~k~~~-~~~is~ell~d~~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~ 272 (419) T protein:vir:94 194 TAKPQSTLSFDTITTTLKTVAH-WLPITRQAADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQ 272 (419) T ss_pred ccccccccceeeEEeeeeeEEE-eehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccc Confidence 1122222244444433332 345554445544455666655 5678888999888631000 000 00 00 Q ss_pred cc-CCCCCCCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHH Q lcl|Aclame:pro 133 SP-DAIGTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDV 211 (430) Q Consensus 133 ~~-~~~~~~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~ 211 (430) .+ ..........+.++..+...+.....+. -..+++|.+...+.. +...+ |+++ .+ T Consensus 273 ~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~---~~~v~n~~~~~~l~~----~k~~~------------~~~~---~~- 329 (419) T protein:vir:94 273 QPKPTAPATDEPPLVDIRRAKTVAEIAGFPP---DGVVVHPQDWESIEL----DQAPG------------SGVF---RV- 329 (419) T ss_pred ccccccccccchhHHHHHHHHHhhhhccCCC---CEEEEcHHHHHHHHH----HhhcC------------CCce---ee- Confidence 00 0000111223667777777776655543 248899987665532 11111 1111 00 Q ss_pred HhCCCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccc Q lcl|Aclame:pro 212 LRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQ 291 (430) Q Consensus 212 ~~~~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~ 291 (430) ..+ +. . |..-+|-|+. T Consensus 330 ~~~--~~-----~---------------------------------------------~~~~~l~G~p------------ 345 (419) T protein:vir:94 330 IAN--VQ-----G---------------------------------------------EATPRIWGLN------------ 345 (419) T ss_pred cCC--cc-----c---------------------------------------------CCCcccccee------------ Confidence 000 00 0 0111233310 Q ss_pred cceEEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCC Q lcl|Aclame:pro 292 DATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHE 371 (430) Q Consensus 292 l~~fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~ 371 (430) .+++... |++ . + ++|. .++++.++- T Consensus 346 ---V~~~~~~--------------------------------~~~-~-~-~~gd---------~~~~~~~~~-------- 370 (419) T protein:vir:94 346 ---VVSTVAI--------------------------------AQG-T-A-LVGG---------FRQGATLWS-------- 370 (419) T ss_pred ---eEEcCCC--------------------------------CCc-c-E-EEee---------ccceEEEEE-------- Confidence 0011000 000 0 0 0110 111111111 Q ss_pred chhhceeEEEecCCCcEEEEEEee--cccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 372 LFAGMKTTSFSIPDVGLNGIFATQ--GDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 372 ~~~~~~~~~~~~~~~Glsirv~~~--yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) ..|+++.+..+ .++.++...+|+..-+|+++++|+- .+++-.-.| T Consensus 371 -------------~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a-~~~~~~~aa 417 (419) T protein:vir:94 371 -------------RQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKA-FVRVTFAAA 417 (419) T ss_pred -------------ecceEEEEeccccchhhcCcEEEEEEEeeccEEecccc-EEEEEeccC Confidence 11344443332 2245667777777778888888875 355555555 No 101 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=21.55 E-value=2.6 Score=18.32 Aligned_cols=287 Identities=9% Similarity=0.024 Sum_probs=99.0 Q ss_pred Cccchh-hHHHHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCcccccc----cCCccC----CccCCCC Q lcl|Aclame:pro 1 MALNEG-QIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ----EGWDLT----DKATGLL 71 (430) Q Consensus 1 MAn~~~-~~~~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~----~g~~~s----~~~~d~~ 71 (430) |.-.-+ -+-+.+.+++++.+++..++.+++++. +- .+..+++|+-...... .|.... ...+... T Consensus 20 ~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~-~~------~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~ 92 (333) T protein:vir:78 20 LAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQI-PI------SYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSG 92 (333) T ss_pred eecCCccccchhHHHHHHHHHHhhchhhhhccee-ec------cCCceEEEEEeCCceeEeecCcccccccccccccccc Confidence 221112 233667899999999999999988642 21 2456677764322211 111110 0011111 Q ss_pred cceEEEEecccc-ccceEecHHHhccH-HHHHHHHH-HHHHHHHHHHHHHHHHHHHhcc-cceeeccC-----------C Q lcl|Aclame:pro 72 ELNVAVNMGEPD-NDFFQLRADDLRDE-TAYRHRIQ-SAARKLANNVELKVANMAAEMG-SLVITSPD-----------A 136 (430) Q Consensus 72 e~sV~v~l~~~k-~V~~~~t~keL~~~-~~~~r~l~-pAm~~LAn~Id~dl~~~~~~~a-s~~~~~~~-----------~ 136 (430) ..--++++.-.| .+-+.+|.+=|++. ...+++|+ .-.++++..+|..+++---... ....+... . T Consensus 93 ~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~~~~ 172 (333) T protein:vir:78 93 TAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTNVDY 172 (333) T ss_pred cceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccccccccccccccccccc Confidence 122233442222 23344444434442 23445554 3446788888887763110000 00000000 0 Q ss_pred CCCCCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccch--hhhhhhhccccccchhhhHHHhC Q lcl|Aclame:pro 137 IGTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI--PEEAYRDGTIQRQVAGFDDVLRS 214 (430) Q Consensus 137 ~~~~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~--~~~a~r~g~igr~~~Gfd~~~~~ 214 (430) ........++++..+...+..+.- . .....+++|.+...|. .+..+...++. -..-...|.-++ +.|+- +..+ T Consensus 173 ~~~~~~~~~~~i~~~~~~~~~~~~-~-~~~~~vmn~~~~~~L~-~~~~~~d~~G~~i~~~~~~~~~~~~-l~G~P-v~~~ 247 (333) T protein:vir:78 173 LQETGDPLLDRLLDGYDLVSANTD-V-EFNGWAVDPRFRAHLL-RAQAYRDANGNVDPSRINLAAQTGD-VLGLP-AQFG 247 (333) T ss_pred cccccchhHHHHHHHHHhhccccc-c-CceEEEEcchHHHHHH-HHhhhcCCCCceeecCccccCCCce-eecee-eEEc Confidence 011112235566666555543321 1 1234788998877664 33222221110 001111222222 33432 2222 Q ss_pred CCcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccce Q lcl|Aclame:pro 215 PKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDAT 294 (430) Q Consensus 215 ~~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~ 294 (430) ++++.-. +....+...-|-| +... T Consensus 248 ~~i~~~~-------------------------------------------~~~~~~~~~~~~g-------------D~~~ 271 (333) T protein:vir:78 248 RAVGGDL-------------------------------------------GAAVDSKTRIIGG-------------DFSQ 271 (333) T ss_pred cccCCCc-------------------------------------------cccCCCccEEEEE-------------eccc Confidence 2222110 0000011111222 1112 Q ss_pred EEEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceee-eeecccceeEEEecccCC Q lcl|Aclame:pro 295 FSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTN-VFWADDAIRIVSQPIPAN 369 (430) Q Consensus 295 fvVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~N-laFhr~A~~Latrpl~~p 369 (430) |.+ .+..+-++.+++..-+.+. +... ++. ....-|.+.. ..+.. ..-|++||+..+-.= .| T Consensus 272 ~~~-g~~~~~~i~~~~~~~~~~~-~~~~-------~~~--~~~~~v~~r~--~~r~d~~v~~~~a~~~l~~~~-a~ 333 (333) T protein:vir:78 272 LKF-GFADEIRIKMSDTATLTDS-GSAT-------VSM--WQTNQIAILI--EVTFGWLLGDKQAFVKFVDDE-QP 333 (333) T ss_pred EEE-EEeeccEEEEecccccccc-ccce-------eeh--hhcCcEEEEE--EEEEccEEecccceEEEeccC-CC Confidence 221 1212222222222100000 0000 000 0000011100 00111 123334444332111 11 No 102 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=21.02 E-value=2.7 Score=18.24 Aligned_cols=285 Identities=11% Similarity=0.009 Sum_probs=101.1 Q ss_pred Cccc-hhh-HH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccc----ccCCccCCccCCCCcc Q lcl|Aclame:pro 1 MALN-EGQ-IV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT----QEGWDLTDKATGLLEL 73 (430) Q Consensus 1 MAn~-~~~-~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~----~~g~~~s~~~~d~~e~ 73 (430) ||.- -+. ++ +.+.+++|+.+++..++.++++.. +- .+..+++|+-..... ..|...+ ..+..=. T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i-~~------~~~~~~~p~~~~~~~a~wv~Eg~~~~--~~~~~f~ 71 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE-PQ------EFGEQQYMTLTAPPRGEVVGEGAQKS--ESTATFA 71 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhccee-ec------CCCceEEEEEeCCceeEEeecCcccc--cccceee Confidence 6665 333 22 667799999999999999998643 21 234577776321111 1222111 1232223 Q ss_pred eEEEEeccccccceEecHHHhc--cHH--HHHHHH-HHHHHHHHHHHHHHHHHHHHhccc-ceeeccC-------CCCCC Q lcl|Aclame:pro 74 NVAVNMGEPDNDFFQLRADDLR--DET--AYRHRI-QSAARKLANNVELKVANMAAEMGS-LVITSPD-------AIGTN 140 (430) Q Consensus 74 sV~v~l~~~k~V~~~~t~keL~--~~~--~~~r~l-~pAm~~LAn~Id~dl~~~~~~~as-~~~~~~~-------~~~~~ 140 (430) ++.++..+- .+-+.+|.|=|+ .++ ..+++| +...++|+..+|..++.---.... ...++.. ..... T Consensus 72 ~v~l~~~kl-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~ 150 (311) T protein:vir:81 72 PVTAIPRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT 150 (311) T ss_pred EEEEeeEEE-EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeec Confidence 444444232 233444444332 221 234444 556678888888877632100000 0000000 00011 Q ss_pred C---CCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccc-hh-hhhhhhccccccchhhhHHHhCC Q lcl|Aclame:pro 141 T---ADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IP-EEAYRDGTIQRQVAGFDDVLRSP 215 (430) Q Consensus 141 ~---~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~-~~-~~a~r~g~igr~~~Gfd~~~~~~ 215 (430) . ...+.++..+...+...+... ...+++|.+...+.. + ...++ -. ......+.-++ +.|+- +..++ T Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~---~~~vmn~~~~~~l~~-l---kd~~G~~l~~~~~~~~~~~t-l~G~P-v~~~~ 221 (311) T protein:vir:81 151 TGTSATPDLAVEAAVGLVLGDNLSP---DGVALDNTFSFMLAT-Q---RDSQGRKLYPELGFGTDVAS-FAGLN-AAVSD 221 (311) T ss_pred ccccchHHHHHHHHHHHhhhcCCCc---eEEEEcHHHHHHHHh-h---hccCCCeeecCccccCCCce-eccee-EEecc Confidence 1 122445555555555444332 348999998876632 2 11110 00 00111111111 22322 11122 Q ss_pred CcceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceE Q lcl|Aclame:pro 216 KLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATF 295 (430) Q Consensus 216 ~~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~f 295 (430) .++...... ... . .......++..-+.| +-.+| T Consensus 222 ~i~~~~~~~--------~~~---------~-----------------~~~~~~~~~~~~~~g-------------Dfs~~ 254 (311) T protein:vir:81 222 TVRGGPEAV--------TAS---------T-----------------GVYRTTNPNVKAIAG-------------DFSAF 254 (311) T ss_pred ccccccccc--------ccc---------c-----------------chhcccCCccEEEEE-------------ecccE Confidence 221100000 000 0 000011122222222 22333 Q ss_pred EEEEeecCceeEEeeccccccccccccccccccccccccccCceeEEecCCCceee-eeecccceeEEEecccC Q lcl|Aclame:pro 296 SVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTN-VFWADDAIRIVSQPIPA 368 (430) Q Consensus 296 vVt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~N-laFhr~A~~Latrpl~~ 368 (430) ++ ....+-++.+.+..- ++ +.++. .....|.+.. ..+.. -..+.+||+..+..-.. T Consensus 255 ~i-~~~~~~~~~~~~~~~----~~--------~~~~~--~~~~~v~~r~--~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 255 RW-GVQVSIPLELIEFGD----PD--------GLGDL--KRQNQIAIRA--EVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred EE-EEeccceEEEeccCC----CC--------cchhh--hhcCcEEEEE--EEEeccEeecccceEEEEeeccC Confidence 32 122233344433210 00 00000 0000111100 00000 11223333333222111 No 103 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=20.91 E-value=2.7 Score=18.23 Aligned_cols=291 Identities=10% Similarity=-0.051 Sum_probs=98.1 Q ss_pred Cccchhh----HH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccc----ccccCCccCCccCCCC Q lcl|Aclame:pro 1 MALNEGQ----IV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES----PTQEGWDLTDKATGLL 71 (430) Q Consensus 1 MAn~~~~----~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~----~~~~g~~~s~~~~d~~ 71 (430) ||+.... ++ +.+.+++|+.+++..++-++++.. +- .+..+++|+=... -...|... ..++.. T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i-~~------~~~~~~ip~~~~~~~a~wv~Eg~~~--~~s~~~ 71 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ-PT------IFGPVKGAVFSGVPRAKIVGEGEVK--PSASVD 71 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhccee-ec------CCCceEEEEEeCCcceEEeeCCccc--cccccc Confidence 9976321 33 566688999999999998888643 21 2345677662111 11112111 112222 Q ss_pred cceEEEEeccccccceEecHHHhccH--H---HHHHHHH-HHHHHHHHHHHHHHHHHHHhccc-ceeeccCC------CC Q lcl|Aclame:pro 72 ELNVAVNMGEPDNDFFQLRADDLRDE--T---AYRHRIQ-SAARKLANNVELKVANMAAEMGS-LVITSPDA------IG 138 (430) Q Consensus 72 e~sV~v~l~~~k~V~~~~t~keL~~~--~---~~~r~l~-pAm~~LAn~Id~dl~~~~~~~as-~~~~~~~~------~~ 138 (430) =..+.++.-+- .+-+.+|.+=|++. + ..+.+|. .-.++|+.++|..+++---..++ ...+.... .. T Consensus 72 f~~v~l~~~kl-~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~ 150 (315) T protein:vir:80 72 VSAFTAQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIV 150 (315) T ss_pred eeeeEeeeeeE-EeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccccccccccccccee Confidence 22333333221 22344444434322 2 1344443 44567888888776521000000 00000000 00 Q ss_pred CCCCCchhHHHHHHHHHHHhCCCcCCCcEEEecHHHHHHHHHhhhh--hhhccchhhhhhhhccccccchhhhHHHhCCC Q lcl|Aclame:pro 139 TNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTK--RDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPK 216 (430) Q Consensus 139 ~~~~~~~~d~a~a~~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~--l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~ 216 (430) ..+...+.|+..+-..+.....-. ....+++|.....+...... ..............|.-++ +.|+- +..+++ T Consensus 151 ~~~~~~~~d~~~~~~~~~~~~~~~--~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~t-l~G~P-V~~~~~ 226 (315) T protein:vir:80 151 DATDSATADLVKAVGLIAGAGLQV--PNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDN-WRGLN-VGASST 226 (315) T ss_pred eccccchHHHHHHHHHHhhccCcc--ceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCce-eccee-eEecCc Confidence 111233677776666655433332 22388999987766422111 0000000001111222222 34443 222333 Q ss_pred cceecccccccceecccceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEE Q lcl|Aclame:pro 217 LPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFS 296 (430) Q Consensus 217 ~~~~~~gt~~~~tV~ga~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fv 296 (430) ++........+. ....- .|.....+.....-.++.=|--+-.++ .++- -..+.-.|+ T Consensus 227 ~~~~~~~~~~~~---------~~~~~---------GDfs~~~~g~~~~~~i~i~~~~~~~~~-~~~~----~~~~~v~~r 283 (315) T protein:vir:80 227 VSGAPEMSPASG---------VKAIV---------GDFSRVHWGFQRNFPIELIEYGDPDQT-GRDL----KGHNEVMVR 283 (315) T ss_pred CCcccccccccc---------cEEEE---------eecccEEEEEecCeeEEEeccccccCc-ccch----hhcCcEEEE Confidence 331111100000 00000 000000000000000000000000000 0000 000111234 Q ss_pred EEEeecCceeEEeeccccccccccccccccccccccccccC Q lcl|Aclame:pro 297 VVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADA 337 (430) Q Consensus 297 Vt~~~~~~~v~I~p~~v~~~~~~~~~~~~~~~nVsa~pA~~ 337 (430) ++....+. |.---+++.+...++ + ....||.| T Consensus 284 ~~~r~~~~-v~~~~a~~~l~~~~a-~-------~~~~~~~~ 315 (315) T protein:vir:80 284 AEAVLYVA-IESLDSFAVVKEKAA-P-------KPNPPAEN 315 (315) T ss_pred EEEEecce-eecccceEEEeeccC-C-------CCCCCCCC Confidence 43332221 111011111111111 0 11224444 No 104 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=20.71 E-value=2.7 Score=18.20 Aligned_cols=260 Identities=11% Similarity=0.073 Sum_probs=113.4 Q ss_pred Cccch-hhHH-HHHHHHHHHHHHhhcccchhhcccCChHHHHhhcCCEEEEecCccccccc-CCc-cCCc--cCCCCcce Q lcl|Aclame:pro 1 MALNE-GQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE-GWD-LTDK--ATGLLELN 74 (430) Q Consensus 1 MAn~~-~~~~-~~~~~~vl~~l~~~~Vma~lV~~y~~~~~~~~k~GdTV~ip~P~~~~~~~-g~~-~s~~--~~d~~e~s 74 (430) +.... +.++ +.+..++++.++...++.++++...- .+.++++|++....... ++. .... ..++.=.. T Consensus 117 ~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~ 189 (421) T protein:vir:13 117 MSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPV-------NRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQP 189 (421) T ss_pred cccCCcceecchhhHHHHHHHHHhhhhhhhhceeeec-------cCCceEEEEeecCCccceeeccccccccccccceeE Confidence 11111 1133 44456777778888888888854321 24567777754433221 111 1111 11222223 Q ss_pred EEEEeccccccceEecHHHhccHHH-HHHHHH-HHHHHHHHHHHHHHHHHHHhcccceeeccCCCCCCCCCchhHHHHHH Q lcl|Aclame:pro 75 VAVNMGEPDNDFFQLRADDLRDETA-YRHRIQ-SAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAE 152 (430) Q Consensus 75 V~v~l~~~k~V~~~~t~keL~~~~~-~~r~l~-pAm~~LAn~Id~dl~~~~~~~as~~~~~~~~~~~~~~~~~~d~a~a~ 152 (430) +.+++.+-.+ -+.+|.+=|.+..+ .+.+|. ...+.++..+|..+++.... + ....+...|.++..+. T Consensus 190 i~~~~~k~~~-~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g-------~---~~~~~~~~~d~i~~~~ 258 (421) T protein:vir:13 190 MAYDIDDYGL-LAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKA-------V---LAEETINDYAGLVKTI 258 (421) T ss_pred EEeeeeeeEe-ehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhh-------c---cccccccchHHHHHHH Confidence 4444433332 24444443444322 344443 34467778888877653221 1 1123345688888877 Q ss_pred HHHHHhCCCcCCCcEEEecHHHHHHHHHhhhhhhhccchhhhhhhhccccccchhhhHHHhCCCcceecccccccceecc Q lcl|Aclame:pro 153 ELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSG 232 (430) Q Consensus 153 ~~L~~~~aP~~~~R~~vl~p~~~a~~~~~~~~l~~~~~~~~~a~r~g~igr~~~Gfd~~~~~~~~~~~~~gt~~~~tV~g 232 (430) ..|.....+. -..++||.+...+. . +... -||++ |.. + ..+++. T Consensus 259 ~~l~~~~~~~---a~~v~n~~~~~~l~-~---lkd~------------~G~~i------~~~---~----~~~~~~---- 302 (421) T protein:vir:13 259 NSLVPNARKR---AIIVTNSDGRAYLD-G---LMDK------------QGRPL------LKE---L----SDGGDL---- 302 (421) T ss_pred HHhhhhhcCC---CEEEEcHHHHHHHH-H---hhcC------------CCcee------ecC---c----CCCCCc---- Confidence 7777655553 23789998876553 2 2111 13322 211 0 011111 Q ss_pred cceeeeeEEEEeeccccccccceeeEEEeeccceeecccEEEEcceeeccccccccccccceEEEEEeecCceeEEeecc Q lcl|Aclame:pro 233 AQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKP 312 (430) Q Consensus 233 a~~~~~~~~t~~~~~~~~~~d~~~~~~~~s~tgtlk~GDv~TiaGV~~v~~~tk~~~~~l~~fvVt~~~~~~~v~I~p~~ 312 (430) +|-|. | .+++ +.+ T Consensus 303 -----------------------------------------tl~G~----p-----------V~~~-----------~~~ 315 (421) T protein:vir:13 303 -----------------------------------------VFKGR----P-----------VIEL-----------EES 315 (421) T ss_pred -----------------------------------------eecce----e-----------eEEe-----------ccc Confidence 22230 0 0000 000 Q ss_pred ccccccccccccccccccccccccCceeEEecCCCceeeeeecccceeEEEecccCCCCchhhceeEEEecCCCcEEEEE Q lcl|Aclame:pro 313 VALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIF 392 (430) Q Consensus 313 v~~~~~~~~~~~~~~~nVsa~pA~~aavTv~~~~~~~~NlaFhr~A~~Latrpl~~p~~~~~~~~~~~~~~~~~Glsirv 392 (430) +. ++.+...-++|.- ++++.+.. ..|+++.+ T Consensus 316 -~~------------------~~~~~~~~~~gd~---------~~~~~~~~---------------------~~~~~v~~ 346 (421) T protein:vir:13 316 -IF------------------DVGDETKFIVSDF---------KTLIKFMD---------------------RKQYLIDQ 346 (421) T ss_pred -cc------------------cCCCceEEEEEec---------cccEEEEE---------------------ecceEEEe Confidence 00 0000000011100 00111110 11455555 Q ss_pred EeecccccceEEEEEEeecCceeeCcceeEEEcCCCCC Q lcl|Aclame:pro 393 ATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) Q Consensus 393 ~~~yd~~~~~~~~rldvlyG~~~v~Pe~agv~l~~q~~ 430 (430) ..+....++...+|+-.-+++++++||-...+-....+ T Consensus 347 ~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 384 (421) T protein:vir:13 347 SKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFG 384 (421) T ss_pred ecccccccCeeEEEEEeeecceeecchhhheeeecccc Confidence 55555556677777777778888888875443333322 Done!