Query lcl|NC_011054.1_cdsid_YP_002014223.1 [gene=BOOMER_7] [protein=gp7] [protein_id=YP_002014223.1] [location=5007..5915] Match_columns 302 No_of_seqs 131 out of 1092 Neff 9.8 Searched_HMMs 1612 Date Thu Nov 7 14:01:05 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_7 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_7_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:2504 Length: 305 # 100.0 9.7E-69 6E-72 393.5 30.2 301 1-301 1-305 (305) 2 protein:vir:105905 Length: 304 100.0 3E-58 1.9E-61 335.9 29.8 282 1-293 9-304 (304) 3 protein:vir:94142 Length: 304 100.0 3E-58 1.9E-61 335.9 29.8 282 1-293 9-304 (304) 4 protein:vir:7771 Length: 330 # 100.0 5.4E-58 3.3E-61 334.6 30.3 293 1-302 10-330 (330) 5 protein:vir:80684 Length: 315 100.0 1.2E-57 7.1E-61 332.7 29.1 291 1-302 1-314 (315) 6 protein:vir:8187 Length: 311 # 100.0 2.1E-57 1.3E-60 331.3 29.9 286 1-295 1-311 (311) 7 protein:vir:5739 Length: 366 # 100.0 4E-57 2.5E-60 329.8 28.7 284 1-294 64-366 (366) 8 protein:vir:41 Length: 299 # N 100.0 7.2E-57 4.5E-60 328.4 28.8 280 1-295 6-299 (299) 9 protein:vir:9574 Length: 300 # 100.0 1.1E-56 6.7E-60 327.4 29.6 285 1-294 1-300 (300) 10 protein:vir:9759 Length: 303 # 100.0 1.2E-56 7.2E-60 327.3 29.5 285 1-294 1-303 (303) 11 protein:vir:97148 Length: 324 100.0 2.5E-56 1.5E-59 325.5 29.1 286 1-302 27-323 (324) 12 protein:vir:1638 Length: 298 # 100.0 3.8E-56 2.4E-59 324.4 29.3 283 1-293 1-298 (298) 13 protein:vir:1433 Length: 435 # 100.0 2.3E-56 1.4E-59 325.7 27.6 284 1-296 132-435 (435) 14 protein:vir:2430 Length: 318 # 100.0 4.2E-56 2.6E-59 324.2 29.1 285 1-301 14-318 (318) 15 protein:vir:2344 Length: 397 # 100.0 2.1E-56 1.3E-59 325.9 27.3 285 1-302 10-321 (397) 16 protein:vir:80376 Length: 435 100.0 3.6E-56 2.2E-59 324.5 28.1 284 1-296 132-435 (435) 17 protein:vir:485 Length: 407 # 100.0 3.5E-56 2.2E-59 324.6 27.9 285 1-301 106-407 (407) 18 protein:vir:104085 Length: 320 100.0 7.4E-56 4.6E-59 322.8 29.5 287 1-301 14-320 (320) 19 protein:vir:99749 Length: 324 100.0 1.1E-55 6.9E-59 321.9 29.0 286 1-302 27-323 (324) 20 protein:vir:103955 Length: 324 100.0 1.2E-55 7.3E-59 321.7 29.1 286 1-302 27-323 (324) 21 protein:vir:78830 Length: 324 100.0 1.4E-55 8.6E-59 321.3 29.1 286 1-302 27-323 (324) 22 protein:vir:96392 Length: 324 100.0 1.4E-55 8.6E-59 321.3 29.1 286 1-302 27-323 (324) 23 protein:vir:4226 Length: 326 # 100.0 2.9E-55 1.8E-58 319.6 29.7 285 1-297 20-326 (326) 24 protein:vir:9309 Length: 324 # 100.0 2.6E-55 1.6E-58 319.8 29.2 286 1-302 27-324 (324) 25 protein:vir:4456 Length: 401 # 100.0 1.1E-55 7.1E-59 321.8 26.6 278 1-294 107-401 (401) 26 protein:vir:105038 Length: 428 100.0 1.8E-55 1.1E-58 320.7 27.6 285 1-294 125-428 (428) 27 protein:vir:96223 Length: 324 100.0 3.7E-55 2.3E-58 319.0 29.1 286 1-302 27-323 (324) 28 protein:vir:78223 Length: 333 100.0 4.7E-55 2.9E-58 318.4 29.2 294 1-295 10-333 (333) 29 protein:vir:94771 Length: 298 100.0 4.3E-55 2.7E-58 318.6 28.8 283 1-293 1-298 (298) 30 protein:vir:95763 Length: 297 100.0 3.9E-55 2.4E-58 318.9 28.5 277 1-294 9-297 (297) 31 protein:vir:100247 Length: 425 100.0 3.7E-55 2.3E-58 319.0 27.5 283 1-295 130-425 (425) 32 protein:vir:6242 Length: 390 # 100.0 4E-55 2.5E-58 318.8 27.2 272 1-295 110-390 (390) 33 protein:vir:78523 Length: 338 100.0 1.4E-54 8.9E-58 315.8 29.7 296 1-301 10-338 (338) 34 protein:vir:99920 Length: 311 100.0 1.1E-54 6.9E-58 316.4 28.5 285 1-294 1-311 (311) 35 protein:vir:7855 Length: 497 # 100.0 1.6E-54 1E-57 315.5 27.4 284 1-302 151-497 (497) 36 protein:vir:101650 Length: 497 100.0 1.6E-54 1E-57 315.5 27.4 284 1-302 151-497 (497) 37 protein:vir:102082 Length: 392 100.0 2.7E-53 1.7E-56 308.8 28.8 270 1-301 106-392 (392) 38 protein:vir:107593 Length: 392 100.0 2.7E-53 1.7E-56 308.8 28.8 270 1-301 106-392 (392) 39 protein:vir:102873 Length: 392 100.0 2.7E-53 1.7E-56 308.8 28.8 270 1-301 106-392 (392) 40 protein:vir:105004 Length: 392 100.0 2.7E-53 1.7E-56 308.8 28.8 270 1-301 106-392 (392) 41 protein:vir:4830 Length: 397 # 100.0 1.7E-53 1E-56 309.9 27.5 271 1-302 109-393 (397) 42 protein:vir:4997 Length: 397 # 100.0 1.9E-53 1.2E-56 309.6 27.5 271 1-302 109-393 (397) 43 protein:vir:4953 Length: 397 # 100.0 3.1E-53 1.9E-56 308.4 28.0 271 1-302 109-394 (397) 44 protein:vir:1328 Length: 392 # 100.0 2.4E-53 1.5E-56 309.0 26.6 272 1-295 110-392 (392) 45 protein:vir:1025 Length: 408 # 100.0 6.4E-53 4E-56 306.7 28.4 272 1-302 116-406 (408) 46 protein:vir:104256 Length: 458 100.0 5.4E-53 3.3E-56 307.1 27.8 282 1-294 162-458 (458) 47 protein:vir:4856 Length: 293 # 100.0 6.4E-53 4E-56 306.7 28.2 271 1-302 5-289 (293) 48 protein:vir:93616 Length: 645 100.0 8.9E-53 5.5E-56 305.9 28.0 280 1-300 338-645 (645) 49 protein:vir:102119 Length: 404 100.0 1.3E-52 8.3E-56 305.0 28.3 283 1-300 110-404 (404) 50 protein:vir:3845 Length: 395 # 100.0 1.8E-52 1.1E-55 304.2 27.9 272 1-302 107-391 (395) 51 protein:vir:4600 Length: 415 # 100.0 4E-52 2.5E-55 302.4 29.1 279 1-302 121-410 (415) 52 protein:vir:4700 Length: 415 # 100.0 4E-52 2.5E-55 302.4 29.1 279 1-302 121-410 (415) 53 protein:vir:100135 Length: 418 100.0 2.5E-52 1.6E-55 303.4 27.5 273 1-301 135-418 (418) 54 protein:vir:4092 Length: 390 # 100.0 3E-52 1.9E-55 303.0 27.3 283 1-302 84-375 (390) 55 protein:vir:8102 Length: 543 # 100.0 2E-52 1.2E-55 304.0 26.3 278 1-295 250-543 (543) 56 protein:vir:7409 Length: 408 # 100.0 4.8E-52 3E-55 302.0 28.1 272 1-302 116-404 (408) 57 protein:vir:81100 Length: 415 100.0 5.6E-52 3.5E-55 301.5 28.2 279 1-302 121-410 (415) 58 protein:vir:98339 Length: 415 100.0 5.6E-52 3.5E-55 301.5 28.2 279 1-302 121-410 (415) 59 protein:vir:79987 Length: 415 100.0 5.6E-52 3.5E-55 301.5 28.2 279 1-302 121-410 (415) 60 protein:vir:81160 Length: 371 100.0 6.6E-52 4.1E-55 301.2 28.4 263 1-294 91-371 (371) 61 protein:vir:3991 Length: 404 # 100.0 7.8E-52 4.9E-55 300.8 28.2 272 1-302 116-404 (404) 62 protein:vir:97053 Length: 390 100.0 6.3E-52 3.9E-55 301.3 27.1 267 1-292 113-390 (390) 63 protein:vir:1268 Length: 397 # 100.0 8.2E-52 5.1E-55 300.7 27.8 263 1-294 123-397 (397) 64 protein:vir:4339 Length: 395 # 100.0 8.3E-52 5.2E-55 300.6 27.8 272 1-294 113-395 (395) 65 protein:vir:4511 Length: 409 # 100.0 7E-52 4.3E-55 301.0 26.9 280 1-297 117-409 (409) 66 protein:vir:10364 Length: 390 100.0 1.2E-51 7.3E-55 299.8 27.5 267 1-292 113-390 (390) 67 protein:vir:191 Length: 385 # 100.0 1E-51 6.2E-55 300.2 26.7 270 1-295 105-385 (385) 68 protein:vir:1886 Length: 385 # 100.0 1E-51 6.2E-55 300.2 26.7 270 1-295 105-385 (385) 69 protein:vir:81070 Length: 390 100.0 1.9E-51 1.2E-54 298.6 27.4 267 1-292 113-390 (390) 70 protein:vir:95376 Length: 425 100.0 1.9E-51 1.2E-54 298.6 27.0 273 1-300 138-425 (425) 71 protein:vir:9410 Length: 415 # 100.0 5.4E-51 3.4E-54 296.2 28.5 279 1-302 121-410 (415) 72 protein:vir:81227 Length: 413 100.0 5.6E-51 3.5E-54 296.1 27.8 273 1-299 118-413 (413) 73 protein:vir:96762 Length: 632 100.0 5.2E-51 3.2E-54 296.3 25.4 269 1-293 357-632 (632) 74 protein:vir:6212 Length: 434 # 100.0 1.7E-50 1.1E-53 293.4 26.7 275 1-302 141-434 (434) 75 protein:vir:101291 Length: 381 100.0 1.2E-50 7.4E-54 294.3 25.6 285 1-302 76-378 (381) 76 protein:vir:9509 Length: 381 # 100.0 1.2E-50 7.4E-54 294.3 25.6 285 1-302 76-378 (381) 77 protein:vir:98635 Length: 377 100.0 2.8E-51 1.8E-54 297.7 21.8 276 1-294 79-377 (377) 78 protein:vir:100172 Length: 394 100.0 1.4E-49 8.7E-53 288.4 28.5 267 1-302 111-392 (394) 79 protein:vir:101607 Length: 379 100.0 7.3E-50 4.5E-53 290.0 26.2 262 1-294 107-379 (379) 80 protein:vir:1383 Length: 421 # 100.0 7.4E-50 4.6E-53 289.9 26.1 268 1-302 114-396 (421) 81 protein:vir:100884 Length: 389 100.0 1.9E-49 1.2E-52 287.6 27.7 266 1-301 109-389 (389) 82 protein:vir:100632 Length: 381 100.0 8E-50 4.9E-53 289.8 24.7 284 1-302 76-375 (381) 83 protein:vir:95963 Length: 395 100.0 6E-49 3.7E-52 285.0 26.2 280 1-302 86-386 (395) 84 protein:vir:8420 Length: 477 # 100.0 3.8E-49 2.4E-52 286.0 25.0 288 1-300 157-477 (477) 85 protein:vir:3870 Length: 400 # 100.0 7.4E-49 4.6E-52 284.5 26.4 257 1-295 133-400 (400) 86 protein:vir:78350 Length: 383 100.0 4.2E-49 2.6E-52 285.8 24.7 284 1-301 83-383 (383) 87 protein:vir:9643 Length: 377 # 100.0 8.7E-49 5.4E-52 284.1 26.1 277 1-294 79-377 (377) 88 protein:vir:9704 Length: 394 # 100.0 1.7E-48 1.1E-51 282.4 26.9 258 1-300 128-394 (394) 89 protein:vir:1084 Length: 437 # 100.0 1.3E-48 7.8E-52 283.2 24.8 268 1-302 156-436 (437) 90 protein:vir:78640 Length: 352 100.0 2E-48 1.2E-51 282.1 22.9 266 1-300 83-352 (352) 91 protein:vir:94673 Length: 419 100.0 3.7E-47 2.3E-50 275.2 27.3 277 1-296 123-419 (419) 92 protein:vir:2685 Length: 387 # 100.0 1.4E-47 8.9E-51 277.4 21.4 266 1-300 118-387 (387) 93 protein:vir:96978 Length: 387 100.0 1.4E-47 8.9E-51 277.4 21.4 266 1-300 118-387 (387) 94 protein:vir:94424 Length: 387 100.0 1.4E-47 8.9E-51 277.4 21.4 266 1-300 118-387 (387) 95 protein:vir:9361 Length: 402 # 100.0 1.5E-47 9.6E-51 277.2 20.8 264 1-300 133-402 (402) 96 protein:vir:93881 Length: 387 100.0 6.4E-47 4E-50 273.8 23.0 264 1-300 118-387 (387) 97 protein:vir:80128 Length: 466 100.0 2.6E-46 1.6E-49 270.5 23.3 286 1-302 148-457 (466) 98 protein:vir:962 Length: 397 # 100.0 8.7E-46 5.4E-49 267.6 23.8 255 1-294 132-397 (397) 99 protein:vir:4197 Length: 314 # 100.0 1.2E-37 7.3E-41 223.0 24.0 281 1-297 14-314 (314) 100 protein:vir:4159 Length: 315 # 100.0 4.5E-37 2.8E-40 219.8 22.7 275 1-292 19-315 (315) 101 protein:vir:97397 Length: 517 100.0 1.8E-35 1.1E-38 211.1 21.0 273 1-297 239-517 (517) 102 protein:vir:3158 Length: 321 # 100.0 1.4E-33 8.5E-37 200.8 25.7 287 1-302 18-321 (321) 103 protein:vir:9820 Length: 272 # 100.0 1.2E-30 7.2E-34 184.7 23.3 258 1-297 1-272 (272) 104 protein:vir:3033 Length: 272 # 100.0 1.2E-30 7.2E-34 184.7 23.3 258 1-297 1-272 (272) 105 protein:vir:4074 Length: 480 # 99.9 1.4E-30 8.5E-34 184.3 12.7 260 1-297 210-480 (480) 106 protein:vir:94933 Length: 330 99.9 3.5E-23 2.2E-26 143.7 19.9 277 1-295 25-330 (330) 107 protein:vir:3613 Length: 272 # 99.8 1.2E-22 7.2E-26 140.9 19.2 259 1-294 1-272 (272) 108 protein:vir:93742 Length: 274 99.8 3.2E-22 2E-25 138.4 21.5 260 1-298 1-274 (274) 109 protein:vir:80930 Length: 278 99.8 1.9E-20 1.2E-23 128.7 21.5 263 1-295 1-278 (278) 110 protein:vir:105334 Length: 276 99.8 1.1E-20 7E-24 130.0 19.9 262 1-302 1-276 (276) 111 protein:vir:96123 Length: 274 99.8 2.7E-20 1.7E-23 127.8 20.9 260 1-298 1-274 (274) 112 protein:vir:96833 Length: 275 99.8 2.1E-20 1.3E-23 128.4 19.7 260 1-298 1-275 (275) 113 protein:vir:94494 Length: 274 99.8 1E-19 6.4E-23 124.7 21.4 260 1-298 1-274 (274) 114 protein:vir:97433 Length: 274 99.8 1E-19 6.4E-23 124.7 21.4 260 1-298 1-274 (274) 115 protein:vir:97255 Length: 310 99.7 8.5E-19 5.3E-22 119.7 22.8 280 1-294 1-310 (310) 116 protein:vir:79928 Length: 393 99.7 1.1E-19 6.5E-23 124.6 17.3 292 1-302 59-386 (393) 117 protein:vir:1239 Length: 274 # 99.7 1E-18 6.3E-22 119.2 20.3 260 1-298 1-274 (274) 118 protein:vir:95898 Length: 274 99.7 1.5E-18 9.1E-22 118.3 20.9 260 1-298 1-274 (274) 119 protein:vir:96262 Length: 274 99.7 1.5E-18 9.1E-22 118.3 20.9 260 1-298 1-274 (274) 120 protein:vir:95107 Length: 270 99.6 2E-17 1.3E-20 112.1 18.9 259 1-299 1-270 (270) 121 protein:vir:99424 Length: 360 99.4 6.1E-14 3.8E-17 93.0 19.7 283 1-301 23-360 (360) 122 protein:vir:7990 Length: 273 # 99.4 8.7E-14 5.4E-17 92.2 18.8 253 1-294 1-273 (273) 123 protein:vir:105822 Length: 273 99.4 2.2E-13 1.4E-16 90.0 19.5 253 1-294 1-273 (273) 124 protein:vir:102605 Length: 273 99.4 2.2E-13 1.4E-16 90.0 19.5 253 1-294 1-273 (273) 125 protein:vir:739 Length: 231 # 99.4 4.9E-14 3.1E-17 93.5 15.8 222 35-294 1-231 (231) 126 protein:vir:108211 Length: 318 99.3 5.3E-13 3.3E-16 87.9 16.3 281 1-299 1-318 (318) 127 protein:vir:8324 Length: 410 # 99.1 2.9E-12 1.8E-15 83.9 14.6 260 1-292 131-410 (410) 128 protein:vir:94622 Length: 341 99.1 1.4E-11 8.7E-15 80.1 15.9 283 1-297 1-341 (341) 129 protein:vir:93858 Length: 400 99.1 5.3E-12 3.3E-15 82.4 13.4 268 1-292 117-400 (400) 130 protein:vir:8885 Length: 347 # 99.1 1.8E-11 1.1E-14 79.4 16.4 283 1-295 1-347 (347) 131 protein:vir:94711 Length: 347 99.1 1.2E-11 7.3E-15 80.5 14.7 282 1-295 1-347 (347) 132 protein:vir:10450 Length: 344 99.0 3E-11 1.8E-14 78.3 15.9 280 1-294 1-344 (344) 133 protein:vir:94576 Length: 347 99.0 6.4E-11 4E-14 76.5 17.8 281 1-294 1-347 (347) 134 protein:vir:100057 Length: 375 99.0 3.4E-10 2.1E-13 72.5 21.6 290 1-302 1-375 (375) 135 protein:vir:80213 Length: 334 99.0 1.1E-10 6.5E-14 75.3 18.6 281 1-299 1-334 (334) 136 protein:vir:6324 Length: 335 # 99.0 2.6E-10 1.6E-13 73.1 20.0 280 1-301 1-335 (335) 137 protein:vir:2201 Length: 345 # 99.0 1.1E-10 7E-14 75.1 17.2 278 1-294 1-345 (345) 138 protein:vir:78739 Length: 332 98.9 2.4E-10 1.5E-13 73.3 17.0 270 1-292 7-332 (332) 139 protein:vir:78935 Length: 335 98.9 7.7E-10 4.8E-13 70.5 19.6 283 1-301 1-335 (335) 140 protein:vir:102944 Length: 330 98.9 6.2E-10 3.8E-13 71.1 18.7 276 1-302 1-302 (330) 141 protein:vir:95318 Length: 328 98.9 2.9E-10 1.8E-13 72.9 16.7 225 1-232 6-328 (328) 142 protein:vir:3364 Length: 347 # 98.9 5.4E-10 3.3E-13 71.4 18.1 282 1-296 1-347 (347) 143 protein:vir:5974 Length: 324 # 98.9 1.6E-09 1E-12 68.8 20.0 271 1-302 1-296 (324) 144 protein:vir:80180 Length: 381 98.9 1.1E-09 6.7E-13 69.8 18.7 282 1-302 1-347 (381) 145 protein:vir:103323 Length: 364 98.9 1.2E-08 7.2E-12 64.1 23.7 285 1-302 1-345 (364) 146 protein:vir:103285 Length: 296 98.8 1.8E-09 1.1E-12 68.5 17.4 274 1-295 1-296 (296) 147 protein:vir:80068 Length: 301 98.8 5.4E-09 3.4E-12 65.9 19.4 274 3-292 1-301 (301) 148 protein:vir:79642 Length: 329 98.8 3.7E-09 2.3E-12 66.8 18.0 279 1-295 26-329 (329) 149 protein:vir:107687 Length: 319 98.8 3.1E-09 1.9E-12 67.2 17.5 274 1-292 21-319 (319) 150 protein:vir:1541 Length: 347 # 98.7 8.3E-09 5.1E-12 64.9 18.7 283 1-296 1-347 (347) 151 protein:vir:99675 Length: 324 98.7 2.7E-09 1.6E-12 67.6 15.3 255 34-302 1-306 (324) 152 protein:vir:1583 Length: 351 # 98.6 1.3E-08 8.3E-12 63.7 17.5 275 1-302 1-300 (351) 153 protein:vir:104342 Length: 314 98.6 1.1E-08 6.6E-12 64.3 16.8 273 1-295 19-314 (314) 154 protein:vir:107826 Length: 331 98.6 1.2E-08 7.2E-12 64.1 16.1 225 1-232 1-331 (331) 155 protein:vir:98525 Length: 331 98.6 1.2E-08 7.2E-12 64.1 16.1 225 1-232 1-331 (331) 156 protein:vir:107388 Length: 331 98.6 1.2E-08 7.2E-12 64.1 16.1 225 1-232 1-331 (331) 157 protein:vir:103759 Length: 330 98.5 2.7E-08 1.6E-11 62.1 15.3 225 1-232 1-330 (330) 158 protein:vir:3136 Length: 322 # 98.4 4.9E-08 3E-11 60.7 15.3 278 1-299 1-322 (322) 159 protein:vir:105645 Length: 400 98.4 1.3E-07 8.2E-11 58.3 16.8 291 1-302 1-341 (400) 160 protein:vir:9927 Length: 295 # 98.4 5.5E-08 3.4E-11 60.4 14.4 255 1-302 1-295 (295) 161 protein:vir:97031 Length: 402 98.4 1.4E-07 8.5E-11 58.2 16.6 285 1-302 1-348 (402) 162 protein:vir:7019 Length: 401 # 98.3 1.3E-07 8.1E-11 58.3 15.9 292 1-302 1-345 (401) 163 protein:vir:7324 Length: 335 # 98.3 1.4E-07 8.8E-11 58.1 15.0 225 1-233 1-335 (335) 164 protein:vir:102655 Length: 322 98.1 1.3E-06 8.2E-10 52.8 17.2 279 1-295 1-322 (322) 165 protein:vir:99075 Length: 392 98.1 1.7E-06 1.1E-09 52.2 17.4 269 1-302 1-316 (392) 166 protein:vir:8843 Length: 317 # 98.0 9.3E-06 5.8E-09 48.2 20.6 277 1-296 1-317 (317) 167 protein:vir:79548 Length: 652 97.8 7.9E-06 4.9E-09 48.6 16.9 271 1-291 359-652 (652) 168 protein:vir:94070 Length: 339 97.8 8.8E-06 5.5E-09 48.3 16.6 277 1-292 35-339 (339) 169 protein:vir:9875 Length: 296 # 97.8 3.9E-06 2.4E-09 50.3 14.2 258 1-295 1-296 (296) 170 protein:vir:5255 Length: 304 # 97.7 9.7E-06 6E-09 48.1 15.9 273 1-291 1-304 (304) 171 protein:vir:106647 Length: 303 97.5 7.6E-06 4.7E-09 48.6 12.1 261 1-300 1-303 (303) 172 protein:vir:101557 Length: 336 97.5 7.2E-06 4.5E-09 48.8 11.8 276 1-292 31-336 (336) 173 protein:vir:95512 Length: 693 97.4 5.3E-05 3.3E-08 44.0 16.2 273 1-292 394-693 (693) 174 protein:vir:3643 Length: 336 # 97.3 1.2E-05 7.5E-09 47.5 11.0 274 1-292 31-336 (336) 175 protein:vir:103886 Length: 302 97.0 0.00021 1.3E-07 40.8 16.2 274 1-302 1-302 (302) 176 protein:vir:78558 Length: 336 97.0 5.3E-05 3.3E-08 44.0 11.9 276 1-292 31-336 (336) 177 protein:vir:107732 Length: 379 97.0 8.8E-05 5.5E-08 42.8 13.1 276 1-292 56-379 (379) 178 protein:vir:99576 Length: 388 96.7 9.6E-05 6E-08 42.6 11.7 282 1-292 65-388 (388) 179 protein:vir:107120 Length: 329 96.7 0.00041 2.5E-07 39.2 22.5 270 1-302 30-313 (329) 180 protein:vir:96079 Length: 382 96.6 0.00018 1.1E-07 41.2 12.6 274 1-292 63-382 (382) 181 protein:vir:348 Length: 321 # 96.4 0.00055 3.4E-07 38.5 13.9 285 1-292 1-321 (321) 182 protein:vir:106734 Length: 336 96.4 0.00022 1.4E-07 40.6 11.6 276 1-292 31-336 (336) 183 protein:vir:95131 Length: 325 96.1 0.00098 6.1E-07 37.1 16.6 274 1-302 1-299 (325) 184 protein:vir:80446 Length: 367 96.1 0.001 6.5E-07 36.9 17.7 285 1-302 1-331 (367) 185 protein:vir:78387 Length: 349 95.7 0.0016 1E-06 35.9 17.9 280 1-302 1-321 (349) 186 protein:vir:108303 Length: 418 95.6 0.0018 1.1E-06 35.7 20.7 264 1-295 1-418 (418) 187 protein:vir:96792 Length: 315 95.4 0.0021 1.3E-06 35.3 16.6 264 1-302 1-286 (315) 188 protein:vir:94800 Length: 319 95.4 0.0022 1.3E-06 35.2 22.6 269 1-302 19-303 (319) 189 protein:vir:97331 Length: 319 95.4 0.0022 1.3E-06 35.2 22.6 269 1-302 19-303 (319) 190 protein:vir:94989 Length: 349 95.2 0.0025 1.5E-06 34.9 18.4 281 1-302 1-321 (349) 191 protein:vir:861 Length: 318 # 95.0 0.00034 2.1E-07 39.6 7.5 265 1-290 35-318 (318) 192 protein:vir:105522 Length: 423 94.6 0.0039 2.4E-06 33.8 17.3 259 1-302 1-281 (423) 193 protein:vir:1663 Length: 393 # 94.6 0.00037 2.3E-07 39.4 6.6 265 1-290 110-393 (393) 194 protein:vir:93966 Length: 400 94.2 0.00044 2.7E-07 39.0 6.1 265 1-290 117-400 (400) 195 protein:vir:95451 Length: 313 92.4 0.011 7.1E-06 31.2 16.4 274 1-296 1-313 (313) 196 protein:vir:98566 Length: 355 91.9 0.014 8.5E-06 30.8 17.9 288 1-302 16-354 (355) 197 protein:vir:1781 Length: 221 # 91.4 0.016 9.9E-06 30.4 12.0 185 88-302 1-211 (221) 198 protein:vir:1153 Length: 338 # 91.0 0.018 1.1E-05 30.2 17.3 281 1-296 16-338 (338) 199 protein:vir:3525 Length: 423 # 89.0 0.029 1.8E-05 29.0 15.4 259 1-302 1-302 (423) 200 protein:vir:95603 Length: 463 88.9 0.03 1.8E-05 29.0 16.0 289 1-302 26-352 (463) 201 protein:vir:99311 Length: 463 88.9 0.03 1.8E-05 29.0 16.0 289 1-302 26-352 (463) 202 protein:vir:1829 Length: 355 # 87.9 0.035 2.2E-05 28.5 17.1 288 1-302 16-354 (355) 203 protein:vir:5694 Length: 357 # 86.2 0.047 2.9E-05 27.8 15.6 295 1-301 16-357 (357) 204 protein:vir:79008 Length: 299 84.8 0.058 3.6E-05 27.4 21.8 266 1-296 1-299 (299) 205 protein:vir:95875 Length: 401 84.2 0.062 3.9E-05 27.2 15.9 289 1-296 9-401 (401) 206 protein:vir:174 Length: 423 # 84.0 0.064 4E-05 27.1 17.3 269 1-302 1-336 (423) 207 protein:vir:94870 Length: 318 82.2 0.063 3.9E-05 27.2 8.3 266 1-290 35-318 (318) 208 protein:vir:96666 Length: 462 82.0 0.081 5E-05 26.6 14.7 278 1-302 26-333 (462) 209 protein:vir:102823 Length: 470 80.2 0.097 6E-05 26.1 9.8 260 1-302 18-303 (470) 210 protein:vir:6061 Length: 357 # 79.0 0.11 6.8E-05 25.9 15.9 288 1-302 16-351 (357) 211 protein:vir:2016 Length: 357 # 77.1 0.13 8E-05 25.5 14.5 287 1-301 16-357 (357) 212 protein:vir:5942 Length: 523 # 74.6 0.16 9.7E-05 25.0 11.4 284 1-299 188-523 (523) 213 protein:vir:100851 Length: 514 74.6 0.16 9.7E-05 25.0 9.4 254 1-302 45-320 (514) 214 protein:vir:106286 Length: 534 74.5 0.16 9.8E-05 25.0 19.7 281 1-302 87-517 (534) 215 protein:vir:104011 Length: 337 73.5 0.17 0.0001 24.8 18.7 280 1-297 16-337 (337) 216 protein:vir:79171 Length: 337 72.4 0.18 0.00011 24.6 18.7 280 1-297 16-337 (337) 217 protein:vir:100331 Length: 342 70.4 0.21 0.00013 24.3 16.6 282 1-300 16-342 (342) 218 protein:vir:79157 Length: 339 69.5 0.22 0.00014 24.2 16.4 282 1-302 16-339 (339) 219 protein:vir:80835 Length: 464 69.0 0.23 0.00014 24.1 15.2 289 1-302 22-368 (464) 220 protein:vir:80491 Length: 467 64.2 0.3 0.00019 23.4 12.0 272 1-302 31-329 (467) 221 protein:vir:78920 Length: 290 63.9 0.31 0.00019 23.4 20.7 261 1-293 1-290 (290) 222 protein:vir:63741 Length: 468 63.0 0.33 0.0002 23.3 11.8 272 1-302 32-330 (468) 223 protein:vir:105374 Length: 423 61.7 0.35 0.00022 23.1 18.0 255 1-302 1-281 (423) 224 protein:vir:78186 Length: 337 54.5 0.5 0.00031 22.2 17.8 280 1-301 16-337 (337) 225 protein:vir:78777 Length: 358 53.3 0.53 0.00033 22.1 16.1 282 1-302 20-352 (358) 226 protein:vir:93696 Length: 364 50.9 0.6 0.00037 21.8 15.7 282 1-302 1-364 (364) 227 protein:vir:5670 Length: 514 # 46.6 0.73 0.00045 21.3 19.8 281 1-302 76-498 (514) 228 protein:vir:270 Length: 341 # 45.2 0.78 0.00048 21.2 15.8 279 1-302 20-338 (341) 229 protein:vir:103370 Length: 418 38.5 1.1 0.00066 20.4 15.3 280 1-301 69-418 (418) 230 protein:vir:6901 Length: 522 # 38.2 1.1 0.00067 20.4 19.4 280 1-302 80-505 (522) 231 protein:vir:98856 Length: 343 37.2 1.1 0.0007 20.3 18.2 283 1-302 16-341 (343) 232 protein:vir:96442 Length: 418 35.4 1.2 0.00076 20.1 16.8 280 1-301 69-418 (418) 233 protein:vir:106998 Length: 468 29.9 1.6 0.001 19.4 19.8 279 1-302 63-449 (468) 234 protein:vir:100603 Length: 529 27.9 1.8 0.0011 19.2 17.5 285 1-302 79-509 (529) 235 protein:vir:101039 Length: 529 22.3 2.5 0.0015 18.4 19.0 285 1-302 79-517 (529) 236 protein:vir:3783 Length: 336 # 22.0 2.5 0.0016 18.4 17.2 277 1-297 13-336 (336) 237 protein:vir:103463 Length: 521 20.7 2.7 0.0017 18.2 19.7 287 1-302 79-509 (521) No 1 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=9.7e-69 Score=393.46 Aligned_cols=301 Identities=73% Similarity=1.112 Sum_probs=277.9 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) ||.++++++|.+||++++++|++.+++.++|+++++++++.++.+++|+.+..+.+.|++|++..+++.+++++++|+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 99999999999999999999999999999999999999999999999999999999999999999998899999999999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchH Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANE 160 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (302) ++++||++++++||+|+++||.++++++|+++|++++++++|+++|+|+|++.|.......+.............+.... T Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) T ss_pred EeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccchhh Confidence 99999999999999999999999999999999999999999999999999998888777776665555555555556666 Q ss_pred HHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecccccCcceEeecccccCCCcceEEEEecceEE Q lcl|NC_011054. 161 DDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVR 240 (302) Q Consensus 161 ~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~ 240 (302) ..+.+.+.++...+....+..+.|+||+.++..|+++||++|||+|+++.+.|+|+.+++......++..+++|||++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~l~G~Pv~~~~~~~~~~~~~~~~~gd~s~~~ 240 (305) T protein:vir:25 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVK 240 (305) T ss_pred hHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecCCcccccceEEcCccCCCCCccEEEEEecceEE Confidence 77778888888777777777888999999999999999999999999999999999999888777788899999999999 Q ss_pred EEeecCcEEEEeeccccc----chhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCC Q lcl|NC_011054. 241 IGVRQDITVKFLDQATVG----SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDG 301 (302) Q Consensus 241 ~~~~~~~~i~~~~~~~~~----~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~ 301 (302) +++++++++++++++... .+++|++||+.+|++.|+||.+.||++|++++++++++|+|++ T Consensus 241 i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~pa~ 305 (305) T protein:vir:25 241 IGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) T ss_pred EEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCCCC Confidence 999999999999887543 4668999999999999999999999999999999999999999 No 2 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=3e-58 Score=335.94 Aligned_cols=282 Identities=24% Similarity=0.306 Sum_probs=242.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +...+++++|.+||++++++|++.+++.++|+++++++|++++.+++|+.++.+.+.|++|+++.++ ++++|+++ T Consensus 9 ~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~-----~~~~~~~i 83 (304) T protein:vir:10 9 GNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQT-----SKPEYAQA 83 (304) T ss_pred ccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCccccc-----ccceeeEE Confidence 5556677889999999999999999999999999999999999999999999999999999988665 57899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCC--cccccccccccccccccceeeccccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKP--SSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~--~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +++++|++++++||+|+++||.++++++|.++|++++++++|+++|+|+|++ .+............. ..... T Consensus 84 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~------~~~~~ 157 (304) T protein:vir:10 84 EMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEK------GNVVT 157 (304) T ss_pred EEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccc------ccccc Confidence 9999999999999999999999999999999999999999999999999864 333332222222111 11112 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec--ccccCcceEeecccccCCCcceEEEEec Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD--ESFNGFGTYFNANGAWPVGVAEALVVDS 236 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~--~~~~g~p~~~~~~~~~~~~~~~~~~gd~ 236 (302) .....++++.+++..+...+..++.|+||++++..|++++|++|||+|++ ..+.|+|+.+.+......+++.+++||| T Consensus 158 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~ 237 (304) T protein:vir:10 158 DTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANGNEIMGLPLSYTGADVYDKKKSLALMGDW 237 (304) T ss_pred cccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCCccccceeeEEecccccCCCCcEEEEEeh Confidence 22334566777888888888889999999999999999999999999986 4788999998888877778889999999 Q ss_pred ceEEEEeecCcEEEEeeccccc----------chhhhcCCcEEEEEEEEeccEEeccccEEEEeeec Q lcl|NC_011054. 237 SRVRIGVRQDITVKFLDQATVG----------SINLAERDMIALRLKARFAYVLGNGATAVGDNKTP 293 (302) Q Consensus 237 ~~~~~~~~~~~~i~~~~~~~~~----------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~ 293 (302) +++++++++++++++++++... .+++|++||+.||+++|+|+++.+|+||++|+.+. T Consensus 238 ~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 238 DYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred hhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 9999999999999999886532 34689999999999999999999999999999887 No 3 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=3e-58 Score=335.94 Aligned_cols=282 Identities=24% Similarity=0.306 Sum_probs=242.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +...+++++|.+||++++++|++.+++.++|+++++++|++++.+++|+.++.+.+.|++|+++.++ ++++|+++ T Consensus 9 ~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~-----~~~~~~~i 83 (304) T protein:vir:94 9 GNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQT-----SKPEYAQA 83 (304) T ss_pred ccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCccccc-----ccceeeEE Confidence 5556677889999999999999999999999999999999999999999999999999999988665 57899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCC--cccccccccccccccccceeeccccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKP--SSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~--~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +++++|++++++||+|+++||.++++++|.++|++++++++|+++|+|+|++ .+............. ..... T Consensus 84 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~------~~~~~ 157 (304) T protein:vir:94 84 EMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEK------GNVVT 157 (304) T ss_pred EEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccc------ccccc Confidence 9999999999999999999999999999999999999999999999999864 333332222222111 11112 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec--ccccCcceEeecccccCCCcceEEEEec Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD--ESFNGFGTYFNANGAWPVGVAEALVVDS 236 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~--~~~~g~p~~~~~~~~~~~~~~~~~~gd~ 236 (302) .....++++.+++..+...+..++.|+||++++..|++++|++|||+|++ ..+.|+|+.+.+......+++.+++||| T Consensus 158 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~ 237 (304) T protein:vir:94 158 DTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANGNEIMGLPLSYTGADVYDKKKSLALMGDW 237 (304) T ss_pred cccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCCccccceeeEEecccccCCCCcEEEEEeh Confidence 22334566777888888888889999999999999999999999999986 4788999998888877778889999999 Q ss_pred ceEEEEeecCcEEEEeeccccc----------chhhhcCCcEEEEEEEEeccEEeccccEEEEeeec Q lcl|NC_011054. 237 SRVRIGVRQDITVKFLDQATVG----------SINLAERDMIALRLKARFAYVLGNGATAVGDNKTP 293 (302) Q Consensus 237 ~~~~~~~~~~~~i~~~~~~~~~----------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~ 293 (302) +++++++++++++++++++... .+++|++||+.||+++|+|+++.+|+||++|+.+. T Consensus 238 ~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 238 DYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred hhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 9999999999999999886532 34689999999999999999999999999999887 No 4 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=5.4e-58 Score=334.56 Aligned_cols=293 Identities=22% Similarity=0.287 Sum_probs=240.5 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |+. ++.++|.++|++++++|++.+++.++|+++++++++.++.+++|+.++.+.+.|++|++++++ ++++|+++ T Consensus 10 ~~~-~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-----~~~~f~~i 83 (330) T protein:vir:77 10 QVA-LTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGEAERKPI-----TKGSFGKQ 83 (330) T ss_pred hcc-ccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecCCCcccc-----ccceeeEE Confidence 443 345566788889999999999999999999999999998899999999999999999988665 57899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccc--cceeeccccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAAN--QDYTIVPGDA 158 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~--~~~~~~~~~~ 158 (302) ++++||++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++.+. .+......... ......+... T Consensus 84 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~--~g~~~~~~~~~~~~~~~~~~~~~ 161 (330) T protein:vir:77 84 ELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAF--KGYLAETTKVVSLADTNLTTASG 161 (330) T ss_pred EEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcc--ccccccccccceeeccccccccc Confidence 9999999999999999999999999999999999999999999999999975432 11111111111 1111222233 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc------------cccCcceEeecccccC- Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE------------SFNGFGTYFNANGAWP- 225 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~------------~~~g~p~~~~~~~~~~- 225 (302) .....++++.+++..+...+..++.|+||++++..|+++||++|||||++. ++.|+|+.+.+..... T Consensus 162 ~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~~~ 241 (330) T protein:vir:77 162 PQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVVNGT 241 (330) T ss_pred ccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEeccccCCC Confidence 344556677777777777788889999999999999999999999999753 4678898888765433 Q ss_pred -CCcceEEEEecceEEEEeecCcEEEEeeccccc------------chhhhcCCcEEEEEEEEeccEEeccccEEEEeee Q lcl|NC_011054. 226 -VGVAEALVVDSSRVRIGVRQDITVKFLDQATVG------------SINLAERDMIALRLKARFAYVLGNGATAVGDNKT 292 (302) Q Consensus 226 -~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~------------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~ 292 (302) .++..+++|||++++++++++++++++++.... .+++|++|++.||+++|+|+++.+|+||++++.+ T Consensus 242 ~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~ 321 (330) T protein:vir:77 242 VGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQ 321 (330) T ss_pred CCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEec Confidence 345678999999999999999999999886532 3568999999999999999999999999999998 Q ss_pred cccccCCCCC Q lcl|NC_011054. 293 PVGAVVPDGS 302 (302) Q Consensus 293 ~a~~~~p~~~ 302 (302) .+++ +|+-. T Consensus 322 ~~~~-~~~~~ 330 (330) T protein:vir:77 322 VAGT-DPEEE 330 (330) T ss_pred cCCc-CCCCC Confidence 8877 66666 No 5 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=1.2e-57 Score=332.74 Aligned_cols=291 Identities=15% Similarity=0.168 Sum_probs=234.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) ||..+++.||++||++++++||+.+++.|+++++++++|+.++.++||+.++.+.|+|++|++++++ ++++|+++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~-----s~~~f~~v 75 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPS-----ASVDVSAF 75 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCccccc-----cccceeee Confidence 9999999999999999999999999999999999999999998999999999999999999987654 67999999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTS----LLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~----~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) ++++||++++++||+|+++++..+ ++++|.++|++++++++|.++|+|+|..++....+........... ...+ T Consensus 76 ~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~--~~~~ 153 (315) T protein:vir:80 76 TAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNI--VDAT 153 (315) T ss_pred EeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccce--eecc Confidence 999999999999999999988765 7899999999999999999999998865544333322222111111 1111 Q ss_pred cchHHHHHHHhhhhhhhhhh-cccCccEEEecHHHHHHHHhhhcCCCc-----eeee------cccccCcceEeeccccc Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAA-AGYMPDTLLASLGFRFDVANLRDANGN-----PIFR------DESFNGFGTYFNANGAW 224 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~-~~~~~~~~v~~~~~~~~l~~l~d~~g~-----~i~~------~~~~~g~p~~~~~~~~~ 224 (302) ...+ +++.+++..+.. .+..++.|+||+.++..|++|+|.+|+ |+|. +.++.|+|+.+.+.... T Consensus 154 ~~~~----~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~ 229 (315) T protein:vir:80 154 DSAT----ADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSG 229 (315) T ss_pred ccch----HHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcCCc Confidence 1122 334445544433 344556799999999999999877654 5553 24689999988876543 Q ss_pred C-----CCcceEEEEecceEEEEeecCcEEEEeecccc--cchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccccc Q lcl|NC_011054. 225 P-----VGVAEALVVDSSRVRIGVRQDITVKFLDQATV--GSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV 297 (302) Q Consensus 225 ~-----~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~--~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~ 297 (302) . .++..+++|||++++|+.+++++++++++++. ..+++|++|++.||+++|+||+|.+|+||++|+.+.++.. T Consensus 230 ~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~ 309 (315) T protein:vir:80 230 APEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) T ss_pred ccccccccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCC Confidence 2 23457899999999999999999999987642 2457899999999999999999999999999998888877 Q ss_pred CCCCC Q lcl|NC_011054. 298 VPDGS 302 (302) Q Consensus 298 ~p~~~ 302 (302) +|.+- T Consensus 310 ~~~~~ 314 (315) T protein:vir:80 310 NPPAE 314 (315) T ss_pred CCCCC Confidence 77777 No 6 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=2.1e-57 Score=331.27 Aligned_cols=286 Identities=18% Similarity=0.196 Sum_probs=234.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) ||..++ ||++||+++.++||+.+++.+++++++++++++++..++|+.++.+.++|++|+++.++ ++++|+++ T Consensus 1 mat~~~--gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~-----~~~~f~~v 73 (311) T protein:vir:81 1 MVALAT--GTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSE-----STATFAPV 73 (311) T ss_pred CceecC--CceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCccccc-----ccceeeEE Confidence 885554 78999999999999999999999999999999999999999999999999999988664 67899999 Q ss_pred EeeeeeEEEeehhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVD---DASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++++||+++++++|+|+++ ++..+++++|.+++++++++++|.++++|+++++|....+.............. +. T Consensus 74 ~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~--~~ 151 (311) T protein:vir:81 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVEL--TT 151 (311) T ss_pred EEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeee--cc Confidence 9999999999999999996 456789999999999999999999999998766554443333332222211111 12 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec-------ccccCcceEeecccc------- Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD-------ESFNGFGTYFNANGA------- 223 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~-------~~~~g~p~~~~~~~~------- 223 (302) .+......++.+++..+...+.+++.|+||+.++.+|++|||++|||+|++ .++.|+|+.+.+... T Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~ 231 (311) T protein:vir:81 152 GTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVT 231 (311) T ss_pred cccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccccccccccc Confidence 222333455666777777777788889999999999999999999999964 467889988765432 Q ss_pred -------cCCCcceEEEEecceEEEEeecCcEEEEeeccc-ccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 224 -------WPVGVAEALVVDSSRVRIGVRQDITVKFLDQAT-VGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 224 -------~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~-~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) ....+..+++|||++++++.+++++++++++.. ....++|++|++.+|+++|+|++|.+|+||++++++.-+ T Consensus 232 ~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred cccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 123456789999999999999999999998763 334578999999999999999999999999999987655 No 7 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=4e-57 Score=329.77 Aligned_cols=284 Identities=17% Similarity=0.151 Sum_probs=230.8 Q ss_pred CCCcc-CCCcceecchHHHHHHHHHHHhhhhhhhh-cceeecCCCceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADIS-RSEVATLIQEAYANDLLASAKKGSTVLQA-FPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t-~~~~g~liP~~~~~~ii~~~~~~s~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |+..+ ++.||++||+++.++||+.+++.++++++ ++.+++.++.+++|+.++++.++|++|+++.++ ++++|+ T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a~wv~E~~~~~~-----s~~~f~ 138 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGATAGYVGEGKDVVA-----TGATFD 138 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcceeeeccCccccc-----ccccee Confidence 44333 34688999999999999999999999998 888999888999999999999999999987654 678999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC---Ccccccccccccccccccceeecc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---PSSWVSPALLPAAVAANQDYTIVP 155 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---~~g~~~~~~~~~~~~~~~~~~~~~ 155 (302) +|++++||+++++++|+|+++||.++++++|+++|++++++++|++||+|+|+ |.|+.+.... ......... T Consensus 139 ~i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~-----~~~~~~~~~ 213 (366) T protein:vir:57 139 DVKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATA-----ANRLVAWTG 213 (366) T ss_pred EEEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccc-----ccceeeccc Confidence 99999999999999999999999999999999999999999999999999883 5555433221 111111122 Q ss_pred ccchHHHHHHHhhhhh--hhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec---ccccCcceEeeccccc----CC Q lcl|NC_011054. 156 GDANEDDLIGCINRAS--KAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD---ESFNGFGTYFNANGAW----PV 226 (302) Q Consensus 156 ~~~~~~~~~~~i~~~~--~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~---~~~~g~p~~~~~~~~~----~~ 226 (302) ...+...+..++..+. ......+...+.|+||+.++..|++|+|++|+|+|.+ +++.|+|+.+.+.... .. T Consensus 214 t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~~ 293 (366) T protein:vir:57 214 TAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPEMSQGILKGYPIQRTSAIPANLGDDG 293 (366) T ss_pred cccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCceeccCCCCCeecceeeEEccccccccccCC Confidence 2233333333332222 2233445678889999999999999999999999953 4688999988766432 33 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeeccccc-----chhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVG-----SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~-----~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) +...++||||+++++++++++++++++++++. .+++|++|++.+|+++|+||++.||++|++++++.| T Consensus 294 ~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 294 NESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred CccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 55679999999999999999999999987653 246899999999999999999999999999999999 No 8 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=7.2e-57 Score=328.38 Aligned_cols=280 Identities=21% Similarity=0.307 Sum_probs=238.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |+..++++++.+||++++++||+.+++.++|+++++++|++++..++|+.+ .+.+.|++|+++.++ ++++|+++ T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~-----~~~~f~~v 79 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMS-GVGAFWVDEAERIQT-----SKPTFTKA 79 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEc-CCceeeeecCccccc-----cccceeEE Confidence 999999999999999999999999999999999999999999999999876 578999999988665 57899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchH Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANE 160 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (302) ++.++|++++++||+|+++||..+++++|.++|++++++++|+++|+|+|++.+. ++........ .....+..+ T Consensus 80 ~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~---gil~~~~~~~--~~~~~~~~~- 153 (299) T protein:vir:41 80 KMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNW---NILKSATDAS--NLVEETANK- 153 (299) T ss_pred EEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccc---cccccccccc--eeecccccc- Confidence 9999999999999999999999999999999999999999999999999865322 1111111111 112222233 Q ss_pred HHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc------cccCcceEeecccccCCCcceEEEE Q lcl|NC_011054. 161 DDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE------SFNGFGTYFNANGAWPVGVAEALVV 234 (302) Q Consensus 161 ~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~------~~~g~p~~~~~~~~~~~~~~~~~~g 234 (302) ++++.+++..+...+..++.|+||++++.+|++|+|++|+|||+++ .+.|+|+.+.+....+.++..+++| T Consensus 154 ---~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~g 230 (299) T protein:vir:41 154 ---YDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGVDDVLGLPIAYTPKYTFGDKDISELVG 230 (299) T ss_pred ---HHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCceecceeeEEecccCCCCCceEEEEE Confidence 3456666677777888889999999999999999999999999764 4678899888887767777889999 Q ss_pred ecceEEEEeecCcEEEEeeccccc--------chhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 235 DSSRVRIGVRQDITVKFLDQATVG--------SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 235 d~~~~~~~~~~~~~i~~~~~~~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) ||++++++++++++++++++.+.. .+++|++|++.+|+++|+|+++.+|+||++++.+.|. T Consensus 231 dfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 231 DWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred ecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 999999999999999999987542 2457999999999999999999999999999988777 No 9 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=1.1e-56 Score=327.41 Aligned_cols=285 Identities=14% Similarity=0.166 Sum_probs=229.9 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) ||..+++. |.+||++++.+||+.+++.++++++++++++.++..++|+.++.+.|+|++|+++.+ +++++|+++ T Consensus 1 ma~~t~~~-G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~-----~s~~~f~~v 74 (300) T protein:vir:95 1 MSEAQLSK-GNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKKT-----HGGVSLDPV 74 (300) T ss_pred CcccccCC-cceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcccc-----cccccceee Confidence 99887775 568999999999999999999999999999998889999999999999999998755 478999999 Q ss_pred EeeeeeEEEeehhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVD---DASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++++||++++++||+|+++ ++.++++++|.++|++++++++|+++|+|++.++|.................... T Consensus 75 ~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~--- 151 (300) T protein:vir:95 75 TIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVP--- 151 (300) T ss_pred EeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeec--- Confidence 9999999999999999995 5678999999999999999999999999976544432211111111111111111 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec-------ccccCcceEeecccccC--CCc Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD-------ESFNGFGTYFNANGAWP--VGV 228 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~-------~~~~g~p~~~~~~~~~~--~~~ 228 (302) .+....++.+.++...+...+++++.|+||+.++.+|++|||++|||||.+ .++.|+|+.+.+..... ..+ T Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 231 (300) T protein:vir:95 152 FKDTNPDESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDPK 231 (300) T ss_pred ccccchHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCCCCCc Confidence 112233466777777777778888899999999999999999999999953 46889999887765433 345 Q ss_pred ceEEEEecceE-EEEeecCcEEEEeecccc--cchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 229 AEALVVDSSRV-RIGVRQDITVKFLDQATV--GSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 229 ~~~~~gd~~~~-~~~~~~~~~i~~~~~~~~--~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) ..+++|||+++ .++.|++++++++++... ..+++|++||+.+|+++|+||++.+|+||++++++.= T Consensus 232 ~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 232 NTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred cEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 56889999875 499999999999876532 3456899999999999999999999999999986532 No 10 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=1.2e-56 Score=327.26 Aligned_cols=285 Identities=14% Similarity=0.145 Sum_probs=231.5 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |+ +++.+|.+||++++++||+.+++.++++++++++|+.++..++|+.++++.+.|++|+++.++ ++++|+++ T Consensus 1 m~--t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~-----s~~~f~~v 73 (303) T protein:vir:97 1 MG--TETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKTH-----GGLSLEPV 73 (303) T ss_pred Cc--ccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCccccc-----cccceeeE Confidence 88 444578999999999999999999999999999999999999999999999999999988654 67899999 Q ss_pred EeeeeeEEEeehhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVD---DASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++++||+++++++|+|+++ ++..+++++|.++|++++++++|.++|+|++.++|...... ..... ....+..... T Consensus 74 ~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~-~~~~~-~~~~~~~~~~ 151 (303) T protein:vir:97 74 TIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVI-GTNHF-DSKVTQVVKF 151 (303) T ss_pred EeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccc-ccccc-cccccccccc Confidence 9999999999999999994 56789999999999999999999999999875443221111 10000 0001111111 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc--------cccCcceEeecccccC---- Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE--------SFNGFGTYFNANGAWP---- 225 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~--------~~~g~p~~~~~~~~~~---- 225 (302) .+.+..++++.+++..+...+..++.|+||++++.+|++|||++|+|+|.++ .+.|+|+.+....... T Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~ 231 (303) T protein:vir:97 152 TESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGADEA 231 (303) T ss_pred ccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccccC Confidence 2223345677777777777788889999999999999999999999999754 5789999887764332 Q ss_pred CCcceEEEEecc-eEEEEeecCcEEEEeeccc--ccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 226 VGVAEALVVDSS-RVRIGVRQDITVKFLDQAT--VGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 226 ~~~~~~~~gd~~-~~~~~~~~~~~i~~~~~~~--~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) .....+++|||+ .+.++.+++++++++++.. ...+++|++||+.+|+++|+|+++.+|+||++|+++++ T Consensus 232 ~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 232 ESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred CCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 344568999995 5789999999999987653 23467899999999999999999999999999999988 No 11 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=2.5e-56 Score=325.45 Aligned_cols=286 Identities=19% Similarity=0.253 Sum_probs=239.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +....+++++.+||++++++|++.+++.++|+++++++|++++.+++|+.++.+.+.|++|++++++ ++++|+++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~-----~~~~f~~v 101 (324) T protein:vir:97 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET-----SKATWVNA 101 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceEEEEEecCcceeEeccCccccc-----cccceeEE Confidence 4444556688999999999999999999999999999999999999999999999999999988654 67899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchH Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANE 160 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (302) ++++||++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++. .+.++..... .......+..++ T Consensus 102 ~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~--~~~gi~~~~~---~~~~~~~~~~~~ 176 (324) T protein:vir:97 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIE---KTNKVIKGDFTQ 176 (324) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc--cCcccccccc---ccceeccccCCH Confidence 99999999999999999999999999999999999999999999999998531 2222222111 112222333344 Q ss_pred HHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec---ccccCcceEeecccccCCCcceEEEEecc Q lcl|NC_011054. 161 DDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD---ESFNGFGTYFNANGAWPVGVAEALVVDSS 237 (302) Q Consensus 161 ~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~---~~~~g~p~~~~~~~~~~~~~~~~~~gd~~ 237 (302) +++.++...+...++.++.|+||+.++..|++++|++|||+|.+ +++.|+|+.+.. ..+.+++.+++|||+ T Consensus 177 ----~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~~~~~tl~G~PV~~~~--~~~~~~~~~~~gd~~ 250 (324) T protein:vir:97 177 ----DNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLPVVNLK--SSNLKRGELITGDFD 250 (324) T ss_pred ----HHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCCCccccceeeEeec--CCCCCcceEEEEecc Confidence 44556667777788889999999999999999999999999974 467888876654 345677889999999 Q ss_pred eEEEEeecCcEEEEeeccccc--------chhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 238 RVRIGVRQDITVKFLDQATVG--------SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 238 ~~~~~~~~~~~i~~~~~~~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) +++++++++++++++++.... .+++|++|++.||+++|+|+++.+|+||++++.+.++...|.|- T Consensus 251 ~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~ 323 (324) T protein:vir:97 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGE 323 (324) T ss_pred cEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCCCCCC Confidence 999999999999999987543 35789999999999999999999999999999887776666667 No 12 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=3.8e-56 Score=324.39 Aligned_cols=283 Identities=13% Similarity=0.154 Sum_probs=230.9 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) || .++|.++|++++++||+.++++++|++++++++++++..++|+.++.+.++|++|+++.++ ++++|+++ T Consensus 1 ma----~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~-----~~~~f~~v 71 (298) T protein:vir:16 1 MV----LNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTH-----GGVTLAPQ 71 (298) T ss_pred Cc----ccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCccccc-----cccceeEE Confidence 77 3457899999999999999999999999999999988899999999999999999987654 67899999 Q ss_pred EeeeeeEEEeehhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVD---DASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++++||+++++++|+|+++ ++..+++++|.++|++++++++|.++++|++.++|........ .............. T Consensus 72 ~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~-~~~~~~~~~~~~~~ 150 (298) T protein:vir:16 72 TMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGT-NHFDSKVTQKVEAP 150 (298) T ss_pred EEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccc-cccccccccccccc Confidence 9999999999999999996 4567999999999999999999999999976554433222111 11111111122222 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeeccccc--CCCc Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAW--PVGV 228 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~--~~~~ 228 (302) ......++++.+++..+...+..++.|+||++++..|++|||++|||||++. .+.|+|+.+...... ..++ T Consensus 151 ~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~ 230 (298) T protein:vir:16 151 RGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) T ss_pred cccccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCc Confidence 2234445677788888888888888999999999999999999999999763 678899988776543 3455 Q ss_pred ceEEEEecceE-EEEeecCcEEEEeecccc--cchhhhcCCcEEEEEEEEeccEEeccccEEEEeeec Q lcl|NC_011054. 229 AEALVVDSSRV-RIGVRQDITVKFLDQATV--GSINLAERDMIALRLKARFAYVLGNGATAVGDNKTP 293 (302) Q Consensus 229 ~~~~~gd~~~~-~~~~~~~~~i~~~~~~~~--~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~ 293 (302) ..+++|||+++ .++.+++++++++++... ...++|++||+.+|+++|+|+++.||+||++++++. T Consensus 231 ~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 67899999875 589999999999886532 346789999999999999999999999999999876 No 13 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=2.3e-56 Score=325.66 Aligned_cols=284 Identities=16% Similarity=0.202 Sum_probs=235.4 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhh-cceeecCCCceEEEEEeCCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQA-FPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |...+...||.+||+++.++||+.+++.++++++ ++.+++.++.+++|+.++.+.+.|++|++..++ ++++|++ T Consensus 132 ~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~-----~~~~f~~ 206 (435) T protein:vir:14 132 LNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPT-----TQQQFDD 206 (435) T ss_pred cccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceeeeccCccccc-----cccceeE Confidence 6666777889999999999999999999999998 788899888999999999999999999987654 6789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhcccCC---Ccccccccccccccccccceeec Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDAS--TSLLEEIAALGGQAIGKKLDQAVIFGTDK---PSSWVSPALLPAAVAANQDYTIV 154 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---~~g~~~~~~~~~~~~~~~~~~~~ 154 (302) |++.++|++++++||+|+++|+. +++++||.++|++++++++|++|++|+|+ |.|++....... .... T Consensus 207 i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~-------~~~~ 279 (435) T protein:vir:14 207 LKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSN-------VITA 279 (435) T ss_pred EEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccc-------eecc Confidence 99999999999999999999985 46999999999999999999999999885 556544322111 1111 Q ss_pred cccchHHHHHHHhhhhhhhhhhc--ccCccEEEecHHHHHHHHhhhcCCCceeeec---ccccCcceEeeccccc----C Q lcl|NC_011054. 155 PGDANEDDLIGCINRASKAVAAA--GYMPDTLLASLGFRFDVANLRDANGNPIFRD---ESFNGFGTYFNANGAW----P 225 (302) Q Consensus 155 ~~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~---~~~~g~p~~~~~~~~~----~ 225 (302) ....+.+....++.++...+... ++.++.|+||+.++..|+++||++|||||.+ +++.|+|+.+++.... . T Consensus 280 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~ 359 (435) T protein:vir:14 280 SDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGET 359 (435) T ss_pred ccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeecceeEeeccccccccCC Confidence 22233444445556665555432 4557789999999999999999999999953 4688999988765432 2 Q ss_pred CCcceEEEEecceEEEEeecCcEEEEeecccccc-----hhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccc Q lcl|NC_011054. 226 VGVAEALVVDSSRVRIGVRQDITVKFLDQATVGS-----INLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGA 296 (302) Q Consensus 226 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~-----~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~ 296 (302) .....+++|||++|++++|+++++++++++.+.. ..+|++|++.||+++|+||++.+|+||+++++.+|++ T Consensus 360 ~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 360 GKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred CccceEEEeecccEEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 3456799999999999999999999999876532 4689999999999999999999999999999999999 No 14 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=4.2e-56 Score=324.15 Aligned_cols=285 Identities=20% Similarity=0.242 Sum_probs=233.9 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |+..+++++|.+||+++.++||+.+++.++|+++++++|+.++..+||+.++.+.++|++|++++++ ++++|+++ T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~-----~~~~f~~i 88 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEGDMKPI-----TKGNMTSQ 88 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCCccccc-----cccceeEE Confidence 8888889999999999999999999999999999999999999999999999999999999988665 57899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchH Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANE 160 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (302) ++++||+++++++|+|+++||.++++++|.++|++++++++|+++|+|+|++.+......... ...... .+ .. T Consensus 89 ~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~---~~~~~~--~~--~~ 161 (318) T protein:vir:24 89 TIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKA---ISIADT--TG--AT 161 (318) T ss_pred EEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCccccccccc---cccccc--cc--cc Confidence 999999999999999999999999999999999999999999999999987543322211111 111111 11 11 Q ss_pred HHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeeccc------------ccCcceEeecccccCCCc Q lcl|NC_011054. 161 DDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDES------------FNGFGTYFNANGAWPVGV 228 (302) Q Consensus 161 ~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~------------~~g~p~~~~~~~~~~~~~ 228 (302) ....+.+.++...+...+..++.|+||++++..|+++||++|||||++.. +.|+|+.+... .+.++ T Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~--~~~~~ 239 (318) T protein:vir:24 162 TVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDH--VVEGT 239 (318) T ss_pred chHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCC--CCCCc Confidence 22234455666667777888899999999999999999999999998753 33445444433 34556 Q ss_pred ceEEEEecceEEEEeecCcEEEEeeccccc--------chhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCC Q lcl|NC_011054. 229 AEALVVDSSRVRIGVRQDITVKFLDQATVG--------SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPD 300 (302) Q Consensus 229 ~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~ 300 (302) ..+++|||++++++++++++|+++++++.. .+++|++|++.||+++|+|+++.+|+||++|+...++.- + T Consensus 240 ~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~--~ 317 (318) T protein:vir:24 240 TVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGG--E 317 (318) T ss_pred cEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCC--C Confidence 778999999999999999999999987643 356899999999999999999999999999998766552 2 Q ss_pred C Q lcl|NC_011054. 301 G 301 (302) Q Consensus 301 ~ 301 (302) | T Consensus 318 ~ 318 (318) T protein:vir:24 318 G 318 (318) T ss_pred C Confidence 3 No 15 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=2.1e-56 Score=325.87 Aligned_cols=285 Identities=20% Similarity=0.259 Sum_probs=233.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |+..++++++.++|++++++||+.+++.++|+++++++++.++.++||+.+..+.+.|++|++++++ ++++|+++ T Consensus 10 ~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~-----s~~~f~~v 84 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDMKPI-----TKGNMTKR 84 (397) T ss_pred HhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCccccc-----cccceeEE Confidence 8888888888889999999999999999999999999999998999999999999999999988665 67899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchH Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANE 160 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (302) ++++||++++++||+|+++|+.++++++|+++|++++++++|+++|+|+|++.+........ . ......+.... T Consensus 85 ~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~-----~-~~~~~~~~~~~ 158 (397) T protein:vir:23 85 DVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQS-----N-KTQSISPNAYQ 158 (397) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccc-----c-ceeeecccchh Confidence 99999999999999999999999999999999999999999999999999876543221111 1 11112222223 Q ss_pred HHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc------------cccCcceEeecccccCCCc Q lcl|NC_011054. 161 DDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE------------SFNGFGTYFNANGAWPVGV 228 (302) Q Consensus 161 ~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~------------~~~g~p~~~~~~~~~~~~~ 228 (302) +.+.++...+...+..++.|+||++++..|+++||++|||||+++ .+.|+|+.+.... +.++ T Consensus 159 ----~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~--~~g~ 232 (397) T protein:vir:23 159 ----GLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHV--AEGD 232 (397) T ss_pred ----HHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCC--CCCc Confidence 333444445566677889999999999999999999999999875 3566777666553 3455 Q ss_pred ceEEEEecceEEEEeecCcEEEEeeccccc--------chhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccC-- Q lcl|NC_011054. 229 AEALVVDSSRVRIGVRQDITVKFLDQATVG--------SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVV-- 298 (302) Q Consensus 229 ~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~-- 298 (302) ..+++|||+++++++++++.++++++.+.. .+++|++||+.||+++|+||++.+|+||++++.++.+... T Consensus 233 ~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~ 312 (397) T protein:vir:23 233 VVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYAL 312 (397) T ss_pred eEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceeee Confidence 667899999999999999999999887643 4568999999999999999999999999999986543321 Q ss_pred --C---CCC Q lcl|NC_011054. 299 --P---DGS 302 (302) Q Consensus 299 --p---~~~ 302 (302) | .|+ T Consensus 313 ~~~~~~~~~ 321 (397) T protein:vir:23 313 DLDGASAGN 321 (397) T ss_pred cccccCcce Confidence 1 222 No 16 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=3.6e-56 Score=324.54 Aligned_cols=284 Identities=17% Similarity=0.204 Sum_probs=234.3 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhh-cceeecCCCceEEEEEeCCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQA-FPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) +...+++.||++||+++.++||+.+++.++++++ ++.+++..+.+++|+.++.+.+.|++|++..++ ++++|++ T Consensus 132 ~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~~-----~~~~f~~ 206 (435) T protein:vir:80 132 LNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPT-----TQQQFDD 206 (435) T ss_pred hcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCceEEEEEeCCcceeeeccCccccc-----cccceee Confidence 4556667789999999999999999999999998 788999998999999999999999999987654 6789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhcccCC---Ccccccccccccccccccceeec Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDAS--TSLLEEIAALGGQAIGKKLDQAVIFGTDK---PSSWVSPALLPAAVAANQDYTIV 154 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---~~g~~~~~~~~~~~~~~~~~~~~ 154 (302) |++.++|++++++||+|+++|+. ++++++|.++|++++++++|.+||+|+|+ |.|+........ .... T Consensus 207 i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~-------~~~~ 279 (435) T protein:vir:80 207 LKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGN-------VITA 279 (435) T ss_pred EEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccc-------eeec Confidence 99999999999999999999985 47999999999999999999999999884 555544332111 1112 Q ss_pred cccchHHHHHHHhhhhhhhhhh--cccCccEEEecHHHHHHHHhhhcCCCceeee---cccccCcceEeeccccc----C Q lcl|NC_011054. 155 PGDANEDDLIGCINRASKAVAA--AGYMPDTLLASLGFRFDVANLRDANGNPIFR---DESFNGFGTYFNANGAW----P 225 (302) Q Consensus 155 ~~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~---~~~~~g~p~~~~~~~~~----~ 225 (302) ....+.+.+..++.+++..+.. .+..++.|+||+.++..|++++|++|+|+|. ++++.|+|+.+.+.... . T Consensus 280 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~~ 359 (435) T protein:vir:80 280 SDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGEA 359 (435) T ss_pred ccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceeccCCCCCeEeeeeeEEeccccccccCC Confidence 2223334444445555444433 2456788999999999999999999999995 45788999988776432 2 Q ss_pred CCcceEEEEecceEEEEeecCcEEEEeecccccc-----hhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccc Q lcl|NC_011054. 226 VGVAEALVVDSSRVRIGVRQDITVKFLDQATVGS-----INLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGA 296 (302) Q Consensus 226 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~-----~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~ 296 (302) .+...+++|||+++++++++++++++++++++.+ +++|++|++.||++.|+||++.+|+||+++++..|++ T Consensus 360 ~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 360 GKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred CCcceEEEEEcccEEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 3456799999999999999999999999876533 4689999999999999999999999999999999999 No 17 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=3.5e-56 Score=324.59 Aligned_cols=285 Identities=13% Similarity=0.085 Sum_probs=232.9 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |...++++||++||++++++|++.+++.++|+++++++++.++.+.+|+..+++.+.|++|++..++. +.++|+++ T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~----~~~~f~~i 181 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGGTTSGWVGETDARPET----ATSKLGLI 181 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCcceeeeccccccccc----ccccceeE Confidence 88888899999999999999999999999999999999999999999999999999999999887653 35799999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Cccccccccccccccc---cc---cee Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAA---NQ---DYT 152 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~---~~---~~~ 152 (302) ++.+||++++++||+|+++|+.+++++||.++|++++++++|.++++|+|+ |.|++........... +. ... T Consensus 182 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 261 (407) T protein:vir:48 182 EPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIAS 261 (407) T ss_pred EeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeeccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999875 5555533221111100 00 111 Q ss_pred eccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeeccccc- Q lcl|NC_011054. 153 IVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAW- 224 (302) Q Consensus 153 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~- 224 (302) ...+..+ ++++.+++..+...+..++.|+||++++..|++|||++|||||+++ ++.|+|+.+.+.... T Consensus 262 ~~~~~~~----~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~ 337 (407) T protein:vir:48 262 GAASGVT----ADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDI 337 (407) T ss_pred ccccccC----hHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecCcCCc Confidence 1122223 3555666667777788888999999999999999999999999764 578889887776443 Q ss_pred CCCcceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCC Q lcl|NC_011054. 225 PVGVAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDG 301 (302) Q Consensus 225 ~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~ 301 (302) ..+...++||||++ |.++++.++++... .|+++|++.||+++|+|+++.+|+||++++.++++..+-++ T Consensus 338 ~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d--------~~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 338 AADAKAIAFGNFKRGYTIVDRIGTRILRD--------PYTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGAATRQKAAA 407 (407) T ss_pred cCCccEEEEEeccccEEEEEeeceEEEee--------ccccCCcEEEEEEEEeccEEecccceEEEEeeccCCCCCCC Confidence 34556788999986 77899999887653 25789999999999999999999999999988776655555 No 18 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=7.4e-56 Score=322.85 Aligned_cols=287 Identities=19% Similarity=0.215 Sum_probs=235.5 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |+..+++++|.+||++++++||+.+++.++|+++++++++.++.+++|+.++.+.+.|++|++++++ ++++|+++ T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~-----~~~~f~~v 88 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEGDMKPI-----TKGNMTSQ 88 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCCccccc-----cccceeEE Confidence 8888888888899999999999999999999999999999998999999999999999999988664 67899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchH Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANE 160 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (302) ++++||++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++.+.............. ......... T Consensus 89 ~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~---~~~~~~~~~ 165 (320) T protein:vir:10 89 NIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLAD---PGGATASDL 165 (320) T ss_pred EEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCccccccccccccee---ccccccccc Confidence 99999999999999999999999999999999999999999999999999765433333222222111 111111112 Q ss_pred HHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc------------cccCcceEeecccccCCCc Q lcl|NC_011054. 161 DDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE------------SFNGFGTYFNANGAWPVGV 228 (302) Q Consensus 161 ~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~------------~~~g~p~~~~~~~~~~~~~ 228 (302) ....+.+.++...+...+..++.|+||++++.+|++|||++|+|+|.+. .+.|+|+.+.... +.++ T Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~--~~~~ 243 (320) T protein:vir:10 166 TAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHV--ADGT 243 (320) T ss_pred ccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEecCCC--CCCc Confidence 2223456666777777888899999999999999999999999999753 2455666555442 3445 Q ss_pred ceEEEEecceEEEEeecCcEEEEeeccccc--------chhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCC Q lcl|NC_011054. 229 AEALVVDSSRVRIGVRQDITVKFLDQATVG--------SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPD 300 (302) Q Consensus 229 ~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~ 300 (302) ..+++|||++++++++++++++++++.... .+++|++|++.||+++|+|+++.+|+||++|++.. +|+ T Consensus 244 ~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~----ap~ 319 (320) T protein:vir:10 244 TVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVV----TPD 319 (320) T ss_pred eEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEecc----CCC Confidence 567899999999999999999999887643 35689999999999999999999999999999654 355 Q ss_pred C Q lcl|NC_011054. 301 G 301 (302) Q Consensus 301 ~ 301 (302) + T Consensus 320 ~ 320 (320) T protein:vir:10 320 A 320 (320) T ss_pred C Confidence 5 No 19 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=1.1e-55 Score=321.86 Aligned_cols=286 Identities=18% Similarity=0.253 Sum_probs=238.3 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +....+.+++.+||++++++|++.+++.++|+++++++|+.++.++||+.++.+.+.|++|+++.++ ++++|+++ T Consensus 27 ~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-----~~~~~~~v 101 (324) T protein:vir:99 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET-----SKATWVNA 101 (324) T ss_pred cceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEeccCccccc-----cccceeEE Confidence 3334445567799999999999999999999999999999999999999999999999999988664 57899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchH Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANE 160 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (302) +++++|+++++++|+|+++|+.++++++|.++|++++++++|+++|+|+|+.. .+.+..... ........+..++ T Consensus 102 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~--~~~~~~~~~---~~~~~~~~~~~~~ 176 (324) T protein:vir:99 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSI---EKTNKVIKGDFTQ 176 (324) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCc--cCccccccc---cccceeccccCCH Confidence 99999999999999999999999999999999999999999999999988632 122222211 1112222333343 Q ss_pred HHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec---ccccCcceEeecccccCCCcceEEEEecc Q lcl|NC_011054. 161 DDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD---ESFNGFGTYFNANGAWPVGVAEALVVDSS 237 (302) Q Consensus 161 ~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~---~~~~g~p~~~~~~~~~~~~~~~~~~gd~~ 237 (302) +++.+++..+...+..++.|+||+++|..|++++|++|||+|.+ +++.|+|+.+.. ..+.+++.+++|||+ T Consensus 177 ----~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~PVv~~~--~~~~~~~~~i~gd~~ 250 (324) T protein:vir:99 177 ----DNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLPVVNLK--SSNLKRGELITGDFD 250 (324) T ss_pred ----HHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCCCccccceeEEeec--CCCCCcceEEEEecc Confidence 45566677777778888999999999999999999999999864 467888886654 345677889999999 Q ss_pred eEEEEeecCcEEEEeeccccc--------chhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 238 RVRIGVRQDITVKFLDQATVG--------SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 238 ~~~~~~~~~~~i~~~~~~~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) +++++++++++|+++++.... .+++|++|++.+|+++|||+++.+|+||++++.+.++...|.|- T Consensus 251 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~ 323 (324) T protein:vir:99 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGE 323 (324) T ss_pred cEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCC Confidence 999999999999999987543 25689999999999999999999999999999988777766666 No 20 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=1.2e-55 Score=321.72 Aligned_cols=286 Identities=19% Similarity=0.251 Sum_probs=237.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +....+.+++.+||++++++|++.+++.++|+++++++|+.++.+++|+.++.+.+.|++|+++.++ ++++|+++ T Consensus 27 ~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-----~~~~~~~v 101 (324) T protein:vir:10 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET-----SKATWVNA 101 (324) T ss_pred cceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceeEeccCccccc-----cccceeEE Confidence 3334455567799999999999999999999999999999999999999999999999999988664 57899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchH Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANE 160 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (302) ++++||+++++++|+|+++|+.+++++||.++|++++++++|+++|+|+|++. .+.+...... .......+..++ T Consensus 102 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~--~~~~i~~~~~---~~~~~~~~~~t~ 176 (324) T protein:vir:10 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIE---KTNKVIKGDFTQ 176 (324) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCc--cCcccccccc---ccceeccccCCH Confidence 99999999999999999999999999999999999999999999999988642 1112222111 112222333343 Q ss_pred HHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec---ccccCcceEeecccccCCCcceEEEEecc Q lcl|NC_011054. 161 DDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD---ESFNGFGTYFNANGAWPVGVAEALVVDSS 237 (302) Q Consensus 161 ~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~---~~~~g~p~~~~~~~~~~~~~~~~~~gd~~ 237 (302) +++.++...+...+..++.|+||++++..|++++|++|||+|.+ +.+.|+|+.+.. ..+.+++.+++|||+ T Consensus 177 ----~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~PV~~~~--~~~~~~~~~~~gd~~ 250 (324) T protein:vir:10 177 ----DNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLPVVNLK--SSNLKRGELITGDFD 250 (324) T ss_pred ----HHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeecCCCCccccceeEEeec--CCCCCcceEEEEecc Confidence 45566667777778888999999999999999999999999964 457888876653 345677889999999 Q ss_pred eEEEEeecCcEEEEeeccccc--------chhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 238 RVRIGVRQDITVKFLDQATVG--------SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 238 ~~~~~~~~~~~i~~~~~~~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) +++++++++++++++++.... .+++|++|++++|+++|||+++.+|+||++++++.++...|.|- T Consensus 251 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:10 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGE 323 (324) T ss_pred cEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCC Confidence 999999999999999987542 25689999999999999999999999999999988777655556 No 21 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=1.4e-55 Score=321.33 Aligned_cols=286 Identities=19% Similarity=0.248 Sum_probs=237.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +...++++++.+||+++.++||+.+++.++|+++++++|+.++.+++|+.++.+.++|++|+++.++ ++++|+++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-----~~~~~~~v 101 (324) T protein:vir:78 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET-----SKATWVNA 101 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEecCCccccc-----cccceeEE Confidence 6666677888999999999999999999999999999999998899999999999999999988664 67899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchH Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANE 160 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (302) ++++||++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++. .+.++..... .......+..+ T Consensus 102 ~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~--~~~gi~~~~~---~~~~~~~~~~t- 175 (324) T protein:vir:78 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIE---KTNKVIKGDFT- 175 (324) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--cCcccccccc---ccceecccccc- Confidence 99999999999999999999999999999999999999999999999988532 1122221111 11122223333 Q ss_pred HHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec---ccccCcceEeecccccCCCcceEEEEecc Q lcl|NC_011054. 161 DDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD---ESFNGFGTYFNANGAWPVGVAEALVVDSS 237 (302) Q Consensus 161 ~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~---~~~~g~p~~~~~~~~~~~~~~~~~~gd~~ 237 (302) ++++.++...+...+..++.|+||++++..|++++|++|||++.+ ..+.|+|+.+.. ....+++.+++|||+ T Consensus 176 ---~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV~~~~--~~~~~~~~~~~gd~~ 250 (324) T protein:vir:78 176 ---QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLK--SSNLKRGELITGDFD 250 (324) T ss_pred ---HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecCCCCCcccceeeEeeC--CCCCCcceEEEEecc Confidence 445566666677778888999999999999999999999999864 467888876653 345678889999999 Q ss_pred eEEEEeecCcEEEEeecccc--------cchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 238 RVRIGVRQDITVKFLDQATV--------GSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 238 ~~~~~~~~~~~i~~~~~~~~--------~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) ++++++++++++++++++.. ..+++|++|++.||+++|+||++.+|+||++|+++.+....-.|- T Consensus 251 ~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:78 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred eEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCCCCC Confidence 99999999999999988753 235689999999999999999999999999999887666332233 No 22 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=1.4e-55 Score=321.33 Aligned_cols=286 Identities=19% Similarity=0.248 Sum_probs=237.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +...++++++.+||+++.++||+.+++.++|+++++++|+.++.+++|+.++.+.++|++|+++.++ ++++|+++ T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-----~~~~~~~v 101 (324) T protein:vir:96 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET-----SKATWVNA 101 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEecCCccccc-----cccceeEE Confidence 6666677888999999999999999999999999999999998899999999999999999988664 67899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchH Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANE 160 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (302) ++++||++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++. .+.++..... .......+..+ T Consensus 102 ~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~--~~~gi~~~~~---~~~~~~~~~~t- 175 (324) T protein:vir:96 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIE---KTNKVIKGDFT- 175 (324) T ss_pred EEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCC--cCcccccccc---ccceecccccc- Confidence 99999999999999999999999999999999999999999999999988532 1122221111 11122223333 Q ss_pred HHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec---ccccCcceEeecccccCCCcceEEEEecc Q lcl|NC_011054. 161 DDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD---ESFNGFGTYFNANGAWPVGVAEALVVDSS 237 (302) Q Consensus 161 ~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~---~~~~g~p~~~~~~~~~~~~~~~~~~gd~~ 237 (302) ++++.++...+...+..++.|+||++++..|++++|++|||++.+ ..+.|+|+.+.. ....+++.+++|||+ T Consensus 176 ---~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV~~~~--~~~~~~~~~~~gd~~ 250 (324) T protein:vir:96 176 ---QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLK--SSNLKRGELITGDFD 250 (324) T ss_pred ---HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecCCCCCcccceeeEeeC--CCCCCcceEEEEecc Confidence 445566666677778888999999999999999999999999864 467888876653 345678889999999 Q ss_pred eEEEEeecCcEEEEeecccc--------cchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 238 RVRIGVRQDITVKFLDQATV--------GSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 238 ~~~~~~~~~~~i~~~~~~~~--------~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) ++++++++++++++++++.. ..+++|++|++.||+++|+||++.+|+||++|+++.+....-.|- T Consensus 251 ~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:96 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred eEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCCCCC Confidence 99999999999999988753 235689999999999999999999999999999887666332233 No 23 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=2.9e-55 Score=319.59 Aligned_cols=285 Identities=20% Similarity=0.211 Sum_probs=227.9 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |...+ +.+|.++|++++++||+.+++.++++++++++|++++.+++|+.++.+.+.|++|++++++ ++++|+++ T Consensus 20 ~~~~~-~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-----~~~~f~~i 93 (326) T protein:vir:42 20 AQTGD-SMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVSASWIGEGDMKPI-----TKGNMTSQ 93 (326) T ss_pred eeccc-cCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcceEEecCCccccc-----cccceeEE Confidence 54433 4456689999999999999999999999999999999999999999999999999988765 57899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCc--ccccccccccccccccceeeccccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPS--SWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~--g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +++++|+++++++|+|+++||..+++++|.++|++++++++|+++|+|+|++. |+.... ..........+...+.. T Consensus 94 ~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~--~~~~~~~~~~~~~~~~~ 171 (326) T protein:vir:42 94 TIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTT--KEVSLVDPDGTGSNADL 171 (326) T ss_pred EEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccc--cccceeecccccccccc Confidence 99999999999999999999999999999999999999999999999998643 322111 11111111111111111 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeeccc------------ccCcceEeecccccCC Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDES------------FNGFGTYFNANGAWPV 226 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~------------~~g~p~~~~~~~~~~~ 226 (302) +.. ...+..+...+...+...+.|+||++++..|++|||++|||||++.. +.|+|+.+.+.. +. T Consensus 172 ~~~--~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~--~~ 247 (326) T protein:vir:42 172 TVY--DAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHV--AS 247 (326) T ss_pred hhH--HHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeEEEcCCC--CC Confidence 111 12234445555666777889999999999999999999999998653 456677666543 44 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeeccccc--------chhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccccc Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVG--------SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV 297 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~ 297 (302) ++..+++|||++++++++++++++++++.... .+++|++|++.||+++|+|+++.||+||++|++++++.. T Consensus 248 ~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 248 GTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred CceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccccCC Confidence 56677899999999999999999999887542 356899999999999999999999999999998877664 No 24 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=2.6e-55 Score=319.82 Aligned_cols=286 Identities=19% Similarity=0.261 Sum_probs=235.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +....+++++.+||++++++|++.+++.++|+++++++|+.++..+||+.++.+.++|++|++++++ ++++|+++ T Consensus 27 ~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~-----~~~~f~~i 101 (324) T protein:vir:93 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET-----SKATWVNA 101 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeeecCCccccc-----cccceeEE Confidence 4445556677899999999999999999999999999999999899999999999999999988665 57899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchH Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANE 160 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (302) +++++|++++++||+|+++||.++++++|.++|++++++++|+++|+|+|++. .+....... ........+..+ T Consensus 102 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~--~~~~~~~~~---~~~~~~~~~~~~- 175 (324) T protein:vir:93 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSI---EKTNKVIKGDFT- 175 (324) T ss_pred EEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC--cCccccccc---cccceecccccc- Confidence 99999999999999999999999999999999999999999999999988531 111111111 111222223333 Q ss_pred HHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec---ccccCcceEeecccccCCCcceEEEEecc Q lcl|NC_011054. 161 DDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD---ESFNGFGTYFNANGAWPVGVAEALVVDSS 237 (302) Q Consensus 161 ~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~---~~~~g~p~~~~~~~~~~~~~~~~~~gd~~ 237 (302) ++++.+++..+...+..++.|+||++++..|++++|++|||++.+ ..+.|+|+.+.. ....+++.+++|||+ T Consensus 176 ---~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PVv~~~--~~~~~~~~i~~gdfs 250 (324) T protein:vir:93 176 ---QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLK--SSNLKRGELITGDFD 250 (324) T ss_pred ---HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCCCCCcccceeeEeec--CCCCCcceEEEEecc Confidence 355666777777778888999999999999999999999999863 467888886653 335677889999999 Q ss_pred eEEEEeecCcEEEEeeccccc--------chhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccc-cCCCCC Q lcl|NC_011054. 238 RVRIGVRQDITVKFLDQATVG--------SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGA-VVPDGS 302 (302) Q Consensus 238 ~~~~~~~~~~~i~~~~~~~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~-~~p~~~ 302 (302) ++++++++++++++++++... .+++|++|++.||+++|+||++.+|+||++|+.+.++. +||-=- T Consensus 251 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:93 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred eEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCCC Confidence 999999999999999987543 35689999999999999999999999999999776655 333222 No 25 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=1.1e-55 Score=321.79 Aligned_cols=278 Identities=13% Similarity=0.089 Sum_probs=229.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |+.++.++||++||+++.++|++.+++.++|++++++++++++.+++|+..+++.+.|++|+++.++. ..++|++| T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~----~~~~~~~v 182 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGTASGWVGETDTRSQT----ATSRLGLI 182 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCccceeeccccccCcc----ccccceee Confidence 88888899999999999999999999999999999999999999999999999999999999876642 35799999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccc---c---cee Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAAN---Q---DYT 152 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~---~---~~~ 152 (302) ++++||++++++||+|+++|+.+++++||.++|++++++++|.++|+|+|+ |.|++............ . ..+ T Consensus 183 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t 262 (401) T protein:vir:44 183 EPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVS 262 (401) T ss_pred eeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999885 56665433222211111 0 111 Q ss_pred eccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccc-c Q lcl|NC_011054. 153 IVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGA-W 224 (302) Q Consensus 153 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~-~ 224 (302) ...+..+ ++.+.+++..+...+..++.|+||++++..|++|+|++|||||+++ ++.|+|+.+.+... . T Consensus 263 ~~~~~~~----~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~ 338 (401) T protein:vir:44 263 GEATAVT----ADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDI 338 (401) T ss_pred ccccccC----HHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEecCcCCc Confidence 1122223 4455566666677777888999999999999999999999999764 57889988776643 3 Q ss_pred CCCcceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 225 PVGVAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 225 ~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) ..+...+++|||++ |.++++.++++.+. .++++|++.||+++|+|+++.+|+||+.++.+.+ T Consensus 339 ~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~--------~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 339 AADAKAIAFGNFKRGYTIVDRIGTRILRD--------PYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred cCCccEEEEeehhccEEEEEecceEEeee--------ccccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 45566688999976 77899999887644 2578999999999999999999999999998776 No 26 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=1.8e-55 Score=320.65 Aligned_cols=285 Identities=16% Similarity=0.150 Sum_probs=229.3 Q ss_pred CC-CccCCCcceecchHHHHHHHHHHHhhhhhhhh-cceeecCCCceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MA-DISRSEVATLIQEAYANDLLASAKKGSTVLQA-FPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma-~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) ++ ..+++.||++||+++.++||+.+++.++|+++ ++.+|+.++.+++|+.++.+.++|++|+++.++ ++++|+ T Consensus 125 ~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~-----~~~~f~ 199 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLAGGATASYTGENQDAKV-----SEARFD 199 (428) T ss_pred hhhcccccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcceEEEEEeCCcceeeeccCccccc-----ccccee Confidence 22 23344688999999999999999999999999 788899888899999999999999999987654 678999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC---Ccccccccccccccccccceeecc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---PSSWVSPALLPAAVAANQDYTIVP 155 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---~~g~~~~~~~~~~~~~~~~~~~~~ 155 (302) +|++.++|++++++||+|+++||.+++++||.++|++++++++|++||+|+|+ |.|+......... ....... T Consensus 200 ~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~----~~~~~~~ 275 (428) T protein:vir:10 200 DVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNR----LLPWAAD 275 (428) T ss_pred eEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccc----ccccccc Confidence 99999999999999999999999999999999999999999999999999884 5565543322111 1111112 Q ss_pred ccchHHHHHHHhhhh--hhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc---cccCcceEeeccccc----CC Q lcl|NC_011054. 156 GDANEDDLIGCINRA--SKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE---SFNGFGTYFNANGAW----PV 226 (302) Q Consensus 156 ~~~~~~~~~~~i~~~--~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~---~~~g~p~~~~~~~~~----~~ 226 (302) ...+.+.+...+..+ .......+..++.|+||+.++..|++|+|++|||||++. ++.|+|+.+.+.... .. T Consensus 276 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~ 355 (428) T protein:vir:10 276 AAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGG 355 (428) T ss_pred ccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeeceeeEEeccccccccCCC Confidence 222333333322222 223334455678899999999999999999999999643 678999988765432 34 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeeccccc-----chhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVG-----SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~-----~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) +...++||||+++++++++++++++++++... .+.+|++|++.||+++||||++.+|+||+.+++..| T Consensus 356 ~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 356 KESEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred ccceEEEEecceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 56789999999999999999999999886542 246899999999999999999999999999999999 No 27 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=3.7e-55 Score=318.99 Aligned_cols=286 Identities=19% Similarity=0.256 Sum_probs=235.3 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +....+++++.+||++++++|++.++++++|+++++++|++++..+||+.++.+.+.|++|+++.++ ++++|+++ T Consensus 27 ~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-----~~~~f~~v 101 (324) T protein:vir:96 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET-----SKATWVNA 101 (324) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeeecCCccccc-----cccceeEE Confidence 3333345577799999999999999999999999999999999999999999999999999988654 67899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchH Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANE 160 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (302) +++++|++++++||+|+++|+..+++++|.++|++++++++|+++|+|+|++. .+.+..... ........+..++ T Consensus 102 ~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~--~~~~~~~~~---~~~~~~~~~~~~~ 176 (324) T protein:vir:96 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSI---KKTNKVIKGDFTQ 176 (324) T ss_pred EEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCC--cCccccccc---cccceecccccch Confidence 99999999999999999999999999999999999999999999999988532 122221211 1112222333344 Q ss_pred HHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec---ccccCcceEeecccccCCCcceEEEEecc Q lcl|NC_011054. 161 DDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD---ESFNGFGTYFNANGAWPVGVAEALVVDSS 237 (302) Q Consensus 161 ~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~---~~~~g~p~~~~~~~~~~~~~~~~~~gd~~ 237 (302) +++.+++..+...+..++.|+||++++..|++++|++|||++.+ ..+.|+|+.+.. ....+++.+++|||+ T Consensus 177 ----~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~~~~~~l~G~PV~~~~--~~~~~~~~~~~gd~s 250 (324) T protein:vir:96 177 ----DNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLK--SSNLKRGELITGDFD 250 (324) T ss_pred ----HHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCCCCCcccceeeEeec--CCCCCcceEEEEecc Confidence 44555666667778888999999999999999999999999864 467888886643 345677889999999 Q ss_pred eEEEEeecCcEEEEeeccccc--------chhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 238 RVRIGVRQDITVKFLDQATVG--------SINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 238 ~~~~~~~~~~~i~~~~~~~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) ++++++++++++++++++... .+++|++|++.||+++|+||++.+|+||++|+.+.++-..-.|- T Consensus 251 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~ 323 (324) T protein:vir:96 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) T ss_pred eEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCC Confidence 999999999999999987542 35689999999999999999999999999999876665554455 No 28 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=4.7e-55 Score=318.41 Aligned_cols=294 Identities=20% Similarity=0.245 Sum_probs=232.6 Q ss_pred CCCccCCCcc------eecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeecccccccc---cccc Q lcl|NC_011054. 1 MADISRSEVA------TLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPE---GVKP 71 (302) Q Consensus 1 Ma~~t~~~~g------~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~---~~~~ 71 (302) |+..+..+++ .++|+++.++|++.+++.+++++++++++++++..++|+.++.+.+.|++|++.... +.++ T Consensus 10 ~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~ 89 (333) T protein:vir:78 10 NSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKP 89 (333) T ss_pred hcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCccccccccccccc Confidence 4444433333 389999999999999999999999999999999999999999999999999865422 3467 Q ss_pred ccccceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccce Q lcl|NC_011054. 72 TSEATWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDY 151 (302) Q Consensus 72 ~s~~~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~ 151 (302) +++++|+++++++||+++++++|+|+++|+..+++++|+++|++++++++|.++|+|+|+..+....+.......... . T Consensus 90 ~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~-~ 168 (333) T protein:vir:78 90 LSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANT-T 168 (333) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccccccccccc-c Confidence 889999999999999999999999999999999999999999999999999999999997554443333222211111 1 Q ss_pred eeccccchHHHHHHHhhhhhhhhhh-cccCccEEEecHHHHHHHHh---hhcCCCceeeecc-------cccCcceEeec Q lcl|NC_011054. 152 TIVPGDANEDDLIGCINRASKAVAA-AGYMPDTLLASLGFRFDVAN---LRDANGNPIFRDE-------SFNGFGTYFNA 220 (302) Q Consensus 152 ~~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~~~v~~~~~~~~l~~---l~d~~g~~i~~~~-------~~~g~p~~~~~ 220 (302) .........+..++++.+++..+.. .+..++.|+|||.++..|++ ++|++|+|+|.+. ++.|+|+.+.. T Consensus 169 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~ 248 (333) T protein:vir:78 169 NVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVLGLPAQFGR 248 (333) T ss_pred cccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCceeeceeeEEcc Confidence 1111122233344556666665544 35667789999999988765 6799999999753 57788888776 Q ss_pred cccc-----CCCcceEEEEecceEEEEeecCcEEEEeeccccc-----chhhhcCCcEEEEEEEEeccEEeccccEEEEe Q lcl|NC_011054. 221 NGAW-----PVGVAEALVVDSSRVRIGVRQDITVKFLDQATVG-----SINLAERDMIALRLKARFAYVLGNGATAVGDN 290 (302) Q Consensus 221 ~~~~-----~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~-----~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt 290 (302) .... ..++..+++|||++++++++++++++++++++.. ..++|++|++.+|+++|+|+++.+|+||++++ T Consensus 249 ~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~ 328 (333) T protein:vir:78 249 AVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFV 328 (333) T ss_pred ccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEe Confidence 5432 2345679999999999999999999999887543 24689999999999999999999999999998 Q ss_pred eeccc Q lcl|NC_011054. 291 KTPVG 295 (302) Q Consensus 291 ~~~a~ 295 (302) ++.++ T Consensus 329 ~~~a~ 333 (333) T protein:vir:78 329 DDEQP 333 (333) T ss_pred ccCCC Confidence 77655 No 29 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=4.3e-55 Score=318.62 Aligned_cols=283 Identities=13% Similarity=0.152 Sum_probs=231.3 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) ||. ++|.+||+++.++|++.++++++++++++.++++++..++|+.++.+.++|++|+++.++ ++++|+++ T Consensus 1 ma~----~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-----~~~~f~~v 71 (298) T protein:vir:94 1 MVL----NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTH-----GGVTLAPQ 71 (298) T ss_pred Cee----ccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCccccc-----cccceeEE Confidence 764 457899999999999999999999999999999998899999999999999999987654 68899999 Q ss_pred EeeeeeEEEeehhHHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDD---ASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~d---s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++++||+++.+++|+|++++ +..+++++|.++|++++++++|.++|+|++.+.|.......... ............ T Consensus 72 ~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~-~~~~~~~~~~~~ 150 (298) T protein:vir:94 72 TMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNH-FDSKVTQKVEAP 150 (298) T ss_pred EEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccc-cccccccccccc Confidence 99999999999999999964 45789999999999999999999999996654443322221111 111111112222 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeeccccc--CCCc Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAW--PVGV 228 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~--~~~~ 228 (302) ......++++.+++..+...+..++.|+||++++.+|++|||++|||+|++. ++.|+|+.+...... ...+ T Consensus 151 ~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~ 230 (298) T protein:vir:94 151 RGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) T ss_pred cccccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCc Confidence 2334456778888888888888889999999999999999999999999753 578899887775432 3455 Q ss_pred ceEEEEecceE-EEEeecCcEEEEeecccc--cchhhhcCCcEEEEEEEEeccEEeccccEEEEeeec Q lcl|NC_011054. 229 AEALVVDSSRV-RIGVRQDITVKFLDQATV--GSINLAERDMIALRLKARFAYVLGNGATAVGDNKTP 293 (302) Q Consensus 229 ~~~~~gd~~~~-~~~~~~~~~i~~~~~~~~--~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~ 293 (302) ..+++|||++. .++.+++++++++++... ..+++|++|++.+|+++|+||++.||+||++++++. T Consensus 231 ~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 67899999875 599999999999886542 345689999999999999999999999999999776 No 30 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=3.9e-55 Score=318.85 Aligned_cols=277 Identities=20% Similarity=0.247 Sum_probs=231.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCC-ceEEEEEeCCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTK-TTHLPVLATLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |...+++++|.+||++++++|++.+++.++|++++++++++++ ...+|+..+.+.+.|++|+++.++ ++++|++ T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-----~~~~f~~ 83 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIKT-----DKPEVVP 83 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccccc-----cccceeE Confidence 7778888899999999999999999999999999999999765 467888889999999999988654 5789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccch Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDAN 159 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (302) ++++++|++++++||+|+++|+..+++++|.+++++++++++|+++|+|+|++.+.. +.... ........+..+ T Consensus 84 v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~g---i~~~~---~~~~~~~~~~~t 157 (297) T protein:vir:95 84 VTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANS---VAKAA---KDANKVIGGPIN 157 (297) T ss_pred EEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccc---ccccc---cccceecccccC Confidence 999999999999999999999999999999999999999999999999998743321 11111 111222233344 Q ss_pred HHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc--cccCcceEeecccccCCCcceEEEEecc Q lcl|NC_011054. 160 EDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE--SFNGFGTYFNANGAWPVGVAEALVVDSS 237 (302) Q Consensus 160 ~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~--~~~g~p~~~~~~~~~~~~~~~~~~gd~~ 237 (302) ++ ++.++...+...+..++.|+||++++.+|++|+|++|+|+|++. .+.|+|+.+.. ....+++.+++|||+ T Consensus 158 ~~----~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~~~~~l~G~Pv~~~~--~~~~~~~~~~~gd~s 231 (297) T protein:vir:95 158 YD----NILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDKAANTIDGITTVDLK--SARFEKGDLLAGDFD 231 (297) T ss_pred HH----HHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecCCCCcccceeeEeec--CCCCCCceEEEEecc Confidence 44 45556666677788889999999999999999999999999864 56788876543 344677889999999 Q ss_pred eEEEEeecCcEEEEeeccccc--------chhhhcCCcEEEEEEEEeccEEeccccEEEEee-ecc Q lcl|NC_011054. 238 RVRIGVRQDITVKFLDQATVG--------SINLAERDMIALRLKARFAYVLGNGATAVGDNK-TPV 294 (302) Q Consensus 238 ~~~~~~~~~~~i~~~~~~~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~-~~a 294 (302) +++++++++++++++++.+.. .+++|++|++.+|+++|+|+++.+|+||++|+. ||+ T Consensus 232 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 232 NLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred cEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 999999999999999887542 356899999999999999999999999999984 334 No 31 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=3.7e-55 Score=319.00 Aligned_cols=283 Identities=15% Similarity=0.085 Sum_probs=227.9 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |...++++||++||++++++|++.+++.++|+++++++++.++..++|+.++++.+.|++|++..++. ..++|+++ T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~----~~~~f~~v 205 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMGGTTSGWVGEASQRPQT----NAATFQPL 205 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcCCcceeeeccccccccc----ccccccee Confidence 88889999999999999999999999999999999999999999999999999999999999887652 34789999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccc--cceeeccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAAN--QDYTIVPG 156 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~--~~~~~~~~ 156 (302) +++++|++++++||+|+++|+.++++++|.++|++++++++|.+||+|+|+ |.|++............ ........ T Consensus 206 ~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 285 (425) T protein:vir:10 206 SFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNS 285 (425) T ss_pred eeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999875 55655432211110000 00000111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccc-cCCCc Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGA-WPVGV 228 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~-~~~~~ 228 (302) .......++++.+++..+...+..++.|+||++++.+|++|+|++|||||+++ ++.|+|+.++++.. ...+. T Consensus 286 ~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~ 365 (425) T protein:vir:10 286 GAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVAANS 365 (425) T ss_pred cccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecCcCCccCCc Confidence 11222334555666667777788888999999999999999999999999764 57788988877654 33455 Q ss_pred ceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 229 AEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 229 ~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) ..++||||++ |+++++.++++... .++.+|++.||++.|+|+++.+|+||+.++...+- T Consensus 366 ~~i~~Gd~~~~~~i~~~~~~~v~~d--------~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 366 TPILFGDFQQTYLIIDRIGVRVLRD--------PYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred cEEEEEehhccEEEEEecceEEEec--------ccccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 6789999987 67888988776543 24789999999999999999999999988866444 No 32 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=4e-55 Score=318.81 Aligned_cols=272 Identities=19% Similarity=0.165 Sum_probs=219.7 Q ss_pred CCCccCCCcceecchHHHHHHH-HHHHhhhhhhhhcceeecCCC-ceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLL-ASAKKGSTVLQAFPTVNMGTK-TTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii-~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) ...++++++|.++|+++..++| +.++..++++++++++++.++ .+++|+.++.+.+.|++|++++++ ++++|+ T Consensus 110 ~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~-----~~~~f~ 184 (390) T protein:vir:62 110 KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGETAEIPE-----SYPATA 184 (390) T ss_pred hhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccc-----ccccee Confidence 3335555556666666665555 556667778889999998764 589999999999999999988765 678999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) ++++++||++++++||+|+++||.++++++|.++|++++++++|.+||+|+|+|.|++......... ......... T Consensus 185 ~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~Gi~~~~~~~~~~----~~~~~~~~~ 260 (390) T protein:vir:62 185 QRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQPRGILTDASPATAT----FLATDTDSK 260 (390) T ss_pred eeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccccccccccc----eeccccccc Confidence 9999999999999999999999999999999999999999999999999999998887643222111 111112233 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccccCCCcceE Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAWPVGVAEA 231 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~~~~~~~~ 231 (302) +++. +.+++..+...+..++.|+||++++..|++|||++|||||+++ ++.|+|+.+.++. ....+ T Consensus 261 ~~~~----l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~----p~~~i 332 (390) T protein:vir:62 261 VSDA----LIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAPSLFNGKVVETDDGM----PADKI 332 (390) T ss_pred chHH----HHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCcCCCccceecccceEEecCC----CCccE Confidence 4444 4445555556666778899999999999999999999999875 4678888776553 34568 Q ss_pred EEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 232 LVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 232 ~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) +||||++|+++++++++++++.+ .+|++|++.||+++|+|+++.+|+||+.|+.++++ T Consensus 333 ~~gd~s~~~i~~~~~~~v~~~~~------~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 333 LFADLSKYRVRFAGSLRVDRSVD------AKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred EEeeccceeEEeecceEEEeecc------ccccCCcEEEEEEEEeCcEeechhheEEEEeecCC Confidence 89999999999999999998876 36899999999999999999999999999988877 No 33 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=1.4e-54 Score=315.78 Aligned_cols=296 Identities=21% Similarity=0.244 Sum_probs=228.9 Q ss_pred CCCccCCCc------ceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeecccccc--cc-cccc Q lcl|NC_011054. 1 MADISRSEV------ATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATE--PE-GVKP 71 (302) Q Consensus 1 Ma~~t~~~~------g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~--~~-~~~~ 71 (302) |+..+...+ +.+||++++++||+.+++.++|+++|++++++++.+++|+.+..+.+.|++++... .| +.++ T Consensus 10 ~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~~~~Eg~~~~ 89 (338) T protein:vir:78 10 NTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGTKP 89 (338) T ss_pred hhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeeccccccccccccccc Confidence 444443333 44899999999999999999999999999999999999999887777666543211 11 2355 Q ss_pred ccccceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccce Q lcl|NC_011054. 72 TSEATWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDY 151 (302) Q Consensus 72 ~s~~~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~ 151 (302) +++++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|.++|+|+|+.++....+.......... . T Consensus 90 ~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~~~~~-~ 168 (338) T protein:vir:78 90 LSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNVIVNT-T 168 (338) T ss_pred ccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccccccc-c Confidence 678999999999999999999999999999999999999999999999999999999997554444333332222211 1 Q ss_pred eeccccchHHHHHHHhhhhhhhhhh-cccCccEEEecHHHHHHHH---hhhcCCCceeeecc-------cccCcceEeec Q lcl|NC_011054. 152 TIVPGDANEDDLIGCINRASKAVAA-AGYMPDTLLASLGFRFDVA---NLRDANGNPIFRDE-------SFNGFGTYFNA 220 (302) Q Consensus 152 ~~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~~~v~~~~~~~~l~---~l~d~~g~~i~~~~-------~~~g~p~~~~~ 220 (302) ............++.+.++...+.. .....+.|+||+.++..|+ +++|++|||+|.+. ++.|+|+.+.+ T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~G~PV~~~~ 248 (338) T protein:vir:78 169 NVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAGDLLGLPVQFGK 248 (338) T ss_pred ccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCCceeeeeeEEEcc Confidence 1111222334455666666665543 4556778999999998875 56899999999653 57889998876 Q ss_pred ccc-----cCCCcceEEEEecceEEEEeecCcEEEEeecccc--------cchhhhcCCcEEEEEEEEeccEEeccccEE Q lcl|NC_011054. 221 NGA-----WPVGVAEALVVDSSRVRIGVRQDITVKFLDQATV--------GSINLAERDMIALRLKARFAYVLGNGATAV 287 (302) Q Consensus 221 ~~~-----~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~--------~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~ 287 (302) ... ...++..+++|||++++++++++++++++++++. ..+++|++|++.+|+++|+||++.||+||+ T Consensus 249 ~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~ 328 (338) T protein:vir:78 249 AVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFV 328 (338) T ss_pred ccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceE Confidence 543 2235577999999999999999999999998753 235789999999999999999999999999 Q ss_pred EEeeecccccCCCC Q lcl|NC_011054. 288 GDNKTPVGAVVPDG 301 (302) Q Consensus 288 ~lt~~~a~~~~p~~ 301 (302) +|++..++ ++ T Consensus 329 ~l~~~~~~----~~ 338 (338) T protein:vir:78 329 KFVDDEDP----DA 338 (338) T ss_pred EEecccCC----CC Confidence 99985444 44 No 34 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=1.1e-54 Score=316.37 Aligned_cols=285 Identities=18% Similarity=0.198 Sum_probs=225.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) ||..+ +++|.+||++++++|++.++++++|+++++++|++++..+||+.++.+.++|++|++++++ ++++|+++ T Consensus 1 Mat~t-t~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~-----~~~~f~~v 74 (311) T protein:vir:99 1 MATFG-TGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKSS-----TTGEFDFV 74 (311) T ss_pred Cceec-CCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCccccc-----ccceeeEE Confidence 99665 5677899999999999999999999999999999988899999999999999999988664 67899999 Q ss_pred EeeeeeEEEeehhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVD---DASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++++||+++++++|+|+++ |+..+++++|+++|++++++++|+++|+|+|+++|....+............+. .. T Consensus 75 ~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~--~~ 152 (311) T protein:vir:99 75 TSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVEL--TA 152 (311) T ss_pred EEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeec--cc Confidence 9999999999999999995 667899999999999999999999999999876554433322222111111111 11 Q ss_pred chHHHHHHHhhhhhhhhhhc--ccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccc----- Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAA--GYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGA----- 223 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~--~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~----- 223 (302) .+...+.+++.+++..+... ...++.|+||+.++..|++|||++|||||++. .+.|+|+.+..... T Consensus 153 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~ 232 (311) T protein:vir:99 153 DTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEA 232 (311) T ss_pred cccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeeccccccccc Confidence 22233344555555554433 34556799999999999999999999999753 57899998765421 Q ss_pred -------cCCCcceEEEEecce-EEEEeecCcEEEEeeccc-ccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 224 -------WPVGVAEALVVDSSR-VRIGVRQDITVKFLDQAT-VGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 224 -------~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~-~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) .......+++|||++ +.++.+++++++.++++. ..++++|++||+.+|+++|+||++.|| +|++++...| T Consensus 233 ~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 233 DPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred ccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecCh-hHeeeecccC Confidence 122445678999987 558999999999988763 345779999999999999999999996 5677777666 No 35 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=1.6e-54 Score=315.50 Aligned_cols=284 Identities=12% Similarity=0.045 Sum_probs=221.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-Ccceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLAT-LPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |..++++++|++||+++..+||+.+++.++|++++++++++++.++||+.++ .+.+.|++|++..++ ++++|++ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~-----s~~~f~~ 225 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-----SSEEFAR 225 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCccccc-----cccccee Confidence 7788888999999999999999999999999999999999999999999876 468999999987654 6799999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccccceee---- Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQDYTI---- 153 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~~~~~---- 153 (302) |++.+||++++++||+||++|+. ++++||.++|++++++++|.+||+|+|+ |.|++................. T Consensus 226 i~~~~~k~a~~~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) T protein:vir:78 226 VYEQVGKVANALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) T ss_pred eEeeeeeeEeecHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhh Confidence 99999999999999999999975 6999999999999999999999999885 5565543322111110000000 Q ss_pred ----------------------------------------ccccchHHHHHHHhhhhhhhh-hhcccCccEEEecHHHHH Q lcl|NC_011054. 154 ----------------------------------------VPGDANEDDLIGCINRASKAV-AAAGYMPDTLLASLGFRF 192 (302) Q Consensus 154 ----------------------------------------~~~~~~~~~~~~~i~~~~~~~-~~~~~~~~~~v~~~~~~~ 192 (302) .....+.......+..+...+ ...+..++.|+||+.+|. T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~ 384 (497) T protein:vir:78 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) T ss_pred hhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHH Confidence 000001111222233333333 345567778999999999 Q ss_pred HHHhhhcCCCceeeecc-------------cccCcceEeecccccCCCcceEEEEecce--EEEEeecCcEEEEeecccc Q lcl|NC_011054. 193 DVANLRDANGNPIFRDE-------------SFNGFGTYFNANGAWPVGVAEALVVDSSR--VRIGVRQDITVKFLDQATV 257 (302) Q Consensus 193 ~l~~l~d~~g~~i~~~~-------------~~~g~p~~~~~~~~~~~~~~~~~~gd~~~--~~~~~~~~~~i~~~~~~~~ 257 (302) .|++|||++|||||+++ ++.|+|+.+.... ..+.+++|||++ +.++++.+++|+++++.. T Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~----~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~- 459 (497) T protein:vir:78 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLI----PLGTILVGHFAPSVIQTARREGVTMQMTNSNG- 459 (497) T ss_pred HHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCC----CCCceEEeecccceEEEEEecccEEEeecccc- Confidence 99999999999999763 4668888776654 345678999986 457899999999987643 Q ss_pred cchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 258 GSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 258 ~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) ++|++|++.||+++|+|+.|.+|+||++++.+.++ +|| T Consensus 460 ---~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~----~~~ 497 (497) T protein:vir:78 460 ---TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA----TGS 497 (497) T ss_pred ---hhhhcCcEEEEEEEeecceeeccccEEEEEecCCc----cCC Confidence 46999999999999999999999999999976433 344 No 36 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=1.6e-54 Score=315.50 Aligned_cols=284 Identities=12% Similarity=0.045 Sum_probs=221.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-Ccceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLAT-LPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |..++++++|++||+++..+||+.+++.++|++++++++++++.++||+.++ .+.+.|++|++..++ ++++|++ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~-----s~~~f~~ 225 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-----SSEEFAR 225 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCccccc-----cccccee Confidence 7788888999999999999999999999999999999999999999999876 468999999987654 6799999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccccceee---- Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQDYTI---- 153 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~~~~~---- 153 (302) |++.+||++++++||+||++|+. ++++||.++|++++++++|.+||+|+|+ |.|++................. T Consensus 226 i~~~~~k~a~~~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) T protein:vir:10 226 VYEQVGKVANALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) T ss_pred eEeeeeeeEeecHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhh Confidence 99999999999999999999975 6999999999999999999999999885 5565543322111110000000 Q ss_pred ----------------------------------------ccccchHHHHHHHhhhhhhhh-hhcccCccEEEecHHHHH Q lcl|NC_011054. 154 ----------------------------------------VPGDANEDDLIGCINRASKAV-AAAGYMPDTLLASLGFRF 192 (302) Q Consensus 154 ----------------------------------------~~~~~~~~~~~~~i~~~~~~~-~~~~~~~~~~v~~~~~~~ 192 (302) .....+.......+..+...+ ...+..++.|+||+.+|. T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~ 384 (497) T protein:vir:10 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWE 384 (497) T ss_pred hhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHH Confidence 000001111222233333333 345567778999999999 Q ss_pred HHHhhhcCCCceeeecc-------------cccCcceEeecccccCCCcceEEEEecce--EEEEeecCcEEEEeecccc Q lcl|NC_011054. 193 DVANLRDANGNPIFRDE-------------SFNGFGTYFNANGAWPVGVAEALVVDSSR--VRIGVRQDITVKFLDQATV 257 (302) Q Consensus 193 ~l~~l~d~~g~~i~~~~-------------~~~g~p~~~~~~~~~~~~~~~~~~gd~~~--~~~~~~~~~~i~~~~~~~~ 257 (302) .|++|||++|||||+++ ++.|+|+.+.... ..+.+++|||++ +.++++.+++|+++++.. T Consensus 385 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~----~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~- 459 (497) T protein:vir:10 385 LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLI----PLGTILVGHFAPSVIQTARREGVTMQMTNSNG- 459 (497) T ss_pred HHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCC----CCCceEEeecccceEEEEEecccEEEeecccc- Confidence 99999999999999763 4668888776654 345678999986 457899999999987643 Q ss_pred cchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 258 GSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 258 ~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) ++|++|++.||+++|+|+.|.+|+||++++.+.++ +|| T Consensus 460 ---~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~----~~~ 497 (497) T protein:vir:10 460 ---TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA----TGS 497 (497) T ss_pred ---hhhhcCcEEEEEEEeecceeeccccEEEEEecCCc----cCC Confidence 46999999999999999999999999999976433 344 No 37 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=2.7e-53 Score=308.75 Aligned_cols=270 Identities=15% Similarity=0.163 Sum_probs=223.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT--THLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |+.+++++||.+||+++.++|++.+++.++|++++++++++++. +.+|+..+.+.+.|++|+++.++. +.++|+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~----~~~~~~ 181 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPET----DNPKFS 181 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeeccccccccc----ccccce Confidence 88888889999999999999999999999999999999987654 456777788899999999886642 358999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) ++++.+||++++++||+|+++||.++++++|.++|++++++++|.++++|+|+... .+.. T Consensus 182 ~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~--------------------~~~~ 241 (392) T protein:vir:10 182 NVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK--------------------QAIK 241 (392) T ss_pred eEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------------------cCcc Confidence 99999999999999999999999999999999999999999999999999875321 1112 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEee-cc-----cccC Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFN-AN-----GAWP 225 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~-~~-----~~~~ 225 (302) +++.+.+.+ ...+...+..++.|+||+++|..|++|||++|||||+++ ++.|.|+.+. .+ .... T Consensus 242 ~~d~i~~~~---~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~ 318 (392) T protein:vir:10 242 SLDDIKDVL---NVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTT 318 (392) T ss_pred CHHHHHHHH---HHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCccc Confidence 334333333 235566777788999999999999999999999999765 4667654332 22 2233 Q ss_pred CCcceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeec-ccccCCCC Q lcl|NC_011054. 226 VGVAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTP-VGAVVPDG 301 (302) Q Consensus 226 ~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~-a~~~~p~~ 301 (302) .++..+++|||+. |.++++.+++++++++.. .+|++|++.||+++|+|+++.+|++|++++.++ +++++|+| T Consensus 319 ~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 319 AKKAPLIIGDLKEAIVLFKREDMELASTDVGG----KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred CCceEEEEEehhceEEEEeecceEEEEecccc----chhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 5667789999987 678999999999987643 469999999999999999999999999999765 66667999 No 38 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=2.7e-53 Score=308.75 Aligned_cols=270 Identities=15% Similarity=0.163 Sum_probs=223.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT--THLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |+.+++++||.+||+++.++|++.+++.++|++++++++++++. +.+|+..+.+.+.|++|+++.++. +.++|+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~----~~~~~~ 181 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPET----DNPKFS 181 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeeccccccccc----ccccce Confidence 88888889999999999999999999999999999999987654 456777788899999999886642 358999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) ++++.+||++++++||+|+++||.++++++|.++|++++++++|.++++|+|+... .+.. T Consensus 182 ~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~--------------------~~~~ 241 (392) T protein:vir:10 182 NVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK--------------------QAIK 241 (392) T ss_pred eEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------------------cCcc Confidence 99999999999999999999999999999999999999999999999999875321 1112 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEee-cc-----cccC Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFN-AN-----GAWP 225 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~-~~-----~~~~ 225 (302) +++.+.+.+ ...+...+..++.|+||+++|..|++|||++|||||+++ ++.|.|+.+. .+ .... T Consensus 242 ~~d~i~~~~---~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~ 318 (392) T protein:vir:10 242 SLDDIKDVL---NVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTT 318 (392) T ss_pred CHHHHHHHH---HHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCccc Confidence 334333333 235566777788999999999999999999999999765 4667654332 22 2233 Q ss_pred CCcceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeec-ccccCCCC Q lcl|NC_011054. 226 VGVAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTP-VGAVVPDG 301 (302) Q Consensus 226 ~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~-a~~~~p~~ 301 (302) .++..+++|||+. |.++++.+++++++++.. .+|++|++.||+++|+|+++.+|++|++++.++ +++++|+| T Consensus 319 ~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 319 AKKAPLIIGDLKEAIVLFKREDMELASTDVGG----KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred CCceEEEEEehhceEEEEeecceEEEEecccc----chhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 5667789999987 678999999999987643 469999999999999999999999999999765 66667999 No 39 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=2.7e-53 Score=308.75 Aligned_cols=270 Identities=15% Similarity=0.163 Sum_probs=223.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT--THLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |+.+++++||.+||+++.++|++.+++.++|++++++++++++. +.+|+..+.+.+.|++|+++.++. +.++|+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~----~~~~~~ 181 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPET----DNPKFS 181 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeeccccccccc----ccccce Confidence 88888889999999999999999999999999999999987654 456777788899999999886642 358999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) ++++.+||++++++||+|+++||.++++++|.++|++++++++|.++++|+|+... .+.. T Consensus 182 ~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~--------------------~~~~ 241 (392) T protein:vir:10 182 NVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK--------------------QAIK 241 (392) T ss_pred eEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------------------cCcc Confidence 99999999999999999999999999999999999999999999999999875321 1112 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEee-cc-----cccC Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFN-AN-----GAWP 225 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~-~~-----~~~~ 225 (302) +++.+.+.+ ...+...+..++.|+||+++|..|++|||++|||||+++ ++.|.|+.+. .+ .... T Consensus 242 ~~d~i~~~~---~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~ 318 (392) T protein:vir:10 242 SLDDIKDVL---NVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTT 318 (392) T ss_pred CHHHHHHHH---HHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCccc Confidence 334333333 235566777788999999999999999999999999765 4667654332 22 2233 Q ss_pred CCcceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeec-ccccCCCC Q lcl|NC_011054. 226 VGVAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTP-VGAVVPDG 301 (302) Q Consensus 226 ~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~-a~~~~p~~ 301 (302) .++..+++|||+. |.++++.+++++++++.. .+|++|++.||+++|+|+++.+|++|++++.++ +++++|+| T Consensus 319 ~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 319 AKKAPLIIGDLKEAIVLFKREDMELASTDVGG----KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred CCceEEEEEehhceEEEEeecceEEEEecccc----chhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 5667789999987 678999999999987643 469999999999999999999999999999765 66667999 No 40 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=2.7e-53 Score=308.75 Aligned_cols=270 Identities=15% Similarity=0.163 Sum_probs=223.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT--THLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |+.+++++||.+||+++.++|++.+++.++|++++++++++++. +.+|+..+.+.+.|++|+++.++. +.++|+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~----~~~~~~ 181 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPET----DNPKFS 181 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeeccccccccc----ccccce Confidence 88888889999999999999999999999999999999987654 456777788899999999886642 358999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) ++++.+||++++++||+|+++||.++++++|.++|++++++++|.++++|+|+... .+.. T Consensus 182 ~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~--------------------~~~~ 241 (392) T protein:vir:10 182 NVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK--------------------QAIK 241 (392) T ss_pred eEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------------------cCcc Confidence 99999999999999999999999999999999999999999999999999875321 1112 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEee-cc-----cccC Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFN-AN-----GAWP 225 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~-~~-----~~~~ 225 (302) +++.+.+.+ ...+...+..++.|+||+++|..|++|||++|||||+++ ++.|.|+.+. .+ .... T Consensus 242 ~~d~i~~~~---~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~ 318 (392) T protein:vir:10 242 SLDDIKDVL---NVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTT 318 (392) T ss_pred CHHHHHHHH---HHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCccc Confidence 334333333 235566777788999999999999999999999999765 4667654332 22 2233 Q ss_pred CCcceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeec-ccccCCCC Q lcl|NC_011054. 226 VGVAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTP-VGAVVPDG 301 (302) Q Consensus 226 ~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~-a~~~~p~~ 301 (302) .++..+++|||+. |.++++.+++++++++.. .+|++|++.||+++|+|+++.+|++|++++.++ +++++|+| T Consensus 319 ~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 319 AKKAPLIIGDLKEAIVLFKREDMELASTDVGG----KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred CCceEEEEEehhceEEEEeecceEEEEecccc----chhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 5667789999987 678999999999987643 469999999999999999999999999999765 66667999 No 41 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=1.7e-53 Score=309.94 Aligned_cols=271 Identities=14% Similarity=0.082 Sum_probs=229.3 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE---eCCcceeeeccccccccccccccccce Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVL---ATLPGASWVSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~---~~~~~a~~v~E~~~~~~~~~~~s~~~f 77 (302) |+..++++||++||++++++||+.+++.++|+++++++|++++...+++. +..+.++|++|+++.++. ++++| T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~----~~~~~ 184 (397) T protein:vir:48 109 KTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTN----DDPKL 184 (397) T ss_pred hhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccc----cccce Confidence 88888899999999999999999999999999999999998876665643 345679999999887652 35899 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++|+++++|++++++||+|+++|+.+++++||.++|++++++++|.++|+|+|+.... .+. T Consensus 185 ~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~~-------------------~~~ 245 (397) T protein:vir:48 185 YPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATLPTK-------------------PTL 245 (397) T ss_pred eeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------------ccc Confidence 9999999999999999999999999999999999999999999999999999864311 111 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccc---cCCC Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGA---WPVG 227 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~---~~~~ 227 (302) .++ +.+.++...+...+..++.|+||++++..|++|||++|||||+++ .+.|+|+.++.+.. ...+ T Consensus 246 ~~~----d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~ 321 (397) T protein:vir:48 246 TKW----DDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSG 321 (397) T ss_pred ccH----HHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceeccceeEEecccccCCcCCC Confidence 233 345556666677778889999999999999999999999999754 57889988776422 2356 Q ss_pred cceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 228 VAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 228 ~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) +..+++|||+. +.++++++++++++++.. .+|++|++.||+++|+|+++.+|++|++++.+.++..+|+-+ T Consensus 322 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~ 393 (397) T protein:vir:48 322 AMPLYFGDLKQAVTLFDRQQMSLLSTNIGG----GAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKGNLG 393 (397) T ss_pred ceEEEEEeccceEEEEeecceEEEEeccch----hhhhcCceeEEEEeeeccEEecccceEEEEecccccCCCCcc Confidence 77899999996 568999999999887643 469999999999999999999999999999998888777766 No 42 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=1.9e-53 Score=309.64 Aligned_cols=271 Identities=15% Similarity=0.080 Sum_probs=228.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceE--EEEEe-CCcceeeeccccccccccccccccce Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTH--LPVLA-TLPGASWVSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~--~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f 77 (302) |+..++++||.+||+++.++|++.+++.++|++++++++++++..+ +|+.. ..+.+.|++|+++.++. ..++| T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~----~~~~~ 184 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQN----DDPKL 184 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeeccccccccc----cccce Confidence 9989999999999999999999999999999999999998876544 55443 34689999999887652 34799 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++|+++++|++++++||+|+++|+..++++||.++|++++++++|.+||+|+|+.+. ..+. T Consensus 185 ~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~-------------------~~~~ 245 (397) T protein:vir:49 185 SLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLPN-------------------KPTL 245 (397) T ss_pred eeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-------------------cccc Confidence 999999999999999999999999999999999999999999999999999986421 0111 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccc---cCCC Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGA---WPVG 227 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~---~~~~ 227 (302) .++ +.+.+++..+...+..++.|+||+++|..|++|||++|||||+++ ++.|+|+.++.+.. ...+ T Consensus 246 ~~~----d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~ 321 (397) T protein:vir:49 246 AKW----DDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGG 321 (397) T ss_pred cCH----HHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCceecceeeEEecccccccccCC Confidence 233 345556666777788899999999999999999999999999754 57888988765432 2345 Q ss_pred cceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 228 VAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 228 ~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) +..+++|||+. |+++++++++++++++.. .+|++|++.||+++|+|+++.+|+||++++.++.+..+|..+ T Consensus 322 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~----~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~ 393 (397) T protein:vir:49 322 AMPLYFGDLKQAVTLFDRQHLSLLSTNIGG----GAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIADQKAKLS 393 (397) T ss_pred ceeEEEeeccceEEEEeecccEEEEecccc----chhhcCeeeEEEEEeeccEEecccceEEEEecccccccCccc Confidence 67799999986 778999999999987653 469999999999999999999999999999887777666666 No 43 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=3.1e-53 Score=308.44 Aligned_cols=271 Identities=15% Similarity=0.089 Sum_probs=225.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEe-CCcceeeeccccccccccccccccce Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT--THLPVLA-TLPGASWVSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f 77 (302) |+..++++||++||+++.++|++.+++.++|+++++++++++.. +.+|+.. ..+.+.|++|+++.++ ++.++| T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~----~~~~~~ 184 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIAD----VDDPKL 184 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCcccccc----ccccce Confidence 88889999999999999999999999999999999999987554 5556554 4578999999988764 256899 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) +++++++||++++++||+|+++|+.+++++||.++|++++++++|.++++|+|+.... .+. T Consensus 185 ~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~-------------------~~~ 245 (397) T protein:vir:49 185 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAALPTK-------------------PTL 245 (397) T ss_pred eeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------------------ccc Confidence 9999999999999999999999999999999999999999999999999998864311 111 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccc---cCCC Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGA---WPVG 227 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~---~~~~ 227 (302) .++ +.+.+++..+...+..++.|+||++++..|++|||++|||||+++ .+.|+|+.++.+.. ...+ T Consensus 246 ~~~----d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~ 321 (397) T protein:vir:49 246 TKW----DDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGG 321 (397) T ss_pred ccH----HHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecccccccccCC Confidence 223 445566666677778889999999999999999999999999764 68889988766422 2345 Q ss_pred cceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccccc-CCCCC Q lcl|NC_011054. 228 VAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV-VPDGS 302 (302) Q Consensus 228 ~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~-~p~~~ 302 (302) +..+++|||++ |+++++++++++++++.. .+|++|++.||++.|+|+++.+|++|++++.+.++.. ...+| T Consensus 322 ~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~ 394 (397) T protein:vir:49 322 AMPLYFGDLKQAVTLFDRQHMSLLSTNIGG----GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNLGS 394 (397) T ss_pred ceeEEEeeccceEEEEeecceEEEEecccc----chhhcCceeEEEEeeeCcEEecccceEEEEeecccCCCCCccc Confidence 66799999996 678999999999987643 4699999999999999999999999999997765543 33344 No 44 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=2.4e-53 Score=309.04 Aligned_cols=272 Identities=18% Similarity=0.152 Sum_probs=218.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhh-hhhhhhcceeecCCC-ceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKG-STVLQAFPTVNMGTK-TTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) ....+++++|.++|+++.+++|..++.. ++++++++++++.++ .+.+|+.++.+.++|++|+++.++ ++++|+ T Consensus 110 ~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~~~f~ 184 (392) T protein:vir:13 110 KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAEIPE-----SYPATT 184 (392) T ss_pred hhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccc-----ccccee Confidence 4445666677788888888887766555 567778888887654 589999999999999999988665 678999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccccceeeccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) ++++++||++++++||+|+++|+.+++++||.++|++++++++|.+||+|+|+ |.|++...... .. .......+ T Consensus 185 ~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~--~~--~~~~~~~~ 260 (392) T protein:vir:13 185 QRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGA--NA--AFGEADAD 260 (392) T ss_pred eEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccc--cc--cccccccc Confidence 99999999999999999999999999999999999999999999999999875 55554322111 00 01111122 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccccCCCcc Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAWPVGVA 229 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~~~~~~ 229 (302) ..++ +.+.+++..+...+..++.|+||++++..|++|+|++|||||+++ ++.|+|+.+.... ... T Consensus 261 ~~~~----d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~----~~~ 332 (392) T protein:vir:13 261 SKVS----DALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGKVVETDDGM----PAD 332 (392) T ss_pred cccH----HHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEcCCC----CCC Confidence 2333 445556666666677788999999999999999999999999865 4778888776553 356 Q ss_pred eEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 230 EALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 230 ~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) .+++|||++|+++++++++++.+++ .+|.+|++.||++.|+|+++.||+||+.++.++++ T Consensus 333 ~i~~Gdf~~~~i~~~~~~~i~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 333 KVLFADLSKYRVRFAGSLRVDRSVD------AKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred cEEEeeccceeEEeecceEEEeecc------ccccCCcEEEEEEEEeccEEecccceEEEEeeccC Confidence 7899999999999999999998866 36899999999999999999999999999988777 No 45 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=6.4e-53 Score=306.74 Aligned_cols=272 Identities=15% Similarity=0.107 Sum_probs=223.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE--e-CCcceeeeccccccccccccccccce Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVL--A-TLPGASWVSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~--~-~~~~a~~v~E~~~~~~~~~~~s~~~f 77 (302) |..+++++||++||++++++||+.+++.++|+++++++++++...++|+. . ..+.+.|++|+++.++. +.++| T Consensus 116 ~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~----~~~~~ 191 (408) T protein:vir:10 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDL----DNPQL 191 (408) T ss_pred hhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCccccccc----cCcce Confidence 88888899999999999999999999999999999999998766555543 3 44779999999887652 45899 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++|++++||++++++||+|+++|+.+++++||.++|++++++++|.+|++|+|+.... .+. T Consensus 192 ~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~-------------------~~~ 252 (408) T protein:vir:10 192 TIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK-------------------PTI 252 (408) T ss_pred eeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------------ccc Confidence 9999999999999999999999999999999999999999999999999998864210 111 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeeccc---ccCCC Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANG---AWPVG 227 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~---~~~~~ 227 (302) .+++.+.+ .+...+...+..++.|+||+++|..|+++||++|||||+++ ++.|+|+.+..+. ....+ T Consensus 253 ~~~~~l~~---~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~ 329 (408) T protein:vir:10 253 AKFDDVIT---MINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGST 329 (408) T ss_pred ccHHHHHH---HHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCceecceeeEEecccccCccCCC Confidence 22333332 23345566677788999999999999999999999999764 6789999887642 22345 Q ss_pred cceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccccc-----CCCC Q lcl|NC_011054. 228 VAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV-----VPDG 301 (302) Q Consensus 228 ~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~-----~p~~ 301 (302) +..+++|||+. |.++++++++++++++.. ..|++|++.||+++|+|+++.+|++|++++.+++++. +|++ T Consensus 330 ~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~----~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~ 405 (408) T protein:vir:10 330 VYPLYYGDMSQAITLFDRENMSLLPTNIGA----GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTS 405 (408) T ss_pred ceEEEEEehhccEEEEEecceEEEEccccc----chhhcCceEEEEEEeeccEEeccccEEEEEeeccccCCCCCCCCCc Confidence 56789999997 678999999999987653 4589999999999999999999999999998775443 3444 Q ss_pred C Q lcl|NC_011054. 302 S 302 (302) Q Consensus 302 ~ 302 (302) | T Consensus 406 ~ 406 (408) T protein:vir:10 406 T 406 (408) T ss_pred c Confidence 4 No 46 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=5.4e-53 Score=307.15 Aligned_cols=282 Identities=13% Similarity=0.081 Sum_probs=229.4 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccc-cccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVK-PTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~-~~s~~~f~~ 79 (302) ....+.+++|.++|++++++|++.+++.++|+++++++|+.++...+|+.++.+.+.|++|++..++... ..++++|++ T Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~ 241 (458) T protein:vir:10 162 NQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKE 241 (458) T ss_pred hhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeeccccccccccccccccccccee Confidence 2233456788999999999999999999999999999999999999999999999999999998887643 346789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccccceeecccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++++++|++++++||+|+++|+.+++++||.++|++++++++|.+||+|+|+ |.|+...................... T Consensus 242 i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 321 (458) T protein:vir:10 242 IHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADGSVL 321 (458) T ss_pred eEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeeccccccccc Confidence 9999999999999999999999999999999999999999999999998875 56665543322221111111111222 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec-----------ccccCcceEeecccccCC Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD-----------ESFNGFGTYFNANGAWPV 226 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~-----------~~~~g~p~~~~~~~~~~~ 226 (302) .+ ++.+.+++..+...+..++.|+||+.+|..|++|+|++|||||++ .++.|+|+.++....... T Consensus 322 ~~----~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~ 397 (458) T protein:vir:10 322 VT----AKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKA 397 (458) T ss_pred cc----HHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEcccccccc Confidence 23 345556666777778888999999999999999999999999864 247789998887766666 Q ss_pred CcceEEEEecc-eEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 227 GVAEALVVDSS-RVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 227 ~~~~~~~gd~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) +...+++|||+ .|.++++.++++.+.+ ++.+|++.||++.|+|+.+.+|++|++.+.+.+ T Consensus 398 ~~~~~~~~~f~~~~~~~~~~~~~v~~d~--------~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 398 NSAEFAVIVYKDNFVMPRQRAVTVERER--------QAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred CCcceEEEEecccEEEEEeeceEEEeec--------ccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 67788999995 5789999999887543 467899999999999999999999998665544 No 47 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=6.4e-53 Score=306.72 Aligned_cols=271 Identities=15% Similarity=0.095 Sum_probs=224.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEe-CCcceeeeccccccccccccccccce Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT--THLPVLA-TLPGASWVSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f 77 (302) |+..++++||++||++++++|++.++++++|+++++++|+.+.. +.+|... ..+.+.|++|+++.++. ++++| T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~----~~~~~ 80 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADI----DDPKL 80 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccc----cccce Confidence 99999999999999999999999999999999999999987654 5566654 46789999999887653 46899 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) +++++++||+++++++|+|+++|+.++++++|.+++++++++++|++|+.|+|+... ..+. T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~-------------------~~~~ 141 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT-------------------KPTL 141 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc-------------------cccc Confidence 999999999999999999999999999999999999999999999999998775321 1122 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeeccccc---CCC Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAW---PVG 227 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~---~~~ 227 (302) .+++ ++.+++..+...+..++.|+||++++..|++|||++|||||+++ .+.|+|+.++.+... ..+ T Consensus 142 ~~~d----~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~ 217 (293) T protein:vir:48 142 TKWD----DIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSG 217 (293) T ss_pred cCHH----HHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEEecccccCCccCC Confidence 2344 44555566666777888999999999999999999999999865 578889887655332 234 Q ss_pred cceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 228 VAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 228 ~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) +..+++|||++ |+++++++++++++++.. ++|++|++.+|+++|+|+++.+|+||++++.+.+........ T Consensus 218 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~----~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~ 289 (293) T protein:vir:48 218 VMPLYFGDLKQAVTLFDRQQMSLLSTNIGG----GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIG 289 (293) T ss_pred ceEEEEEeccceEEEEEecceEEEEecccc----hhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCcccc Confidence 56789999987 678999999999987643 469999999999999999999999999999776555222222 No 48 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=8.9e-53 Score=305.95 Aligned_cols=280 Identities=12% Similarity=0.109 Sum_probs=219.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecC----CCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMG----TKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~----~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) |...+...||.++|+++.++||+.+++.+++++++...... .+.+++|+.++++.++|++|++..+ +++++ T Consensus 338 ~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~-----~s~~~ 412 (645) T protein:vir:93 338 TTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKP-----LTKFD 412 (645) T ss_pred ccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCcccc-----ccccc Confidence 33344445888999999999999999999999987553222 2357999999999999999998755 47899 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCC-cccccccccccccccccceeecc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKP-SSWVSPALLPAAVAANQDYTIVP 155 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~-~g~~~~~~~~~~~~~~~~~~~~~ 155 (302) |+++++++||+++++++|+||++|+.+++++||.++|++++++++|.+||+|+|.+ .+..+.++.... .. ... T Consensus 413 f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~-----~~-~~~ 486 (645) T protein:vir:93 413 FESITFSHAKVSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDV-----KG-TAS 486 (645) T ss_pred eeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccc-----cc-ccc Confidence 99999999999999999999999999999999999999999999999999998753 222222221111 11 111 Q ss_pred ccchHHHHHHHhhhhhhhhhhc--ccCccEEEecHHHHHHHHhhhcCCCceeee-----cccccCcceEeecccccCCCc Q lcl|NC_011054. 156 GDANEDDLIGCINRASKAVAAA--GYMPDTLLASLGFRFDVANLRDANGNPIFR-----DESFNGFGTYFNANGAWPVGV 228 (302) Q Consensus 156 ~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~-----~~~~~g~p~~~~~~~~~~~~~ 228 (302) +.... .++..++..+... ....+.|+||+.++..|++|||++|+|+|. ++++.|+|+.+..... T Consensus 487 ~~~~~----~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~~~~~~tL~G~PV~~s~~vp----- 557 (645) T protein:vir:93 487 SGNPD----ADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPDMTLLGGSFQGLPVIVSQYVG----- 557 (645) T ss_pred ccchH----HHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCceeecCCCCCCceeeceeeEEeccCC----- Confidence 11122 2334444444332 334568999999999999999999999984 3468899998877643 Q ss_pred ceEEEEecceEEEEeecCcEEEEeeccccc----------------chhhhcCCcEEEEEEEEeccEEeccccEEEEeee Q lcl|NC_011054. 229 AEALVVDSSRVRIGVRQDITVKFLDQATVG----------------SINLAERDMIALRLKARFAYVLGNGATAVGDNKT 292 (302) Q Consensus 229 ~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~----------------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~ 292 (302) ..+++|||+.+++|+++++.|..++++++. .+++|++||+++|+++|+||++.||+||++|++. T Consensus 558 ~~~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~ 637 (645) T protein:vir:93 558 DQLVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGV 637 (645) T ss_pred cceeEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecc Confidence 236899999999999999999988877652 3568999999999999999999999999999999 Q ss_pred cccccCCC Q lcl|NC_011054. 293 PVGAVVPD 300 (302) Q Consensus 293 ~a~~~~p~ 300 (302) .|++...- T Consensus 638 ~~g~~~~~ 645 (645) T protein:vir:93 638 NYGSASGG 645 (645) T ss_pred cCCcccCC Confidence 99984333 No 49 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=1.3e-52 Score=304.99 Aligned_cols=283 Identities=10% Similarity=0.082 Sum_probs=227.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCC--CceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGT--KTTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~--~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |...++++||.+||+++.++|++.+++.++|+++++++++++ +.+.||+..+.+.+.|++|++..++.. .+++|+ T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~---~~~~f~ 186 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLSENQQIPTNG---DNGKLE 186 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeeccccccccccc---ccccee Confidence 888888999999999999999999999999999999998864 457788888999999999998876532 358999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +++++++|++++++||+|+++|+.+++++||.++|++++++++|.+||+|+|++.. +.++.... .......++.. T Consensus 187 ~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~--~~gi~~~~---~~~~~~~~~~~ 261 (404) T protein:vir:10 187 RFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEH--ATGIMTAN---KFKKITLPKSP 261 (404) T ss_pred eeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCc--ccceeecc---ccceeeccccc Confidence 99999999999999999999999999999999999999999999999999885321 11111111 11112222333 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccc--cCCCcc Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGA--WPVGVA 229 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~--~~~~~~ 229 (302) +.+.+.+.+. ..+...+..++.|+||+++|..|++|||++|||+|.++ .+.|+|+.++.+.. ...++. T Consensus 262 ~~~~~~~~~~---~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~ 338 (404) T protein:vir:10 262 ALKDFKKCKN---VELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAI 338 (404) T ss_pred cHHHHHHHHH---hhhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCccccceeeEEecccccCCCCCcc Confidence 3443333222 24556667778899999999999999999999999764 57888887654422 234567 Q ss_pred eEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCC Q lcl|NC_011054. 230 EALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPD 300 (302) Q Consensus 230 ~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~ 300 (302) .+++|||++ +.++.+++++++++++.. ..|++|++.||+++|+|+++.+|+||++++.+.++. |+ T Consensus 339 ~~~~gd~s~~~~~~~~~~~~i~~~~~~~----~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa~--~~ 404 (404) T protein:vir:10 339 PVLLGDTKEAYKYVSDGAYELATTNIGA----GAFETNTTKARIIMRIDGNVKDSEALLIAEIPVESV--QA 404 (404) T ss_pred EEEEEeccccEEEEEecceEEEEecccc----chhhcCceEEEEEEeeccEEecccceEEEEeecccC--CC Confidence 789999986 678899999999887643 458999999999999999999999999999887766 55 No 50 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=1.8e-52 Score=304.22 Aligned_cols=272 Identities=13% Similarity=0.089 Sum_probs=223.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE--Ee-CCcceeeeccccccccccccccccce Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPV--LA-TLPGASWVSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~--~~-~~~~a~~v~E~~~~~~~~~~~s~~~f 77 (302) ....++++||++||++++++||+.+++.++|+++++++|++++...+++ .. ..+.+.|++|+++.++. +.++| T Consensus 107 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~----~~~~f 182 (395) T protein:vir:38 107 SGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGDN----DDPEL 182 (395) T ss_pred hccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccccccccccccccc----cccce Confidence 4445566789999999999999999999999999999999876555543 33 35678999999887653 35899 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++|++++||++++++||+|+++|+.++++++|.++|++++++++|.+|++|+|++.... +. T Consensus 183 ~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~-------------------~~ 243 (395) T protein:vir:38 183 TVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKKP-------------------TI 243 (395) T ss_pred eeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-------------------cc Confidence 99999999999999999999999999999999999999999999999999988643210 11 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccccC--CCc Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAWP--VGV 228 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~~--~~~ 228 (302) .+++.+.+.+ ...+...+..++.|+||+.+|..|++|+|++|||||+++ ++.|+|+.++.+...+ .++ T Consensus 244 ~~~~~i~~~~---~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 320 (395) T protein:vir:38 244 SQFDNIKDLE---NNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGS 320 (395) T ss_pred ccHHHHHHHH---HHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceeccceeEEecccccCcCCCc Confidence 1233333322 234556667788899999999999999999999999754 5788998887653332 456 Q ss_pred ceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 229 AEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 229 ~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) ..+++|||++ |+++++++++++++++.. .+|++|++.||++.|+|+++.+|++|++++.++++...|..| T Consensus 321 ~~i~~gd~~~~~~i~~~~~~~i~~~~~~~----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~ 391 (395) T protein:vir:38 321 HPLYFGDLKQGITLFDRQQMQIDTTNVGA----GSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVANQAQGTA 391 (395) T ss_pred ceEEEEeccccEEEEEecceEEEEecccc----chhhcCceEEEEEEeeccEEecccceEEEEeecccCCCCCcc Confidence 6789999986 778999999999988643 469999999999999999999999999999998887777777 No 51 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=4e-52 Score=302.35 Aligned_cols=279 Identities=13% Similarity=0.098 Sum_probs=224.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE--eCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVL--ATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~--~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) .+..++++|+.+||+++.++|++.+++.++|++++++++++++..++|+. .+.+.+.|++|+++.++. +.++|+ T Consensus 121 ~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~----~~~~~~ 196 (415) T protein:vir:46 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPEL----AVKPFF 196 (415) T ss_pred hccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccc----ccccee Confidence 34456677889999999999999999999999999999999887777754 567789999999887653 468999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +|+++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+|++....... .... .......++.. T Consensus 197 ~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~--~~~~--~~~~~~~~~~~ 272 (415) T protein:vir:46 197 QLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS--GFEK--EGKKLEVKKAK 272 (415) T ss_pred eEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccc--cccc--ccceecccccc Confidence 9999999999999999999999999999999999999999999999999988643322111 1111 11122223333 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeeccccc-CCCcce Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAW-PVGVAE 230 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~-~~~~~~ 230 (302) +++ ++.+++..+...++.++.|+||+++|..|++|+|++|||||+++ .+.|+|+.+..+... ..++.. T Consensus 273 ~~~----~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:46 273 SLD----DIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred chH----HHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccE Confidence 444 44555556666677888999999999999999999999999754 577889887765543 345667 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) +++|||++ |++++++++++++++ |.++++.+|+++|+|+++.+|+||++++.++++. |.|| T Consensus 349 ~~~gd~~~~~~~~~~~~~~v~~~~---------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~--~~~~ 410 (415) T protein:vir:46 349 LIIGNLKDAIVLFDRSQYQASWTD---------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSER--GEGD 410 (415) T ss_pred EEEEehhccEEEEeecceEEEeec---------cccCceEEEEEEEeccEEeccccEEEEEeeccCC--CCCC Confidence 99999997 667889999998875 4567788999999999999999999999987666 9999 No 52 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=4e-52 Score=302.35 Aligned_cols=279 Identities=13% Similarity=0.098 Sum_probs=224.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE--eCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVL--ATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~--~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) .+..++++|+.+||+++.++|++.+++.++|++++++++++++..++|+. .+.+.+.|++|+++.++. +.++|+ T Consensus 121 ~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~----~~~~~~ 196 (415) T protein:vir:47 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPEL----AVKPFF 196 (415) T ss_pred hccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccc----ccccee Confidence 34456677889999999999999999999999999999999887777754 567789999999887653 468999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +|+++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+|++....... .... .......++.. T Consensus 197 ~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~--~~~~--~~~~~~~~~~~ 272 (415) T protein:vir:47 197 QLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS--GFEK--EGKKLEVKKAK 272 (415) T ss_pred eEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccc--cccc--ccceecccccc Confidence 9999999999999999999999999999999999999999999999999988643322111 1111 11122223333 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeeccccc-CCCcce Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAW-PVGVAE 230 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~-~~~~~~ 230 (302) +++ ++.+++..+...++.++.|+||+++|..|++|+|++|||||+++ .+.|+|+.+..+... ..++.. T Consensus 273 ~~~----~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:47 273 SLD----DIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred chH----HHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccE Confidence 444 44555556666677888999999999999999999999999754 577889887765543 345667 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) +++|||++ |++++++++++++++ |.++++.+|+++|+|+++.+|+||++++.++++. |.|| T Consensus 349 ~~~gd~~~~~~~~~~~~~~v~~~~---------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~--~~~~ 410 (415) T protein:vir:47 349 LIIGNLKDAIVLFDRSQYQASWTD---------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSER--GEGD 410 (415) T ss_pred EEEEehhccEEEEeecceEEEeec---------cccCceEEEEEEEeccEEeccccEEEEEeeccCC--CCCC Confidence 99999997 667889999998875 4567788999999999999999999999987666 9999 No 53 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=2.5e-52 Score=303.45 Aligned_cols=273 Identities=15% Similarity=0.123 Sum_probs=224.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-Ccceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLAT-LPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) ....+++++|.+||++++++|++.+++.++|+++++.++++++.+++|+.++ .+.+.|++|+++.++ ++++|++ T Consensus 135 ~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~~~f~~ 209 (418) T protein:vir:10 135 TVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVAEGAQKPT-----SDLKFNL 209 (418) T ss_pred hccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeeeccCccccc-----cccceee Confidence 4455677789999999999999999999999999999999998899999876 688999999987654 6789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC---Ccccccccccccccccccceeeccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---PSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) |+++++|++++++||+|+++++ .++++||.++|++++++++|.+||+|+|+ |.|++...... ......++ T Consensus 210 v~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~------~~~~~~~~ 282 (418) T protein:vir:10 210 KNQPVRTIAHLFKASRQILDDA-PALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAF------MPSITLAN 282 (418) T ss_pred EEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccc------cccccccc Confidence 9999999999999999999987 58999999999999999999999999886 44554432111 11111111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec------ccccCcceEeecccccCCCcce Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD------ESFNGFGTYFNANGAWPVGVAE 230 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~------~~~~g~p~~~~~~~~~~~~~~~ 230 (302) ...++++.+++..+...+..++.|+||+.++..|++++|++|||||.+ ..+.|+|+.+.... ..+. T Consensus 283 ----~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~l~G~pV~~~~~~----p~~~ 354 (418) T protein:vir:10 283 ----ATPIDKIRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTPRLWNLPVVETQAM----TANE 354 (418) T ss_pred ----cccHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCceecceeeEEcCCC----CCCc Confidence 222345566666777778888899999999999999999999999963 35778888776543 3556 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCC Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDG 301 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~ 301 (302) +++|||++ +++++++++++.++++.. .+|++|++.||+++|+||++.+|+||++++.+++++ | T Consensus 355 ~~~gd~s~~~~~~~~~~~~i~~~~~~~----~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~~----g 418 (418) T protein:vir:10 355 FLVGAFSMAAQIFDRMEIEVLLSTENV----DDFEKNMVSIRAEERLALAVYRPESFVTGALVEQAG----G 418 (418) T ss_pred EEEeeccceEEEEEecceEEEEecccc----hhhhcCceEEEEEEeeccEEecccceEEEEeccCCC----C Confidence 89999987 678899999999887643 469999999999999999999999999999775443 2 No 54 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=3e-52 Score=303.02 Aligned_cols=283 Identities=11% Similarity=0.044 Sum_probs=222.4 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) ++..+++++|++||+++.++|++.+++.++|+++++++|++++...+|+.++.+.+.|++|++++++ .++++|++| T Consensus 84 ~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~~~~~a~~~~E~~~~~~----~~~~~f~~i 159 (390) T protein:vir:40 84 IAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVGDVATAWWGPLCAEIKE----VLDNGFDKI 159 (390) T ss_pred HhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEcCCcceeeeccccccCc----cccccceee Confidence 7778888999999999999999999999999999999999999999999999999999999887654 357899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccccceeeccccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) ++++||++++++||+|+++|+.+++++||+++|++++++++|++||+|+|+ |.|++........ ........... T Consensus 160 ~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~---~~~~~~~~~~~ 236 (390) T protein:vir:40 160 QTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTA---GEHPVKTATPL 236 (390) T ss_pred EeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeeccccccc---ccccccccccc Confidence 999999999999999999999999999999999999999999999999875 5565543211111 11111122223 Q ss_pred hHHHHHHHhhhhhhhhh---hcccCccEEEecHHHH-H---HHHhhhcCCCceeeecccccCcceEeecccccCCCcceE Q lcl|NC_011054. 159 NEDDLIGCINRASKAVA---AAGYMPDTLLASLGFR-F---DVANLRDANGNPIFRDESFNGFGTYFNANGAWPVGVAEA 231 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~---~~~~~~~~~v~~~~~~-~---~l~~l~d~~g~~i~~~~~~~g~p~~~~~~~~~~~~~~~~ 231 (302) +.....+++..+...+. .....++.|+||+.++ . .++.++|.+|+|+|... ..|+|+.+.... .++.+ T Consensus 237 t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~~-~~g~pvv~~~~~----p~~~i 311 (390) T protein:vir:40 237 TDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTGIL-PVPLEIVQSVAV----PVGKA 311 (390) T ss_pred chhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCccccccC-CCceeEEEcCCC----CCCcE Confidence 33444444444444332 2345577899998874 3 34578999999998653 457777655443 35568 Q ss_pred EEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 232 LVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 232 ~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) +||||++|+++++++++++++++. +|.+|++.||+..|+|+++.+++||+.++.+++.. +|+-+ T Consensus 312 ~~Gd~s~~~i~~~~~~~v~~~~~~------~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~~-~~~~~ 375 (390) T protein:vir:40 312 VAGRAKDYFMGIGSEQVIRTSTEY------RLLDDETLYYAKQYANGRPKDNSSFLVFDITGLEG-SPAID 375 (390) T ss_pred EEEeeceEEEEeecceEEEecchh------hhhcCcEEEEEEEEeCCEEecccceEEEEeeccCC-CCCCC Confidence 999999999999999999988753 58999999999999999999999999998665432 22222 No 55 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=2e-52 Score=304.02 Aligned_cols=278 Identities=12% Similarity=0.053 Sum_probs=225.1 Q ss_pred CCCccCCCcceecchHHHHHHH-HHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLL-ASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii-~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) ....++++||++||+++..+|| +.++..+++.+++++.++ ++.+.+|+.++.+.+.|++|++++++ ++++|++ T Consensus 250 ~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~~~~~~~~~~~~a~~v~Eg~~~~~-----~~~~~~~ 323 (543) T protein:vir:81 250 AMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGDVWHGVSSAAVQWSWDAEFEEVSD-----DSPEFGQ 323 (543) T ss_pred hcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC-CcceEEEEecCCcceeecccCccccc-----cccccce Confidence 3345677899999999998876 667888999999998766 45688999999999999999988654 6789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC---Ccccccccccccccccccceeeccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---PSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) |+++++|++++++||+|+++|+ +++.++|.+.|++++++++|.+||+|+|+ |.|+....... .....+.... T Consensus 324 i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~----~~~~~~~~~~ 398 (543) T protein:vir:81 324 PEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGT----AAEIAPVTAE 398 (543) T ss_pred eeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhcccc----cccccccccc Confidence 9999999999999999999997 69999999999999999999999999985 45554322111 1111112222 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec------ccccCcceEeecccccC----- Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD------ESFNGFGTYFNANGAWP----- 225 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~------~~~~g~p~~~~~~~~~~----- 225 (302) .. .++++.+++..+...+..++.|+||+.+|..|++++|++|+|||.+ +++.|+|+.++...... T Consensus 399 ~~----~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~ 474 (543) T protein:vir:81 399 TF----ALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWTTIGNGEPSQLLGRPVGEAEAMDANWNTSA 474 (543) T ss_pred cc----cHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCceeccCcCCCCCccccceeeEEeccccccccccc Confidence 22 3455666667777777788899999999999999999999999975 35889999888764322 Q ss_pred -CCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 226 -VGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 226 -~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) .+...++||||++|+++++++++|+++.+.... ..|.+|++.||+++|+||++.+|+||++++.+.++ T Consensus 475 ~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~--~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 475 SADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGT--NRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred cCCcceEEEeeccceeEEeecccEEEEecccccc--chhhcCceEEEEEEeeccEeecccceEEEEecccC Confidence 355678999999999999999999988765322 35788999999999999999999999999988776 No 56 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=4.8e-52 Score=301.95 Aligned_cols=272 Identities=15% Similarity=0.118 Sum_probs=222.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEE--EEEeC-Ccceeeeccccccccccccccccce Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHL--PVLAT-LPGASWVSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~--p~~~~-~~~a~~v~E~~~~~~~~~~~s~~~f 77 (302) |...++++||.+||++++++||+.+++.++|+++++.+|++++...+ ++..+ .+.+.|++|+++.++. ++++| T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~----~~~~~ 191 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDL----DNPRL 191 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccccccccc----cccce Confidence 77788888999999999999999999999999999999998765544 44433 4677899999887642 56899 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++|+++++|++++++||+|+++|+.+++++||.++|++++++++|.++|+|+|+... ..+. T Consensus 192 ~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~-------------------~~~~ 252 (408) T protein:vir:74 192 TIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPK-------------------KPTI 252 (408) T ss_pred eeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------------------cccc Confidence 999999999999999999999999999999999999999999999999999886321 0111 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecc---cccCCC Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNAN---GAWPVG 227 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~---~~~~~~ 227 (302) .+++.+.+. +...+...+..++.|+||+.++.+|++|||++|+|||+++ .+.|+|+.+..+ +....+ T Consensus 253 ~~~~~i~~~---~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 329 (408) T protein:vir:74 253 ANFDDVITM---INTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGST 329 (408) T ss_pred ccHHHHHHH---HHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCceecceeeEEecCcccccccCC Confidence 233333332 3345566777788999999999999999999999999754 578899887754 333456 Q ss_pred cceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc---cccCCCCC Q lcl|NC_011054. 228 VAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV---GAVVPDGS 302 (302) Q Consensus 228 ~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a---~~~~p~~~ 302 (302) +..+++|||++ |.++++++++++++++.. ..|++|++.+|+++|+|+++.+|+||++++.+++ .+.+|..+ T Consensus 330 ~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~----~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 404 (408) T protein:vir:74 330 VYPLYYGDMSQAITLFDRENMSLLPTNIGA----GAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQVGNFKTTT 404 (408) T ss_pred cceEEEEehhccEEEEEecceEEEEecccc----chhhcceeeEEEEEeeCcEEecccceEEEEeecccCCCCCCCCCc Confidence 77899999996 678999999999987643 4589999999999999999999999999997543 33344444 No 57 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=5.6e-52 Score=301.55 Aligned_cols=279 Identities=12% Similarity=0.066 Sum_probs=224.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEE--EEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHL--PVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~--p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) ....++++||.+||+++.++|++.+++.++|++++++++|+++..++ ++..+...+.|++|+++.++. +.++|+ T Consensus 121 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~----~~~~~~ 196 (415) T protein:vir:81 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPEL----AVKPFF 196 (415) T ss_pred hccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcc----ccccee Confidence 44456677889999999999999999999999999999998765554 456677889999999887653 457999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+|++.+...... ... ........+.. T Consensus 197 ~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~--~~~--~~~~~~~~~~~ 272 (415) T protein:vir:81 197 QLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSG--FEK--EGKKLEVKKAK 272 (415) T ss_pred eEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccc--ccc--ccccccccccc Confidence 99999999999999999999999999999999999999999999999999886443221111 111 11122223333 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccccC-CCcce Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAWP-VGVAE 230 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~~-~~~~~ 230 (302) ++ +++.+++..+...++.++.|+||+++|..|+++||++|||||+++ .+.|+|+.+..+...+ .++.. T Consensus 273 ~~----~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:81 273 SL----DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred ch----hHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccE Confidence 44 444555556666677889999999999999999999999999764 5788898887665433 45667 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) ++||||++ |++++++++++++++ |.++++.+|+++|+|+++.+|+||++++.++++. |.|| T Consensus 349 ~~~Gd~~~~~~~~~~~~~~v~~~~---------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~--~~~~ 410 (415) T protein:vir:81 349 LIIGNLKDAIVLFDRSQYQASWTD---------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSER--GEGD 410 (415) T ss_pred EEEEehhccEEEEeecceEEEEec---------cccCceEEEEEEEeccEEeccccEEEEEEeccCC--CCCc Confidence 99999997 567889999998775 3456678999999999999999999999986665 9999 No 58 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=5.6e-52 Score=301.55 Aligned_cols=279 Identities=12% Similarity=0.066 Sum_probs=224.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEE--EEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHL--PVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~--p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) ....++++||.+||+++.++|++.+++.++|++++++++|+++..++ ++..+...+.|++|+++.++. +.++|+ T Consensus 121 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~----~~~~~~ 196 (415) T protein:vir:98 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPEL----AVKPFF 196 (415) T ss_pred hccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcc----ccccee Confidence 44456677889999999999999999999999999999998765554 456677889999999887653 457999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+|++.+...... ... ........+.. T Consensus 197 ~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~--~~~--~~~~~~~~~~~ 272 (415) T protein:vir:98 197 QLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSG--FEK--EGKKLEVKKAK 272 (415) T ss_pred eEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccc--ccc--ccccccccccc Confidence 99999999999999999999999999999999999999999999999999886443221111 111 11122223333 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccccC-CCcce Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAWP-VGVAE 230 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~~-~~~~~ 230 (302) ++ +++.+++..+...++.++.|+||+++|..|+++||++|||||+++ .+.|+|+.+..+...+ .++.. T Consensus 273 ~~----~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:98 273 SL----DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred ch----hHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccE Confidence 44 444555556666677889999999999999999999999999764 5788898887665433 45667 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) ++||||++ |++++++++++++++ |.++++.+|+++|+|+++.+|+||++++.++++. |.|| T Consensus 349 ~~~Gd~~~~~~~~~~~~~~v~~~~---------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~--~~~~ 410 (415) T protein:vir:98 349 LIIGNLKDAIVLFDRSQYQASWTD---------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSER--GEGD 410 (415) T ss_pred EEEEehhccEEEEeecceEEEEec---------cccCceEEEEEEEeccEEeccccEEEEEEeccCC--CCCc Confidence 99999997 567889999998775 3456678999999999999999999999986665 9999 No 59 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=5.6e-52 Score=301.55 Aligned_cols=279 Identities=12% Similarity=0.066 Sum_probs=224.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEE--EEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHL--PVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~--p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) ....++++||.+||+++.++|++.+++.++|++++++++|+++..++ ++..+...+.|++|+++.++. +.++|+ T Consensus 121 ~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~----~~~~~~ 196 (415) T protein:vir:79 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPEL----AVKPFF 196 (415) T ss_pred hccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcc----ccccee Confidence 44456677889999999999999999999999999999998765554 456677889999999887653 457999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+|++.+...... ... ........+.. T Consensus 197 ~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~--~~~--~~~~~~~~~~~ 272 (415) T protein:vir:79 197 QLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSG--FEK--EGKKLEVKKAK 272 (415) T ss_pred eEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccc--ccc--ccccccccccc Confidence 99999999999999999999999999999999999999999999999999886443221111 111 11122223333 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccccC-CCcce Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAWP-VGVAE 230 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~~-~~~~~ 230 (302) ++ +++.+++..+...++.++.|+||+++|..|+++||++|||||+++ .+.|+|+.+..+...+ .++.. T Consensus 273 ~~----~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:79 273 SL----DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred ch----hHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccE Confidence 44 444555556666677889999999999999999999999999764 5788898887665433 45667 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) ++||||++ |++++++++++++++ |.++++.+|+++|+|+++.+|+||++++.++++. |.|| T Consensus 349 ~~~Gd~~~~~~~~~~~~~~v~~~~---------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~--~~~~ 410 (415) T protein:vir:79 349 LIIGNLKDAIVLFDRSQYQASWTD---------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSER--GEGD 410 (415) T ss_pred EEEEehhccEEEEeecceEEEEec---------cccCceEEEEEEEeccEEeccccEEEEEEeccCC--CCCc Confidence 99999997 567889999998775 3456678999999999999999999999986665 9999 No 60 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=6.6e-52 Score=301.18 Aligned_cols=263 Identities=13% Similarity=0.084 Sum_probs=221.3 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT--THLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |+..++++||.+||+++.++|++.+++.++|+++++.++++++. +.+++..+.+.+.|++|+++.++ ++.++|+ T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~----~~~~~f~ 166 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGE----KATPQFT 166 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeecccccccc----cccccee Confidence 99999999999999999999999999999999999999998765 44556667789999999987654 2568999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +++++++|+++.++||+|+++|+.+++++||.++|++++++++|.++++|+|+.... +.. T Consensus 167 ~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~--------------------~~~ 226 (371) T protein:vir:81 167 LLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAKT--------------------AIA 226 (371) T ss_pred eEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------------------ccc Confidence 999999999999999999999999999999999999999999999999998864210 111 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeeccccc------- Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAW------- 224 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~------- 224 (302) +.+.+... +...+...+..++.|+||+++|..|++|||++|||||+++ ++.|+|+.++++... T Consensus 227 ~~~~i~~~---~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 303 (371) T protein:vir:81 227 DLDGLKQI---INVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGG 303 (371) T ss_pred cHHHHHHH---HHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCceecceeEEEecccccCcccccc Confidence 22333222 2234455666778999999999999999999999999754 578899988876432 Q ss_pred -CCCcceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 225 -PVGVAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 225 -~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) ......+++|||+. +.++++.+++++++++.. ++|++|++.||++.|+|+++.+|++|++++.+.+ T Consensus 304 ~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~----~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 304 TGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAM----DAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred ccCCcceEEEEehhceEEEEeecceEEEEecccc----chhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 24567799999986 678899999999987643 4699999999999999999999999999998877 No 61 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=7.8e-52 Score=300.77 Aligned_cols=272 Identities=16% Similarity=0.123 Sum_probs=222.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE---eCCcceeeeccccccccccccccccce Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVL---ATLPGASWVSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~---~~~~~a~~v~E~~~~~~~~~~~s~~~f 77 (302) |..+++++||++||++++++|++.+++.++|+++++++|++++...+++. +..+.+.|++|+++.++ ++.++| T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~----~~~~~f 191 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPD----LDNPRL 191 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCcccccc----ccccce Confidence 78888899999999999999999999999999999999998776555543 34477999999988664 256899 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++++++++|++++++||+|+++|+.+++++||.++|++++++++|+++|+|+|+... ..+. T Consensus 192 ~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~~-------------------~~~~ 252 (404) T protein:vir:39 192 TIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPK-------------------KPTI 252 (404) T ss_pred eeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-------------------cccc Confidence 999999999999999999999999999999999999999999999999999886321 0111 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccc---cCCC Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGA---WPVG 227 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~---~~~~ 227 (302) .+++.+.+.+ ...+...+..++.|+||+++|..|++|||++|||||+++ .+.|+|+.++.+.. .... T Consensus 253 ~~~~~i~~~~---~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 329 (404) T protein:vir:39 253 AKFDDVITMI---NTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGST 329 (404) T ss_pred ccHHHHHHHH---HHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecceeEEEecccccCccCCC Confidence 2233333332 234455666778899999999999999999999999754 57899998876533 2345 Q ss_pred cceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc---cccCCCCC Q lcl|NC_011054. 228 VAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV---GAVVPDGS 302 (302) Q Consensus 228 ~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a---~~~~p~~~ 302 (302) +..+++|||++ +.++++++++++++++.. ++|++|++.+|+++|+|+++.+|+||++++.+++ +...|+|- T Consensus 330 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 330 VYPLYYGDMSQAITLFDRENMSLLPTNIGA----GAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQVGNFTAGK 404 (404) T ss_pred ccEEEEEeccccEEEEeecceEEEEeccch----hhhhhceeeEEEEeeeccEEecccceEEEEeeccccCCCCCCCCC Confidence 56799999986 678999999999987643 4689999999999999999999999999997654 33445555 No 62 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=6.3e-52 Score=301.30 Aligned_cols=267 Identities=15% Similarity=0.113 Sum_probs=225.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCC-cceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATL-PGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) +...+++++|.++|++++++||+.+++.++|+++++.+++.++.+++|+.+.. +.+.|++|+++.++ ++++|++ T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-----~~~~~~~ 187 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPE-----SSLKFAK 187 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEecCCcceeeecCCccccc-----cccceeE Confidence 77778888999999999999999999999999999999999999999998764 68999999988665 6789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC---Ccccccccccccccccccceeeccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---PSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) +++++||+++.+++|+|+++|+ .+++++|.++|++++++++|.++|+|+|+ |.|+.+.... .. ... T Consensus 188 i~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~------~~----~~~ 256 (390) T protein:vir:97 188 KTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATT------YA----APT 256 (390) T ss_pred EEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeecccc------cc----ccc Confidence 9999999999999999999997 58999999999999999999999999885 3444332211 11 111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc------cccCcceEeecccccCCCcce Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE------SFNGFGTYFNANGAWPVGVAE 230 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~------~~~g~p~~~~~~~~~~~~~~~ 230 (302) ..+.+..++.+.++...+...+..++.|+||+++|..|++|||++|+|||+++ .+.|+|+.+.+. ..++. T Consensus 257 ~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~----~~~~~ 332 (390) T protein:vir:97 257 TIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQA----MAPGE 332 (390) T ss_pred cccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCccCCCCceecceeeEEcCC----CCCCc Confidence 12234445667777778888888899999999999999999999999999754 567888877654 34567 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeee Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKT 292 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~ 292 (302) +++|||++ |.++++.+++++++++. .+|++|++.||+++|+||++.+|+||++++-. T Consensus 333 ~~~gd~~~~~~~~~~~~~~i~~~~~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 333 FLVGAFDLAAQIFDQWDARVEIGYVN-----DDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred EEEEeccceEEEEEecceEEEEeecc-----cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 89999986 67889999999988654 35899999999999999999999999999977 No 63 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=8.2e-52 Score=300.65 Aligned_cols=263 Identities=13% Similarity=0.065 Sum_probs=220.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCC--ceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTK--TTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~--~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |+.+++++||.+||+++.++||+.+++.++|++++++++++++ .+.+++.++.+.+.|++|+++.++. +.++|+ T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~----~~~~~~ 198 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEI----DQPRFT 198 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeeccccccccc----ccccce Confidence 8888899999999999999999999999999999999998754 5667777888999999999886542 468999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +|+++++|++++++||+|+++|+.+++++||.++|++++++++|.+|++|+|+.+. .+.. T Consensus 199 ~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~~--------------------~g~~ 258 (397) T protein:vir:12 199 KVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASLKK--------------------VDID 258 (397) T ss_pred eEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--------------------cccc Confidence 99999999999999999999999999999999999999999999999999886321 0111 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccc--cCCCcc Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGA--WPVGVA 229 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~--~~~~~~ 229 (302) +.+.+.+. +...+...+..++.|+||+++|.+|++|||++|||+|+++ ++.|+|+.++.+.. ...++. T Consensus 259 ~~~~i~~~---~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~ 335 (397) T protein:vir:12 259 GLDGIKKA---LNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKA 335 (397) T ss_pred cHHHHHHH---HhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCccccceeeEEecccccccCCCcc Confidence 22322222 3245566777888999999999999999999999999754 57788987776532 334567 Q ss_pred eEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 230 EALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 230 ~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) .+++|||+. +.++++++++++++++.. .+|++|++.||+++|+|+++.+|+||++++.|.- T Consensus 336 ~~~~gd~~~~~~~~~~~~~~i~~~~~~~----~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 336 PLIIGNLKEAIVLFDREQQSIASTDTGA----GAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEEEEehhceEEEEeecceEEEEecccc----chhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 799999987 568889999999887643 4689999999999999999999999999998755 No 64 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=8.3e-52 Score=300.62 Aligned_cols=272 Identities=16% Similarity=0.102 Sum_probs=225.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-Ccceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLAT-LPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) +...+++.+|.++|++++++||+.+++.++|+++++.+++.++.+++|+.++ .+.+.|++|+++.++ ++++|++ T Consensus 113 ~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~~~~~~ 187 (395) T protein:vir:43 113 AITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNNAAPVSEGTQKPY-----SDLTFEL 187 (395) T ss_pred hhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCceeeecCCccccc-----cccceeE Confidence 4445667788999999999999999999999999999999998899999876 478999999987654 6789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCC---cccccccccccccccccceeeccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKP---SSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~---~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) ++++++|++++++||+|+++++. ++++||.++|++++++++|.++|+|+|+. .|+.... .. .....+. T Consensus 188 i~~~~~k~~~~~~is~ell~d~~-~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~-----~~---~~~~~~~ 258 (395) T protein:vir:43 188 ENAPVRTIAHLFKASRQILDDAS-ALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQA-----QA---YAPPSGV 258 (395) T ss_pred EEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccc-----cc---ccccccc Confidence 99999999999999999999864 79999999999999999999999998853 3333221 11 1111222 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec------ccccCcceEeecccccCCCcce Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD------ESFNGFGTYFNANGAWPVGVAE 230 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~------~~~~g~p~~~~~~~~~~~~~~~ 230 (302) ....+..++.+.+++..+...+..++.|+||++++..|++++|++|||||.+ +.+.|+|+.+... ..++. T Consensus 259 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~l~G~pVv~~~~----~~~~~ 334 (395) T protein:vir:43 259 VVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGSPQNGTTPTLWRLPVVETQA----ITQDE 334 (395) T ss_pred ccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccccccCCCceecceeeEEcCC----CCCCc Confidence 2334455677788888888888889999999999999999999999999964 3567888766654 34566 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) +++|||++ ++++++++++++++++.. .+|++|++.||+++|+||++.+|+||++++.+++ T Consensus 335 ~~~gd~~~~~~~~~~~~~~i~~~~~~~----~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 335 FLTGAFSLGAQIFDRMDIEVLVSTEND----KDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred EEEEeccceEEEEEecceEEEEecccc----chhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 89999987 668889999999887643 4699999999999999999999999999998877 No 65 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=7e-52 Score=301.04 Aligned_cols=280 Identities=13% Similarity=0.081 Sum_probs=222.3 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeC-Ccceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLAT-LPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~-~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |...++++||++||+++.++|++.+++.++|++++++++++++. ..++...+ ...+.|++|+++.++ ++++|. T Consensus 117 ~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~-----~~~~f~ 191 (409) T protein:vir:45 117 QGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLGENEEAGE-----EDTDFG 191 (409) T ss_pred ccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCccccccccccccccc-----cccccc Confidence 77778888999999999999999999999999999999997764 44555544 356789999987654 678999 Q ss_pred eEEeeeeeEE-EeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 79 DRTLVAEEVA-VIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 79 ~i~l~~~ki~-~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ++++.++|++ ++++||+|+++|+.+++++||.++|++++++++|++||+|+|++....+.++...... .......+. T Consensus 192 ~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~--~~~~~~~~~ 269 (409) T protein:vir:45 192 MGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTG--TTQTAAANA 269 (409) T ss_pred eeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeecccc--ccccccccc Confidence 9999999986 5789999999999999999999999999999999999999997544333333332221 122223334 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccE--EEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeeccccc-CCC Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDT--LLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAW-PVG 227 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~--~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~-~~~ 227 (302) .+++.+ .+++..+...+..++. |+||+.++..|++|||++|||||+++ ++.|+|+.+.+.... ..+ T Consensus 270 ~~~d~i----~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~ 345 (409) T protein:vir:45 270 VKWQEI----LALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAG 345 (409) T ss_pred cchHHH----HHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecCcCCccCC Confidence 445444 4454555555555554 57899999999999999999999764 588899988876543 345 Q ss_pred cceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccccc Q lcl|NC_011054. 228 VAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV 297 (302) Q Consensus 228 ~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~ 297 (302) +..+++|||++|++++++++.++++++. ++++|++.||++.|+|+++.+|+||+.++.++++.- T Consensus 346 ~~~i~~Gd~~~~~i~~~~~~~~~~~~d~------~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 346 KKFMFCGDFDRFIIRRVRYMILKRLVER------YAEYDQTGFLAFHRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred ccEEEEeehhhhheeeccceEEEEeecc------cccCCcEEEEEEEEeccEeechhheEEEEeccCCCC Confidence 5678899999999999999999988763 578999999999999999999999999997654432 No 66 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=1.2e-51 Score=299.80 Aligned_cols=267 Identities=15% Similarity=0.107 Sum_probs=221.0 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCC-cceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATL-PGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) +...+++.+|.++|+++.++||+.+++.++|+++++++++.++.+++|+.++. +.+.|++|+++.++ ++++|++ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-----~~~~~~~ 187 (390) T protein:vir:10 113 ASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPE-----SSLKFAK 187 (390) T ss_pred hhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCCcceeeecCCccccc-----cccceeE Confidence 44455566777889999999999999999999999999999999999998865 68999999987654 6789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC---Ccccccccccccccccccceeeccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---PSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) ++++++|++++++||+|+++|+. ++++||.++|++++++++|+++|+|+|+ |.|++...... . ... T Consensus 188 i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~-----~-----~~~ 256 (390) T protein:vir:10 188 KTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTY-----A-----APT 256 (390) T ss_pred EEEeeEEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccccccccccc-----c-----ccc Confidence 99999999999999999999975 8999999999999999999999999885 44443322111 1 111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc------cccCcceEeecccccCCCcce Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE------SFNGFGTYFNANGAWPVGVAE 230 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~------~~~g~p~~~~~~~~~~~~~~~ 230 (302) .......++.+.+++..+...+..++.|+|||++|..|++|+|++|||||+++ .+.|+|+.+.... ..+. T Consensus 257 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~~~~~l~G~pv~~~~~~----p~~~ 332 (390) T protein:vir:10 257 TIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAM----APGE 332 (390) T ss_pred cccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCcCcCCceecceeeEEcCCC----CCCc Confidence 11223334566777777788888999999999999999999999999999764 4678888776543 3566 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeee Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKT 292 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~ 292 (302) +++|||++ |.++++++++++++++. .+|++|++.||++.|+|+++.+|+||++++.. T Consensus 333 ~~~gdf~~~~~~~~~~~~~i~~~~~~-----~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 333 FLVGAFDLAAQIFDQWDARVEIGYVN-----DDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEEeccceEEEEEecceEEEEeecc-----cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 89999986 56789999999988764 35899999999999999999999999999977 No 67 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=1e-51 Score=300.18 Aligned_cols=270 Identities=16% Similarity=0.138 Sum_probs=223.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-Ccceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLAT-LPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |. .+++.+|.++|++++.+||+.+++.++|+++++++++.++.+++|+.+. .+.+.|++|+++.++ ++++|++ T Consensus 105 ~~-~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~~~~~~ 178 (385) T protein:vir:19 105 LG-SDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPE-----SDITFSK 178 (385) T ss_pred hc-cccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCccccc-----cccceeE Confidence 44 3445567788999999999999999999999999999988899999875 578999999987654 6789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCc---ccccccccccccccccceeeccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPS---SWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~---g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) +++++||++++++||+|+++|+ .+++++|.++|++++++++|.++|+|+|++. |+.... ... .... T Consensus 179 ~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~-----~~~-----~~~~ 247 (385) T protein:vir:19 179 QTANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVA-----TAY-----DTSL 247 (385) T ss_pred EEEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccc-----ccc-----cccc Confidence 9999999999999999999987 5799999999999999999999999988642 333221 111 1111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec------ccccCcceEeecccccCCCcce Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD------ESFNGFGTYFNANGAWPVGVAE 230 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~------~~~~g~p~~~~~~~~~~~~~~~ 230 (302) ..+.+..++.+.+++..+...+..++.|+||++++..|+++||++|||||++ ..+.|+|+.+.... .++. T Consensus 248 ~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~----p~~~ 323 (385) T protein:vir:19 248 NATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQ----AAGT 323 (385) T ss_pred cccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceeeEEcCcC----CCCc Confidence 1223334566777777888888889999999999999999999999999964 35778888776553 4567 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) +++|||+. |+++++++++++++++.. ++|++|++.||+++|+|+++.+|+||++++.+.++ T Consensus 324 ~~~gd~~~~~~~~~~~~~~v~~~~~~~----~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 324 FTVGGFDMASQVWDRMDATVEVSREDR----DNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEeecccEEEEEEecceEEEEecccc----chhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 99999986 778999999999887653 46999999999999999999999999999998877 No 68 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=1e-51 Score=300.18 Aligned_cols=270 Identities=16% Similarity=0.138 Sum_probs=223.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-Ccceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLAT-LPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |. .+++.+|.++|++++.+||+.+++.++|+++++++++.++.+++|+.+. .+.+.|++|+++.++ ++++|++ T Consensus 105 ~~-~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~~~~~~ 178 (385) T protein:vir:18 105 LG-SDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPE-----SDITFSK 178 (385) T ss_pred hc-cccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCccccc-----cccceeE Confidence 44 3445567788999999999999999999999999999988899999875 578999999987654 6789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCc---ccccccccccccccccceeeccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPS---SWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~---g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) +++++||++++++||+|+++|+ .+++++|.++|++++++++|.++|+|+|++. |+.... ... .... T Consensus 179 ~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~-----~~~-----~~~~ 247 (385) T protein:vir:18 179 QTANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVA-----TAY-----DTSL 247 (385) T ss_pred EEEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccc-----ccc-----cccc Confidence 9999999999999999999987 5799999999999999999999999988642 333221 111 1111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec------ccccCcceEeecccccCCCcce Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD------ESFNGFGTYFNANGAWPVGVAE 230 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~------~~~~g~p~~~~~~~~~~~~~~~ 230 (302) ..+.+..++.+.+++..+...+..++.|+||++++..|+++||++|||||++ ..+.|+|+.+.... .++. T Consensus 248 ~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~----p~~~ 323 (385) T protein:vir:18 248 NATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQ----AAGT 323 (385) T ss_pred cccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceeeEEcCcC----CCCc Confidence 1223334566777777888888889999999999999999999999999964 35778888776553 4567 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) +++|||+. |+++++++++++++++.. ++|++|++.||+++|+|+++.+|+||++++.+.++ T Consensus 324 ~~~gd~~~~~~~~~~~~~~v~~~~~~~----~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 324 FTVGGFDMASQVWDRMDATVEVSREDR----DNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEeecccEEEEEEecceEEEEecccc----chhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 99999986 778999999999887653 46999999999999999999999999999998877 No 69 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=1.9e-51 Score=298.65 Aligned_cols=267 Identities=16% Similarity=0.114 Sum_probs=222.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCC-cceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATL-PGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) +...+++++|.++|+++..+||+.+++.++|+++++++++.++.+++|+.++. +.+.|++|+++.++ ++++|++ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-----~~~~~~~ 187 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPE-----SSLKFAK 187 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEecCCcceeeecCCccccc-----ccceeeE Confidence 55667788889999999999999999999999999999999999999998765 68999999987654 6789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC---Ccccccccccccccccccceeeccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---PSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) +++++||+++.++||+|+++|+ .+++++|.++|++++++++|.+||+|+|+ |.|+....... .... T Consensus 188 i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~------~~~~---- 256 (390) T protein:vir:81 188 KTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTY------AAPT---- 256 (390) T ss_pred EEEeeeEEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeeccccc------cccc---- Confidence 9999999999999999999997 57999999999999999999999999886 33443322111 1111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc------cccCcceEeecccccCCCcce Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE------SFNGFGTYFNANGAWPVGVAE 230 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~------~~~g~p~~~~~~~~~~~~~~~ 230 (302) .......++.+.+++..+...+..++.|+|||++|..|++|||++|||||++. .+.|+|+.+... ..++. T Consensus 257 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pv~~~~~----~p~~~ 332 (390) T protein:vir:81 257 TIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQA----MAPGE 332 (390) T ss_pred ccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcccccCceecceeeEEcCC----CCCCc Confidence 11223334566677777788888899999999999999999999999999754 567888776654 34567 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeee Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKT 292 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~ 292 (302) +++|||++ |++.++++++++++++. .+|++|++.||+++|+|+++.+|+||++++.. T Consensus 333 ~~~gd~~~~~~~~~~~~~~v~~~~~~-----~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 333 FLVGAFDLAAQIFDQWDARVEIGYVG-----EDFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEEehhceEEEEEecceEEEEeccc-----chhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 89999987 67788999999988654 26899999999999999999999999999977 No 70 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=1.9e-51 Score=298.61 Aligned_cols=273 Identities=18% Similarity=0.134 Sum_probs=216.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) ++..+++++|++||+++.++|++.+++.++|+++++++++++ ..++|+..+.+.+.|++|+++.++. ..++|++| T Consensus 138 ~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~g-~~~ip~~~~~~~a~~v~E~~~~~~~----~~~~f~~i 212 (425) T protein:vir:95 138 RNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVKG-TTRILVDTDTSPATWIEQSGALPTG----DVGTIASI 212 (425) T ss_pred HhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecCc-eeEEEEecCCccccccccccccccc----ccccccee Confidence 555667789999999999999999999999999999999865 5789999999999999999987653 23689999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCC----cccccccccccccccccceeeccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKP----SSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~----~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) ++++||++++++||+|+++|+.+++++||.++|++++++++|+++|+|+|++ .|++.. ... ......... T Consensus 213 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~-----~~~-~~~~~~~~~ 286 (425) T protein:vir:95 213 DFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPS-----LPP-ENQVTVEAD 286 (425) T ss_pred eeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecc-----ccc-ccccccccc Confidence 9999999999999999999999999999999999999999999999999853 454432 111 111222233 Q ss_pred cchHHHHHHHhhhhhhhhhhc--ccCccEEEecHHHH----HHHHhhhcCCCceeeec-----ccccCcceEeecccccC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAA--GYMPDTLLASLGFR----FDVANLRDANGNPIFRD-----ESFNGFGTYFNANGAWP 225 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~--~~~~~~~v~~~~~~----~~l~~l~d~~g~~i~~~-----~~~~g~p~~~~~~~~~~ 225 (302) ..+++.+.+ +...+... ....+.|+||+.++ ..|+.++|++|||||+. +.+.|+|+.+.+.. T Consensus 287 ~~~~~~~~~----~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~l~G~pvv~~~~~--- 359 (425) T protein:vir:95 287 NNLLKNLVK----QIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPDLLGLRVVFNNFL--- 359 (425) T ss_pred cchHHHHHH----HHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCCCCccccceeeEEcCcC--- Confidence 334444444 43333322 23456789998874 34677889999999974 35678888766654 Q ss_pred CCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCC Q lcl|NC_011054. 226 VGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPD 300 (302) Q Consensus 226 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~ 300 (302) .+..++||||++|++++++++++.++++. +|.+|++.||++.|+|+++.+|+||+.++.++ ++.++ T Consensus 360 -~~~~i~~Gd~~~~~~~~~~~~~i~~~~~~------~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~--~~~g~ 425 (425) T protein:vir:95 360 -DDDTVLFGEFEQYTLVERENITIDSSTHV------KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITD--PVQGA 425 (425) T ss_pred -CCccEEEEecccEEEEeecceEEEeeccc------ccccCceEEEEEEeeCcEeecccceEEEEecC--cCCCC Confidence 35568999999999999999999998763 58999999999999999999999999999663 22233 No 71 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=5.4e-51 Score=296.15 Aligned_cols=279 Identities=12% Similarity=0.068 Sum_probs=223.0 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceE--EEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTH--LPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~--~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) -...++++||.+||+++.++|++.+++.++|++++++++++++..+ +++..+.+.+.|++|+++.++. +.++|+ T Consensus 121 ~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~----~~~~~~ 196 (415) T protein:vir:94 121 GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPEL----AVKPFF 196 (415) T ss_pred hhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceecccccccccc----ccccce Confidence 3334566789999999999999999999999999999999876554 4556678899999999887653 457999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +|++++||++++++||+|+++|+.+++++||.++|++++++++|++|++|+|++.+...... ... .......++.. T Consensus 197 ~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~--~~~--~~~~~~~~~~~ 272 (415) T protein:vir:94 197 QLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSG--FEK--EGKKLEVKKAK 272 (415) T ss_pred eeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccc--ccc--ccccccccccc Confidence 99999999999999999999999999999999999999999999999999886443221111 111 11122222333 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccccC-CCcce Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAWP-VGVAE 230 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~~-~~~~~ 230 (302) ++ +++.+++..+...++.++.|+||+++|.+|+++||++|||||.++ .+.|+|+.++.....+ .++.. T Consensus 273 ~~----~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:94 273 SL----DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred ch----HHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccE Confidence 34 445555556666677789999999999999999999999999754 5778898877664433 45667 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) +++|||+. |++++++++++++++ |.++++.+|+++|+|+++.+|+||++++.++++. |.|| T Consensus 349 i~~gd~~~~~~~~~~~~~~v~~~~---------~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~--~~~~ 410 (415) T protein:vir:94 349 LIIGNLKDAIVLFDRSQYQASWTD---------YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSER--GEGD 410 (415) T ss_pred EEEEehhccEEEEeecceEEEEec---------cccCceEEEEEEEeccEEeccccEEEEEEeccCC--CCCc Confidence 89999997 667889999998775 4567788999999999999999999999886665 9999 No 72 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=5.6e-51 Score=296.06 Aligned_cols=273 Identities=14% Similarity=0.138 Sum_probs=218.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCC----cceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATL----PGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~----~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ++..++++++.++|++++++||+.+++.++|++++++++++++..++|+.... ..+.|++|++..++. ..++ T Consensus 118 ~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~----~~~~ 193 (413) T protein:vir:81 118 STATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYM----RFAD 193 (413) T ss_pred hhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCceeEEEeccccccccccceecCccccccc----Cccc Confidence 56667788999999999999999999999999999999999998999987643 468999999886653 2368 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCC---cccccccccccccccccceee Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKP---SSWVSPALLPAAVAANQDYTI 153 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~---~g~~~~~~~~~~~~~~~~~~~ 153 (302) |++|++.+||++++++||+|+++|+. .+++||.++|++++++++|++||+|+|+. .|+....... .... T Consensus 194 f~~i~~~~~k~~~~~~iS~ell~ds~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~-------~~~~ 265 (413) T protein:vir:81 194 FDIVTESLSKIAGLTKITDEMIEDYD-FLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQ-------TLAV 265 (413) T ss_pred ceeeEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccc-------cccc Confidence 99999999999999999999999985 59999999999999999999999998853 3333221111 1111 Q ss_pred ccccchHHHHHHHhhhhhhhhhh-cccCccEEEecHHHHHHHHhhhcCCCceeeecc--------------cccCcceEe Q lcl|NC_011054. 154 VPGDANEDDLIGCINRASKAVAA-AGYMPDTLLASLGFRFDVANLRDANGNPIFRDE--------------SFNGFGTYF 218 (302) Q Consensus 154 ~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~--------------~~~g~p~~~ 218 (302) . +.+..++.+.+++..+.. ..+.+..|+||+++|..|++|||++|||||.++ ++.|+|+.+ T Consensus 266 ~----~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~ 341 (413) T protein:vir:81 266 S----NKDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQ 341 (413) T ss_pred c----ccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEE Confidence 1 122334445455444432 345566799999999999999999999999643 366888866 Q ss_pred ecccccCCCcceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccccc Q lcl|NC_011054. 219 NANGAWPVGVAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV 297 (302) Q Consensus 219 ~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~ 297 (302) .... ..+.+++|||++ |+++++++++++++++.. ++|++|++.||+++|+|+++.+|+||++++.+ .++ T Consensus 342 s~~~----~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~----~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~--~~~ 411 (413) T protein:vir:81 342 SQVV----PVGKPVVGAFRSAASVLRKGGVRIDSTNTNV----DDFENNLITVRAEERVGLMVTFPEAIVQLDVA--EVV 411 (413) T ss_pred cCCC----CcccEEEEecccEEEEEEecceEEEEecccc----chhhcCcEEEEEEEeeccEEecccceEEEEec--CCC Confidence 6543 356789999986 678889999999988754 46999999999999999999999999999876 455 Q ss_pred CC Q lcl|NC_011054. 298 VP 299 (302) Q Consensus 298 ~p 299 (302) +| T Consensus 412 ~p 413 (413) T protein:vir:81 412 TP 413 (413) T ss_pred CC Confidence 57 No 73 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=5.2e-51 Score=296.26 Aligned_cols=269 Identities=15% Similarity=0.157 Sum_probs=223.3 Q ss_pred CCCccCCCcceecchHH-HHHHHHHHHhhhhhhhh-cceeecCCCceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAY-ANDLLASAKKGSTVLQA-FPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~-~~~ii~~~~~~s~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |...++++||++||+++ .++||+.+++.++++++ ++.+|+..+.+++|+.++++.++|++|+++.++ ++++|+ T Consensus 357 ~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~~~~~~-----s~~~f~ 431 (632) T protein:vir:96 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQD-----SDFDFT 431 (632) T ss_pred hhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeCCceeEeecCCccccc-----ccccee Confidence 67778888999999887 68999999999999998 688899888999999999999999999988654 678999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC---Ccccccccccccccccccceeecc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---PSSWVSPALLPAAVAANQDYTIVP 155 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---~~g~~~~~~~~~~~~~~~~~~~~~ 155 (302) ++++++||++++++||+|+++|+.++++++|.++|++++++++|.++|+|+|. |.|+++..... ...... T Consensus 432 ~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~-------~~~~~~ 504 (632) T protein:vir:96 432 TLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVP-------ALTYPA 504 (632) T ss_pred eEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeeccccc-------ceeccc Confidence 99999999999999999999999999999999999999999999999999884 55554433221 122223 Q ss_pred ccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHh--hhcCCCceeeecccccCcceEeecccccCCCcceEEE Q lcl|NC_011054. 156 GDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVAN--LRDANGNPIFRDESFNGFGTYFNANGAWPVGVAEALV 233 (302) Q Consensus 156 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~--l~d~~g~~i~~~~~~~g~p~~~~~~~~~~~~~~~~~~ 233 (302) +..+++.+.++..++.... ....++.|+||+.++..|++ ++|++|+|||+++.+.|+|+.+.... ..+.+++ T Consensus 505 ~~~~~~~i~~~~~~i~~~~--~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~~~l~G~pv~~s~~i----p~~~~~~ 578 (632) T protein:vir:96 505 GGVDWASVVDMETKISTFN--ADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNNEVNGYRAEASNQI----PADTWIF 578 (632) T ss_pred ccCCHHHHHHHHHHHhhcc--cccCccEEEEchhHHHHHHHHhccCCCCceeecCCeecccceEecccc----ccCcEEE Confidence 3445555555443333211 22456789999998888865 77999999999999999999887654 3455899 Q ss_pred EecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeec Q lcl|NC_011054. 234 VDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTP 293 (302) Q Consensus 234 gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~ 293 (302) |||+.++++++++++|.++++. ++.+|++.||+++|+|+++.+|++|+.+++.. T Consensus 579 gd~s~~~i~~~~~~~i~~~~~~------~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 579 GDWSQIVIAMWGVLDLKVDPYT------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred eecceEEEEEecceEEEEcccc------ccccCceEEEEEeecCceeechhhhhheeecC Confidence 9999999999999999988764 57899999999999999999999999888765 No 74 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=1.7e-50 Score=293.40 Aligned_cols=275 Identities=12% Similarity=0.090 Sum_probs=214.4 Q ss_pred CC-CccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MA-DISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma-~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) ++ ..++++||++||+++.++|++.+++.++|+++++++++.+ ..++|+....+.+.|+.+.++.. .++.++++|++ T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~-~~~~p~~~~~~~a~~~~~~~e~~--~~~~~~~~f~~ 217 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKE-NIKYPVLVKKAEAQGHKNERTNN--EMPETDIEFDE 217 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccCC-ceEEEEEecCCcccceecccccc--cccccccceee Confidence 22 2345678999999999999999999999999999988765 58899998888899986654332 24567899999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCc---ccccccccccccccccceeeccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPS---SWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~---g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) |++++||++++++||+|+++|+.+++++||.++|++++++++|.+||+|+|+.. |++.... .....+ T Consensus 218 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~---------~~~~~~- 287 (434) T protein:vir:62 218 IELSPTEFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKA---------VEFKTD- 287 (434) T ss_pred EEeeheeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeeccc---------cccccc- Confidence 999999999999999999999999999999999999999999999999998632 2211110 111111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc---------cccCcceEeecccccC-- Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE---------SFNGFGTYFNANGAWP-- 225 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~---------~~~g~p~~~~~~~~~~-- 225 (302) .....+++.++...+...+..++.|+||+.++..|++|||++|||||++. ++.|+|+.+......+ T Consensus 288 ---~~~~~d~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~ 364 (434) T protein:vir:62 288 ---EKNLYDALVKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDS 364 (434) T ss_pred ---ccchhhHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccC Confidence 22334566677777777788888999999999999999999999999752 4789999888665432 Q ss_pred CCcceEEEEecceEEEEeec-CcEEEEeecccccchhhhcCCcEEEEEEEEeccEEec-cccEEEEe--eecccccCCCC Q lcl|NC_011054. 226 VGVAEALVVDSSRVRIGVRQ-DITVKFLDQATVGSINLAERDMIALRLKARFAYVLGN-GATAVGDN--KTPVGAVVPDG 301 (302) Q Consensus 226 ~~~~~~~~gd~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~-~~a~~~lt--~~~a~~~~p~~ 301 (302) .+...++||||++|+++++. .++++++.+ .+|.+|++.||++.|+|+++.+ |.++..++ .+++.. T Consensus 365 ~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~------~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~----- 433 (434) T protein:vir:62 365 PDTPVFYFGDFSKFYIQDVIGSLEVQKLVE------LFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTG----- 433 (434) T ss_pred CCceEEEEeeccceEEEEeeceeEEEeehh------hhcccCceEEEEEeeecceeecCcccceEEEEEeccCCC----- Confidence 23355789999999999875 477777754 3678999999999999999775 87765554 332222 Q ss_pred C Q lcl|NC_011054. 302 S 302 (302) Q Consensus 302 ~ 302 (302) + T Consensus 434 ~ 434 (434) T protein:vir:62 434 A 434 (434) T ss_pred C Confidence 2 No 75 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=1.2e-50 Score=294.27 Aligned_cols=285 Identities=14% Similarity=0.064 Sum_probs=218.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |...++++||++||+++.++|++.+++.||++++|++.++++. .++|+.++.+.|.|++|+++.++ +++++|+++ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~i~~~~~~~~a~w~~e~~~~~~----~~~~~f~~i 150 (381) T protein:vir:10 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKFLKSETSGVAVWGKIYGEIKG----QLDAAFSEE 150 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcc-eEEEEecCCcceeeecccccccc----cccccceee Confidence 7778888999999999999999999999999999999998764 78999999999999999877653 357899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccccce-eeccc- Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQDY-TIVPG- 156 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~~~-~~~~~- 156 (302) ++.+||++++++||+||++|+.+++++||+++|++++++++|.+|++|+|+ |.|++...........+... ....+ T Consensus 151 ~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t 230 (381) T protein:vir:10 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) T ss_pred eecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999875 66765432211111111000 00111 Q ss_pred --cchHHHHHHHhhhhhhhhh-------hcccCccEEEecHHHHHHHHhhh---cCCCceeeecccccCcceEeeccccc Q lcl|NC_011054. 157 --DANEDDLIGCINRASKAVA-------AAGYMPDTLLASLGFRFDVANLR---DANGNPIFRDESFNGFGTYFNANGAW 224 (302) Q Consensus 157 --~~~~~~~~~~i~~~~~~~~-------~~~~~~~~~v~~~~~~~~l~~l~---d~~g~~i~~~~~~~g~p~~~~~~~~~ 224 (302) ..+.....+.+.+++..+. ..+..+..|+||+.++..|+.++ +.+|+|+|. .++|..++.+.. T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~----l~~g~~vv~s~~- 305 (381) T protein:vir:10 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA----LPFNLNVIESTV- 305 (381) T ss_pred cccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeec----CCCCceEEecCC- Confidence 1122222333444433332 13455678999999999998765 678999875 345554544332 Q ss_pred CCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeec--ccccCCCCC Q lcl|NC_011054. 225 PVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTP--VGAVVPDGS 302 (302) Q Consensus 225 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~--a~~~~p~~~ 302 (302) ..++.++||||++|++++|++++++++++. +|.+|++.||+..|+|+++.+++||+.++-+. +..++|..+ T Consensus 306 -~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~------~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~ 378 (381) T protein:vir:10 306 -QEAGKVLTYVKGLYDGYLAGGINVQKFKET------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) T ss_pred -CCcCcEEEEecccEEEEEecccEEEeechh------HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCccccc Confidence 345669999999999999999999999874 69999999999999999999999999877443 444444444 No 76 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=1.2e-50 Score=294.27 Aligned_cols=285 Identities=14% Similarity=0.064 Sum_probs=218.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |...++++||++||+++.++|++.+++.||++++|++.++++. .++|+.++.+.|.|++|+++.++ +++++|+++ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~i~~~~~~~~a~w~~e~~~~~~----~~~~~f~~i 150 (381) T protein:vir:95 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKFLKSETSGVAVWGKIYGEIKG----QLDAAFSEE 150 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcc-eEEEEecCCcceeeecccccccc----cccccceee Confidence 7778888999999999999999999999999999999998764 78999999999999999877653 357899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccccce-eeccc- Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQDY-TIVPG- 156 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~~~-~~~~~- 156 (302) ++.+||++++++||+||++|+.+++++||+++|++++++++|.+|++|+|+ |.|++...........+... ....+ T Consensus 151 ~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t 230 (381) T protein:vir:95 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) T ss_pred eecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999875 66765432211111111000 00111 Q ss_pred --cchHHHHHHHhhhhhhhhh-------hcccCccEEEecHHHHHHHHhhh---cCCCceeeecccccCcceEeeccccc Q lcl|NC_011054. 157 --DANEDDLIGCINRASKAVA-------AAGYMPDTLLASLGFRFDVANLR---DANGNPIFRDESFNGFGTYFNANGAW 224 (302) Q Consensus 157 --~~~~~~~~~~i~~~~~~~~-------~~~~~~~~~v~~~~~~~~l~~l~---d~~g~~i~~~~~~~g~p~~~~~~~~~ 224 (302) ..+.....+.+.+++..+. ..+..+..|+||+.++..|+.++ +.+|+|+|. .++|..++.+.. T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~----l~~g~~vv~s~~- 305 (381) T protein:vir:95 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA----LPFNLNVIESTV- 305 (381) T ss_pred cccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeec----CCCCceEEecCC- Confidence 1122222333444433332 13455678999999999998765 678999875 345554544332 Q ss_pred CCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeec--ccccCCCCC Q lcl|NC_011054. 225 PVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTP--VGAVVPDGS 302 (302) Q Consensus 225 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~--a~~~~p~~~ 302 (302) ..++.++||||++|++++|++++++++++. +|.+|++.||+..|+|+++.+++||+.++-+. +..++|..+ T Consensus 306 -~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~------~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~ 378 (381) T protein:vir:95 306 -QEAGKVLTYVKGLYDGYLAGGINVQKFKET------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) T ss_pred -CCcCcEEEEecccEEEEEecccEEEeechh------HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCccccc Confidence 345669999999999999999999999874 69999999999999999999999999877443 444444444 No 77 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=2.8e-51 Score=297.69 Aligned_cols=276 Identities=12% Similarity=0.020 Sum_probs=222.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +...+.++||++||+++.++|++.+.+.+|++++|++.++++. .++|+.++.+.+.|++|+++.++ +++++|+++ T Consensus 79 ~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~~~~~~~~~~a~w~~e~~~~~~----~~~~~f~~i 153 (377) T protein:vir:98 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLR-LKALTAETSGTAVWGDIFGEIKG----QLKQAFKEQ 153 (377) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCcc-eEEEEecCCcceeEeecccccCc----ccCccceeE Confidence 7888889999999999999999999999999999999998654 78999999999999999877543 367899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccccceeeccccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) ++.+||++++++||+|||+||.+++++||++++++++++++|.+|++|+|+ |.|++......... .... .... T Consensus 154 ~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~--~~~~---~~~~ 228 (377) T protein:vir:98 154 DFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVD--QSTG---RDIT 228 (377) T ss_pred eecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccc--cccc---cccc Confidence 999999999999999999999999999999999999999999999999885 66665432111110 1111 1111 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc---------------------cccCcceE Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE---------------------SFNGFGTY 217 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~---------------------~~~g~p~~ 217 (302) +.....+.+.++...+...+...+.|+||+.++..++++||.+|+|+|..+ ++.|+|+. T Consensus 229 ~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~ 308 (377) T protein:vir:98 229 TYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGIT 308 (377) T ss_pred cccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCce Confidence 111122445556666666677778899999999999999999999999311 23444554 Q ss_pred eecccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 218 FNANGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 218 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) ++.+. ...++.++||||++|+++++++++|+++++. +|.+|++.||+..|+|+++.+++||++++-+-= T Consensus 309 vv~s~--~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~------~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 309 ILESL--AVETGKAIAFVANRYDAFMATASTIEEYDQT------FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEecC--CCCcccEEEEEecceeEEeecceEEEeechh------hhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 44432 2445678999999999999999999998864 589999999999999999999999998886533 No 78 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=1.4e-49 Score=288.43 Aligned_cols=267 Identities=18% Similarity=0.199 Sum_probs=218.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-Ccceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLAT-LPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) +...++++||++||+++.++|++.+++.++|+++++++|+++++.++|+... ...+.|++|+++.++ +++++|++ T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~----~~~~~~~~ 186 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPA----LAEPEFEQ 186 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCccccccccccccc----ccccccee Confidence 7778889999999999999999999999999999999999998899998764 577899999988664 25689999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccch Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDAN 159 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (302) |++++||++++++||+|+++||.+++++||.++|++++++++|.++++|+|++... ...+..+ T Consensus 187 v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~-----------------~~~~~~~ 249 (394) T protein:vir:10 187 VDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTAK-----------------ATTTDTL 249 (394) T ss_pred EEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-----------------ccccccc Confidence 99999999999999999999999999999999999999999999999988753210 1112223 Q ss_pred HHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-----------cccCcceEeeccc--ccCC Q lcl|NC_011054. 160 EDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-----------SFNGFGTYFNANG--AWPV 226 (302) Q Consensus 160 ~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-----------~~~g~p~~~~~~~--~~~~ 226 (302) .+.+.+.+.. .+... + .+.|+||+++|..|++|+|++|||||+++ ++.|+|+.++.+. .... T Consensus 250 ~d~l~~~~~~---~~~~~-~-~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~ 324 (394) T protein:vir:10 250 VDSLKHILNV---DLDPA-Y-SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAA 324 (394) T ss_pred HHHHHHHHHh---hhhhh-c-cCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCC Confidence 3444333322 22222 2 46899999999999999999999999753 4788899887653 3334 Q ss_pred CcceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 227 GVAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 227 ~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) ++..+++|||++ |++++++++++.++++.. |. ..+|+++|+|+++.+|++|+.++.+++++-+++|- T Consensus 325 ~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~------~~---~~~~~~~r~d~~~~~~~ai~~~~~~~~~~~~~~~~ 392 (394) T protein:vir:10 325 GDQKAFVGDLKRGVLFADRQQVTLAWEDSKI------YG---RYLGAAFRFGVKQADSNAGYFVTNTDAASGSTSGT 392 (394) T ss_pred CceEEEEeeccccEEEEeecceEEEEecccc------cc---eeEEEEEEeccEEeccccEEEEEeecccCCCCCCC Confidence 566799999987 677889999999887543 22 35899999999999999999999998888777777 No 79 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=7.3e-50 Score=289.97 Aligned_cols=262 Identities=12% Similarity=0.047 Sum_probs=215.5 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC--Ccceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLAT--LPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~--~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) -+.+++++++.++|+++..+|++.++..++|+++++++++.++.+.||+.++ .+.+.|++|+++.++ ++++|+ T Consensus 107 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~-----~~~~f~ 181 (379) T protein:vir:10 107 GDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENGAGEGAIGAQVEGATKGQ-----KDYDIS 181 (379) T ss_pred cccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeecCCCcccccccCCccccc-----ccccee Confidence 2224455566789999999999999999999999999999999999999874 456788999987554 678999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +|++++||++++++||+|+++|+. ++++||.++|++++++++|.+++.|+|.... .. ....++.. T Consensus 182 ~i~~~~~k~~~~~~iS~ell~D~~-~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~-~~-------------~~~~~~~~ 246 (379) T protein:vir:10 182 MIDVNTDFIAGFTRYSKKMANNLP-FLTSFIPNALRRDYAKAENAAFNAVLAANAT-AS-------------TEIITNKN 246 (379) T ss_pred eeEeeeeeEEeeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHhcccccccc-cc-------------cccccCcc Confidence 999999999999999999999975 6999999999999999999999988774210 00 00011111 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc---------cccCcceEeecccccCCCcc Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE---------SFNGFGTYFNANGAWPVGVA 229 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~---------~~~g~p~~~~~~~~~~~~~~ 229 (302) . .+.+.+++..+...++.++.|+|||.+|..|++|||++|+|+|+++ .+.|+|+.+.... +.+ T Consensus 247 ~----~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~----~ag 318 (379) T protein:vir:10 247 K----VEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRINGIPLFRATWL----AAN 318 (379) T ss_pred c----HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcceecceeeEecCCC----CCC Confidence 2 2445666666677788889999999999999999999999999754 4667777665443 456 Q ss_pred eEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 230 EALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 230 ~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) .+++|||+++++.+++++.++++++.. ++|++|++.||+++|+|++|.||+||++++.+.+ T Consensus 319 ~~~~gdf~~~~~~~~~~~~i~~~~~~~----~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 319 KYYVGDWTRVTKVTTEGLSLEFSEVEG----TNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred ceEEeecccEEEEEEeceEEEEeeccc----ccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 689999999999999999999887643 4699999999999999999999999999999877 No 80 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=7.4e-50 Score=289.94 Aligned_cols=268 Identities=10% Similarity=0.120 Sum_probs=219.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcc--eeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPG--ASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~--a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) .+..++++||++||+++..+|++.+++.++|+++++++++.++..++|+...... +.|++|+.+.+ .++++|+ T Consensus 114 ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~-----~s~~~f~ 188 (421) T protein:vir:13 114 RDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRAGASVDKLANLAKDTELV-----KAMLKTQ 188 (421) T ss_pred hhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEeecCCccceeecccccccc-----cccccee Confidence 4567778899999999999999999999999999999999999899998776654 56688887654 4689999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +|+++++|++++++||+|+++|+.+++++||.++|++++++++|.++++ .++|++.. ++.. T Consensus 189 ~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~---~~~g~~~~----------------~~~~ 249 (421) T protein:vir:13 189 PMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVK---QAKAVLAE----------------ETIN 249 (421) T ss_pred EEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhh---hhhhcccc----------------cccc Confidence 9999999999999999999999999999999999999999999998874 34443211 1112 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec------ccccCcceEeecccccC-CCcceE Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD------ESFNGFGTYFNANGAWP-VGVAEA 231 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~------~~~~g~p~~~~~~~~~~-~~~~~~ 231 (302) +++ .+.+++..+...+..++.|+||+.+|..|++|||++|||||++ .++.|+|+.++.+...+ .++..+ T Consensus 250 ~~d----~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~ 325 (421) T protein:vir:13 250 DYA----GLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKELSDGGDLVFKGRPVIELEESIFDVGDETKF 325 (421) T ss_pred chH----HHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecCcCCCCCceecceeeEEeccccccCCCceEE Confidence 333 4445555566677788999999999999999999999999975 35889999888765433 456778 Q ss_pred EEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccc-cC----CCCC Q lcl|NC_011054. 232 LVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGA-VV----PDGS 302 (302) Q Consensus 232 ~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~-~~----p~~~ 302 (302) ++|||+. |+++++++++++++++. +|++|++.||++.|+|+++.+++||+.+.....++ ++ |++| T Consensus 326 ~~gd~~~~~~~~~~~~~~v~~~~~~------~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~ 396 (421) T protein:vir:13 326 IVSDFKTLIKFMDRKQYLIDQSKEA------GYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKSS 396 (421) T ss_pred EEEeccccEEEEEecceEEEeeccc------ccccCeeEEEEEeeecceeecchhhheeeecccceeeccccccCCC Confidence 9999997 77899999999998764 59999999999999999999999987776554332 22 3333 No 81 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=1.9e-49 Score=287.64 Aligned_cols=266 Identities=18% Similarity=0.169 Sum_probs=217.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-Ccceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLAT-LPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |+.+++++||++||+++..+|++.+++.++|+++++++|++++..++|+... ...+.|++|+++.++ .++++|++ T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~----~~~~~~~~ 184 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPK----LAEPEFNK 184 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEecCCCccccccccccccc----ccccccee Confidence 8889999999999999999999999999999999999999998899998764 566789999887654 35789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccch Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDAN 159 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (302) |++.+||+++++++|+|+++||.+++++||.++|++++++++|.+|++|+|+... ....+..+ T Consensus 185 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~-----------------~~~~~~~~ 247 (389) T protein:vir:10 185 VDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTA-----------------KKTTTDTL 247 (389) T ss_pred eeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc-----------------cccccccc Confidence 9999999999999999999999999999999999999999999999998774321 11122233 Q ss_pred HHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-----------cccCcceEeecccc--cCC Q lcl|NC_011054. 160 EDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-----------SFNGFGTYFNANGA--WPV 226 (302) Q Consensus 160 ~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-----------~~~g~p~~~~~~~~--~~~ 226 (302) ++.+.+.+.. .+...+ .+.|+||+++|..|++|||++|||||+++ ++.|+|+.++.+.. ... T Consensus 248 ~d~l~~~~~~---~~~~~~--~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~ 322 (389) T protein:vir:10 248 VDSLKHILNV---DLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLA 322 (389) T ss_pred HHHHHHHHHh---hhhhhh--CcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCCC Confidence 4444443321 222222 46899999999999999999999999754 47899988876532 334 Q ss_pred CcceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCC Q lcl|NC_011054. 227 GVAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDG 301 (302) Q Consensus 227 ~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~ 301 (302) ++..+++|||++ |+++++++++++++++.. |. ..+|+.+|+|+++.+|+||++++.+++++.+|.= T Consensus 323 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~---~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 389 (389) T protein:vir:10 323 GDQKAFVGDLKRGVLFTDRQQVTLAWEDSKI------YG---KYLGAAFRFGVQKADSKAGYFVTNTDVPGSALGK 389 (389) T ss_pred CceEEEEeeccccEEEEeecceEEEeecccc------cc---ceEEEEEEeccEEecccceEEEEeeccCCCCCCC Confidence 566789999997 789999999999987643 22 3689999999999999999999988877766666 No 82 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=8e-50 Score=289.76 Aligned_cols=284 Identities=14% Similarity=0.072 Sum_probs=215.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |...+.++||++||+++.++|++.+++.||++++|+++++++ ..++|+.+..+.+.|++|..+.++ +++++|+++ T Consensus 76 ~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~-~~~i~~~~~~~~a~W~~e~~~~~~----~~~~~f~~i 150 (381) T protein:vir:10 76 INKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIKG----QLDAAFSEE 150 (381) T ss_pred HhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCc-ceEEEeecCCcceEEeeccccccc----ccCccceeE Confidence 777888899999999999999999999999999999999865 578999999999999999876543 357899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Cccccccccccccccccc-ceeecccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQ-DYTIVPGD 157 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~-~~~~~~~~ 157 (302) ++.+||++++++||+|||+|+.+++++||+++|++++++++|.+|++|+|+ |.|++...........+. ......+. T Consensus 151 ~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~ 230 (381) T protein:vir:10 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGT 230 (381) T ss_pred eecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCcccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999885 667654211111111110 00001111 Q ss_pred c---hHHHHHHHhhhhhhhhh-------hcccCccEEEecHHHHHHHHhhh---cCCCceeeecccccCcceEeeccccc Q lcl|NC_011054. 158 A---NEDDLIGCINRASKAVA-------AAGYMPDTLLASLGFRFDVANLR---DANGNPIFRDESFNGFGTYFNANGAW 224 (302) Q Consensus 158 ~---~~~~~~~~i~~~~~~~~-------~~~~~~~~~v~~~~~~~~l~~l~---d~~g~~i~~~~~~~g~p~~~~~~~~~ 224 (302) . +.....+.+..++..+. ..+..+..|+||+.++..++.++ +.+|+|+|..+ +|..++.+.. T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~lp----~g~~vv~~~~- 305 (381) T protein:vir:10 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP----FNLNVIESTV- 305 (381) T ss_pred ccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceeecCC----CCceeEEcCC- Confidence 1 11222222222222221 13455678999999999988655 88999997632 3443433322 Q ss_pred CCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 225 PVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 225 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) ..++.++||||++|++++|++++++++++. +|.+|++.||+..|+|+++.+++||+.++-+-.+ ..|+=+ T Consensus 306 -~p~~~i~fGDfs~Y~i~~r~~~~i~~~~~~------~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~-~~~~~~ 375 (381) T protein:vir:10 306 -QEAGKVLTYVKGLYDGYLAGGINVQKFKET------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG-HKPALE 375 (381) T ss_pred -CCcCcEEEEEcccEEEEEecccEEEeechh------hhhcCceEEEEEEEEcCEEecCCcEEEEEEeecC-Cccccc Confidence 345669999999999999999999999874 6999999999999999999999999987765444 344433 No 83 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=6e-49 Score=284.95 Aligned_cols=280 Identities=12% Similarity=0.070 Sum_probs=211.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |...+.++||++||+++.++|++.+++.++++++++++++++ ..++|+.+..+.+.|+.|..+.++ +++++|++| T Consensus 86 ~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~~~~a~w~~e~~~~~~----~~~~~f~~i 160 (395) T protein:vir:95 86 INYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGI-KTRVIKADPAGQAVWGKVFGEIKG----QLDAAFREE 160 (395) T ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceEEeecccccCc----cccccceee Confidence 777888999999999999999999999999999999999876 478999999999999988766542 367999999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC----Ccccccccccccccccccceeeccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK----PSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~----~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) ++.+||++++++||+||++|+.+++++||+++|++++++++|++||+|+|+ |.|++.......... ......+ T Consensus 161 ~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~---~~~~~~~ 237 (395) T protein:vir:95 161 NFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGAV---TDKASSG 237 (395) T ss_pred eeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeeccccccccc---ccccccc Confidence 999999999999999999999999999999999999999999999999985 566654322111100 0111111 Q ss_pred cchH---HHHHHHhhhhhhhh-------hhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-----cccCcceEeecc Q lcl|NC_011054. 157 DANE---DDLIGCINRASKAV-------AAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-----SFNGFGTYFNAN 221 (302) Q Consensus 157 ~~~~---~~~~~~i~~~~~~~-------~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-----~~~g~p~~~~~~ 221 (302) ..+. +...+.+..++..+ ...+.....|+||+.++. |..|+|+|++. ++.|+|+++..+ T Consensus 238 ~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~------~~~g~~~~~~~~G~~~~~lg~g~~v~~~ 311 (395) T protein:vir:95 238 TLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW------DVQARYTYLTANGGFVTVLPYNVTIITS 311 (395) T ss_pred hhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh------hcCCcceeccCCCcceeccCCcceEEEc Confidence 1222 22223333333322 123345667999998765 55688888763 344556555443 Q ss_pred cccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeec--ccccCC Q lcl|NC_011054. 222 GAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTP--VGAVVP 299 (302) Q Consensus 222 ~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~--a~~~~p 299 (302) .. ..++.++||||++|++++|++++++++++. +|.+|++.||+.+|+|+++.|++||+.++-+. ++..++ T Consensus 312 ~~--~p~~~i~fgdfs~y~i~~r~~~~i~~~~~~------~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~~~~ 383 (395) T protein:vir:95 312 EF--VPEGKLVAFVTDRYNAVRGGGLTVKKFDQT------LALEDAVLFTAKTFAYGQPDDNKASAVYDLKVASAPRRQT 383 (395) T ss_pred CC--CCCCcEEEEecccEEEEEecceEEEeccch------hhhCCcEEEEEEEEECCEEeccccEEEEEeeccCCCCCCC Confidence 32 335568999999999999999999988763 58999999999999999999999998877652 222222 Q ss_pred CCC Q lcl|NC_011054. 300 DGS 302 (302) Q Consensus 300 ~~~ 302 (302) .+. T Consensus 384 ~~~ 386 (395) T protein:vir:95 384 SAG 386 (395) T ss_pred CCC Confidence 222 No 84 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=3.8e-49 Score=286.03 Aligned_cols=288 Identities=14% Similarity=0.089 Sum_probs=222.1 Q ss_pred CCCccCCCcceecchHH-HHHHHHHHHhhhhhhhhcceeecCC--CceEEEEEeCC-cceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAY-ANDLLASAKKGSTVLQAFPTVNMGT--KTTHLPVLATL-PGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~-~~~ii~~~~~~s~l~~~~~~~~~~~--~~~~~p~~~~~-~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) +. .+++.||.+||+++ .++|++.+++.++++++++.+++++ +.+++|+..++ ..+.|++|++..++..++.++++ T Consensus 157 ~~-~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~ 235 (477) T protein:vir:84 157 LD-RNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLT 235 (477) T ss_pred cc-ccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccc Confidence 33 34455777887775 6889999999999999999988754 46899987655 45789999998888888999999 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC---Ccccccccccccccccccceee Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---PSSWVSPALLPAAVAANQDYTI 153 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---~~g~~~~~~~~~~~~~~~~~~~ 153 (302) |+++++++||++++++||+|+++||.+++++||.++|++++++++|.+||+|+|+ |.|+++....+...... . T Consensus 236 f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~----~ 311 (477) T protein:vir:84 236 DGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATS----A 311 (477) T ss_pred eeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccc----c Confidence 9999999999999999999999999999999999999999999999999999884 66776544332221111 1 Q ss_pred ccccchHHHHHHHhhhhhhhhhhcc-cCccEEEecHHHHHHHHhhhcCCCceeeecc--------------------ccc Q lcl|NC_011054. 154 VPGDANEDDLIGCINRASKAVAAAG-YMPDTLLASLGFRFDVANLRDANGNPIFRDE--------------------SFN 212 (302) Q Consensus 154 ~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~--------------------~~~ 212 (302) .......+..++.+.++...+...+ .+++.|+||++++..|++|||++|||||+++ .+. T Consensus 312 ~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~ 391 (477) T protein:vir:84 312 GSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMH 391 (477) T ss_pred ccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhc Confidence 1111223344455555555554444 4556899999999999999999999999764 567 Q ss_pred CcceEeecccccC----CCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEe-ccccEE Q lcl|NC_011054. 213 GFGTYFNANGAWP----VGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLG-NGATAV 287 (302) Q Consensus 213 g~p~~~~~~~~~~----~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~-~~~a~~ 287 (302) |+|+.++...... .....++||||+.++++. .++.++++++ .++.++++.||+..++++... +|+||+ T Consensus 392 G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~-~~~~~~~~~~------~~~~~~~~~~~v~~~~~~~~~r~~~afv 464 (477) T protein:vir:84 392 GLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE-SSVRMRALQE------TRAENLSVLLQVYGYLAFTAARFPQSVV 464 (477) T ss_pred ccceEecCcccccccccCCcceEEEEEeceEEEEe-eceeEEeccc------cccccceeeeeehhhhhhhhhccccceE Confidence 8898887665432 234578999999998887 4677777655 346678889999888887655 599999 Q ss_pred EEeeecccccCCC Q lcl|NC_011054. 288 GDNKTPVGAVVPD 300 (302) Q Consensus 288 ~lt~~~a~~~~p~ 300 (302) .+|++...+.|=+ T Consensus 465 ~~t~~~~~~~~~~ 477 (477) T protein:vir:84 465 EIGGTALTAPTFA 477 (477) T ss_pred EeecccccccccC Confidence 9999976665555 No 85 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=7.4e-49 Score=284.45 Aligned_cols=257 Identities=19% Similarity=0.195 Sum_probs=212.0 Q ss_pred CCC-ccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MAD-ISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLA-TLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~-~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |.. .++++||++||+++.++|++.+++.++|++++++++++++..++|+.. ..+.+.|++|+++.++ +++++|+ T Consensus 133 ~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~----~~~~~f~ 208 (400) T protein:vir:38 133 VNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPA----MAKPEFK 208 (400) T ss_pred HhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecCCCccccccccccccc----cccccce Confidence 333 467778999999999999999999999999999999999989999876 4577999999987664 3578999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +|++.++|++++++||+|+++||.+++++||.++|+++++.++|.++++|+|..+. .+.. T Consensus 209 ~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~--------------------~~~~ 268 (400) T protein:vir:38 209 PVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTA--------------------KTIS 268 (400) T ss_pred eeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc--------------------cccc Confidence 99999999999999999999999999999999999999999999999998875321 0111 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccccC-CCcce Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAWP-VGVAE 230 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~~-~~~~~ 230 (302) +.+.+.+.+.... . ....+.|+||+++|..|++|||++|||||+++ ++.|+|+.++++...+ .++.. T Consensus 269 ~~~~~~~~~~~~~----~-~~~~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~ 343 (400) T protein:vir:38 269 SVDDLKHINNVDL----D-PAYSRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAH 343 (400) T ss_pred cHHHHHHHHHhhh----h-hhhCcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCccccccceeEEecccccCCCCceE Confidence 2333333332221 1 22357899999999999999999999999763 5789999888765433 55777 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) +++|||+. |+++++++++++++++.. +...+|+++|+|+++.+|++|++++.++++ T Consensus 344 ~~~gd~s~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 344 AFLGDIKRAILFANRADFMVRWVDDQI---------YGQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred EEEEeccccEEEEeecceEEEEecccc---------cceeEEEEEEeccEEecccceEEEEeecCC Confidence 99999987 667889999999987642 335799999999999999999999998877 No 86 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=4.2e-49 Score=285.81 Aligned_cols=284 Identities=15% Similarity=0.084 Sum_probs=213.5 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |...++++||++||+++.++|++.+++.||++++++++++++. .++|+.+..+.+.|++|+.+.++ +++++|+++ T Consensus 83 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~~-~~i~~~~~~~~a~w~~e~~~~~~----~~~~~f~~i 157 (383) T protein:vir:78 83 INKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGLR-TKFLKSETSGVAVWGKIFGEIKG----QLDATFSDE 157 (383) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCCc-eEEEEEcCCcceEEeeccccccc----ccCcceeeE Confidence 8889999999999999999999999999999999999998765 78999999999999999876543 357899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccccce-eecccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQDY-TIVPGD 157 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~~~-~~~~~~ 157 (302) ++.+||++++++||+|||+|+.+++++||++++++++++++|.+|++|+|+ |.|++...........+... ....+. T Consensus 158 ~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 237 (383) T protein:vir:78 158 ESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGT 237 (383) T ss_pred eecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCcccccccccccccccch Confidence 999999999999999999999999999999999999999999999999874 66665422111111111000 011112 Q ss_pred chHHHH---HHHhhhhhhhhh-------hcccCccEEEecHHHHHHHH---hhhcCCCceeeecccccCcceEeeccccc Q lcl|NC_011054. 158 ANEDDL---IGCINRASKAVA-------AAGYMPDTLLASLGFRFDVA---NLRDANGNPIFRDESFNGFGTYFNANGAW 224 (302) Q Consensus 158 ~~~~~~---~~~i~~~~~~~~-------~~~~~~~~~v~~~~~~~~l~---~l~d~~g~~i~~~~~~~g~p~~~~~~~~~ 224 (302) .+.+.+ .+.+..+..... .....+..|+||+.++..+. ...+.+|+|. +..|+|+.++.+.. T Consensus 238 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~----t~l~~~~~iv~s~~- 312 (383) T protein:vir:78 238 LTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSLNANGVYV----TALPFNLNIIESLF- 312 (383) T ss_pred hhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchhccCCCCcee----eecCCCceEEecCC- Confidence 222222 122221111100 01123345888887654432 2346677765 45667766654432 Q ss_pred CCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEee-ecccccCCCC Q lcl|NC_011054. 225 PVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNK-TPVGAVVPDG 301 (302) Q Consensus 225 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~-~~a~~~~p~~ 301 (302) ..++.++||||++|+++++++++++++++. +|.+|++.||+.+|+|+++.+++||+.++- .....++|+| T Consensus 313 -~p~~~iifgdfs~Y~i~~r~~~~i~~~~~~------~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~~~~~~~~ 383 (383) T protein:vir:78 313 -VPEKKAISYVAERYDALIGGPLDIGTYDQT------LAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNINPAEQTPEG 383 (383) T ss_pred -CCcccEEEeeccceEEEecccceEEecchh------hhhcCceEEEEEEEEcCEEecCCeEEEEEEEecCCCCCCCC Confidence 345668999999999999999999988764 699999999999999999999999998884 4677889999 No 87 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=8.7e-49 Score=284.06 Aligned_cols=277 Identities=14% Similarity=0.078 Sum_probs=217.5 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +...+.++||++||+++.++|++.+.+.||++++|+++++++ ..++|+.++.+.|.|++|++++++ +++++|+++ T Consensus 79 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~~~~a~wv~e~~~~~~----~~~~~f~~i 153 (377) T protein:vir:96 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEIKG----QLKQAFKEQ 153 (377) T ss_pred HhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceeEeeccccccc----ccCccceeE Confidence 677888899999999999999999999999999999999865 578999999999999999877653 357899999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccccc-------- Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQD-------- 150 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~~-------- 150 (302) ++.+||++++++||+|||+||.+++++||++++++++++++|.+|++|+|+ |.|++.............. T Consensus 154 ~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) T protein:vir:96 154 DFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) T ss_pred eeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeec Confidence 999999999999999999999999999999999999999999999999885 6677653322111111000 Q ss_pred --eeeccccchHHHHHHHhhhhhhhhhhc-------ccCccEEEecHHHHHHH---HhhhcCCCceeeecccccCcceEe Q lcl|NC_011054. 151 --YTIVPGDANEDDLIGCINRASKAVAAA-------GYMPDTLLASLGFRFDV---ANLRDANGNPIFRDESFNGFGTYF 218 (302) Q Consensus 151 --~~~~~~~~~~~~~~~~i~~~~~~~~~~-------~~~~~~~v~~~~~~~~l---~~l~d~~g~~i~~~~~~~g~p~~~ 218 (302) ........+.+.+.+.+.++...++.. ...+..|+||+.++..+ ...++++|+|. ++.|+|+.+ T Consensus 234 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~----~~l~~p~~v 309 (377) T protein:vir:96 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYV----TVLPHGITI 309 (377) T ss_pred cccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhccccccccCCCCCce----eccCCCceE Confidence 000111234455555555555444322 22455699999998776 34556777664 667788777 Q ss_pred ecccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 219 NANGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 219 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) ..+.. ..++.++||||++|+++++++++++.+++. +|.+|++.||+.+|+|+++.+++||+.++-+-- T Consensus 310 ~~s~~--~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~------~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 310 LESLA--VETGKAIAFVANRYDAFMATASTIEEYDQT------FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EecCC--CCcccEEEEEcCcEEEEEecccEEEeehhh------hhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 65533 334568999999999999999999999864 689999999999999999999999998886643 No 88 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=1.7e-48 Score=282.42 Aligned_cols=258 Identities=17% Similarity=0.177 Sum_probs=210.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLA-TLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) ....+..+||++||+++.++|++.+++.++|+++++++++.++...+|+.. +...+.|++|+++.++ +++++|++ T Consensus 128 ~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~----~~~~~~~~ 203 (394) T protein:vir:97 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPA----LAKPDFKD 203 (394) T ss_pred ccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCCCccceecccccccc----ccccccee Confidence 445677779999999999999999999999999999999999889999876 4568899999988664 25689999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccch Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDAN 159 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (302) |++.+||++++++||+|+++|+.+++++||.++|++++++++|.++++|.++++. .+..+ T Consensus 204 v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~~--------------------~~~~~ 263 (394) T protein:vir:97 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT--------------------KTVKN 263 (394) T ss_pred EEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------------------ccccc Confidence 9999999999999999999999999999999999999999999999998764321 01122 Q ss_pred HHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccccCCCcceEE Q lcl|NC_011054. 160 EDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAWPVGVAEAL 232 (302) Q Consensus 160 ~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~~~~~~~~~ 232 (302) .+.+.+.+... ... ...+.|+||+++|..|++|+|++|||||+++ ++.|+|+.+..+. ..++..++ T Consensus 264 ~~~~~~~~~~~----~~~-~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~--~~~~~~~~ 336 (394) T protein:vir:97 264 LDEIKALLNGG----FDP-AYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDE--VLGANKAF 336 (394) T ss_pred HHHHHHHHHhh----hhh-hhCCEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCCceeccceeEEeccc--ccCCccEE Confidence 34443333322 222 2356799999999999999999999999764 5788888776543 45566789 Q ss_pred EEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCC Q lcl|NC_011054. 233 VVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPD 300 (302) Q Consensus 233 ~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~ 300 (302) +|||++ |+++++++++++++++. ++...+|+++|+|+++.+|++|++++.++++. |= T Consensus 337 ~gd~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~--p~ 394 (394) T protein:vir:97 337 IGDFKRGVLFADRKDLGLRWADNE---------IYGQYLQAVLRFGVSKVDDKAGYYVTFTPEPL--PL 394 (394) T ss_pred EeeccccEEEEEecceEEEEeccc---------ccceeEEEEEEEccEEecccceEEEEeccccc--CC Confidence 999987 67899999999887653 34467999999999999999999999886554 33 No 89 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=1.3e-48 Score=283.18 Aligned_cols=268 Identities=13% Similarity=0.046 Sum_probs=216.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLA-TLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) ++..+++++|++||+++.+.| ..++..++++.++++++++++...+|+.. ..+.+.|++|++..++. ++++|++ T Consensus 156 ~~~~~~~~~g~lvp~~~~~~i-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~----~~~~~~~ 230 (437) T protein:vir:10 156 VTGIALKDGKVIIPETILTPE-KEVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKN----ATPVITP 230 (437) T ss_pred hhhcccccccccchHHHHHHH-HHhhhhhhhhhcceeEeeccCceeeEEeecccccccccccccccccc----cccccee Confidence 677788899999999998765 55688889999999999998889999875 45789999999887642 5689999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccch Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDAN 159 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (302) |++.+||++++++||+|+++|+.+++++||.++|++++++++|.+|++|+|++... ..+..+ T Consensus 231 v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~------------------~~~~~~ 292 (437) T protein:vir:10 231 ILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKK------------------TTSTYL 292 (437) T ss_pred eeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc------------------cccccc Confidence 99999999999999999999999999999999999999999999999998753210 011122 Q ss_pred HHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecccc---cCCCcc Q lcl|NC_011054. 160 EDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGA---WPVGVA 229 (302) Q Consensus 160 ~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~---~~~~~~ 229 (302) .+.+.+.+. ..+...+..++.|+||++++..|++|+|++|||||+++ ++.|+|+.++.+.. ...++. T Consensus 293 ~~~~~~~~~---~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~ 369 (437) T protein:vir:10 293 LGDLKKVLN---VTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDV 369 (437) T ss_pred hhhHHHHHH---hhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCCCcccccceeEEecccccCCcCCCce Confidence 233333322 24556666778899999999999999999999999753 58899998876532 335667 Q ss_pred eEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeee-cccccCCCCC Q lcl|NC_011054. 230 EALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKT-PVGAVVPDGS 302 (302) Q Consensus 230 ~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~-~a~~~~p~~~ 302 (302) .++||||+. |.++++.+++++.++. +..+...+|+.+|+|+++.+|+||++|+++ ++-.++|.++ T Consensus 370 ~~~~gd~~~~~~~~~r~~~~~~~~~~--------~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~ 436 (437) T protein:vir:10 370 NIVVAPLKKAVINFKLTEITGQFQDT--------YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTVVQSTA 436 (437) T ss_pred EEEEeeccccEEEEeeeceEEEEecc--------cccccceeeEEEEEccEEecccceEEEEeeccccccCCCCC Confidence 799999986 5688899999987653 455667899999999999999999999976 4555555555 No 90 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=2e-48 Score=282.10 Aligned_cols=266 Identities=11% Similarity=0.108 Sum_probs=211.4 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLA-TLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |...++++||++||+++.++||+.++++++|+++++++++++ ..+|+.. ..+.+.|++|++..++ ++++|++ T Consensus 83 l~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~~~-----~~~~f~~ 155 (352) T protein:vir:78 83 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKE-----LKLKGDT 155 (352) T ss_pred hccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC--ceEEEEecCCCccccccccccccc-----cccccee Confidence 888888999999999999999999999999999999988765 4567655 4578999999988665 5789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh-cccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVI-FGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l-~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +++.+||++++++||+|+++||.+++++||.++|+++++++++..+| +|+| +|........ . .....++.. T Consensus 156 v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g--~~~~~g~l~~-~-----~~~~~t~~~ 227 (352) T protein:vir:78 156 VKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPK--SGLEHMSFYN-G-----SVKEVEGAN 227 (352) T ss_pred eeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCC--Ccccccceec-c-----ccccccccc Confidence 99999999999999999999999999999999999999988555333 4444 3322211111 0 111122222 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeee--cccccCcceEeecccccCCCcceEEEEec Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFR--DESFNGFGTYFNANGAWPVGVAEALVVDS 236 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~--~~~~~g~p~~~~~~~~~~~~~~~~~~gd~ 236 (302) . ++.+.+++..+...+..++.|+||+.++..|.+++|.+|+|+|. +.++.|+|+.+... ...++|||| T Consensus 228 ~----~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~~~~~~llG~PV~~~~~------~~~~~~Gdf 297 (352) T protein:vir:78 228 M----YDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAEKVFGKPVVFTDA------AVKPIVGDF 297 (352) T ss_pred h----HHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCcccccCCccccccceEEecC------CCceeEeeh Confidence 2 45556666667777778889999999999999999999999985 45788999987753 345789999 Q ss_pred ceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCC Q lcl|NC_011054. 237 SRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPD 300 (302) Q Consensus 237 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~ 300 (302) ++++++ +.++.++..++ ..++++.|++.+|+|+++.+|+||+.++.++++...|. T Consensus 298 ~~~~~~-~~~~~~~~~~~--------~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~~~~ 352 (352) T protein:vir:78 298 NYFGIN-YDGTTYDTDKD--------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGSLPS 352 (352) T ss_pred hhhhhh-hhhheeeeecc--------ccCCeeEEEEEeeeCceeechhheEEEEeecccCCCCC Confidence 988765 45566665554 34689999999999999999999999999998888998 No 91 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=3.7e-47 Score=275.17 Aligned_cols=277 Identities=13% Similarity=0.082 Sum_probs=219.5 Q ss_pred CCCcc-CCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC--------Ccceeeecccccccccccc Q lcl|NC_011054. 1 MADIS-RSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLAT--------LPGASWVSESATEPEGVKP 71 (302) Q Consensus 1 Ma~~t-~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~--------~~~a~~v~E~~~~~~~~~~ 71 (302) +...+ +..++.++|..+.+.|+...+..+.++++++.+++.++.+++|+.++ ...+.|++|+++.++ T Consensus 123 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~---- 198 (419) T protein:vir:94 123 APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ---- 198 (419) T ss_pred cccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccc---- Confidence 33333 34455567777777778888888899999999999998888887653 456889999987554 Q ss_pred ccccceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Cccccccccccccccccc Q lcl|NC_011054. 72 TSEATWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQ 149 (302) Q Consensus 72 ~s~~~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~ 149 (302) ++++|+++++++||++++++||+|+++|+ .++++||.++|++++++++|.+||+|+|+ |.|++......... T Consensus 199 -~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~---- 272 (419) T protein:vir:94 199 -STLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQ---- 272 (419) T ss_pred -cccceeeEEeeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccc---- Confidence 67899999999999999999999999987 57999999999999999999999999885 55655443322211 Q ss_pred ceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCce-eeecc-------cccCcceEeecc Q lcl|NC_011054. 150 DYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNP-IFRDE-------SFNGFGTYFNAN 221 (302) Q Consensus 150 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~-i~~~~-------~~~g~p~~~~~~ 221 (302) ........+....++++.+++..+...+..++.|+||+++|..|++++|++|++ +++++ .+.|+|+.++.. T Consensus 273 -~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~ 351 (419) T protein:vir:94 273 -QPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVA 351 (419) T ss_pred -ccccccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCccccceeeEEcCC Confidence 111222334455667788888888888888899999999999999999987765 45433 677888877765 Q ss_pred cccCCCcceEEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccc Q lcl|NC_011054. 222 GAWPVGVAEALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGA 296 (302) Q Consensus 222 ~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~ 296 (302) . .++.+++|||++ |+++++++++++++++.. ++|++|++.||+++|+|+++.+|+||++++.+++.- T Consensus 352 ~----~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~----~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 352 I----AQGTALVGGFRQGATLWSRQGITVLMTDSHA----DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred C----CCccEEEeeccceEEEEEecceEEEEecccc----chhhcCcEEEEEEEeeccEEeccccEEEEEeccCCC Confidence 3 456789999987 567889999999887653 469999999999999999999999999999885444 No 92 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=1.4e-47 Score=277.39 Aligned_cols=266 Identities=11% Similarity=0.112 Sum_probs=209.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLA-TLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |..+++++||++||+++.++||+.+++.++|+++++++++++ ..+|+.. ....+.|++|++..++ ++++|++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~-----~~~~f~~ 190 (387) T protein:vir:26 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKE-----LKAKGDT 190 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccccc-----cccccce Confidence 788888999999999999999999999999999999998865 4567655 5578999999988665 5789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh-cccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVI-FGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l-~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +++.+||++++++||+|+++||.+++++||.++|+++++++++..+| +|+|+ |........ .....+++.. T Consensus 191 v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~--g~~~g~~~~------~~~~~~~~~~ 262 (387) T protein:vir:26 191 VKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS--GLEHMSFYN------GSVKEVEGAD 262 (387) T ss_pred eeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc--cccceeeec------cccccccccc Confidence 99999999999999999999999999999999999999999766544 44443 221111100 0111122222 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeee--cccccCcceEeecccccCCCcceEEEEec Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFR--DESFNGFGTYFNANGAWPVGVAEALVVDS 236 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~--~~~~~g~p~~~~~~~~~~~~~~~~~~gd~ 236 (302) .++.+.+++..+...+..++.|+||+.++..+.++++..|+++|. +.++.|+|+.+.+. ...++|||| T Consensus 263 ----~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~~~~~llG~PV~~~~~------~~~~~~GDf 332 (387) T protein:vir:26 263 ----MYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAEKVFGKPVVFTDA------AVKPIVGDF 332 (387) T ss_pred ----hHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCCccccccceEEecC------CCceeeech Confidence 345566666677777778889999999988887777777888875 45788999988764 345789999 Q ss_pred ceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCC Q lcl|NC_011054. 237 SRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPD 300 (302) Q Consensus 237 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~ 300 (302) +++++. +.++.+..+++ ...+++.|++..|+|+++.+|+||+.++.+.++.++|- T Consensus 333 ~~~~~~-~~~~~~~~~~~--------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:26 333 NYFGIN-YDGTTYDTDKD--------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred hhhhhh-hhhhhheeccc--------ccCCceEEEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 988664 34565655544 34689999999999999999999999999999888888 No 93 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=1.4e-47 Score=277.39 Aligned_cols=266 Identities=11% Similarity=0.112 Sum_probs=209.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLA-TLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |..+++++||++||+++.++||+.+++.++|+++++++++++ ..+|+.. ....+.|++|++..++ ++++|++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~-----~~~~f~~ 190 (387) T protein:vir:96 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKE-----LKAKGDT 190 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccccc-----cccccce Confidence 788888999999999999999999999999999999998865 4567655 5578999999988665 5789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh-cccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVI-FGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l-~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +++.+||++++++||+|+++||.+++++||.++|+++++++++..+| +|+|+ |........ .....+++.. T Consensus 191 v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~--g~~~g~~~~------~~~~~~~~~~ 262 (387) T protein:vir:96 191 VKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS--GLEHMSFYN------GSVKEVEGAD 262 (387) T ss_pred eeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc--cccceeeec------cccccccccc Confidence 99999999999999999999999999999999999999999766544 44443 221111100 0111122222 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeee--cccccCcceEeecccccCCCcceEEEEec Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFR--DESFNGFGTYFNANGAWPVGVAEALVVDS 236 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~--~~~~~g~p~~~~~~~~~~~~~~~~~~gd~ 236 (302) .++.+.+++..+...+..++.|+||+.++..+.++++..|+++|. +.++.|+|+.+.+. ...++|||| T Consensus 263 ----~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~~~~~llG~PV~~~~~------~~~~~~GDf 332 (387) T protein:vir:96 263 ----MYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAEKVFGKPVVFTDA------AVKPIVGDF 332 (387) T ss_pred ----hHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCCccccccceEEecC------CCceeeech Confidence 345566666677777778889999999988887777777888875 45788999988764 345789999 Q ss_pred ceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCC Q lcl|NC_011054. 237 SRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPD 300 (302) Q Consensus 237 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~ 300 (302) +++++. +.++.+..+++ ...+++.|++..|+|+++.+|+||+.++.+.++.++|- T Consensus 333 ~~~~~~-~~~~~~~~~~~--------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:96 333 NYFGIN-YDGTTYDTDKD--------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred hhhhhh-hhhhhheeccc--------ccCCceEEEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 988664 34565655544 34689999999999999999999999999999888888 No 94 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=1.4e-47 Score=277.39 Aligned_cols=266 Identities=11% Similarity=0.112 Sum_probs=209.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLA-TLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |..+++++||++||+++.++||+.+++.++|+++++++++++ ..+|+.. ....+.|++|++..++ ++++|++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~-----~~~~f~~ 190 (387) T protein:vir:94 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKE-----LKAKGDT 190 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccccc-----cccccce Confidence 788888999999999999999999999999999999998865 4567655 5578999999988665 5789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh-cccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVI-FGTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l-~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) +++.+||++++++||+|+++||.+++++||.++|+++++++++..+| +|+|+ |........ .....+++.. T Consensus 191 v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~--g~~~g~~~~------~~~~~~~~~~ 262 (387) T protein:vir:94 191 VKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS--GLEHMSFYN------GSVKEVEGAD 262 (387) T ss_pred eeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc--cccceeeec------cccccccccc Confidence 99999999999999999999999999999999999999999766544 44443 221111100 0111122222 Q ss_pred hHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeee--cccccCcceEeecccccCCCcceEEEEec Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFR--DESFNGFGTYFNANGAWPVGVAEALVVDS 236 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~--~~~~~g~p~~~~~~~~~~~~~~~~~~gd~ 236 (302) .++.+.+++..+...+..++.|+||+.++..+.++++..|+++|. +.++.|+|+.+.+. ...++|||| T Consensus 263 ----~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~~~~~llG~PV~~~~~------~~~~~~GDf 332 (387) T protein:vir:94 263 ----MYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAEKVFGKPVVFTDA------AVKPIVGDF 332 (387) T ss_pred ----hHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCCccccccceEEecC------CCceeeech Confidence 345566666677777778889999999988887777777888875 45788999988764 345789999 Q ss_pred ceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCC Q lcl|NC_011054. 237 SRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPD 300 (302) Q Consensus 237 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~ 300 (302) +++++. +.++.+..+++ ...+++.|++..|+|+++.+|+||+.++.+.++.++|- T Consensus 333 ~~~~~~-~~~~~~~~~~~--------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:94 333 NYFGIN-YDGTTYDTDKD--------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred hhhhhh-hhhhhheeccc--------ccCCceEEEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 988664 34565655544 34689999999999999999999999999999888888 No 95 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=1.5e-47 Score=277.22 Aligned_cols=264 Identities=11% Similarity=0.115 Sum_probs=208.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLA-TLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |...++++||++||+++.++||+.+++.++|+++++++++++ ..+|+.. ....+.|++|++..++ ++++|++ T Consensus 133 ~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~-----~~~~f~~ 205 (402) T protein:vir:93 133 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKE-----LKAKGDT 205 (402) T ss_pred hccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC--ceeeeeeccCCccccccccccccc-----cccccce Confidence 788888899999999999999999999999999999998865 4577654 5678999999987654 5789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh-cccCC--Ccccccccccccccccccceeeccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVI-FGTDK--PSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l-~G~g~--~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) +++.+||++++++||+|+++||.+++++||.++|+++++++++..+| .|+|. |.|++... ....+++ T Consensus 206 i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~----------~~~~~~~ 275 (402) T protein:vir:93 206 VKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNG----------SVKEVEG 275 (402) T ss_pred eeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecc----------ccccccc Confidence 99999999999999999999999999999999999999998766544 44443 22222111 1111222 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeee--cccccCcceEeecccccCCCcceEEEE Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFR--DESFNGFGTYFNANGAWPVGVAEALVV 234 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~--~~~~~g~p~~~~~~~~~~~~~~~~~~g 234 (302) . ..++.+.+++..+...+..++.|+||+.++..+.++++..|+++|. +.++.|+|+.+..+ ...++|| T Consensus 276 ~----~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~~~~~~llG~PV~~t~~------~~~i~~G 345 (402) T protein:vir:93 276 A----DMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAEKVFGKPVVFTDA------AVKPIVG 345 (402) T ss_pred c----chHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCCccccccceEEecC------CCceeee Confidence 2 2345566676777777788889999999988877777667777774 55788999988764 3457899 Q ss_pred ecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCC Q lcl|NC_011054. 235 DSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPD 300 (302) Q Consensus 235 d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~ 300 (302) ||+++++.. .++.++.+++ ...+++.|++..|+|+++.+|+||+.++.+.++..||. T Consensus 346 Df~~~~~~~-~~~~~~~~~~--------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 346 DFNYFGINY-DGTTYDTDKD--------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 402 (402) T ss_pred chhhhhhhh-hhhhhhhhhc--------ccCCceEEEEEEEeCcEEechhheEEEEeecCCCCCCC Confidence 999876543 3455555443 23589999999999999999999999999999999999 No 96 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=6.4e-47 Score=273.85 Aligned_cols=264 Identities=11% Similarity=0.095 Sum_probs=206.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLA-TLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) |...++++||++||+++.++|++.+++.++|+++++++++++ ..+|+.. +...+.|++|++..++ ++++|++ T Consensus 118 l~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~~~-----~~~~f~~ 190 (387) T protein:vir:93 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKE-----LKLKGDT 190 (387) T ss_pred hccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCC--ceEEEEeecCCccccccCcccccc-----cccccce Confidence 888888999999999999999999999999999999998865 4577654 5678999999987654 5789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh-cccCCC--cccccccccccccccccceeeccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVI-FGTDKP--SSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l-~G~g~~--~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) +++.+||++++++||+|+++||.+++++||.++|+++++++++..+| +|+|++ .|++... ....+++ T Consensus 191 v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~----------~~~~v~~ 260 (387) T protein:vir:93 191 VKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNG----------SVKEVEG 260 (387) T ss_pred eeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecc----------ccccccc Confidence 99999999999999999999999999999999999999999776544 455432 2222111 1111222 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHH-HhhhcCCCceee-ecccccCcceEeecccccCCCcceEEEE Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDV-ANLRDANGNPIF-RDESFNGFGTYFNANGAWPVGVAEALVV 234 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l-~~l~d~~g~~i~-~~~~~~g~p~~~~~~~~~~~~~~~~~~g 234 (302) .. .++.+.+++..+...+..++.|+||+.++..+ ++++|.+|++++ .+.++.|+|+.+..+ ...++|| T Consensus 261 ~~----~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~~~~~~~~~~llG~PV~~~~~------~~~~~~G 330 (387) T protein:vir:93 261 AD----MYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAEKVFGKPVVFTDA------AVKPIVG 330 (387) T ss_pred cc----hHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCCccccccceEEecC------CCceeee Confidence 22 34556666667777777888999999987665 455565555444 345788999988754 2357899 Q ss_pred ecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCC Q lcl|NC_011054. 235 DSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPD 300 (302) Q Consensus 235 d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~ 300 (302) ||++++++ +.++.+...++ +.++++.|+++.|+|+++.+|+||+.++.+++++.+|. T Consensus 331 Df~~~~~~-~~~~~~~~~~~--------~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 331 DFNYFGIN-YDGTTYDTDKD--------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGSLPS 387 (387) T ss_pred ehhhhhee-hhhheeeeccc--------ccCCceeEEEEeeeCceeechhheEEEEeecCCCCCCC Confidence 99998765 44566655443 45789999999999999999999999999988888898 No 97 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=2.6e-46 Score=270.50 Aligned_cols=286 Identities=16% Similarity=0.097 Sum_probs=212.9 Q ss_pred CCCc-cCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADI-SRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~-t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) +... +.++++.+||+++.+.|++.+++.+++++++++.++++ ..++++....+.+.|++|++++++ ++++|++ T Consensus 148 ~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g-~~~~~~~~~~~~a~wv~E~~~~~~-----~~~~f~~ 221 (466) T protein:vir:80 148 AQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKG-TARQNIAGAIPEGVWTEAVANLNE-----LSLSFSQ 221 (466) T ss_pred hhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCc-eeEeeeecCCcceeeccccccccc-----ccccccc Confidence 2222 23456678999999999999999999999999999875 468888888899999999988654 5799999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccccceeecccc Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) |++.+||++++++||+|+++||.+++++||+++|++++++++|.+||+|+|+ |.|+++.................... T Consensus 222 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~ 301 (466) T protein:vir:80 222 IEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTN 301 (466) T ss_pred eeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999875 56665432211111110000000000 Q ss_pred ch--------------HHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh---cCCCceeeeccc---ccCcceE Q lcl|NC_011054. 158 AN--------------EDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR---DANGNPIFRDES---FNGFGTY 217 (302) Q Consensus 158 ~~--------------~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~---d~~g~~i~~~~~---~~g~p~~ 217 (302) .+ ...+.+.+..+.........+...|+||+.++..|..++ +.+|.+++.+.. +.|.|+. T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~~~i~G~pvv 381 (466) T protein:vir:80 302 LSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLNNTMPIVGGDIV 381 (466) T ss_pred cchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCCCccccccccee Confidence 00 111112222222222223334456999999999999988 677888876543 5688876 Q ss_pred eecccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeec-ccc Q lcl|NC_011054. 218 FNANGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTP-VGA 296 (302) Q Consensus 218 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~-a~~ 296 (302) +.... .++.+++|||+.|++++|+++++.++++. +|.+|++.||+.+|+|+++.+|+||++++.+. .++ T Consensus 382 ~s~~~----~~~~~~~g~~~~y~i~~r~~~~i~~~~~~------~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~ 451 (466) T protein:vir:80 382 ILDFI----PDNDIIGGYGSLYLLAERADIKLAQSEHV------RFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPT 451 (466) T ss_pred ecCcc----CccceeeeccccEEEEeecceEEEechhh------hhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcc Confidence 66543 34558999999999999999999988763 58999999999999999999999999998654 344 Q ss_pred cCCCCC Q lcl|NC_011054. 297 VVPDGS 302 (302) Q Consensus 297 ~~p~~~ 302 (302) |++++- T Consensus 452 ~~~~~~ 457 (466) T protein:vir:80 452 TSITFA 457 (466) T ss_pred cceeee Confidence 444443 No 98 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=8.7e-46 Score=267.63 Aligned_cols=255 Identities=15% Similarity=0.140 Sum_probs=209.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe-CCcceeeeccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLA-TLPGASWVSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~~~~~~~s~~~f~~ 79 (302) ++..++.+++.++|+++.+.|++ ++..++++++++.++++++...+|+.. ....+.|++|+++.++ +++++|++ T Consensus 132 ~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~----~~~~~~~~ 206 (397) T protein:vir:96 132 RDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQ----LANPKMVE 206 (397) T ss_pred hhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccCCccccccccccccc----cccccccc Confidence 77788889999999999999987 578889999999999998888888765 4577899999987654 35789999 Q ss_pred EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccch Q lcl|NC_011054. 80 RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDAN 159 (302) Q Consensus 80 i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (302) |++++|++++++++|+|+++|+.+++++||.++|++++++++|.++++|+|.... .+..+ T Consensus 207 i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~--------------------~~~~~ 266 (397) T protein:vir:96 207 IDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTATA--------------------KSVVG 266 (397) T ss_pred eeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------------------ccccc Confidence 9999999999999999999999999999999999999999999999999875321 11223 Q ss_pred HHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeeccccc--CCCcce Q lcl|NC_011054. 160 EDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNANGAW--PVGVAE 230 (302) Q Consensus 160 ~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~~~~--~~~~~~ 230 (302) ++.+.+.+.... ..+ .++.|+||+++|..|++|+|++|||||+++ ++.|+|+.++.+... ..++.. T Consensus 267 ~d~~~~~~~~~~----~~~-~~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~ 341 (397) T protein:vir:96 267 VDGLKDLINKEI----KKV-YDVKLFISASMYSELDKLKDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVV 341 (397) T ss_pred hHHHHHHHHHhh----hhh-cCcEEEEcHHHHHHHHHhhccCCCeEeccCccCCCcccccccceEEecccccCCCCCceE Confidence 444444443322 222 357899999999999999999999999754 578899887765332 345667 Q ss_pred EEEEecce-EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 231 ALVVDSSR-VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 231 ~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) +++|||++ |++++++++++.++++.. +.+.+|+++|+|+++.+|+||++++.+.| T Consensus 342 ~~~gd~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 342 GFIGDAKAFASFFDRKQVSVSWVDNNI---------YGQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred EEEeehhcceEeEeecceEEEEecccc---------cceeEEEEEEEccEEecccceEEEEeecC Confidence 99999997 678999999999887532 34579999999999999999999998877 No 99 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=1.2e-37 Score=223.04 Aligned_cols=281 Identities=11% Similarity=0.087 Sum_probs=204.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee-cCCCceEEEEEeCCc-ceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN-MGTKTTHLPVLATLP-GASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~-~~~~~~~~p~~~~~~-~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |. .+..+||+|+|++. +++|+.+++.+++++++++++ +.++...+|....+. ...|..|+++.++ .+.++++|+ T Consensus 14 it-~~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~~~--~~~~~~tf~ 89 (314) T protein:vir:41 14 ID-VPDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTKVA--PTADEVTVS 89 (314) T ss_pred cc-cccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCCcc--CCccccccc Confidence 75 34566899999887 579999999999999999985 567778888765332 2334333332221 234789999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchH--HHHHHHHHHHHHHHHHHHHHHhhcccCCC--cc---cccccccccccccccce Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDAST--SLLEEIAALGGQAIGKKLDQAVIFGTDKP--SS---WVSPALLPAAVAANQDY 151 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~--~~~~~i~~~l~~ai~~~~d~~~l~G~g~~--~g---~~~~~~~~~~~~~~~~~ 151 (302) ++++..||+...+.||+|+|+|+.. +|+++|.+.+++++++.++..+++|+|+. .+ ..+.+....+. ... T Consensus 90 ~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~---~~~ 166 (314) T protein:vir:41 90 TNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAG---NQY 166 (314) T ss_pred ceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcc---cce Confidence 9999999999999999999999965 99999999999999999999999999842 11 11222222111 111 Q ss_pred eeccccchHHHHHHHhhhhhhhhhhcccC---ccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeecc Q lcl|NC_011054. 152 TIVPGDANEDDLIGCINRASKAVAAAGYM---PDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNAN 221 (302) Q Consensus 152 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~---~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~~ 221 (302) +..+ ..+.....+.+.+++..+...+++ ...|+||+.++.+++++++.+|+++|++. .+.|+|+..+.. T Consensus 167 ~~~~-~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~ 245 (314) T protein:vir:41 167 TDAE-PEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGLQYDGIPIQYVPA 245 (314) T ss_pred eecC-ccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCCCceecceeeEeccc Confidence 1111 112234456667777777776654 45799999999999999999999998654 466777766554 Q ss_pred c-ccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccccc Q lcl|NC_011054. 222 G-AWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV 297 (302) Q Consensus 222 ~-~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~ 297 (302) . ....++.+++||||++++++++..+.++...+ ..++++.|.+..|+|+.+.+.++.++..-..+++= T Consensus 246 ~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~--------a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 246 LDALGDDKARALLTVPTNLVYGFWRNIRIEPKRD--------AAMRRTEYIASLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred ccccCCCCceEEEechhheEEEeeceeEEeeccc--------CcCCeEEEEEEEEeceEEEEcCcEEEEEeeccCCC Confidence 3 33457899999999999999988877765543 46789999999999999998877666554433331 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=4.5e-37 Score=219.85 Aligned_cols=275 Identities=14% Similarity=0.145 Sum_probs=192.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee-cCCCceEEEEEeC----Ccceeeecccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN-MGTKTTHLPVLAT----LPGASWVSESATEPEGVKPTSEA 75 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~-~~~~~~~~p~~~~----~~~a~~v~E~~~~~~~~~~~s~~ 75 (302) |. .++.+||+|+|++. +++|+.+++.|++++++++++ +.+....++.... .....|.+|..+. +++++ T Consensus 19 ~t-~~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~~~~~~~-----~~~~~ 91 (315) T protein:vir:41 19 ID-VPDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDETGQKLAP-----PESTA 91 (315) T ss_pred cC-CcCCCCceechHHH-HHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccccccCcCCC-----CCCcc Confidence 43 45567888888776 569999999999999999864 5544445544321 1234455555443 34679 Q ss_pred ceeeEEeeeeeEEEeehhHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhcccCCCc-c--cccccccccccccccc Q lcl|NC_011054. 76 TWADRTLVAEEVAVIIPVHENVVDDAS--TSLLEEIAALGGQAIGKKLDQAVIFGTDKPS-S--WVSPALLPAAVAANQD 150 (302) Q Consensus 76 ~f~~i~l~~~ki~~~~~iS~ell~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~-g--~~~~~~~~~~~~~~~~ 150 (302) +|+++++..|++.+.+.||+|+|+|+. ++|+++|.+++++++++.++.++++|+|... . ..+.+....+.... . T Consensus 92 ~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~-~ 170 (315) T protein:vir:41 92 EVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKL-T 170 (315) T ss_pred ccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceecccccc-c Confidence 999999999999999999999999986 4999999999999999999999999987421 1 11122221111100 0 Q ss_pred eeeccccchHHHHHHHhhhhhhhhhhccc---CccEEEecHHHHHHHHhhhcCCCceeeecc-------cccCcceEeec Q lcl|NC_011054. 151 YTIVPGDANEDDLIGCINRASKAVAAAGY---MPDTLLASLGFRFDVANLRDANGNPIFRDE-------SFNGFGTYFNA 220 (302) Q Consensus 151 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~~---~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-------~~~g~p~~~~~ 220 (302) ....+ ........+.+.++...+...++ .+..|+||+.++..+++++|.+|+|+|++. ++.|+|+.... T Consensus 171 ~~~~~-~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g~~~tl~G~PV~~~~ 249 (315) T protein:vir:41 171 ESDVD-PEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTGANSILYDGRPVQYVP 249 (315) T ss_pred ccccc-cccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccchhhcCCCceecccceEecc Confidence 00011 11112223455556666665554 356799999999999999999999999654 56777876665 Q ss_pred ccc-cCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEecccc-EEEEeee Q lcl|NC_011054. 221 NGA-WPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGAT-AVGDNKT 292 (302) Q Consensus 221 ~~~-~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a-~~~lt~~ 292 (302) .+. ...++..++||||++++++++.++.+++..++ .++.+.|.+..|+|+.+.++++ .+++.+. T Consensus 250 ~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a--------~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 250 ALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDA--------EMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred cccccCCCCccEEEecccceEEEeccccEEEeeecC--------CCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 543 34567889999999999999999988876553 4566788999999998776554 2333333 No 101 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=1.8e-35 Score=211.12 Aligned_cols=273 Identities=16% Similarity=0.074 Sum_probs=191.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +-..+...++++.|..+...+...++..+++++.++..+.. ...+|.......+.|+.||+.. |+++++|+.+ T Consensus 239 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~--~~~~~~~~~~~~a~~~~eG~~k-----p~s~~tf~~~ 311 (517) T protein:vir:97 239 AELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP--TLVVGGDNALTQGTGHTTGTDK-----TESNITLQTR 311 (517) T ss_pred eecccccccccccchHHHHHHHHhhhhhccceeeeeecccc--ceeeecccccceeeeeecCCcc-----cccccceeeE Confidence 22222334678899999999999999999998888765544 3566777777788999998764 4578999999 Q ss_pred EeeeeeEEEeehhHHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDASTS----LLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~~~----~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) ++.++++++++++|+++++|+..+ +++||.++|++++++++|.+||+|+|++.. ..++.+.+..... ....+ T Consensus 312 ~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~--~~gi~~~a~~~~~--~~~~~ 387 (517) T protein:vir:97 312 VLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVS--ETQIYPVVGDAWA--TNVTG 387 (517) T ss_pred EeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcc--ccccccccccccc--ccccc Confidence 999999999999999999998776 999999999999999999999999986422 2222222211111 11111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecccccCcceEeecc--cccCCCcceEEEE Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDESFNGFGTYFNAN--GAWPVGVAEALVV 234 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~~~g~p~~~~~~--~~~~~~~~~~~~g 234 (302) + +.+.+.+..+...+.. ..++.|+||+.+|..|++|||++|||||++....+.+....+. .......+...++ T Consensus 388 ~---~~~~d~i~~l~~a~~~--a~~a~~vmn~~t~~~I~klKD~~G~Yl~~~~~~~~~~~~l~G~~~~~~~~~~~~~~~~ 462 (517) T protein:vir:97 388 T---TNIQELLEKLSVATPK--AADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAV 462 (517) T ss_pred c---chHHHHHHHHHHHhhh--ccCCEEEECHHHHHHHHHhhcCCCCeeccCcCCcccccccCCccccccccccCceeEe Confidence 2 2223333333333322 2356799999999999999999999999876544443332221 0001112223455 Q ss_pred ecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccccc Q lcl|NC_011054. 235 DSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV 297 (302) Q Consensus 235 d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~ 297 (302) +.+.|.++.+.++.+.... .+.+|+..|+.++|+++.|..|++|+..+.+|..+= T Consensus 463 ~~~~y~i~~~~g~~~~~~f--------d~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 463 SLSGYVTNGSRGMEFEQGT--------ILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred eccccEEEeecceeeeeee--------ecccCceeEeeeeeeccccccccceEEEEEcCCCCC Confidence 6678888888776543221 145789999999999999999999999888864442 No 102 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=1.4e-33 Score=200.75 Aligned_cols=287 Identities=13% Similarity=0.131 Sum_probs=208.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) +....+.++|++||+++..+|++.+++.++++++++++++.+....+|....++.+.|+++....+ ...++++|+++ T Consensus 18 ~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~~~~~e~~~~---~~~~~~~~~~~ 94 (321) T protein:vir:31 18 ALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHRRPQDEGEWN---ENESDVSTGTI 94 (321) T ss_pred cccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCcccccccccccc---cccccceeeee Confidence 444566778899999999999999999999999999999999888999887777778887433222 23467899999 Q ss_pred EeeeeeEEEeehhHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhcccCCCccc---ccccccccccccccceeecc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDDAS--TSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSW---VSPALLPAAVAANQDYTIVP 155 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~---~~~~~~~~~~~~~~~~~~~~ 155 (302) ++..||+.+.+.||+|+|+|+. ++|+++|.+.+++++++.++.++++|+|..... .+.+..+.+........... T Consensus 95 ~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~~~~~ 174 (321) T protein:vir:31 95 DISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVETIDAAD 174 (321) T ss_pred eeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhccccccccccc Confidence 9999999999999999999974 699999999999999999999999999853211 11122111111111122222 Q ss_pred ccchHHHHHHHhhhhhhhhhhccc--CccEEEecHHHHHHHHh-hhcCCCceeeec-------ccccCcceEeecccccC Q lcl|NC_011054. 156 GDANEDDLIGCINRASKAVAAAGY--MPDTLLASLGFRFDVAN-LRDANGNPIFRD-------ESFNGFGTYFNANGAWP 225 (302) Q Consensus 156 ~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~-l~d~~g~~i~~~-------~~~~g~p~~~~~~~~~~ 225 (302) +..+.+.+ .++...+...++ ....|+||+.++..++. |+|. +.++|++ .++.|+|+..+... T Consensus 175 ~~~~~d~l----~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~-~~~~~~~~l~~~~~~tl~G~pvv~~~~m--- 246 (321) T protein:vir:31 175 DILDNDLV----IRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDR-DTPLGDNVIMGEADVNPFSFPIIGSGLW--- 246 (321) T ss_pred cccCHHHH----HHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcC-CCccccchhhccccccccceeEEEcCCC--- Confidence 33344444 445555555443 24479999999887765 5554 4567654 35678888766553 Q ss_pred CCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeee--cccccCCCCC Q lcl|NC_011054. 226 VGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKT--PVGAVVPDGS 302 (302) Q Consensus 226 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~--~a~~~~p~~~ 302 (302) .+..++++||+.+.++.++++.+++..+.... ....+.+......++|+.|.+.++++.+++. |...+.|+-| T Consensus 247 -P~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~~~~~~~~~~ 321 (321) T protein:vir:31 247 -PDDKAMFTDPQNLIYALYRDLEIDVLTESDKV---SERDLHARYFMRGDDDFAIENTEAVVLAEGLGDPLEHLEEETS 321 (321) T ss_pred -CCCcEEEeccccEEEEEeeccEEEEeecCccc---cccceeeEeeeeeecceeEeccccEEEEecCCcchhcccCCCC Confidence 45679999999999999999998877654211 1123444445566799999999999999964 5677788888 No 103 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.96 E-value=1.2e-30 Score=184.68 Aligned_cols=258 Identities=14% Similarity=0.072 Sum_probs=201.0 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhccee----ecCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTV----NMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||+.+++.+..++|+.+++.+++.+++.+.+.+++... ...+..++||++...+.+.|++||+..+. ++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~-----~~~~ 75 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPM-----TQLG 75 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccc-----cccc Confidence 99999988999999999999999999999888877542 23456799999988899999999987653 6789 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) ++++++.+++++..+.+|+++..++..++.+++.+++++++++++|+.++..-.. .....++ T Consensus 76 ~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~------------------a~~~~~~ 137 (272) T protein:vir:98 76 FKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK------------------STQTVEA 137 (272) T ss_pred cceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc------------------ccccccc Confidence 9999999999999999999999999999999999999999999999999953110 0011122 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCC-------Cceee-ec--ccccCcceEeecccccCC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDAN-------GNPIF-RD--ESFNGFGTYFNANGAWPV 226 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~-------g~~i~-~~--~~~~g~p~~~~~~~~~~~ 226 (302) ..+.+ .+.++...+...+....+|+|||.++..|++.+..+ |..+. +. ..+.|+|+.+.... T Consensus 138 ~~t~d----~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~---- 209 (272) T protein:vir:98 138 TATVD----GVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKC---- 209 (272) T ss_pred ccCHH----HHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCC---- Confidence 23334 444555556666677789999999999998764221 11111 11 35778888777653 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccccc Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV 297 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~ 297 (302) ++..+++.+...+.++.+++++++..++. .++...+++..||++++.+|+++++++.++|+-- T Consensus 210 p~~t~~~~~~~a~~~~~~~~~~ve~~r~~--------~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 210 PKGTAYMVRKGALRIMLKRNTMVETDRDI--------TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred CcceEEEEcCCeEEEEecCCceeeecccc--------ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 35567778888888888888888876653 3466789999999999999999999998877775 No 104 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.96 E-value=1.2e-30 Score=184.68 Aligned_cols=258 Identities=14% Similarity=0.072 Sum_probs=201.0 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhccee----ecCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTV----NMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||+.+++.+..++|+.+++.+++.+++.+.+.+++... ...+..++||++...+.+.|++||+..+. ++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~-----~~~~ 75 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPM-----TQLG 75 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccc-----cccc Confidence 99999988999999999999999999999888877542 23456799999988899999999987653 6789 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) ++++++.+++++..+.+|+++..++..++.+++.+++++++++++|+.++..-.. .....++ T Consensus 76 ~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~------------------a~~~~~~ 137 (272) T protein:vir:30 76 FKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK------------------STQTVEA 137 (272) T ss_pred cceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc------------------ccccccc Confidence 9999999999999999999999999999999999999999999999999953110 0011122 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCC-------Cceee-ec--ccccCcceEeecccccCC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDAN-------GNPIF-RD--ESFNGFGTYFNANGAWPV 226 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~-------g~~i~-~~--~~~~g~p~~~~~~~~~~~ 226 (302) ..+.+ .+.++...+...+....+|+|||.++..|++.+..+ |..+. +. ..+.|+|+.+.... T Consensus 138 ~~t~d----~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~---- 209 (272) T protein:vir:30 138 TATVD----GVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKC---- 209 (272) T ss_pred ccCHH----HHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCC---- Confidence 23334 444555556666677789999999999998764221 11111 11 35778888777653 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccccc Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV 297 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~ 297 (302) ++..+++.+...+.++.+++++++..++. .++...+++..||++++.+|+++++++.++|+-- T Consensus 210 p~~t~~~~~~~a~~~~~~~~~~ve~~r~~--------~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 210 PKGTAYMVRKGALRIMLKRNTMVETDRDI--------TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred CcceEEEEcCCeEEEEecCCceeeecccc--------ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 35567778888888888888888876653 3466789999999999999999999998877775 No 105 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.94 E-value=1.4e-30 Score=184.32 Aligned_cols=260 Identities=14% Similarity=0.008 Sum_probs=157.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) -+..+....+..+|+.+.+.+.......++....++.. ..+.....|++|....++... ..++... T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~g~~~~~~~~e~~~~~~~~~---~~~~~~~ 275 (480) T protein:vir:40 210 GADLNVVNSLGSITSKYARKSGIYDGAMKARFQGLTLA-----------EDGVDDTFISGTFKAGTDKNK---SQTATKR 275 (480) T ss_pred hccccccccccccccchhhheeechhhhhhhhhcceee-----------eccccceeeeeeeeccccccc---ccccccc Confidence 11112222233345555444444444444443333221 223445678888765544322 2234444 Q ss_pred Eee---eeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 81 TLV---AEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 81 ~l~---~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) .+. .++++.....|+++++|+. ++++||.++|++.++++++++||+|+|++... ..+...... ..+.. T Consensus 276 ~~~~~~v~~l~~~~k~t~~lLDDa~-~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~--~~g~~~~~~------~~~~~ 346 (480) T protein:vir:40 276 SLRPQMAEAYLQMDKATVRGVNDSG-ALSEYVMSEMVNRVIQKVEYNMILGSVDGSNG--FYGLKTATD------GWTKQ 346 (480) T ss_pred hhhHHHHHHHHHhHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCccc--cccceeecc------ccccc Confidence 554 4688888899999999976 79999999999999999999999997654221 111111111 11111 Q ss_pred chHHHHHHHhhhhhhhhhhcccCcc-EEEecHHHHHHHHhhhcCCCceeeeccc-------ccCcceEeecccccCCCcc Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPD-TLLASLGFRFDVANLRDANGNPIFRDES-------FNGFGTYFNANGAWPVGVA 229 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~-~~v~~~~~~~~l~~l~d~~g~~i~~~~~-------~~g~p~~~~~~~~~~~~~~ 229 (302) .+.+ +.+..++..+...+..++ +|+||+.+|..|++|||++|||||++.. +.|+|+.+.... ...+. T Consensus 347 ~~~~---d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~G~Yi~q~~~~~~~~~~llG~pvv~~~~~--~~~~~ 421 (480) T protein:vir:40 347 IEYT---DLFEGITDAVAECSISDAITIVMSPQTFAELRKAKGTDGHSRFNELATKEQIAQSFGAVNLETRVW--MPKDE 421 (480) T ss_pred chhH---HHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCCCCeeccCcccccCcceecccceeeeecc--ccCCc Confidence 2222 334445555555555555 7999999999999999999999998753 456665332211 11233 Q ss_pred eEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccccc Q lcl|NC_011054. 230 EALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV 297 (302) Q Consensus 230 ~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~ 297 (302) +.+..+..++++++++ ++..++. .+..|+..|+++.|+++.+.+|.+++.++....=-| T Consensus 422 ~~~~~~~~~~~~~d~~---~~~~~~~------~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:40 422 VAVYNHDEYVLIGDLN---VENYNDF------DLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGSLGV 480 (480) T ss_pred ceeeeCCccEEEEecc---cceeccc------ccccchhhhhhhhhhceeeEccccEEEEEeccCcCC Confidence 3444444567788864 2333221 246788999999999999999999988776533333 No 106 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.86 E-value=3.5e-23 Score=143.66 Aligned_cols=277 Identities=14% Similarity=0.117 Sum_probs=200.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |+..|..+++.+.|+.....|||.+.+++++++++++.++.++.+++++++.-+.+.|...++..++. ...+|.++ T Consensus 25 m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~lp~a~~r~~n~~~~~~----~~~Tf~q~ 100 (330) T protein:vir:94 25 MPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVLGDVQFLAVGGTITAK----NPATFTKV 100 (330) T ss_pred hhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeeecCCcceeeecccccccc----Ccceeeee Confidence 99999999999999999999999999999999999998888888999999999999999988776542 23589999 Q ss_pred EeeeeeEEEeehhHHHHHh--cchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceee--ccc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVD--DASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTI--VPG 156 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~--ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~--~~~ 156 (302) +...+.+.+.+.|.+++.+ .+..++..+-.+...+++.+++++++|+|++.+..+ .+...... ....... .++ T Consensus 101 t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~F--~GL~~~~~-~~q~i~tg~~gg 177 (330) T protein:vir:94 101 TSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNSF--QGMMGLVA-ASQTISAGANGG 177 (330) T ss_pred eechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccc--cchhhcCC-cccEEecCCCCC Confidence 9999999999999999965 345688889999999999999999999998765432 12222221 1111211 234 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeecc----------cccCcceEeecccccC- Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDE----------SFNGFGTYFNANGAWP- 225 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~----------~~~g~p~~~~~~~~~~- 225 (302) ..+.+. +..+...+......++.|+||+....+++.+....|++-..+. ...|.|+...+..... T Consensus 178 ~~T~d~----LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~ 253 (330) T protein:vir:94 178 TLTFEL----LDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNM 253 (330) T ss_pred CCCHHH----HHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEeeeCCeEEEecccccCCC Confidence 455444 4444444444455688999999999999999877665533221 2345555444332221 Q ss_pred -----CCcceEEEEecc-----eEEEEee----cCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEee Q lcl|NC_011054. 226 -----VGVAEALVVDSS-----RVRIGVR----QDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNK 291 (302) Q Consensus 226 -----~~~~~~~~gd~~-----~~~~~~~----~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~ 291 (302) .+...|++..|. +.+.|.. .+++++...+ .-+++...++++++|+.++.+|+|+.+|.. T Consensus 254 ~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~-------~~~k~v~~~~v~~y~~~av~~~~a~~~L~~ 326 (330) T protein:vir:94 254 TQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGA-------KENADETITRVKMYCGFANFSQLGLAAIKG 326 (330) T ss_pred CcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCC-------ccccceeeEEEEEeeeeEEechhheeeecc Confidence 223445555542 4556653 3555543321 124567789999999999999999999998 Q ss_pred eccc Q lcl|NC_011054. 292 TPVG 295 (302) Q Consensus 292 ~~a~ 295 (302) ...+ T Consensus 327 V~~g 330 (330) T protein:vir:94 327 LIPG 330 (330) T ss_pred ccCC Confidence 8777 No 107 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.84 E-value=1.2e-22 Score=140.85 Aligned_cols=259 Identities=12% Similarity=0.058 Sum_probs=185.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeec----CCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNM----GTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||+..|.-...++|+.+...+.+.+.+...+.+++..-+. .+..+++|.+.....+.++.|+.+++. .+.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~-----~~lt 75 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISL-----DKIG 75 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccCh-----hhcC Confidence 9998888888899999999999999988888888755432 356799999987778889999987654 4567 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) .++.++..++.+..+.++++...++..++.+.+.++++.++++++|+.++..-.. .....+. T Consensus 76 ~~~~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~------------------~~~~~~~ 137 (272) T protein:vir:36 76 TTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKT------------------TSQTVST 137 (272) T ss_pred CcceeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc------------------ccccccc Confidence 7888889999999999999998888899999999999999999999999853110 0001122 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhc------CCCcee-eec--ccccCcceEeecccccCCC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRD------ANGNPI-FRD--ESFNGFGTYFNANGAWPVG 227 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d------~~g~~i-~~~--~~~~g~p~~~~~~~~~~~~ 227 (302) ..+.+ .+.++...+........+++|||..+..|++... ..|..+ .+. +...|+++.+.+....... T Consensus 138 ~~~~d----~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~ 213 (272) T protein:vir:36 138 KANVD----GVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA 213 (272) T ss_pred cccHH----HHHHHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCce Confidence 23334 4445555555556667889999999999987542 222222 221 3567888877766443322 Q ss_pred cceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 228 VAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 228 ~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) ....++.-...+.++..++++++..++. .+....+++..+|+.++.+|+++++++.+.+ T Consensus 214 ~~~~~~~~~gA~~~~~~~~~~vE~~R~~--------~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 214 LMFKIVSNSPALKLVLKRGVQVETDRDI--------VTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eEEEEEecccceeeeecCCcccccccch--------hhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 2222222223344556667777655543 2344578999999999999999999998877 No 108 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.84 E-value=3.2e-22 Score=138.41 Aligned_cols=260 Identities=15% Similarity=0.060 Sum_probs=192.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeec----CCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNM----GTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||+..|.-+..++|+.++..+.+.+.+...+.+++....- .+..+++|++...+.+.++.|+..++. ++.+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~-----~~it 75 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-----DILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccc-----cccc Confidence 9999999899999999999999999998888888755421 345789999987778899999887654 4677 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) +++.++..++.+..+.++++...++..++.+.+.+++++++++++|+.++..-.+.+ ...... T Consensus 76 ~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~-----------------~~~~~~ 138 (274) T protein:vir:93 76 TKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-----------------LTVNAD 138 (274) T ss_pred cceeEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------cccccc Confidence 888899999999899999999998889999999999999999999999995322110 001111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh------cC-CCceeeec---ccccCcceEeecccccCC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR------DA-NGNPIFRD---ESFNGFGTYFNANGAWPV 226 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~------d~-~g~~i~~~---~~~~g~p~~~~~~~~~~~ 226 (302) ..+.+.+.+ +...+........+++|||..+..|++.. ++ .|..+... +...|+++.+.+.. T Consensus 139 ~~~~d~i~d----A~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~---- 210 (274) T protein:vir:93 139 ITKLNGLQS----AIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKL---- 210 (274) T ss_pred ccCHHHHHH----HHHHhhhccCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCCC---- Confidence 223444444 44444444556788999999999998642 11 12222221 24678887776543 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccC Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVV 298 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~ 298 (302) +....++.+...+.++...++.++..++. .+....+++..+|++++.+|+++++++++.++..- T Consensus 211 p~~t~~l~~~gai~~~~~~~~~vE~~Rd~--------~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 211 EAGTAILAKKGAVKLILKRDFFLEVARDA--------STKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred CcceEEEEeCCeEEEEecCCcccccccch--------hhcccEEEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 35567777778887888788777766553 23456899999999999999999999987666654 No 109 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.79 E-value=1.9e-20 Score=128.70 Aligned_cols=263 Identities=13% Similarity=0.083 Sum_probs=182.0 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN----MGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||+.+|.-+..++|+.|+..+.+.+++...+.+++.... -.+..+++|++.....+.++.|+..++. .+.+ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~-----~~lt 75 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDY-----SALE 75 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcc-----cccc Confidence 999888888899999999999999998888877764432 2355789999986677888999877654 4567 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) +++.++..++.+..+.++++...++..++.+.+.+++++++++.+|+.++..-... ........+ T Consensus 76 ~~~~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a-----------~~~~~~~~t---- 140 (278) T protein:vir:80 76 TESVKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTT-----------TLEVKGAIN---- 140 (278) T ss_pred cceeeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcc-----------ccccccccc---- Confidence 88888888888888999999999888899999999999999999999998642100 000000001 Q ss_pred cchHHHHHHHhhhhhhhhhhccc-CccEEEecHHHHHHHHhhhcC-------CCceeeec---ccccCcceEeecccccC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGY-MPDTLLASLGFRFDVANLRDA-------NGNPIFRD---ESFNGFGTYFNANGAWP 225 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~-~~~~~v~~~~~~~~l~~l~d~-------~g~~i~~~---~~~~g~p~~~~~~~~~~ 225 (302) ....+..++.+.++...+..... ....++|||..+..|++.... .|..+... +...|+++.+.+... T Consensus 141 ~~~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p-- 218 (278) T protein:vir:80 141 IGLIDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKLA-- 218 (278) T ss_pred cchhhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCCC-- Confidence 11122233444444444433322 234588999999999875311 12222221 246788887776542 Q ss_pred CCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 226 VGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 226 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) ....++.....+.++..+++.++..++. .+....+++.++|+.++.+|++++++++...- T Consensus 219 --~~t~~l~~~gAi~~~~~~~~~vE~~Rd~--------~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 219 --DGNALAVKAGALKTFLKRNLLAESGRDM--------DHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred --cceEEEEeccceeeeecCCcccccccch--------hhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 3344455555666667777777655543 23455788899999999999999999987444 No 110 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.79 E-value=1.1e-20 Score=129.95 Aligned_cols=262 Identities=16% Similarity=0.106 Sum_probs=189.5 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN----MGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||+.+|.-...++|+.+.+.+.+.+.+...+.+++..-+ .++..+++|.+.....+.++.|+.+++. .+.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~-----~~lt 75 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPV-----DKIE 75 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCc-----cccc Confidence 999888888888999999999999999988888876533 3566899999987788889999987654 4567 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) +++.....++.+..+.++++....+..++.+.+.++++.++++++|+.++.--... ....... T Consensus 76 ~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~-----------------~~~~~~~ 138 (276) T protein:vir:10 76 TNRREAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGT-----------------KLTVSAD 138 (276) T ss_pred cceeeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc-----------------ccccccc Confidence 78888888999999999999999888899999999999999999999998411000 0001112 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCC-------Cce-eeec--ccccCcceEeecccccCC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDAN-------GNP-IFRD--ESFNGFGTYFNANGAWPV 226 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~-------g~~-i~~~--~~~~g~p~~~~~~~~~~~ 226 (302) ..+.+.+. ++...+........+++|||..+..|+++.+.+ |.. +.+. +...|+++.+.... T Consensus 139 ~~t~d~i~----~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~---- 210 (276) T protein:vir:10 139 IGTLAGLE----AAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKKL---- 210 (276) T ss_pred ccCHHHHH----HHHHHhccccCcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCCC---- Confidence 23344444 444444444556778999999999998764211 211 2211 24567776665542 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) +....++.....+.++..+++.++..++.. +....+++..+|+.++.+|+.++++++...+. |.|- T Consensus 211 p~~t~~l~~~gAi~~~~~~~~~vE~dRd~~--------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~--~~~~ 276 (276) T protein:vir:10 211 DEGEAILAKRGAVKLITKRDFFLETDRDPS--------TKTTALYSDKHYVAYLYDESKAVKVTKGAGTT--DSGA 276 (276) T ss_pred CcceEEEEeccceeeeecCCceeecccchh--------hcccEEEEeeEEEEEEEcCcceEEEecCCcCC--cCCC Confidence 344455555556666777788877766542 34567888999999999999999999877555 5555 No 111 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.78 E-value=2.7e-20 Score=127.84 Aligned_cols=260 Identities=16% Similarity=0.094 Sum_probs=187.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN----MGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||+.+|.-...++|+.++..+.+.+.+...+.++++.-+ -.+..+++|++.....+..+.|+..++. .+.+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~-----~~it 75 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV-----DQIG 75 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCch-----hhcc Confidence 999998888889999999999999988888877765432 1355799999876677778888876654 4567 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) ++..++..++.+..+.++++...++..++.+.+.++++.++++.+|+.++.--...+ ...... T Consensus 76 ~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~-----------------~~~~~~ 138 (274) T protein:vir:96 76 TSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT-----------------LTVEAD 138 (274) T ss_pred cceeEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----------------CCcCcc Confidence 788888889988889999999888888999999999999999999999985321100 001111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh------cC-CCceeeec---ccccCcceEeecccccCC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR------DA-NGNPIFRD---ESFNGFGTYFNANGAWPV 226 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~------d~-~g~~i~~~---~~~~g~p~~~~~~~~~~~ 226 (302) ..+.+. +.++...+........+++|||..+..|+++. +. .|..+... +...|+++.+.+.. T Consensus 139 ~~~~d~----i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~---- 210 (274) T protein:vir:96 139 ITKLDG----LQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKL---- 210 (274) T ss_pred cccHHH----HHHHHHHhcccCCCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcCCC---- Confidence 223344 44454455555556788999999999998863 11 12222221 24667777666553 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccC Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVV 298 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~ 298 (302) +....++.....+.++...++.++..++. .+....+++.++|+.++.+|++++++++..+-.+- T Consensus 211 p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~--------~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 211 NKGEALLAKKGAVKLITKRDFFLEKDRDA--------SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred CcceEEEEeCcceeeeecCCcccccccch--------hhcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 23445555566666777777777655443 23456788999999999999999999998777766 No 112 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.78 E-value=2.1e-20 Score=128.41 Aligned_cols=260 Identities=15% Similarity=0.064 Sum_probs=183.5 Q ss_pred CCCccCCC-cceecchHHHHHHHHHHHhhhhhhhhcceeec----CCCceEEEEEeCCcceeeecccccccccccccccc Q lcl|NC_011054. 1 MADISRSE-VATLIQEAYANDLLASAKKGSTVLQAFPTVNM----GTKTTHLPVLATLPGASWVSESATEPEGVKPTSEA 75 (302) Q Consensus 1 Ma~~t~~~-~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~ 75 (302) ||..+.+. ...++|+.++..+.+.+.+...+.+++..-+. .+..+++|.+.....+.++.|+..++. .+. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-----~~l 75 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPI-----DLI 75 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcch-----hhc Confidence 76655333 45567999999999999999888888765432 356799999987778888999887654 456 Q ss_pred ceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecc Q lcl|NC_011054. 76 TWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVP 155 (302) Q Consensus 76 ~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~ 155 (302) ++++.+...++.+..+.++++....+..++.+.+.++++.++++++|+.++.--++.. ..... T Consensus 76 t~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~-----------------~~~~~ 138 (275) T protein:vir:96 76 ETKKRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGAT-----------------LKVEA 138 (275) T ss_pred ccceeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------ccccc Confidence 7788888889999999999999888878899999999999999999999985322100 00111 Q ss_pred ccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhc-------CCCceeeec---ccccCcceEeecccccC Q lcl|NC_011054. 156 GDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRD-------ANGNPIFRD---ESFNGFGTYFNANGAWP 225 (302) Q Consensus 156 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d-------~~g~~i~~~---~~~~g~p~~~~~~~~~~ 225 (302) ...+.+.+.+ +...+........+++|||..+..|+++.. ..|..+... +...|+++.+.+.. T Consensus 139 ~~~~~d~i~d----A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~--- 211 (275) T protein:vir:96 139 DITKLAGLQT----AIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNKI--- 211 (275) T ss_pred cccCHHHHHH----HHHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCCC--- Confidence 2234444444 444444444567789999999999987631 123222222 24667777666543 Q ss_pred CCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccC Q lcl|NC_011054. 226 VGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVV 298 (302) Q Consensus 226 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~ 298 (302) +....++.....+.++...++.++..++. .+....+++.++|+.++.+|+++++++++|++.=. T Consensus 212 -p~~t~~i~~~gA~~~~~~~~~~vE~~Rd~--------~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 212 -KEGEAILAKRGAVKLITKRDFFLETERHA--------SHKSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred -CcceEEEEeccceeeeecCCcccccccch--------hhcCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 23344444455566667777777766553 23456788999999999999999999999887744 No 113 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.77 E-value=1e-19 Score=124.69 Aligned_cols=260 Identities=14% Similarity=0.059 Sum_probs=188.4 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN----MGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||+..|.-...++|+.+...+.+.+++...+.+++..-+ .++..+++|++.....+..+.|+..++. .+.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~-----~~lt 75 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-----DILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccc-----cccc Confidence 999998888899999999999999988877777775532 2456799999886677888888877653 4567 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) .++.++..++.+..+.++++....+..++.+.+.++++.++++.+|+.++.--.+. ... .... T Consensus 76 ~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a---------------~~~--~~~~ 138 (274) T protein:vir:94 76 TKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA---------------KLT--VNAD 138 (274) T ss_pred cceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------------Ccc--cccc Confidence 78888889998889999999988888899999999999999999999998532110 000 0111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh------cC-CCceeeec---ccccCcceEeecccccCC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR------DA-NGNPIFRD---ESFNGFGTYFNANGAWPV 226 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~------d~-~g~~i~~~---~~~~g~p~~~~~~~~~~~ 226 (302) ..+.+. +.++...+........+++|||..+..|++.. ++ .|..+... +...|+++.+.+.. T Consensus 139 ~~~~d~----i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~---- 210 (274) T protein:vir:94 139 ITKLNG----LQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKL---- 210 (274) T ss_pred ccCHHH----HHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCC---- Confidence 223344 44444455555556788999999999998631 11 12332222 24677777766553 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccC Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVV 298 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~ 298 (302) +....++.....+.++..+++.++..++.. +....+++..+|++++.+|.++++++++.|+..- T Consensus 211 p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~--------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 211 EAGTAILAKKGAVKLILKRDFFLEVARDAS--------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred CcceEEEEeCcceEeeecCCceeccccchh--------hcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 245556666667777777787777666542 2345788899999999999999999988777655 No 114 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.77 E-value=1e-19 Score=124.69 Aligned_cols=260 Identities=14% Similarity=0.059 Sum_probs=188.4 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN----MGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||+..|.-...++|+.+...+.+.+++...+.+++..-+ .++..+++|++.....+..+.|+..++. .+.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~-----~~lt 75 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-----DILE 75 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccc-----cccc Confidence 999998888899999999999999988877777775532 2456799999886677888888877653 4567 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) .++.++..++.+..+.++++....+..++.+.+.++++.++++.+|+.++.--.+. ... .... T Consensus 76 ~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a---------------~~~--~~~~ 138 (274) T protein:vir:97 76 TKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA---------------KLT--VNAD 138 (274) T ss_pred cceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------------Ccc--cccc Confidence 78888889998889999999988888899999999999999999999998532110 000 0111 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh------cC-CCceeeec---ccccCcceEeecccccCC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR------DA-NGNPIFRD---ESFNGFGTYFNANGAWPV 226 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~------d~-~g~~i~~~---~~~~g~p~~~~~~~~~~~ 226 (302) ..+.+. +.++...+........+++|||..+..|++.. ++ .|..+... +...|+++.+.+.. T Consensus 139 ~~~~d~----i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~---- 210 (274) T protein:vir:97 139 ITKLNG----LQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKL---- 210 (274) T ss_pred ccCHHH----HHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCC---- Confidence 223344 44444455555556788999999999998631 11 12332222 24677777766553 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccC Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVV 298 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~ 298 (302) +....++.....+.++..+++.++..++.. +....+++..+|++++.+|.++++++++.|+..- T Consensus 211 p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~--------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 211 EAGTAILAKKGAVKLILKRDFFLEVARDAS--------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred CcceEEEEeCcceEeeecCCceeccccchh--------hcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 245556666667777777787777666542 2345788899999999999999999988777655 No 115 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.74 E-value=8.5e-19 Score=119.65 Aligned_cols=280 Identities=13% Similarity=0.063 Sum_probs=186.5 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeE Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADR 80 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i 80 (302) |..+|..+.+.+.+..+...|||.+.++|.|++.+++.++.++.+.+.++..-+.+.+.+.+.+......+.+..+|.++ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~t~~~~ 80 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAATFTKV 80 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCcccccccccee Confidence 99999999999999999999999999999999999999998988999998877777665554433322234467899999 Q ss_pred EeeeeeEEEeehhHHHHHhc--c-hHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceee--cc Q lcl|NC_011054. 81 TLVAEEVAVIIPVHENVVDD--A-STSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTI--VP 155 (302) Q Consensus 81 ~l~~~ki~~~~~iS~ell~d--s-~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~--~~ 155 (302) +...+-+++.+.|.+.+.+- + ..+...+=.+...+++.++.+..+|||+.+..+ ..+........ ..... .. T Consensus 81 ~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~--F~GL~~~~~~~-q~i~~~~~g 157 (310) T protein:vir:97 81 NSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNE--FAGLIQLCASG-QKATTGATG 157 (310) T ss_pred eeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCc--ccchhhcCCcc-ceeecCCCC Confidence 99999999999999866552 2 334555556778899999999999999875432 11222222211 11111 22 Q ss_pred ccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh-cCCCceeeec---------ccccCcceEeecccccC Q lcl|NC_011054. 156 GDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR-DANGNPIFRD---------ESFNGFGTYFNANGAWP 225 (302) Q Consensus 156 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~-d~~g~~i~~~---------~~~~g~p~~~~~~~~~~ 225 (302) +..+.+ ++..+...+......++.|+|||.++.+++.+. ..+++-++.. ....|.|+...+..... T Consensus 158 g~~t~d----~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~ 233 (310) T protein:vir:97 158 SAISFA----ILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPTN 233 (310) T ss_pred CCCCHH----HHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCCC Confidence 444444 445555555555667889999999866665432 2222222221 24556676655443221 Q ss_pred ------CCcceEEEEecc-----eEEEEee----cCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEe Q lcl|NC_011054. 226 ------VGVAEALVVDSS-----RVRIGVR----QDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDN 290 (302) Q Consensus 226 ------~~~~~~~~gd~~-----~~~~~~~----~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt 290 (302) .+...+++..|. +-++|.. .+++++.... .-+++...+|++++|+.++..|+|+.+|. T Consensus 234 ~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~-------~~~~~v~~~~V~~Y~~~av~~~~A~a~L~ 306 (310) T protein:vir:97 234 QTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGE-------SEDSDEHIWRVKWYCGLALFSEKGLACAD 306 (310) T ss_pred ccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCc-------ccCCcceeEEEEEeeeEEEecccceeeec Confidence 233445555543 2344532 2455443321 12456678999999999999999999999 Q ss_pred eecc Q lcl|NC_011054. 291 KTPV 294 (302) Q Consensus 291 ~~~a 294 (302) ...- T Consensus 307 ~V~~ 310 (310) T protein:vir:97 307 GITN 310 (310) T ss_pred cccC Confidence 6544 No 116 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.74 E-value=1.1e-19 Score=124.62 Aligned_cols=292 Identities=13% Similarity=0.088 Sum_probs=181.6 Q ss_pred CC------------CccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeecccccccc Q lcl|NC_011054. 1 MA------------DISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMG-TKTTHLPVLATLPGASWVSESATEPE 67 (302) Q Consensus 1 Ma------------~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~ 67 (302) |+ ..++.++..+||..+++-+.|....-....+++..+... +.++.+|- -+.-.++-++||++.++ T Consensus 59 m~G~~p~~eV~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~-~g~~Ra~~IgEGgE~~~ 137 (393) T protein:vir:79 59 MEGETPTNEVNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPS-IGIMRAYDVAEGQEIPE 137 (393) T ss_pred hcCCCchhheehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccc-hheeeeccccccccccc Confidence 11 134666889999999999999888887788888888774 33454443 34667888999999987 Q ss_pred ccccccccceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccccc Q lcl|NC_011054. 68 GVKPTSEATWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAA 147 (302) Q Consensus 68 ~~~~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~ 147 (302) .+. +..+++.|+++.+|++..+.+|+|+++||..++.+++.....+++++..|..++++.-+.......+ ......+ T Consensus 138 ~sl--d~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa-~st~t~a 214 (393) T protein:vir:79 138 DSI--DWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDN-YSTNKLA 214 (393) T ss_pred cch--hhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeec-cccCccc Confidence 543 4568899999999999999999999999999999999999999999999999999865432211111 1111111 Q ss_pred ccceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh---cCC----Cce---ee------ecccc Q lcl|NC_011054. 148 NQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR---DAN----GNP---IF------RDESF 211 (302) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~---d~~----g~~---i~------~~~~~ 211 (302) ..+.-...+.-......+++.++...+.+..+.++.|+|||-.|+.+.+-. ... |++ .+ .|+.+ T Consensus 215 hptGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~algp~~i 294 (393) T protein:vir:79 215 HTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALGPDSI 294 (393) T ss_pred eeecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhchhhh Confidence 111111112223333445555666666777889999999999999997642 111 111 11 11122 Q ss_pred cC-cc----eEeecccccCCCcceEEEEecceEEEEeecCcEEEEee-cccccchhhhcCCcEEEEEEEEeccEEecc-c Q lcl|NC_011054. 212 NG-FG----TYFNANGAWPVGVAEALVVDSSRVRIGVRQDITVKFLD-QATVGSINLAERDMIALRLKARFAYVLGNG-A 284 (302) Q Consensus 212 ~g-~p----~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~-~ 284 (302) .| +| +.+....+........ +|+..++..+.|.... +.+..+.+....|...++...|+|++|++. . T Consensus 295 ~~~~~~nlnv~~sPfvp~d~k~~rF------d~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gk 368 (393) T protein:vir:79 295 QGRLPFNFNVNLSPFIPLDKKSRRF------DVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGILNEGK 368 (393) T ss_pred ccccccceeEEEeccccccccccee------eEEEeecCCceEEEEecCcceeccccccccceeeeeeeeeceeeeeCCc Confidence 22 23 2222222333222211 3444555554444332 223333445667889999999999999885 4 Q ss_pred cEEEEeeecccccCCCCC Q lcl|NC_011054. 285 TAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 285 a~~~lt~~~a~~~~p~~~ 302 (302) +++.-.-......-|+-- T Consensus 369 aiavakNI~~~k~y~~P~ 386 (393) T protein:vir:79 369 AIAVAKNISMDKSYAEPM 386 (393) T ss_pred eEEEEecceeecccccch Confidence 443322222222222222 No 117 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.72 E-value=1e-18 Score=119.24 Aligned_cols=260 Identities=15% Similarity=0.070 Sum_probs=183.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN----MGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||+..|.-...++|+.+...+.+.+.+...+.+++..-. ..+..+++|.+...+.+..+.|+..++. .+.+ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-----~~lt 75 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-----DILE 75 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccch-----hhcc Confidence 999998888899999999999999988877777765432 2466899999886677888888877653 4567 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) .++.++..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--.+.. ...... T Consensus 76 ~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~-----------------~~~~~~ 138 (274) T protein:vir:12 76 TKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-----------------LTVNAD 138 (274) T ss_pred cceeeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------cccccc Confidence 777788888888899999988888878899999999999999999999985322110 001112 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh------cCC-Cceeeec---ccccCcceEeecccccCC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR------DAN-GNPIFRD---ESFNGFGTYFNANGAWPV 226 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~------d~~-g~~i~~~---~~~~g~p~~~~~~~~~~~ 226 (302) ..+.+.+.+ +...+........+++|||..+..|++.. +++ |..+... +...|+++.+.+... T Consensus 139 a~~~d~i~d----A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p--- 211 (274) T protein:vir:12 139 ITKLNGLQS----AIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRSNKLE--- 211 (274) T ss_pred ccCHHHHHH----HHHHhccccccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEEEeCCCC--- Confidence 234444444 44444444456778999999999998741 222 2222222 246778777765432 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccC Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVV 298 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~ 298 (302) ....++.....+.++..+++.++..++.. +....+++..+|++++.||++++++++..|+..- T Consensus 212 -~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~--------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 212 -AGTAILAKKGAVKLILKRDFFLEVARDAS--------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred -cceEEEEeccceeeeecCCceeccccchh--------hcccEEEeeeEEEEEEEcCCceEEEEcCCccccC Confidence 23334444455556667777887766542 3445788999999999999999999976666544 No 118 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.72 E-value=1.5e-18 Score=118.35 Aligned_cols=260 Identities=15% Similarity=0.059 Sum_probs=183.0 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN----MGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||+..|.-...++|+.++..+.+.+.+...+.+++..-+ -.+..+++|.+.....+..+.|+..++. .+.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-----~~lt 75 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPT-----DILE 75 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccch-----hhcc Confidence 999888777888899999999999988888777764332 2466899999886677888888876654 3566 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) .+..++..++.+..+.++++....+..++.+.+.++++.++++.+|+.++.--.+.. ...... T Consensus 76 ~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-----------------~~~~~~ 138 (274) T protein:vir:95 76 TKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-----------------LTVEAD 138 (274) T ss_pred cceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------cccccc Confidence 777788888888889999998888878999999999999999999999985221110 001112 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh------cCC-Cceeeec---ccccCcceEeecccccCC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR------DAN-GNPIFRD---ESFNGFGTYFNANGAWPV 226 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~------d~~-g~~i~~~---~~~~g~p~~~~~~~~~~~ 226 (302) ..+++.+.+ +...+........+++|||..+..|++.. +++ |..+... +.+.|+++.+.++. T Consensus 139 ~~~~d~i~~----A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~---- 210 (274) T protein:vir:95 139 ITKLTGLQT----AIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKL---- 210 (274) T ss_pred ccCHHHHHH----HHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCC---- Confidence 233444444 44444444456778999999999998741 222 2222222 24678887766543 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccC Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVV 298 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~ 298 (302) +....++.....+.++..+++.++..++. .+....+++.++|++++.+|++++++++..++..- T Consensus 211 ~~~t~~l~~~gA~~~~~~~~~~vE~~Rd~--------~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 211 EAGTAILAKKGAVKLITKRDFFLETDRDP--------STKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred CCceEEEEeccceeeeecCCccccccccc--------ccccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 23333444445555666777777766553 34556788999999999999999999977666644 No 119 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.72 E-value=1.5e-18 Score=118.35 Aligned_cols=260 Identities=15% Similarity=0.059 Sum_probs=183.0 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN----MGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||+..|.-...++|+.++..+.+.+.+...+.+++..-+ -.+..+++|.+.....+..+.|+..++. .+.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-----~~lt 75 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPT-----DILE 75 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccch-----hhcc Confidence 999888777888899999999999988888777764332 2466899999886677888888876654 3566 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) .+..++..++.+..+.++++....+..++.+.+.++++.++++.+|+.++.--.+.. ...... T Consensus 76 ~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-----------------~~~~~~ 138 (274) T protein:vir:96 76 TKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-----------------LTVEAD 138 (274) T ss_pred cceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------cccccc Confidence 777788888888889999998888878999999999999999999999985221110 001112 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh------cCC-Cceeeec---ccccCcceEeecccccCC Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR------DAN-GNPIFRD---ESFNGFGTYFNANGAWPV 226 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~------d~~-g~~i~~~---~~~~g~p~~~~~~~~~~~ 226 (302) ..+++.+.+ +...+........+++|||..+..|++.. +++ |..+... +.+.|+++.+.++. T Consensus 139 ~~~~d~i~~----A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~---- 210 (274) T protein:vir:96 139 ITKLTGLQT----AIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKL---- 210 (274) T ss_pred ccCHHHHHH----HHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCC---- Confidence 233444444 44444444456778999999999998741 222 2222222 24678887766543 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccC Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVV 298 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~ 298 (302) +....++.....+.++..+++.++..++. .+....+++.++|++++.+|++++++++..++..- T Consensus 211 ~~~t~~l~~~gA~~~~~~~~~~vE~~Rd~--------~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 211 EAGTAILAKKGAVKLITKRDFFLETDRDP--------STKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred CCceEEEEeccceeeeecCCccccccccc--------ccccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 23333444445555666777777766553 34556788999999999999999999977666644 No 120 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.65 E-value=2e-17 Score=112.10 Aligned_cols=259 Identities=12% Similarity=0.058 Sum_probs=178.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN----MGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||.++.++ .++|+.+.+-+.+.+.+...+.+++..-+ .++..+++|.+.-.+++.-+.|++.++. .+.+ T Consensus 1 Ma~T~~~d--~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~-----~~lt 73 (270) T protein:vir:95 1 MTQTKKAN--LINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDT-----TQMS 73 (270) T ss_pred CCceehhh--hcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccch-----hhcc Confidence 99877654 46899999999999988888888876543 2466899999987777777888877653 4567 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeeccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPG 156 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~ 156 (302) +++-....++.+..+.++++....+..+....+.++++..+++++|+.++.--. |. ....+. T Consensus 74 ~~~~~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~---~a---------------~~~~~~ 135 (270) T protein:vir:95 74 MTTTKVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELN---KS---------------KQTATV 135 (270) T ss_pred cchheeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhc---cc---------------cccccc Confidence 778888889999999999998877777889999999999999999999883100 00 000112 Q ss_pred cchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcC----CCce-eeec--ccccCcceEeecccccCCCcc Q lcl|NC_011054. 157 DANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDA----NGNP-IFRD--ESFNGFGTYFNANGAWPVGVA 229 (302) Q Consensus 157 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~----~g~~-i~~~--~~~~g~p~~~~~~~~~~~~~~ 229 (302) ..+.+.+.+ +...+......+.+++|||..+..|++.... .+.- +.+. +...|.++.+.... ..+. T Consensus 136 ~~t~~~~~d----A~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~~G~~Viv~s~~---~~~~ 208 (270) T protein:vir:95 136 SADATGILD----AIEVFNSENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEIVGVSDIVKSKR---VSEN 208 (270) T ss_pred ccCHHHHHH----HHHHhccccCCCcEEEEcHHHHHHHHhhhcccccccccchhcccccceecceeEEEeCCC---CCce Confidence 234444444 4444555566678899999999999864311 1111 1111 23456666554432 2334 Q ss_pred eEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCC Q lcl|NC_011054. 230 EALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVP 299 (302) Q Consensus 230 ~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p 299 (302) ..++.....+-++..+++.++..++.. +....+.+..+|++++.+++.+++++..+++.+.- T Consensus 209 ~~~l~~~gAi~~~~~~~~~vEtdRd~~--------~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~~~ 270 (270) T protein:vir:95 209 TAFLQRYGAMEIVNKKKPEAYTDFDIL--------KRTHLLSTNYHYSVNLKDETGVVKVTFKPSGSLEM 270 (270) T ss_pred eEEEEeccceeeeecCCceeeeccchh--------hcccEEEeeeEEEEEEEccceEEEEEecCCCCcCC Confidence 445555555667777788887766543 34457888899999999999999999864444333 No 121 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.42 E-value=6.1e-14 Score=93.02 Aligned_cols=283 Identities=12% Similarity=0.105 Sum_probs=166.0 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceee-eccccccccccccccccceee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASW-VSESATEPEGVKPTSEATWAD 79 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~-v~E~~~~~~~~~~~s~~~f~~ 79 (302) +..... ++.+++++...++++.+++.+++++.++++++.+....+++...+..-.. -.|+...++. .+.+... T Consensus 23 it~~~l--~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~G~r~~r~~~e~~~~~~~----~~~~~~~ 96 (360) T protein:vir:99 23 IGLAEL--DGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGVPRLSGHTRDEEGSRTEN----SEAESGS 96 (360) T ss_pred cccccc--CceeecHHHHHHHHHHHhhccchhhhcceeecccccccccccccceeeccccccCCCCCcC----CcCcccc Confidence 222222 46788999999999999999999999999999988888776554332111 1122221111 2233344 Q ss_pred EEe-eeeeEEEeehhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhcccCCCc-----cc------cccccccc Q lcl|NC_011054. 80 RTL-VAEEVAVIIPVHENVVDDA----STSLLEEIAALGGQAIGKKLDQAVIFGTDKPS-----SW------VSPALLPA 143 (302) Q Consensus 80 i~l-~~~ki~~~~~iS~ell~ds----~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~-----g~------~~~~~~~~ 143 (302) +.+ ..+++.....+..+.+++. ...+++.|.+.+++++++-++.-.++|+.... |. .+.+.... T Consensus 97 v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~~~~d~fl~~~dGwlKk 176 (360) T protein:vir:99 97 VKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSIGGAAELDNTFKGWIAR 176 (360) T ss_pred CccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccccCcccchhhhhhHHHHHH Confidence 444 2356666777777777664 33678999999999999999999999875421 00 11111111 Q ss_pred cc-------ccc-cc---------------eeec--cccc-hHHHHHHHhhhhhhhhhhcccC----ccEEEecHHHHHH Q lcl|NC_011054. 144 AV-------AAN-QD---------------YTIV--PGDA-NEDDLIGCINRASKAVAAAGYM----PDTLLASLGFRFD 193 (302) Q Consensus 144 ~~-------~~~-~~---------------~~~~--~~~~-~~~~~~~~i~~~~~~~~~~~~~----~~~~v~~~~~~~~ 193 (302) +. .++ .+ +..+ .+.. .......++.++...+...+++ +-.|+||+..+.. T Consensus 177 a~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~ 256 (360) T protein:vir:99 177 AEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQVQS 256 (360) T ss_pred hhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhhhcCcccceEEEccCchHHH Confidence 10 000 00 0000 0000 0111233445566666665543 3379999887655 Q ss_pred HHh-hhcCC---Cce-eeecc--cccCcceEeecccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCC Q lcl|NC_011054. 194 VAN-LRDAN---GNP-IFRDE--SFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERD 266 (302) Q Consensus 194 l~~-l~d~~---g~~-i~~~~--~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 266 (302) .+. |.+-+ |-- +.... ...|+|+..+.. ..++.++|.+++.+++|...+++++...+... +.++. T Consensus 257 yr~~L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~----~pd~~~mlT~p~NLi~g~~~~iri~~~~e~~~----~~~~~ 328 (360) T protein:vir:99 257 YTMSLTEREDPLGSAVIFGDSDITPFSYDLVGVNG----FPDEYMMFTDPNNLAFGLYEEMELDQSTDTDK----VHEQR 328 (360) T ss_pred HHHHHhccCcccchhheecccccccceeeeEEcCC----CCCCceEEeccCceeEEeeeeeEEeecccchh----hhhhc Confidence 544 43222 322 22222 234666544433 34567999999999999999999986655321 12222 Q ss_pred c-EEEEEEEEeccEEeccccEEEEeeecccccCCCC Q lcl|NC_011054. 267 M-IALRLKARFAYVLGNGATAVGDNKTPVGAVVPDG 301 (302) Q Consensus 267 ~-~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~ 301 (302) . +.+-.+..+|+.+.+++|+|.+++.+ +|++ T Consensus 329 ~~~~~~~~~~~D~~iee~~Av~~vt~~~----~~~~ 360 (360) T protein:vir:99 329 LHSRNWLEGQFDFQIKEQQAGVLVTDLE----TPTA 360 (360) T ss_pred eeeeEEEEEEeeEEEEecccEEEEecCC----CCCC Confidence 1 22334567999999999999999653 4555 No 122 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.39 E-value=8.7e-14 Score=92.19 Aligned_cols=253 Identities=13% Similarity=0.061 Sum_probs=152.0 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcce----eecCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPT----VNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~----~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||... ++|+.|+..+++.+++.+.+.+++.. ....++++++|+......+....++..+... +.+ T Consensus 1 MA~~~------~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~-----~~~ 69 (273) T protein:vir:79 1 MAFNN------FIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSAD-----AIS 69 (273) T ss_pred Ccchh------hhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCcc-----ccc Confidence 99743 68999999999999999988887633 2233568999997766666667677654432 344 Q ss_pred eeeEEeeeeeE-EEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc---ccCCCccccccccccccccccccee Q lcl|NC_011054. 77 WADRTLVAEEV-AVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF---GTDKPSSWVSPALLPAAVAANQDYT 152 (302) Q Consensus 77 f~~i~l~~~ki-~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~---G~g~~~g~~~~~~~~~~~~~~~~~~ 152 (302) ...++++..+. ...+.|++.-...+..++.+ +.+++.+++++++|+.++. +.+.. . T Consensus 70 ~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~~~-------------------~ 129 (273) T protein:vir:79 70 DTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA-------------------L 129 (273) T ss_pred cceEEEEEeeecccceeeccHHHHhhcccHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc-------------------c Confidence 55566666543 33456666444445667877 4567889999999987763 11100 0 Q ss_pred eccccchHHHHHHHhhhhhhhhhhccc--CccEEEecHHHHHHHHhhhc------CCCc--eeeec--ccccCcceEeec Q lcl|NC_011054. 153 IVPGDANEDDLIGCINRASKAVAAAGY--MPDTLLASLGFRFDVANLRD------ANGN--PIFRD--ESFNGFGTYFNA 220 (302) Q Consensus 153 ~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~l~d------~~g~--~i~~~--~~~~g~p~~~~~ 220 (302) ......+....++.+.++...+..... ...+++++|..+..|.+..+ ..|. .+.+. ..+.|++++... T Consensus 130 ~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~ 209 (273) T protein:vir:79 130 TGSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN 209 (273) T ss_pred ccccccchhhHHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecc Confidence 001112223445566666666655443 34578899999998865422 1121 12221 236677776665 Q ss_pred ccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 221 NGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 221 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) ...... ....+.+..+.+.+..+ ...++..+.. .+-...+++.+++|++++||++++.++++.+ T Consensus 210 ~lp~~~-~~~~~a~~~~A~~~a~~-~~~~e~~r~~--------~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 210 NLRDTD-DEQFVAFHPSAAAYVSQ-IDTVEALRDQ--------DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred cccccC-ceEEEEEeccceeeeee-hhhhhcccCc--------ccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 543222 23344555444433322 1223322221 1124468889999999999999999988755 No 123 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.36 E-value=2.2e-13 Score=89.96 Aligned_cols=253 Identities=13% Similarity=0.067 Sum_probs=150.5 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhccee----ecCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTV----NMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||.. .++|+.|+..+++.+++.+++..++..- ...++++++|+......+....++..+... +.+ T Consensus 1 MA~~------~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~-----~~~ 69 (273) T protein:vir:10 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSAD-----AIS 69 (273) T ss_pred Ccch------hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCcc-----ccc Confidence 9973 4689999999999999999888876431 223567999987665556566666544322 233 Q ss_pred eeeEEeeeeeE-EEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc---ccCCCccccccccccccccccccee Q lcl|NC_011054. 77 WADRTLVAEEV-AVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF---GTDKPSSWVSPALLPAAVAANQDYT 152 (302) Q Consensus 77 f~~i~l~~~ki-~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~---G~g~~~g~~~~~~~~~~~~~~~~~~ 152 (302) ..+++++..+. ...+.|++.-..++..++++ +.+++.++++.++|..++. +.+.. . T Consensus 70 ~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~-------------------~ 129 (273) T protein:vir:10 70 DTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA-------------------L 129 (273) T ss_pred cceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc-------------------c Confidence 34455554332 33345666333344567877 5567889999999988873 11100 0 Q ss_pred eccccchHHHHHHHhhhhhhhhhhccc--CccEEEecHHHHHHHHhhh----c--CCC-ceeee-c--ccccCcceEeec Q lcl|NC_011054. 153 IVPGDANEDDLIGCINRASKAVAAAGY--MPDTLLASLGFRFDVANLR----D--ANG-NPIFR-D--ESFNGFGTYFNA 220 (302) Q Consensus 153 ~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~l~----d--~~g-~~i~~-~--~~~~g~p~~~~~ 220 (302) ..+...+...+++.+.++...+..... ...+++++|..+..|.+.. + ..| .-.+. . ..+.|++++... T Consensus 130 ~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~ 209 (273) T protein:vir:10 130 TGSAPTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN 209 (273) T ss_pred ccccccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEec Confidence 011122334456667777766665543 3467899999999996532 2 111 11222 2 246677777665 Q ss_pred ccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 221 NGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 221 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) +.... .....+.+..+.+.+..+- ..++..+.. .+-...+++.+.+|+++.||++++.++++.+ T Consensus 210 ~lp~~-~~~~~~~~~~~A~~~a~q~-~~~e~~r~~--------~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 210 NLRDT-DDEQFVAFHPSAAAYVSQI-DTVEALRDQ--------DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ccccC-CccEEEEEeccceeeeeee-ehhhcccCC--------CcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 54322 2334556655554433321 122222221 1113458889999999999999999988765 No 124 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.36 E-value=2.2e-13 Score=89.96 Aligned_cols=253 Identities=13% Similarity=0.067 Sum_probs=150.5 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhccee----ecCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTV----NMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||.. .++|+.|+..+++.+++.+++..++..- ...++++++|+......+....++..+... +.+ T Consensus 1 MA~~------~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~-----~~~ 69 (273) T protein:vir:10 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSAD-----AIS 69 (273) T ss_pred Ccch------hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCcc-----ccc Confidence 9973 4689999999999999999888876431 223567999987665556566666544322 233 Q ss_pred eeeEEeeeeeE-EEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc---ccCCCccccccccccccccccccee Q lcl|NC_011054. 77 WADRTLVAEEV-AVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF---GTDKPSSWVSPALLPAAVAANQDYT 152 (302) Q Consensus 77 f~~i~l~~~ki-~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~---G~g~~~g~~~~~~~~~~~~~~~~~~ 152 (302) ..+++++..+. ...+.|++.-..++..++++ +.+++.++++.++|..++. +.+.. . T Consensus 70 ~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~-------------------~ 129 (273) T protein:vir:10 70 DTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA-------------------L 129 (273) T ss_pred cceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc-------------------c Confidence 34455554332 33345666333344567877 5567889999999988873 11100 0 Q ss_pred eccccchHHHHHHHhhhhhhhhhhccc--CccEEEecHHHHHHHHhhh----c--CCC-ceeee-c--ccccCcceEeec Q lcl|NC_011054. 153 IVPGDANEDDLIGCINRASKAVAAAGY--MPDTLLASLGFRFDVANLR----D--ANG-NPIFR-D--ESFNGFGTYFNA 220 (302) Q Consensus 153 ~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~l~----d--~~g-~~i~~-~--~~~~g~p~~~~~ 220 (302) ..+...+...+++.+.++...+..... ...+++++|..+..|.+.. + ..| .-.+. . ..+.|++++... T Consensus 130 ~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~ 209 (273) T protein:vir:10 130 TGSAPTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN 209 (273) T ss_pred ccccccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEec Confidence 011122334456667777766665543 3467899999999996532 2 111 11222 2 246677777665 Q ss_pred ccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 221 NGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 221 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) +.... .....+.+..+.+.+..+- ..++..+.. .+-...+++.+.+|+++.||++++.++++.+ T Consensus 210 ~lp~~-~~~~~~~~~~~A~~~a~q~-~~~e~~r~~--------~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 210 NLRDT-DDEQFVAFHPSAAAYVSQI-DTVEALRDQ--------DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ccccC-CccEEEEEeccceeeeeee-ehhhcccCC--------CcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 54322 2334556655554433321 122222221 1113458889999999999999999988765 No 125 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.36 E-value=4.9e-14 Score=93.53 Aligned_cols=222 Identities=13% Similarity=0.076 Sum_probs=148.9 Q ss_pred cceeecCCCceEEEEEeCCcceeeeccccccccccccccccceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHH Q lcl|NC_011054. 35 FPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGG 114 (302) Q Consensus 35 ~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~ 114 (302) -+-++ .+..+++|.+ ...|.-+.||.+++ ..+.++++-+.+.++++..+.|++|....+..+......++++ T Consensus 1 ~~~~~-~Gdtit~P~~--iGda~~v~eG~~i~-----~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~ 72 (231) T protein:vir:73 1 ENGIN-LANLCEYPND--IGDAADVAEGGEIS-----LDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLG 72 (231) T ss_pred Ccccc-CCceEEeccc--ccchhhhcCCCcCC-----hhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHH Confidence 12122 3556888866 34556677777654 3457788889999999999999999888777889999999999 Q ss_pred HHHHHHHHHHhhcccCCCcccccccccccccccccceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHH Q lcl|NC_011054. 115 QAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDV 194 (302) Q Consensus 115 ~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l 194 (302) .+|++++|+.++.--.+ +. ...+...+.+ .+.++...+......+.+++|||..+..| T Consensus 73 ~~iA~kvD~di~~~~~~-----------a~-------l~~~~~~t~d----~i~~A~~~fgde~~~~~vivv~p~~~~~L 130 (231) T protein:vir:73 73 LSLANKVDDDLLKAAKT-----------TS-------QTVSTKANVD----GVQAALDIFNDEDAQAYVLIVNPKDAAKI 130 (231) T ss_pred HHHHHhhhHHHHHhhcc-----------cc-------ccccccccHH----HHHHHHHHhccccccceEEEEcchHHHhh Confidence 99999999999841110 00 0112223444 44445555555566777899999999999 Q ss_pred HhhhcCC------Cceeeec---ccccCcceEeecccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcC Q lcl|NC_011054. 195 ANLRDAN------GNPIFRD---ESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAER 265 (302) Q Consensus 195 ~~l~d~~------g~~i~~~---~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 265 (302) ++..+.+ |..+... +.+.|+++.+.........-..-+++-...+.+...+++.++..++. .+ T Consensus 131 rk~~~~~~~~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~--------~~ 202 (231) T protein:vir:73 131 RKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI--------VT 202 (231) T ss_pred hhccchhhhhhhhccceeeecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeeccccc--------cc Confidence 8855332 2223222 24567777666554332221112333334455677778888776654 34 Q ss_pred CcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 266 DMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 266 ~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) ....+++.++|+.++.+|+.+++++.+.+ T Consensus 203 k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 203 KTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred cccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 55678899999999999999999998877 No 126 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.27 E-value=5.3e-13 Score=87.88 Aligned_cols=281 Identities=12% Similarity=0.065 Sum_probs=153.9 Q ss_pred CCCcc----CCCccee-c------chHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeC---Ccceeeecccccc Q lcl|NC_011054. 1 MADIS----RSEVATL-I------QEAYANDLLASAKKGSTVLQAFPTVNM-GTKTTHLPVLAT---LPGASWVSESATE 65 (302) Q Consensus 1 Ma~~t----~~~~g~l-i------P~~~~~~ii~~~~~~s~l~~~~~~~~~-~~~~~~~p~~~~---~~~a~~v~E~~~~ 65 (302) |..-+ ..+++.+ + |+.+-+.|.+.+.+.-..-.+++.... .++.+.+-.... ..++.-|.|++++ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEi 80 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEI 80 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCcccc Confidence 43221 1222222 1 555556777777666555556666543 355555544332 2456667888887 Q ss_pred ccccccccccceeeEEe-eeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccc Q lcl|NC_011054. 66 PEGVKPTSEATWADRTL-VAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAA 144 (302) Q Consensus 66 ~~~~~~~s~~~f~~i~l-~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~ 144 (302) |. +.+.++.-.+ ..+|.+..++||+|+++.+..+..+.....++.++++..|+.++.---++ +..... .. T Consensus 81 P~-----~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa-~t~~~~---~s 151 (318) T protein:vir:10 81 PV-----SAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSP-IVPTLA---VP 151 (318) T ss_pred cc-----cCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccccc---CC Confidence 65 4455555555 45799999999999999999999999999999999999999887421000 000000 00 Q ss_pred cccccceeeccccch-HHHHHHHhhhhh-----hhhhhcccCccEEEecHHHHHHHHhhhc------CCCceeeeccccc Q lcl|NC_011054. 145 VAANQDYTIVPGDAN-EDDLIGCINRAS-----KAVAAAGYMPDTLLASLGFRFDVANLRD------ANGNPIFRDESFN 212 (302) Q Consensus 145 ~~~~~~~~~~~~~~~-~~~~~~~i~~~~-----~~~~~~~~~~~~~v~~~~~~~~l~~l~d------~~g~~i~~~~~~~ 212 (302) ............... .+.+.....++. ..-...++.++.++|||..|..|++-++ .++.+++...... T Consensus 152 ~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~t 231 (318) T protein:vir:10 152 TAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWT 231 (318) T ss_pred cCCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhccccc Confidence 000000000000000 111111111111 1112446788999999999999955443 3444443222111 Q ss_pred C------cceEeecccccCCCcceEEEEecceE-EEEeecCcEEEEee-cccccchhhhcC-CcEEEEEEEEeccEEecc Q lcl|NC_011054. 213 G------FGTYFNANGAWPVGVAEALVVDSSRV-RIGVRQDITVKFLD-QATVGSINLAER-DMIALRLKARFAYVLGNG 283 (302) Q Consensus 213 g------~p~~~~~~~~~~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~-~~~~~~~~~~~~-~~~~~r~~~r~d~~v~~~ 283 (302) | ++..++.++..+ .+.+++.+-..+ .+++.++++.+... +.... +.+. .....|+..+....|.+| T Consensus 232 g~~~g~~lGl~vi~s~~~p--~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~---~g~~~~s~~~~~~~~~~~~V~~P 306 (318) T protein:vir:10 232 GNFPGSVMGLNVIRSRTFP--IDRVLIMERGTVGFYSDTRPLQFTALYPEGNGP---NGGPTESYRADASHKRALAVDQP 306 (318) T ss_pred ccccceeeceEEeecCccC--CCeeEEEecCCcceeeccccceeeecccCCCCC---CCCcchhhheehheeeeeeeeCc Confidence 1 122333333333 344666665432 35566667665433 21111 1222 334568888889999999 Q ss_pred ccEEEEeeecccccCC Q lcl|NC_011054. 284 ATAVGDNKTPVGAVVP 299 (302) Q Consensus 284 ~a~~~lt~~~a~~~~p 299 (302) +|+++||+. ++| T Consensus 307 kA~~~itgi----~~~ 318 (318) T protein:vir:10 307 KAALWLTGI----VTP 318 (318) T ss_pred ceeEEEeec----cCC Confidence 999999954 455 No 127 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.14 E-value=2.9e-12 Score=83.87 Aligned_cols=260 Identities=15% Similarity=0.158 Sum_probs=162.5 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCccee-eecccccccc-cccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGAS-WVSESATEPE-GVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~-~v~E~~~~~~-~~~~~s~~~f~ 78 (302) ....++.+...+||+++..+.|+.+.+..++..++...|..+.++.||+.+...+.. ...++....| +..+..+.+|+ T Consensus 131 ~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~ 210 (410) T protein:vir:83 131 ADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVID 210 (410) T ss_pred hccCcccccccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccccccccccccccceeee Confidence 445556666678899999999999999999999998899999899998876655421 1112222222 22334567777 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHh---hcccCCCcccccccccccccccccceeecc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAV---IFGTDKPSSWVSPALLPAAVAANQDYTIVP 155 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~---l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~ 155 (302) .-+-..+.++++..+||+.++-|.+...+...+.|..+++++-+.++ |.++-+ . .... T Consensus 211 t~tA~ikTyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t----------~---------~~a~ 271 (410) T protein:vir:83 211 RLTVNAKTLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTST----------G---------AVGY 271 (410) T ss_pred eccceeehhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhh----------h---------hhhh Confidence 77778899999999999999999999999999999999988888654 322110 0 0122 Q ss_pred ccchHHHHHHHhhhhhhhhhhc--ccCccEEEecHHHHHHHHhh--------hcCCCc---eeee--cccccCcceEeec Q lcl|NC_011054. 156 GDANEDDLIGCINRASKAVAAA--GYMPDTLLASLGFRFDVANL--------RDANGN---PIFR--DESFNGFGTYFNA 220 (302) Q Consensus 156 ~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~v~~~~~~~~l~~l--------~d~~g~---~i~~--~~~~~g~p~~~~~ 220 (302) ...+.+.+...+.+....+... +..-..+..+|+.+..+..+ +|+.|- ++-. .+.+.+.|+.... T Consensus 272 ~~~Tad~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~ 351 (410) T protein:vir:83 272 GNATADNVASAIWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSA 351 (410) T ss_pred hhccHHHHHHHHHHHHHHHhhhhccceeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEec Confidence 2335566666666665555544 44445578899997666543 233330 0000 0112334443332 Q ss_pred ccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeee Q lcl|NC_011054. 221 NGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKT 292 (302) Q Consensus 221 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~ 292 (302) ....+.+.|-|...+-+-..++-.+...++.....-. .|- -||.+.+..+++++=+.+. T Consensus 352 ----~a~AgTA~f~~~~Ai~~~eS~~gp~qL~d~~i~nLt~-------~yS--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 352 ----ALGSGDAYLFSTAAIECFEQRVGTLQVVEPSVFGLQV-------AYA--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred ----CCCcCeeeEeccceeeeeecCCceeEeeCCchhhhhh-------hhe--eeeeeccccccceeeeccC Confidence 3455667777777665555543345555443222111 111 5667888888888755544 No 128 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.08 E-value=1.4e-11 Score=80.09 Aligned_cols=283 Identities=13% Similarity=0.016 Sum_probs=147.3 Q ss_pred CCCccCCCc--------ceecchHHHHHHHHHHHhhhhhhhhcceee---cCCCceEEEEEeCCcceeeecccccccccc Q lcl|NC_011054. 1 MADISRSEV--------ATLIQEAYANDLLASAKKGSTVLQAFPTVN---MGTKTTHLPVLATLPGASWVSESATEPEGV 69 (302) Q Consensus 1 Ma~~t~~~~--------g~liP~~~~~~ii~~~~~~s~l~~~~~~~~---~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~ 69 (302) ||.+++-.| ..+||+.|+.++++.+++.+.+.++++..+ ..++++++|+.. .+.+.-+.++..++-+. T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~i~~~~ 79 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS-ELGVEDKATDVPVGVQP 79 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC-cceeeeecCCCcccccc Confidence 877766544 457999999999999999988888876543 235689999764 55666667776655433 Q ss_pred ccccccceeeEEeeeee-EEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccc Q lcl|NC_011054. 70 KPTSEATWADRTLVAEE-VAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAAN 148 (302) Q Consensus 70 ~~~s~~~f~~i~l~~~k-i~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~ 148 (302) . +-.++++...+ ....+.|+++-..++..++.+.+.++..+++++++|+.++.--...++..... ... T Consensus 80 ~-----~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~------~~~ 148 (341) T protein:vir:94 80 V-----NDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQN------VFS 148 (341) T ss_pred c-----cCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCc------ccc Confidence 2 22344445422 33446667755555678999999999999999999998874211000000000 000 Q ss_pred cceeeccccchHHHHHHHhhhhhhhhhhccc--CccEEEecHHHHHHHHhhh-----cCCCceeeec---ccccCcceEe Q lcl|NC_011054. 149 QDYTIVPGDANEDDLIGCINRASKAVAAAGY--MPDTLLASLGFRFDVANLR-----DANGNPIFRD---ESFNGFGTYF 218 (302) Q Consensus 149 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~l~-----d~~g~~i~~~---~~~~g~p~~~ 218 (302) ......+ .......++.+.++...+..... ...+++++|..+..|.+.. |..|.-.+.. ..+.|++++. T Consensus 149 ~~~~~~t-~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~ 227 (341) T protein:vir:94 149 SSNGAIT-GNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIR 227 (341) T ss_pred Ccccccc-CchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEE Confidence 0000011 11112223445555555555433 3456889999999997532 1222222222 1466777766 Q ss_pred ecccccCCCcc--------------e-----EEE----Eecce--EEEEeecCc-EEEEeec----------ccccchhh Q lcl|NC_011054. 219 NANGAWPVGVA--------------E-----ALV----VDSSR--VRIGVRQDI-TVKFLDQ----------ATVGSINL 262 (302) Q Consensus 219 ~~~~~~~~~~~--------------~-----~~~----gd~~~--~~~~~~~~~-~i~~~~~----------~~~~~~~~ 262 (302) ..+........ . ..+ ++++. .+++.++.+ .++..+- ...+.... T Consensus 228 Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~ 307 (341) T protein:vir:94 228 TSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFE 307 (341) T ss_pred eccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccch Confidence 55432211100 0 000 11111 111111111 0110000 00000000 Q ss_pred hcCCcEEEEEEEEeccEEeccccEEEEeeeccccc Q lcl|NC_011054. 263 AERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV 297 (302) Q Consensus 263 ~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~ 297 (302) -.+-.-.+++.+-||.+++||++.+.+. +.+..| T Consensus 308 ~~~~~~~i~~~~~~G~~~lrp~~~v~~~-~~~~~~ 341 (341) T protein:vir:94 308 NREQVWLMVGRQAYGARLYRPLHAVNIH-TTGDTV 341 (341) T ss_pred hhhhhhhhhhhhhhcccccCcceeEEEe-cCcCCC Confidence 1122233566777999999999976444 333333 No 129 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.08 E-value=5.3e-12 Score=82.40 Aligned_cols=268 Identities=15% Similarity=0.069 Sum_probs=161.7 Q ss_pred CCCcc--CCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceee-eccccccccccccccccce Q lcl|NC_011054. 1 MADIS--RSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASW-VSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~~t--~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~-v~E~~~~~~~~~~~s~~~f 77 (302) ++.-. -.+....+|.-+...|-..++.+.++++++++.+.++-....+-. ...-+| +--|.+ |..+..+| T Consensus 117 l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~l~V~~~~d--t~~qa~gHk~G~~-----K~eq~~tl 189 (400) T protein:vir:93 117 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFD--SANEAQVHKDGQT-----KTEQAATL 189 (400) T ss_pred hhhcccccCCchhhcchHHHHHHHHhhhccCCcccceeeecCCceeeecchh--hhcccceeccCCc-----ccceeeee Confidence 22222 144455789999999999999999999999888875432222222 222233 344443 34456688 Q ss_pred eeEEeeeeeEEEeehhHHHHHh--cchHHHHHHHHHHHHHHHHH-HHHHHhhcccCCC--ccccc-ccccccccccccce Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVD--DASTSLLEEIAALGGQAIGK-KLDQAVIFGTDKP--SSWVS-PALLPAAVAANQDY 151 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~--ds~~~~~~~i~~~l~~ai~~-~~d~~~l~G~g~~--~g~~~-~~~~~~~~~~~~~~ 151 (302) ..-++.|.-+..+..+.+-..+ .+...+..||.++|...+.. ..+.+++-|+|+. .+.-. ..+.+.+.. ... T Consensus 190 ~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~d--t~k 267 (400) T protein:vir:93 190 TIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKI--TTK 267 (400) T ss_pred eeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhh--hhh Confidence 8888888766666666443333 23567899999999999996 5799999998863 12211 111121111 122 Q ss_pred eeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeeccc-------ccCcceEeeccccc Q lcl|NC_011054. 152 TIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDES-------FNGFGTYFNANGAW 224 (302) Q Consensus 152 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~-------~~g~p~~~~~~~~~ 224 (302) +..++......+.+. +....++.......++++|..|+.|+.|+|++|.+.|+-.. -.|..-.+.....+ T Consensus 268 t~~a~~~~~qdl~E~---~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~ 344 (400) T protein:vir:93 268 AKSAGKTPFADAIEE---AVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSK 344 (400) T ss_pred hhhcCCccHHHHHHH---HHhhhhhccCCceeEEeccchHHHHHHhcCCcceeeeeeccccchhhhhcccceeeeeccCC Confidence 233444445544443 33344444555667999999999999999999999885322 22444333333332 Q ss_pred CCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeee Q lcl|NC_011054. 225 PVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKT 292 (302) Q Consensus 225 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~ 292 (302) .. +.. ++.|-..++ +. .+ ++..+. ..+.+|+-.|.++...++.+.-+.+-+.++.. T Consensus 345 ~~-kp~-V~VDek~~i-~~-~~--~~t~~s------f~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 345 AL-KPT-VLVDQKYHI-DM-QD--LTKVDA------FEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred CC-Cce-eeeehhhhc-cc-cC--ceeccc------eeeeeccceEEeeeeeccceecccceeeEeeC Confidence 22 222 233533332 22 22 222221 13567777888899999999988777777755 No 130 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.08 E-value=1.8e-11 Score=79.45 Aligned_cols=283 Identities=12% Similarity=0.047 Sum_probs=153.0 Q ss_pred CCCccCCCc-----------c---eecchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeecccccc Q lcl|NC_011054. 1 MADISRSEV-----------A---TLIQEAYANDLLASAKKGSTVLQAFPTVNMG-TKTTHLPVLATLPGASWVSESATE 65 (302) Q Consensus 1 Ma~~t~~~~-----------g---~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~ 65 (302) ||+.+.... + .+.=+.+..++++..+..+.++++.+..+.. +++.++|+. +..++..+..+... T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~i-G~~~~~~~~~g~~l 79 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeee-cceeeeeeccccCC Confidence 776544322 1 2233788899999999999999999887765 557888865 45555666666554 Q ss_pred ccccccccccceeeEEeeeeeE-EEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCC--cccccc Q lcl|NC_011054. 66 PEGVKPTSEATWADRTLVAEEV-AVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF----GTDKP--SSWVSP 138 (302) Q Consensus 66 ~~~~~~~s~~~f~~i~l~~~ki-~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~~--~g~~~~ 138 (302) ... ..++...++++...+. .....|.+-=.-++..++.+.+.++.++++++..|+.++. +...+ .+.... T Consensus 80 ~~~---~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~ 156 (347) T protein:vir:88 80 DDK---RKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) T ss_pred CCC---CCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccC Confidence 321 1134455555555443 2222333322223456889999999999999999998873 21111 000000 Q ss_pred cccccc-cccccceeeccccchHHHHHHHhhhhhhhhhhccc--CccEEEecHHHHHHHHhhhcC-CCcee-----eec- Q lcl|NC_011054. 139 ALLPAA-VAANQDYTIVPGDANEDDLIGCINRASKAVAAAGY--MPDTLLASLGFRFDVANLRDA-NGNPI-----FRD- 208 (302) Q Consensus 139 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~l~d~-~g~~i-----~~~- 208 (302) +..... ..................+++.|.++...+..... ...+++++|..|..|.+-... ...+. .+. T Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (347) T protein:vir:88 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGN 236 (347) T ss_pred CccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhcce Confidence 000000 01111111122233445566777777777765543 245788999999888653311 11111 111 Q ss_pred -ccccCcceEeecccccCC-Cc--------------------ceEEEEecceE----------EEEeecCcEEEEeeccc Q lcl|NC_011054. 209 -ESFNGFGTYFNANGAWPV-GV--------------------AEALVVDSSRV----------RIGVRQDITVKFLDQAT 256 (302) Q Consensus 209 -~~~~g~p~~~~~~~~~~~-~~--------------------~~~~~gd~~~~----------~~~~~~~~~i~~~~~~~ 256 (302) ..+.|+.++...+.+... +. ..-+.+|++.- ..+...++.++..++.. T Consensus 237 vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~ 316 (347) T protein:vir:88 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE 316 (347) T ss_pred eeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeechh Confidence 245677766665432210 00 00122333321 11122333444443321 Q ss_pred ccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 257 VGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 257 ~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) +-.-.+++.+.+|.++.||++.+.+.-++++ T Consensus 317 --------~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 317 --------FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred --------hHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 1122578888999999999998888877666 No 131 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.06 E-value=1.2e-11 Score=80.51 Aligned_cols=282 Identities=11% Similarity=0.028 Sum_probs=149.7 Q ss_pred CCCccCCCc------c-------eecchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccccccc Q lcl|NC_011054. 1 MADISRSEV------A-------TLIQEAYANDLLASAKKGSTVLQAFPTVNMG-TKTTHLPVLATLPGASWVSESATEP 66 (302) Q Consensus 1 Ma~~t~~~~------g-------~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~ 66 (302) ||+.+-+.. | .+.=+.+..+++......+.++++.+..++. +++.++|+. +..++..+..|+... T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~i-G~~tv~~~t~G~~l~ 79 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVM-GRTSGVYLAPGERLS 79 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecc-cceeeeeecCCCCcC Confidence 877654332 1 1222577888888888889999998888765 557888876 566666666666654 Q ss_pred cccccccccceee--EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----cc---CCCccccc Q lcl|NC_011054. 67 EGVKPTSEATWAD--RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF----GT---DKPSSWVS 137 (302) Q Consensus 67 ~~~~~~s~~~f~~--i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~---g~~~g~~~ 137 (302) ....+ .+=.+ +++...++..+ .|-+-=--++..++.+.+.++.++++++.+|+.++. .. +.+.+... T Consensus 80 ~~~~~---~~~~e~~itID~~~~~~~-~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~ 155 (347) T protein:vir:94 80 DKRKG---IKHTEKVITIDGLLTADV-MIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIA 155 (347) T ss_pred CCCCC---CCcceEEEEecchhhhhH-HhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccC Confidence 32111 12233 33333333222 222111112456889999999999999999998863 11 11111111 Q ss_pred ccccccccccccceeeccccchHHHHHHHhhhhhhhhhhcccC--ccEEEecHHHHHHHHhhhcCCC-cee----e-ec- Q lcl|NC_011054. 138 PALLPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYM--PDTLLASLGFRFDVANLRDANG-NPI----F-RD- 208 (302) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~l~d~~g-~~i----~-~~- 208 (302) ....+............+...+.+.+++.+.++...+...+.+ ..+++++|..|..|..-++.+. .+. . +. T Consensus 156 g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 235 (347) T protein:vir:94 156 GLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPETGN 235 (347) T ss_pred CCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccccccccc Confidence 0001111111111122222334566677777777777665432 3578899999998854332111 111 1 11 Q ss_pred -ccccCcceEeecccccC------CCcc--------eE--------EEEecce----------EEEEeecCcEEEEeecc Q lcl|NC_011054. 209 -ESFNGFGTYFNANGAWP------VGVA--------EA--------LVVDSSR----------VRIGVRQDITVKFLDQA 255 (302) Q Consensus 209 -~~~~g~p~~~~~~~~~~------~~~~--------~~--------~~gd~~~----------~~~~~~~~~~i~~~~~~ 255 (302) ..+.|++++..++.+.. .+++ .. +-+||+. +..+...+++++..++. T Consensus 236 Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~ 315 (347) T protein:vir:94 236 IRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDV 315 (347) T ss_pred eEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhch Confidence 24567777666544321 0111 11 1122221 11122223333433222 Q ss_pred cccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 256 TVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 256 ~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) . +| .-.+++.+.+|.++.||++.+.++.+.|. T Consensus 316 ~-----~~---~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 316 D-----AQ---GDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred h-----hH---HHHhhhhhhhcCcccccceeEEEEecCCC Confidence 1 11 12578888999999999999888776544 No 132 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.04 E-value=3e-11 Score=78.30 Aligned_cols=280 Identities=13% Similarity=0.068 Sum_probs=152.8 Q ss_pred CCCccCC------Ccce----------ecchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeecccc Q lcl|NC_011054. 1 MADISRS------EVAT----------LIQEAYANDLLASAKKGSTVLQAFPTVNMG-TKTTHLPVLATLPGASWVSESA 63 (302) Q Consensus 1 Ma~~t~~------~~g~----------liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~ 63 (302) ||+.++. .++. +| +.+..++++.....+.++++++..++. +++.++|+. +..++..+..|+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G~~~~~~~~~G~ 78 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGE 78 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-ceeEEEeeecCC Confidence 8876443 2222 34 788999999999999999999988877 457888876 666677777777 Q ss_pred ccccccccccccceeeEEee--eeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----ccC--CCccc Q lcl|NC_011054. 64 TEPEGVKPTSEATWADRTLV--AEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF----GTD--KPSSW 135 (302) Q Consensus 64 ~~~~~~~~~s~~~f~~i~l~--~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g--~~~g~ 135 (302) +...... ++.-.+++|. ..++..+. |.+-=--++..++.+.+.+++.+++++..|+.++. +.. .|... T Consensus 79 ~l~~t~~---~~~~~e~~l~ID~~~y~~~~-VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~ 154 (344) T protein:vir:10 79 NLDDIRK---DIKHTEKVITIDGLLTADVL-IYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNE 154 (344) T ss_pred CCCCCCC---CcccceEEEEEcchhhhhhh-hhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 6543211 1233443343 33343322 22211122457899999999999999999998863 111 11111 Q ss_pred ccccccccccc--cccceeeccccchHHHHHHHhhhhhhhhhhcccC--ccEEEecHHHHHHHHhhhcCC-Cce-----e Q lcl|NC_011054. 136 VSPALLPAAVA--ANQDYTIVPGDANEDDLIGCINRASKAVAAAGYM--PDTLLASLGFRFDVANLRDAN-GNP-----I 205 (302) Q Consensus 136 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~l~d~~-g~~-----i 205 (302) ...+....... ............+.+.+++.+.++...+...+.+ ..+++++|..|..|..-+.-+ ..+ + T Consensus 155 ~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~ 234 (344) T protein:vir:10 155 NITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDP 234 (344) T ss_pred ccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccccce Confidence 11110000000 0001111222334466777777777777665543 346789999999986533211 111 1 Q ss_pred eec--ccccCcceEeecccccC---------CC--------cceEEEEecce----------EEEEeecCcEEEEeeccc Q lcl|NC_011054. 206 FRD--ESFNGFGTYFNANGAWP---------VG--------VAEALVVDSSR----------VRIGVRQDITVKFLDQAT 256 (302) Q Consensus 206 ~~~--~~~~g~p~~~~~~~~~~---------~~--------~~~~~~gd~~~----------~~~~~~~~~~i~~~~~~~ 256 (302) .+. ..+.|++++..++.... ++ .+.....+++. ...+...++.++..++.. T Consensus 235 ~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~ 314 (344) T protein:vir:10 235 EKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 314 (344) T ss_pred eeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchh Confidence 111 23567777666543211 01 11111122222 222233344555544321 Q ss_pred ccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 257 VGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 257 ~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) +|. -.+++.+-+|.++.||++...+.-++- T Consensus 315 -----~~~---d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 315 -----FQA---DQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred -----HHH---HHHHHHhhcccceecccceEEEEeecC Confidence 121 246778889999999998855554433 No 133 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.04 E-value=6.4e-11 Score=76.46 Aligned_cols=281 Identities=11% Similarity=0.022 Sum_probs=155.0 Q ss_pred CCCccCCCc-------c-------eecchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeecccccc Q lcl|NC_011054. 1 MADISRSEV-------A-------TLIQEAYANDLLASAKKGSTVLQAFPTVNMG-TKTTHLPVLATLPGASWVSESATE 65 (302) Q Consensus 1 Ma~~t~~~~-------g-------~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~ 65 (302) ||+..++.. | .+.=+.+..++.+.....+.++++.+...+. +++.++|+. +..++..+..|++. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~i-G~~~~~~~~~G~~l 79 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVL-GRTKAAYLQPGENL 79 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeec-cceeEeeeecCcCC Confidence 886655441 1 0233888999999999999999999887765 557888864 56667777777765 Q ss_pred ccccccccccceeeEEeeeee--EEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----ccCC--Cccccc Q lcl|NC_011054. 66 PEGVKPTSEATWADRTLVAEE--VAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF----GTDK--PSSWVS 137 (302) Q Consensus 66 ~~~~~~~s~~~f~~i~l~~~k--i~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~--~~g~~~ 137 (302) .... .++...+.++...+ +..+ .|.+-=.-++..++.+.+.++.++++++..|+.++. +... +..... T Consensus 80 ~~~~---~~~~~~e~~ltID~~~y~~~-~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~ 155 (347) T protein:vir:94 80 DDKR---KDMKHTEKTINIDGLLTADV-LIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENI 155 (347) T ss_pred CCCc---CCccccceEEEEcchhhhhh-hhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 4311 12344555554443 3332 233221223456899999999999999999998862 2211 100000 Q ss_pred cccccc--ccccccceeeccccchHHHHHHHhhhhhhhhhhcccC--ccEEEecHHHHHHHHhhhcC-CCceee-----e Q lcl|NC_011054. 138 PALLPA--AVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYM--PDTLLASLGFRFDVANLRDA-NGNPIF-----R 207 (302) Q Consensus 138 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~l~d~-~g~~i~-----~ 207 (302) .+.... ...........+...+...+++.+.++...+...+.+ ..+++.+|..|..|.+..+. .+.+-. . T Consensus 156 ~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~ 235 (347) T protein:vir:94 156 AGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPST 235 (347) T ss_pred ccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccccccccccccccc Confidence 000000 0000001111122334566677787777777665543 34677899999888764332 111111 1 Q ss_pred c--ccccCcceEeecccccCC-Ccc--------------------eEEEEecce----------EEEEeecCcEEEEeec Q lcl|NC_011054. 208 D--ESFNGFGTYFNANGAWPV-GVA--------------------EALVVDSSR----------VRIGVRQDITVKFLDQ 254 (302) Q Consensus 208 ~--~~~~g~p~~~~~~~~~~~-~~~--------------------~~~~gd~~~----------~~~~~~~~~~i~~~~~ 254 (302) . ..+.|++++.+.+..... +.. .-+=+||+. ...+...++.+++.++ T Consensus 236 G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~ 315 (347) T protein:vir:94 236 GSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARR 315 (347) T ss_pred ceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeec Confidence 1 245677777665532211 000 001123322 2222333445555443 Q ss_pred ccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 255 ATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 255 ~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) .. +-...+.+.+-+|.++.||++.+.+.-+.| T Consensus 316 ~~--------~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 316 AN--------FQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hh--------hhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 32 112246777789999999999887776655 No 134 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.03 E-value=3.4e-10 Score=72.48 Aligned_cols=290 Identities=12% Similarity=0.048 Sum_probs=152.9 Q ss_pred CCCccCCCcc-----------------eecchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccc Q lcl|NC_011054. 1 MADISRSEVA-----------------TLIQEAYANDLLASAKKGSTVLQAFPTVNMG-TKTTHLPVLATLPGASWVSES 62 (302) Q Consensus 1 Ma~~t~~~~g-----------------~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~ 62 (302) |++.+-+--| .+.=+.+..++.+..+..+.++++.+..++. +++.++|+. +..+++.+.-| T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~i-G~~t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYT-GRMTSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEee-eeeEEeeecCC Confidence 4433322211 2334778899999999999999999988877 557888876 56666666555 Q ss_pred cccccccccccccceee--EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----ccC--CCcc Q lcl|NC_011054. 63 ATEPEGVKPTSEATWAD--RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF----GTD--KPSS 134 (302) Q Consensus 63 ~~~~~~~~~~s~~~f~~--i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g--~~~g 134 (302) +++... +..+....+ +++...|+..+ .|.+-=--++..++.+.+.++.++++++.+|+.++. +.. .|.+ T Consensus 80 ~~i~~~--~~~d~~~te~~l~ID~~~y~~~-~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~ 156 (375) T protein:vir:10 80 TPILGN--ADKAPPVAEKTIVMDDLLISSA-FVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVS 156 (375) T ss_pred cCcCCc--cccCCCCCceEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 544321 011112222 44554444432 222211123457899999999999999999998873 211 1211 Q ss_pred cccccccccccccccceeeccccchHHHHHHHhhhhhhhhhhcccC--ccEEEecHHHHHHHHhhhcCC--------Cce Q lcl|NC_011054. 135 WVSPALLPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYM--PDTLLASLGFRFDVANLRDAN--------GNP 204 (302) Q Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~l~d~~--------g~~ 204 (302) ..+.....................+...+++.+.++...+.....+ ..+++++|..|..|.+-+|.+ |.- T Consensus 157 ~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~ 236 (375) T protein:vir:10 157 ATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSA 236 (375) T ss_pred cccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecccccc Confidence 1100000000000011122233345677778888888777665543 457889999999997655432 111 Q ss_pred eeec---ccccCcceEeecccccCCC----------------------------------------------cceEEEEe Q lcl|NC_011054. 205 IFRD---ESFNGFGTYFNANGAWPVG----------------------------------------------VAEALVVD 235 (302) Q Consensus 205 i~~~---~~~~g~p~~~~~~~~~~~~----------------------------------------------~~~~~~gd 235 (302) +... ..+.|++++...+...... +...++.. T Consensus 237 ~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~ 316 (375) T protein:vir:10 237 LQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQ 316 (375) T ss_pred eeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEc Confidence 1111 1244555554443221110 01112222 Q ss_pred cceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 236 SSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 236 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) .+....+...++.+++++.. + .-.+-...+.+.+-+|..+.||++.+.|... ++.|++= T Consensus 317 ~~A~g~v~~~~~~~~~~~~~----~-~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~---~~~~~~~ 375 (375) T protein:vir:10 317 KEAAGVVEAIGPQVQVTNGD----V-SVIYQGDVILGRMAMGADYLNPAAAVELYIG---ATAPSAF 375 (375) T ss_pred hhheeeeeeeccccccccch----h-hheeeeeeeeeeeeeccCccCceeEEEEecC---cCccccC Confidence 22222233344444443210 0 0122233466777889999999998777533 2334444 No 135 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.03 E-value=1.1e-10 Score=75.29 Aligned_cols=281 Identities=11% Similarity=-0.010 Sum_probs=155.7 Q ss_pred CCCccCCC---------cc-eecc-hHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccccccccc Q lcl|NC_011054. 1 MADISRSE---------VA-TLIQ-EAYANDLLASAKKGSTVLQAFPTVNMG-TKTTHLPVLATLPGASWVSESATEPEG 68 (302) Q Consensus 1 Ma~~t~~~---------~g-~liP-~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~ 68 (302) |++...+. ++ .-+. +.+..++.+.....+.++++.++.++. +++.++|+. +..+++...-+++.... T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~~~~g~~l~~~ 79 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAGRKAGEELVVQ 79 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeeeecCCCCCCCC Confidence 87762211 22 2344 888999999999999999999988877 557899876 67777777777766543 Q ss_pred cccccccceeeEEeeee--eEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----ccC--CCcccccccc Q lcl|NC_011054. 69 VKPTSEATWADRTLVAE--EVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF----GTD--KPSSWVSPAL 140 (302) Q Consensus 69 ~~~~s~~~f~~i~l~~~--ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g--~~~g~~~~~~ 140 (302) . .+-++.+|... +++. ..|.+----++..++.+.+.+++++++++..|++++. |.. .|.+...... T Consensus 80 ~-----~~~~~~~l~ID~~l~~~-~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~ 153 (334) T protein:vir:80 80 K-----NVSDKLNLTVDTVLYAR-HFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFH 153 (334) T ss_pred C-----cccCceEEEEeeeeehh-hhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 2 23344444443 3333 2333211123457899999999999999999998763 221 2222111111 Q ss_pred cccccccccceeeccccchHHHHHHHhhhhhhhhhhcccC-----ccEEEecHHHHHHHHhhhc---C------CCceee Q lcl|NC_011054. 141 LPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYM-----PDTLLASLGFRFDVANLRD---A------NGNPIF 206 (302) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-----~~~~v~~~~~~~~l~~l~d---~------~g~~i~ 206 (302) .+.........+......+.+.+.+.+..+...+...+.. ..+++++|..|..|..-+. . ++..+- T Consensus 154 ~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~ 233 (334) T protein:vir:80 154 DGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFV 233 (334) T ss_pred CCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecccccccccc Confidence 1111111111222233445666667676777666655444 3578899999999976421 1 112222 Q ss_pred ec--ccccCcceEeecccccCC-------CcceEEEEecceE----------EEEeecCcEEEEeecccccchhhhcCCc Q lcl|NC_011054. 207 RD--ESFNGFGTYFNANGAWPV-------GVAEALVVDSSRV----------RIGVRQDITVKFLDQATVGSINLAERDM 267 (302) Q Consensus 207 ~~--~~~~g~p~~~~~~~~~~~-------~~~~~~~gd~~~~----------~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 267 (302) +. ..+.|++++...+.+... .....+-|||+.. ..+...++..+..++.... .+ T Consensus 234 ~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~--~d----- 306 (334) T protein:vir:80 234 GGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDF--GH----- 306 (334) T ss_pred ceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhH--HH----- Confidence 21 246677777665543221 1112345555432 2222223333333322110 01 Q ss_pred EEEEEEEEeccEEeccccEEEEeeecccccCC Q lcl|NC_011054. 268 IALRLKARFAYVLGNGATAVGDNKTPVGAVVP 299 (302) Q Consensus 268 ~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p 299 (302) .+.+.+-+|.++.||++++.+.-+. +.| T Consensus 307 -~i~~~~a~G~g~lRPeaa~vv~~~~---~~~ 334 (334) T protein:vir:80 307 -YLDTFQSYNIGQRRPDAVAVHDITV---TNP 334 (334) T ss_pred -HHHHHHHcCCceeccceEEEEEEee---ecC Confidence 2344556899999998866555442 233 No 136 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.01 E-value=2.6e-10 Score=73.12 Aligned_cols=280 Identities=13% Similarity=0.028 Sum_probs=158.3 Q ss_pred CCCcc----------CCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCC-CceEEEEEeCCcceeeecccccccccc Q lcl|NC_011054. 1 MADIS----------RSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGT-KTTHLPVLATLPGASWVSESATEPEGV 69 (302) Q Consensus 1 Ma~~t----------~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~E~~~~~~~~ 69 (302) |++.. .++-..+| +.+..++.+.....+.++++.++.++.+ ++.++|+. +..+++.+.-|++..... T Consensus 1 ms~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELERSR 78 (335) T ss_pred CCCcccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eeeeeecccCCcCcCCCC Confidence 66554 22233333 8889999999999999999998888765 47888886 667777777776654432 Q ss_pred ccccccceeeEEeeee--eEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhh----cccC--CCccccc---c Q lcl|NC_011054. 70 KPTSEATWADRTLVAE--EVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVI----FGTD--KPSSWVS---P 138 (302) Q Consensus 70 ~~~s~~~f~~i~l~~~--ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l----~G~g--~~~g~~~---~ 138 (302) +..++..+... +++.. .|-+----++..++.+.+.+++.+++++..|++++ .+.. .+.+... . T Consensus 79 -----~~~~k~~itVD~ll~a~~-~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:63 79 -----VVNDKWNLTVDTLLYLRH-QFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred -----ccccceEEEecceeechh-hhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCC Confidence 23344444443 32222 22221111245789999999999999999999876 2322 1221111 1 Q ss_pred cccccccccccceeeccccchHHHHHHHhhhhhhhhhhcccC-----ccEEEecHHHHHHHHhhhcCCCc-ee------- Q lcl|NC_011054. 139 ALLPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYM-----PDTLLASLGFRFDVANLRDANGN-PI------- 205 (302) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-----~~~~v~~~~~~~~l~~l~d~~g~-~i------- 205 (302) ++.... ..+..+...+.+.+.+.+..+...+...+.+ ....+++|..|..|..-+.--++ |. T Consensus 153 G~~~~~-----~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~ 227 (335) T protein:vir:63 153 GVLEKL-----DLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATND 227 (335) T ss_pred Ccceee-----eeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccccccccccccccc Confidence 111111 1111122234677777777777777665543 35688999999999764321111 11 Q ss_pred -eec--ccccCcceEeecccccCC-------CcceEEEEec----------ceEEEEeecCcEEEEeecccccchhhhcC Q lcl|NC_011054. 206 -FRD--ESFNGFGTYFNANGAWPV-------GVAEALVVDS----------SRVRIGVRQDITVKFLDQATVGSINLAER 265 (302) Q Consensus 206 -~~~--~~~~g~p~~~~~~~~~~~-------~~~~~~~gd~----------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 265 (302) ... ..+.|.|++-..+.+... .....+-+|+ +....+...++..++.++... | T Consensus 228 ~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~-----~-- 300 (335) T protein:vir:63 228 YVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEK-----F-- 300 (335) T ss_pred ccCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccch-----h-- Confidence 111 134566665554432211 1111223344 223333333444443333221 1 Q ss_pred CcEEEEEEEEeccEEeccccEEEEeeecccccCCCC Q lcl|NC_011054. 266 DMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDG 301 (302) Q Consensus 266 ~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~ 301 (302) +..+.+.+-+|.++.||++++.++-+..+++.-.+ T Consensus 301 -~~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:63 301 -SWVLDTFQMYNIGARRPDTAGAIELKGIGAFDITA 335 (335) T ss_pred -hHHhHHHHHcCCcccccceEEEEEEcCCCceeecC Confidence 11345556689999999998888877777766666 No 137 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.99 E-value=1.1e-10 Score=75.13 Aligned_cols=278 Identities=12% Similarity=0.058 Sum_probs=157.7 Q ss_pred CCCccCCC-------cc--------eecchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccccc Q lcl|NC_011054. 1 MADISRSE-------VA--------TLIQEAYANDLLASAKKGSTVLQAFPTVNMG-TKTTHLPVLATLPGASWVSESAT 64 (302) Q Consensus 1 Ma~~t~~~-------~g--------~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~ 64 (302) ||+.+... .| .+.=+.+..++++.....+.++++.+..++. +++.++|+. +..++..+..|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~~~~~~~G~~ 79 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGEN 79 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceEEEeeecCCC Confidence 66655411 11 2334778899999999999999999988877 457888876 6777788887876 Q ss_pred cccccccccccceee--EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----cc-------CC Q lcl|NC_011054. 65 EPEGVKPTSEATWAD--RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF----GT-------DK 131 (302) Q Consensus 65 ~~~~~~~~s~~~f~~--i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~-------g~ 131 (302) ..... .+++..+ |+++..++..+..---+- -++..++.+.+.+++++++++.+|+.++. +. +. T Consensus 80 l~~~~---~~~~~~e~~ltID~~~y~~~~VddiD~-~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~ 155 (345) T protein:vir:22 80 LDDKR---KDIKHTEKVITIDGLLTADVLIYDIED-AMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNEN 155 (345) T ss_pred CCCCC---CCcccceEEEEecchhhhhhhHhhHHH-HhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 54321 1234455 444444444432221111 23457899999999999999999998873 11 11 Q ss_pred CcccccccccccccccccceeeccccchHHHHHHHhhhhhhhhhhcccCc--cEEEecHHHHHHHHhhhcCC-Ccee--- Q lcl|NC_011054. 132 PSSWVSPALLPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMP--DTLLASLGFRFDVANLRDAN-GNPI--- 205 (302) Q Consensus 132 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~l~d~~-g~~i--- 205 (302) |.|........ ....+. .......+.+.+++.+.++...+...+.+. .+++++|..|..|..-+.-+ ..+. T Consensus 156 ~~~~~~~~~~~-~~~~g~--~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~ 232 (345) T protein:vir:22 156 IEGLGTATVIE-TTQNKA--ALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALI 232 (345) T ss_pred ccccccccccc-cccccc--cccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhcccccccccccccc Confidence 21111111111 111110 111112234667777777777776655433 57889999999996543221 1111 Q ss_pred --eec--ccccCcceEeecccccC----------------------------CCcceEEEEecceEEEEeecCcEEEEee Q lcl|NC_011054. 206 --FRD--ESFNGFGTYFNANGAWP----------------------------VGVAEALVVDSSRVRIGVRQDITVKFLD 253 (302) Q Consensus 206 --~~~--~~~~g~p~~~~~~~~~~----------------------------~~~~~~~~gd~~~~~~~~~~~~~i~~~~ 253 (302) .+. ..+.|++++...+.... .+....++...+....+...++.++..+ T Consensus 233 ~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r 312 (345) T protein:vir:22 233 DPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 312 (345) T ss_pred ccccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeee Confidence 011 13456666554332110 0111223334444444555556666655 Q ss_pred cccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 254 QATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 254 ~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a 294 (302) +... | . -.+++.+-+|.++.||++.+.++-+-- T Consensus 313 ~~~~-----~-~--d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 313 RANF-----Q-A--DQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred chhH-----H-H--HHHHHHHhcCCcccccceeEEEEEeeC Confidence 4321 1 1 246777889999999999888875533 No 138 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.93 E-value=2.4e-10 Score=73.27 Aligned_cols=270 Identities=11% Similarity=0.021 Sum_probs=149.8 Q ss_pred CCCccCC-------Cc----ceecchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccccccccc Q lcl|NC_011054. 1 MADISRS-------EV----ATLIQEAYANDLLASAKKGSTVLQAFPTVNMG-TKTTHLPVLATLPGASWVSESATEPEG 68 (302) Q Consensus 1 Ma~~t~~-------~~----g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~ 68 (302) |+.-..+ ++ ..+| +.+..++++..+..+.++.+.+..+.. +++.++|+. +..+++....|...... T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~i-g~~~~~~~~~g~~l~~~ 84 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGTPIVGD 84 (332) T ss_pred ccCCccccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhccccccccccceEEEEec-cceeEeeecCCCCCCCC Confidence 3332222 11 1334 888999999999999999999877765 557888886 45555555555544321 Q ss_pred cccccccceee--EEeeeeeEEEeehhHHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHhhc----ccC--CCccccccc Q lcl|NC_011054. 69 VKPTSEATWAD--RTLVAEEVAVIIPVHENVVD-DASTSLLEEIAALGGQAIGKKLDQAVIF----GTD--KPSSWVSPA 139 (302) Q Consensus 69 ~~~~s~~~f~~--i~l~~~ki~~~~~iS~ell~-ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g--~~~g~~~~~ 139 (302) . +++-.+ +++...|+..+. |.+ +-+ ++..++.+.+.++.++++++.+|+.++. +.. .+.+....+ T Consensus 85 ~----~~~~~~~~l~ID~~ky~~~~-Vdd-iD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~ 158 (332) T protein:vir:78 85 A----GIKANEKTLVMDDLLVSSQF-VYS-LDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGG 158 (332) T ss_pred C----CCCCceEEEEEehhhhhHHH-HHh-HHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccccccc Confidence 1 122233 344443444332 222 222 3456899999999999999999988873 111 111111000 Q ss_pred ccccccccccceeeccccchHHHHHHHhhhhhhhhhhcccCc--cEEEecHHHHHHHHhhhcC----------CCceeee Q lcl|NC_011054. 140 LLPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMP--DTLLASLGFRFDVANLRDA----------NGNPIFR 207 (302) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~l~d~----------~g~~i~~ 207 (302) .......+...+.+.+++.|.++...+.....+. .+++++|..|..|.+.+|. +| .+.+ T Consensus 159 --------~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~-~~~~ 229 (332) T protein:vir:78 159 --------FHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQG-DMNS 229 (332) T ss_pred --------cccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeecccccc-ceec Confidence 0011112234456777888888888887766533 4677899999999764331 22 2222 Q ss_pred c---ccccCcceEeecccccCCC----------cceEEEEecce--EEEE--------eecCcEEEEeecccccchhhhc Q lcl|NC_011054. 208 D---ESFNGFGTYFNANGAWPVG----------VAEALVVDSSR--VRIG--------VRQDITVKFLDQATVGSINLAE 264 (302) Q Consensus 208 ~---~~~~g~p~~~~~~~~~~~~----------~~~~~~gd~~~--~~~~--------~~~~~~i~~~~~~~~~~~~~~~ 264 (302) . ..+.|++++..++...... ....+-|+|+. .++. ...++.+++.+.... ..+| T Consensus 230 g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~--~~~~- 306 (332) T protein:vir:78 230 GKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN--VQYQ- 306 (332) T ss_pred ceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccc--hhhh- Confidence 2 2466788776655432211 11123444433 1111 122223322211100 0111 Q ss_pred CCcEEEEEEEEeccEEeccccEEEEeee Q lcl|NC_011054. 265 RDMIALRLKARFAYVLGNGATAVGDNKT 292 (302) Q Consensus 265 ~~~~~~r~~~r~d~~v~~~~a~~~lt~~ 292 (302) .-.+++.+.+|.++.||++++.++.. T Consensus 307 --~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 307 --GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred --HhhhhhhhhhcCceecccceEEEeeC Confidence 23467777899999999999988866 No 139 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.92 E-value=7.7e-10 Score=70.54 Aligned_cols=283 Identities=12% Similarity=0.000 Sum_probs=157.7 Q ss_pred CCCcc----------CCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCC-CceEEEEEeCCcceeeecccccccccc Q lcl|NC_011054. 1 MADIS----------RSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGT-KTTHLPVLATLPGASWVSESATEPEGV 69 (302) Q Consensus 1 Ma~~t----------~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~E~~~~~~~~ 69 (302) |++.. .++-..+| +.+..++.+.....+.++++.++.++.+ ++.++|+. +..+++...-|++..... T Consensus 1 ms~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELERSR 78 (335) T ss_pred CCccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccccCcccCCCC Confidence 66554 23333334 8889999999999999999998888765 47888876 666777776676654322 Q ss_pred ccccccceeeEEeeeee--EEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----ccC--CCccccccccc Q lcl|NC_011054. 70 KPTSEATWADRTLVAEE--VAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF----GTD--KPSSWVSPALL 141 (302) Q Consensus 70 ~~~s~~~f~~i~l~~~k--i~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g--~~~g~~~~~~~ 141 (302) +..++..+.... ++.. .|-+----++..++.+.+.+++.+++++..|+.++. +.. .|....+.... T Consensus 79 -----~~~~k~~itID~ll~a~~-~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~ 152 (335) T protein:vir:78 79 -----VVNDKWNLTVDTLLYLRH-QFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSP 152 (335) T ss_pred -----cccCCeEEEecceeechh-hHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCC Confidence 233444444433 2222 222211112457899999999999999999998762 222 22221111000 Q ss_pred ccccccccceeeccccchHHHHHHHhhhhhhhhhhcccC-----ccEEEecHHHHHHHHhhhcCCCc-ee--------ee Q lcl|NC_011054. 142 PAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYM-----PDTLLASLGFRFDVANLRDANGN-PI--------FR 207 (302) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-----~~~~v~~~~~~~~l~~l~d~~g~-~i--------~~ 207 (302) +.... ...+..+...+...+.+.+.++...+...+.+ ....+++|..|..|..-+.--.+ |. .. T Consensus 153 G~~~~--~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~ 230 (335) T protein:vir:78 153 GVLEK--LDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVK 230 (335) T ss_pred Cccee--eeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccccccccccccccc Confidence 10000 01111222335667777777777776655443 24688999999999764321111 11 11 Q ss_pred c--ccccCcceEeecccccCCC-----------------cceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcE Q lcl|NC_011054. 208 D--ESFNGFGTYFNANGAWPVG-----------------VAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMI 268 (302) Q Consensus 208 ~--~~~~g~p~~~~~~~~~~~~-----------------~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 268 (302) . ..+.|.|++...+.+.... ....+++..+....+...++.-++.++... | +- T Consensus 231 g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~-----~---~~ 302 (335) T protein:vir:78 231 SRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQ-----F---SW 302 (335) T ss_pred ceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccch-----h---hH Confidence 1 1355666655544332211 112333333333334434444444433221 1 11 Q ss_pred EEEEEEEeccEEeccccEEEEeeecccccCCCC Q lcl|NC_011054. 269 ALRLKARFAYVLGNGATAVGDNKTPVGAVVPDG 301 (302) Q Consensus 269 ~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~ 301 (302) .+.+.+-+|.++.||++.+.++-+..+++.-.+ T Consensus 303 ~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:78 303 VLDTFQMYNIGARRPDTAGAIELKGIEAFDITA 335 (335) T ss_pred hhhHHHHcCCcccCcceEEEEEecCCCcccccC Confidence 345556689999999998888877666665555 No 140 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.91 E-value=6.2e-10 Score=71.07 Aligned_cols=276 Identities=10% Similarity=0.044 Sum_probs=150.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhh---------cceeecCCCceEEEEEeCC-cceeeecccc-cccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQA---------FPTVNMGTKTTHLPVLATL-PGASWVSESA-TEPEGV 69 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~---------~~~~~~~~~~~~~p~~~~~-~~a~~v~E~~-~~~~~~ 69 (302) ||+.+|.-.-.++|+.+..-+.+...+.+.|++- ......++...++|.+..- ..+.-+.|++ .++. T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~-- 78 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALET-- 78 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccch-- Confidence 9987776677788999877777777666665432 1222235778899988633 5555566664 2432 Q ss_pred ccccccceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccc--cccccc Q lcl|NC_011054. 70 KPTSEATWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALL--PAAVAA 147 (302) Q Consensus 70 ~~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~--~~~~~~ 147 (302) .+.+-++-.-..++.+..+.++++...-+..+....+.+++++.+.+..++.+|.--. |++..... ...... T Consensus 79 ---~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~---gvf~~~~~~~~~~~~~ 152 (330) T protein:vir:10 79 ---GKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLN---GIFATGTAGEKGALEE 152 (330) T ss_pred ---hhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHH---hhhhhhhcccchhhhh Confidence 2333344444556667778888887666778889999999999999999888774210 11100000 000000 Q ss_pred c--cceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh------cCCCceeeecccccCcceEee Q lcl|NC_011054. 148 N--QDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR------DANGNPIFRDESFNGFGTYFN 219 (302) Q Consensus 148 ~--~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~------d~~g~~i~~~~~~~g~p~~~~ 219 (302) . ..........+. +.+.++...+......-..|+||+..+..|++.. ++++..- =+..+|.++.+. T Consensus 153 ~~~~~~~~~~a~~s~----~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~--i~~~~G~~Vivd 226 (330) T protein:vir:10 153 THVSDQSKASTGIDA----GMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQYIQPTTATIN--IPTYLGYRVIID 226 (330) T ss_pred hheecccccccccCH----HHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhhhhcccccCcc--cccccceEEEEe Confidence 0 000111112223 3344455555555556778999999999998742 2332211 145677777666 Q ss_pred cccccCCCcceE-EEEecceEEEEe---ecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeee-cc Q lcl|NC_011054. 220 ANGAWPVGVAEA-LVVDSSRVRIGV---RQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKT-PV 294 (302) Q Consensus 220 ~~~~~~~~~~~~-~~gd~~~~~~~~---~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~-~a 294 (302) +.+....+.... +|+ ...+.+++ ...+.++..++. ...+..+..+.++. .+|..+.+-... .. T Consensus 227 D~~p~~~~~yt~yl~~-~GAi~~~~~~~~~~v~~EtdRd~--------~~g~~~l~~r~~~~---~hp~G~s~~~~~~~~ 294 (330) T protein:vir:10 227 DGIAPTGDIYTSYLFR-TGSIGLNTGNPSGLTTFETSREA--------AKGNDMIYTRRALV---MHPYGVKWTGAEVDA 294 (330) T ss_pred CCCCCCCCceeEEEEe-cCceeeecccCCccccccccCCc--------cccceEEEEeeEEE---eeeeeeeeccccccc Confidence 655433222222 222 12222332 112344444432 23444555555544 556666554432 23 Q ss_pred cccCCCCC Q lcl|NC_011054. 295 GAVVPDGS 302 (302) Q Consensus 295 ~~~~p~~~ 302 (302) +-..|.-+ T Consensus 295 ~~~sPt~~ 302 (330) T protein:vir:10 295 GNITPSNA 302 (330) T ss_pred CcCCcChH Confidence 34456655 No 141 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.91 E-value=2.9e-10 Score=72.87 Aligned_cols=225 Identities=11% Similarity=0.112 Sum_probs=148.4 Q ss_pred CCCccCCC-cceecchHHHHHHHHHHHhhhhhhhhcceeecCC-CceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSE-VATLIQEAYANDLLASAKKGSTVLQAFPTVNMGT-KTTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~-~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |...|..+ +..+-|......|||.+.+.++|++.+++.+... ....+.++++-|.+.|..=++..++ ++.++. T Consensus 6 ~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~lN~g~~~-----s~~tt~ 80 (328) T protein:vir:95 6 LTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRLLNYGVQP-----SKSTTV 80 (328) T ss_pred cccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeeecCCccCc-----ccceeE Confidence 45555555 4446677888899999999999999999998853 3477889999999999988887654 567999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhcccCC--Cccccccc----------ccccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDAS--TSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPA----------LLPAA 144 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~----------~~~~~ 144 (302) +++-..+-+++.+.|.+.+.+... .++...-.+...+++.+.+.+.||+|+.+ |.++.... ..+.. T Consensus 81 q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~qii 160 (328) T protein:vir:95 81 QVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQNII 160 (328) T ss_pred EEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCcccccccccee Confidence 999999999999999998887653 34445456678899999999999999643 22110000 00000 Q ss_pred c---cccc--c--------------------------------------------------------------------- Q lcl|NC_011054. 145 V---AANQ--D--------------------------------------------------------------------- 150 (302) Q Consensus 145 ~---~~~~--~--------------------------------------------------------------------- 150 (302) . .+.. + T Consensus 161 daGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~NId 240 (328) T protein:vir:95 161 DAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIANID 240 (328) T ss_pred ecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecCc Confidence 0 0000 0 Q ss_pred eeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh-cCCCceeeec-------ccccCcceEeeccc Q lcl|NC_011054. 151 YTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR-DANGNPIFRD-------ESFNGFGTYFNANG 222 (302) Q Consensus 151 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~-d~~g~~i~~~-------~~~~g~p~~~~~~~ 222 (302) ....++......+.+++..+...++........|.||++....|++.. +..+-.+-.. -.+.|.|+..++. T Consensus 241 ~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t~~~gipir~~da- 319 (328) T protein:vir:95 241 VSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWTSFRGVPIRETDA- 319 (328) T ss_pred ccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcceeEECCeEEEEEee- Confidence 000112334566777888888888777777888999999999998753 4443333211 1344555444432 Q ss_pred ccCCCcceEE Q lcl|NC_011054. 223 AWPVGVAEAL 232 (302) Q Consensus 223 ~~~~~~~~~~ 232 (302) ...++..++ T Consensus 320 -i~~tE~~vv 328 (328) T protein:vir:95 320 -LLETEARVV 328 (328) T ss_pred -eecCccccC Confidence 122222222 No 142 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.90 E-value=5.4e-10 Score=71.40 Aligned_cols=282 Identities=11% Similarity=0.036 Sum_probs=152.7 Q ss_pred CCCccCCC-------cc--------eecchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccccc Q lcl|NC_011054. 1 MADISRSE-------VA--------TLIQEAYANDLLASAKKGSTVLQAFPTVNMG-TKTTHLPVLATLPGASWVSESAT 64 (302) Q Consensus 1 Ma~~t~~~-------~g--------~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~ 64 (302) ||+..+.. .| .+| +.+..++.+..+..+.++++.+..+.. +++.++|+. +..++..+..++. T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~i-G~~t~~~~~~g~~ 78 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVI-GRTKAAYLKPGEN 78 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccccccccceeEeeec-cceeeeeecCCCC Confidence 88655543 11 245 888999999999999999999877655 557888875 4455555666665 Q ss_pred cccccccccccceeeEEee--eeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc-----ccCC--Cccc Q lcl|NC_011054. 65 EPEGVKPTSEATWADRTLV--AEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF-----GTDK--PSSW 135 (302) Q Consensus 65 ~~~~~~~~s~~~f~~i~l~--~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~-----G~g~--~~g~ 135 (302) +.... .+++..+.++. ..++..+ .|.+-=.-++..++.+.+.++.++++++..|+.++. +... +.+. T Consensus 79 l~~~~---~~~~~~e~~ltiD~~~y~~~-~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~ 154 (347) T protein:vir:33 79 LDDKR---KDIKHTEKVIHIDGLLTADV-LIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNEN 154 (347) T ss_pred CCCCC---CCCccceEEEEechhhhhhH-HHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 43211 12233444444 3333322 222221123456899999999999999999999872 1111 1110 Q ss_pred ccccccc--cccccccceeeccccchHHHHHHHhhhhhhhhhhcccC--ccEEEecHHHHHHHHhhhcC-CCcee----- Q lcl|NC_011054. 136 VSPALLP--AAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYM--PDTLLASLGFRFDVANLRDA-NGNPI----- 205 (302) Q Consensus 136 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~l~d~-~g~~i----- 205 (302) ....... .......+........+.+.+++.+.++...+...+.+ ..+++++|..|..|.+-..- +..+. T Consensus 155 ~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~ 234 (347) T protein:vir:33 155 IEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLDP 234 (347) T ss_pred cccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccccccccc Confidence 0000000 00011111112222334567778788787777765543 45788999999999764321 11221 Q ss_pred eec--ccccCcceEeecccccCCC------------------cceEEEEecce----------EEEEeecCcEEEEeecc Q lcl|NC_011054. 206 FRD--ESFNGFGTYFNANGAWPVG------------------VAEALVVDSSR----------VRIGVRQDITVKFLDQA 255 (302) Q Consensus 206 ~~~--~~~~g~p~~~~~~~~~~~~------------------~~~~~~gd~~~----------~~~~~~~~~~i~~~~~~ 255 (302) .+. ..+.|++++..++...... .....-++|+. +......++.++..++. T Consensus 235 ~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~ 314 (347) T protein:vir:33 235 ERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA 314 (347) T ss_pred ccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccch Confidence 111 1467777776655332110 01111222211 11122223344444332 Q ss_pred cccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccc Q lcl|NC_011054. 256 TVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGA 296 (302) Q Consensus 256 ~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~ 296 (302) . +-.-.+++.+.+|.+++||++.+.+.....+. T Consensus 315 ~--------~~~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 315 N--------YQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred h--------hhhHhhhhhhhcCCceecccceEEEecCCCCC Confidence 1 11234677778899999999998887655555 No 143 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.88 E-value=1.6e-09 Score=68.75 Aligned_cols=271 Identities=9% Similarity=0.016 Sum_probs=149.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhh---------ccee--ecCCCceEEEEEeCC-cceeeeccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQA---------FPTV--NMGTKTTHLPVLATL-PGASWVSESATEPEG 68 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~---------~~~~--~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~ 68 (302) ||.+.. .-.++|+.+..-+.+...+.+.|++- .... ..++..+++|.+..- ..+.-+.|+..++. T Consensus 1 MA~T~l--sd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~- 77 (324) T protein:vir:59 1 MAYTKI--SDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVP- 77 (324) T ss_pred CCceee--eceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccch- Confidence 995443 34577888877777777777666432 1222 235667899988653 56666777776553 Q ss_pred cccccccceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccc Q lcl|NC_011054. 69 VKPTSEATWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAAN 148 (302) Q Consensus 69 ~~~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~ 148 (302) .+.+.++-.-..++.+..+.++++...-+..+....+.+++++.+++..++.+|.--. |........ ..... T Consensus 78 ----~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~---g~~~~~~~~-~~~~d 149 (324) T protein:vir:59 78 ----QKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELA---GVFSNDDMK-DNKLD 149 (324) T ss_pred ----hhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHH---Hhhhccccc-cceee Confidence 3344444444555666677888877766777899999999999999999988874210 111000000 00000 Q ss_pred cceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhh------cCCCceeeecccccCcceEeeccc Q lcl|NC_011054. 149 QDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLR------DANGNPIFRDESFNGFGTYFNANG 222 (302) Q Consensus 149 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~------d~~g~~i~~~~~~~g~p~~~~~~~ 222 (302) . ........+.+ .+.++...+-.....-..|+||+..+..|++.. .+++..- =+...|.++.+.+.. T Consensus 150 v-sa~~~~~~s~~----~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~--i~~~~G~~VivdD~~ 222 (324) T protein:vir:59 150 I-SGTADGIYSAE----TFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIEFVKDSQSGIR--FPTYMNKRVIVDDSM 222 (324) T ss_pred e-eccccceecHH----HHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhhhccccccCce--eeeecccEEEEeCCC Confidence 0 00111122333 344455555555566778999999999998753 3333211 134677777766543 Q ss_pred ccC--CC---cc-eEEEEecceEEEEe-ecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 223 AWP--VG---VA-EALVVDSSRVRIGV-RQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 223 ~~~--~~---~~-~~~~gd~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) ... .+ .. ..+|+. ..+.++. ...+.++..++. ..++..+..+.++. .+|..+... .+..+ T Consensus 223 p~~~~~~~~~~y~s~l~~~-GAi~~~~~~~~v~vE~dRd~--------~~g~~~l~~r~~~~---~~p~G~s~~-~~~~~ 289 (324) T protein:vir:59 223 PVETLEDGTKVFTSYLFGA-GALGYAEGQPEVPTETARNA--------LGSQDILINRKHFV---LHPRGVKFT-ENAMA 289 (324) T ss_pred CccccCCCCceEEEEEEec-CeEEEeecCCCcceecccCc--------cccceEEEEeeEEE---eEeeeEEec-ccccC Confidence 321 11 11 123332 2233443 233555554442 34455666666655 555555432 33333 Q ss_pred ccCCCCC Q lcl|NC_011054. 296 AVVPDGS 302 (302) Q Consensus 296 ~~~p~~~ 302 (302) ...|.-+ T Consensus 290 ~~sPt~~ 296 (324) T protein:vir:59 290 GTTPTDE 296 (324) T ss_pred CCCCChh Confidence 3455544 No 144 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=98.87 E-value=1.1e-09 Score=69.76 Aligned_cols=282 Identities=10% Similarity=-0.010 Sum_probs=140.7 Q ss_pred CCCccCC-----------CcceecchHHHHHHHHHHHhhhhhhhhcceeec---CCCceEEEEEeCCcceeeeccccccc Q lcl|NC_011054. 1 MADISRS-----------EVATLIQEAYANDLLASAKKGSTVLQAFPTVNM---GTKTTHLPVLATLPGASWVSESATEP 66 (302) Q Consensus 1 Ma~~t~~-----------~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~---~~~~~~~p~~~~~~~a~~v~E~~~~~ 66 (302) ||.+..+ ....++|+.++.++++.+++.+.+..+++.... .+.++++|+.. .+.+..+.++.++. T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~a~d~~~g~~i~ 79 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAAVYDKQPQTPVN 79 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cceeeeecCCCccc Confidence 6655422 225689999999999999999888888765432 35678999864 56777788887665 Q ss_pred cccccccccceeeEEeeeeeE-EEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccc----CCCccccccccc Q lcl|NC_011054. 67 EGVKPTSEATWADRTLVAEEV-AVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGT----DKPSSWVSPALL 141 (302) Q Consensus 67 ~~~~~~s~~~f~~i~l~~~ki-~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~----g~~~g~~~~~~~ 141 (302) -+. .+..++++...+. .....|++.-..++..++.+.+.+.+..++++++|+.++.-- ..+.+...... T Consensus 80 ~~~-----~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~- 153 (381) T protein:vir:80 80 LQA-----RTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYD- 153 (381) T ss_pred ccc-----cCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc- Confidence 432 2334444444332 233566765555566789999999999999999999987421 11111100000 Q ss_pred ccccccccceeeccccchHHHHHHHhhhhhhhhhhccc--CccEEEecHHHHHHHHhhhc-----CCCceeeec---ccc Q lcl|NC_011054. 142 PAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGY--MPDTLLASLGFRFDVANLRD-----ANGNPIFRD---ESF 211 (302) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~l~d-----~~g~~i~~~---~~~ 211 (302) ..... ...............++.+.++...+..... ....++++|..+..|.+... ..+...++. ..+ T Consensus 154 ~~i~~--~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i 231 (381) T protein:vir:80 154 TTLGD--GTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTI 231 (381) T ss_pred ccccc--cccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEE Confidence 00000 0011111122334455666777776665543 23468899999999976421 111111111 246 Q ss_pred cCcceEeecccccCCC-cceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEecc-ccEEEE Q lcl|NC_011054. 212 NGFGTYFNANGAWPVG-VAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNG-ATAVGD 289 (302) Q Consensus 212 ~g~p~~~~~~~~~~~~-~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~-~a~~~l 289 (302) .|++++.......... .....+|-..... ..+ ....+. .-|.++..+++....+|.++... ..+-.. T Consensus 232 ~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~----~~~-----~~~~~~--g~~s~~a~av~~~k~yd~~~~~~~~~~~~~ 300 (381) T protein:vir:80 232 LGMEVIVTTQIGINSLTGYVNGQGAPTQPT----PGV-----LGSPYL--PDQAGTANVVNTGSASDLAVSLSYFGLPVF 300 (381) T ss_pred cceEEEeecccccccccceeeecccccccc----ccc-----cccccc--cccccceeeeeeeeeeceeeeeeeccceee Confidence 6777766654432211 1111111100000 000 000000 01223334555555555555322 111111 Q ss_pred eee----------------------------cccccCCC------CC Q lcl|NC_011054. 290 NKT----------------------------PVGAVVPD------GS 302 (302) Q Consensus 290 t~~----------------------------~a~~~~p~------~~ 302 (302) +++ .|.++-|+ +| T Consensus 301 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 347 (381) T protein:vir:80 301 SGAGATAADGGQTLGSFGGANRWATAVVCHPDWLAVGVQQNVKSESS 347 (381) T ss_pred ecceeeecCCCceeeeehhhhhhhhhcccccccccccceeEeecccc Confidence 111 11111110 11 No 145 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.85 E-value=1.2e-08 Score=64.10 Aligned_cols=285 Identities=11% Similarity=0.001 Sum_probs=151.1 Q ss_pred CCCccCCCcce---------ecchHHHHHHHHHHHhhhhhhhhcceeecCC-CceEEEEEeCCcceeeeccccccccccc Q lcl|NC_011054. 1 MADISRSEVAT---------LIQEAYANDLLASAKKGSTVLQAFPTVNMGT-KTTHLPVLATLPGASWVSESATEPEGVK 70 (302) Q Consensus 1 Ma~~t~~~~g~---------liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~E~~~~~~~~~ 70 (302) |+..+....+. +.=+.+..++.+.....+.++++.++.++.+ ++.++|+. +..+++...-|++.. .. T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~i-G~~~~~~~~~G~~ld-~~- 77 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYI-GETELQVLSPGKSPD-AS- 77 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeee-eeeEEeeeccCcccC-CC- Confidence 87766544322 2237778899999999999999998888765 47888886 555666665555532 22 Q ss_pred cccccceeeEEeeee--eEEEeehhHHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHhhc----c-cCCCcccc-ccccc Q lcl|NC_011054. 71 PTSEATWADRTLVAE--EVAVIIPVHENVVDDASTS-LLEEIAALGGQAIGKKLDQAVIF----G-TDKPSSWV-SPALL 141 (302) Q Consensus 71 ~~s~~~f~~i~l~~~--ki~~~~~iS~ell~ds~~~-~~~~i~~~l~~ai~~~~d~~~l~----G-~g~~~g~~-~~~~~ 141 (302) .+.-++.+|... +++... |-+----++..+ +.+.+.+++.+++++.+|+.++. + -..-.+.. ..... T Consensus 78 ---~~~~~k~~itID~ll~a~~~-V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~ 153 (364) T protein:vir:10 78 ---PTEFDKNRLVVDTTVIARNT-VAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVA 153 (364) T ss_pred ---CcccCcEEEEecceeeechh-hhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCccc Confidence 233344444443 333222 222111123455 68899999999999999998862 1 00000000 00000 Q ss_pred ccccccccceeeccccchHHHHHHHhhhhhhhhhhcccC--ccEEEecHHHHHHHHhhhc---------CCCceeee-cc Q lcl|NC_011054. 142 PAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYM--PDTLLASLGFRFDVANLRD---------ANGNPIFR-DE 209 (302) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~l~d---------~~g~~i~~-~~ 209 (302) +............+.......+.+.+.++...+.+.+.+ ...++++|..|..|.+-.+ .+|-+... -. T Consensus 154 ~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~ 233 (364) T protein:vir:10 154 GHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVL 233 (364) T ss_pred CCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeE Confidence 100011111222333445567777777777777665543 3468899999999976321 11222111 01 Q ss_pred cccCcceEeecccccC---------C----------C----------cceEEEEecceEEEEeecCcEEEEeecccccch Q lcl|NC_011054. 210 SFNGFGTYFNANGAWP---------V----------G----------VAEALVVDSSRVRIGVRQDITVKFLDQATVGSI 260 (302) Q Consensus 210 ~~~g~p~~~~~~~~~~---------~----------~----------~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~ 260 (302) .+.|.+++...+.+.. . + ...+++...+....+...++..++.++... T Consensus 234 ~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~--- 310 (364) T protein:vir:10 234 KSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKE--- 310 (364) T ss_pred EEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccce--- Confidence 3456665544433210 0 0 111222222223333444555555443221 Q ss_pred hhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 261 NLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 261 ~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) -+..+.+.+-+|.++.||++++.++...+ .+|+-- T Consensus 311 -----~~~~ida~~a~G~g~lRPeaa~~i~~~~~--~~~~~~ 345 (364) T protein:vir:10 311 -----KTWYIDTFLAEGAIPDRWEAVAVVTAADT--AELATD 345 (364) T ss_pred -----eeeeeeeehcccCcccCccceEEEEecCC--CCCccc Confidence 11234556668999999999887764432 235444 No 146 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.79 E-value=1.8e-09 Score=68.53 Aligned_cols=274 Identities=14% Similarity=0.056 Sum_probs=161.7 Q ss_pred CCCccCCCcceecchHH---HHHHHHHHHhhhhhhhhcceee---cCCCceEEEEEeCCcceeeeccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAY---ANDLLASAKKGSTVLQAFPTVN---MGTKTTHLPVLATLPGASWVSESATEPEGVKPTSE 74 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~---~~~ii~~~~~~s~l~~~~~~~~---~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~ 74 (302) |...-..++|.++-.++ .+.+++.....-..++++.... ....+..+...+....+.|++.++. ..|..+ T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~----dip~v~ 76 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTD----DLPLVD 76 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCcc----ccceee Confidence 77664555667766444 5667777777767777666443 2233566666667778888876643 234455 Q ss_pred cceeeEEeeeeeEEEeehhHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhcccCCC--ccccccccccccccccc Q lcl|NC_011054. 75 ATWADRTLVAEEVAVIIPVHENVVDDA---STSLLEEIAALGGQAIGKKLDQAVIFGTDKP--SSWVSPALLPAAVAANQ 149 (302) Q Consensus 75 ~~f~~i~l~~~ki~~~~~iS~ell~ds---~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~--~g~~~~~~~~~~~~~~~ 149 (302) ..+++.....+.++..+.++.+=++.+ ..+++.--....++++++.+|+.+|+|+..- .|+++....+..... T Consensus 77 ~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~-- 154 (296) T protein:vir:10 77 ALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSG-- 154 (296) T ss_pred ccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCcccccc-- Confidence 667777888888888888887544433 4578888888999999999999999997642 355544333222111 Q ss_pred ceeeccccchHHHHHHHhhhhhhhhhh---cccCccEEEecHHHHHHHHhhhcCCCceeeecccccCcc-----eEeecc Q lcl|NC_011054. 150 DYTIVPGDANEDDLIGCINRASKAVAA---AGYMPDTLLASLGFRFDVANLRDANGNPIFRDESFNGFG-----TYFNAN 221 (302) Q Consensus 150 ~~~~~~~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~~~g~p-----~~~~~~ 221 (302) .+. .+...+.+++.+++..+.. ....+..++++|..+..|.......|.-++.-=.....+ ++.... T Consensus 155 ----~~W-~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~~~~l~~ 229 (296) T protein:vir:10 155 ----GSW-SQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEFVQYLND 229 (296) T ss_pred ----CCc-cCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEEeeeecc Confidence 111 1233667777777765543 456677899999999999766555553332210111112 222211 Q ss_pred cccCCCcceEEEEec--ceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEec-cEEeccccEEEEeeeccc Q lcl|NC_011054. 222 GAWPVGVAEALVVDS--SRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFA-YVLGNGATAVGDNKTPVG 295 (302) Q Consensus 222 ~~~~~~~~~~~~gd~--~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d-~~v~~~~a~~~lt~~~a~ 295 (302) . ...++..+++.+. +.+-+....++....... ..-...++...|++ ..+.+|.+++++++..-+ T Consensus 230 a-~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~~e~---------~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 230 Y-NGTGTSAAIAYEKDPNNMAIEIPEATNALPAQP---------KDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred C-CCCcceEEEEEEcCCceEEEEcCcceeeecccc---------cCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 1 1122333344332 233333333333221111 11223467788885 778899999999987665 No 147 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.77 E-value=5.4e-09 Score=65.90 Aligned_cols=274 Identities=14% Similarity=0.074 Sum_probs=155.7 Q ss_pred CccCCCcceecc---hHHHHHHHHHHHhhhhhhhhccee---ecCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 3 DISRSEVATLIQ---EAYANDLLASAKKGSTVLQAFPTV---NMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 3 ~~t~~~~g~liP---~~~~~~ii~~~~~~s~l~~~~~~~---~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ..+.++|. +.. +.+.+.+++.+.+....++++... +.......+...+....+.|++.++. ..|..+.. T Consensus 1 ~~~~~~g~-f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~----dip~~~~~ 75 (301) T protein:vir:80 1 MQGKITAT-IEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGAD----DLPLVDVD 75 (301) T ss_pred CCccccch-hhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCccc----cccccccc Confidence 34444443 333 233466788888887777776553 33334566666667778888877653 23445666 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccccccccccccccce Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDA---STSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQDY 151 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds---~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~~~ 151 (302) ++......+.++.-+.++..=++.+ ..++..--....++++++.+|+.+|+|+.. -.|+++....+......... T Consensus 76 ~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~ 155 (301) T protein:vir:80 76 MVRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGV 155 (301) T ss_pred ceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCccc Confidence 7777778888888888877444432 457788888999999999999999999764 24555554433322211111 Q ss_pred e--eccccchHHHHHHHhhhhhhhhhh---cccCccEEEecHHHHHHHHhhh--cCCCceeee---cc--cccCcceEee Q lcl|NC_011054. 152 T--IVPGDANEDDLIGCINRASKAVAA---AGYMPDTLLASLGFRFDVANLR--DANGNPIFR---DE--SFNGFGTYFN 219 (302) Q Consensus 152 ~--~~~~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~v~~~~~~~~l~~l~--d~~g~~i~~---~~--~~~g~p~~~~ 219 (302) . ..-...+.+.+++++.++..++.. ....+..++++|+.+..|.... +..|.-+++ .. .+...+++.. T Consensus 156 ~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p~L 235 (301) T protein:vir:80 156 GNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVPDL 235 (301) T ss_pred ccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEccee Confidence 0 111234667888888888888753 3346678999999999997543 444543322 11 1111122222 Q ss_pred cccccCCCcceEE-EEe-cceEEEEeecCcEEEEeecccccchhhhcCCc-EEEEEEEEe-ccEEeccccEEEEeee Q lcl|NC_011054. 220 ANGAWPVGVAEAL-VVD-SSRVRIGVRQDITVKFLDQATVGSINLAERDM-IALRLKARF-AYVLGNGATAVGDNKT 292 (302) Q Consensus 220 ~~~~~~~~~~~~~-~gd-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~-~~~r~~~r~-d~~v~~~~a~~~lt~~ 292 (302) ..... .++..++ +.+ ...+-+....+++.... -.+++ .....+.|+ +..+.+|.+++++++. T Consensus 236 ~~~g~-~g~~~~v~~~~~~d~~~~~v~~~~~~~~~----------e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 236 AGMGT-AGSDSFAVIHDSNETAELIIPMDITRHPE----------EYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred ccCCC-CcccEEEEEecCCcEEEEEecCceeeecc----------eecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 21111 1222222 222 12222222222221111 11222 122345666 4567899999999998 No 148 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.75 E-value=3.7e-09 Score=66.82 Aligned_cols=279 Identities=11% Similarity=0.053 Sum_probs=159.6 Q ss_pred CCCccC--CCcceecc---hHHHHHHHHHHHhhhhhhhhcceee---cCCCceEEEEEeCCcceeeeccccccccccccc Q lcl|NC_011054. 1 MADISR--SEVATLIQ---EAYANDLLASAKKGSTVLQAFPTVN---MGTKTTHLPVLATLPGASWVSESATEPEGVKPT 72 (302) Q Consensus 1 Ma~~t~--~~~g~liP---~~~~~~ii~~~~~~s~l~~~~~~~~---~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (302) |...+. .+.+.++- +.+...|++.....-..++++...+ ....+..+........+.|++.++. ..|. T Consensus 26 ~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~G~a~~~~d~~~----dip~ 101 (329) T protein:vir:79 26 LRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKVGHAKIIADYTD----DLST 101 (329) T ss_pred cccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecceeeeeecCccc----ccce Confidence 333332 22344554 3345778888888777777776542 2334567777777788888876543 2334 Q ss_pred cccceeeEEeeeeeEEEeehhHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Cccccccccccccccc Q lcl|NC_011054. 73 SEATWADRTLVAEEVAVIIPVHENVVDDA---STSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAA 147 (302) Q Consensus 73 s~~~f~~i~l~~~ki~~~~~iS~ell~ds---~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~ 147 (302) .+..+++-....+.++..+.++..=++.+ ..++..--....++++++.+|+.+|+|++. -.|+++....+..... T Consensus 102 vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~ 181 (329) T protein:vir:79 102 VDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSA 181 (329) T ss_pred eecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeecccccceeeecCCCccccccC Confidence 45566666667777888888876433322 457888888899999999999999999764 2466655554432221 Q ss_pred ccceeeccccchHHHHHHHhhhhhhhhhh---cccCccEEEecHHHHHHHHhhhcCCCceeeecccccCcceEeeccc-- Q lcl|NC_011054. 148 NQDYTIVPGDANEDDLIGCINRASKAVAA---AGYMPDTLLASLGFRFDVANLRDANGNPIFRDESFNGFGTYFNANG-- 222 (302) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~~~g~p~~~~~~~-- 222 (302) .. ....-...+.+.+++++.+++.++.. ....+..++++|+.+..|.......|.-+++-=..+..+..+...+ T Consensus 182 ~~-~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~~tvl~~lk~~~~~l~I~~~~el 260 (329) T protein:vir:79 182 GW-NNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETTMSYLDYFKQQNGGITIESISEL 260 (329) T ss_pred CC-CCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCCccHHHHHHHhCCCcEEEEcccc Confidence 11 11112234667888888888887764 2345678999999999997655555543322101111222222111 Q ss_pred --ccCCCcceEEEEecce--EEEEeecCcEEEEeecccccchhhhcCCc--EEEEEEEEec-cEEeccccEEEEeeeccc Q lcl|NC_011054. 223 --AWPVGVAEALVVDSSR--VRIGVRQDITVKFLDQATVGSINLAERDM--IALRLKARFA-YVLGNGATAVGDNKTPVG 295 (302) Q Consensus 223 --~~~~~~~~~~~gd~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~--~~~r~~~r~d-~~v~~~~a~~~lt~~~a~ 295 (302) ....+...+++.+.+. +-+.....++.... ++.. .....+.|++ ..+.+|.+|+++++..++ T Consensus 261 ~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~-----------q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 261 EDIDGAGTKAALVYEKDPMNMSIEIPEAFNMLTA-----------QPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred cccCCCCceEEEEEecCCceEEEecCcceeeeec-----------eecCceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 1112223333333322 22222222222111 1111 2234566665 556889999999999888 No 149 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.75 E-value=3.1e-09 Score=67.24 Aligned_cols=274 Identities=9% Similarity=0.028 Sum_probs=156.7 Q ss_pred CC--CccCCCcceecchHH---HHHHHHHHHhhhhhhhhcceee---cCCCceEEEEEeCCcceeeeccccccccccccc Q lcl|NC_011054. 1 MA--DISRSEVATLIQEAY---ANDLLASAKKGSTVLQAFPTVN---MGTKTTHLPVLATLPGASWVSESATEPEGVKPT 72 (302) Q Consensus 1 Ma--~~t~~~~g~liP~~~---~~~ii~~~~~~s~l~~~~~~~~---~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (302) |- .-...+.|.+...++ .+.+++.....-..++++.... ....+..+........+.|++.+.. ..|. T Consensus 21 ~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~d~~~----dip~ 96 (319) T protein:vir:10 21 AGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDKVGTAQIIADYTD----DLPL 96 (319) T ss_pred ccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeeccccceeeecCccc----cccc Confidence 22 222223355555333 4567888777777777776542 2233456666666778888877643 1334 Q ss_pred cccceeeEEeeeeeEEEeehhHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Cccccccccccccccc Q lcl|NC_011054. 73 SEATWADRTLVAEEVAVIIPVHENVVDDA---STSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAA 147 (302) Q Consensus 73 s~~~f~~i~l~~~ki~~~~~iS~ell~ds---~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~ 147 (302) .+..++......+.++..+.++..=++.+ ..++..--....++++++.+|+.+|+|+.. -.|+++....+..... T Consensus 97 v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~ 176 (319) T protein:vir:10 97 VDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAPHKIVSVFNHPNITKITSG 176 (319) T ss_pred eeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEEeCCCceeeecC Confidence 45566777777788888888876434332 457788888899999999999999999764 2355555443332211 Q ss_pred ccceeeccccchHHHHHHHhhhhhhhhhh---cccCccEEEecHHHHHHHHhhhcCCCceeeecc-----cccCcceEee Q lcl|NC_011054. 148 NQDYTIVPGDANEDDLIGCINRASKAVAA---AGYMPDTLLASLGFRFDVANLRDANGNPIFRDE-----SFNGFGTYFN 219 (302) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~-----~~~g~p~~~~ 219 (302) .. ...++.+.+.+.+++.+++..+.. ....+..++++|+.+..|.......|.-++.-= .+...+++.. T Consensus 177 ~~---~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~t~l~~lk~~~~~l~I~~~pel 253 (319) T protein:vir:10 177 KW---IDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPETTMSYLDYFKSQNSGIEIDSIAEL 253 (319) T ss_pred CC---CCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCCCeeHHHHHHHhcCCceEEEeeee Confidence 11 111234567888888888887753 345677899999999999766555554433211 1111122222 Q ss_pred cccccCCCcceEEEEecc--eEEEEeecCcEEEEeecccccchhhhcCCc-EEEEEEEEec-cEEeccccEEEEeee Q lcl|NC_011054. 220 ANGAWPVGVAEALVVDSS--RVRIGVRQDITVKFLDQATVGSINLAERDM-IALRLKARFA-YVLGNGATAVGDNKT 292 (302) Q Consensus 220 ~~~~~~~~~~~~~~gd~~--~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~-~~~r~~~r~d-~~v~~~~a~~~lt~~ 292 (302) .... ..+...+++...+ .+-+.....++...... +++ .....+.|++ ..+.+|.+++++++. T Consensus 254 ~~ag-~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~e~----------~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 254 EDID-GAGTKGVLVYEKNPMNMSIEIPEAFNMLPAQP----------KDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred cccC-CCcceEEEEEecCCceEEEecCcceeeeeeee----------cCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 2111 1122333333322 22222222322211110 111 2334566665 557889999999998 No 150 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.71 E-value=8.3e-09 Score=64.90 Aligned_cols=283 Identities=11% Similarity=0.010 Sum_probs=149.3 Q ss_pred CCCccCCC-------cceecc-------hHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeecccccc Q lcl|NC_011054. 1 MADISRSE-------VATLIQ-------EAYANDLLASAKKGSTVLQAFPTVNMG-TKTTHLPVLATLPGASWVSESATE 65 (302) Q Consensus 1 Ma~~t~~~-------~g~liP-------~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~ 65 (302) ||++.+.. .|...+ +.+..++++..+..+.++.+++..+.. +++.++|+.. ..++.....+.+. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig-~~t~~~~~~g~~l 79 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG-RTKAAYLKPGENL 79 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeecc-ceeeeeeccCCCC Confidence 88766543 111233 455788888888899999999877755 5578888764 4556666666654 Q ss_pred ccccccccccceeeEEee--eeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----c-cCCC---ccc Q lcl|NC_011054. 66 PEGVKPTSEATWADRTLV--AEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF----G-TDKP---SSW 135 (302) Q Consensus 66 ~~~~~~~s~~~f~~i~l~--~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G-~g~~---~g~ 135 (302) +... .+.+..+.+|. ..++..+ .|.+-=..++..++.+.+.++.++++++..|+.++. + +-.+ .+. T Consensus 80 ~~~~---~~~~~~e~~ltID~~~~~~~-~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~ 155 (347) T protein:vir:15 80 DDKR---KDIKHTEKVIHIDGLLTADV-LIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENI 155 (347) T ss_pred CCCC---CCCccceEEEEechhhhhhH-HhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 3221 12334454444 3344332 222211123556899999999999999999998873 1 0000 000 Q ss_pred ccccccccccc-cccceeeccccchHHHHHHHhhhhhhhhhhccc--CccEEEecHHHHHHHHhhhcCC-----Cceeee Q lcl|NC_011054. 136 VSPALLPAAVA-ANQDYTIVPGDANEDDLIGCINRASKAVAAAGY--MPDTLLASLGFRFDVANLRDAN-----GNPIFR 207 (302) Q Consensus 136 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~l~d~~-----g~~i~~ 207 (302) ...+....... ..............+.+++.+.++...+..... ...+++++|..|..|.+-.+.. |.-.+. T Consensus 156 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~~ 235 (347) T protein:vir:15 156 EGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALIDHE 235 (347) T ss_pred cccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccccccccccccccc Confidence 00000000000 000011111222345667777766666655443 3456788999999997643221 111111 Q ss_pred c---ccccCcceEeecccccCCC----------cce------------------EEEEecceEEEEeecCcEEEEeeccc Q lcl|NC_011054. 208 D---ESFNGFGTYFNANGAWPVG----------VAE------------------ALVVDSSRVRIGVRQDITVKFLDQAT 256 (302) Q Consensus 208 ~---~~~~g~p~~~~~~~~~~~~----------~~~------------------~~~gd~~~~~~~~~~~~~i~~~~~~~ 256 (302) . ..+.|++++..++...... ... .++...+....+...++.++...+.. T Consensus 236 ~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~ 315 (347) T protein:vir:15 236 RGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN 315 (347) T ss_pred ceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccch Confidence 1 2456777776654432110 111 11111121112233344444443321 Q ss_pred ccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccc Q lcl|NC_011054. 257 VGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGA 296 (302) Q Consensus 257 ~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~ 296 (302) +-.-.+++.+.+|.+++||++.+.+.....+. T Consensus 316 --------~~~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 316 --------YQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred --------hhhhhhehhhhcCCceeccccEEEEecCCCCC Confidence 12234667778899999999998887555555 No 151 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.69 E-value=2.7e-09 Score=67.60 Aligned_cols=255 Identities=13% Similarity=0.013 Sum_probs=129.3 Q ss_pred hcceeecCCCceEEEEEeCCcceeeeccccccccccccccccceee--EEeeeeeEEEeehhHHHHHhcchHHHHHHHHH Q lcl|NC_011054. 34 AFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWAD--RTLVAEEVAVIIPVHENVVDDASTSLLEEIAA 111 (302) Q Consensus 34 ~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~~--i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~ 111 (302) +++.+. ++++.++|+. +..++..+.-|+++..... ++.=.+ |++...++..+..---+-. ++..++.+...+ T Consensus 1 ~vr~i~-~g~s~~~~~i-G~~~~~~~~~G~~l~~~~~---~~~~~e~~itID~~l~~~~~VdDiD~~-qa~~Dlr~e~s~ 74 (324) T protein:vir:99 1 MTRTIT-SGKSAQFPVM-GRTKARYLKQGQSLDDGRE---DIKHTEKVITIDGLLTTDVLIYDIEDA-MNHYDVRSEYST 74 (324) T ss_pred Ceeeee-cCceEEEeee-eeeEeccccCCCCcCCCcC---CcCcccEEEEecchhhhhhhhhhHHHH-hcCccchhHHHH Confidence 666553 3667899887 6666776666665532110 111122 4445555444322222222 245789999999 Q ss_pred HHHHHHHHHHHHHhhc----c--cCCCcccccccccccccccccceeeccccchHHHHHHHhhhhhhhhhhcccC--ccE Q lcl|NC_011054. 112 LGGQAIGKKLDQAVIF----G--TDKPSSWVSPALLPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYM--PDT 183 (302) Q Consensus 112 ~l~~ai~~~~d~~~l~----G--~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~ 183 (302) ++++++++.+|+.++. + ..++....+....+..................+.+++.+.++...+...+.+ ..+ T Consensus 75 ~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~ 154 (324) T protein:vir:99 75 QMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRT 154 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCE Confidence 9999999999988862 1 1112211111111111111111222222344566777777777777655433 357 Q ss_pred EEecHHHHHHHHhhhcC-CCcee----e-ec--ccccCcceEeecccccCCC---------------------------- Q lcl|NC_011054. 184 LLASLGFRFDVANLRDA-NGNPI----F-RD--ESFNGFGTYFNANGAWPVG---------------------------- 227 (302) Q Consensus 184 ~v~~~~~~~~l~~l~d~-~g~~i----~-~~--~~~~g~p~~~~~~~~~~~~---------------------------- 227 (302) ++++|..+..|..-+.. ++.+. + +. ..+.|++++..++...... T Consensus 155 ~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~ 234 (324) T protein:vir:99 155 FYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTV 234 (324) T ss_pred EEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccccccccccccccccc Confidence 88999999988543221 12221 1 11 1356777765554322110 Q ss_pred ---cceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEE--eeecccccCCCCC Q lcl|NC_011054. 228 ---VAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGD--NKTPVGAVVPDGS 302 (302) Q Consensus 228 ---~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l--t~~~a~~~~p~~~ 302 (302) ....++...+....+...++..+..++.. +-.-.+++.+.+|.++.||++++.+ .+..+..|+|+=- T Consensus 235 d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~--------~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~~~~ 306 (324) T protein:vir:99 235 GADNVVGLFVHRSAVATLKLKDMALERARRPE--------YQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVAPDVI 306 (324) T ss_pred ccCceeEEEEehhheEEEeeecceecceechh--------hHHHhhhhhhhhcCcccccceEEEEEEccCccccccchhh Confidence 01112222222223333344444443321 1223466777789999999987544 3444445666422 No 152 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.63 E-value=1.3e-08 Score=63.74 Aligned_cols=275 Identities=10% Similarity=-0.003 Sum_probs=141.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhh---------cceeecCCCceEEEEEeC-Ccceeeeccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQA---------FPTVNMGTKTTHLPVLAT-LPGASWVSESATEPEGVK 70 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~---------~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~~~~~ 70 (302) ||.+.. .-.++|+.+..-+.+...+.+.|++- .....-++..+++|.+.. +.++.-+.|+..++... T Consensus 1 MA~T~l--sd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~k- 77 (351) T protein:vir:15 1 MAETHL--SDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNN- 77 (351) T ss_pred CCceee--eeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhe- Confidence 995443 34577888877777776666665442 112223566889998864 35666777877665432 Q ss_pred cccccceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccccccc- Q lcl|NC_011054. 71 PTSEATWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQ- 149 (302) Q Consensus 71 ~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~- 149 (302) .+-.+-.-..++.+..+.++++...-+..+..+.+.+++++.+++..++.+|.-- +|++............. T Consensus 78 ----itt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l---~gv~~~~~~~~~~~~d~t 150 (351) T protein:vir:15 78 ----LTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVL---KGVMGVTKIANSKVYDQT 150 (351) T ss_pred ----ecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHH---HHHhhchhhcccceeccc Confidence 2333333444556666888887766677789999999999999999999888521 01100000000000000 Q ss_pred ceeeccccchHHHHHHHhhhhhhhhhhcc-cCccEEEecHHHHHHHHhhh------cCCCceeeecccccCcceEeeccc Q lcl|NC_011054. 150 DYTIVPGDANEDDLIGCINRASKAVAAAG-YMPDTLLASLGFRFDVANLR------DANGNPIFRDESFNGFGTYFNANG 222 (302) Q Consensus 150 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~l~------d~~g~~i~~~~~~~g~p~~~~~~~ 222 (302) .........+.+ .+.++...+-... ..-..|+||+..+..|++.. .++|..-+ +...|.++.+.+.. T Consensus 151 ~~~~~~~~is~~----~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i--~t~~G~~VivdD~~ 224 (351) T protein:vir:15 151 KVSPSEPMFGAK----GFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIETIQPQNGATPF--EAYNGLRIVLDDDI 224 (351) T ss_pred cccccccccCHH----HHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhhccccccCccc--ceecceEEEEcCCC Confidence 001112223333 3445554544432 23478999999999998643 33332111 45677777666543 Q ss_pred ccCC---Ccc---eEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeee-ccc Q lcl|NC_011054. 223 AWPV---GVA---EALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKT-PVG 295 (302) Q Consensus 223 ~~~~---~~~---~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~-~a~ 295 (302) .... ... ..+|+. ..+.++..+ ..+++.++... ..++-.+..+.++- .+|..+..-..+ +.+ T Consensus 225 p~~~~~~~~~~ytsyl~~~-GAi~~~~~~-~~ve~~rd~~~------~~g~d~l~~r~~~~---~hp~G~s~~~~~~~~~ 293 (351) T protein:vir:15 225 EIDLTDKTKPVSTSYIFAP-GAVRYSTNM-RSTETKYDPLI------NGGQDVIVQKRVGT---IHVAGTSIKASFSPSK 293 (351) T ss_pred ccccCCCCCceeEEEEEec-ceeeeecCC-cCcceeecccC------CCCceEEEEeeeee---eeeeeeeecccccccC Confidence 3221 111 122221 222233332 23444443221 12333344444433 556565443221 122 Q ss_pred ccCCCCC Q lcl|NC_011054. 296 AVVPDGS 302 (302) Q Consensus 296 ~~~p~~~ 302 (302) ...|.-+ T Consensus 294 ~~sPt~~ 300 (351) T protein:vir:15 294 ASFPTID 300 (351) T ss_pred cCCcChH Confidence 3335444 No 153 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.62 E-value=1.1e-08 Score=64.29 Aligned_cols=273 Identities=14% Similarity=0.051 Sum_probs=155.3 Q ss_pred CCCccCCCcceecch---HHHHHHHHHHHhhhhhhhhcceee---cCCCceEEEEEeCCcceeeeccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQE---AYANDLLASAKKGSTVLQAFPTVN---MGTKTTHLPVLATLPGASWVSESATEPEGVKPTSE 74 (302) Q Consensus 1 Ma~~t~~~~g~liP~---~~~~~ii~~~~~~s~l~~~~~~~~---~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~ 74 (302) |-..+..++|.++-. .+...|++.....-..+++++..+ ....+..+...+....+.|++.... ..|..+ T Consensus 19 ~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a~~~~d~~~----dip~vd 94 (314) T protein:vir:10 19 MGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIAQIIADYSD----DLPLVD 94 (314) T ss_pred hcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccccceeeeCCccc----ccceee Confidence 333333334555553 344667777776666666665442 2223566666777788888887643 234456 Q ss_pred cceeeEEeeeeeEEEeehhHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhcccCC--Cccccccccccccccccc Q lcl|NC_011054. 75 ATWADRTLVAEEVAVIIPVHENVVDDA---STSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSPALLPAAVAANQ 149 (302) Q Consensus 75 ~~f~~i~l~~~ki~~~~~iS~ell~ds---~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~~~~~~~~~~~~ 149 (302) ..+++.....+.++..+.++..=++.+ ..++..--....++++.+.+|+.+|+|+.. -.|+++....+..... T Consensus 95 ~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~-- 172 (314) T protein:vir:10 95 AFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPHGIVSVFDQPNINNVVAT-- 172 (314) T ss_pred cccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccCC-- Confidence 677778888888888888876433332 457788888899999999999999999753 2355554443322111 Q ss_pred ceeeccccchHHHHHHHhhhhhhhhhh---cccCccEEEecHHHHHHHHhhhcCCCceeeec-----ccccCcceEeecc Q lcl|NC_011054. 150 DYTIVPGDANEDDLIGCINRASKAVAA---AGYMPDTLLASLGFRFDVANLRDANGNPIFRD-----ESFNGFGTYFNAN 221 (302) Q Consensus 150 ~~~~~~~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~-----~~~~g~p~~~~~~ 221 (302) ... .+.+.+++++.+++.++.. ....+..++++|+.+..|....+.+|.-++.- +.+...+++.... T Consensus 173 ----~~W-aT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl~~l~~n~~~l~I~~~~el~~ 247 (314) T protein:vir:10 173 ----PNW-SVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYGELFTRNNPGLTIRFLQFLDN 247 (314) T ss_pred ----CCc-ccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHHHHHHHhCCCcEEEEcccccc Confidence 112 3567888888888888764 34566789999999998865545445433221 1111112222221 Q ss_pred cccCCCcceEEEEec--ceEEEEeecCcEEEEeecccccchhhhcCCc-EEEEEEEEec-cEEeccccEEEEeeeccc Q lcl|NC_011054. 222 GAWPVGVAEALVVDS--SRVRIGVRQDITVKFLDQATVGSINLAERDM-IALRLKARFA-YVLGNGATAVGDNKTPVG 295 (302) Q Consensus 222 ~~~~~~~~~~~~gd~--~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~-~~~r~~~r~d-~~v~~~~a~~~lt~~~a~ 295 (302) ... .++..+++-.- ..+-+.....++.-... .+++ .......|++ ..+.+|.+++++++..-+ T Consensus 248 ag~-~g~~~~v~y~~~~~~~~~~vp~~~~~l~~e----------~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 248 YDG-AGGKAALAFEKSPLNMSIEIPEVTNVLPAQ----------PKDLHFRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred cCC-CcceEEEEEecCCcEEEEecCccceeecce----------ecCceEEEcceeeeEEEEEECcceeEeeeeeecC Confidence 111 11222222221 22222222222211110 0111 2234566764 567899999999988766 No 154 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.58 E-value=1.2e-08 Score=64.08 Aligned_cols=225 Identities=11% Similarity=0.140 Sum_probs=140.0 Q ss_pred CCCc-----cCCCcceec-chH-HHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccc Q lcl|NC_011054. 1 MADI-----SRSEVATLI-QEA-YANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPT 72 (302) Q Consensus 1 Ma~~-----t~~~~g~li-P~~-~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (302) |+.+ |..+....+ |.. +...|||.+.++++|++.++++....+. ..+.++++-|.+.|..=++..++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~----- 75 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQP----- 75 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCc----- Confidence 6654 333333222 332 4457999999999999999998754433 34567788999999988877654 Q ss_pred cccceeeEEeeeeeEEEeehhHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccc---------- Q lcl|NC_011054. 73 SEATWADRTLVAEEVAVIIPVHENVVDDAS--TSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSP---------- 138 (302) Q Consensus 73 s~~~f~~i~l~~~ki~~~~~iS~ell~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~---------- 138 (302) ++.++.+++-..+-+.+.+.|.+.+.+... .++...-.+...+++...+.+.||+|+.+ |.++... T Consensus 76 s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~ 155 (331) T protein:vir:10 76 EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) T ss_pred ccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccc Confidence 567889999999999999999999888643 34455566778899999999999999632 2111000 Q ss_pred ---ccccccccccc--c--------------------------------------------------------------- Q lcl|NC_011054. 139 ---ALLPAAVAANQ--D--------------------------------------------------------------- 150 (302) Q Consensus 139 ---~~~~~~~~~~~--~--------------------------------------------------------------- 150 (302) ........++. + T Consensus 156 ~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ 235 (331) T protein:vir:10 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) T ss_pred cccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEE Confidence 00000000000 0 Q ss_pred -eeec------cccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhh-hcCCCceeeecc--------cccCc Q lcl|NC_011054. 151 -YTIV------PGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANL-RDANGNPIFRDE--------SFNGF 214 (302) Q Consensus 151 -~~~~------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-~d~~g~~i~~~~--------~~~g~ 214 (302) .... ....+..++.+++..+...++........|.||++....|++. .+.......... .+.|. T Consensus 236 ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gi 315 (331) T protein:vir:10 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI 315 (331) T ss_pred EEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCe Confidence 0000 0122335567778888888877777778899999999999875 344332222211 24555 Q ss_pred ceEeecccccCCCcceEE Q lcl|NC_011054. 215 GTYFNANGAWPVGVAEAL 232 (302) Q Consensus 215 p~~~~~~~~~~~~~~~~~ 232 (302) |+..++. ...++..++ T Consensus 316 pir~~da--i~~tE~~Vv 331 (331) T protein:vir:10 316 PCRRTDA--LLLTEARVV 331 (331) T ss_pred eEEEeee--eecCccccC Confidence 6544432 122222222 No 155 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.58 E-value=1.2e-08 Score=64.08 Aligned_cols=225 Identities=11% Similarity=0.140 Sum_probs=140.0 Q ss_pred CCCc-----cCCCcceec-chH-HHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccc Q lcl|NC_011054. 1 MADI-----SRSEVATLI-QEA-YANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPT 72 (302) Q Consensus 1 Ma~~-----t~~~~g~li-P~~-~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (302) |+.+ |..+....+ |.. +...|||.+.++++|++.++++....+. ..+.++++-|.+.|..=++..++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~----- 75 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQP----- 75 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCc----- Confidence 6654 333333222 332 4457999999999999999998754433 34567788999999988877654 Q ss_pred cccceeeEEeeeeeEEEeehhHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccc---------- Q lcl|NC_011054. 73 SEATWADRTLVAEEVAVIIPVHENVVDDAS--TSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSP---------- 138 (302) Q Consensus 73 s~~~f~~i~l~~~ki~~~~~iS~ell~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~---------- 138 (302) ++.++.+++-..+-+.+.+.|.+.+.+... .++...-.+...+++...+.+.||+|+.+ |.++... T Consensus 76 s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~ 155 (331) T protein:vir:98 76 EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) T ss_pred ccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccc Confidence 567889999999999999999999888643 34455566778899999999999999632 2111000 Q ss_pred ---ccccccccccc--c--------------------------------------------------------------- Q lcl|NC_011054. 139 ---ALLPAAVAANQ--D--------------------------------------------------------------- 150 (302) Q Consensus 139 ---~~~~~~~~~~~--~--------------------------------------------------------------- 150 (302) ........++. + T Consensus 156 ~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ 235 (331) T protein:vir:98 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) T ss_pred cccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEE Confidence 00000000000 0 Q ss_pred -eeec------cccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhh-hcCCCceeeecc--------cccCc Q lcl|NC_011054. 151 -YTIV------PGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANL-RDANGNPIFRDE--------SFNGF 214 (302) Q Consensus 151 -~~~~------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-~d~~g~~i~~~~--------~~~g~ 214 (302) .... ....+..++.+++..+...++........|.||++....|++. .+.......... .+.|. T Consensus 236 ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gi 315 (331) T protein:vir:98 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI 315 (331) T ss_pred EEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCe Confidence 0000 0122335567778888888877777778899999999999875 344332222211 24555 Q ss_pred ceEeecccccCCCcceEE Q lcl|NC_011054. 215 GTYFNANGAWPVGVAEAL 232 (302) Q Consensus 215 p~~~~~~~~~~~~~~~~~ 232 (302) |+..++. ...++..++ T Consensus 316 pir~~da--i~~tE~~Vv 331 (331) T protein:vir:98 316 PCRRTDA--LLLTEARVV 331 (331) T ss_pred eEEEeee--eecCccccC Confidence 6544432 122222222 No 156 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.58 E-value=1.2e-08 Score=64.08 Aligned_cols=225 Identities=11% Similarity=0.140 Sum_probs=140.0 Q ss_pred CCCc-----cCCCcceec-chH-HHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccc Q lcl|NC_011054. 1 MADI-----SRSEVATLI-QEA-YANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPT 72 (302) Q Consensus 1 Ma~~-----t~~~~g~li-P~~-~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (302) |+.+ |..+....+ |.. +...|||.+.++++|++.++++....+. ..+.++++-|.+.|..=++..++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~----- 75 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQP----- 75 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCc----- Confidence 6654 333333222 332 4457999999999999999998754433 34567788999999988877654 Q ss_pred cccceeeEEeeeeeEEEeehhHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccc---------- Q lcl|NC_011054. 73 SEATWADRTLVAEEVAVIIPVHENVVDDAS--TSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSP---------- 138 (302) Q Consensus 73 s~~~f~~i~l~~~ki~~~~~iS~ell~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~---------- 138 (302) ++.++.+++-..+-+.+.+.|.+.+.+... .++...-.+...+++...+.+.||+|+.+ |.++... T Consensus 76 s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~ 155 (331) T protein:vir:10 76 EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAE 155 (331) T ss_pred ccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccc Confidence 567889999999999999999999888643 34455566778899999999999999632 2111000 Q ss_pred ---ccccccccccc--c--------------------------------------------------------------- Q lcl|NC_011054. 139 ---ALLPAAVAANQ--D--------------------------------------------------------------- 150 (302) Q Consensus 139 ---~~~~~~~~~~~--~--------------------------------------------------------------- 150 (302) ........++. + T Consensus 156 ~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ 235 (331) T protein:vir:10 156 NGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVV 235 (331) T ss_pred cccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEE Confidence 00000000000 0 Q ss_pred -eeec------cccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhh-hcCCCceeeecc--------cccCc Q lcl|NC_011054. 151 -YTIV------PGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANL-RDANGNPIFRDE--------SFNGF 214 (302) Q Consensus 151 -~~~~------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-~d~~g~~i~~~~--------~~~g~ 214 (302) .... ....+..++.+++..+...++........|.||++....|++. .+.......... .+.|. T Consensus 236 ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gi 315 (331) T protein:vir:10 236 RIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGI 315 (331) T ss_pred EEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCe Confidence 0000 0122335567778888888877777778899999999999875 344332222211 24555 Q ss_pred ceEeecccccCCCcceEE Q lcl|NC_011054. 215 GTYFNANGAWPVGVAEAL 232 (302) Q Consensus 215 p~~~~~~~~~~~~~~~~~ 232 (302) |+..++. ...++..++ T Consensus 316 pir~~da--i~~tE~~Vv 331 (331) T protein:vir:10 316 PCRRTDA--LLLTEARVV 331 (331) T ss_pred eEEEeee--eecCccccC Confidence 6544432 122222222 No 157 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.47 E-value=2.7e-08 Score=62.12 Aligned_cols=225 Identities=14% Similarity=0.165 Sum_probs=140.9 Q ss_pred CCCc-----cCCC-cceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeecccccccccccccc Q lcl|NC_011054. 1 MADI-----SRSE-VATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTS 73 (302) Q Consensus 1 Ma~~-----t~~~-~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s 73 (302) |+.. |..+ +.-+-|......|||.+.+.++|++.+++....... -...+.++-|.+.|..=++..++ + T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN~g~~~-----s 75 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLP-----N 75 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhhcCCcccc-----c Confidence 6654 3333 233556777788999999999999999987543322 12345677889999888777654 5 Q ss_pred ccceeeEEeeeeeEEEeehhHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccc----------- Q lcl|NC_011054. 74 EATWADRTLVAEEVAVIIPVHENVVDDAS--TSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSP----------- 138 (302) Q Consensus 74 ~~~f~~i~l~~~ki~~~~~iS~ell~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~----------- 138 (302) +.++.+++-..+-+.+.+.|-+.+.+... .++...-.+...+++...+.+.+|+|+.+ |.++... T Consensus 76 ~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~ 155 (330) T protein:vir:10 76 KSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAEN 155 (330) T ss_pred cceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCc Confidence 67999999999999999999998887643 34556667788999999999999999532 2211100 Q ss_pred --ccccccccccc----------------------------------c-------------------------------- Q lcl|NC_011054. 139 --ALLPAAVAANQ----------------------------------D-------------------------------- 150 (302) Q Consensus 139 --~~~~~~~~~~~----------------------------------~-------------------------------- 150 (302) ..+.+....+. . T Consensus 156 ~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~v 235 (330) T protein:vir:10 156 KDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYV 235 (330) T ss_pred hhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccE Confidence 00000000000 0 Q ss_pred -------eeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhh-hcCCCcee-eec------ccccCcc Q lcl|NC_011054. 151 -------YTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANL-RDANGNPI-FRD------ESFNGFG 215 (302) Q Consensus 151 -------~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-~d~~g~~i-~~~------~~~~g~p 215 (302) ............+.+++..+...++........|.||++....|++. .+.++-.+ +.. -.+.|.| T Consensus 236 vRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~t~~~gip 315 (330) T protein:vir:10 236 ARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIP 315 (330) T ss_pred EEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCeeeEEECCeE Confidence 00011122345677788888888887778888999999999999885 34433222 111 1344555 Q ss_pred eEeecccccCCCcceEE Q lcl|NC_011054. 216 TYFNANGAWPVGVAEAL 232 (302) Q Consensus 216 ~~~~~~~~~~~~~~~~~ 232 (302) +..++. ...++..++ T Consensus 316 ir~~Da--il~tE~~vv 330 (330) T protein:vir:10 316 VQRTDA--LLNTESRVV 330 (330) T ss_pred EEEEee--eecCccccC Confidence 544432 122222222 No 158 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.41 E-value=4.9e-08 Score=60.68 Aligned_cols=278 Identities=11% Similarity=0.019 Sum_probs=142.8 Q ss_pred CCCccCCC-cceec-chHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeCCcceeeeccccccccccccccccce Q lcl|NC_011054. 1 MADISRSE-VATLI-QEAYANDLLASAKKGSTVLQAFPTVNM-GTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~~t~~~-~g~li-P~~~~~~ii~~~~~~s~l~~~~~~~~~-~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f 77 (302) |+.++.+. ...+| |+.|+.+|+.-+++......+.++... .+.+++||... .++..=..++..+.-+... ..++ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg-~~tV~dY~~~~~i~~d~lt--t~~~ 77 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVG-TPVVRSRPEQGDFTFDNLD--TGEI 77 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEecccc-ccccccccCCCCcccccCC--CceE Confidence 88766543 44455 999999999888888776666654432 46678888653 2332223333333222111 1122 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCCcccccccccccccccccceee Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF----GTDKPSSWVSPALLPAAVAANQDYTI 153 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~~~g~~~~~~~~~~~~~~~~~~~ 153 (302) .+.++..|+.++. |+++.. +...++.+...++.+++++..+|+.+.. |..+-++........ ... .. . T Consensus 78 -~l~IDq~KYfaf~-VdDD~~-Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin---~~~-~~-i 149 (322) T protein:vir:31 78 -SIILRDEVYAGNA-ISKKLR-QDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVIN---GVP-HR-F 149 (322) T ss_pred -EEEEehhhhhccc-cchhHH-HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceec---CCc-cc-e Confidence 4556666676665 777554 4578999999999999999999988742 211100100000000 000 00 0 Q ss_pred ccccchHHHHHHHhhhhhhhhhhccc--CccEEEecHHHHHHHHhhh-----cCCCce--eeec---------ccccCcc Q lcl|NC_011054. 154 VPGDANEDDLIGCINRASKAVAAAGY--MPDTLLASLGFRFDVANLR-----DANGNP--IFRD---------ESFNGFG 215 (302) Q Consensus 154 ~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~l~-----d~~g~~--i~~~---------~~~~g~p 215 (302) ..........++.+.++..++..... ...++|.+|..+..|..+. -.++|+ +.+. ..+.|+- T Consensus 150 v~~gt~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF~ 229 (322) T protein:vir:31 150 VGTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGID 229 (322) T ss_pred eccCCCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhcee Confidence 11111222233455555555544332 2356788999988884431 123333 1111 2356667 Q ss_pred eEeecccccCCCcceEE---------EEecceEEE----------EeecCcEEEEeecccccchhhhcCCcEEEEEEEEe Q lcl|NC_011054. 216 TYFNANGAWPVGVAEAL---------VVDSSRVRI----------GVRQDITVKFLDQATVGSINLAERDMIALRLKARF 276 (302) Q Consensus 216 ~~~~~~~~~~~~~~~~~---------~gd~~~~~~----------~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~ 276 (302) ++..++.. .++..++ .|-++.+.. +-++.|. -.++- +.-.+..-.+|..+|+ T Consensus 230 V~~SN~l~--~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~---~~e~~----r~~~~~~d~~~~~~~~ 300 (322) T protein:vir:31 230 LFVSNLLA--DANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMP---TTKSF----IDDYNDDLNTATTARW 300 (322) T ss_pred eeeecccc--ccccccccCcccccccceeecccccccchhhhhhhhHhhhhh---hhhcc----cCccccccceeeeeee Confidence 76666532 1111111 222222211 1111110 00110 0112233457899999 Q ss_pred ccEEeccccEEEEeeecccccCC Q lcl|NC_011054. 277 AYVLGNGATAVGDNKTPVGAVVP 299 (302) Q Consensus 277 d~~v~~~~a~~~lt~~~a~~~~p 299 (302) |.++.+|+.++.+.... ..+|= T Consensus 301 g~g~~r~e~l~~~~a~~-~~~~~ 322 (322) T protein:vir:31 301 GNGLVRDENLVCVLANA-DKVTF 322 (322) T ss_pred cceeecccceEEEEecc-ccccC Confidence 99999999998777543 33333 No 159 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.37 E-value=1.3e-07 Score=58.29 Aligned_cols=291 Identities=10% Similarity=0.003 Sum_probs=145.9 Q ss_pred CCCccCCCcc---------eecchHHHHHHHHHHHhhhhhhhhcceeecCCC-ceEEEEEeCCcceeeeccccccccccc Q lcl|NC_011054. 1 MADISRSEVA---------TLIQEAYANDLLASAKKGSTVLQAFPTVNMGTK-TTHLPVLATLPGASWVSESATEPEGVK 70 (302) Q Consensus 1 Ma~~t~~~~g---------~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~ 70 (302) |+..+....+ .+.=+.+..++.+.....+.++++.++.++.++ +.++|+. +..+++.+.-|++... . T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG~~ldg-~- 77 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAA-T- 77 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCCCCcCC-C- Confidence 7776554321 234467778889999999999999999988755 6788876 7777888877776432 2 Q ss_pred cccccceeeEEeeeee-EEEeehhHH--HHHhcchHH-HHHHHHHHHHHHHHHHHHHHhhc----c----cCCCcccccc Q lcl|NC_011054. 71 PTSEATWADRTLVAEE-VAVIIPVHE--NVVDDASTS-LLEEIAALGGQAIGKKLDQAVIF----G----TDKPSSWVSP 138 (302) Q Consensus 71 ~~s~~~f~~i~l~~~k-i~~~~~iS~--ell~ds~~~-~~~~i~~~l~~ai~~~~d~~~l~----G----~g~~~g~~~~ 138 (302) .+..++..+.... +.....|-+ |.+ +..+ +.+.+.+++.+++++.+|+.+|. + +..+.+.... T Consensus 78 ---~~~~dk~~ItIDtLL~a~~~V~dlDd~q--~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g 152 (400) T protein:vir:10 78 ---STQADKNQLVIDATVIARNTVAHLHDVQ--GDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRV 152 (400) T ss_pred ---CcccCcEEEEeCceeeecchhhhHHHHh--hccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCc Confidence 2333344444332 222233322 222 3455 78999999999999999998762 2 2222221111 Q ss_pred cccccccccccceeeccccchHHHHHHHhhhhhhhhhhcccCc--cEEEecHHHHHHHHhhh-----c----CCCceeee Q lcl|NC_011054. 139 ALLPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMP--DTLLASLGFRFDVANLR-----D----ANGNPIFR 207 (302) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~l~-----d----~~g~~i~~ 207 (302) ...+..... .........+...+.+.+..+...+...+... -.+++.|..|+.|.... | .+|-++.. T Consensus 153 ~~~g~s~~v--~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g 230 (400) T protein:vir:10 153 KGHGFSVNV--EVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQG 230 (400) T ss_pred cccccceee--cccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccc Confidence 111111000 11112222344566666666666665444332 35778889998886431 1 11222211 Q ss_pred c-ccccCcceEeecccccCC-------------CcceEEEEecceE--EEEeecCc-EEEEeecccccchhhhcCCcEEE Q lcl|NC_011054. 208 D-ESFNGFGTYFNANGAWPV-------------GVAEALVVDSSRV--RIGVRQDI-TVKFLDQATVGSINLAERDMIAL 270 (302) Q Consensus 208 ~-~~~~g~p~~~~~~~~~~~-------------~~~~~~~gd~~~~--~~~~~~~~-~i~~~~~~~~~~~~~~~~~~~~~ 270 (302) . -.+.|.|++-..+.+... +...-+-||++.- ++..++.+ .++..+ .+......-.+-...+ T Consensus 231 ~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~-lt~~~~~d~r~~~~~i 309 (400) T protein:vir:10 231 FVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSID-VIGDIFYEKKEKTYYI 309 (400) T ss_pred eEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeec-cccccccchhhHHHHH Confidence 1 135666665554432211 1111123555331 22222221 112111 0000000001111124 Q ss_pred EEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 271 RLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 271 r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) -+.+-++..+.||+++..++-..-+-..-++- T Consensus 310 d~~~a~G~g~~RPeaa~vv~~~~~~~~~~~~~ 341 (400) T protein:vir:10 310 DTFMSEGAIPDRWEAVSVVTTKRQSTGAVDSG 341 (400) T ss_pred HHHHHhCCcccchhheEEEEecCCcccccccC Confidence 45566899999999988777433221111111 No 160 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.36 E-value=5.5e-08 Score=60.39 Aligned_cols=255 Identities=13% Similarity=0.011 Sum_probs=136.2 Q ss_pred CCCccCCCcceecchHH---HHHHHHHHHhhhhhhhhcceeecCCC-ceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAY---ANDLLASAKKGSTVLQAFPTVNMGTK-TTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~---~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) ||....+....|.+..- .+.+-..+.+-..++...+.+|+..+ .+++|.+.-...+.=|+||+++|- ++.+ T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Ipl-----skvt 75 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPL-----SKVT 75 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccch-----hhhe Confidence 99987877777764333 23343344444555666688998865 689999988888888999998764 4444 Q ss_pred ee---eEEeeeeeEEEeehhHHHHHhcch-HHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccccccccee Q lcl|NC_011054. 77 WA---DRTLVAEEVAVIIPVHENVVDDAS-TSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYT 152 (302) Q Consensus 77 f~---~i~l~~~ki~~~~~iS~ell~ds~-~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~ 152 (302) .. ..+++.+|++..+ |.|.++.+. .+....-.++|..++++++|+.+|.--.+.+ . T Consensus 76 ~~~~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat------------------~ 135 (295) T protein:vir:99 76 RTKDKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKP------------------T 135 (295) T ss_pred eeeeeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCc------------------e Confidence 43 3667778877754 999986553 4667888899999999999999995221110 0 Q ss_pred eccccchHHHHHHHhhhh---hhhhhhcccCccEEEecHHHHHHHHhhhcCC-------C-ceeeecccccCcc-eEeec Q lcl|NC_011054. 153 IVPGDANEDDLIGCINRA---SKAVAAAGYMPDTLLASLGFRFDVANLRDAN-------G-NPIFRDESFNGFG-TYFNA 220 (302) Q Consensus 153 ~~~~~~~~~~~~~~i~~~---~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~-------g-~~i~~~~~~~g~p-~~~~~ 220 (302) ..+ .+.+...+..+ ....-..+..+...++||.....+++-..-+ | .|| . .+.|+. +.... T Consensus 136 t~t----g~~lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L-~--nfLG~q~II~S~ 208 (295) T protein:vir:99 136 KVK----GVGLQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLL-K--NFLGMQNVIVMP 208 (295) T ss_pred eee----hhhHHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhh-h--hhhccceEEEcc Confidence 011 11111222222 2222233344556778999998887644222 1 122 1 245554 22222 Q ss_pred ccccCCCcceEEEEecceEEE--E-ee-cCcEEEEeecccccchhhhcCCcEEEEEEEE--------------eccE--E Q lcl|NC_011054. 221 NGAWPVGVAEALVVDSSRVRI--G-VR-QDITVKFLDQATVGSINLAERDMIALRLKAR--------------FAYV--L 280 (302) Q Consensus 221 ~~~~~~~~~~~~~gd~~~~~~--~-~~-~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r--------------~d~~--v 280 (302) . ..++.++.--...+.+ . .. +++. +...+..|.+.|....+ -++. . T Consensus 209 k----v~~G~~~aT~~~Ni~~ay~~~~~g~l~----------~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfp 274 (295) T protein:vir:99 209 S----VPEGKIYSTAVENLVFASLNVKGGDLG----------GLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFA 274 (295) T ss_pred c----CCCceEEEeeccceEEEEecCCchhhh----------hhhhhccCcccceEEEeccccceeeehhhhHhHHHhcc Confidence 2 2334444333322221 1 11 1111 00111222222222111 1111 2 Q ss_pred eccccEEEEeeecccccCCCCC Q lcl|NC_011054. 281 GNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 281 ~~~~a~~~lt~~~a~~~~p~~~ 302 (302) .+++++++.+-..... +..|- T Consensus 275 E~~dgiv~~tI~~~~~-~~~~~ 295 (295) T protein:vir:99 275 EIPEGVVEATIEAAAV-PGIGG 295 (295) T ss_pred cccceEEEEEEecCcC-CCCCC Confidence 4667888877643222 22222 No 161 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.36 E-value=1.4e-07 Score=58.22 Aligned_cols=285 Identities=11% Similarity=-0.002 Sum_probs=146.1 Q ss_pred CCCccCCCcce---------ecchHHHHHHHHHHHhhhhhhhhcceeecCC-CceEEEEEeCCcceeeeccccccccccc Q lcl|NC_011054. 1 MADISRSEVAT---------LIQEAYANDLLASAKKGSTVLQAFPTVNMGT-KTTHLPVLATLPGASWVSESATEPEGVK 70 (302) Q Consensus 1 Ma~~t~~~~g~---------liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~E~~~~~~~~~ 70 (302) |+..+....+. +.=+.+..++.+.....+.++++.++.++.+ ++.++|+. +..+++.+.-|++.. +. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~i-G~~~a~y~~~G~~ld-g~- 77 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPN-AT- 77 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEE-eeeEEeeeccccccC-CC- Confidence 87766644322 2237778889999999999999998888765 47888876 566666666555532 22 Q ss_pred cccccceeeEEeeee--eEEEeehhHHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHhhc-----cc---CCCccccccc Q lcl|NC_011054. 71 PTSEATWADRTLVAE--EVAVIIPVHENVVDDASTS-LLEEIAALGGQAIGKKLDQAVIF-----GT---DKPSSWVSPA 139 (302) Q Consensus 71 ~~s~~~f~~i~l~~~--ki~~~~~iS~ell~ds~~~-~~~~i~~~l~~ai~~~~d~~~l~-----G~---g~~~g~~~~~ 139 (302) .+..++..+... .++.. .|-+----++..+ +.+.+.+++.+++++.+|+.++. +- ..+. ..+.. T Consensus 78 ---~~~~~k~~ItID~lL~a~~-~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~-~~~~~ 152 (402) T protein:vir:97 78 ---PTQADKNQLVIDTTVIARN-TVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER-NKPRV 152 (402) T ss_pred ---CcccccEEEEeCceeechh-hhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-ccCcc Confidence 233344444443 33222 2221101123455 68899999999999999998863 11 1111 01111 Q ss_pred ccccccccccceeeccccchHHHHHHHhhhhhhhhhhcccCc--cEEEecHHHHHHHHhhhc---------CCCceeeec Q lcl|NC_011054. 140 LLPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMP--DTLLASLGFRFDVANLRD---------ANGNPIFRD 208 (302) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~l~d---------~~g~~i~~~ 208 (302) .........+.+......+...+.+.+.++...+...+... ..++++|..|..|.+-.+ .+|.+.... T Consensus 153 -~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~ 231 (402) T protein:vir:97 153 -KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGF 231 (402) T ss_pred -cccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccce Confidence 11111111222222334566777777777776666544333 468899999999976421 122222111 Q ss_pred -ccccCcceEeecccccCC-------------CcceEEEEecce--EEEEeecC--------cEEEEeecccccchhhhc Q lcl|NC_011054. 209 -ESFNGFGTYFNANGAWPV-------------GVAEALVVDSSR--VRIGVRQD--------ITVKFLDQATVGSINLAE 264 (302) Q Consensus 209 -~~~~g~p~~~~~~~~~~~-------------~~~~~~~gd~~~--~~~~~~~~--------~~i~~~~~~~~~~~~~~~ 264 (302) ..+.|.+++...+.+... +...-+-+|+.. .++..+.- ++-++.++... T Consensus 232 v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~------- 304 (402) T protein:vir:97 232 VLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKE------- 304 (402) T ss_pred eEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhH------- Confidence 246677776665543211 111112244432 22222221 11111111110 Q ss_pred CCcEEEEEEEEeccEEeccccEEEEeeec--ccccCCC-----CC Q lcl|NC_011054. 265 RDMIALRLKARFAYVLGNGATAVGDNKTP--VGAVVPD-----GS 302 (302) Q Consensus 265 ~~~~~~r~~~r~d~~v~~~~a~~~lt~~~--a~~~~p~-----~~ 302 (302) -...+-+.+-+|..+.||++...++-.. ..+.+|. .| T Consensus 305 -~~~~id~~~a~G~g~~RPeaa~vv~~~~~~t~~~~~~~~~~~~~ 348 (402) T protein:vir:97 305 -KTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHAT 348 (402) T ss_pred -HHHHHHHHHHhCCcccCccceEEEEEecccccccCCccccchhh Confidence 0011334455788899998877774322 2222222 11 No 162 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.33 E-value=1.3e-07 Score=58.32 Aligned_cols=292 Identities=10% Similarity=0.002 Sum_probs=145.0 Q ss_pred CCCccCCCcc---------eecchHHHHHHHHHHHhhhhhhhhcceeecCCC-ceEEEEEeCCcceeeeccccccccccc Q lcl|NC_011054. 1 MADISRSEVA---------TLIQEAYANDLLASAKKGSTVLQAFPTVNMGTK-TTHLPVLATLPGASWVSESATEPEGVK 70 (302) Q Consensus 1 Ma~~t~~~~g---------~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~ 70 (302) |+..+....+ .+.=+.+..++.+.....+.++++.++.++.++ +.++|+. +..+++.+.-|++.. .. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG~~ld-~~- 77 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPA-AT- 77 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCCCCcC-CC- Confidence 8776654422 234466778888999999999999999988755 6888876 667777777666543 22 Q ss_pred cccccceeeEEeeee--eEEEeehhHHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHhhc-----ccC--CCcccccccc Q lcl|NC_011054. 71 PTSEATWADRTLVAE--EVAVIIPVHENVVDDASTS-LLEEIAALGGQAIGKKLDQAVIF-----GTD--KPSSWVSPAL 140 (302) Q Consensus 71 ~~s~~~f~~i~l~~~--ki~~~~~iS~ell~ds~~~-~~~~i~~~l~~ai~~~~d~~~l~-----G~g--~~~g~~~~~~ 140 (302) .+..++..|... +++. ..|-+----++..+ +.+.+.+++.+++++.+|+.++. |-. .+....+.+ T Consensus 78 ---~~~~dK~~ItID~lL~a~-~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~- 152 (401) T protein:vir:70 78 ---STQADKNQLVIDATVIAR-NTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRV- 152 (401) T ss_pred ---CcccccEEEEeCceeehh-hhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCc- Confidence 233344444433 2222 22221101123455 68899999999999999987742 211 000001100 Q ss_pred cccccccccceeeccccchHHHHHHHhhhhhhhhhhcccCcc--EEEecHHHHHHHHhh---hc------CCCceeeec- Q lcl|NC_011054. 141 LPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPD--TLLASLGFRFDVANL---RD------ANGNPIFRD- 208 (302) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~l---~d------~~g~~i~~~- 208 (302) .+................+...+.+.+.++...+...+.... .+++.|..|+.|... -| .+|.++... T Consensus 153 ~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v 232 (401) T protein:vir:70 153 KGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFT 232 (401) T ss_pred CCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccceE Confidence 000000011111122234556677777777777665554433 466788888888553 11 122222111 Q ss_pred ccccCcceEeecccccCC-------------CcceEEEEecceE--EEEeecCcE-EEEeecccccchhhhcCCcEEEEE Q lcl|NC_011054. 209 ESFNGFGTYFNANGAWPV-------------GVAEALVVDSSRV--RIGVRQDIT-VKFLDQATVGSINLAERDMIALRL 272 (302) Q Consensus 209 ~~~~g~p~~~~~~~~~~~-------------~~~~~~~gd~~~~--~~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~r~ 272 (302) -.+.|.|++...+.+... +...-+-||++.- ++..++.+- ++..+ .+........+-...+-+ T Consensus 233 ~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~-lt~~~~~d~r~~~~~id~ 311 (401) T protein:vir:70 233 LSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSID-VTGDIFYEKKEKTYYIDT 311 (401) T ss_pred EEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeec-cccchhhhhhhhHHHHHH Confidence 135677766655433211 1112223455331 222222211 12111 110000000001112335 Q ss_pred EEEeccEEeccccEEEEeeecccccCCC-----CC Q lcl|NC_011054. 273 KARFAYVLGNGATAVGDNKTPVGAVVPD-----GS 302 (302) Q Consensus 273 ~~r~d~~v~~~~a~~~lt~~~a~~~~p~-----~~ 302 (302) .+-+|..+.||++...++-+. ..+||+ || T Consensus 312 ~~a~g~g~~RPeaa~vv~~k~-~~~~~~~~~~~~~ 345 (401) T protein:vir:70 312 FMAEGAIPDRWEAVSVVTTKR-NTTTGAVEGTDGA 345 (401) T ss_pred HHHhCCcccchhheEEEeecC-cccccccccCCcc Confidence 566899999999986664221 223333 33 No 163 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.27 E-value=1.4e-07 Score=58.12 Aligned_cols=225 Identities=14% Similarity=0.096 Sum_probs=133.7 Q ss_pred CCCccC-----CCcce-ecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeecccccccccccccc Q lcl|NC_011054. 1 MADISR-----SEVAT-LIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTS 73 (302) Q Consensus 1 Ma~~t~-----~~~g~-liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s 73 (302) |+.... .+... +-|......|||.+.+.++|++.+++....... -...+.++-|.+.|..=++..++ + T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~lN~g~~~-----s 75 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRRYNQGVQP-----T 75 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhhcCCcccc-----c Confidence 665533 33222 446666777999999999999999987543322 12345677889999888777654 5 Q ss_pred ccceeeEEeeeeeEEEeehhHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhcccCC--Ccccccc----------- Q lcl|NC_011054. 74 EATWADRTLVAEEVAVIIPVHENVVDDAS--TSLLEEIAALGGQAIGKKLDQAVIFGTDK--PSSWVSP----------- 138 (302) Q Consensus 74 ~~~f~~i~l~~~ki~~~~~iS~ell~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~g~~~~----------- 138 (302) +.++.+++-..+-+.+.+.|-+.+.+... -++...-.+...+++...+.+.+|+|+.+ |.++... T Consensus 76 ~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~ 155 (335) T protein:vir:73 76 KTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSK 155 (335) T ss_pred cceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccc Confidence 67999999999999999999998777543 34566666778999999999999999642 2221100 Q ss_pred --cccccccccccc------------------------------------------------------------------ Q lcl|NC_011054. 139 --ALLPAAVAANQD------------------------------------------------------------------ 150 (302) Q Consensus 139 --~~~~~~~~~~~~------------------------------------------------------------------ 150 (302) ...+....+++. T Consensus 156 a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~ 235 (335) T protein:vir:73 156 AASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRS 235 (335) T ss_pred cCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCccc Confidence 000000000000 Q ss_pred ------eeec---cccchHHHHHHHhhhhhh--hhhhcccCccEEEecHHHHHHHHhhh-cCCCceeeecc--------c Q lcl|NC_011054. 151 ------YTIV---PGDANEDDLIGCINRASK--AVAAAGYMPDTLLASLGFRFDVANLR-DANGNPIFRDE--------S 210 (302) Q Consensus 151 ------~~~~---~~~~~~~~~~~~i~~~~~--~~~~~~~~~~~~v~~~~~~~~l~~l~-d~~g~~i~~~~--------~ 210 (302) .... .-..+...+.+++..++. .++........|.||++....|++.. +..+..+ ... . T Consensus 236 vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l-~~~~~~g~~~t~ 314 (335) T protein:vir:73 236 ISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNL-TIEEYGGKKIVS 314 (335) T ss_pred EEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceee-eeeccCCceeEE Confidence 0000 001233455566655553 34555555678999999999998754 4433222 222 2 Q ss_pred ccCcceEeecccccCCCcceEEE Q lcl|NC_011054. 211 FNGFGTYFNANGAWPVGVAEALV 233 (302) Q Consensus 211 ~~g~p~~~~~~~~~~~~~~~~~~ 233 (302) +.|.|+..++. ...++..+.. T Consensus 315 ~~gipir~~Da--il~tE~~v~~ 335 (335) T protein:vir:73 315 FLGIPIRRVDA--ILNTESAVTA 335 (335) T ss_pred ECCeEEEEEee--eecCcccccC Confidence 33445444332 1122222211 No 164 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.11 E-value=1.3e-06 Score=52.81 Aligned_cols=279 Identities=10% Similarity=0.029 Sum_probs=138.0 Q ss_pred CCCccCC--------C-cceecchHHHHHHHHHHHh-hhhhhhhcceeecCCCceEEEEEeCCcceeeecccc---cccc Q lcl|NC_011054. 1 MADISRS--------E-VATLIQEAYANDLLASAKK-GSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESA---TEPE 67 (302) Q Consensus 1 Ma~~t~~--------~-~g~liP~~~~~~ii~~~~~-~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~---~~~~ 67 (302) |+.+..= + ...+| +++.+++....++ .+.|++-++..+-.++.-.+-.. +...+.-+.++. ...+ T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d 78 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFV-QTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETL-ASMDPDAVKRKRSRQQSAD 78 (322) T ss_pred CcccceeeeeeeeechhhhHHH-HHHHHHHHHHHHHhhhhhhcccccccccccccceeec-ccccccccccccccccccC Confidence 2221111 1 11233 5666666544444 56676666644433332111111 111111111111 1111 Q ss_pred c--ccccccc--ceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccc Q lcl|NC_011054. 68 G--VKPTSEA--TWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPA 143 (302) Q Consensus 68 ~--~~~~s~~--~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~ 143 (302) . ..|.... ....+.+..+ .....|.+.-..+...+..+...+..+.+++++.|+.++.+-=.+..... ..... T Consensus 79 ~~~dtp~~~~~~~~r~~~~~d~--~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~-~gt~v 155 (322) T protein:vir:10 79 GTYPTPVNNKPFAKRRTNVDTY--DTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKG-TGQPV 155 (322) T ss_pred cccCCCccccccceEEEeeccc--ccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccc-ccccc Confidence 1 1111122 2333444444 34467776655556778899999999999999999988864110000000 00000 Q ss_pred ccccccceeeccccchHHHHHHHhhhhhhhhhhcccC---ccEEEecHHHHHHHHhhhcCC-----C-ceeee---cccc Q lcl|NC_011054. 144 AVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYM---PDTLLASLGFRFDVANLRDAN-----G-NPIFR---DESF 211 (302) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~---~~~~v~~~~~~~~l~~l~d~~-----g-~~i~~---~~~~ 211 (302) ...........+...+.+. +.++...+.....+ +.+++.+|..|..|....... | ..+.. -..+ T Consensus 156 ~~~ss~~i~~g~~g~t~~k----l~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~ 231 (322) T protein:vir:10 156 EFLATQEIGDGTKPISFDY----VTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNW 231 (322) T ss_pred ccCCCcccccCccchhHHH----HHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeee Confidence 0000111111122333443 44455555544433 246889999999997644221 1 22322 1245 Q ss_pred cCcceEeeccccc--------------CCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEec Q lcl|NC_011054. 212 NGFGTYFNANGAW--------------PVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFA 277 (302) Q Consensus 212 ~g~p~~~~~~~~~--------------~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d 277 (302) .|+.+........ .......+++..+.+.++...++..++....... +...+++.+-+| T Consensus 232 lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~-------~a~~I~~~~~~G 304 (322) T protein:vir:10 232 MGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSAS-------FAWRIYSAFTAD 304 (322) T ss_pred eeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCcc-------hhhhhhhhhhhC Confidence 6666655543321 1122345677777777887777777665433221 223466778899 Q ss_pred cEEeccccEEEEeeeccc Q lcl|NC_011054. 278 YVLGNGATAVGDNKTPVG 295 (302) Q Consensus 278 ~~v~~~~a~~~lt~~~a~ 295 (302) .++.+|+.++.+.-..+= T Consensus 305 a~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 305 CVRVEDEHIFKLRLKNSL 322 (322) T ss_pred ceEeccCcEEEEEEeccC Confidence 999999999999975443 No 165 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.09 E-value=1.7e-06 Score=52.19 Aligned_cols=269 Identities=12% Similarity=0.060 Sum_probs=114.4 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhccee---ec---CCCceEEEEEeCCcceeeeccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTV---NM---GTKTTHLPVLATLPGASWVSESATEPEGVKPTSE 74 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~---~~---~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~ 74 (302) ||+. .++|+.|+.++++.+++..++.+++..- .. .+..++||+... ..+.+...............+ T Consensus 1 Ma~~------~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) T protein:vir:99 1 MANA------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) T ss_pred Cccc------cccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc-ccceeeeccccccCCcccccc Confidence 9953 3889999999999999999988887432 22 245688876543 333332211110000011112 Q ss_pred cceeeEEeee-eeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccc-CCCccccccccccccccccccee Q lcl|NC_011054. 75 ATWADRTLVA-EEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGT-DKPSSWVSPALLPAAVAANQDYT 152 (302) Q Consensus 75 ~~f~~i~l~~-~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~-g~~~g~~~~~~~~~~~~~~~~~~ 152 (302) .+-..+++.. +..+.-+.|+++-......++...+.+...+++++++|..++.-- +.+. ... T Consensus 74 ~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~------------~~~---- 137 (392) T protein:vir:99 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY------------EAA---- 137 (392) T ss_pred cccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------ccc---- Confidence 2233444444 223334556666555556788888888899999999999887310 1100 000 Q ss_pred eccccchHHHHHHHhhhhhhhhhhccc-CccEEEecHHHHHHHHhhh-----cCCC---ceeeec---ccccCcceEeec Q lcl|NC_011054. 153 IVPGDANEDDLIGCINRASKAVAAAGY-MPDTLLASLGFRFDVANLR-----DANG---NPIFRD---ESFNGFGTYFNA 220 (302) Q Consensus 153 ~~~~~~~~~~~~~~i~~~~~~~~~~~~-~~~~~v~~~~~~~~l~~l~-----d~~g---~~i~~~---~~~~g~p~~~~~ 220 (302) ......+.+..++.+.++...+..... ...+++++|..+..|.+.. +.-| ...+.. ..+.|+.++... T Consensus 138 ~~~~~~~~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~ 217 (392) T protein:vir:99 138 GAVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVEST 217 (392) T ss_pred ccccccChhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeec Confidence 011112233344555556555544332 2346889999999987531 1111 111111 134555555444 Q ss_pred ccccCCCcceEEEEecceEEEEeec-----------------CcEEEEeecccccchhhhcCCcEEEEEEEEeccEEec- Q lcl|NC_011054. 221 NGAWPVGVAEALVVDSSRVRIGVRQ-----------------DITVKFLDQATVGSINLAERDMIALRLKARFAYVLGN- 282 (302) Q Consensus 221 ~~~~~~~~~~~~~gd~~~~~~~~~~-----------------~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~- 282 (302) +.... ..+.+..+.+...... .+...+..+... -+..+...+.. ..+..... T Consensus 218 ~~~~~----t~~a~~~~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~----t~~s~~~~v~~--~~g~~~v~~ 287 (392) T protein:vir:99 218 LIPHG----DAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDS----TITSNRSLIDT--YFGLKVVED 287 (392) T ss_pred ccccc----cceeeeccccccccccccccccccceeEEecccceecceeecccc----eeeccccccce--eEEEEEEee Confidence 32111 1122222211111100 011111100000 00011111111 11111111 Q ss_pred --cccEEEEeeecc-------cccCCCCC Q lcl|NC_011054. 283 --GATAVGDNKTPV-------GAVVPDGS 302 (302) Q Consensus 283 --~~a~~~lt~~~a-------~~~~p~~~ 302 (302) ..++........ ..+.+.-. T Consensus 288 ~~~~~~~~~~~~~~~~~~v~v~~v~~~~~ 316 (392) T protein:vir:99 288 PNGVGFVRARKIHLIPGSIEVAPEAGANA 316 (392) T ss_pred ccccceeeeeeeeeecceeeeeeeecccc Confidence 111111000000 01111111 No 166 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=97.95 E-value=9.3e-06 Score=48.16 Aligned_cols=277 Identities=10% Similarity=-0.022 Sum_probs=145.9 Q ss_pred CCCccCC---CcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc-ceeeeccccccccccccc--cc Q lcl|NC_011054. 1 MADISRS---EVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLP-GASWVSESATEPEGVKPT--SE 74 (302) Q Consensus 1 Ma~~t~~---~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~-~a~~v~E~~~~~~~~~~~--s~ 74 (302) ||.-+.+ -.....-+.+.+.|...-....|++.++-.....+....+...+-.. ...-..||.+.+...... .. T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~~ 80 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTML 80 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEEe Confidence 7755432 23344567778888888888899999887666555444444333222 222234666554332111 11 Q ss_pred cceeeEEeeeeeEEEeehhHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhcccCC-Ccc-----ccccccccccc Q lcl|NC_011054. 75 ATWADRTLVAEEVAVIIPVHENVVDDA---STSLLEEIAALGGQAIGKKLDQAVIFGTDK-PSS-----WVSPALLPAAV 145 (302) Q Consensus 75 ~~f~~i~l~~~ki~~~~~iS~ell~ds---~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~-~~g-----~~~~~~~~~~~ 145 (302) ..+.+| +...+.||.-+..-+ ..+...|-...-...+.+.+|.++|+|.-. ..+ -...++..... T Consensus 81 ~N~tQI------f~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~ 154 (317) T protein:vir:88 81 NNYCQI------SDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYK 154 (317) T ss_pred ccEEEE------EEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhc Confidence 222222 233334443222211 124334444445567899999999998531 111 01111111110 Q ss_pred c--------------cccceee-ccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec-- Q lcl|NC_011054. 146 A--------------ANQDYTI-VPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD-- 208 (302) Q Consensus 146 ~--------------~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~-- 208 (302) . .....+. .....+.+ ++.++..++-..+..+..+++++.....+.++...++.++..+ T Consensus 155 t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~----~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~ 230 (317) T protein:vir:88 155 TNGSLGANGVAPVGDGSNTGTAGDLRLLTED----MLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDAS 230 (317) T ss_pred cCceeccCccccccCCCccccccccccccHH----HHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEccc Confidence 0 0000110 11123334 4445555555566777788999999999998865455555432 Q ss_pred ccccCcceE--eecc------cccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEE Q lcl|NC_011054. 209 ESFNGFGTY--FNAN------GAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVL 280 (302) Q Consensus 209 ~~~~g~p~~--~~~~------~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v 280 (302) +...|.-+. +.+. +......+.+++.|++++-+..-+++..+..... -+......+..+++.+ T Consensus 231 ~~~~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e~laKt---------Gd~~k~~i~~E~tLe~ 301 (317) T protein:vir:88 231 DNRIAQTVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYLRPFFQHELAKT---------GDSEKRQLLVEYTFRV 301 (317) T ss_pred CeEEEEEEEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeecccceeeccCCC---------cccceeEEEEEEEEEE Confidence 222222211 1111 0112345677888888776655455544433221 1334566778889999 Q ss_pred eccccEEEEeeecccc Q lcl|NC_011054. 281 GNGATAVGDNKTPVGA 296 (302) Q Consensus 281 ~~~~a~~~lt~~~a~~ 296 (302) .++.|.++++...++. T Consensus 302 ~N~~a~a~i~~l~~~~ 317 (317) T protein:vir:88 302 NNEKSGALIRDVVAQL 317 (317) T ss_pred cCccceeEEEEecccC Confidence 9999999999887777 No 167 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=97.83 E-value=7.9e-06 Score=48.57 Aligned_cols=271 Identities=11% Similarity=0.048 Sum_probs=142.4 Q ss_pred CCC-ccCCCcceecchHHHHHHHHHHHhhh-hhhhhcceeecC-CCceEEEEEeCCcceeeeccccccccccccccccce Q lcl|NC_011054. 1 MAD-ISRSEVATLIQEAYANDLLASAKKGS-TVLQAFPTVNMG-TKTTHLPVLATLPGASWVSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~-~t~~~~g~liP~~~~~~ii~~~~~~s-~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f 77 (302) +|. -+|+|-+.++-...-+.++..-+... .+.+.++..+++ -...+..+..+.+...-|.|+++...+...+ T Consensus 359 ~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e----- 433 (652) T protein:vir:79 359 AAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGD----- 433 (652) T ss_pred HHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCccceeeecC----- Confidence 443 24566655554444444443333332 355556555443 1223344455677888889998887643322 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc---ccCC----Ccccc-ccccccccccccc Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF---GTDK----PSSWV-SPALLPAAVAANQ 149 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~---G~g~----~~g~~-~~~~~~~~~~~~~ 149 (302) +..++...+++..+.||++++-.-..++.+-|...+.++.++.+++.++. ++.+ ++.++ +....+. T Consensus 434 ~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl------ 507 (652) T protein:vir:79 434 KQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANV------ 507 (652) T ss_pred ccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeecccccccc------ Confidence 34567889999999999998755467888889999999999999977763 2211 11112 1000000 Q ss_pred ceeeccccchHHHHHHHhhhhhhhhhh---cccCccEEEecHHHHHHHHhhhcCCCceee--e---cccccCcceEeecc Q lcl|NC_011054. 150 DYTIVPGDANEDDLIGCINRASKAVAA---AGYMPDTLLASLGFRFDVANLRDANGNPIF--R---DESFNGFGTYFNAN 221 (302) Q Consensus 150 ~~~~~~~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~--~---~~~~~g~p~~~~~~ 221 (302) . .++..+.+.+......+..+-.. -...|..|+..+.......++..+...+-- + .+.+.++...+. . T Consensus 508 -~--~~aa~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~~a~~~~~~~Np~~~~~~~i~-e 583 (652) T protein:vir:79 508 -L--ESAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVKGADINAGIINPVKDFATVIA-E 583 (652) T ss_pred -c--ccccCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCCccccccccccccccccccccc-c Confidence 0 11223334444443333333222 236677889999888777776532211000 0 001111111111 1 Q ss_pred cccCC-CcceEEEEecceEEEEeecCcEEEEeecc---cccchhhhcCCcEEEEEEEEeccEEeccccEEEEee Q lcl|NC_011054. 222 GAWPV-GVAEALVVDSSRVRIGVRQDITVKFLDQA---TVGSINLAERDMIALRLKARFAYVLGNGATAVGDNK 291 (302) Q Consensus 222 ~~~~~-~~~~~~~gd~~~~~~~~~~~~~i~~~~~~---~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~ 291 (302) +-... ....-++++.... ..+++.+++.. ...+..-|..|-+.||++..||.++.|--+++|.|. T Consensus 584 prL~~~s~~~wylaa~~~~-----dtiev~yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 584 PRLDDNSQTTFYLAASKGS-----DTIEVAYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred cccCCCCcccEEEecCCCC-----CeEEEEEecCCCCCeeeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 11111 1111112211110 01222222211 112223488999999999999999999999999987 No 168 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=97.80 E-value=8.8e-06 Score=48.29 Aligned_cols=277 Identities=12% Similarity=0.046 Sum_probs=146.3 Q ss_pred CC-----------CccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecC---CCceEEEEEeCCcceeeeccccccc Q lcl|NC_011054. 1 MA-----------DISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMG---TKTTHLPVLATLPGASWVSESATEP 66 (302) Q Consensus 1 Ma-----------~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~---~~~~~~p~~~~~~~a~~v~E~~~~~ 66 (302) || ....+.-.....+.+...|++...+.-..+.++...+.+ ...+.|+..+....|.+++.+...| T Consensus 35 ~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~P 114 (339) T protein:vir:94 35 YAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANG 114 (339) T ss_pred hhccccccccccccccccchhhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEEcccccCCC Confidence 21 111111111233334466777777777777777776654 2467888888889999998877654 Q ss_pred cccccccccceeeEEeeeeeEEEeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCCC--cccccccccc Q lcl|NC_011054. 67 EGVKPTSEATWADRTLVAEEVAVIIPVHENVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDKP--SSWVSPALLP 142 (302) Q Consensus 67 ~~~~~~s~~~f~~i~l~~~ki~~~~~iS~ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~--~g~~~~~~~~ 142 (302) - ...+.+|.+.++....++-.+. ..|+.+. ...++.+.-.+..++++.+.+++..++|+..- .|+++....+ T Consensus 115 l---~~~~v~~~~~~v~~~~~g~~y~-~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~ 190 (339) T protein:vir:94 115 M---SKANVNFESRQNYRYQTWTEYG-DLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLP 190 (339) T ss_pred c---ccccceeeEEeEEEEEEEEeec-HHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCcc Confidence 3 2245667777766666555544 3344432 23577888888889999999999999997532 4555444333 Q ss_pred cccccccceeeccccchHHHHHHHhhhhhhhhhhccc------CccEEEecHHHHHHHHhhhcCCCceeeecccccCcc- Q lcl|NC_011054. 143 AAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGY------MPDTLLASLGFRFDVANLRDANGNPIFRDESFNGFG- 215 (302) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~------~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~~~g~p- 215 (302) ..... +..=...+.+.+++++.+++..+..... .+..+++.++.+..|..- +..|.-++.-= ..-+| T Consensus 191 ~~v~~----s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~-n~~~~Tvl~~l-k~n~pn 264 (339) T protein:vir:94 191 APVAA----TVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRT-NNFGLSAGAKI-AQTYPN 264 (339) T ss_pred ccccC----CCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC-CcCCccHHHHH-HHhcCC Confidence 22111 1111235678888999988888754332 244789999999988653 44443322100 00012 Q ss_pred eEeeccccc--CCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEecc-EEeccccEEEEeee Q lcl|NC_011054. 216 TYFNANGAW--PVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAY-VLGNGATAVGDNKT 292 (302) Q Consensus 216 ~~~~~~~~~--~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~-~v~~~~a~~~lt~~ 292 (302) +.+..-+-. ..++...++.+. ......+.+.+......-.+ ....-.....+..|+++ .+.+|.+|+++++. T Consensus 265 l~i~~~~el~~a~g~~~~~~~~~----~~~~~~~~~~~p~~~~~lpv-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 265 IQFVAVPEFDTASGRLVQLWVPE----VNGQPTGEVAFAEKLRSHSI-ERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred cEEEEccccccCCCceEEEEEEe----ccCCcceEEEcchhhhcccc-EEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 222211111 111111111110 00001111111111000000 00111233456667554 46789999999998 No 169 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=97.77 E-value=3.9e-06 Score=50.25 Aligned_cols=258 Identities=10% Similarity=-0.005 Sum_probs=128.2 Q ss_pred CCCc---------cCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-e-EEEEEeCCcceeeecccccccccc Q lcl|NC_011054. 1 MADI---------SRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-T-HLPVLATLPGASWVSESATEPEGV 69 (302) Q Consensus 1 Ma~~---------t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~-~~p~~~~~~~a~~v~E~~~~~~~~ 69 (302) |-.. ++.+-+...--++.+++-..+.+-.-++...+.+||..++ + .||.+.-...+.=|+||+++|- T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe~Ipl-- 78 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPL-- 78 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccCCcccch-- Confidence 3321 1122223233344555555555555566667999998764 5 4566787888888999998764 Q ss_pred cccccccee---eEEeeeeeEEEeehhHHHHHhcch-HHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccc Q lcl|NC_011054. 70 KPTSEATWA---DRTLVAEEVAVIIPVHENVVDDAS-TSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAV 145 (302) Q Consensus 70 ~~~s~~~f~---~i~l~~~ki~~~~~iS~ell~ds~-~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~ 145 (302) ++.+.. ..+++.+|++.-+ |.|.++.+. .+....-.++|...+++++|+.++.--.+.++ T Consensus 79 ---skvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~----------- 142 (296) T protein:vir:98 79 ---SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG----------- 142 (296) T ss_pred ---hhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccc----------- Confidence 444443 3667778877775 999986554 46677888999999999999999952211110 Q ss_pred ccccceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec---ccccCcceEeeccc Q lcl|NC_011054. 146 AANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD---ESFNGFGTYFNANG 222 (302) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~---~~~~g~p~~~~~~~ 222 (302) ....++..-...+...+.++.......+..+...++||.....+++-..-.-+-.|.. ..+.|.- +.... T Consensus 143 -----t~~~t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~it~qt~fG~tyl~nfLG~~--II~S~ 215 (296) T protein:vir:98 143 -----TQDALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAFGLTYLVDFTGTV--IISTN 215 (296) T ss_pred -----eeeechhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcCCccchhheechhhhhhccccE--EEEcC Confidence 0011111111222223333444444444445567788888877653221111111111 1234432 22211 Q ss_pred ccCCCcceEEEEecceEE--EEee--cCcEEEEeecccccchhhhcCCcEEEEEEEE--------------eccE--Eec Q lcl|NC_011054. 223 AWPVGVAEALVVDSSRVR--IGVR--QDITVKFLDQATVGSINLAERDMIALRLKAR--------------FAYV--LGN 282 (302) Q Consensus 223 ~~~~~~~~~~~gd~~~~~--~~~~--~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r--------------~d~~--v~~ 282 (302) ...++.++..-...+. +.+. +++.-.. .+..|.+.+.+..+ -++. ..+ T Consensus 216 --kV~~G~~~~T~~~Ni~~ay~~~~~~~l~~~f----------~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~ 283 (296) T protein:vir:98 216 --DVTKGEIWATVPENIIFAYINPNNSELAKEF----------NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPER 283 (296) T ss_pred --cCCCceEEEeeecceEEEeecccccchhhhh----------ccccccccceEEEeccccceeeehhHhHhHHHhcccc Confidence 1234444444333322 1221 1111110 01112222221111 1111 245 Q ss_pred cccEEEEeeeccc Q lcl|NC_011054. 283 GATAVGDNKTPVG 295 (302) Q Consensus 283 ~~a~~~lt~~~a~ 295 (302) ++++++.+-+++- T Consensus 284 ~dgiv~~tI~~~~ 296 (296) T protein:vir:98 284 IDGIVKVTLTPGV 296 (296) T ss_pred cceEEEEEecCCC Confidence 6778887775333 No 170 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.73 E-value=9.7e-06 Score=48.06 Aligned_cols=273 Identities=10% Similarity=0.026 Sum_probs=149.9 Q ss_pred CCCccCCCcceecchHH---HHHHHHHHHhhhhhhhhccee---ecCCCceEEEEEeCCccee--eeccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAY---ANDLLASAKKGSTVLQAFPTV---NMGTKTTHLPVLATLPGAS--WVSESATEPEGVKPT 72 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~---~~~ii~~~~~~s~l~~~~~~~---~~~~~~~~~p~~~~~~~a~--~v~E~~~~~~~~~~~ 72 (302) |+ ..++|+ .++ ...|.+....+-..++++... +....+..+...+....+. |++-++. ..|. T Consensus 1 ~~-----~lafl~-~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~----dip~ 70 (304) T protein:vir:52 1 MS-----LLAYVK-NGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTS----TLDQ 70 (304) T ss_pred Cc-----hHHHHH-HHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCC----ccce Confidence 33 223333 333 234444444444444554433 3333456666665555666 8776642 3556 Q ss_pred cccceeeEEeeeeeEEEeehhHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhcccCC---Ccccccccccccccc Q lcl|NC_011054. 73 SEATWADRTLVAEEVAVIIPVHENVVDDA---STSLLEEIAALGGQAIGKKLDQAVIFGTDK---PSSWVSPALLPAAVA 146 (302) Q Consensus 73 s~~~f~~i~l~~~ki~~~~~iS~ell~ds---~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---~~g~~~~~~~~~~~~ 146 (302) .+..+++-....+.++..+.+|.+=++.+ ..++..-=.+...+++...+++..+.|+.. -.|+++....+.... T Consensus 71 vd~~~~~~~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~ 150 (304) T protein:vir:52 71 VEVGFTPTRSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAI 150 (304) T ss_pred eecccceeEEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeee Confidence 67777888888888888888776444332 346677777778889999999999999642 246666655553322 Q ss_pred cccceeeccccchHHHHHHHhhhhhhhhhhc---ccCccEEEecHHHHHHHHhhh-cCCCceee---ec--ccccCcceE Q lcl|NC_011054. 147 ANQDYTIVPGDANEDDLIGCINRASKAVAAA---GYMPDTLLASLGFRFDVANLR-DANGNPIF---RD--ESFNGFGTY 217 (302) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~---~~~~~~~v~~~~~~~~l~~l~-d~~g~~i~---~~--~~~~g~p~~ 217 (302) ........-.+.+.+++.+++..+..++... ...+..+++.++.+..|.... ...+.-++ .. ....|.|+. T Consensus 151 ~~~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~~~~g~~l~ 230 (304) T protein:vir:52 151 KGAAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLSAAAGRQVA 230 (304) T ss_pred cCCccCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcccccCCcce Confidence 2111111123347788889988888887532 245678999999999996542 22232222 11 112344433 Q ss_pred eecc-----cccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEE--EEEEEEeccE-EeccccEEEE Q lcl|NC_011054. 218 FNAN-----GAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIA--LRLKARFAYV-LGNGATAVGD 289 (302) Q Consensus 218 ~~~~-----~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~--~r~~~r~d~~-v~~~~a~~~l 289 (302) +..- .....++..+++.+.+.-.+...-.+.+.... -..++... .=++.|+++. +.+|.+++++ T Consensus 231 I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~--------~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~ 302 (304) T protein:vir:52 231 IKALPSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLD--------AQPKGLLAFESGLRMAFGGVTFMEPDSALYV 302 (304) T ss_pred EEEecccccccCCCCceEEEEEecChhheEEecCccccccc--------hhhcCCceEEecceeeeeeEEEEccceeeee Confidence 2211 11112334444554443333332222222221 12334332 3356777666 5679999999 Q ss_pred ee Q lcl|NC_011054. 290 NK 291 (302) Q Consensus 290 t~ 291 (302) .. T Consensus 303 D~ 304 (304) T protein:vir:52 303 DY 304 (304) T ss_pred cC Confidence 99 No 171 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=97.47 E-value=7.6e-06 Score=48.64 Aligned_cols=261 Identities=11% Similarity=0.041 Sum_probs=131.0 Q ss_pred CCCccC----CCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eE---EEEEeCCcceeeeccccccccccccc Q lcl|NC_011054. 1 MADISR----SEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-TH---LPVLATLPGASWVSESATEPEGVKPT 72 (302) Q Consensus 1 Ma~~t~----~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~---~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (302) |+.-.- .+-+..+--++.+++-..+.+-.-++...+.+||..++ ++ +|.++-...+.-|+||+.+|- T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Ipl----- 75 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPL----- 75 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccch----- Confidence 653222 22333344455566666666666677777888988663 44 444455577778999998764 Q ss_pred cccce---eeEEeeeeeEEEeehhHHHHHhcch-HHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccc Q lcl|NC_011054. 73 SEATW---ADRTLVAEEVAVIIPVHENVVDDAS-TSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAAN 148 (302) Q Consensus 73 s~~~f---~~i~l~~~ki~~~~~iS~ell~ds~-~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~ 148 (302) ++.+. ...+++.+|++..+ |.|.++.+. .+....-.++|..++++++++.||.=-.+.++. .. T Consensus 76 skvt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t-----------~~ 142 (303) T protein:vir:10 76 TKVTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIEN-----------GK 142 (303) T ss_pred hhheeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccc-----------cc Confidence 44443 24778888888855 999996553 456777888999999999999998521111100 00 Q ss_pred cceeeccccchHHHHHHHhhhhhhhh---hhcccCccEEEecHHHHHHHHhhhcCC------C-ceeeecccccCcceEe Q lcl|NC_011054. 149 QDYTIVPGDANEDDLIGCINRASKAV---AAAGYMPDTLLASLGFRFDVANLRDAN------G-NPIFRDESFNGFGTYF 218 (302) Q Consensus 149 ~~~~~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~~~v~~~~~~~~l~~l~d~~------g-~~i~~~~~~~g~p~~~ 218 (302) .......+.+.+.+++.....++ ...+ ....+++||.+...+++-..-+ | .|| + .+.|..+.. T Consensus 143 ---~t~~t~~s~~glq~Al~~~~~kl~~~~ed~-~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L-~--nfLG~~II~ 215 (303) T protein:vir:10 143 ---RTNKTKLSAENLQGALSKGRANLSVLLDDE-ITPIAFVNPNDTAEYLANGFINSTGAQFGVNLL-T--PYVGVKIVE 215 (303) T ss_pred ---cccceeecHHHHHHHHHhhhhhcccccccc-ccEEEEEchHHHHHHhhcCCcchhhhhhhhhhh-h--hhhcceEEE Confidence 00111122333333333332222 1222 2346778999998886532111 1 122 1 255554422 Q ss_pred ecccccCCCcceEEEEecceEE--EEe-ecCcEEEEeecccccchhhhcCCcEEEEEEEE--------------eccE-- Q lcl|NC_011054. 219 NANGAWPVGVAEALVVDSSRVR--IGV-RQDITVKFLDQATVGSINLAERDMIALRLKAR--------------FAYV-- 279 (302) Q Consensus 219 ~~~~~~~~~~~~~~~gd~~~~~--~~~-~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r--------------~d~~-- 279 (302) ... ..++.++.--...+. +.+ ++++.- .+ -+..|.+.|.+..+ -++. T Consensus 216 S~k----v~~G~~~~T~~~Ni~~ay~~~~g~l~~-~f---------~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lf 281 (303) T protein:vir:10 216 FAD----VPQGEVWMTVAENLNVAYANPRGELSR-AF---------AFATDATGFVGVLHDIQPQRLTSDTIYASAISMF 281 (303) T ss_pred ecc----CCCceEEEeeccceEEEEecCchhhhh-hh---------hhccccccceEEEeccccceeeehhHhHhHHHhc Confidence 222 333444333322221 111 111110 00 01112222211111 1111 Q ss_pred EeccccEEEEeeec-ccccCCC Q lcl|NC_011054. 280 LGNGATAVGDNKTP-VGAVVPD 300 (302) Q Consensus 280 v~~~~a~~~lt~~~-a~~~~p~ 300 (302) ..+.+++++.+-+. -+.-.|. T Consensus 282 pE~~dgiv~~ti~~~e~~~~~~ 303 (303) T protein:vir:10 282 PENIDAVIKVTIKKDEAGELPS 303 (303) T ss_pred ccccceEEEEEEeccccCCCCC Confidence 24667888888643 2334566 No 172 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=97.46 E-value=7.2e-06 Score=48.77 Aligned_cols=276 Identities=11% Similarity=-0.006 Sum_probs=138.4 Q ss_pred CCCcc-------CCCcceecchHHH----HHHHHHHHhhhhhhhhcceeecC---CCceEEEEEeCCcceeeeccccccc Q lcl|NC_011054. 1 MADIS-------RSEVATLIQEAYA----NDLLASAKKGSTVLQAFPTVNMG---TKTTHLPVLATLPGASWVSESATEP 66 (302) Q Consensus 1 Ma~~t-------~~~~g~liP~~~~----~~ii~~~~~~s~l~~~~~~~~~~---~~~~~~p~~~~~~~a~~v~E~~~~~ 66 (302) |+.-. ++.....||..+. ..+++.+...-....++.....+ .....+++......+.+.+-+...| T Consensus 31 ~~~da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P 110 (336) T protein:vir:10 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG 110 (336) T ss_pred hhhhhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCc Confidence 22110 0011122343221 33445444444445555544432 1345666666677778877665543 Q ss_pred cccccccccceeeEEeeeeeEEEeehhHH-HHHhcc--hHHHHHHHHHHHHHHHHHHHHHHhhcccCCC--ccccccccc Q lcl|NC_011054. 67 EGVKPTSEATWADRTLVAEEVAVIIPVHE-NVVDDA--STSLLEEIAALGGQAIGKKLDQAVIFGTDKP--SSWVSPALL 141 (302) Q Consensus 67 ~~~~~~s~~~f~~i~l~~~ki~~~~~iS~-ell~ds--~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~--~g~~~~~~~ 141 (302) .++..-+..+-+.+.++..+.++. |+.+.. ..++.+.-+...++++.+.+++-.+.|+..- .|+++.... T Consensus 111 -----~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l 185 (336) T protein:vir:10 111 -----DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSL 185 (336) T ss_pred -----eeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEeCCCC Confidence 344344444556777888888884 554432 3577788888888899999999888887643 255554433 Q ss_pred ccccccccceeeccccchHHHHHHHhhhhhhhhhhcc------cCccEEEecHHHHHHHHhhhcCCCceeeecccccCcc Q lcl|NC_011054. 142 PAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAG------YMPDTLLASLGFRFDVANLRDANGNPIFRDESFNGFG 215 (302) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~------~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~~~g~p 215 (302) +...+. .+.-.+..+.+.+++++.+++..+.... ..+..++|.++.+..|.+ ++..|.-+++- ...-+| T Consensus 186 ~a~~t~---~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~-lk~n~P 260 (336) T protein:vir:10 186 SAPITA---TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDIFP 260 (336) T ss_pred cccccc---CCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccC-CCccCccHHHH-HHHhcC Confidence 322211 1112223456888899998888887533 247789999999988864 23334322210 000022 Q ss_pred -eEeecccccC--CCcceEEEE-ecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccE-EeccccEEEEe Q lcl|NC_011054. 216 -TYFNANGAWP--VGVAEALVV-DSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYV-LGNGATAVGDN 290 (302) Q Consensus 216 -~~~~~~~~~~--~~~~~~~~g-d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~-v~~~~a~~~lt 290 (302) +.+...+-.. .+....++. +... ..-..+........-.+ -...-.....+..|+++. +.+|.+|++++ T Consensus 261 nl~i~t~pEl~~a~G~~~~l~~~~~~~-----~~t~~~~~p~~~~~l~v-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~ 334 (336) T protein:vir:10 261 KLEFVTIPEYDTASGRLVQLWAPRVEG-----KDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMI 334 (336) T ss_pred ccEEEEccccccCCCceEEEEEEecCC-----Ccceeeecchhhhccce-eecCceeEeccccceeeeeeeccchheeee Confidence 2222111111 111111111 1100 00011111111000000 011122334566676665 56799999999 Q ss_pred ee Q lcl|NC_011054. 291 KT 292 (302) Q Consensus 291 ~~ 292 (302) +. T Consensus 335 GI 336 (336) T protein:vir:10 335 GV 336 (336) T ss_pred cC Confidence 88 No 173 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=97.43 E-value=5.3e-05 Score=44.01 Aligned_cols=273 Identities=12% Similarity=0.067 Sum_probs=137.6 Q ss_pred CCC-ccCCCcceecchHHHHHHHHHHHh-hhhhhhhcceeecC-CCceEEEEEeCCcceeeeccccccccccccccccce Q lcl|NC_011054. 1 MAD-ISRSEVATLIQEAYANDLLASAKK-GSTVLQAFPTVNMG-TKTTHLPVLATLPGASWVSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~-~t~~~~g~liP~~~~~~ii~~~~~-~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f 77 (302) ||. -+++|-+.++-...-+.+++.-+. ...+...++..+++ -...+.......+...-|.|+++..-+...++ T Consensus 394 ~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~---- 469 (693) T protein:vir:95 394 LAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFSSLRQVREGAEYKYVTLGER---- 469 (693) T ss_pred HHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCCChhhcCCCCceeeeecCCc---- Confidence 444 345555544443333333322222 22344444433332 11222233445566667888887754332221 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc---ccC---CCcccccccccccccccccce Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF---GTD---KPSSWVSPALLPAAVAANQDY 151 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~---G~g---~~~g~~~~~~~~~~~~~~~~~ 151 (302) .-++...+++..+.||++++-.-..++.+-|...+.++.++.+++.++. ++. .++.++...-.+. . T Consensus 470 -~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~Nl-------~ 541 (693) T protein:vir:95 470 -GEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHSNL-------L 541 (693) T ss_pred -cceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeecccccc-------c Confidence 2355678899999999998865567888889999999999999987763 221 1122222111110 1 Q ss_pred eeccccchHHHHHHHhhhhhhhh--------hhcccCccEEEecHHHHHHHHhhhcCCCceee--e---cccccCcceEe Q lcl|NC_011054. 152 TIVPGDANEDDLIGCINRASKAV--------AAAGYMPDTLLASLGFRFDVANLRDANGNPIF--R---DESFNGFGTYF 218 (302) Q Consensus 152 ~~~~~~~~~~~~~~~i~~~~~~~--------~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~--~---~~~~~g~p~~~ 218 (302) +......+.+.+......+..+- ..-...|..|+..+.......++..+.-.+-- + .+.+.|+.- + T Consensus 542 tga~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~a~~~~~~~NP~~~~~~-v 620 (693) T protein:vir:95 542 TGAASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPGADVNSGIVNPIRAFAQ-V 620 (693) T ss_pred cccccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccccccccccccchhcccc-c Confidence 11122333444444434443321 11235678888888888887776543221100 0 011112111 1 Q ss_pred ecccccC--CCcceEEEEecceEEEEeecCcEEEEeecc---cccchhhhcCCcEEEEEEEEeccEEeccccEEEEeee Q lcl|NC_011054. 219 NANGAWP--VGVAEALVVDSSRVRIGVRQDITVKFLDQA---TVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKT 292 (302) Q Consensus 219 ~~~~~~~--~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~---~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~ 292 (302) +..+-.. .+..-.++.|... ..+++.+++.. ...+..-|..|-+.||++..||.++.|--+++|-.++ T Consensus 621 i~~prL~~~s~~~Wyl~a~~~~------dtie~~yL~G~~~P~ie~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 621 IGEPRLDDASATAWYMAAKKGS------DTIEVAYLDGVDTPYLEQQEGFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred cccceecCCCCCceEEecCCCC------CeEEEEEecCCCCCeEeecCCCCcceEEEEEEEeccCceeeccccccCCCC Confidence 1111111 1111122333221 01222222111 1122234889999999999999999998888877766 No 174 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=97.26 E-value=1.2e-05 Score=47.54 Aligned_cols=274 Identities=10% Similarity=-0.021 Sum_probs=137.7 Q ss_pred CCCccCCCcce-------ecchHHHH----HHHHHHHhhhhhhhhcceeecC---CCceEEEEEeCCcceeeeccccccc Q lcl|NC_011054. 1 MADISRSEVAT-------LIQEAYAN----DLLASAKKGSTVLQAFPTVNMG---TKTTHLPVLATLPGASWVSESATEP 66 (302) Q Consensus 1 Ma~~t~~~~g~-------liP~~~~~----~ii~~~~~~s~l~~~~~~~~~~---~~~~~~p~~~~~~~a~~v~E~~~~~ 66 (302) |+....-.++. -||..+.+ .+++.+...-....++.....+ .....+++......+.+.+-+...| T Consensus 31 ~~~da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P 110 (336) T protein:vir:36 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG 110 (336) T ss_pred hhhhhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCc Confidence 22211000111 13433322 3444444444445555544332 1245666666667778877665543 Q ss_pred cccccccccceeeEEeeeeeEEEeehhH-HHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHhhcccCCC--ccccccccc Q lcl|NC_011054. 67 EGVKPTSEATWADRTLVAEEVAVIIPVH-ENVVDDA--STSLLEEIAALGGQAIGKKLDQAVIFGTDKP--SSWVSPALL 141 (302) Q Consensus 67 ~~~~~~s~~~f~~i~l~~~ki~~~~~iS-~ell~ds--~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~--~g~~~~~~~ 141 (302) .++..-+..+-+.+.++..+.++ .|+.+.. ..++.+.-+...++++.+.+++-.+.|+..- .|+++.... T Consensus 111 -----~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l 185 (336) T protein:vir:36 111 -----DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSL 185 (336) T ss_pred -----eeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEecCCC Confidence 34434444455677788888887 4555432 3567777888888889999998888887643 255554433 Q ss_pred ccccccccceeeccccchHHHHHHHhhhhhhhhhhcc------cCccEEEecHHHHHHHHhhhcCCCceeeecccccCcc Q lcl|NC_011054. 142 PAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAG------YMPDTLLASLGFRFDVANLRDANGNPIFRDESFNGFG 215 (302) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~------~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~~~g~p 215 (302) +...+. .+.--+..+.+.+++++.+++..+.... ..+..++|.++.+..|.+ ++..|.-+++- ...-+| T Consensus 186 ~a~~t~---~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~-lk~n~P 260 (336) T protein:vir:36 186 SAPITA---TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDIFP 260 (336) T ss_pred cccccc---CCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccC-CCccCccHHHH-HHHhcC Confidence 322211 1111223456888899998888887533 246789999999988864 23334322210 000022 Q ss_pred -eEeecccccC--CCcceEEEEecceEEEEeecC---cEEEEeecccccchhhhcCCcEEEEEEEEeccE-EeccccEEE Q lcl|NC_011054. 216 -TYFNANGAWP--VGVAEALVVDSSRVRIGVRQD---ITVKFLDQATVGSINLAERDMIALRLKARFAYV-LGNGATAVG 288 (302) Q Consensus 216 -~~~~~~~~~~--~~~~~~~~gd~~~~~~~~~~~---~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~-v~~~~a~~~ 288 (302) +.+...+-.. .+....++ +-...+ ..+........-.+ -...-.....+..|+++. +.+|.+|++ T Consensus 261 nl~i~t~pEl~~a~g~~~~l~-------~~~~~~~~t~~~~~p~~~~~l~v-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~ 332 (336) T protein:vir:36 261 KLEFVTIPEYDTASGRLVQLW-------APRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQ 332 (336) T ss_pred ccEEEEccccccCCCceEEEE-------EEecCCCcceeeecchhhhccce-eecCceeEeccccceeeeeeeccchhee Confidence 2222111111 11111111 111111 11111111000000 011122334566676665 567999999 Q ss_pred Eeee Q lcl|NC_011054. 289 DNKT 292 (302) Q Consensus 289 lt~~ 292 (302) +++. T Consensus 333 ~~GI 336 (336) T protein:vir:36 333 MIGV 336 (336) T ss_pred eecC Confidence 9988 No 175 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=97.02 E-value=0.00021 Score=40.76 Aligned_cols=274 Identities=14% Similarity=0.075 Sum_probs=124.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhh-hhhhhcceeecCCCceEEEEEeCCcce-eeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGS-TVLQAFPTVNMGTKTTHLPVLATLPGA-SWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s-~l~~~~~~~~~~~~~~~~p~~~~~~~a-~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |..+... =.++-..+.+.+.+...... ...+.++..+.+....++.....-+.. .|.+|-.- ...+=. T Consensus 1 m~it~~~--l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge~~~--------~~l~~~ 70 (302) T protein:vir:10 1 MLINKQS--LNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGAKVV--------KNLKAY 70 (302) T ss_pred CcccHHH--HHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccccceee--------cccccc Confidence 6644321 11222222233333333322 255667766655555566555544443 45544321 122334 Q ss_pred eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----ccCC----Ccccccccccccc----cc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF----GTDK----PSSWVSPALLPAA----VA 146 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~----~~g~~~~~~~~~~----~~ 146 (302) ..+++.++++..+.||++.+.+=..++..-+.+.+.++.++..|+.++. |.+. +.-++........ +. T Consensus 71 ~~~i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~ 150 (302) T protein:vir:10 71 KYVVENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNK 150 (302) T ss_pred ceeEEeecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccc Confidence 4678889999999999999987677888889999999999999988774 2221 1111111100000 00 Q ss_pred cccceeeccccchHHHHHHHhhhhhhhhh----hcccCccEEEecHHHHHHHHhhhcCCCceee-ecccccCcceEeecc Q lcl|NC_011054. 147 ANQDYTIVPGDANEDDLIGCINRASKAVA----AAGYMPDTLLASLGFRFDVANLRDANGNPIF-RDESFNGFGTYFNAN 221 (302) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~----~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~-~~~~~~g~p~~~~~~ 221 (302) ..............+.+.+....+...-. .-...|..++..+.....-+++-.. ++.-. ..+...|.-- ++.+ T Consensus 151 g~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~-~~~~~g~~Np~~g~~~-~vv~ 228 (302) T protein:vir:10 151 GTAPLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTN-PKLADNTPNPYVGTAE-LVVD 228 (302) T ss_pred cchhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhc-cccCCCCcceeccceE-EEEe Confidence 00000111112223333333333322211 2235566777777776666655311 11100 0011112111 1111 Q ss_pred cccCCCcceEEEEecce---EEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEecc------EEeccccEEEEeee Q lcl|NC_011054. 222 GAWPVGVAEALVVDSSR---VRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAY------VLGNGATAVGDNKT 292 (302) Q Consensus 222 ~~~~~~~~~~~~gd~~~---~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~------~v~~~~a~~~lt~~ 292 (302) +-...+..=.++.|.+. +++.-+++..++..+. |..+.+.+|.+..|+. +...+....+-++ T Consensus 229 p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~--------~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g- 299 (302) T protein:vir:10 229 GRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQVN--------LDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTG- 299 (302) T ss_pred eccCCCCceEEEecCCccceEEEcCccccEEEeccC--------CCCCceEEEEEEEEeeeeeeecchhhhhhhhccCc- Confidence 11122223334444433 2333444555544332 5556666666555553 3333333232332 Q ss_pred cccccCCCCC Q lcl|NC_011054. 293 PVGAVVPDGS 302 (302) Q Consensus 293 ~a~~~~p~~~ 302 (302) ++| T Consensus 300 -------~~~ 302 (302) T protein:vir:10 300 -------TGA 302 (302) T ss_pred -------cCC Confidence 333 No 176 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=96.96 E-value=5.3e-05 Score=44.02 Aligned_cols=276 Identities=10% Similarity=-0.016 Sum_probs=140.1 Q ss_pred CCCccCCCccee-------cchHHH----HHHHHHHHhhhhhhhhcceeecC---CCceEEEEEeCCcceeeeccccccc Q lcl|NC_011054. 1 MADISRSEVATL-------IQEAYA----NDLLASAKKGSTVLQAFPTVNMG---TKTTHLPVLATLPGASWVSESATEP 66 (302) Q Consensus 1 Ma~~t~~~~g~l-------iP~~~~----~~ii~~~~~~s~l~~~~~~~~~~---~~~~~~p~~~~~~~a~~v~E~~~~~ 66 (302) |+.-..-.++.+ ||..+. ..+++.+........++.+...+ .....++.......+.+.+-+... T Consensus 31 ~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~- 109 (336) T protein:vir:78 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSD- 109 (336) T ss_pred HHHhhhhhccccccCCCcchHHHHHHhcccceeeehhhhhhhhhhcccccCCCccccEEEEeeeecceeeEEeecccCC- Confidence 222110001111 333222 34445555544455555544432 134577777777888888766544 Q ss_pred cccccccccceeeEEeeeeeEEEeehhHH-HHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCCC--ccccccccc Q lcl|NC_011054. 67 EGVKPTSEATWADRTLVAEEVAVIIPVHE-NVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDKP--SSWVSPALL 141 (302) Q Consensus 67 ~~~~~~s~~~f~~i~l~~~ki~~~~~iS~-ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~--~g~~~~~~~ 141 (302) |.++..-+..+-+.+.++..+.++. |+.+- ...++.+.-+...++++.+.+++-.++|+..- .|+++.... T Consensus 110 ----P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l 185 (336) T protein:vir:78 110 ----GDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSL 185 (336) T ss_pred ----CeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCC Confidence 4455556666677788888888885 44432 23577788888888888999998888887642 355554433 Q ss_pred ccccccccceeeccccchHHHHHHHhhhhhhhhhhccc------CccEEEecHHHHHHHHhhhcCCCceeeecccccCcc Q lcl|NC_011054. 142 PAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGY------MPDTLLASLGFRFDVANLRDANGNPIFRDESFNGFG 215 (302) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~------~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~~~g~p 215 (302) +...+. .+..-+..+.+.+++++..++..+..... .+..+++.+..+..|.. ++..|.-+++- ....+| T Consensus 186 ~a~~t~---~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~~-lk~n~P 260 (336) T protein:vir:78 186 SAPITA---TTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP 260 (336) T ss_pred Cccccc---CcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHHH-HHHhcC Confidence 322211 11111235678889999999888754331 24578999999999964 33334322110 000122 Q ss_pred -eEeecccccCCCcceEEEEecceEEEEeec---CcEEEEeecccccchhhhcCCcEEEEEEEEeccE-EeccccEEEEe Q lcl|NC_011054. 216 -TYFNANGAWPVGVAEALVVDSSRVRIGVRQ---DITVKFLDQATVGSINLAERDMIALRLKARFAYV-LGNGATAVGDN 290 (302) Q Consensus 216 -~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~---~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~-v~~~~a~~~lt 290 (302) +.+..-+-..... |+..+++..+.. -+.+.+......-.+ -.........+..|+++. +.+|.+|++++ T Consensus 261 nl~i~t~pel~~Ag-----g~~~~~~~~~~~~~~t~~~~~p~~f~~lpv-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~ 334 (336) T protein:vir:78 261 KLEFVTIPEYDTAS-----GRLVQLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMI 334 (336) T ss_pred ccEEEEcccccccC-----cceEEEEEeeccCCcceeeecchhhhccce-eecCceeEeccccceeeeeeeccchheeec Confidence 2222111111111 111111111110 011111111000000 011122334556666665 56799999999 Q ss_pred ee Q lcl|NC_011054. 291 KT 292 (302) Q Consensus 291 ~~ 292 (302) +. T Consensus 335 GI 336 (336) T protein:vir:78 335 GV 336 (336) T ss_pred cC Confidence 88 No 177 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=96.95 E-value=8.8e-05 Score=42.81 Aligned_cols=276 Identities=13% Similarity=0.043 Sum_probs=132.6 Q ss_pred CCCccCCC----ccee-------cch---HHHHHHHHHHHhhhhhhhhcceeecCC---CceEEEEEeCCcceeeecccc Q lcl|NC_011054. 1 MADISRSE----VATL-------IQE---AYANDLLASAKKGSTVLQAFPTVNMGT---KTTHLPVLATLPGASWVSESA 63 (302) Q Consensus 1 Ma~~t~~~----~g~l-------iP~---~~~~~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~E~~ 63 (302) |....++. ...+ +|. .+...+|+.+..-.....++.....+. ....+++.+....+.+++-+. T Consensus 56 md~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~ 135 (379) T protein:vir:10 56 MDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGG 135 (379) T ss_pred hccccccccccccCccccccccchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEecccc Confidence 33221110 0011 122 122556666655555555555544321 345666666677788887665 Q ss_pred ccccccccccccceeeEEeeeeeEEEeehhHH-HHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCC-C---cccc Q lcl|NC_011054. 64 TEPEGVKPTSEATWADRTLVAEEVAVIIPVHE-NVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDK-P---SSWV 136 (302) Q Consensus 64 ~~~~~~~~~s~~~f~~i~l~~~ki~~~~~iS~-ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~-~---~g~~ 136 (302) ..|- ++..-+...-..+.+...+.++. |+.+- ...++.+.-....++++.+.+++-.|+|.+. . .|++ T Consensus 136 d~pl-----~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGll 210 (379) T protein:vir:10 136 NMAL-----MSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFL 210 (379) T ss_pred CCCe-----eeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEE Confidence 5432 33222333334455666666665 33332 2457888888999999999999999999532 1 2555 Q ss_pred ccccccccccccccee--eccccchHHHHHHHhhhhhhhhhhccc-------CccEEEecHHHHHHHHhhhcCCCceeee Q lcl|NC_011054. 137 SPALLPAAVAANQDYT--IVPGDANEDDLIGCINRASKAVAAAGY-------MPDTLLASLGFRFDVANLRDANGNPIFR 207 (302) Q Consensus 137 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~~~~~~~~~~~-------~~~~~v~~~~~~~~l~~l~d~~g~~i~~ 207 (302) +...++...+..+... ..=...+.+.+++++..++..+..... .+..+++.+..+..|..- +..|.-+++ T Consensus 211 NdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl~ 289 (379) T protein:vir:10 211 NDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP-TELGYSVAQ 289 (379) T ss_pred eCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc-cccCccHHH Confidence 5554443222111111 111234678888888888887653221 233688999999999743 333322221 Q ss_pred cccccCcc-eEeecccc---cCC-CcceEEEEec-ce--------EEEEeecCcEEEEeecccccchhhhcCCcEEEEEE Q lcl|NC_011054. 208 DESFNGFG-TYFNANGA---WPV-GVAEALVVDS-SR--------VRIGVRQDITVKFLDQATVGSINLAERDMIALRLK 273 (302) Q Consensus 208 ~~~~~g~p-~~~~~~~~---~~~-~~~~~~~gd~-~~--------~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~ 273 (302) - ...-+| +.+..-+- .+. ++...++.+- .. +.......+.. . .. ....-.....+. T Consensus 290 ~-lk~n~Pnl~i~t~pEL~~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~--l---~v----e~~~~~~~~~~~ 359 (379) T protein:vir:10 290 Y-MRESYPNVTFVSAPELNDANGGSSAIYYYADAVENNGTDDGRTWLQVVPTKMFT--L---GV----EKKIKGYAEGYT 359 (379) T ss_pred H-HHHhcCCcEEEEcccccccCCCccEEEEEeeccCCCccCCcceEEEecchhhhh--c---cc----eecCceeEeccc Confidence 0 000012 11111111 111 1112222210 00 00111111000 0 00 001111223455 Q ss_pred EEeccE-EeccccEEEEeee Q lcl|NC_011054. 274 ARFAYV-LGNGATAVGDNKT 292 (302) Q Consensus 274 ~r~d~~-v~~~~a~~~lt~~ 292 (302) .|+++. +.+|.||+++++. T Consensus 360 ~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 360 NATAGAMLKRPFATYRQTGA 379 (379) T ss_pred cceeeeeeecchhhheecCC Confidence 565555 5789999999988 No 178 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=96.73 E-value=9.6e-05 Score=42.61 Aligned_cols=282 Identities=10% Similarity=-0.015 Sum_probs=129.0 Q ss_pred CCCcc-----CCCcceecchHHHH----HHHHHHHhhhhhhhhcceeecC---CCceEEEEEeCCcceeeeccccccccc Q lcl|NC_011054. 1 MADIS-----RSEVATLIQEAYAN----DLLASAKKGSTVLQAFPTVNMG---TKTTHLPVLATLPGASWVSESATEPEG 68 (302) Q Consensus 1 Ma~~t-----~~~~g~liP~~~~~----~ii~~~~~~s~l~~~~~~~~~~---~~~~~~p~~~~~~~a~~v~E~~~~~~~ 68 (302) ||.-. .+.++.=||-.+.+ .|++-+..--....++.+...+ .....+++.+....+.+.+-+...|-. T Consensus 65 ~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~ 144 (388) T protein:vir:99 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLS 144 (388) T ss_pred cccCcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecceeEEEeecccCCCce Confidence 22110 01111115655544 3344444433444444443322 235567777777788888776655431 Q ss_pred cccccccceeeEEeeeeeEEEeehhHHH-HHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCCC-----cccccccc Q lcl|NC_011054. 69 VKPTSEATWADRTLVAEEVAVIIPVHEN-VVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDKP-----SSWVSPAL 140 (302) Q Consensus 69 ~~~~s~~~f~~i~l~~~ki~~~~~iS~e-ll~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~-----~g~~~~~~ 140 (302) ..+.++.+.++ +.+...+.++.+ +-.- ...++...-+...++++.+.+++-.|+|.... .|+++... T Consensus 145 ---d~~~~~~~r~v--~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~ 219 (388) T protein:vir:99 145 ---SWNVNFERRTI--VRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPS 219 (388) T ss_pred ---eccceeeeeeE--EEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCC Confidence 22344444443 445555666653 3322 23577888888888899999999999995321 25555444 Q ss_pred ccccccccccee-eccccchHHHHHHHhhhhhhhhhhccc-------CccEEEecHHHHHHHHhhhcCCCceeeeccccc Q lcl|NC_011054. 141 LPAAVAANQDYT-IVPGDANEDDLIGCINRASKAVAAAGY-------MPDTLLASLGFRFDVANLRDANGNPIFRDESFN 212 (302) Q Consensus 141 ~~~~~~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~-------~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~~~ 212 (302) .+.......... ..-...+.+.+++++..++..+..... .+-.+++.+..+..|..- +..|.-+++- ... T Consensus 220 l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl~~-lk~ 297 (388) T protein:vir:99 220 LLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQ 297 (388) T ss_pred cccccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhcccc-CcCCccHHHH-HHH Confidence 333222111111 111234778889999999988754332 112578899999998633 3333222110 000 Q ss_pred Ccc-eEee---ccccc-CCCcc-e-EEEEec-ceEEEEe-ecCcEEE--EeecccccchhhhcCC--cEEEEEEEEe-cc Q lcl|NC_011054. 213 GFG-TYFN---ANGAW-PVGVA-E-ALVVDS-SRVRIGV-RQDITVK--FLDQATVGSINLAERD--MIALRLKARF-AY 278 (302) Q Consensus 213 g~p-~~~~---~~~~~-~~~~~-~-~~~gd~-~~~~~~~-~~~~~i~--~~~~~~~~~~~~~~~~--~~~~r~~~r~-d~ 278 (302) -+| +.+. +.... ..+.. . .++.+- .....+. ....... +...... .. .+.. .....+..|+ |. T Consensus 298 n~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~--l~-vq~~~~~~~~~~~~rt~Gv 374 (388) T protein:vir:99 298 TYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVT--LG-VEKRVKNYVEAYSNATAGV 374 (388) T ss_pred hcCCcEEEEecccccccccCCceeEEEEecccccccccCccCcceeEEeccccccc--cc-ceecCceeEeccccceeee Confidence 122 1111 11111 11111 1 111110 0000000 0000000 0000000 00 0111 1122334454 44 Q ss_pred EEeccccEEEEeee Q lcl|NC_011054. 279 VLGNGATAVGDNKT 292 (302) Q Consensus 279 ~v~~~~a~~~lt~~ 292 (302) .+.+|.+|+++++. T Consensus 375 ~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 375 MLKRPWAVVRLIGL 388 (388) T ss_pred EEeccchhheeccC Confidence 56789999999988 No 179 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=96.69 E-value=0.00041 Score=39.16 Aligned_cols=270 Identities=10% Similarity=-0.020 Sum_probs=123.4 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhh-hcc--eeecCCCceEEEEEeCCcceeeeccccccccccccccccce Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQ-AFP--TVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATW 77 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~-~~~--~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f 77 (302) .|+-+-.-+-...-+-+...+-+.+...+--.. +++ .....+++++||......-..+- -+.....+. -..+. T Consensus 30 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~DY~-R~~g~~~g~---vt~~~ 105 (329) T protein:vir:10 30 FANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVTELKDYK-RNATNEFDH---PQIQE 105 (329) T ss_pred hcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeeccccccccc-CCCCccccc---cccce Confidence 333222212122222233333333322221111 122 34456788999988654333321 111111111 12344 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcchH--HHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecc Q lcl|NC_011054. 78 ADRTLVAEEVAVIIPVHENVVDDAST--SLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVP 155 (302) Q Consensus 78 ~~i~l~~~ki~~~~~iS~ell~ds~~--~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~ 155 (302) ...+++..|.-.+..=..+. +++.. .+...+.+.....++..+|...+.---+..+ .... T Consensus 106 ~t~tidqdR~~~F~VD~~D~-dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~-----------------~~~~ 167 (329) T protein:vir:10 106 TTYFLDQEKYWGRFVDALDR-RDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKA-----------------KHLT 167 (329) T ss_pred eEEEeecccceeeecchhhH-hhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcc-----------------cccc Confidence 45566666655544211111 22222 2344556667777777888776631000000 0111 Q ss_pred ccchHHHHHHHhhhhhhhhhhccc-CccEEEecHHHHHHHHhhh------cCCCceeeec--ccccCcceEeecccccCC Q lcl|NC_011054. 156 GDANEDDLIGCINRASKAVAAAGY-MPDTLLASLGFRFDVANLR------DANGNPIFRD--ESFNGFGTYFNANGAWPV 226 (302) Q Consensus 156 ~~~~~~~~~~~i~~~~~~~~~~~~-~~~~~v~~~~~~~~l~~l~------d~~g~~i~~~--~~~~g~p~~~~~~~~~~~ 226 (302) ...+.+..++.+.++...+..... .+.+++++|..+..|.+-. +.+...+.+. ..++|.++..+.. ... T Consensus 168 ~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g~Vg~idG~~Ii~vps--~~~ 245 (329) T protein:vir:10 168 VGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKGVQGELDGFTIVKVPS--KML 245 (329) T ss_pred cccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeeeeecCeEEEEecC--Ccc Confidence 223455666667777776665432 3456789999999997522 1122222222 3467777665432 122 Q ss_pred CcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 227 GVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 227 ~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) .+..++++..+.......-. .++..+.... ++.-.++.+.++|..|.+|++.........+..+.+++ T Consensus 246 k~in~ii~~~~A~~~~~K~~-~~~~~~p~~~-------~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a~~~~~~~ 313 (329) T protein:vir:10 246 QGVEAMAVIGEVMASPIQAN-EAKLNSNVPG-------MFGTLAEQMLYTGAFVPEHLQKYIFTIGGKEVETNRDG 313 (329) T ss_pred cceeEEEEcCCceeeeeeee-eeeeeCCCCc-------cchheeeeeeeeeeEEEccccCEEEEecccCcccCCCC Confidence 33345666655443332211 2333322111 12346888999999999998544333222222222222 No 180 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=96.64 E-value=0.00018 Score=41.16 Aligned_cols=274 Identities=8% Similarity=0.008 Sum_probs=130.0 Q ss_pred CC-----CccCCCcceecchHHH----HHHHHHHHhhhhhhhhcceeecC---CCceEEEEEeCCcceeeeccccccccc Q lcl|NC_011054. 1 MA-----DISRSEVATLIQEAYA----NDLLASAKKGSTVLQAFPTVNMG---TKTTHLPVLATLPGASWVSESATEPEG 68 (302) Q Consensus 1 Ma-----~~t~~~~g~liP~~~~----~~ii~~~~~~s~l~~~~~~~~~~---~~~~~~p~~~~~~~a~~v~E~~~~~~~ 68 (302) |= -.++.+.| ||-.+. ..+++-+...-....++.+...+ .....|++......|.+++-+...|- T Consensus 63 mDa~~~~~~t~~~~g--~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl- 139 (382) T protein:vir:96 63 MDSNFTAPVTTPSIP--TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPL- 139 (382) T ss_pred cccccCCccccCCcc--HHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEEeecccCCCc- Confidence 21 11122222 465554 44455555555555555554432 23557777777788888877765543 Q ss_pred cccccccceeeEEeeeeeEEEeehh-HHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHhhcccCC--C---cccccccc Q lcl|NC_011054. 69 VKPTSEATWADRTLVAEEVAVIIPV-HENVVDDA--STSLLEEIAALGGQAIGKKLDQAVIFGTDK--P---SSWVSPAL 140 (302) Q Consensus 69 ~~~~s~~~f~~i~l~~~ki~~~~~i-S~ell~ds--~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~---~g~~~~~~ 140 (302) ...+.++.+.++.. +...+.+ ..|+.+.+ ..++.+.-....++++.+.+++-.|+|+.. . .|+++... T Consensus 140 --~d~~~~~~~r~v~~--~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~ 215 (382) T protein:vir:96 140 --TSWNANFERRTIVR--GELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPN 215 (382) T ss_pred --cccccceeEEEEEE--EEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCC Confidence 22345565555444 4444555 45655532 356667777888889999999999999632 2 25555443 Q ss_pred cccccccccceeeccccchHHHHHHHhhhhhhhhhhccc-------CccEEEecHHHHHHHHhhhcCCCceeeecccccC Q lcl|NC_011054. 141 LPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGY-------MPDTLLASLGFRFDVANLRDANGNPIFRDESFNG 213 (302) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~-------~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~~~g 213 (302) .+.... ..+..-...+.+.+++++..++..+..... .+..+++.++.+..|..- +..|.-+++- ...- T Consensus 216 l~a~~t---~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~-n~~g~Tvl~~-lk~n 290 (382) T protein:vir:96 216 LPPFQT---PPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT-TPYGISVSDW-IEQT 290 (382) T ss_pred cccccc---cCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc-CccCccHHHH-HHHh Confidence 332211 112223345778889999998888864332 122577899988888542 3333222110 0000 Q ss_pred cc-eEeeccccc---C-CCcceEEEEecceEEEEeecCcEE--EEeecc--cccc--------hhhhcC-CcEEEEEEEE Q lcl|NC_011054. 214 FG-TYFNANGAW---P-VGVAEALVVDSSRVRIGVRQDITV--KFLDQA--TVGS--------INLAER-DMIALRLKAR 275 (302) Q Consensus 214 ~p-~~~~~~~~~---~-~~~~~~~~gd~~~~~~~~~~~~~i--~~~~~~--~~~~--------~~~~~~-~~~~~r~~~r 275 (302) +| +.+..-+-. . .+++. ....+-....+.. ..+++. ...+ .....+ -........| T Consensus 291 ~Pnl~i~t~peL~~a~~~g~g~------~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~ 364 (382) T protein:vir:96 291 YPKMRIVSAPELSGVQMQGKTP------EDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNG 364 (382) T ss_pred cCCcEEEEccccccccCCCccc------eeEEEEecchhhhhcccccccCcceeccccceeeeccceeecceeEeccccc Confidence 12 111111100 0 00000 0000000000000 000000 0000 000000 0011122233 Q ss_pred -eccEEeccccEEEEeee Q lcl|NC_011054. 276 -FAYVLGNGATAVGDNKT 292 (302) Q Consensus 276 -~d~~v~~~~a~~~lt~~ 292 (302) .|..+.+|.+|+++++. T Consensus 365 t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 365 TAGALCKRPWAVVRYLGI 382 (382) T ss_pred eeeeEEEcchhhhhccCC Confidence 45556889999999988 No 181 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=96.41 E-value=0.00055 Score=38.45 Aligned_cols=285 Identities=12% Similarity=0.081 Sum_probs=144.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhc----ceeecCC-CceEEEEEeC-Ccceeeeccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAF----PTVNMGT-KTTHLPVLAT-LPGASWVSESATEPEGVKPTSE 74 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~----~~~~~~~-~~~~~p~~~~-~~~a~~v~E~~~~~~~~~~~s~ 74 (302) |-....++-..+-=...+..+.+.+-..++|+... ++.+..+ .++..|.+-. ..++.|..-.....-. -. T Consensus 1 mp~~~lsel~t~tl~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~~~----p~ 76 (321) T protein:vir:34 1 MPFPNISDIITTTIESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSGYDVLPTA----PQ 76 (321) T ss_pred CCCchHHHHHHHHHHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEeeeeeccc----hh Confidence 55433222222222222455666667777766543 3334444 4677776655 7889997544433321 13 Q ss_pred cceeeEEeeeeeEEEeehhHH-HHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhc-ccCCC----cccccc-ccccc Q lcl|NC_011054. 75 ATWADRTLVAEEVAVIIPVHE-NVVDDAS----TSLLEEIAALGGQAIGKKLDQAVIF-GTDKP----SSWVSP-ALLPA 143 (302) Q Consensus 75 ~~f~~i~l~~~ki~~~~~iS~-ell~ds~----~~~~~~i~~~l~~ai~~~~d~~~l~-G~g~~----~g~~~~-~~~~~ 143 (302) -.|+..++..+.+++-+.||- |+++.+. .+|...=.+...+.+...+|..+.. |++.+ .|+... ...+. T Consensus 77 d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~sdGTa~g~~~i~GL~~lv~~~p~ 156 (321) T protein:vir:34 77 DVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYGDGTAFGGRAINGLDGAVPVDPT 156 (321) T ss_pred hhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhccccccccchhhhhhhhcccCCC Confidence 478889999999999988886 5665543 3454444555667788888888875 55421 111100 00000 Q ss_pred ccccc-----------cceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeeccccc Q lcl|NC_011054. 144 AVAAN-----------QDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDESFN 212 (302) Q Consensus 144 ~~~~~-----------~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~~~ 212 (302) +...+ +..+...+..+...+...+.++-.+.--....|+.|++....|...+.-.-...|+--....-. T Consensus 157 tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~y~~y~~s~q~~qR~~~~~~a~~ 236 (321) T protein:vir:34 157 VGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDAWTTYSNSLQVLQRFTSAEEANL 236 (321) T ss_pred CceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHHHHHHHHhhheeeeecccccccc Confidence 10000 1122222333445555555555555444556789999999999988775444334332222222 Q ss_pred Cc------ceEeecccc--cCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccc Q lcl|NC_011054. 213 GF------GTYFNANGA--WPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGA 284 (302) Q Consensus 213 g~------p~~~~~~~~--~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~ 284 (302) |+ .+.++-++. ........+|-|.+++.+....+-.+...+...... .-+|.+.-....+....+-++. T Consensus 237 Gf~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r~~~---~NqdA~~q~I~~~GnL~~sn~~ 313 (321) T protein:vir:34 237 GFRSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSRRAA---FNQDAEAQILAWAGNLTCSGAQ 313 (321) T ss_pred cceeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCcccccc---cchhHHhhhhhhhheeeeeccc Confidence 22 122222222 224455678888888877765554444444332100 0111111122223344445555 Q ss_pred cEEEEeee Q lcl|NC_011054. 285 TAVGDNKT 292 (302) Q Consensus 285 a~~~lt~~ 292 (302) +=.++..- T Consensus 314 ~~~vL~~~ 321 (321) T protein:vir:34 314 FQGRLIAE 321 (321) T ss_pred ceeEEeeC Confidence 54444433 No 182 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=96.39 E-value=0.00022 Score=40.60 Aligned_cols=276 Identities=11% Similarity=-0.000 Sum_probs=134.4 Q ss_pred CCCccCCCccee-------cchHHH----HHHHHHHHhhhhhhhhcceeecC---CCceEEEEEeCCcceeeeccccccc Q lcl|NC_011054. 1 MADISRSEVATL-------IQEAYA----NDLLASAKKGSTVLQAFPTVNMG---TKTTHLPVLATLPGASWVSESATEP 66 (302) Q Consensus 1 Ma~~t~~~~g~l-------iP~~~~----~~ii~~~~~~s~l~~~~~~~~~~---~~~~~~p~~~~~~~a~~v~E~~~~~ 66 (302) |+.-..-.++.+ ||..+. ..+++.+........++.+.+.+ .....++.......+.+.+..... T Consensus 31 ~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~- 109 (336) T protein:vir:10 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSD- 109 (336) T ss_pred HHHhhhhhccccccCCCcchHHHHHhhcCcceeeeeechhchhhhcccccCCCcceeeEEEEeeeeeeeEEEccccCCC- Confidence 222111001111 333222 23333333333334444433322 133555666666666776655443 Q ss_pred cccccccccceeeEEeeeeeEEEeehhHH-HHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCCC--ccccccccc Q lcl|NC_011054. 67 EGVKPTSEATWADRTLVAEEVAVIIPVHE-NVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDKP--SSWVSPALL 141 (302) Q Consensus 67 ~~~~~~s~~~f~~i~l~~~ki~~~~~iS~-ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~--~g~~~~~~~ 141 (302) |.++..-+.-.-+.+.++..+.++. |+..- ...++.+.-+...++++.+.+++-.+.|+..- .|+++.... T Consensus 110 ----P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l 185 (336) T protein:vir:10 110 ----GDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSL 185 (336) T ss_pred ----cceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeecccceEEEeecCCC Confidence 4455445555666777888888885 44332 23577777888888888888898888887642 455554444 Q ss_pred ccccccccceeeccccchHHHHHHHhhhhhhhhhhccc------CccEEEecHHHHHHHHhhhcCCCceeeecccccCcc Q lcl|NC_011054. 142 PAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGY------MPDTLLASLGFRFDVANLRDANGNPIFRDESFNGFG 215 (302) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~------~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~~~g~p 215 (302) +...+. .+..-+..+.+.+++++..++..+..... .+..+++.++.+..|.. ++..|.-+++- ....+| T Consensus 186 ~a~~t~---~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~~-lk~n~P 260 (336) T protein:vir:10 186 SAPITA---TTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP 260 (336) T ss_pred Cccccc---CcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHHH-HHHhCC Confidence 322211 11111235678899999999888754331 24578999999999964 33334322110 000122 Q ss_pred -eEeecccccC--CCcceEEEE-ecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccE-EeccccEEEEe Q lcl|NC_011054. 216 -TYFNANGAWP--VGVAEALVV-DSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYV-LGNGATAVGDN 290 (302) Q Consensus 216 -~~~~~~~~~~--~~~~~~~~g-d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~-v~~~~a~~~lt 290 (302) +.+..-+-.. .++...++. +... ..-+.+.+......-.+ ....-.....+..|+++. +.+|.+|++++ T Consensus 261 nl~i~t~pel~~Agg~~~~~~~~~~~~-----~~t~~~~~P~~f~~lpv-q~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~ 334 (336) T protein:vir:10 261 KLEFVTIPEYDTASGRLVQLWAPRVEG-----KDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQML 334 (336) T ss_pred ccEEEEcccccccCCceEEEEEecccC-----CcceeeecChhhhccce-eecCceeEeccccceeeeeeeccchheeec Confidence 2222211111 111111111 1100 00011111111000000 011122334556666665 46799999999 Q ss_pred ee Q lcl|NC_011054. 291 KT 292 (302) Q Consensus 291 ~~ 292 (302) +. T Consensus 335 GI 336 (336) T protein:vir:10 335 GV 336 (336) T ss_pred cC Confidence 88 No 183 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=96.12 E-value=0.00098 Score=37.08 Aligned_cols=274 Identities=10% Similarity=-0.031 Sum_probs=118.0 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhc-------ceeecCCCceEEEEEeCCc----ceeeecccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAF-------PTVNMGTKTTHLPVLATLP----GASWVSESATEPEGV 69 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~-------~~~~~~~~~~~~p~~~~~~----~a~~v~E~~~~~~~~ 69 (302) ||-...- +..+.+....++.+.+...+.... ...++.+...++|.+..-. +..-+.+...++... T Consensus 1 m~lsD~~----vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~k 76 (325) T protein:vir:95 1 MALSDLA----VYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKV 76 (325) T ss_pred Cchhhhh----hhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceeccce Confidence 7644432 245555566677766654444432 2234456666788665321 111222222222111 Q ss_pred ccccccceeeEEeeeeeEEEeehhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccc Q lcl|NC_011054. 70 KPTSEATWADRTLVAEEVAVIIPVHENVVD---DASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVA 146 (302) Q Consensus 70 ~~~s~~~f~~i~l~~~ki~~~~~iS~ell~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~ 146 (302) . .+..++..+..+-.+......+.+. +....+...|.+.++++..+.+-+.+|.+.....+... ..+.. T Consensus 77 i----tt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~----~~v~d 148 (325) T protein:vir:95 77 L----KHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVS----DVVYD 148 (325) T ss_pred e----ccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----cceee Confidence 1 1334444444443333333333222 22334555666666666655555555532211000000 00011 Q ss_pred cccceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeec------ccccCcceEeec Q lcl|NC_011054. 147 ANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRD------ESFNGFGTYFNA 220 (302) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~------~~~~g~p~~~~~ 220 (302) +...........+. +.+.++..++-.....-..|+||...+..|.++.-.+...++.. ++..|.++.+.+ T Consensus 149 is~~~~~~~~~~s~----~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~~i~t~~G~~VIVdD 224 (325) T protein:vir:95 149 ATANTDAADKLPTW----NNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVNVVRDPFGKLLVMTD 224 (325) T ss_pred eecccCcccccccH----HHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCCcccccccCCcEEEEeC Confidence 11111111111222 34555666665666667789999999999987543333222221 256677887776 Q ss_pred ccccCCCc-c----eEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 221 NGAWPVGV-A----EALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 221 ~~~~~~~~-~----~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) .+...... . ..+||. ..+.++...+......+.. .-+.....+|++.. -+++|..+.. +++ .+ T Consensus 225 ~~p~~~~g~~~~ytty~lg~-GAi~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~t---f~lhp~G~sw-~~s-~~ 292 (325) T protein:vir:95 225 SPNLFAAGTPNVYHILGLVP-GGVLIGQNNDFDANEETKN------GDENIIRTYQAEWS---YNIGVKGFAW-DKA-NG 292 (325) T ss_pred CCCCCCccCceeEEEEEEec-CeEEecCCCCccccccccC------cccceeeeeeeeee---EEeecceeee-ecc-cc Confidence 65433211 1 111221 1122222222221111111 01222233443322 3478888766 333 33 Q ss_pred ccCCCCC Q lcl|NC_011054. 296 AVVPDGS 302 (302) Q Consensus 296 ~~~p~~~ 302 (302) ..+|.-+ T Consensus 293 g~sPt~a 299 (325) T protein:vir:95 293 GKSPTDA 299 (325) T ss_pred cCCcChH Confidence 3467665 No 184 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=96.07 E-value=0.001 Score=36.92 Aligned_cols=285 Identities=10% Similarity=0.024 Sum_probs=126.5 Q ss_pred CCCccC--CCcceecchHHHHHHHHHHHhhhhhhhhcce---------eecCCCceEEEEEeCC-cceeeeccccccccc Q lcl|NC_011054. 1 MADISR--SEVATLIQEAYANDLLASAKKGSTVLQAFPT---------VNMGTKTTHLPVLATL-PGASWVSESATEPEG 68 (302) Q Consensus 1 Ma~~t~--~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~---------~~~~~~~~~~p~~~~~-~~a~~v~E~~~~~~~ 68 (302) |+.... .-.-.++|+.+..-+.+...+.+.|++-.=. ...++...++|.+..- ....-+.+.....+ T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~- 79 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVE- 79 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCccc- Confidence 986553 2233578888877676666666665543211 2245667899987532 22222222221111 Q ss_pred cccccccc-eeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc---c---cCCCccc--c--- Q lcl|NC_011054. 69 VKPTSEAT-WADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIF---G---TDKPSSW--V--- 136 (302) Q Consensus 69 ~~~~s~~~-f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~---G---~g~~~g~--~--- 136 (302) .+..+.+ ..++....+ .+.....++-.-.-+..+..+.|.+++++-..+...+.+|. | ....... . T Consensus 80 -~t~~kittg~~~a~v~~-r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~ 157 (367) T protein:vir:80 80 -APIDGLGSGEMKTTKTW-LNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) T ss_pred -ccccccccchheeeeeh-hcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhh Confidence 0001111 111111111 12222333322222445788999999998888777777664 1 1100000 0 Q ss_pred ---cccccc--cccccccc--eeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhh------hcCCCc Q lcl|NC_011054. 137 ---SPALLP--AAVAANQD--YTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANL------RDANGN 203 (302) Q Consensus 137 ---~~~~~~--~~~~~~~~--~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l------~d~~g~ 203 (302) ...... .......+ .......... +.+.++...+-.....-+.++||+..+..|+++ ++++|. T Consensus 158 ~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~----~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~li~~i~~sd~~ 233 (367) T protein:vir:80 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNR----EAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ 233 (367) T ss_pred hccccccccccCceeeeeeccCCCccceecH----HHHHHHHHHhccccccccEEEEchHHHHHHHhccccccccCCCCc Confidence 000000 00000000 0011122233 334445555555555667899999999999875 355552 Q ss_pred eeeecccccCcceEeecccccCCC--cc---eEEEEecceEEEEeecC-cEEEEeecccccchhhhcCCcEEEEEEEEec Q lcl|NC_011054. 204 PIFRDESFNGFGTYFNANGAWPVG--VA---EALVVDSSRVRIGVRQD-ITVKFLDQATVGSINLAERDMIALRLKARFA 277 (302) Q Consensus 204 ~i~~~~~~~g~p~~~~~~~~~~~~--~~---~~~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d 277 (302) . .=++..|+.+.+.+.+..... .. ..+||. ..+.++.... ..+++.++.... -..++-.+..+.| T Consensus 234 ~--~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~-GAi~~~~~~~~~~~E~~Rd~~~~----~~gG~d~L~~Rr~-- 304 (367) T protein:vir:80 234 L--TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRG----NGSGLEYILERKE-- 304 (367) T ss_pred c--ccceecceeEEEeCCCcccccCCCceEEEEEEec-ceeeecccCCccceecccchhhh----cCCceEEEEeeee-- Confidence 1 124566777766655443211 11 123332 1222333221 223444433210 0123334444444 Q ss_pred cEEeccccEEEEeeecc---cccCCCCC Q lcl|NC_011054. 278 YVLGNGATAVGDNKTPV---GAVVPDGS 302 (302) Q Consensus 278 ~~v~~~~a~~~lt~~~a---~~~~p~~~ 302 (302) .+.+|..+.......+ ...+|.|+ T Consensus 305 -~~~hP~G~s~~~~~v~~~~~~~~~~~~ 331 (367) T protein:vir:80 305 -WIVHPGGFNWLDADVTIPDNTGSPSGI 331 (367) T ss_pred -EEeecceeeeccccccccccccccccc Confidence 4678887755443221 12223332 No 185 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=95.68 E-value=0.0016 Score=35.87 Aligned_cols=280 Identities=9% Similarity=0.041 Sum_probs=125.1 Q ss_pred CCCccCCCcceecch--HHHHHHHHHHHhhhhhhhhcc---------eeecCCCceEEEEEeCC---cceeeecccc--c Q lcl|NC_011054. 1 MADISRSEVATLIQE--AYANDLLASAKKGSTVLQAFP---------TVNMGTKTTHLPVLATL---PGASWVSESA--T 64 (302) Q Consensus 1 Ma~~t~~~~g~liP~--~~~~~ii~~~~~~s~l~~~~~---------~~~~~~~~~~~p~~~~~---~~a~~v~E~~--~ 64 (302) ||.+..+|. .+|+ .+..-+.+...+.+.|++-.= ....++...++|.+..- .+..+...+. . T Consensus 1 Ma~T~l~D~--iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MAITTIGDI--VTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceEEeee--eccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 996665543 5676 355555555555555544321 11234667899987531 2222211111 1 Q ss_pred cccccccccccceeeEEeeeeeEEEe--ehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccc- Q lcl|NC_011054. 65 EPEGVKPTSEATWADRTLVAEEVAVI--IPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALL- 141 (302) Q Consensus 65 ~~~~~~~~s~~~f~~i~l~~~ki~~~--~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~- 141 (302) .+.+.. .+..++....++-.++ ..++.++ |..+..+.|.+++++...+...+.+|.= -+|++..... T Consensus 79 ~t~~ki----tt~~~~a~~~~r~kaw~~~Dla~~l---sG~dpm~~Ia~~va~yW~r~~q~~Lia~---L~Gvf~~~~~a 148 (349) T protein:vir:78 79 ATPRAI----QTGEMMARVAYLNEGFGQADLTVEL---TSQNPLQSVASRLDNFWQRQAQRRLIAT---ALGLYNDNVSA 148 (349) T ss_pred cccccc----cccceeeeeeeeccccchhHHHHHh---hCchHHHHHHHHHHHHHhhHHHHHHHHH---HHHhhcccccc Confidence 111111 1233333333333332 2334333 3457788999999988888877777641 0111110000 Q ss_pred -cccccccc-cee-eccccchHHHHHHHhhhhhhhhhh-cccCccEEEecHHHHHHHHhhh------cCCCceeeecccc Q lcl|NC_011054. 142 -PAAVAANQ-DYT-IVPGDANEDDLIGCINRASKAVAA-AGYMPDTLLASLGFRFDVANLR------DANGNPIFRDESF 211 (302) Q Consensus 142 -~~~~~~~~-~~~-~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~~~v~~~~~~~~l~~l~------d~~g~~i~~~~~~ 211 (302) ......+. +.. ......+...+.+....+-..... ....-+.++||+..+..|++++ +.+|..- =++. T Consensus 149 ~~~~~~~~~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~--i~ty 226 (349) T protein:vir:78 149 TDAYHEQNDMVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNTM--FATY 226 (349) T ss_pred cchhhhcccceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhhccCcccCcc--ccee Confidence 00000000 000 111123344444443333332211 2334467999999999998753 4444311 1456 Q ss_pred cCcceEeecccccCCC-c----ceEEEEecceEEEEeecC-cEEEEeecccccchhhhcCCcEEEEEEEEeccEEecccc Q lcl|NC_011054. 212 NGFGTYFNANGAWPVG-V----AEALVVDSSRVRIGVRQD-ITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGAT 285 (302) Q Consensus 212 ~g~p~~~~~~~~~~~~-~----~~~~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a 285 (302) +|..+.+.+.++.... . ...+||. ..+.++.-+. ..++..++.... -..++..+..+.++- .+|.. T Consensus 227 ~G~~VivDD~~Pv~~~g~~~~yttylfg~-GAi~~~~~~~~~~~et~rd~~~g----~~~G~d~l~~R~~~~---~hp~G 298 (349) T protein:vir:78 227 QGYRVIVDDSMTVVGQGAQRKFISIIFGQ-GAIGYGEGNPVMPLEYEREASRA----NGGGVETLWTRKTWL---LHPFG 298 (349) T ss_pred cCeEEEEeCCCccccCCCCceEEEEEeec-ceEEEccCCCccceeeecccccC----CcceeEEEEEeeEEE---eeeee Confidence 7777766665543321 1 1124442 2233443221 234444443211 123455566666654 66766 Q ss_pred EEEEeeecccc------cCCCCC Q lcl|NC_011054. 286 AVGDNKTPVGA------VVPDGS 302 (302) Q Consensus 286 ~~~lt~~~a~~------~~p~~~ 302 (302) +.......+.. ..|.=+ T Consensus 299 ~s~~~a~v~~~~~~~~~~sPt~a 321 (349) T protein:vir:78 299 YRFTSAVITGNGTETIARSASWQ 321 (349) T ss_pred eeeccccccCCccccccCCCChH Confidence 65444322211 122211 No 186 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=95.60 E-value=0.0018 Score=35.66 Aligned_cols=264 Identities=10% Similarity=-0.048 Sum_probs=126.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee-----cCCCceEEEEEeCCcceeeecccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN-----MGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEA 75 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~-----~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~ 75 (302) ||... ...|-|+.|+.++++.+++++++.+++..-. -.++.++||+..... +.++....-+ +. T Consensus 1 m~~~~---N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~----v~dg~~~~~~-----~~ 68 (418) T protein:vir:10 1 MAVQD---NNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVK----SASGRTLVKQ-----PM 68 (418) T ss_pred CCccc---cccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCcee----ecccCCcccc-----cc Confidence 88533 3356699999999999999999888875421 124578888633211 2233333221 11 Q ss_pred ceee--EEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceee Q lcl|NC_011054. 76 TWAD--RTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTI 153 (302) Q Consensus 76 ~f~~--i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~ 153 (302) +=.. ++++.+|.. .+.++++-..++..++.+.+.+...++++..+|..++.-- . ......++. T Consensus 69 te~~v~l~id~~k~~-~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~-~----------~a~~~~gt~--- 133 (418) T protein:vir:10 69 VDQTIPFKIAYQEHV-GLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTL-K----------KAFHSSGTP--- 133 (418) T ss_pred ccceEEEEEeccccc-ceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH-h----------hcccccccC--- Confidence 2233 444444444 4455555444567788888889999999999999887310 0 000001100 Q ss_pred ccccchHHHHHHHhhhhhhhhhhcccC---ccEEEecHHHHHHHHhhh----cCCCc-eeeecc---cccCcceEeeccc Q lcl|NC_011054. 154 VPGDANEDDLIGCINRASKAVAAAGYM---PDTLLASLGFRFDVANLR----DANGN-PIFRDE---SFNGFGTYFNANG 222 (302) Q Consensus 154 ~~~~~~~~~~~~~i~~~~~~~~~~~~~---~~~~v~~~~~~~~l~~l~----d~~g~-~i~~~~---~~~g~p~~~~~~~ 222 (302) .+.... ++.+.++...+.....+ ..+.+++|..+..|.+-. +..+. -.++.. .+.|+.++...+. T Consensus 134 gt~~~~----~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~ni 209 (418) T protein:vir:10 134 GVRPGA----FIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAAYEVYESQNL 209 (418) T ss_pred CcCcch----HHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeeceEEEEecCC Confidence 011112 34455566556554443 245679999988875321 11100 000000 1111111111100 Q ss_pred c----------------------------cCCCcceEEEEe--------------------cceEEEE-ee-----cCcE Q lcl|NC_011054. 223 A----------------------------WPVGVAEALVVD--------------------SSRVRIG-VR-----QDIT 248 (302) Q Consensus 223 ~----------------------------~~~~~~~~~~gd--------------------~~~~~~~-~~-----~~~~ 248 (302) . .....+.+..|| ..+|.+- +. ++.. T Consensus 210 p~~tag~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~t 289 (418) T protein:vir:10 210 PKHTVGDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGS 289 (418) T ss_pred CcccccccccceeeecccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcce Confidence 0 000001111111 1111110 00 0001 Q ss_pred EEEe---------------------------------------------------------------------------- Q lcl|NC_011054. 249 VKFL---------------------------------------------------------------------------- 252 (302) Q Consensus 249 i~~~---------------------------------------------------------------------------- 252 (302) |.++ T Consensus 290 v~i~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~l~~p~g~~~~~~~ 369 (418) T protein:vir:10 290 IKISPSLNDGTATINNENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALAMIDLELPQSAVIKSRA 369 (418) T ss_pred eEeccccccccccccccccccccccCCCcccccccCcceeeeecccccceeeeeeeecceEEEEEeeccCCCCCCcceEE Confidence 1100 Q ss_pred -e-cccccc----hhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccc Q lcl|NC_011054. 253 -D-QATVGS----INLAERDMIALRLKARFAYVLGNGATAVGDNKTPVG 295 (302) Q Consensus 253 -~-~~~~~~----~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~ 295 (302) + +.+.+. .-...++...+|...-+|++.++|+-.+++-+..++ T Consensus 370 ~~~~~G~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~~~ 418 (418) T protein:vir:10 370 ADPETGLSLTLTGAYDINEQSEIHRIDAVWGADMIYGELALRLWGAASS 418 (418) T ss_pred EeccCCeEEEEEEcccccccceEEEEEeecCceeecccceEEEEeecCC Confidence 0 000000 001123445556677788999999999999988766 No 187 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=95.44 E-value=0.0021 Score=35.31 Aligned_cols=264 Identities=9% Similarity=-0.057 Sum_probs=105.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcce-------eecCCCceEEEEEe-CCc-ceeeecccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPT-------VNMGTKTTHLPVLA-TLP-GASWVSESATEPEGVKP 71 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~-------~~~~~~~~~~p~~~-~~~-~a~~v~E~~~~~~~~~~ 71 (302) ||.+-.+|-- +.-+.+..-.+|.+++...+++.... .++.+.-...+... ++. ...-+.....+..+.+ T Consensus 1 ~~~t~~sdl~-vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~ki- 78 (315) T protein:vir:96 1 MATTVNSDLV-IYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKI- 78 (315) T ss_pred Cceeeeccee-eehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceec- Confidence 9988888764 45566667778888776665554321 12233322222111 110 0000111111111111 Q ss_pred ccccceeeEEeeeeeEEEeehhHHHHHhcch---HHHHHHHHHHHHHHHHHHHHHHhhcccC-CCccccccccccccccc Q lcl|NC_011054. 72 TSEATWADRTLVAEEVAVIIPVHENVVDDAS---TSLLEEIAALGGQAIGKKLDQAVIFGTD-KPSSWVSPALLPAAVAA 147 (302) Q Consensus 72 ~s~~~f~~i~l~~~ki~~~~~iS~ell~ds~---~~~~~~i~~~l~~ai~~~~d~~~l~G~g-~~~g~~~~~~~~~~~~~ 147 (302) + +...+..+..--.+-+..+.+.+.... ..+..-|.+.+..++.+.+=...+.|.- .-.+ +.. T Consensus 79 -t--~~~dvaVk~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~----------~t~ 145 (315) T protein:vir:96 79 -A--ADEMVSVKVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIGS----------NAG 145 (315) T ss_pred -c--cccceeEEEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcc----------ccc Confidence 0 111222221111122334444444322 3333344444444444444443333211 0000 000 Q ss_pred ccceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhh-----hcCCCce-eee-cccccCcceEeec Q lcl|NC_011054. 148 NQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANL-----RDANGNP-IFR-DESFNGFGTYFNA 220 (302) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l-----~d~~g~~-i~~-~~~~~g~p~~~~~ 220 (302) .. ........+.. .+.++..++-.....-..|+||...+..|.+- ..+++.- ++. ++...|.++.+.+ T Consensus 146 ~~-~~~~~a~~~~~----~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~q~L~~~~~~~~~~~~~~~~~~~lGkrViVdD 220 (315) T protein:vir:96 146 MN-VSGELATEGKK----VLTKGLRTMGDKASSIAIWVMDSTSYFDIVDEAIDNKLYEEAGVVVYGGTPGTLGKPVLVTD 220 (315) T ss_pred cc-ccccccccCHH----HHHHHHHHhcccccCeeEEEEchHHHHHHHHhhhhhhcccccceeEecCcCcccccEEEEEC Confidence 00 01112223333 34455555656666677899999999999761 1122222 221 2334477776665 Q ss_pred ccccCCCcceEEEEec-ceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccE-EeccccEEEEeeecccccC Q lcl|NC_011054. 221 NGAWPVGVAEALVVDS-SRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYV-LGNGATAVGDNKTPVGAVV 298 (302) Q Consensus 221 ~~~~~~~~~~~~~gd~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~-v~~~~a~~~lt~~~a~~~~ 298 (302) .++. . -++|=- ..+.++...++.... +-..++-.+....|..|. ..+|..+.+-+ ++... T Consensus 221 ~~P~----~-~~~gl~~GAi~~~~~~~~~~~~----------~~~~g~e~l~~~~r~e~tf~l~p~G~sw~~---~~~~s 282 (315) T protein:vir:96 221 QCPA----T-KIFGLVAGAVMITESQAPGMRS----------YQIDDQENLAIGFRAEGTANVEVLGYKWKT---KTNVN 282 (315) T ss_pred CCCc----c-eeeeeecceeeecCCCcccccc----------ccCCCcceeEEEEeeeeEeeeeeeeEEeec---CCCcC Confidence 4321 1 111100 111122211110000 011122223334444442 46777776532 23345 Q ss_pred CCCC Q lcl|NC_011054. 299 PDGS 302 (302) Q Consensus 299 p~~~ 302 (302) |.-+ T Consensus 283 Pt~a 286 (315) T protein:vir:96 283 PASA 286 (315) T ss_pred CChH Confidence 6555 No 188 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=95.39 E-value=0.0022 Score=35.20 Aligned_cols=269 Identities=11% Similarity=-0.017 Sum_probs=127.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhh-h-cc--eeecCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQ-A-FP--TVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~-~-~~--~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) .|.-+-...-..+.+-++. +++.+.....+.. + ++ .....+++++||......-..+ .-+.....+. -..+ T Consensus 19 ~~~~~~~~nt~~l~~k~~~-~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY-~R~~g~~~g~---vt~~ 93 (319) T protein:vir:94 19 FANKSVEPGQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY-KRNATNEFDH---PKIE 93 (319) T ss_pred hhccCCCcchHHHHHHHHH-HHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccc-cCCCCcccCC---cccc Confidence 3433333222233333333 4444444444332 1 22 3345677899998875433222 1111111111 1223 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchH--HHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeec Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDAST--SLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIV 154 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~--~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~ 154 (302) ....+++..|.-.+..=..+ .+++.. .+...+.+...+.++-.+|...+.---+..+ ... T Consensus 94 ~~t~tidqdR~~~F~VD~~D-~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~-----------------~~~ 155 (319) T protein:vir:94 94 ETTYFLDQEKYWGRFVDALD-RKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA-----------------KHL 155 (319) T ss_pred eeEEEeecccccccccchhh-HhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcc-----------------ccc Confidence 44455555555444321111 122222 2334455666667777778766532110000 011 Q ss_pred cccchHHHHHHHhhhhhhhhhhccc-CccEEEecHHHHHHHHhhh----c--CCCceeeec--ccccCcceEeecccccC Q lcl|NC_011054. 155 PGDANEDDLIGCINRASKAVAAAGY-MPDTLLASLGFRFDVANLR----D--ANGNPIFRD--ESFNGFGTYFNANGAWP 225 (302) Q Consensus 155 ~~~~~~~~~~~~i~~~~~~~~~~~~-~~~~~v~~~~~~~~l~~l~----d--~~g~~i~~~--~~~~g~p~~~~~~~~~~ 225 (302) +...+.+..++.+.++...+..... .+.+++++|..+..|.+-. + .....+... ..++|.++..+.. .. T Consensus 156 ~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps--~~ 233 (319) T protein:vir:94 156 TVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPT--KL 233 (319) T ss_pred ccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeEEEEecc--cc Confidence 1223456667777777777766443 3456789999999996532 1 111222222 2466777654432 12 Q ss_pred CCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEE-eeecccccCCCCC Q lcl|NC_011054. 226 VGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGD-NKTPVGAVVPDGS 302 (302) Q Consensus 226 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l-t~~~a~~~~p~~~ 302 (302) ..+..+++|..+....... =-.+++++... .++.-.++.+.++|..|.+|++.... ......+..+.|| T Consensus 234 ~k~in~i~~h~~A~~~~~k-~~~~~~~~p~~-------~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~~~~ 303 (319) T protein:vir:94 234 LQGLQAIAVVGEVLASPIQ-ADLAKTNSNIP-------GMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGV 303 (319) T ss_pred cccceEEEEcCCeeeeeee-eeeeeccCCCc-------cccceeeeeeeeeeeEEeccccceEEEeecCCcccCCCcc Confidence 2334456666544432221 11122222111 11224678889999999998743322 3344555567777 No 189 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=95.39 E-value=0.0022 Score=35.20 Aligned_cols=269 Identities=11% Similarity=-0.017 Sum_probs=127.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhh-h-cc--eeecCCCceEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQ-A-FP--TVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~-~-~~--~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) .|.-+-...-..+.+-++. +++.+.....+.. + ++ .....+++++||......-..+ .-+.....+. -..+ T Consensus 19 ~~~~~~~~nt~~l~~k~~~-~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY-~R~~g~~~g~---vt~~ 93 (319) T protein:vir:97 19 FANKSVEPGQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY-KRNATNEFDH---PKIE 93 (319) T ss_pred hhccCCCcchHHHHHHHHH-HHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccc-cCCCCcccCC---cccc Confidence 3433333222233333333 4444444444332 1 22 3345677899998875433222 1111111111 1223 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcchH--HHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeec Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDDAST--SLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIV 154 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~ds~~--~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~ 154 (302) ....+++..|.-.+..=..+ .+++.. .+...+.+...+.++-.+|...+.---+..+ ... T Consensus 94 ~~t~tidqdR~~~F~VD~~D-~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~-----------------~~~ 155 (319) T protein:vir:97 94 ETTYFLDQEKYWGRFVDALD-RKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA-----------------KHL 155 (319) T ss_pred eeEEEeecccccccccchhh-HhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcc-----------------ccc Confidence 44455555555444321111 122222 2334455666667777778766532110000 011 Q ss_pred cccchHHHHHHHhhhhhhhhhhccc-CccEEEecHHHHHHHHhhh----c--CCCceeeec--ccccCcceEeecccccC Q lcl|NC_011054. 155 PGDANEDDLIGCINRASKAVAAAGY-MPDTLLASLGFRFDVANLR----D--ANGNPIFRD--ESFNGFGTYFNANGAWP 225 (302) Q Consensus 155 ~~~~~~~~~~~~i~~~~~~~~~~~~-~~~~~v~~~~~~~~l~~l~----d--~~g~~i~~~--~~~~g~p~~~~~~~~~~ 225 (302) +...+.+..++.+.++...+..... .+.+++++|..+..|.+-. + .....+... ..++|.++..+.. .. T Consensus 156 ~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps--~~ 233 (319) T protein:vir:97 156 TVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPT--KL 233 (319) T ss_pred ccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeEEEEecc--cc Confidence 1223456667777777777766443 3456789999999996532 1 111222222 2466777654432 12 Q ss_pred CCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEE-eeecccccCCCCC Q lcl|NC_011054. 226 VGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGD-NKTPVGAVVPDGS 302 (302) Q Consensus 226 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l-t~~~a~~~~p~~~ 302 (302) ..+..+++|..+....... =-.+++++... .++.-.++.+.++|..|.+|++.... ......+..+.|| T Consensus 234 ~k~in~i~~h~~A~~~~~k-~~~~~~~~p~~-------~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~~~~ 303 (319) T protein:vir:97 234 LQGLQAIAVVGEVLASPIQ-ADLAKTNSNIP-------GMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGV 303 (319) T ss_pred cccceEEEEcCCeeeeeee-eeeeeccCCCc-------cccceeeeeeeeeeeEEeccccceEEEeecCCcccCCCcc Confidence 2334456666544432221 11122222111 11224678889999999998743322 3344555567777 No 190 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=95.23 E-value=0.0025 Score=34.86 Aligned_cols=281 Identities=9% Similarity=0.063 Sum_probs=126.6 Q ss_pred CCCccCCCcceecch--HHHHHHHHHHHhhhhhhhhcce---------eecCCCceEEEEEeCC-c--ceeeeccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQE--AYANDLLASAKKGSTVLQAFPT---------VNMGTKTTHLPVLATL-P--GASWVSESATEP 66 (302) Q Consensus 1 Ma~~t~~~~g~liP~--~~~~~ii~~~~~~s~l~~~~~~---------~~~~~~~~~~p~~~~~-~--~a~~v~E~~~~~ 66 (302) ||.+..+| ..+|+ .+..-+.+...+.+.|++-.=. ...++...++|.+..- . +..+-+.. .. T Consensus 1 Ma~T~l~D--~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt-~~- 76 (349) T protein:vir:94 1 MAITTIGN--IVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDV-YQ- 76 (349) T ss_pred CCceEEee--eeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCC-cc- Confidence 99666554 35676 3555555555555665553211 1234667899977532 2 22111111 00 Q ss_pred cccccccc-cceeeEEeeeeeEEEe--ehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccc Q lcl|NC_011054. 67 EGVKPTSE-ATWADRTLVAEEVAVI--IPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPA 143 (302) Q Consensus 67 ~~~~~~s~-~~f~~i~l~~~ki~~~--~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~ 143 (302) +..+..+ .+..++....++-.++ ..++.++ +..+..+.|.+++++...+...+.+|.= -+|++....... T Consensus 77 -~~~t~~kit~~~~~a~~~~r~kaw~~~Dla~~l---sG~dpm~~Ia~~va~yW~r~~q~~Lia~---L~Gvf~~~~~~~ 149 (349) T protein:vir:94 77 -DIATPRAIQTGEMMARVAYLNEGFGQADLTVEL---TSQNPLQSVASRLDNFWQRQAQRRLIAT---ALGLYNDNVSAT 149 (349) T ss_pred -cccccccccccceeeeeeeeccccchhHHHHHh---hCchHHHHHHHHHHHHHhhHHHHHHHHH---HHhhhccccccc Confidence 0000011 1223333333332232 2334433 3447788999999998888888877741 011111100000 Q ss_pred --cccccc-c-eeeccccchHHHHHHHhhhhhhhhh-hcccCccEEEecHHHHHHHHhhh------cCCCceeeeccccc Q lcl|NC_011054. 144 --AVAANQ-D-YTIVPGDANEDDLIGCINRASKAVA-AAGYMPDTLLASLGFRFDVANLR------DANGNPIFRDESFN 212 (302) Q Consensus 144 --~~~~~~-~-~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~~~v~~~~~~~~l~~l~------d~~g~~i~~~~~~~ 212 (302) ...... . ........+...+.+....+..... .....-+.++||+..+..|++++ +.+|..-+ ++.+ T Consensus 150 ~~~~~~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~i--~ty~ 227 (349) T protein:vir:94 150 DAYHEQNDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNTMF--ATYQ 227 (349) T ss_pred ccccccCceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhhccCcccCccc--ceec Confidence 000000 0 0111222344444444444433221 12334467999999999998753 44443211 4667 Q ss_pred CcceEeecccccCCCc-----ceEEEEecceEEEEeec-CcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccE Q lcl|NC_011054. 213 GFGTYFNANGAWPVGV-----AEALVVDSSRVRIGVRQ-DITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATA 286 (302) Q Consensus 213 g~p~~~~~~~~~~~~~-----~~~~~gd~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~ 286 (302) |..+.+.+.++..... ...+||. ..+.++... .+.++..++.... -..++..+..+.|+- .+|..+ T Consensus 228 G~~VivDD~~Pv~~~g~~~~yttylfg~-GAi~~~~~~~~~~~E~~rd~~~g----~~~G~d~L~~R~~~~---~hp~G~ 299 (349) T protein:vir:94 228 GYRVIVDDSMTVVGQDTSRKFISIIFGQ-GAIGYGEGNPEMPLEYEREASRA----NGGGVETLWTRKTWL---LHPFGY 299 (349) T ss_pred CcEEEEeCCCccccCCCCceEEEEEeec-ceEEeecCCCCcceeeecccccC----CcceeEEEEEeeEEE---eeeeee Confidence 7887776655432211 1123442 233344432 2334544443211 012344555555554 667676 Q ss_pred EEEeeeccc------ccCCCCC Q lcl|NC_011054. 287 VGDNKTPVG------AVVPDGS 302 (302) Q Consensus 287 ~~lt~~~a~------~~~p~~~ 302 (302) ........+ ...|.=+ T Consensus 300 s~~~a~v~~~~~~~~~~sPt~a 321 (349) T protein:vir:94 300 SFTSAVITGNGTETIARSASWQ 321 (349) T ss_pred eecccccCCCccccccCCCChH Confidence 544422221 1122211 No 191 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=95.02 E-value=0.00034 Score=39.61 Aligned_cols=265 Identities=15% Similarity=0.105 Sum_probs=115.5 Q ss_pred CCCccC--CCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISR--SEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~--~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) ++...- ++....+|+.+...|-..+....|+++.+.+...+.-..+... .....|.-+-.|.+-.+ ..|. T Consensus 35 L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~-~s~AeAq~HkdGqTK~e-------qa~~ 106 (318) T protein:vir:86 35 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF-DSSAEAQVHKDGQTKTE-------QAAT 106 (318) T ss_pred hhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhhhhhh-hhhhhhhhhccCCcccc-------ceee Confidence 443333 5566778999999999999999999997766555433222222 22355555555554333 2333 Q ss_pred eEEeeeeeEEEeehhHH-HHHhc---chHHHHHHHHHHHHHHHH-HHHHHHhhcccCCCc--ccccccccccccccccce Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHE-NVVDD---ASTSLLEEIAALGGQAIG-KKLDQAVIFGTDKPS--SWVSPALLPAAVAANQDY 151 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~-ell~d---s~~~~~~~i~~~l~~ai~-~~~d~~~l~G~g~~~--g~~~~~~~~~~~~~~~~~ 151 (302) -+......++.+...|- |+.++ +...+-.||..+|++++. +..|.+++-|+|+.. .+-............+.. T Consensus 107 ~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttka 186 (318) T protein:vir:86 107 LTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGSNGFKSIDKEADVKKIKKITTKA 186 (318) T ss_pred eeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhheeecCCCCccchhhHHHHHHHHHHhhhh Confidence 33333333444444443 33333 445678999999999999 899999999999632 111111111111111111 Q ss_pred eeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCC--c-eeeecccc----cCcceEeeccccc Q lcl|NC_011054. 152 TIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANG--N-PIFRDESF----NGFGTYFNANGAW 224 (302) Q Consensus 152 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g--~-~i~~~~~~----~g~p~~~~~~~~~ 224 (302) .. .++.. +...+..+..-++....+.-.++-.......|..|+-+.. . .|-++++- .|..-.++-.... T Consensus 187 ks-agttp---fanaieeavdfvrptagrrylivkaedrkalldelrqatanahvriknddteiasevgvdeiivytgsk 262 (318) T protein:vir:86 187 KS-AGTTP---FANAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANAHVRIKNDDTEIASEVGVDEIIVYTGSK 262 (318) T ss_pred hc-cCCCc---hhhHHHHHHhhhccCCCceEEEEeecchHHHHHHHHhhcccceeEEeccchhhhhhcCcceeeeeeccc Confidence 11 11111 1223333333333332222233433444445555653322 1 23333321 1111111111100 Q ss_pred CCCcceEEEEecceEEEEeecCc-EEEEeecccccchhhhcCCcEEEEEEEEeccEE--eccccEEEEe Q lcl|NC_011054. 225 PVGVAEALVVDSSRVRIGVRQDI-TVKFLDQATVGSINLAERDMIALRLKARFAYVL--GNGATAVGDN 290 (302) Q Consensus 225 ~~~~~~~~~gd~~~~~~~~~~~~-~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v--~~~~a~~~lt 290 (302) . -+. -++.|-+ |.+... ++ .++.+ -|..|..-+.++..-.+.| .+..+++.+. T Consensus 263 a-lkp-tvlvdqk-yhidmq-dltkvdaf---------ewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:86 263 A-LKP-TVLVDQK-YHIDMQ-DLTKVDAF---------EWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred c-ccc-eeeeccc-eecchh-hhhhhhcc---------eeccCCceEEEeecccCcceeecCceeEEeC Confidence 0 011 1233321 222111 11 01111 1333332333333333333 3344443333 No 192 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=94.64 E-value=0.0039 Score=33.81 Aligned_cols=259 Identities=8% Similarity=-0.052 Sum_probs=108.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee-----c--CCCceEEEEEeCCcc---eeeeccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN-----M--GTKTTHLPVLATLPG---ASWVSESATEPEGVK 70 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~-----~--~~~~~~~p~~~~~~~---a~~v~E~~~~~~~~~ 70 (302) ||+.-+ .++|+.|++++++.++++.++.+++.+-- . .+.+++||+-..... +.+-..+... + T Consensus 1 MANsl~----~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~~-~--- 72 (423) T protein:vir:10 1 MANNLD----ANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKSK-N--- 72 (423) T ss_pred Cccccc----cccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCcccc-c--- Confidence 995443 48999999999999999999988875421 1 245677765331111 1111111100 0 Q ss_pred cccccceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccc Q lcl|NC_011054. 71 PTSEATWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQD 150 (302) Q Consensus 71 ~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~ 150 (302) ..++.+ -.+++..+|...+--=..|+. ....++++++... .++++..+|..+...-... .....+.. T Consensus 73 ~l~e~~-v~l~id~~k~~a~~v~d~E~~-l~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~----------~~~~vgt~ 139 (423) T protein:vir:10 73 SLISAK-ATGEVGNYITVAVEYRQIEEA-LKLNQLDQILVPI-NERMVTDLETELALFMMKH----------GALSLGSP 139 (423) T ss_pred ccccce-EEEEecceeeeeeeeChHHHh-cChhHHHHHHHHH-HHHHHHHHHHHHHHHhhhc----------cccccccc Confidence 001111 245555556555443344544 5677887766544 6899999999886321100 00000110 Q ss_pred eeeccccchHHHHHHHhhhhhhhhhhcc--cCccEEEecHHHHHHHHh-h---hcCCC---ceeeec---ccccCcceEe Q lcl|NC_011054. 151 YTIVPGDANEDDLIGCINRASKAVAAAG--YMPDTLLASLGFRFDVAN-L---RDANG---NPIFRD---ESFNGFGTYF 218 (302) Q Consensus 151 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~--~~~~~~v~~~~~~~~l~~-l---~d~~g---~~i~~~---~~~~g~p~~~ 218 (302) .+ ....++ ++.++...+.... ....+.+++|..+..|.+ + ...++ ..+.+. ..+.|+.++. T Consensus 140 ~t---~~~a~~----~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G~~~GFdi~~ 212 (423) T protein:vir:10 140 NT---PIKKWS----DVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISGNFGGIRALM 212 (423) T ss_pred cc---ccccHH----HHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccchHHHHhcccceeecceEEEE Confidence 00 111233 3344444443322 234578899999988853 2 22111 111111 1344555544 Q ss_pred ecccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccC Q lcl|NC_011054. 219 NANGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVV 298 (302) Q Consensus 219 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~ 298 (302) ..+....+.......+..+... .+.. +.... .+..+-..+-+..-..+.+...+. ++...+-++. T Consensus 213 Sn~vp~~T~g~~~ga~~~~~~~-------~vt~---a~~~~--~~~~~~~~~~~T~s~~g~l~~GD~---~t~aGv~~v~ 277 (423) T protein:vir:10 213 SNGLASRTQGAFGGKLTVKGTP-------EVNY---DSVKD--SYAFTATLTGATASKKGFLKVGDQ---LQFDDTHWLN 277 (423) T ss_pred ecCCcccccccccceeeeeeee-------EEEe---ccccc--ccccccceeeccceeceeEEecce---Eeecceeeec Confidence 4433211111111011111110 0000 00000 000000111111111222333332 2222223344 Q ss_pred CCCC Q lcl|NC_011054. 299 PDGS 302 (302) Q Consensus 299 p~~~ 302 (302) |.-. T Consensus 278 ~~tk 281 (423) T protein:vir:10 278 QQSK 281 (423) T ss_pred cccc Confidence 4444 No 193 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=94.63 E-value=0.00037 Score=39.43 Aligned_cols=265 Identities=15% Similarity=0.080 Sum_probs=116.9 Q ss_pred CCCccC--CCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISR--SEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~--~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) ++...- ++....+|..+...|-..+....|+++.+.+...+.-..+.... ....|.-+-.|.+-.+ ..|. T Consensus 110 L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~~-s~~eAq~HkdGqTK~e-------qa~~ 181 (393) T protein:vir:16 110 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFD-SANEAQVHKDGQTKTE-------QAAT 181 (393) T ss_pred HhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhhh-hhhhhhhhccCCcccc-------ceee Confidence 443333 55667789999999999999999999877665554322222111 2234555555554333 2333 Q ss_pred eEEeeeeeEEEeehhHH-HHHhc---chHHHHHHHHHHHHHHHH-HHHHHHhhcccCCCc--ccccccccccccccccce Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHE-NVVDD---ASTSLLEEIAALGGQAIG-KKLDQAVIFGTDKPS--SWVSPALLPAAVAANQDY 151 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~-ell~d---s~~~~~~~i~~~l~~ai~-~~~d~~~l~G~g~~~--g~~~~~~~~~~~~~~~~~ 151 (302) -+......++.+...|- |+.++ +...+..||..+|+.++. +..|.+++-|+|+.. .+-............+.. T Consensus 182 ~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttka 261 (393) T protein:vir:16 182 LTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKA 261 (393) T ss_pred eeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhh Confidence 33333333444444443 33333 445678999999999999 999999999999632 111111111111111111 Q ss_pred eeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCC--C-ceeeecccc----cCcceEeeccccc Q lcl|NC_011054. 152 TIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDAN--G-NPIFRDESF----NGFGTYFNANGAW 224 (302) Q Consensus 152 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~--g-~~i~~~~~~----~g~p~~~~~~~~~ 224 (302) ..+.. ..+.+.+..+..-++....+.-.++-.......|..|+-+. . -.|-++++- .|..-.++-.+.. T Consensus 262 ksagk----tpfadaieeavdfvrptagrrylivktedrkalldelrqatananvriknddteiasevgvdeiivytgsk 337 (393) T protein:vir:16 262 KSAGK----TPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSK 337 (393) T ss_pred hhcCC----CchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhhccCceeeeccchhhhhhcCcceeeeeeccc Confidence 11111 22234444444444443333333444444445555565332 2 223333321 1111111111100 Q ss_pred CCCcceEEEEecceEEEEeecCcE-EEEeecccccchhhhcCCcEEEEEEEEeccEE--eccccEEEEe Q lcl|NC_011054. 225 PVGVAEALVVDSSRVRIGVRQDIT-VKFLDQATVGSINLAERDMIALRLKARFAYVL--GNGATAVGDN 290 (302) Q Consensus 225 ~~~~~~~~~gd~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v--~~~~a~~~lt 290 (302) . -+. -++.|-+ |.+... +++ ++. .-|..|..-+.++..-.+.| .+..+++.+. T Consensus 338 a-lkp-tvlvdqk-yhidmq-dltkvda---------fewktnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 338 A-LKP-TVLVDQK-YHIDMQ-DLTKVDA---------FEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred c-ccc-eeeeccc-cccchh-hhhhhhh---------heeccCCceEEEeecccCcceeeccceeEeeC Confidence 0 011 1223321 211111 110 111 11333332333333333333 3344443333 No 194 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=94.21 E-value=0.00044 Score=39.01 Aligned_cols=265 Identities=15% Similarity=0.085 Sum_probs=116.4 Q ss_pred CCCccC--CCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISR--SEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~--~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) ++...- ++....+|..+...|-..+....|+++.+.+...+.-..+... .....|.-+-.|.+-.+ ..|. T Consensus 117 L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~-~s~~~Aq~HkdGqTK~e-------qa~~ 188 (400) T protein:vir:93 117 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF-DSANEAQVHKDGQTKTE-------QAAT 188 (400) T ss_pred HhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhh-hhhhhhhhhccCCcccc-------ceee Confidence 443333 5566778999999999999999999987766555432222211 12234555555554333 2333 Q ss_pred eEEeeeeeEEEeehhHH-HHHh---cchHHHHHHHHHHHHHHHH-HHHHHHhhcccCCCc--ccccccccccccccccce Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHE-NVVD---DASTSLLEEIAALGGQAIG-KKLDQAVIFGTDKPS--SWVSPALLPAAVAANQDY 151 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~-ell~---ds~~~~~~~i~~~l~~ai~-~~~d~~~l~G~g~~~--g~~~~~~~~~~~~~~~~~ 151 (302) -+......++.+...|- |+.+ .+...+..||..+|+.++. +..|.+++-|+|+.. .+-............+.. T Consensus 189 ~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~~~Ttka 268 (400) T protein:vir:93 189 LTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKA 268 (400) T ss_pred eeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhh Confidence 33333333444444443 3333 3445678999999999999 899999999999632 111111111111111111 Q ss_pred eeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCC--c-eeeeccc----ccCcceEeeccccc Q lcl|NC_011054. 152 TIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANG--N-PIFRDES----FNGFGTYFNANGAW 224 (302) Q Consensus 152 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g--~-~i~~~~~----~~g~p~~~~~~~~~ 224 (302) ..+.. ..+.+.+..+..-++....+.-.++-.......|..|+-+.. . .|-+++. -.|..-.++-.+.. T Consensus 269 ksagk----tpfadaieeavdfvrptagrrylivktedrkalldelrqatanahvriknddaeiasevgvdeiivytgsk 344 (400) T protein:vir:93 269 KSAGK----TPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANAHVRIKNDDAEIASEVGVDEIIVYTGSK 344 (400) T ss_pred hhcCC----CchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhccccceEeecchhhhhhhcCcceeeeeeccc Confidence 11111 222344444444444433333334444444555566653332 2 2222221 11111111111100 Q ss_pred CCCcceEEEEecceEEEEeecCcE-EEEeecccccchhhhcCCcEEEEEEEEeccEE--eccccEEEEe Q lcl|NC_011054. 225 PVGVAEALVVDSSRVRIGVRQDIT-VKFLDQATVGSINLAERDMIALRLKARFAYVL--GNGATAVGDN 290 (302) Q Consensus 225 ~~~~~~~~~gd~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v--~~~~a~~~lt 290 (302) . -+. -++.|-+ |.+... +++ ++. .-|..|..-+.++..-.+.| .+..+++.+. T Consensus 345 a-lkp-tvlvdqk-yhidmq-dltkvda---------fewktnsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 345 A-LKP-TVLVDQK-YHIDMQ-DLTKVDA---------FEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred c-ccc-eeeeccc-cccchh-hhhhhhh---------heeccCCceEEEeecccCcceeeccceeEeeC Confidence 0 011 1223311 211111 110 111 11333332333333333333 3344443333 No 195 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=92.42 E-value=0.011 Score=31.22 Aligned_cols=274 Identities=15% Similarity=0.090 Sum_probs=128.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcc-eeecC-CCceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFP-TVNMG-TKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~-~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) |- .|+..-...+.+.+++.|..-+.+...=-.+.+ +.-.+ +..+.||.. +.+...-..|..+..- .....+ T Consensus 1 ~~-~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~ti-Gs~~~~~~~E~~~~~~-----~~i~TG 73 (313) T protein:vir:95 1 MQ-LTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTI-GSVTLQEAEEDTPLIY-----NPIETG 73 (313) T ss_pred Cc-ccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEeccc-CceeeeccccCCCeee-----cccccc Confidence 54 444444444555556655555554422222333 22333 446777643 2333222222222221 224557 Q ss_pred eEEeeeeeEEEe-ehhHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhc-ccCCCccccccccccccccccccee-e Q lcl|NC_011054. 79 DRTLVAEEVAVI-IPVHENVVDDAS--TSLLEEIAALGGQAIGKKLDQAVIF-GTDKPSSWVSPALLPAAVAANQDYT-I 153 (302) Q Consensus 79 ~i~l~~~ki~~~-~~iS~ell~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~-G~g~~~g~~~~~~~~~~~~~~~~~~-~ 153 (302) +|++....+++- ..||++|.+|+- ..+...+..+-+++|....+.-+|. |...=.+...+.. .+..+. . T Consensus 74 EIt~~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~------vNG~PH~~ 147 (313) T protein:vir:95 74 EITFQITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHN------VNGFPHVI 147 (313) T ss_pred eEEEEEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcc------cccccceE Confidence 788888888885 489999999973 3555666667778888888887773 3211011111111 111111 1 Q ss_pred ccccchHHHHHHHhhhhhhhhhhc--ccCccEEEecHHHHHHHHhhhc------CCCceeeeccc---------ccCcce Q lcl|NC_011054. 154 VPGDANEDDLIGCINRASKAVAAA--GYMPDTLLASLGFRFDVANLRD------ANGNPIFRDES---------FNGFGT 216 (302) Q Consensus 154 ~~~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~v~~~~~~~~l~~l~d------~~g~~i~~~~~---------~~g~p~ 216 (302) ++...+...-..++..+....... ......++..|.....|..+.. .+||+|..... +.|+-+ T Consensus 148 V~~~T~~~~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di 227 (313) T protein:vir:95 148 VSAETNGVFALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDI 227 (313) T ss_pred EeccCCceehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhh Confidence 111111122223344444333322 2333468899999999988752 45788876543 233333 Q ss_pred Eeecc-------cccCCCcceEEEEecceE--------EEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEe Q lcl|NC_011054. 217 YFNAN-------GAWPVGVAEALVVDSSRV--------RIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLG 281 (302) Q Consensus 217 ~~~~~-------~~~~~~~~~~~~gd~~~~--------~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~ 281 (302) .+... ....++.+ .+|+.=.. +++-|+.|.-..+. . .++-..+.++.|+ |+|+++. T Consensus 228 ~~SN~L~~AN~~D~~tT~~G--~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~---~--~~~~~~~~~~~~~--R~G~Gi~ 298 (313) T protein:vir:95 228 LTSNRLHVANYNDGTTTGNG--YVGNLFMCILDDQTKPIMGAWRRMPKSEGE---R--NKDRARDEHVVRC--RYGFGIQ 298 (313) T ss_pred hhhhhhhhccccccccccCc--eeeeeeeeeecccccceeeeeccccccccc---c--ccccccccceeee--eecccce Confidence 22221 11112222 23332111 22333322211110 0 1123345555554 7888888 Q ss_pred ccccEEEEeeecccc Q lcl|NC_011054. 282 NGATAVGDNKTPVGA 296 (302) Q Consensus 282 ~~~a~~~lt~~~a~~ 296 (302) +.+....+..-..+- T Consensus 299 R~~~L~~~~~~A~~~ 313 (313) T protein:vir:95 299 RLDTLGLLATSATAY 313 (313) T ss_pred eecceeEEEeccccC Confidence 877765444221111 No 196 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=91.92 E-value=0.014 Score=30.81 Aligned_cols=288 Identities=10% Similarity=0.070 Sum_probs=139.9 Q ss_pred CCCccC-----CCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccccc Q lcl|NC_011054. 1 MADISR-----SEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTSE 74 (302) Q Consensus 1 Ma~~t~-----~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~ 74 (302) +|.... ..-.+-|-+.+...+.+.+++.+-+++.++.++++... ..+-....++-++.+.-+... + -.+... T Consensus 16 ~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrtdT~~~~-~-R~~~~~ 93 (355) T protein:vir:98 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDK-E-RQTADF 93 (355) T ss_pred HHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccccccCCCCC-C-cccccc Confidence 333321 11345677888899999999999999999999887542 344444445555543221110 0 011111 Q ss_pred cceeeEEeeeeeEEEeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------Cc------cccc Q lcl|NC_011054. 75 ATWADRTLVAEEVAVIIPVHENVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---------PS------SWVS 137 (302) Q Consensus 75 ~~f~~i~l~~~ki~~~~~iS~ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~~------g~~~ 137 (302) ..++.-...-++.---+.|+-+.|+. ..++|...+++.+.++++.-+-.--++|+.- |. |.+. T Consensus 94 ~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ 173 (355) T protein:vir:98 94 TALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQ 173 (355) T ss_pred cccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhhCcCccccchhHHH Confidence 23344444555555556677776663 2368999999999999988888888888641 11 1110 Q ss_pred cc-------ccc-ccccccc---ceeeccccchHHHHHHHhhhhhhhh-hhcccCc-c-EEEecHHHHH-HHHhhhcCCC Q lcl|NC_011054. 138 PA-------LLP-AAVAANQ---DYTIVPGDANEDDLIGCINRASKAV-AAAGYMP-D-TLLASLGFRF-DVANLRDANG 202 (302) Q Consensus 138 ~~-------~~~-~~~~~~~---~~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~~-~-~~v~~~~~~~-~l~~l~d~~g 202 (302) .. ... ....... .........++..+..++.++...+ .....+. . .+++.+.... .-..|-.... T Consensus 174 ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~ 253 (355) T protein:vir:98 174 KYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQ 253 (355) T ss_pred HHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHhhhHhhccC Confidence 00 000 0000000 0011122345666666666665433 3332221 2 3556655433 3333333333 Q ss_pred ce--------eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecCcE-EEEeecccccchhhhcCCcEEEEEE Q lcl|NC_011054. 203 NP--------IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQDIT-VKFLDQATVGSINLAERDMIALRLK 273 (302) Q Consensus 203 ~~--------i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~r~~ 273 (302) .| +.....+.|+|...+... ....+++--++..-+....+-. =.+.+......+..+++-. T Consensus 254 ~ptE~~Aa~~i~s~k~iGGlpa~~~Pff----P~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~N------ 323 (355) T protein:vir:98 254 ENSESLAADIIISQKRIGNLPAVRVPYF----PANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYESMN------ 323 (355) T ss_pred CcHHHHHHHHHHHhhhhCCceeEEcccc----CCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhc------ Confidence 33 233456788998776553 3444667777765544433321 1122222222222122222 Q ss_pred EEeccEEeccccEEEEeeec----ccccCCCCC Q lcl|NC_011054. 274 ARFAYVLGNGATAVGDNKTP----VGAVVPDGS 302 (302) Q Consensus 274 ~r~d~~v~~~~a~~~lt~~~----a~~~~p~~~ 302 (302) -|+.|.+...++.+.... .++..|++- T Consensus 324 --e~YvVEd~~~~a~ienI~~~~~~~~~~~~~~ 354 (355) T protein:vir:98 324 --IDYVVEVYAAGCLLENITLGDFTAPAAPESG 354 (355) T ss_pred --ceeeeeccccEEEeeceeeeCCCCCcccccC Confidence 344444444444443222 112222222 No 197 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=91.41 E-value=0.016 Score=30.43 Aligned_cols=185 Identities=12% Similarity=-0.060 Sum_probs=83.1 Q ss_pred EEeehhHHHHHh-----cchHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCCcccccccccccccccccceeeccccc Q lcl|NC_011054. 88 AVIIPVHENVVD-----DASTSLLEEIAALGGQAIGKKLDQAVIF----GTDKPSSWVSPALLPAAVAANQDYTIVPGDA 158 (302) Q Consensus 88 ~~~~~iS~ell~-----ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 158 (302) ---.-+|+-+++ ++..++.+...+++.+++++..|+.++. +.....+.. . ...........+.+. T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~--~----~~~g~~~~~~a~~t~ 74 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVT--G----QDGGFSVNIGAGNTN 74 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCccc--c----cccCcceeccccccC Confidence 111233444443 3567999999999999999999999874 211100000 0 001111111223334 Q ss_pred hHHHHHHHhhhhhhhhhhcccC--ccEEEecHHHHHHHHhhhcC---------CCceeeec---ccccCcceEeeccccc Q lcl|NC_011054. 159 NEDDLIGCINRASKAVAAAGYM--PDTLLASLGFRFDVANLRDA---------NGNPIFRD---ESFNGFGTYFNANGAW 224 (302) Q Consensus 159 ~~~~~~~~i~~~~~~~~~~~~~--~~~~v~~~~~~~~l~~l~d~---------~g~~i~~~---~~~~g~p~~~~~~~~~ 224 (302) +.+.+++.+.++...+...+.. ..+++++|..+..|.+..|. ++..+... ..+.|++++..++... T Consensus 75 ~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~ 154 (221) T protein:vir:17 75 NAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLAS 154 (221) T ss_pred CHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCc Confidence 5677788888888887766544 34677899877777643221 11112221 1355677666655433 Q ss_pred CCCcc-eEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEE-eccEEeccccEEEEeeecccccCCCC- Q lcl|NC_011054. 225 PVGVA-EALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKAR-FAYVLGNGATAVGDNKTPVGAVVPDG- 301 (302) Q Consensus 225 ~~~~~-~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r-~d~~v~~~~a~~~lt~~~a~~~~p~~- 301 (302) ..+.. ....|+|. ...... ..+|+..- .-+.+.+|+|+..++-..-+.-.|-- T Consensus 155 ~~gt~~~~~ag~~~-~~~~~~-----------------------~~yr~~fs~~~glv~~~~Avgtvkl~~~~~~~~~~~ 210 (221) T protein:vir:17 155 LYGTNLVTDPGDAT-TSGENN-----------------------GSYRPAITDRAGLVFHKEAADTVEVLLPPSRPPLVI 210 (221) T ss_pred ccccccccCCcccc-cccccc-----------------------ccccccccceEEEEEcchheeeeeeecCCCCCceee Confidence 22211 11112111 000000 01111000 01334566665444322111111110 Q ss_pred C Q lcl|NC_011054. 302 S 302 (302) Q Consensus 302 ~ 302 (302) | T Consensus 211 ~ 211 (221) T protein:vir:17 211 S 211 (221) T ss_pred e Confidence 1 No 198 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=91.02 E-value=0.018 Score=30.16 Aligned_cols=281 Identities=10% Similarity=0.052 Sum_probs=142.8 Q ss_pred CC---CccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeecccccccccccccccc- Q lcl|NC_011054. 1 MA---DISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTSEA- 75 (302) Q Consensus 1 Ma---~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~- 75 (302) +| ...+....+-|.+.+...+.+.+++.+-+++.++.+++..-. ..+-....++-++...-..... -.+ .++ T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdT~~~~~--R~~-~~~~ 92 (338) T protein:vir:11 16 LAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRTDTTGDGV--RKP-RDVS 92 (338) T ss_pred HHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccccCCCCCc--ccc-cccc Confidence 33 222344556788899999999999999999999999987543 3444444455554433111000 000 111 Q ss_pred ceeeEEeeeeeEEEeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------Cc------cccc- Q lcl|NC_011054. 76 TWADRTLVAEEVAVIIPVHENVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---------PS------SWVS- 137 (302) Q Consensus 76 ~f~~i~l~~~ki~~~~~iS~ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~~------g~~~- 137 (302) .++.-...-++.---+.|+-+.|+. ..++|...+++.+.++++.-+-.--++|+.- |. |.+. T Consensus 93 ~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~ 172 (338) T protein:vir:11 93 ALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAANPLLQDVNIGWFQQ 172 (338) T ss_pred ccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHH Confidence 2333344445555556667676663 2468999999999999988888888888641 11 1110 Q ss_pred ------ccccccccccccceeeccccchHHHHHHHhhhhhhhh-hhcccC-cc-EEEecHHHHH-HHHhhhcCCCce--- Q lcl|NC_011054. 138 ------PALLPAAVAANQDYTIVPGDANEDDLIGCINRASKAV-AAAGYM-PD-TLLASLGFRF-DVANLRDANGNP--- 204 (302) Q Consensus 138 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~-~~-~~v~~~~~~~-~l~~l~d~~g~~--- 204 (302) .....................++..+..++.++...+ .....+ +. .+++.+.... +-..|-.....| T Consensus 173 ~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~ptE~ 252 (338) T protein:vir:11 173 YRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKYFPMVNKDQPATEK 252 (338) T ss_pred HHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHHhcCCChHHH Confidence 0000000000011111122234666666666666433 333322 22 3556665443 222333333333 Q ss_pred -----eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecCcEE-EEeecccccchhhhcCCcEEEEEEEEecc Q lcl|NC_011054. 205 -----IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQDITV-KFLDQATVGSINLAERDMIALRLKARFAY 278 (302) Q Consensus 205 -----i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~r~~~r~d~ 278 (302) +.....+.|+|...+... ....+++--++..-+....+-.= .+.+.. ++|++.-.-..--|+ T Consensus 253 ~Aa~~~~s~k~iGGlpa~~~Pff----P~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p--------~r~rie~y~s~Ne~Y 320 (338) T protein:vir:11 253 IATDLILSQKRMGGLPPVEVPYV----PEKGLMVTTLKNLSLYWQIGGRRRYLKEVP--------EKNRIENYESSNDAY 320 (338) T ss_pred HHHHHHHHhhhhCCceeEEcccc----CCCceEEeeccccEEEEecCcEEEEEEecc--------ccccccchhhhccce Confidence 222346778888776553 34446677777655544433211 122222 122222222222455 Q ss_pred EEeccccEEEEeeecccc Q lcl|NC_011054. 279 VLGNGATAVGDNKTPVGA 296 (302) Q Consensus 279 ~v~~~~a~~~lt~~~a~~ 296 (302) .|.+...++.+.....+. T Consensus 321 vVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 321 VVEDYGLGCLVENIEVAE 338 (338) T ss_pred eeeccccEEEeecceecC Confidence 566666666555443333 No 199 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=89.03 E-value=0.029 Score=29.04 Aligned_cols=259 Identities=8% Similarity=-0.024 Sum_probs=108.6 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhccee-e--c----CCCceEEEEEeCCcceeeecc-ccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTV-N--M----GTKTTHLPVLATLPGASWVSE-SATEPEGVKPT 72 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~-~--~----~~~~~~~p~~~~~~~a~~v~E-~~~~~~~~~~~ 72 (302) ||+.=. ..||+.|+.+.++.++++.++.++++.- + . .+.+++|++........+-.. +..+..+. . T Consensus 1 MAN~ll----T~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~--~ 74 (423) T protein:vir:35 1 MANNLE----SNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNG--L 74 (423) T ss_pred Cccchh----hhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCccccc--c Confidence 995433 3589999999999999999998887542 1 1 145677776432211111111 11111000 0 Q ss_pred cccceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccccccccee Q lcl|NC_011054. 73 SEATWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYT 152 (302) Q Consensus 73 s~~~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~ 152 (302) ++.+ -++++..+|... +.++++=..++..+|++++...+ ++++..+|..++.---. .. ....++.. T Consensus 75 ~e~~-v~l~id~~k~~a-~~v~d~e~~l~i~~~~~~l~~a~-~ala~~vd~~l~~~l~~-------~a---~~~vgt~~- 140 (423) T protein:vir:35 75 FSAK-ATGKVGKYITVA-VEWTQIEEALKLNQLDQILSPIH-ERMVTDLETELAHFMMN-------NG---ALSLGSPN- 140 (423) T ss_pred ccce-eeEEeccceecc-ceeCHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhh-------cc---cccccccc- Confidence 1111 124555555444 45555444445678887777664 77999999988741000 00 00001100 Q ss_pred eccccchHHHHHHHhhhhhhhhhhccc--CccEEEecHHHHHHHHh----hhcCCC---ceeeec---ccccCcceEeec Q lcl|NC_011054. 153 IVPGDANEDDLIGCINRASKAVAAAGY--MPDTLLASLGFRFDVAN----LRDANG---NPIFRD---ESFNGFGTYFNA 220 (302) Q Consensus 153 ~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~----l~d~~g---~~i~~~---~~~~g~p~~~~~ 220 (302) +....+ +.+.++...+..... ...+.+++|..+..|.+ +...++ .-+.+. ..+.|+.++... T Consensus 141 --t~~~~~----~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Sn 214 (423) T protein:vir:35 141 --TAIKKW----ADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIRALMSN 214 (423) T ss_pred --CCcchH----HHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceEEEEcC Confidence 111123 334444444443322 24567899999888753 111111 111111 235566666555 Q ss_pred ccccCCCcceEEEEecceEEEEeecCcEEEE--eecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeeccccc- Q lcl|NC_011054. 221 NGAWPVGVAEALVVDSSRVRIGVRQDITVKF--LDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAV- 297 (302) Q Consensus 221 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~--~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~- 297 (302) +....+.. .+....... .+..+.. ..+... .++.+..+...-.| .+...+. ..+.+. -.+ T Consensus 215 nvp~~T~g------t~~~~~~v~-~a~~v~~~a~~~~~~------~~~~~~~~~~~~~g-~l~~GD~-~t~aGv--~~v~ 277 (423) T protein:vir:35 215 GLASRKQG------DFDGAITVK-TAPNVDYLSVKDSYQ------FTVALTGATPSKTG-FLKAGDQ-LKFTST--HWLN 277 (423) T ss_pred CCcccccc------ccccceeec-ccccccccccccccc------ceeeeeeeeeccCC-cEEecce-EEeeee--eecc Confidence 44322111 111111100 1111110 000000 01111111111111 1222222 122221 122 Q ss_pred --------------------CCCCC Q lcl|NC_011054. 298 --------------------VPDGS 302 (302) Q Consensus 298 --------------------~p~~~ 302 (302) +.+.+ T Consensus 278 ~~t~~~~~~~~t~~~~~~~V~~~~~ 302 (423) T protein:vir:35 278 QQSKQTLYNGSTAMSFTATVLEETN 302 (423) T ss_pred ccccceeecccCCceeEEEEecccc Confidence 22222 No 200 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=88.86 E-value=0.03 Score=28.95 Aligned_cols=289 Identities=10% Similarity=0.073 Sum_probs=133.1 Q ss_pred CCC------ccCCCcceecchHHHHHHHHHHHhhhh--hhhhcceeecCCCceEEE---EEeCCcceeeecccccccccc Q lcl|NC_011054. 1 MAD------ISRSEVATLIQEAYANDLLASAKKGST--VLQAFPTVNMGTKTTHLP---VLATLPGASWVSESATEPEGV 69 (302) Q Consensus 1 Ma~------~t~~~~g~liP~~~~~~ii~~~~~~s~--l~~~~~~~~~~~~~~~~p---~~~~~~~a~~v~E~~~~~~~~ 69 (302) |.. .+-.+++.|=-+.+.++|-.+...... +.+-+.+.+..+-...|- .......+.++.|+...+ T Consensus 26 ~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~--- 102 (463) T protein:vir:95 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP--- 102 (463) T ss_pred hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccc--- Confidence 222 223345555555555555443333322 344444455544322222 222336678888887643 Q ss_pred ccccccceeeEEeeeeeEEEeehhHHH-HHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC------Ccccccccccc Q lcl|NC_011054. 70 KPTSEATWADRTLVAEEVAVIIPVHEN-VVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK------PSSWVSPALLP 142 (302) Q Consensus 70 ~~~s~~~f~~i~l~~~ki~~~~~iS~e-ll~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~------~~g~~~~~~~~ 142 (302) .+++++.......|-++....+|.- -+.++..+.+....+.-...++..+|.++|+|+.. |.|..-.++.. T Consensus 103 --~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~ 180 (463) T protein:vir:95 103 --VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAK 180 (463) T ss_pred --cCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhh Confidence 4688999999999999988888873 34556778889999999999999999999999763 23333333332 Q ss_pred cccccccceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeeccc---ccCcceEee Q lcl|NC_011054. 143 AAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDES---FNGFGTYFN 219 (302) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~---~~g~p~~~~ 219 (302) .. ........-....+. +++..+...+..++..++-+.|+....+.|..---...|.+..++. ..|.++.-. T Consensus 181 lI-d~enviDarG~~Ls~----~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f 255 (463) T protein:vir:95 181 LI-DKNNVINAKGNQLTE----KHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGF 255 (463) T ss_pred hc-CCCCeeecCCCcccH----HHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCCceeeeeeccce Confidence 22 222233332333332 3355565566677888888899999999887422221222222111 122221100 Q ss_pred --cc------cccCCCcceEEEEecceEEEEeecC--cEEEE--eecccccchhhhcCCcEEEEEEEEeccEEeccccEE Q lcl|NC_011054. 220 --AN------GAWPVGVAEALVVDSSRVRIGVRQD--ITVKF--LDQATVGSINLAERDMIALRLKARFAYVLGNGATAV 287 (302) Q Consensus 220 --~~------~~~~~~~~~~~~gd~~~~~~~~~~~--~~i~~--~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~ 287 (302) .. +.... +.+.+++--....-+.... ++..+ .+..+.. +-.......|++...-+..--.|+.++ T Consensus 256 ~s~~G~I~L~~s~~m-~~~~il~~~~~~~p~ap~~~~~tatv~~~~~~~~~--~~~~~a~~~Y~vv~~s~~geS~pS~iv 332 (463) T protein:vir:95 256 YSSRGFIKLHGSTVM-ENELILDESLQPLPNAPQPAKVTATVETKQKGAFE--NEEDRAGLSYKVVVNSDDAQSAPSEEV 332 (463) T ss_pred eeeeeeeeeCCceec-CCcccccchhhcCCCCccCceeEEEEeeccCCCCC--CcccccceEEEEEEECCCCCcccchhe Confidence 00 00000 0111111110000000000 11111 1111110 001112223444433333322333332 Q ss_pred EEeeec-----ccccCCCCC Q lcl|NC_011054. 288 GDNKTP-----VGAVVPDGS 302 (302) Q Consensus 288 ~lt~~~-----a~~~~p~~~ 302 (302) -.|.+. .=-++|.++ T Consensus 333 taT~a~~~~gv~l~It~~a~ 352 (463) T protein:vir:95 333 TATVSNVDDGVKLSINVNAM 352 (463) T ss_pred eeeeeeccceEEEEEEecCC Confidence 221110 001122222 No 201 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=88.86 E-value=0.03 Score=28.95 Aligned_cols=289 Identities=10% Similarity=0.073 Sum_probs=133.1 Q ss_pred CCC------ccCCCcceecchHHHHHHHHHHHhhhh--hhhhcceeecCCCceEEE---EEeCCcceeeecccccccccc Q lcl|NC_011054. 1 MAD------ISRSEVATLIQEAYANDLLASAKKGST--VLQAFPTVNMGTKTTHLP---VLATLPGASWVSESATEPEGV 69 (302) Q Consensus 1 Ma~------~t~~~~g~liP~~~~~~ii~~~~~~s~--l~~~~~~~~~~~~~~~~p---~~~~~~~a~~v~E~~~~~~~~ 69 (302) |.. .+-.+++.|=-+.+.++|-.+...... +.+-+.+.+..+-...|- .......+.++.|+...+ T Consensus 26 ~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~--- 102 (463) T protein:vir:99 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP--- 102 (463) T ss_pred hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccc--- Confidence 222 223345555555555555443333322 344444455544322222 222336678888887643 Q ss_pred ccccccceeeEEeeeeeEEEeehhHHH-HHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC------Ccccccccccc Q lcl|NC_011054. 70 KPTSEATWADRTLVAEEVAVIIPVHEN-VVDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK------PSSWVSPALLP 142 (302) Q Consensus 70 ~~~s~~~f~~i~l~~~ki~~~~~iS~e-ll~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~------~~g~~~~~~~~ 142 (302) .+++++.......|-++....+|.- -+.++..+.+....+.-...++..+|.++|+|+.. |.|..-.++.. T Consensus 103 --~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~ 180 (463) T protein:vir:99 103 --VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAK 180 (463) T ss_pred --cCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhh Confidence 4688999999999999988888873 34556778889999999999999999999999763 23333333332 Q ss_pred cccccccceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeeccc---ccCcceEee Q lcl|NC_011054. 143 AAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDES---FNGFGTYFN 219 (302) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~---~~g~p~~~~ 219 (302) .. ........-....+. +++..+...+..++..++-+.|+....+.|..---...|.+..++. ..|.++.-. T Consensus 181 lI-d~enviDarG~~Ls~----~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f 255 (463) T protein:vir:99 181 LI-DKNNVINAKGNQLTE----KHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGF 255 (463) T ss_pred hc-CCCCeeecCCCcccH----HHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCCceeeeeeccce Confidence 22 222233332333332 3355565566677888888899999999887422221222222111 122221100 Q ss_pred --cc------cccCCCcceEEEEecceEEEEeecC--cEEEE--eecccccchhhhcCCcEEEEEEEEeccEEeccccEE Q lcl|NC_011054. 220 --AN------GAWPVGVAEALVVDSSRVRIGVRQD--ITVKF--LDQATVGSINLAERDMIALRLKARFAYVLGNGATAV 287 (302) Q Consensus 220 --~~------~~~~~~~~~~~~gd~~~~~~~~~~~--~~i~~--~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~ 287 (302) .. +.... +.+.+++--....-+.... ++..+ .+..+.. +-.......|++...-+..--.|+.++ T Consensus 256 ~s~~G~I~L~~s~~m-~~~~il~~~~~~~p~ap~~~~~tatv~~~~~~~~~--~~~~~a~~~Y~vv~~s~~geS~pS~iv 332 (463) T protein:vir:99 256 YSSRGFIKLHGSTVM-ENELILDESLQPLPNAPQPAKVTATVETKQKGAFE--NEEDRAGLSYKVVVNSDDAQSAPSEEV 332 (463) T ss_pred eeeeeeeeeCCceec-CCcccccchhhcCCCCccCceeEEEEeeccCCCCC--CcccccceEEEEEEECCCCCcccchhe Confidence 00 00000 0111111110000000000 11111 1111110 001112223444433333322333332 Q ss_pred EEeeec-----ccccCCCCC Q lcl|NC_011054. 288 GDNKTP-----VGAVVPDGS 302 (302) Q Consensus 288 ~lt~~~-----a~~~~p~~~ 302 (302) -.|.+. .=-++|.++ T Consensus 333 taT~a~~~~gv~l~It~~a~ 352 (463) T protein:vir:99 333 TATVSNVDDGVKLSINVNAM 352 (463) T ss_pred eeeeeeccceEEEEEEecCC Confidence 221110 001122222 No 202 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=87.95 E-value=0.035 Score=28.54 Aligned_cols=288 Identities=11% Similarity=0.075 Sum_probs=141.8 Q ss_pred CCCccC-----CCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccccc Q lcl|NC_011054. 1 MADISR-----SEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTSE 74 (302) Q Consensus 1 Ma~~t~-----~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~ 74 (302) +|.... ..-.+-|-+.+...+.+.+++.+-++++++.++++... ..+-....++-++.+.-.... + -.+... T Consensus 16 ~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrtdT~~~~-~-R~~~~~ 93 (355) T protein:vir:18 16 LAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIASTTDTSGDK-E-RQTADF 93 (355) T ss_pred HHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceeeccccCCCC-C-cccccc Confidence 332221 12345678888899999999999999999999887542 344444455555543321110 0 011112 Q ss_pred cceeeEEeeeeeEEEeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------Cc------cccc Q lcl|NC_011054. 75 ATWADRTLVAEEVAVIIPVHENVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---------PS------SWVS 137 (302) Q Consensus 75 ~~f~~i~l~~~ki~~~~~iS~ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~~------g~~~ 137 (302) ..++.-...-++.---+.|+-+.|+. ..++|...+++.+.++++.-+-.--++|+.- |. |.+. T Consensus 94 ~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ 173 (355) T protein:vir:18 94 TALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRVKNPMLQDVAVGWLQ 173 (355) T ss_pred cccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhhCcCccccchhHHH Confidence 23344445555555556677777763 2368999999999999988888888888641 11 1110 Q ss_pred cc-------cccc-ccccc---cceeeccccchHHHHHHHhhhhhhhh-hhcccCc-c-EEEecHHHHH-HHHhhhcCCC Q lcl|NC_011054. 138 PA-------LLPA-AVAAN---QDYTIVPGDANEDDLIGCINRASKAV-AAAGYMP-D-TLLASLGFRF-DVANLRDANG 202 (302) Q Consensus 138 ~~-------~~~~-~~~~~---~~~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~~-~-~~v~~~~~~~-~l~~l~d~~g 202 (302) .. .... ..... ..........++..+..++.++...+ .....+. . .+++.+.... +-..|-...+ T Consensus 174 ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~ 253 (355) T protein:vir:18 174 KYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVGRKLLADKYFPLVNKQQ 253 (355) T ss_pred HHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHhhccC Confidence 00 0000 00000 00011122335666666666666433 3332222 2 3556655433 3333333333 Q ss_pred ce--------eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecCcE-EEEeecccccchhhhcCCcEEEEEE Q lcl|NC_011054. 203 NP--------IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQDIT-VKFLDQATVGSINLAERDMIALRLK 273 (302) Q Consensus 203 ~~--------i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~r~~ 273 (302) .| +.....+.|+|...+... ....+++--++..-+....+-. =.+.+......+..+++- T Consensus 254 ~ptE~~Aa~~i~s~k~iGGlpa~~~Pff----P~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~------- 322 (355) T protein:vir:18 254 ENTESLAADIIISQKRIGNLPAVRVPYF----PANAVFVTTLENLSIYFMDESHRRSIDENPKKDRVENYESM------- 322 (355) T ss_pred ChHHHHHHHHHHHHHhhCCceeEEcccc----CCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhh------- Confidence 33 222346788998776553 3444667777765544433321 112222222111112222 Q ss_pred EEeccEEeccccEEEEeee----cccccCCCCC Q lcl|NC_011054. 274 ARFAYVLGNGATAVGDNKT----PVGAVVPDGS 302 (302) Q Consensus 274 ~r~d~~v~~~~a~~~lt~~----~a~~~~p~~~ 302 (302) --|+.|.+...++.+... +.++..|+|- T Consensus 323 -Ne~YvVEd~~~~a~ieni~~~~~~~~~~~~~g 354 (355) T protein:vir:18 323 -NIDYVVEAYAAGCLLENITLGDFTAPAAPEGG 354 (355) T ss_pred -cceeeeeccccEEEEeeeeecCCCCcccccCC Confidence 234444555444444422 2223344444 No 203 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=86.17 E-value=0.047 Score=27.84 Aligned_cols=295 Identities=10% Similarity=0.062 Sum_probs=138.7 Q ss_pred CCCccC-----CCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccccc Q lcl|NC_011054. 1 MADISR-----SEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTSE 74 (302) Q Consensus 1 Ma~~t~-----~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~ 74 (302) +|.... ..-.+-|-+.+...+.+.+++.+-++++++.+++.... ..+-....++-++.+.-+.. .+ -.+..- T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~-~~-R~~~~~ 93 (357) T protein:vir:56 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGG-TE-RQPKDF 93 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccccCCCC-CC-cccccc Confidence 333321 12345688888899999999999999999999887532 33444344444443321110 00 001111 Q ss_pred cceeeEEeeeeeEEEeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------C------ccccc Q lcl|NC_011054. 75 ATWADRTLVAEEVAVIIPVHENVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---------P------SSWVS 137 (302) Q Consensus 75 ~~f~~i~l~~~ki~~~~~iS~ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~------~g~~~ 137 (302) ..++.-...-++.---+.|+-+.|+. ..++|...+++.+.++++.-+-.--++|+.- | .|.+. T Consensus 94 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ 173 (357) T protein:vir:56 94 SKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQ 173 (357) T ss_pred cccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHH Confidence 23333444444444455666666663 2368999999999998888877777888542 1 11110 Q ss_pred cc-------cccc-ccccccc---eeeccccchHHHHHHHhhhhhhhh-hhcccC-cc-EEEecHHHHH-HHHhhhcCCC Q lcl|NC_011054. 138 PA-------LLPA-AVAANQD---YTIVPGDANEDDLIGCINRASKAV-AAAGYM-PD-TLLASLGFRF-DVANLRDANG 202 (302) Q Consensus 138 ~~-------~~~~-~~~~~~~---~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~-~~-~~v~~~~~~~-~l~~l~d~~g 202 (302) .. .... ....+.. ........++..+..++.++...+ .....+ +. .++|.+.... .-..|-...+ T Consensus 174 ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~ 253 (357) T protein:vir:56 174 KYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQ 253 (357) T ss_pred HHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccC Confidence 00 0000 0000000 111223345666666666666433 333332 22 3445555433 3333433333 Q ss_pred ce--------eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecC-cEEEEeecccccchhhhcCCcEEEEEE Q lcl|NC_011054. 203 NP--------IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQD-ITVKFLDQATVGSINLAERDMIALRLK 273 (302) Q Consensus 203 ~~--------i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~r~~ 273 (302) .| +.....+.|+|...+... ....+++--++..-+-...+ ..=.+.+......+..+++-.-+|-++ T Consensus 254 ~pTE~~Aa~~i~s~k~iGGl~a~~~PfF----P~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s~Ne~YvVE 329 (357) T protein:vir:56 254 DNSEMLAADVIISQKRIGNLPAVRVPYF----PADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYESMNIDYVVE 329 (357) T ss_pred ChHHHHHHHHHHHhhhhCCceeEEcccc----CCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhcceeeee Confidence 33 222346778888766553 33446666666654433332 222222222222222222222333333 Q ss_pred EEeccEEeccccEEEEeeecccccCCCC Q lcl|NC_011054. 274 ARFAYVLGNGATAVGDNKTPVGAVVPDG 301 (302) Q Consensus 274 ~r~d~~v~~~~a~~~lt~~~a~~~~p~~ 301 (302) .+--+...+.-.+......+.++..|++ T Consensus 330 d~~~~a~iE~i~i~~~~~~~~~~~~~~a 357 (357) T protein:vir:56 330 DYAAGCLVEKIKVGDFSTPAKATEEPGA 357 (357) T ss_pred ccccEEEeeeeeeccCCCCcccCCCCCC Confidence 3333333332222222222233333444 No 204 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=84.81 E-value=0.058 Score=27.38 Aligned_cols=266 Identities=11% Similarity=0.034 Sum_probs=124.3 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcc-----e-eecCCCceEEEEEeCCcceeeeccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFP-----T-VNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSE 74 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~-----~-~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~ 74 (302) ||... ..+.++..+.+.++..+....+.. . ...++++++||......-..+--.+.....+ ..+ T Consensus 1 MA~~n-------~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g---~~~ 70 (299) T protein:vir:79 1 MAALN-------YAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQR---NYD 70 (299) T ss_pred Cccch-------hHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCccccc---ccC Confidence 99443 247788888888888776554431 2 2244678999987654333221111001111 123 Q ss_pred cceeeEEeeeeeEEEeehhHHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceee Q lcl|NC_011054. 75 ATWADRTLVAEEVAVIIPVHENVVD-DASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTI 153 (302) Q Consensus 75 ~~f~~i~l~~~ki~~~~~iS~ell~-ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~ 153 (302) .++...++...|.-.+..=..+.-+ .-...+...+.+...+.++-.+|...++.--+ .+...+. .. T Consensus 71 ~~~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~-----------~a~~~g~--~~ 137 (299) T protein:vir:79 71 NAWEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYA-----------DWTALGN--TA 137 (299) T ss_pred cceeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHH-----------hhhhcCC--cc Confidence 3555666777665554321111111 01122344445555556666677765532100 0000000 11 Q ss_pred ccccchHHHHHHHhhhhhhhhhhccc--CccEEEecHHHHHHHHhhh------cCC-Cceeeec--ccccCcceEeeccc Q lcl|NC_011054. 154 VPGDANEDDLIGCINRASKAVAAAGY--MPDTLLASLGFRFDVANLR------DAN-GNPIFRD--ESFNGFGTYFNANG 222 (302) Q Consensus 154 ~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~v~~~~~~~~l~~l~------d~~-g~~i~~~--~~~~g~p~~~~~~~ 222 (302) .....+.+.+++.+.++...+..... .+.+++++|.++..|.+.. +.. ++..... ..++|.++..+... T Consensus 138 ~~~~~T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~ 217 (299) T protein:vir:79 138 DTTVLTTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSN 217 (299) T ss_pred cccccCHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechh Confidence 22234567778888888888876554 3467889999999997532 111 1112221 34677776643221 Q ss_pred -----------ccCC---CcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEecc-ccEE Q lcl|NC_011054. 223 -----------AWPV---GVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNG-ATAV 287 (302) Q Consensus 223 -----------~~~~---~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~-~a~~ 287 (302) .... .+-..+++..+.. +...+--.+++++-..-+..++ .+.-+.+.|.=|.+. ..-+ T Consensus 218 r~~t~~~~~~G~~~~~~ak~in~ii~~~~a~-~~~~K~~~~~~~~P~~~~~~~~------~~~~r~y~d~~v~~nk~~~i 290 (299) T protein:vir:79 218 LMKTAYDFTTGWKVGAGAKQIFMSLVHPSAI-ITPVSYQFSKLDEPTAVTEGKY------FYFEESFEDVFILNKKADAI 290 (299) T ss_pred hcCccceeccCccccCcccccceEEEcCCee-eeeEeeeeEEeecCCCCCccce------eeeeeeeeeeeeeccccCeE Confidence 0000 0112334433322 2222222233333222111111 233344556555542 3334 Q ss_pred EEeeecccc Q lcl|NC_011054. 288 GDNKTPVGA 296 (302) Q Consensus 288 ~lt~~~a~~ 296 (302) .+....|.+ T Consensus 291 ~~~~~~a~~ 299 (299) T protein:vir:79 291 QFVVEGAGA 299 (299) T ss_pred EEEeeecCC Confidence 566665555 No 205 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=84.19 E-value=0.062 Score=27.19 Aligned_cols=289 Identities=12% Similarity=0.074 Sum_probs=128.7 Q ss_pred CCCccCC--CcceecchHH-HHHHHHHHHhhhhhhhhcceeecCCC---ceEEEEEeCCcc-eeeecccccccccc---- Q lcl|NC_011054. 1 MADISRS--EVATLIQEAY-ANDLLASAKKGSTVLQAFPTVNMGTK---TTHLPVLATLPG-ASWVSESATEPEGV---- 69 (302) Q Consensus 1 Ma~~t~~--~~g~liP~~~-~~~ii~~~~~~s~l~~~~~~~~~~~~---~~~~p~~~~~~~-a~~v~E~~~~~~~~---- 69 (302) -+..++. +.+.-+...+ ..+.+..+++...+.+++...|++.+ .+++.+-..-+. -.-..||.+..... T Consensus 9 ~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a~G~~~~~g 88 (401) T protein:vir:95 9 DGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDASGATIVNG 88 (401) T ss_pred ccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCcccccccCc Confidence 1222222 1233344433 46667777777888999999998754 333332222121 11234444222110 Q ss_pred ------cccc-------------------ccceeeEEeeeeeEEEeehhHHHHHh-cchHHHHHHH-HHHHHHH---HHH Q lcl|NC_011054. 70 ------KPTS-------------------EATWADRTLVAEEVAVIIPVHENVVD-DASTSLLEEI-AALGGQA---IGK 119 (302) Q Consensus 70 ------~~~s-------------------~~~f~~i~l~~~ki~~~~~iS~ell~-ds~~~~~~~i-~~~l~~a---i~~ 119 (302) +..+ ..+-..+..+.++++.++.+|+++.+ ++...+.+-+ ++.|.-+ ... T Consensus 89 ~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~g~~~~t~d 168 (401) T protein:vir:95 89 NLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMNGATQITEA 168 (401) T ss_pred cccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhhhhhhhHHH Confidence 0000 11223455668899999999998776 3455666543 2333322 344 Q ss_pred HHHHHhhcccCCCcccccccccccccccccceeeccccchHHHHHHHhhhhhhh--------------hhhcccCcc-EE Q lcl|NC_011054. 120 KLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGDANEDDLIGCINRASKA--------------VAAAGYMPD-TL 184 (302) Q Consensus 120 ~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--------------~~~~~~~~~-~~ 184 (302) .+-+.+|++-+.- ...+.............+....+.+.+......|... .......++ +- T Consensus 169 ~i~~dll~ag~~v----iyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va 244 (401) T protein:vir:95 169 VLQKDLLAAAGTV----LYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVM 244 (401) T ss_pred HHHHHHHhhcCee----ecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccccceEE Confidence 4456666432210 0000001111111122233334455554443333320 000112223 35 Q ss_pred EecHHHHHHHHhhhcCCCceeeecc--------cccC-----cceEeeccc-----------ccCC------------Cc Q lcl|NC_011054. 185 LASLGFRFDVANLRDANGNPIFRDE--------SFNG-----FGTYFNANG-----------AWPV------------GV 228 (302) Q Consensus 185 v~~~~~~~~l~~l~d~~g~~i~~~~--------~~~g-----~p~~~~~~~-----------~~~~------------~~ 228 (302) +||+.....|+.++|-.|.|-|.+- ...| -.+.++..+ ...+ +. T Consensus 245 ~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~~gg~ 324 (401) T protein:vir:95 245 YVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVSGQEH 324 (401) T ss_pred EEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCcccccccccccccccccCCCc Confidence 5899999999988887665544321 0000 012221111 0000 11 Q ss_pred ----ceEEEEecceEEEEeecCc---EEEEe-eccc----ccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccc Q lcl|NC_011054. 229 ----AEALVVDSSRVRIGVRQDI---TVKFL-DQAT----VGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGA 296 (302) Q Consensus 229 ----~~~~~gd~~~~~~~~~~~~---~i~~~-~~~~----~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~ 296 (302) ..+++|+-.+..++..++- .+++. +.-. .....|-|...+.|++ ++++.+++++-.+++... +.. T Consensus 325 ~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~--~~a~~vL~~e~m~~ies~-a~~ 401 (401) T protein:vir:95 325 YDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKW--YYGILVKRPERLALIKTV-APL 401 (401) T ss_pred ceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhh--hhhhheeccceeEEEEee-cCC Confidence 2245555444334333221 11221 1111 1223355566666655 568888998887776632 222 No 206 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=84.00 E-value=0.064 Score=27.13 Aligned_cols=269 Identities=8% Similarity=-0.059 Sum_probs=105.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceee-----c--CCCceEEEEEeCCccee-ee-cccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVN-----M--GTKTTHLPVLATLPGAS-WV-SESATEPEGVKP 71 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~-----~--~~~~~~~p~~~~~~~a~-~v-~E~~~~~~~~~~ 71 (302) ||+.=. .++|+.|+.+.++.++++.++.+++..-- . .+.+++|++-. ...+. +- ..+.....+.. T Consensus 1 MaN~ll----T~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~-~~~~~~~~~~~~~~~~~~~l- 74 (423) T protein:vir:17 1 MPNNLD----SNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPH-QFSSLRTPTGDISGQNKNNL- 74 (423) T ss_pred Cccchh----hhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCC-cceeecccCcccCCcccCcc- Confidence 996533 35899999999999999999988875421 1 24567777532 21111 10 01111111101 Q ss_pred ccccceeeEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcc-cCCCcccccccccccccccccc Q lcl|NC_011054. 72 TSEATWADRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFG-TDKPSSWVSPALLPAAVAANQD 150 (302) Q Consensus 72 ~s~~~f~~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G-~g~~~g~~~~~~~~~~~~~~~~ 150 (302) ++.+ -.+++..+|...+--=..|.. ....++++++... .++++..+|..++.- .+... ...+.. T Consensus 75 -~e~~-v~l~id~~k~va~~v~d~E~~-~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a~-----------~~~gt~ 139 (423) T protein:vir:17 75 -ISGK-ATGRVGNYITVAVEYQQLEEA-IKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGA-----------LSLGSP 139 (423) T ss_pred -ccce-eEEEeeceeeeeeeecHHHHh-cChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhccc-----------cccccC Confidence 1111 245666666655554444544 4566787766555 689999999988742 11100 000100 Q ss_pred eeeccccchHHHHHHHhhhhhhhhhhcc--cCccEEEecHHHHHHHHhh----hc-C-CCceeeec----ccccCcceEe Q lcl|NC_011054. 151 YTIVPGDANEDDLIGCINRASKAVAAAG--YMPDTLLASLGFRFDVANL----RD-A-NGNPIFRD----ESFNGFGTYF 218 (302) Q Consensus 151 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~--~~~~~~v~~~~~~~~l~~l----~d-~-~g~~i~~~----~~~~g~p~~~ 218 (302) . +....++ .+.++...+.... ....+.+++|..+..|.+- .. . -+.--+.. ..+.|+.++. T Consensus 140 ~---t~~~a~~----~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~ 212 (423) T protein:vir:17 140 N---TPITKWS----DVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALM 212 (423) T ss_pred C---cccccHH----HHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEE Confidence 0 0111233 3444444443332 2345788999998877532 11 1 11111111 2355665555 Q ss_pred ecccccCCCcce--EE---EEec-ceEEEEe----ecCcEEEEe-ecccccchhhhcCCcEEEEE---EEEeccEE---- Q lcl|NC_011054. 219 NANGAWPVGVAE--AL---VVDS-SRVRIGV----RQDITVKFL-DQATVGSINLAERDMIALRL---KARFAYVL---- 280 (302) Q Consensus 219 ~~~~~~~~~~~~--~~---~gd~-~~~~~~~----~~~~~i~~~-~~~~~~~~~~~~~~~~~~r~---~~r~d~~v---- 280 (302) ..+....+.... .. .+.. .+..... ..++...+. +..... ..|.+.|-+ ..+....+ T Consensus 213 Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~-----~GD~~t~aGv~~v~~~tk~v~~~~ 287 (423) T protein:vir:17 213 SNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLK-----AGDQVKFTNTYWLQQQTKQALYNG 287 (423) T ss_pred eCCCccccccceeceeeecccccccccccccccceeeeeeeeeeeccCcee-----ecceEEecceeeeccccccccccc Confidence 443321111000 00 0000 0000000 000000000 000000 011111110 00000000 Q ss_pred --eccccEE-------------EEeeecccc------------cCCCCC Q lcl|NC_011054. 281 --GNGATAV-------------GDNKTPVGA------------VVPDGS 302 (302) Q Consensus 281 --~~~~a~~-------------~lt~~~a~~------------~~p~~~ 302 (302) .+...|. .++-.|+.. .+|+.+ T Consensus 288 ~t~~~~~~~v~~~~~~~a~~~~tv~i~p~~i~~~~~~~~~~v~a~~a~~ 336 (423) T protein:vir:17 288 ATPISFTATVTADANSDSSGDVTVTLSGVPIYDTTNPQYNSVSRQVAAG 336 (423) T ss_pred ccccceEEEEEecccccccCceEEEecCccccccCCcccccceecccCC Confidence 0001111 111111110 012222 No 207 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=82.16 E-value=0.063 Score=27.17 Aligned_cols=266 Identities=16% Similarity=0.120 Sum_probs=116.2 Q ss_pred CCCcc--CCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADIS--RSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t--~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) +|... -++.-.-+|..+...|-..+...+|+++.+.+.+++.--... -.....++.....|++..++. .++. T Consensus 35 laengvtitdttfqlprklvesintallntnpvfkvfhvtnvgallvsr-sfdssneaqvhkdgqtkteqa-----atlt 108 (318) T protein:vir:94 35 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSR-SFDSSNEAQVHKDGQTKTEQA-----ATLT 108 (318) T ss_pred hhhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhhhheeeec-cccccchhhhhcccccccccc-----eeee Confidence 33322 233345578777778888888889999888877765432221 123455666666776655532 2333 Q ss_pred eEEeeeeeEEEeehhHH--HHHhcchHHHHHHHHHHHHHHHHHHH-HHHhhcccCCCccccc---cccccccccccccee Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHE--NVVDDASTSLLEEIAALGGQAIGKKL-DQAVIFGTDKPSSWVS---PALLPAAVAANQDYT 152 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~--ell~ds~~~~~~~i~~~l~~ai~~~~-d~~~l~G~g~~~g~~~---~~~~~~~~~~~~~~~ 152 (302) -=++.|.-+.....+-. .-+++|...+...|..+|..++..++ |-+++-|+|+. |... .....-.....+... T Consensus 109 idtlepvmvyklqslaervkrlqmsyselynlivaeltqaivnkivdlalvegdgtn-gfksidkeadvkkikkittkak 187 (318) T protein:vir:94 109 IDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN-GFKSIDKEADVKKIKKITTKAK 187 (318) T ss_pred ecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHhhhhheeeeecCCcc-hhhhhchhhhHHHHHHhhhhhh Confidence 33344433333333332 23567777888999999999988776 56677788852 2111 111111111111111 Q ss_pred eccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCC--C-ceeeecccc----cCcceEeecccccC Q lcl|NC_011054. 153 IVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDAN--G-NPIFRDESF----NGFGTYFNANGAWP 225 (302) Q Consensus 153 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~--g-~~i~~~~~~----~g~p~~~~~~~~~~ 225 (302) .+.. ..+.+.+..+..-++....+.-.++-.......|..|+-+. . -.|-++++- .|..-.++-..... T Consensus 188 sagk----tpfadaieeavdfvrptagrrylivktedrkalldelrqatananvriknddteiasevgvdeiivytgska 263 (318) T protein:vir:94 188 SAGK----TPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKA 263 (318) T ss_pred hcCC----CchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhhcccceEEeccchhhhhhcCcceeEEeecccc Confidence 1111 22234444444444443333333444444445555565332 2 223333321 11111111111111 Q ss_pred CCcceEEEEecceEEEEeecCc-EEEEeecccccchhhhcCCcEEEEEEEEeccEE--eccccEEEEe Q lcl|NC_011054. 226 VGVAEALVVDSSRVRIGVRQDI-TVKFLDQATVGSINLAERDMIALRLKARFAYVL--GNGATAVGDN 290 (302) Q Consensus 226 ~~~~~~~~gd~~~~~~~~~~~~-~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v--~~~~a~~~lt 290 (302) -+. -++.|-+ |.+... ++ .++.+ -|..|..-+.++..-.+.+ .+..|++.+. T Consensus 264 -vkp-tvlvdqk-yhidmq-dltkvdaf---------ewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:94 264 -VKP-TVLVDQK-YHIDMQ-DLTKVDAF---------EWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred -ccc-eeEeccc-eecchh-hhhhhhce---------eeccCCceEEEEecccCcceeecCceeEEeC Confidence 111 2233322 222111 11 11111 1223322333333333333 3344443333 No 208 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=82.00 E-value=0.081 Score=26.58 Aligned_cols=278 Identities=8% Similarity=0.027 Sum_probs=125.6 Q ss_pred CCC------ccCCCcceecchHHHHHHHHHHHhhhh--hhhhcceeecCCCceEEE---EEeCCcceeeecccccccccc Q lcl|NC_011054. 1 MAD------ISRSEVATLIQEAYANDLLASAKKGST--VLQAFPTVNMGTKTTHLP---VLATLPGASWVSESATEPEGV 69 (302) Q Consensus 1 Ma~------~t~~~~g~liP~~~~~~ii~~~~~~s~--l~~~~~~~~~~~~~~~~p---~~~~~~~a~~v~E~~~~~~~~ 69 (302) |.. .+-.++|.|=-+.+.++|-.+...... +.+-+.+.+..+-...|- .......+.++.|+...+ T Consensus 26 ~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~~--- 102 (462) T protein:vir:96 26 YQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGVAP--- 102 (462) T ss_pred HhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccc--- Confidence 222 223335555445555555444333322 344444445444322222 222336678888887643 Q ss_pred ccccccceeeEEeeeeeEEEeehhHHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--C----cccccccccc Q lcl|NC_011054. 70 KPTSEATWADRTLVAEEVAVIIPVHENVV-DDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--P----SSWVSPALLP 142 (302) Q Consensus 70 ~~~s~~~f~~i~l~~~ki~~~~~iS~ell-~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~----~g~~~~~~~~ 142 (302) .+++++.+.....|-++..-.+|...- ..+..+..+...+.-...++..+|.++|+|+.+ | .|..-.++.. T Consensus 103 --~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~ 180 (462) T protein:vir:96 103 --VSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAK 180 (462) T ss_pred --cCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhh Confidence 478999999999998888777776433 345667788889999999999999999999864 2 2333333322 Q ss_pred cccccccceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeeccc---ccCcceEee Q lcl|NC_011054. 143 AAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDES---FNGFGTYFN 219 (302) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~---~~g~p~~~~ 219 (302) .. ...+....-....+ .+++..+...+..++..++-+.|+....+.|..---...|.+..++. ..|.++.-. T Consensus 181 lI-~~~NViDarG~~Ls----~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f 255 (462) T protein:vir:96 181 LI-DKDNVIDAKGESLT----ETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQDNSGNVNAGYNVQGF 255 (462) T ss_pred hc-CCCceeecCCCCcc----HHHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceEEEEcCCCCceeeeeeccce Confidence 22 22333333333333 23444454555677788888889999988887433222333333222 223332110 Q ss_pred cccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEec--cEEeccc--cEEEEeeecc- Q lcl|NC_011054. 220 ANGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFA--YVLGNGA--TAVGDNKTPV- 294 (302) Q Consensus 220 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d--~~v~~~~--a~~~lt~~~a- 294 (302) - +..+.+ ++....+.+. ...+..+... ... --....+.+...-+ +...++. +-.....+.+ T Consensus 256 ~-----s~~G~I---~L~~s~~m~~-~~i~~~~~~~-~p~----ap~~~~vsaTv~t~~~g~f~~~~d~~~y~Y~V~avs 321 (462) T protein:vir:96 256 Y-----SSRGFI---KLHGSTVMEN-ELILDESLQP-LPN----APQPATVKATVETGKKGLFTDEHDRAELTYKVVVNS 321 (462) T ss_pred e-----eeeeee---eeCCceecCc-cccccccccc-CCC----CCCCCceeEEEEeCCCCCCCCccCceeEEEEEEEEC Confidence 0 000000 0000000000 0000000000 000 00001122221111 1122221 1111111100 Q ss_pred ----cccCCCCC Q lcl|NC_011054. 295 ----GAVVPDGS 302 (302) Q Consensus 295 ----~~~~p~~~ 302 (302) +...++-+ T Consensus 322 ~dgeS~PS~~Vt 333 (462) T protein:vir:96 322 DDAQSAPSEAVT 333 (462) T ss_pred CCCccccceeeE Confidence 00001001 No 209 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=80.19 E-value=0.097 Score=26.13 Aligned_cols=260 Identities=13% Similarity=0.050 Sum_probs=115.9 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhh--hhhhcceeecCCCceEEEE--E-eCCcceeeecccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGST--VLQAFPTVNMGTKTTHLPV--L-ATLPGASWVSESATEPEGVKPTSEA 75 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~--l~~~~~~~~~~~~~~~~p~--~-~~~~~a~~v~E~~~~~~~~~~~s~~ 75 (302) =+.+.+ |+.+=-+.+.+++......... +.+-..+.+..+-...|-. . -+........|++-. ..+++ T Consensus 18 ~~a~~~--g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~rhG~~g~s~~~E~~l~-----~~~d~ 90 (470) T protein:vir:10 18 NAAGQV--AESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVVTARHDKIGYAAFREGGLP-----RTVEV 90 (470) T ss_pred HHhhhc--chhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhhhhhccccccccceeecccccC-----ccCCC Confidence 000000 1222111111111111111111 2222333333332222211 1 122333344666543 34689 Q ss_pred ceeeEEeeeeeEEEeehhHHHH---HhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--------Ccccccccccccc Q lcl|NC_011054. 76 TWADRTLVAEEVAVIIPVHENV---VDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK--------PSSWVSPALLPAA 144 (302) Q Consensus 76 ~f~~i~l~~~ki~~~~~iS~el---l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--------~~g~~~~~~~~~~ 144 (302) ++.+.....|-++....+|.-. ++....+++....++---.+++.+|.++|+||.. +.|..-.++...+ T Consensus 91 ~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~s~~~g~~~gleFDGl~~lI 170 (470) T protein:vir:10 91 NVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLGDDVPGSPNNLQQDGIINII 170 (470) T ss_pred ceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhccccccccCcccCceeccchhhhc Confidence 9999999999999998999764 3344558888888888899999999999999652 2444444433322 Q ss_pred cc--cccceeeccccchHHHHHHHhhhhhhhhh--hcccCccEEEecHHHHHHHHhhhcCCCceeeeccc---ccCcceE Q lcl|NC_011054. 145 VA--ANQDYTIVPGDANEDDLIGCINRASKAVA--AAGYMPDTLLASLGFRFDVANLRDANGNPIFRDES---FNGFGTY 217 (302) Q Consensus 145 ~~--~~~~~~~~~~~~~~~~~~~~i~~~~~~~~--~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~---~~g~p~~ 217 (302) .. ..+....-...... +.|..+...+. .++..++-+.|+....+.|..--...-|.+..++. ..|.++ T Consensus 171 d~~~~~NViDarG~~Ls~----~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qRv~~~~N~~~~~~G~~v- 245 (470) T protein:vir:10 171 KRGAPQNVLDAGGRPLSI----DLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISRVMTTADRRAGLLGADA- 245 (470) T ss_pred cCCCCccccccCCCCccH----HHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceEEEEecCCCceeeeeec- Confidence 21 12223222222222 34444544443 46777778889999999998766666666655432 223332 Q ss_pred eecccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEE---eccccEEEEeeecc Q lcl|NC_011054. 218 FNANGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVL---GNGATAVGDNKTPV 294 (302) Q Consensus 218 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v---~~~~a~~~lt~~~a 294 (302) .... +..+.+-+ +.+.+ +.+ ... .| ..+++-.+ .-|...+.+..+.- T Consensus 246 -~~f~---sa~G~I~L-~~s~~-m~~--------~~k----------~~------p~~l~~~v~~~aAP~~~~tv~~t~~ 295 (470) T protein:vir:10 246 -QSYI---GVRGEHSL-YPSQF-LGD--------FHK----------FN------PARFGAEVGDFAAPSNSWTVSTTDN 295 (470) T ss_pred -ccee---eeeeeeee-ccccc-ccc--------hhh----------cC------cccCCcccCCcccCceeEEeecCCC Confidence 1110 01111100 00000 000 000 00 00111111 12222222222211 Q ss_pred cccCCCCC Q lcl|NC_011054. 295 GAVVPDGS 302 (302) Q Consensus 295 ~~~~p~~~ 302 (302) ....|.+| T Consensus 296 ~~a~~~~s 303 (470) T protein:vir:10 296 FVTLPYNS 303 (470) T ss_pred ceeecccC Confidence 11222222 No 210 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=78.99 E-value=0.11 Score=25.86 Aligned_cols=288 Identities=11% Similarity=0.090 Sum_probs=137.9 Q ss_pred CCCccC-----CCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccccc Q lcl|NC_011054. 1 MADISR-----SEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTSE 74 (302) Q Consensus 1 Ma~~t~-----~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~ 74 (302) +|.... ..-.+-|-+.+...+.+.+++.+-++++++.+++.... ..+-....++-++.+.-... .+ -.+..- T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~-~~-R~~~~~ 93 (357) T protein:vir:60 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGG-TE-RQPKDF 93 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcccccccccCCC-CC-cccccc Confidence 333321 12345688888899999999999999999999887532 33444444444444321110 00 001111 Q ss_pred cceeeEEeeeeeEEEeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------C------ccccc Q lcl|NC_011054. 75 ATWADRTLVAEEVAVIIPVHENVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---------P------SSWVS 137 (302) Q Consensus 75 ~~f~~i~l~~~ki~~~~~iS~ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~------~g~~~ 137 (302) ..++.-...-++.---+.|+-+.|+. ..++|...+++.+.++++.-+-.--++|+.- | .|.+. T Consensus 94 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ 173 (357) T protein:vir:60 94 SKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRSSNQMLQDVAVGWLQ 173 (357) T ss_pred cccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHH Confidence 23334444444544555666666663 2368999999999998888877777888542 1 11110 Q ss_pred cc-------cccc-ccccccc---eeeccccchHHHHHHHhhhhhhhh-hhcccC-cc-EEEecHHHHH-HHHhhhcCCC Q lcl|NC_011054. 138 PA-------LLPA-AVAANQD---YTIVPGDANEDDLIGCINRASKAV-AAAGYM-PD-TLLASLGFRF-DVANLRDANG 202 (302) Q Consensus 138 ~~-------~~~~-~~~~~~~---~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~-~~-~~v~~~~~~~-~l~~l~d~~g 202 (302) .. .... ....+.. ........++..+..++.++...+ .....+ +. .+++.+.... .-..|-...+ T Consensus 174 ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~ 253 (357) T protein:vir:60 174 KYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNREQ 253 (357) T ss_pred HHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhhcCC Confidence 00 0000 0000000 111223345666666666666443 333332 22 3445555433 3333433333 Q ss_pred ce--------eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecC-cEEEEeecccccchhhhcCCcEEEEEE Q lcl|NC_011054. 203 NP--------IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQD-ITVKFLDQATVGSINLAERDMIALRLK 273 (302) Q Consensus 203 ~~--------i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~r~~ 273 (302) .| +.....+.|+|...+... ....+++--++..-+-...+ ..=.+.+......+..+++- T Consensus 254 ~pTE~~Aa~~i~s~k~iGGl~a~~~PfF----P~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s~------- 322 (357) T protein:vir:60 254 DNSEMLAADVIISQKRIGNLPAVRVPYF----PADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYESM------- 322 (357) T ss_pred ChHHHHHHHHHHHhhhhcCcceEEcccc----CCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhh------- Confidence 33 223446788988776553 33446666666654433333 22122222222111112222 Q ss_pred EEeccEEeccccEEEEeeeccc-ccCCCCC Q lcl|NC_011054. 274 ARFAYVLGNGATAVGDNKTPVG-AVVPDGS 302 (302) Q Consensus 274 ~r~d~~v~~~~a~~~lt~~~a~-~~~p~~~ 302 (302) --|+.|.+...++.+...... +..|++. T Consensus 323 -Ne~YvVEd~~~~a~iE~i~~~~~~~pa~~ 351 (357) T protein:vir:60 323 -NIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred -cceeeeeccccEEEeeeeeeccCcccccC Confidence 234444555554444432211 1123222 No 211 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=77.09 E-value=0.13 Score=25.47 Aligned_cols=287 Identities=12% Similarity=0.104 Sum_probs=136.7 Q ss_pred CCCccC-----CCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccccc Q lcl|NC_011054. 1 MADISR-----SEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTSE 74 (302) Q Consensus 1 Ma~~t~-----~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~ 74 (302) +|.... ..-.+-|-+.+...+.+.+++.+-++++++.+++.... ..+-....++-++.+.-... .+ -.+..- T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~-~~-R~~~~~ 93 (357) T protein:vir:20 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGG-TE-RQPKDF 93 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccccCCCC-CC-cccccc Confidence 443321 12345688888899999999999999999999887532 33444444444444321110 00 001111 Q ss_pred cceeeEEeeeeeEEEeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------C------ccccc Q lcl|NC_011054. 75 ATWADRTLVAEEVAVIIPVHENVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---------P------SSWVS 137 (302) Q Consensus 75 ~~f~~i~l~~~ki~~~~~iS~ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~------~g~~~ 137 (302) ..++.-...-++.---+.|+-+.|+. ..++|...+++.+.++++.-+-.--++|+.- | .|.+. T Consensus 94 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ 173 (357) T protein:vir:20 94 SKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQ 173 (357) T ss_pred cccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHH Confidence 13333444444444455666666663 2368999999999998888877777888542 1 11110 Q ss_pred cc-------ccc-cccccccc---eeeccccchHHHHHHHhhhhhhhh-hhcccC-cc-EEEecHHHHH-HHHhhhcCCC Q lcl|NC_011054. 138 PA-------LLP-AAVAANQD---YTIVPGDANEDDLIGCINRASKAV-AAAGYM-PD-TLLASLGFRF-DVANLRDANG 202 (302) Q Consensus 138 ~~-------~~~-~~~~~~~~---~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~-~~-~~v~~~~~~~-~l~~l~d~~g 202 (302) .. ... .....+.. ........++..+..++.++...+ .....+ +. .++|.+.... .-..|-...+ T Consensus 174 ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~ 253 (357) T protein:vir:20 174 KYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQ 253 (357) T ss_pred HHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccC Confidence 00 000 00000000 111222345666666666666433 333332 22 3445555433 3333433333 Q ss_pred ce--------eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecC-cEEEEeecccccchhhhcCCcEEEEEE Q lcl|NC_011054. 203 NP--------IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQD-ITVKFLDQATVGSINLAERDMIALRLK 273 (302) Q Consensus 203 ~~--------i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~r~~ 273 (302) .| +.....+.|+|...+... ....+++--++..-+-...+ ..=.+.+......+..+++- T Consensus 254 ~ptE~~Aa~~i~s~k~iGGl~a~~~PfF----P~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s~------- 322 (357) T protein:vir:20 254 DNSEMLAADVIISQKRIGNLPAVRVPYF----PADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYESM------- 322 (357) T ss_pred ChHHHHHHHHHHHhhhhCCceeEEcccc----CCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhh------- Confidence 33 222346778888766553 33446666666654433333 21122222222111112222 Q ss_pred EEeccEEeccccEEEEeeeccc--------ccCCCC Q lcl|NC_011054. 274 ARFAYVLGNGATAVGDNKTPVG--------AVVPDG 301 (302) Q Consensus 274 ~r~d~~v~~~~a~~~lt~~~a~--------~~~p~~ 301 (302) --|+.|.+...++.+.....+ +..|++ T Consensus 323 -Ne~YvVEd~~~~a~iE~i~~~~~~~p~~~~~~~~a 357 (357) T protein:vir:20 323 -NIDYVVEDYAAGCLVEKIKVGDFSTPAKATAEPGA 357 (357) T ss_pred -cceeeeeccccEEEeeeeeeccccCCccCCCCCCC Confidence 234444555554444432211 111222 No 212 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=74.64 E-value=0.16 Score=25.01 Aligned_cols=284 Identities=10% Similarity=-0.042 Sum_probs=107.5 Q ss_pred CCCccCCC-cceecchH--HHHHH----HHHHHhhhhhhhhccee--------ecCCCceEEEEEeCCcceeeecccc-- Q lcl|NC_011054. 1 MADISRSE-VATLIQEA--YANDL----LASAKKGSTVLQAFPTV--------NMGTKTTHLPVLATLPGASWVSESA-- 63 (302) Q Consensus 1 Ma~~t~~~-~g~liP~~--~~~~i----i~~~~~~s~l~~~~~~~--------~~~~~~~~~p~~~~~~~a~~v~E~~-- 63 (302) |.....+. ++...+.. ..+.. -+.+..+ ......+.. +.......+....+...+ ..|.. T Consensus 188 ~~~q~itg~tga~fa~s~~~an~astAss~Al~gE-A~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~--~~~~~~~ 264 (523) T protein:vir:59 188 WQYDDASGDPENTVAYPLPRYNRIVGAVGSALYAR-LFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFE--DQSTDPD 264 (523) T ss_pred cccccccccccccccchhhcccccccccccccccc-ccccccccccccCCCcccccccccccccccccchh--hcccccc Confidence 11111100 00000000 00000 0000000 000000000 000000001100000000 01100 Q ss_pred ----ccccccccccccceeeEEeeeeeEEEeehhHHHHHhcc-----hHHHHHHHHHHHHHHHHHHHHHHhhcccC---- Q lcl|NC_011054. 64 ----TEPEGVKPTSEATWADRTLVAEEVAVIIPVHENVVDDA-----STSLLEEIAALGGQAIGKKLDQAVIFGTD---- 130 (302) Q Consensus 64 ----~~~~~~~~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds-----~~~~~~~i~~~l~~ai~~~~d~~~l~G~g---- 130 (302) .......++...+++++++..+.-+-...+|-||.+|- ..|.++.|.+-|...|...|++.+|.=-- T Consensus 265 ~~~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~ 344 (523) T protein:vir:59 265 YPDPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHAR 344 (523) T ss_pred ccccccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhhe Confidence 01111234455677777777777777788999999983 46789999999999999999999995211 Q ss_pred --CCcccccccccccccccccceeecccc---chHHHHHHHhhhh---hhhhh--hcccCccEEEecHHHHHHHHhh--- Q lcl|NC_011054. 131 --KPSSWVSPALLPAAVAANQDYTIVPGD---ANEDDLIGCINRA---SKAVA--AAGYMPDTLLASLGFRFDVANL--- 197 (302) Q Consensus 131 --~~~g~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~i~~~---~~~~~--~~~~~~~~~v~~~~~~~~l~~l--- 197 (302) +-.|....+.......... ....+. ...+....++..+ ...+. ..+.....+++++.....|... T Consensus 345 ~~~~~~~~~~g~~~~~~~~~~--~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~ 422 (523) T protein:vir:59 345 RTDNYGFWSEVVGEYYDETSG--NFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGF 422 (523) T ss_pred eeeeccccccceeeecccccc--hhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhcccc Confidence 1111111111110000000 000000 0012222222222 22121 2223566788999999998642 Q ss_pred hcCCC-cee-----eecccccCcceEeecccccCCCcceEEEEecceEEEEeecCcEEEEeecccccc---hhhhcCCcE Q lcl|NC_011054. 198 RDANG-NPI-----FRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGS---INLAERDMI 268 (302) Q Consensus 198 ~d~~g-~~i-----~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~---~~~~~~~~~ 268 (302) +..+. +.. +..-...++.+++..+ .....+++|-....---+ .++-+ ..+..... ...-.+-|= T Consensus 423 ~~~~~~~~~~~~~~~~g~l~~~~~vy~d~~----~~~dy~~~g~k~~~~~~~-~~~~y--~Py~~l~~~~~~~dp~s~qp 495 (523) T protein:vir:59 423 TPGNDNRDGGTGIFYVGMVQGRYRLYKNIY----QNQPVIIMGNQDLNTPWQ-TGAVY--APYVPLLFTPTIVDPVNFSY 495 (523) T ss_pred ccCCccccccccceeEEEecCceEEEecCC----CCcceEEEEecccCCccc-cccee--cccchhhcccccccCCcccc Confidence 21111 111 1111223345544433 333444444332110000 01111 00000000 000122233 Q ss_pred EEEEEEEeccEEeccccEEEEeeecccccCC Q lcl|NC_011054. 269 ALRLKARFAYVLGNGATAVGDNKTPVGAVVP 299 (302) Q Consensus 269 ~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p 299 (302) .+-...|++..|.+|-+...+-.+ ..+| T Consensus 496 ~~~~~tRY~l~v~nP~~~~~~~~~---~~~~ 523 (523) T protein:vir:59 496 RRGLMTRYALEVVRPEFYGLLYVK---LLQP 523 (523) T ss_pred eeeeeeehhheecchhHhhhhhhh---hcCC Confidence 455667999999999775433322 3344 No 213 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=74.63 E-value=0.16 Score=25.01 Aligned_cols=254 Identities=10% Similarity=0.102 Sum_probs=114.0 Q ss_pred CCC------ccCCCcceecchHHHHHHHHHHHhhhh--hhhhcceeecCCCceEEE---EEeCCcceeeecccccccccc Q lcl|NC_011054. 1 MAD------ISRSEVATLIQEAYANDLLASAKKGST--VLQAFPTVNMGTKTTHLP---VLATLPGASWVSESATEPEGV 69 (302) Q Consensus 1 Ma~------~t~~~~g~liP~~~~~~ii~~~~~~s~--l~~~~~~~~~~~~~~~~p---~~~~~~~a~~v~E~~~~~~~~ 69 (302) |.. .+-++|+.+=-+.+.+++......... +.+-..+.+..+-...|- .......+.++.|+.- T Consensus 45 ~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi----- 119 (514) T protein:vir:10 45 FTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERDFTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGI----- 119 (514) T ss_pred hccccccCCccccCccchhhhhhccceeEeeecCcchhhhhhcCCchhhHHHhhhhhhcccCccccccccccccc----- Confidence 111 111223333233333333222222222 333344444443222221 2223346777888864 Q ss_pred ccccccceeeEEeeeeeEEEeehhHHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC------Ccccccccccc Q lcl|NC_011054. 70 KPTSEATWADRTLVAEEVAVIIPVHENVV-DDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK------PSSWVSPALLP 142 (302) Q Consensus 70 ~~~s~~~f~~i~l~~~ki~~~~~iS~ell-~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~------~~g~~~~~~~~ 142 (302) ...+++.+....+..+-++....+|..+- .++..+......+.-...++..+|.++|+|+.. +.|+...++.. T Consensus 120 ~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~n~i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~ 199 (514) T protein:vir:10 120 GDVNNPNERQRTINIKYIVDTHVTSIALQRANTIVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFK 199 (514) T ss_pred CcCCCcceEEEEEeeeeeeeeeeeeehhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHH Confidence 23467888888888887777666655322 346778888888999999999999999999763 23343333333 Q ss_pred cccccccceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCCCceeeeccc---ccCcceEee Q lcl|NC_011054. 143 AAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDANGNPIFRDES---FNGFGTYFN 219 (302) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~g~~i~~~~~---~~g~p~~~~ 219 (302) ... ..+....-....+ .+++..+.......+..++-+.|+....+.|..-....-|.+...+. ..|.++ T Consensus 200 lI~-~~NvIDarG~~Ls----~~~ln~aA~~i~~gfGt~TD~ylp~~vka~f~~~~~~~qRV~~~~n~~~~~~G~~v--- 271 (514) T protein:vir:10 200 LIA-PENHIDLRGGRLS----PAALNMAARKIGEGFGTPTDAYMPIGIKADFVNQHLNGQRVMLPGQTGGMTTGLDI--- 271 (514) T ss_pred hhc-CCCeEecCCCCcc----HHHHhhhhhhhhcccCChhheeCchHHHHHHhhcccCcceEEeecCccceeeeeec--- Confidence 332 2223333333333 23344444444455777777888888888776544433333322111 111111 Q ss_pred cccccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEe-ccccEEEEeeecccccC Q lcl|NC_011054. 220 ANGAWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLG-NGATAVGDNKTPVGAVV 298 (302) Q Consensus 220 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~-~~~a~~~lt~~~a~~~~ 298 (302) +.+ +..++.+.+.-+ +...-+.+.++... .|.| .....-+..+| T Consensus 272 -----------------~~f-~s~~G~I~L~gs---------------~im~~~n~L~~~~~~~~~A--p~~~~va~svT 316 (514) T protein:vir:10 272 -----------------DKF-LSAHGSIRIQGS---------------TIMDSDNKLDFDRPVSPTA--PTAPQLSATVT 316 (514) T ss_pred -----------------cce-eEeccceeecCC---------------eeecccccCccCCccCCcC--CCCCcceEEEe Confidence 111 111111111100 01111111221111 0100 00000112223 Q ss_pred CCCC Q lcl|NC_011054. 299 PDGS 302 (302) Q Consensus 299 p~~~ 302 (302) |.++ T Consensus 317 ~~~~ 320 (514) T protein:vir:10 317 PDGG 320 (514) T ss_pred cCcc Confidence 3333 No 214 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=74.51 E-value=0.16 Score=24.99 Aligned_cols=281 Identities=11% Similarity=0.042 Sum_probs=114.2 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-----eEEEEEeCC--------------cceeeecc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-----THLPVLATL--------------PGASWVSE 61 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-----~~~p~~~~~--------------~~a~~v~E 61 (302) .+.++++..-.-.-+.+. .+.+++.......+++-+.||+++. ++..+.+.. +.+.|-+. T Consensus 87 ia~s~~s~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt~fSG~ 165 (534) T protein:vir:10 87 IASGETSGSITNVGPAVM-GLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGPDADFSGR 165 (534) T ss_pred ccccccccccccccchhh-hHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCcccccccccccccccccccc Confidence 343333332221222221 1333334445566777777877653 121111100 11111110 Q ss_pred cc------------------------------------------------------------------------------ Q lcl|NC_011054. 62 SA------------------------------------------------------------------------------ 63 (302) Q Consensus 62 ~~------------------------------------------------------------------------------ 63 (302) +. T Consensus 166 ~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~Ta~ 245 (534) T protein:vir:10 166 GAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSAMATAF 245 (534) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccccceecccccchhh Confidence 00 Q ss_pred -cc-------ccccccccccceeeEEeeeeeEEEeehhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhcccC- Q lcl|NC_011054. 64 -TE-------PEGVKPTSEATWADRTLVAEEVAVIIPVHENVVDDA----STSLLEEIAALGGQAIGKKLDQAVIFGTD- 130 (302) Q Consensus 64 -~~-------~~~~~~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds----~~~~~~~i~~~l~~ai~~~~d~~~l~G~g- 130 (302) +. .....++...++++++...+.-+-...+|-||.||- ..|.++.|.+-|+..|...|++.+|.=-- T Consensus 246 AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~ 325 (534) T protein:vir:10 246 AELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVLWINA 325 (534) T ss_pred HhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhh Confidence 00 000123334455555555555555678999999984 36889999999999999999999985211 Q ss_pred -----CCcccccccccccccccccceeeccccchHHHHHHHhhhhhhh---hh--hcccCccEEEecHHHHHHHHh---h Q lcl|NC_011054. 131 -----KPSSWVSPALLPAAVAANQDYTIVPGDANEDDLIGCINRASKA---VA--AAGYMPDTLLASLGFRFDVAN---L 197 (302) Q Consensus 131 -----~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~---~~--~~~~~~~~~v~~~~~~~~l~~---l 197 (302) +..+.....................+-...+.+..++..+-.. +. ........+++|+.....|.. | T Consensus 326 ~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~~g~l 405 (534) T protein:vir:10 326 TAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGHTDML 405 (534) T ss_pred hhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhhccch Confidence 0011100000000000011111111222333333333333222 11 222356678899999999954 1 Q ss_pred h-------------cCCCceeeecccccCcceEeecccccCCCcceEEEEecce------EEEEe-ecCcEEEEeecccc Q lcl|NC_011054. 198 R-------------DANGNPIFRDESFNGFGTYFNANGAWPVGVAEALVVDSSR------VRIGV-RQDITVKFLDQATV 257 (302) Q Consensus 198 ~-------------d~~g~~i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~------~~~~~-~~~~~i~~~~~~~~ 257 (302) . |.++. .+..-...++++++..+ .....+++|-... .++.. .....+...|. T Consensus 406 ~~~~~~~~~~~~~~d~~~~-~~~G~l~~~~~vy~D~y----~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp--- 477 (534) T protein:vir:10 406 MTPAVMGANTTMNTDTTSS-LFAGVLAGKYRVYIDQY----AVEDYFTVGYKGASEMDAGLYYCPYVALTPLRGTDP--- 477 (534) T ss_pred hccccccccccccccCCCc-eEEEEecCceEEEecCC----CCcceEEEEEeCCcccccceeeccccccccccccCC--- Confidence 1 12111 11222233445544433 2233344443311 01100 00111111111 Q ss_pred cchhhhcCCcEEEEEEEEeccEEecccc-------EEEEeeecccccCCCCC Q lcl|NC_011054. 258 GSINLAERDMIALRLKARFAYVLGNGAT-------AVGDNKTPVGAVVPDGS 302 (302) Q Consensus 258 ~~~~~~~~~~~~~r~~~r~d~~v~~~~a-------~~~lt~~~a~~~~p~~~ 302 (302) .+-|=.+-...|++..+ +|=+ +.++... +|..+ T Consensus 478 ------~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~~i~~g-----~~~~~ 517 (534) T protein:vir:10 478 ------KNFQPVLGFKTRYGVKL-HPMADATQNKGFAKISNG-----MPQHT 517 (534) T ss_pred ------ccccceeeeeeeeceee-cCcccccCCccccccccC-----Ccchh Confidence 11222344455666554 3311 1122210 13322 No 215 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=73.54 E-value=0.17 Score=24.82 Aligned_cols=280 Identities=12% Similarity=0.085 Sum_probs=143.0 Q ss_pred CCCcc---CCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADIS---RSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t---~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) +|... ...-.+-|-+.+...+.+.+++.+-+++.++.++++... ..+-....++-++...-+... -+|..-.. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t~~~~---R~~~~~~~ 92 (337) T protein:vir:10 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAA---RQPIDPTA 92 (337) T ss_pred HHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecCCCCc---cccccccc Confidence 33222 223345577788899999999999999999999887532 344444445555444333211 01111123 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------Cc------ccccc- Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---------PS------SWVSP- 138 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~~------g~~~~- 138 (302) ++.-...-++.---+.|+-+.|+. ..++|...+++.+.++++.-+-.--++|+.- |. |.+.. T Consensus 93 l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~ 172 (337) T protein:vir:10 93 LDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQY 172 (337) T ss_pred cCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHH Confidence 344444455555556677777763 2468999999999999988888888888641 11 11100 Q ss_pred ------cccc-cccccccceeeccccchHHHHHHHhhhhhhhh-hhcccC-cc-EEEecHHHHH-HHHhhhcCCCce--- Q lcl|NC_011054. 139 ------ALLP-AAVAANQDYTIVPGDANEDDLIGCINRASKAV-AAAGYM-PD-TLLASLGFRF-DVANLRDANGNP--- 204 (302) Q Consensus 139 ------~~~~-~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~-~~-~~v~~~~~~~-~l~~l~d~~g~~--- 204 (302) -... .+...+. .......++..+..++.++...+ .....+ +. .+++.+.... +-..|-...+.| T Consensus 173 Re~ap~rV~~~~~~~~~~--i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n~~~~ptE~ 250 (337) T protein:vir:10 173 RERAAQRVLHEGAKQAGK--VLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTER 250 (337) T ss_pred HhcchhhhhccccccCcc--eeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHHhccCCCcHHH Confidence 0000 0000000 11122335566666666655432 333222 22 3455655544 223333333333 Q ss_pred -----eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecCcEE-EEeecccccchhhhcCCcEEEEEEEEecc Q lcl|NC_011054. 205 -----IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQDITV-KFLDQATVGSINLAERDMIALRLKARFAY 278 (302) Q Consensus 205 -----i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~r~~~r~d~ 278 (302) +.....+.|+|...+... ....+++--++..-+....+-.= .+.+.. ++|++.-.-..--|+ T Consensus 251 ~Aa~~i~s~k~iGGlpa~~~Pff----P~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p--------~r~rie~y~s~Ne~Y 318 (337) T protein:vir:10 251 LAADLIVSQKRIGNLPAVRVPFF----PKRALMVTKLSNLSIYYQEGARRRTLKEVP--------ERDRIENYESSNDAY 318 (337) T ss_pred HHHHHHHHhhhhCCceeEEcccc----CCCceEEeechhcEEEEecCcEEEEEEEcc--------ccccccchhhcccee Confidence 223346788888776553 34446677777655444433211 122221 222222222223466 Q ss_pred EEeccccEEEEeeeccccc Q lcl|NC_011054. 279 VLGNGATAVGDNKTPVGAV 297 (302) Q Consensus 279 ~v~~~~a~~~lt~~~a~~~ 297 (302) .|.+...++.+.....+.. T Consensus 319 vVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 319 VVEDFGCGCVAENIELAAA 337 (337) T ss_pred eeeccccEEEEeceeecCC Confidence 6677777666664332221 No 216 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=72.39 E-value=0.18 Score=24.62 Aligned_cols=280 Identities=12% Similarity=0.080 Sum_probs=142.8 Q ss_pred CCCcc---CCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADIS---RSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t---~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) +|... ...-.+-|-+.+...+.+.+++.+-+++.++.++++... ..+-....++-++...-+... -+|..-.. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t~~~~---R~~~~~~~ 92 (337) T protein:vir:79 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAA---RQPIDPTA 92 (337) T ss_pred HHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecCCCCc---cccccccc Confidence 33222 222345577788899999999999999999999887532 344444445555444333211 01111123 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------Cc------ccccc- Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---------PS------SWVSP- 138 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~~------g~~~~- 138 (302) ++.-...-++.---+.|+-+.|+. ..++|...+++.+.++++.-+-.--++|+.- |. |.+.. T Consensus 93 l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~ 172 (337) T protein:vir:79 93 LDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQY 172 (337) T ss_pred cCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHH Confidence 344444455555556677777763 2468999999999999988888888888641 11 11100 Q ss_pred ------cccc-cccccccceeeccccchHHHHHHHhhhhhhhh-hhcccC-cc-EEEecHHHHH-HHHhhhcCCCce--- Q lcl|NC_011054. 139 ------ALLP-AAVAANQDYTIVPGDANEDDLIGCINRASKAV-AAAGYM-PD-TLLASLGFRF-DVANLRDANGNP--- 204 (302) Q Consensus 139 ------~~~~-~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~-~~-~~v~~~~~~~-~l~~l~d~~g~~--- 204 (302) -... .+...+ ........++..+..++.++...+ .....+ +. ..++.+.... +-..|-...+.| T Consensus 173 Re~ap~rV~~~~~~~~~--~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n~~~~ptE~ 250 (337) T protein:vir:79 173 RERAAQRVLHEGAKQAG--KVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDKYFPIVNATQAPTER 250 (337) T ss_pred HhcchhhhhccccccCc--ceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHHhccCCCcHHH Confidence 0000 000000 011123335666666666655432 333222 22 3455655544 323333333333 Q ss_pred -----eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecCcEE-EEeecccccchhhhcCCcEEEEEEEEecc Q lcl|NC_011054. 205 -----IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQDITV-KFLDQATVGSINLAERDMIALRLKARFAY 278 (302) Q Consensus 205 -----i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~r~~~r~d~ 278 (302) +.....+.|+|...+... ....+++--++..-+....+-.= .+.+.. ++|++.-.-..--|+ T Consensus 251 ~Aa~~i~s~k~iGGlpa~~~Pff----P~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p--------~r~rie~y~s~Ne~Y 318 (337) T protein:vir:79 251 LAADLIVSQKRIGNLPAVRVPFF----PKRALMVTKLSNLSIYYQEGARRRTLKEVP--------ERDRIENYESSNDAY 318 (337) T ss_pred HHHHHHHHhhhhCCceeEEcccc----CCCceEEeechhcEEEEecCcEEEEEEEcc--------ccccccchhhcccee Confidence 223346788888776553 34446677777655444333211 122221 222222222223466 Q ss_pred EEeccccEEEEeeeccccc Q lcl|NC_011054. 279 VLGNGATAVGDNKTPVGAV 297 (302) Q Consensus 279 ~v~~~~a~~~lt~~~a~~~ 297 (302) .|.+...++.+.....+.. T Consensus 319 vVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 319 VVEDFGCGCVAENIELAAA 337 (337) T ss_pred eeeccccEEEEeceeecCC Confidence 6677777666654332221 No 217 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=70.45 E-value=0.21 Score=24.31 Aligned_cols=282 Identities=14% Similarity=0.153 Sum_probs=144.6 Q ss_pred CCCccCC-----C--cceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccc Q lcl|NC_011054. 1 MADISRS-----E--VATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPT 72 (302) Q Consensus 1 Ma~~t~~-----~--~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (302) +|....- + --+-|-+.+...+.+.+++.+-++++++.+++.... ..+-....++-++.+.-.... + -.+. T Consensus 16 ~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~-~-R~~~ 93 (342) T protein:vir:10 16 QAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVASTTDTSGDG-E-RKTT 93 (342) T ss_pred HHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCcccccccccCCCC-C-cccc Confidence 4433221 2 236688888899999999999999999999887532 344444444555443211110 0 0111 Q ss_pred cccceeeEEeeeeeEEEeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------C------ccc Q lcl|NC_011054. 73 SEATWADRTLVAEEVAVIIPVHENVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---------P------SSW 135 (302) Q Consensus 73 s~~~f~~i~l~~~ki~~~~~iS~ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~------~g~ 135 (302) .-..++.-...-++.---+.|+-+.|+. ..++|...+++.+.++++.-.-.--++|+.- | .|. T Consensus 94 ~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GW 173 (342) T protein:vir:10 94 SIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDRNSNPLLQDVAKGW 173 (342) T ss_pred cccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHH Confidence 1123344444445555556667776663 2468999999999999888887778888542 1 111 Q ss_pred ccc-------cccccccccccceeeccccchHHHHHHHhhhhhhhh-hhcccC-cc-EEEecHHHHH-HHHhhhcCCCce Q lcl|NC_011054. 136 VSP-------ALLPAAVAANQDYTIVPGDANEDDLIGCINRASKAV-AAAGYM-PD-TLLASLGFRF-DVANLRDANGNP 204 (302) Q Consensus 136 ~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~-~~-~~v~~~~~~~-~l~~l~d~~g~~ 204 (302) +.. -........+ ........++..+..++.++...+ .....+ +. .+++.+.... +-..|-...+.| T Consensus 174 lQ~~Re~ap~rv~~~~~~~~--~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~p 251 (342) T protein:vir:10 174 LQKMREDAKERVMNGESTDN--QVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKLLADKYFPIVNQQNAP 251 (342) T ss_pred HHHHHhhhhhhhcccceecc--ceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhcCCCh Confidence 100 0000000000 011122335666666666666433 333332 22 3456655544 222332332232 Q ss_pred --------eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecCc-EEEEeecccccchhhhcCCcEEEEEEEE Q lcl|NC_011054. 205 --------IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQDI-TVKFLDQATVGSINLAERDMIALRLKAR 275 (302) Q Consensus 205 --------i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~-~i~~~~~~~~~~~~~~~~~~~~~r~~~r 275 (302) +.....+.|+|...+... ....+++--++..-+-...+- .=.+.+.. .+|++.-.-..- T Consensus 252 tE~~Aa~~i~s~k~iGGl~a~~~PfF----P~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p--------~r~rie~y~s~N 319 (342) T protein:vir:10 252 TEELAADIVISQKRIGGLKAVRVPFF----PANAILITKLENLAIYVQEGTTRKHIENVP--------KKDRIETYESEN 319 (342) T ss_pred HHHHHHHHHHhhhhhcCceeEEcccc----CCCceEEeeccccEEEEecCcEEEEEEecc--------ccccccchhhhc Confidence 223346778888766553 334466666666544333331 11122221 223332222233 Q ss_pred eccEEeccccEEEEeeecccccCCC Q lcl|NC_011054. 276 FAYVLGNGATAVGDNKTPVGAVVPD 300 (302) Q Consensus 276 ~d~~v~~~~a~~~lt~~~a~~~~p~ 300 (302) -|+.|.+...++.+.....+ .|+ T Consensus 320 e~YvVEd~~~~a~iE~i~i~--~~~ 342 (342) T protein:vir:10 320 IDYVVEDYGCAALIENITLK--DKE 342 (342) T ss_pred cceeeeccccEEEeecceec--CCC Confidence 56777788888777765544 466 No 218 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=69.50 E-value=0.22 Score=24.17 Aligned_cols=282 Identities=15% Similarity=0.135 Sum_probs=141.3 Q ss_pred CCC---ccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MAD---ISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~---~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) +|. ..+.+-.+-|-+.+...+.+.+++.+-++++++.++++... ..+-....++-++...-... + -.+..-.. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~~~--~-R~~~~~~~ 92 (339) T protein:vir:79 16 IAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTTDTTQQ--D-RETSDIST 92 (339) T ss_pred HHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecccCCCC--C-cccccccc Confidence 222 22344556788888899999999999999999999887532 34444444444444322110 0 01111123 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------C------cccccc- Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---------P------SSWVSP- 138 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~------~g~~~~- 138 (302) ++.-...-++.---+.|+-+.|+. ..++|...+++.+.++++.-.-.--++|+.- | .|.+.. T Consensus 93 l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~ 172 (339) T protein:vir:79 93 MDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANPMLQDVNKGWLQNL 172 (339) T ss_pred cCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCcCccccchhHHHHH Confidence 344444444544455666666663 2468999999999999888887778888541 1 111100 Q ss_pred ------cccc-cccccccceeeccccchHHHHHHHhhhhhhhh-hhcccC-cc-EEEecHHHHH-HHHhhhcCCCce--- Q lcl|NC_011054. 139 ------ALLP-AAVAANQDYTIVPGDANEDDLIGCINRASKAV-AAAGYM-PD-TLLASLGFRF-DVANLRDANGNP--- 204 (302) Q Consensus 139 ------~~~~-~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~-~~-~~v~~~~~~~-~l~~l~d~~g~~--- 204 (302) -... .....+. ........++..+..++.++...+ .....+ +. .+++.+.... +-..|-.....| T Consensus 173 Re~ap~rV~~~g~~~s~~-i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~ 251 (339) T protein:vir:79 173 REQAPQRVMKEGKAAAGK-ITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDKYFPLVNRDRDPVQQ 251 (339) T ss_pred Hhhhhhhhhccceeccce-eEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhHhhhHhhcCCChHHH Confidence 0000 0000011 111122335666666666666433 333332 22 3445555533 333333333333 Q ss_pred -----eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecC-cEEEEeecccccchhhhcCCcEEEEEEEEecc Q lcl|NC_011054. 205 -----IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQD-ITVKFLDQATVGSINLAERDMIALRLKARFAY 278 (302) Q Consensus 205 -----i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~ 278 (302) +.....+.|+|...+... ....+++--++..-+-...+ ..=.+.+.. .+|++.-.-..--|+ T Consensus 252 ~Aa~~i~s~k~iGGl~a~~~PfF----P~~~llVT~L~NLsIY~Q~gs~RR~~~d~p--------~r~rie~y~s~Ne~Y 319 (339) T protein:vir:79 252 IAADLIISQKRIGNLPAIRVPYF----PANGLLVTRLDNLSIYYQEGGRRRTILDNA--------KRDRIENYESSNDAY 319 (339) T ss_pred HHHHHHHHhhhhCCceeEEcccc----CCCceEEeechhcEEEEecCcEEEEEEecc--------ccccccchhhcccee Confidence 223356788888766553 33446666666654433333 111122222 122222222222456 Q ss_pred EEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 279 VLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 279 ~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) .|.+...++.+.....+. |+ T Consensus 320 vVEd~~~~a~iEni~~~~----aa 339 (339) T protein:vir:79 320 VIEDLACAAMAENIALAA----AA 339 (339) T ss_pred eeeccccEEEeeeeeccc----CC Confidence 666776666665332111 11 No 219 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=68.99 E-value=0.23 Score=24.09 Aligned_cols=289 Identities=13% Similarity=0.098 Sum_probs=119.7 Q ss_pred CC------CccCCCcceecchHHHHHHHHHHHhhh--hhhhhcceeecCCCceEEE---EEeCCcceeeecccccccccc Q lcl|NC_011054. 1 MA------DISRSEVATLIQEAYANDLLASAKKGS--TVLQAFPTVNMGTKTTHLP---VLATLPGASWVSESATEPEGV 69 (302) Q Consensus 1 Ma------~~t~~~~g~liP~~~~~~ii~~~~~~s--~l~~~~~~~~~~~~~~~~p---~~~~~~~a~~v~E~~~~~~~~ 69 (302) |. -.+-.+++.|=-+.+.++|-.+..... .+.+-+.+.+..+-...|- .......+.++.|+...+ T Consensus 22 ~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~--- 98 (464) T protein:vir:80 22 FTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATSTVAKYDVYLAHGRVGHTRFTREIGVAP--- 98 (464) T ss_pred HHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhhheeeccCccccccccccccccc--- Confidence 22 112233455545555555543333322 2344445555544322222 222335677888887643 Q ss_pred ccccccceeeEEeeeeeEEEeehhHHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC-------Cccccccccc Q lcl|NC_011054. 70 KPTSEATWADRTLVAEEVAVIIPVHENV-VDDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK-------PSSWVSPALL 141 (302) Q Consensus 70 ~~~s~~~f~~i~l~~~ki~~~~~iS~el-l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~-------~~g~~~~~~~ 141 (302) .+++++.+.....|=+...-.+|-.+ +.++..+-.....+.-...++..+|.++|+|+.. +.|+.-.++. T Consensus 99 --~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~~~d~~~~~~~dai~~va~tiE~a~FyGds~l~~~~~~~~gleFDGl~ 176 (464) T protein:vir:80 99 --ISDPNLRQKTVNMKYVSDTKNMSIATGLVNNIEDPMRILTDDAISVVAKTIEWASFYGDSDLSENPDAGSGLEFDGLA 176 (464) T ss_pred --cCCCceEEEEEEeeeeecceeeeeehhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCCCccccchhhhH Confidence 46788888887766444433333322 2345667777888888889999999999999763 2333333333 Q ss_pred ccccccccceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHH-HhhhcCCCceeeecc---cccCcceE Q lcl|NC_011054. 142 PAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDV-ANLRDANGNPIFRDE---SFNGFGTY 217 (302) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l-~~l~d~~g~~i~~~~---~~~g~p~~ 217 (302) ... ...+....-....+ .+++..+...+..++..++-+.|+....+.+ ...-+..-+.+ .++ ...|.++. T Consensus 177 ~lI-~~~NViDarG~~Ls----~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~q~~~~-~~n~~~~~~G~~v~ 250 (464) T protein:vir:80 177 KLI-DKHNVLDAKGASLT----EALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDRQVQVI-SDNGQNATMGFNVK 250 (464) T ss_pred hhc-CCCceeecCCCCcC----HHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCceeEEE-cCCCCcceeeeecc Confidence 222 23333333333333 2445556556666788888888888888665 44333222222 111 11122211 Q ss_pred eeccc--ccCCCcceEEEEecc-----eEE-EEeecCcEEEEeecccccchhhhcCC---cEEEEEEEE----------- Q lcl|NC_011054. 218 FNANG--AWPVGVAEALVVDSS-----RVR-IGVRQDITVKFLDQATVGSINLAERD---MIALRLKAR----------- 275 (302) Q Consensus 218 ~~~~~--~~~~~~~~~~~gd~~-----~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~---~~~~r~~~r----------- 275 (302) -.... ..... +-.++.+.. ... -+.....++..--+.+ +......++ ...|++... T Consensus 251 ~f~sa~G~i~L~-~s~~m~~~~~ld~~~~~~~~apaapsvt~tv~~~-~~g~f~~~~~~~~~~Ykv~~vn~~GeS~ps~~ 328 (464) T protein:vir:80 251 GFNSARGFIRLH-GSTVMELEQILDENRMQLPNAPQKATVKATLEAG-TKGKFRDEDLTIDTEYKVVVVSDDAESAPSDV 328 (464) T ss_pred cccccccceecc-CccccCcccccccccccCCCCcCCceeEEEecCC-cccCCccccccceeEEEEEEECCCCcccccee Confidence 00000 00000 000001000 000 0000001111100000 000001111 011222211 Q ss_pred eccEEeccccEEEEeeeccccc--CCC-----------CC Q lcl|NC_011054. 276 FAYVLGNGATAVGDNKTPVGAV--VPD-----------GS 302 (302) Q Consensus 276 ~d~~v~~~~a~~~lt~~~a~~~--~p~-----------~~ 302 (302) .+..+...+.-++++-++-+-. +|+ |+ T Consensus 329 ~~~ti~~~~~~V~l~it~~~~~~~~p~yv~IYR~~~~~g~ 368 (464) T protein:vir:80 329 ASVVIDDKKKQVKLEITINNMYQARPQYVAIYRKGLETGL 368 (464) T ss_pred eeeeecCcccEEEEEEEeCCccccccceEEEEeecCCCCc Confidence 1111222222333333321111 111 11 No 220 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=64.22 E-value=0.3 Score=23.42 Aligned_cols=272 Identities=9% Similarity=0.047 Sum_probs=114.4 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhh--hhhcceeecCCCceEEE---EEeCCcceeeecccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTV--LQAFPTVNMGTKTTHLP---VLATLPGASWVSESATEPEGVKPTSEA 75 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l--~~~~~~~~~~~~~~~~p---~~~~~~~a~~v~E~~~~~~~~~~~s~~ 75 (302) -.-.+-.+++.+=-+.+.++|..+......+ .+-+.+.+..+-...|- .......+.++.|+...+ .+++ T Consensus 31 ~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~-----~~~~ 105 (467) T protein:vir:80 31 ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-----VSDP 105 (467) T ss_pred cCCccccCcchhhhhhhhhhhheeeccccchhhhhhcccchhhhhhhhheeeeccCccccccccccccccc-----cCCC Confidence 2222223344444555555554444333332 22233333333222222 222336677888887643 4689 Q ss_pred ceeeEEeeeeeEEEeehhHHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC-------Cccccccccccccccc Q lcl|NC_011054. 76 TWADRTLVAEEVAVIIPVHENVV-DDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK-------PSSWVSPALLPAAVAA 147 (302) Q Consensus 76 ~f~~i~l~~~ki~~~~~iS~ell-~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~-------~~g~~~~~~~~~~~~~ 147 (302) ++.......|-++....+|..+- ..+..+......+.-...++..+|.++|+|+.. +.|++..++.... .. T Consensus 106 ~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li-~~ 184 (467) T protein:vir:80 106 NIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLI-NQ 184 (467) T ss_pred ceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEe-cC Confidence 99999999999988777776432 334567778888888899999999999999763 2334444433322 22 Q ss_pred ccceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHH-HhhhcCCCceeee--cccccCcceEeeccccc Q lcl|NC_011054. 148 NQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDV-ANLRDANGNPIFR--DESFNGFGTYFNANGAW 224 (302) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l-~~l~d~~g~~i~~--~~~~~g~p~~~~~~~~~ 224 (302) ....+.-....+. +++..+.......+..+.-+.|+....+.| ...-...=+.... .....|.++. .. T Consensus 185 enviDa~G~~ls~----~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L~~q~~v~~~n~~~~~~G~~v~--g~--- 255 (467) T protein:vir:80 185 DNVHDARGASLTE----SLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQ--GF--- 255 (467) T ss_pred CceeccCCCccCH----HHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEEEEcCCCCceeeeeccc--ce--- Confidence 3333333333332 223333333334566666677787777666 3222211111110 0112223321 00 Q ss_pred CCCcce------EEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEe-ccc--cEE--EEeeec Q lcl|NC_011054. 225 PVGVAE------ALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLG-NGA--TAV--GDNKTP 293 (302) Q Consensus 225 ~~~~~~------~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~-~~~--a~~--~lt~~~ 293 (302) -...+. .++++.... .-+...-... ..-..+ -+..-.+.+=. ..+ +.. +++.-. T Consensus 256 ~sa~G~I~l~gs~il~~~~~l--------~~~~~~~~~A-----psp~~v--saT~~~~~~g~~~~~~~a~y~Y~v~~vs 320 (467) T protein:vir:80 256 HSARGFIKLHGSTVMENEQIL--------DERILALPTA-----PQPAKV--TATQEAGKKGQFRAEDLAAHEYKVVVSS 320 (467) T ss_pred ecceeeeeecCceeeccccCC--------Cccccccccc-----ccCCcc--ceeeecccCCcccCCCcceEEEEEEEEC Confidence 001111 112221111 0000000000 000000 00111111100 000 111 111111 Q ss_pred ccccCCCCC Q lcl|NC_011054. 294 VGAVVPDGS 302 (302) Q Consensus 294 a~~~~p~~~ 302 (302) +...++.+. T Consensus 321 ~~GES~pS~ 329 (467) T protein:vir:80 321 DDAESIASE 329 (467) T ss_pred CCCcccccc Confidence 111111111 No 221 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=63.94 E-value=0.31 Score=23.38 Aligned_cols=261 Identities=14% Similarity=0.080 Sum_probs=120.3 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhc--ceeecCCCceEEEEEeCCcceeeecccccccccccccccccee Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAF--PTVNMGTKTTHLPVLATLPGASWVSESATEPEGVKPTSEATWA 78 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~--~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~f~ 78 (302) ||... -+.+...+.+.++..+.-..+. +..-.++++++||+.....-..+-- +.....+. -+.+.. T Consensus 1 Main~--------a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R-~~g~~~g~---v~~~~e 68 (290) T protein:vir:78 1 MAINY--------VDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTR-NKGYNEGS---ASNTNK 68 (290) T ss_pred CchhH--------HHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCccccccc-CCCcccCc---ccccee Confidence 88543 2456777777777765543332 3334566789999876543222211 11111111 123445 Q ss_pred eEEeeeeeEEEeehhHHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccccccceeecccc Q lcl|NC_011054. 79 DRTLVAEEVAVIIPVHENVVD-DASTSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSWVSPALLPAAVAANQDYTIVPGD 157 (302) Q Consensus 79 ~i~l~~~ki~~~~~iS~ell~-ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~~~~~~~~~~~~~~~~~~~~~~~ 157 (302) ..+|...+.-.+..=....-+ +-...+...+.+...+.++-.+|...+.--- ..+...+ ...+.+ T Consensus 69 t~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla-----------~~a~~~~---~~~~~t 134 (290) T protein:vir:78 69 SYTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLA-----------TAAKTNS---NSVAEE 134 (290) T ss_pred eEEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHH-----------hhhhccC---cccccc Confidence 556666555444321111111 1124566777777788888888877663100 0000000 011123 Q ss_pred chHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHHHhhhcCC--------Cceeeec--ccccCcceEeeccc----- Q lcl|NC_011054. 158 ANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDVANLRDAN--------GNPIFRD--ESFNGFGTYFNANG----- 222 (302) Q Consensus 158 ~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l~d~~--------g~~i~~~--~~~~g~p~~~~~~~----- 222 (302) .+.+..++.+.++...+......+.+++++|..+..|.+.+.-. ++-.... ..++|.++.-+... T Consensus 135 ~t~~n~~~~i~~~~~~ldevp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t 214 (290) T protein:vir:78 135 ITKDNVFTKLKAAIRKVKKYGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYD 214 (290) T ss_pred cCHHHHHHHHHHHHHHHHhcCCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhh Confidence 45566677777777777666666677889999999886532111 1111111 34667665443311 Q ss_pred ----------ccCCCcceEEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEecccc-EEEEee Q lcl|NC_011054. 223 ----------AWPVGVAEALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNGAT-AVGDNK 291 (302) Q Consensus 223 ----------~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a-~~~lt~ 291 (302) .....+-..++...+.. +...+--.+.+++-...+ +.|.-.+.-+.+.|.=|.+.+. -+.... T Consensus 215 ~~~f~~G~~~~~~ak~in~ii~~~~a~-i~~~K~~~~~~~~P~~~~-----~~d~~~~~~r~y~d~~v~~nk~~~i~~~~ 288 (290) T protein:vir:78 215 TFDFTDGYKPAAGAKKLNFLLVNKGSV-VGGAKHASIYLHAPGSVG-----QGDGWLYQYRVYHDIFVLDQQKDGVIAST 288 (290) T ss_pred hhhhcccccccCCccceeEEEEcCCce-eeeeeeeEEEeeCCCCCc-----CcceeeeeeeeeeeeeeeccccCeeEEEe Confidence 00111111233332222 222111123333222111 1132345555566766665422 222332 Q ss_pred ec Q lcl|NC_011054. 292 TP 293 (302) Q Consensus 292 ~~ 293 (302) +. T Consensus 289 ~~ 290 (290) T protein:vir:78 289 EV 290 (290) T ss_pred eC Confidence 22 No 222 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=62.98 E-value=0.33 Score=23.26 Aligned_cols=272 Identities=9% Similarity=0.045 Sum_probs=114.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhh--hhhhcceeecCCCceEEE---EEeCCcceeeecccccccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGST--VLQAFPTVNMGTKTTHLP---VLATLPGASWVSESATEPEGVKPTSEA 75 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~--l~~~~~~~~~~~~~~~~p---~~~~~~~a~~v~E~~~~~~~~~~~s~~ 75 (302) -.-.+-.+++.+=-+.+..+|..+...... +.+-+.+.+..+-...|- .......+.++.|+...+ .+++ T Consensus 32 ~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~-----~~~~ 106 (468) T protein:vir:63 32 ITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAP-----VSDP 106 (468) T ss_pred cCCccccCcchhhhhhhhhhhheeeecccchhhhhhcccchhhhhhhhheeeeccCccccccccccccccc-----cCCC Confidence 111222334444445555555444333333 223333333333222222 222336677888887643 4689 Q ss_pred ceeeEEeeeeeEEEeehhHHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC-------Cccccccccccccccc Q lcl|NC_011054. 76 TWADRTLVAEEVAVIIPVHENVV-DDASTSLLEEIAALGGQAIGKKLDQAVIFGTDK-------PSSWVSPALLPAAVAA 147 (302) Q Consensus 76 ~f~~i~l~~~ki~~~~~iS~ell-~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~-------~~g~~~~~~~~~~~~~ 147 (302) ++.......|-++....+|..+- ..+..+......+.-...++..+|.++|+|+.. +.|++..++.... .. T Consensus 107 ~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li-~~ 185 (468) T protein:vir:63 107 NIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLI-NQ 185 (468) T ss_pred ceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEe-cC Confidence 99999999999988777776432 334567778888888899999999999999763 2334444433322 22 Q ss_pred ccceeeccccchHHHHHHHhhhhhhhhhhcccCccEEEecHHHHHHH-HhhhcCCCceeee--cccccCcceEeeccccc Q lcl|NC_011054. 148 NQDYTIVPGDANEDDLIGCINRASKAVAAAGYMPDTLLASLGFRFDV-ANLRDANGNPIFR--DESFNGFGTYFNANGAW 224 (302) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~~~~~~~~l-~~l~d~~g~~i~~--~~~~~g~p~~~~~~~~~ 224 (302) ....+.-....+. +++..+.......+..+.-+.|+....+.| ...-...=+.... .....|.++. .. T Consensus 186 enviDa~G~~ls~----~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L~~q~~v~~~n~~~~~~G~~v~--g~--- 256 (468) T protein:vir:63 186 DNVHDARGASLTE----SLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVRDNGNNVSVGFNIQ--GF--- 256 (468) T ss_pred CceeccCCCccCH----HHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEEEEcCCCCceeeeeccc--ce--- Confidence 3333333333332 223333333334566666677787777666 3222211111110 0112223321 00 Q ss_pred CCCcce------EEEEecceEEEEeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEe-ccc--cEE--EEeeec Q lcl|NC_011054. 225 PVGVAE------ALVVDSSRVRIGVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLG-NGA--TAV--GDNKTP 293 (302) Q Consensus 225 ~~~~~~------~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~-~~~--a~~--~lt~~~ 293 (302) -...+. .++++.... .-+...-... ..-..+ -+..-.+.+=. ..+ +.. +++.-. T Consensus 257 ~sa~G~I~l~gs~il~~~~~l--------~~~~~~~~~A-----psp~~v--saT~~~~~~g~~~~~~~a~y~Y~v~~vs 321 (468) T protein:vir:63 257 HSARGFIKLHGSTVMENEQIL--------DERILALPTA-----PQPAKV--TATQEAGKKGQFRAEDLAAHEYKVVVSS 321 (468) T ss_pred ecceeeeeecCceeeccccCC--------Cccccccccc-----ccCCcc--ceeeecccCCcccCCCcceEEEEEEEEC Confidence 001111 112221111 0000000000 000000 00111111100 000 111 111111 Q ss_pred ccccCCCCC Q lcl|NC_011054. 294 VGAVVPDGS 302 (302) Q Consensus 294 a~~~~p~~~ 302 (302) +...++.+. T Consensus 322 ~~GES~pS~ 330 (468) T protein:vir:63 322 DDAESIASE 330 (468) T ss_pred CCCcccccc Confidence 111111111 No 223 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=61.67 E-value=0.35 Score=23.09 Aligned_cols=255 Identities=9% Similarity=-0.002 Sum_probs=108.3 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcce-e---e-c--CCCceEEEEEeCCcceeeec-cccccccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPT-V---N-M--GTKTTHLPVLATLPGASWVS-ESATEPEGVKPT 72 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~-~---~-~--~~~~~~~p~~~~~~~a~~v~-E~~~~~~~~~~~ 72 (302) ||+.=.+ .+|+.|+.++++.+++..++.+++.. . . . .+.+++|++-.......+-. .+..+.. T Consensus 1 MaN~llT----~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~----- 71 (423) T protein:vir:10 1 MPNNLDS----NVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNK----- 71 (423) T ss_pred Cccchhh----hhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCcccccccc----- Confidence 9955332 47999999999999999999888754 2 1 1 25567776543222111111 1111100 Q ss_pred ccccee--eEEeeeeeEEEeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcc-cCCCccccccccccccccccc Q lcl|NC_011054. 73 SEATWA--DRTLVAEEVAVIIPVHENVVDDASTSLLEEIAALGGQAIGKKLDQAVIFG-TDKPSSWVSPALLPAAVAANQ 149 (302) Q Consensus 73 s~~~f~--~i~l~~~ki~~~~~iS~ell~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G-~g~~~g~~~~~~~~~~~~~~~ 149 (302) .+.+-. .+++..+|...+--=..|+. ....++++++... .++++..+|..++.- .+.+. ......+ T Consensus 72 ~dl~e~~v~l~id~~k~va~~v~d~E~~-~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~--------~~~gt~~- 140 (423) T protein:vir:10 72 NNLISGKATGRVGNYITVAVEYQQLEEA-IKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGA--------LSLGSPN- 140 (423) T ss_pred CccccceeEEEeeceeeeeeeechHHHh-cChhhHHHHHHHH-HHHHHHHHHHHHHHHHhhccc--------cccccCC- Confidence 111112 35666666655554444554 4556787766555 688999999998742 11110 0000000 Q ss_pred ceeeccccchHHHHHHHhhhhhhhhhhcc--cCccEEEecHHHHHHHHhh----hcCC--Cceeeec----ccccCcceE Q lcl|NC_011054. 150 DYTIVPGDANEDDLIGCINRASKAVAAAG--YMPDTLLASLGFRFDVANL----RDAN--GNPIFRD----ESFNGFGTY 217 (302) Q Consensus 150 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~~--~~~~~~v~~~~~~~~l~~l----~d~~--g~~i~~~----~~~~g~p~~ 217 (302) +....++ .+.++...+.... ....+.+++|..+..|.+- ...+ +.--+.. ..+.|+.++ T Consensus 141 -----t~~~a~~----~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~ 211 (423) T protein:vir:10 141 -----TPITKWS----DVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRAL 211 (423) T ss_pred -----cccchHH----HHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEE Confidence 0011233 3444444443322 2345788999998877532 1111 1111111 235566655 Q ss_pred eecccccCCCcceEEEEecceEEEEeecCcEEEEe--ecccccchhhhcCCcEEEE-EEEEeccEEeccccEEEEeeecc Q lcl|NC_011054. 218 FNANGAWPVGVAEALVVDSSRVRIGVRQDITVKFL--DQATVGSINLAERDMIALR-LKARFAYVLGNGATAVGDNKTPV 294 (302) Q Consensus 218 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~--~~~~~~~~~~~~~~~~~~r-~~~r~d~~v~~~~a~~~lt~~~a 294 (302) ...+....+ .+. ++. +... . .+..+... .+..... +.+. ...+.-..+...+. ++.+.+ T Consensus 212 ~Snnip~~T-~gt--~~~-t~~~--~-~~~~v~~~a~~~a~~~~--------~~~~~~~~~~~~~l~~GD~---~t~aGv 273 (423) T protein:vir:10 212 MSNGLASRT-QGA--FGG-TLTV--K-TQPTVTYNAVKDSYQFT--------VTLTGATASVTGFLKAGDQ---VKFTNT 273 (423) T ss_pred EeCCCcccc-ccc--ccc-ceee--e-ecceeccccccccceee--------eeeeeccccccCceeecce---EEecce Confidence 554433221 110 000 0000 0 01111000 0111000 0011 01111111111221 122222 Q ss_pred cccCCCCC Q lcl|NC_011054. 295 GAVVPDGS 302 (302) Q Consensus 295 ~~~~p~~~ 302 (302) -.+.|.-. T Consensus 274 ~~v~~~tk 281 (423) T protein:vir:10 274 YWLQQQTK 281 (423) T ss_pred eeeccccc Confidence 22222222 No 224 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=54.47 E-value=0.5 Score=22.22 Aligned_cols=280 Identities=12% Similarity=0.081 Sum_probs=140.8 Q ss_pred CCCcc---CCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADIS---RSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t---~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) +|... ...-.+-|-+.+...+.+.+++.+-++++++.++++... ..+-....++-++...-+.. + -+|..-.. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~~~--~-R~~~~~~~ 92 (337) T protein:vir:78 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKA--A-RQPIDPTA 92 (337) T ss_pred HHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeeecCCCc--c-cccccccc Confidence 33222 233455688888899999999999999999999887432 33433344444444333221 0 01111122 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------C------ccccc-- Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDD--ASTSLLEEIAALGGQAIGKKLDQAVIFGTDK---------P------SSWVS-- 137 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~------~g~~~-- 137 (302) ++.-...-++.---+.|+-+.|+. ..++|...+++.+.++++.-.-.--++|+.- | .|.+. T Consensus 93 l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~ 172 (337) T protein:vir:78 93 LDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQY 172 (337) T ss_pred cCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHHHHHH Confidence 333344444444445666666663 2468999999999999888887778888542 1 11110 Q ss_pred -----ccccc-cccccccceeeccccchHHHHHHHhhhhhhh-hhhcccC-cc-EEEecHHHHH-HHHhhhcCCCce--- Q lcl|NC_011054. 138 -----PALLP-AAVAANQDYTIVPGDANEDDLIGCINRASKA-VAAAGYM-PD-TLLASLGFRF-DVANLRDANGNP--- 204 (302) Q Consensus 138 -----~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~-~~-~~v~~~~~~~-~l~~l~d~~g~~--- 204 (302) .-... .+...+ ........++..+..++.++... +.....+ +. .+++.+.... +-..|-...+.| T Consensus 173 Re~ap~rVl~~~~~~~~--~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~ptE~ 250 (337) T protein:vir:78 173 RERAAQRVLHEGAKQAG--KVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTER 250 (337) T ss_pred HhcchhhhhccccccCC--ceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhcCCCcHHH Confidence 00000 000000 01122333566666666666653 3333332 22 3455555543 223333333343 Q ss_pred -----eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecC-cEEEEeecccccchhhhcCCcEEEEEEEEecc Q lcl|NC_011054. 205 -----IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQD-ITVKFLDQATVGSINLAERDMIALRLKARFAY 278 (302) Q Consensus 205 -----i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~ 278 (302) +.....+.|+|...+... ....+++--++..-+-...| ..=.+.+.. .+|++.-.-..--|+ T Consensus 251 ~Aa~~i~s~k~iGGl~a~~~PfF----P~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p--------~r~rie~y~s~Ne~Y 318 (337) T protein:vir:78 251 LAADLIVSQKRIGNLPAVRVPFF----PKRALMVTKLSNLSIYYQEGARRRTLKEVP--------ERDRIENYESSNDAY 318 (337) T ss_pred HHHHHHHHhhhhcCcceEEcccc----CCCceEEeechhcEEEEecCcEEEEEEecc--------ccccccchhhcccee Confidence 233456788998776553 33446666666654433333 111122221 222222222222466 Q ss_pred EEeccccEEEEeeecccccCCCC Q lcl|NC_011054. 279 VLGNGATAVGDNKTPVGAVVPDG 301 (302) Q Consensus 279 ~v~~~~a~~~lt~~~a~~~~p~~ 301 (302) .|.+...++.+.....+. | T Consensus 319 vVEd~~~~a~iEnI~~~~----a 337 (337) T protein:vir:78 319 VVEDFGCGCVAENIELAA----A 337 (337) T ss_pred eeeccccEEEEeceeecC----C Confidence 667777766665432222 1 No 225 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=53.27 E-value=0.53 Score=22.08 Aligned_cols=282 Identities=12% Similarity=0.061 Sum_probs=135.2 Q ss_pred CCCcc-----CCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccccc Q lcl|NC_011054. 1 MADIS-----RSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTSE 74 (302) Q Consensus 1 Ma~~t-----~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~ 74 (302) +|... ..+-.+-|.+.+...+.+.+++.+-++++++.+++..-. ..+-....++-++....+.+ ... T Consensus 20 ~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrt~tr~~-------~~~ 92 (358) T protein:vir:78 20 LAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQLYTGRKKGGRF-------KGK 92 (358) T ss_pred HHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCcccceecCCCcc-------ccc Confidence 33322 223456788899999999999999999999999887532 23434444555544333221 122 Q ss_pred cceeeEEeeeeeEEEeehhHHHHHhc-c----hHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------C------cc Q lcl|NC_011054. 75 ATWADRTLVAEEVAVIIPVHENVVDD-A----STSLLEEIAALGGQAIGKKLDQAVIFGTDK---------P------SS 134 (302) Q Consensus 75 ~~f~~i~l~~~ki~~~~~iS~ell~d-s----~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~------~g 134 (302) ..++.-...-++.---+.|+-+.|+. + ..+|...+++.+.++++.-.-.--++|+.- | .| T Consensus 93 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~G 172 (358) T protein:vir:78 93 VGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADDTDPTANPLGQDVNKG 172 (358) T ss_pred cccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchH Confidence 33344444455544555666666663 2 236999999999999888887778888542 1 11 Q ss_pred ccc-------ccccccccccccceeeccccchHHHHHHHhhhhhh-hhhhcccCc-c-EEEecHHHHH-HHHhhhcCCCc Q lcl|NC_011054. 135 WVS-------PALLPAAVAANQDYTIVPGDANEDDLIGCINRASK-AVAAAGYMP-D-TLLASLGFRF-DVANLRDANGN 203 (302) Q Consensus 135 ~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~-~-~~v~~~~~~~-~l~~l~d~~g~ 203 (302) .+. .....................++..+..++.++.. .+.....+. . .+++.+.... .-..|-...+. T Consensus 173 WlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ 252 (358) T protein:vir:78 173 WHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVGTDLVAAAQAKLYSEATK 252 (358) T ss_pred HHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhhcCCC Confidence 110 00000000000000011122355666666666543 333333222 2 3455555543 33334333333 Q ss_pred ee---ee---cccccCcceEeecccccCCCcceEEEEecceEEEEeecC-cEEEEeecccccchhhhcCCcEEEEEEEEe Q lcl|NC_011054. 204 PI---FR---DESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQD-ITVKFLDQATVGSINLAERDMIALRLKARF 276 (302) Q Consensus 204 ~i---~~---~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~ 276 (302) |- -. -..+.|+|...+... ....+++--++..-+-...+ ..=.+.+......+..+++-. - T Consensus 253 pTE~~Aa~~i~k~iGGlpa~~~PfF----P~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s~N--------e 320 (358) T protein:vir:78 253 PSEQIAAQQLAKSIAGRKAYIPPFF----PGKRMVVTTLDNLHCYTQRGTRKRKADDNQDSKSFDNQYWRM--------E 320 (358) T ss_pred cHHHHHHHHHHHHhCCCeEEEcccc----CCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhc--------c Confidence 31 00 035678887766543 33446666666654433333 221222222222222222222 3 Q ss_pred ccEEeccccEEEEeee-------cccccCCCCC Q lcl|NC_011054. 277 AYVLGNGATAVGDNKT-------PVGAVVPDGS 302 (302) Q Consensus 277 d~~v~~~~a~~~lt~~-------~a~~~~p~~~ 302 (302) |+.|.+...++.+... |+. ...+++ T Consensus 321 ~YvVEd~~~~a~iE~i~v~~~~~pa~-~~~~~~ 352 (358) T protein:vir:78 321 GYALGEHKAYGGFEEADIEIGADPAV-LAVEAA 352 (358) T ss_pred eeeeeccccEEEEeeeeeeeCCCCCc-cccCCc Confidence 4444444444433322 211 111122 No 226 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=50.87 E-value=0.6 Score=21.81 Aligned_cols=282 Identities=10% Similarity=-0.020 Sum_probs=124.4 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhh-hcce--------e-ec---CCCceEEEEEeCCcceeeecccccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQ-AFPT--------V-NM---GTKTTHLPVLATLPGASWVSESATEPE 67 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~-~~~~--------~-~~---~~~~~~~p~~~~~~~a~~v~E~~~~~~ 67 (302) ||.+...-+.......++..+.....+.+++.. ++.. . .. .+..+++..... -...+|.+++..+. T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~-L~g~gv~Gd~~leG 79 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVH-LRGKPTYGDARVEG 79 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeee-cccCCcccCceeec Confidence 997777666666778888999988888888765 3311 0 01 122344443221 22344444443332 Q ss_pred ccccccccceeeEEeeeeeEEEeehhHHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHhhc-ccCCCccccccc------ Q lcl|NC_011054. 68 GVKPTSEATWADRTLVAEEVAVIIPVHENVV-DDASTSLLEEIAALGGQAIGKKLDQAVIF-GTDKPSSWVSPA------ 139 (302) Q Consensus 68 ~~~~~s~~~f~~i~l~~~ki~~~~~iS~ell-~ds~~~~~~~i~~~l~~ai~~~~d~~~l~-G~g~~~g~~~~~------ 139 (302) . +...+|.+-++....+..-+.....+- +-+..+|...-++.|..-+.+..|..+|. -.|. .|+.... T Consensus 80 n---ee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGa-rg~~~~~~~~~~~ 155 (364) T protein:vir:93 80 K---EESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGA-RGINLDFIETPDF 155 (364) T ss_pred c---ccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-cccccccccccCc Confidence 1 234667766666666555554433333 33678999999999999999999997773 1111 0100000 Q ss_pred ---cccc--cc---------ccccceeecc-ccchHHHHHHHhhhhhhhhhhcc----------------cCccEEEecH Q lcl|NC_011054. 140 ---LLPA--AV---------AANQDYTIVP-GDANEDDLIGCINRASKAVAAAG----------------YMPDTLLASL 188 (302) Q Consensus 140 ---~~~~--~~---------~~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~~~----------------~~~~~~v~~~ 188 (302) ..+. +. .........+ -..+ .+.|.++...++... ...-.+++|| T Consensus 156 ~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~s----l~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p 231 (364) T protein:vir:93 156 TGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMA----PLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSE 231 (364) T ss_pred ccccccccCCCCCCcEEeccccCchhhcccccccc----HHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcc Confidence 0000 00 0000000001 1112 233333333322211 1122577999 Q ss_pred HHHHHHHhhhc--------------CCCceeeecccccCcceEeeccc------ccCCCc----c-eEEEEecce--EEE Q lcl|NC_011054. 189 GFRFDVANLRD--------------ANGNPIFRDESFNGFGTYFNANG------AWPVGV----A-EALVVDSSR--VRI 241 (302) Q Consensus 189 ~~~~~l~~l~d--------------~~g~~i~~~~~~~g~p~~~~~~~------~~~~~~----~-~~~~gd~~~--~~~ 241 (302) ..+..|+.-.| ...+|||.+....--++.+.... ...... . -+++|- +. +.+ T Consensus 232 ~q~~~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGa-QA~~~a~ 310 (364) T protein:vir:93 232 YQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGR-QAGVIAY 310 (364) T ss_pred hhhhhhhhcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhhheecc-eeeEEEe Confidence 99999975332 13367887653221122221110 011111 1 133442 22 223 Q ss_pred EeecCcEEEEeecccccchhhhcCCcEEEEEEEEeccEEecc--ccEEEEe-eecccccCCCCC Q lcl|NC_011054. 242 GVRQDITVKFLDQATVGSINLAERDMIALRLKARFAYVLGNG--ATAVGDN-KTPVGAVVPDGS 302 (302) Q Consensus 242 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~--~a~~~lt-~~~a~~~~p~~~ 302 (302) |-.+++...+.++...- .|...+-+...+|++..+- +-|-.+. .|.| ++=| T Consensus 311 g~~~g~~~~w~Ee~~D~------gn~~~i~~~~i~G~kK~rF~~~DfGvi~idtaa----~~~~ 364 (364) T protein:vir:93 311 GTANGLRFDWEETVKDY------GNEPAIAAGFIAGMKKARFNNKDFGVISIDTAA----KKHS 364 (364) T ss_pred ecCCCCCceeeecccCC------CCchhhhhhhHhhhhhcccCCccceEEEecccc----cccC Confidence 44456666555543110 1222233333333332221 1111111 0111 1111 No 227 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=46.62 E-value=0.73 Score=21.34 Aligned_cols=281 Identities=11% Similarity=0.030 Sum_probs=111.7 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-----eEEEEEeC---C---------cceeeecccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-----THLPVLAT---L---------PGASWVSESA 63 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-----~~~p~~~~---~---------~~a~~v~E~~ 63 (302) .|..+++..-.-+-+.+. .+++++.......+++-+.||+++. ++..+... . +.+.|-+.+. T Consensus 76 ia~s~~t~~v~~~~P~ll-~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~tg~EAf~~~nEadt~fSG~~~ 154 (514) T protein:vir:56 76 IAQGVTTGAVTNIGPTVM-GMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAA 154 (514) T ss_pred cccccccccccccchhHH-HHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcccccccccccccCcCcccccc Confidence 444444333222222221 1333333444566777777877643 11111111 1 1111111000 Q ss_pred ---------------------------------------------------------------------------cc--- Q lcl|NC_011054. 64 ---------------------------------------------------------------------------TE--- 65 (302) Q Consensus 64 ---------------------------------------------------------------------------~~--- 65 (302) +. T Consensus 155 ~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~ 234 (514) T protein:vir:56 155 ASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQEN 234 (514) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhccc Confidence 00 Q ss_pred ----ccccccccccceeeEEeeeeeEEEeehhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhh---cccC---- Q lcl|NC_011054. 66 ----PEGVKPTSEATWADRTLVAEEVAVIIPVHENVVDDA----STSLLEEIAALGGQAIGKKLDQAVI---FGTD---- 130 (302) Q Consensus 66 ----~~~~~~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds----~~~~~~~i~~~l~~ai~~~~d~~~l---~G~g---- 130 (302) .....++...++++++...+.-+-...+|-||.+|- ..|.++.|.+-|+..|...|++.+| +-.- T Consensus 235 lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~ 314 (514) T protein:vir:56 235 FNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGK 314 (514) T ss_pred CCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehh Confidence 000122333344444444444455667999999984 3688999999999999999999996 2111 Q ss_pred --CCcccccccccccccccccceeeccccchHHHHHHHhhhhhhhh-----hhcccCccEEEecHHHHHHHHh---h--- Q lcl|NC_011054. 131 --KPSSWVSPALLPAAVAANQDYTIVPGDANEDDLIGCINRASKAV-----AAAGYMPDTLLASLGFRFDVAN---L--- 197 (302) Q Consensus 131 --~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~v~~~~~~~~l~~---l--- 197 (302) ...|....+..... .......+-...+.+..++..+.... .........+++++.....|.. | T Consensus 315 ~~~~~~~~~~G~~d~~----~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~ 390 (514) T protein:vir:56 315 SGWTQGAGAAGVFDFS----DAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGP 390 (514) T ss_pred cccccccccccccccc----cccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccc Confidence 01111111111100 00111111112233333332222111 1233456678899999999964 1 Q ss_pred -----hcCC-----CceeeecccccCcceEeecccccCCCcceEEEEecce------EEEEee-cCcEEEEeecccccch Q lcl|NC_011054. 198 -----RDAN-----GNPIFRDESFNGFGTYFNANGAWPVGVAEALVVDSSR------VRIGVR-QDITVKFLDQATVGSI 260 (302) Q Consensus 198 -----~d~~-----g~~i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~------~~~~~~-~~~~i~~~~~~~~~~~ 260 (302) .+.+ ...++..-...++.+++..+ .....+++|-... .++..- ....+...|. T Consensus 391 ~~~g~~~~~~~~d~~~~~~aG~l~~~~~vy~D~y----~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp------ 460 (514) T protein:vir:56 391 AAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQY----AVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDS------ 460 (514) T ss_pred cccCccccccccccCcceEEEEecCceEEEecCC----CCcceEEEEEecCcceecceeeccccccccccccCC------ Confidence 1111 12233322234455554443 2233344443210 001000 0000111111 Q ss_pred hhhcCCcEEEEEEEEeccEEeccccEEEEe--eecccccCCCCC Q lcl|NC_011054. 261 NLAERDMIALRLKARFAYVLGNGATAVGDN--KTPVGAVVPDGS 302 (302) Q Consensus 261 ~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt--~~~a~~~~p~~~ 302 (302) .+-|=.+-...|++..+ +| |...+ ....+--.|.+- T Consensus 461 ---~sfqP~~g~~tRY~l~~-NP--y~~~~~~~~~~~~~~~~~a 498 (514) T protein:vir:56 461 ---KNFQPVIGFKTRYGVQV-NP--FADPTASATKVGNGAPVAA 498 (514) T ss_pred ---ccccceeeeeeeeceee-CC--CCCccccccccCCcchhhh Confidence 11222334445666554 33 11000 000111111111 No 228 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=45.20 E-value=0.78 Score=21.18 Aligned_cols=279 Identities=11% Similarity=0.050 Sum_probs=133.2 Q ss_pred CCCcc---CCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccccccc Q lcl|NC_011054. 1 MADIS---RSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPTSEAT 76 (302) Q Consensus 1 Ma~~t---~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~s~~~ 76 (302) +|... .....+-|-+.+...+.+.+++.+-+++.++.+++..-. ..+-....++-++...-+.. + .++. T Consensus 20 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdt~R~------~-r~~~ 92 (341) T protein:vir:27 20 LAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTGRKAGGRF------T-KQVG 92 (341) T ss_pred HHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceeeccCCCce------e-cccc Confidence 22221 223445677788899999999999999999999887543 33434344454544432211 1 1233 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhc-c----hHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------C------cccc Q lcl|NC_011054. 77 WADRTLVAEEVAVIIPVHENVVDD-A----STSLLEEIAALGGQAIGKKLDQAVIFGTDK---------P------SSWV 136 (302) Q Consensus 77 f~~i~l~~~ki~~~~~iS~ell~d-s----~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~---------~------~g~~ 136 (302) ++.....-++.---+.|+-+.|+. + .++|...+.+.+.++++.-+-.--++|+.- | .|.+ T Consensus 93 l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td~~anPllqDVNkGWl 172 (341) T protein:vir:27 93 VGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTDPSANPLGQDVNEGWI 172 (341) T ss_pred cCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCChhhcccccccchhHH Confidence 444444555544455566666642 2 378999999999999988888888888641 1 1111 Q ss_pred cccccccc-cccccceeeccccchHHHHHHHhhhhhhhh-hhcccC-cc-EEEecHHHHH-HHHhhhcCCCcee---e-- Q lcl|NC_011054. 137 SPALLPAA-VAANQDYTIVPGDANEDDLIGCINRASKAV-AAAGYM-PD-TLLASLGFRF-DVANLRDANGNPI---F-- 206 (302) Q Consensus 137 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~-~~-~~v~~~~~~~-~l~~l~d~~g~~i---~-- 206 (302) ...-.... ..............++..+..++.++...+ .....+ +. .+++.+.... .-..|-.....|- - T Consensus 173 Q~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa~ 252 (341) T protein:vir:27 173 AFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIGAAQAKLYDKADKPSEQIAAQ 252 (341) T ss_pred HHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhhhhhhhhhccCCCCHHHHHHH Confidence 00000000 000000111122334555555555555432 332222 12 3456655543 3333333222220 0 Q ss_pred -ecccccCcceEeecccccCCCcceEEEEecceEEEEeecCcEE-EEeecccccchhhhcCCcEEEEEEEEeccEEeccc Q lcl|NC_011054. 207 -RDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQDITV-KFLDQATVGSINLAERDMIALRLKARFAYVLGNGA 284 (302) Q Consensus 207 -~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~ 284 (302) -...+.|+|...+... ....+++--++...+....|-.= .+.+......+..+++ + +.|.+-. T Consensus 253 ~i~k~iGGlpa~~~Pff----P~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~yes---~--------YvVEdyg 317 (341) T protein:vir:27 253 KLDKTIAGRPAYVPPFL----PDNAMVVTIPENLQVLTQHGTAQRKAKHESDRKRSKTHTG---A--------WKVTQWV 317 (341) T ss_pred HHHHhhCCCeEEEcccc----CCCceEEeeccceEEEEecCcEEEEEEeccccccccchhh---h--------heeehhh Confidence 0235778887766543 34446677777655544433221 1222222111111222 2 3333333 Q ss_pred cEEEEeee----cccccCCCCC Q lcl|NC_011054. 285 TAVGDNKT----PVGAVVPDGS 302 (302) Q Consensus 285 a~~~lt~~----~a~~~~p~~~ 302 (302) +|..+..+ ++++. -..| T Consensus 318 ~~~~~~~~~vkl~~~~~-~~~~ 338 (341) T protein:vir:27 318 CWKRSPLTTQKKSTSAL-NHRS 338 (341) T ss_pred hhhhccccccccCcccc-cccc Confidence 33322222 22222 2223 No 229 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=38.52 E-value=1.1 Score=20.44 Aligned_cols=280 Identities=13% Similarity=0.089 Sum_probs=116.5 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhh-----hhcceeecCCCceEEEEEeCCcceeeecccc-------ccccc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVL-----QAFPTVNMGTKTTHLPVLATLPGASWVSESA-------TEPEG 68 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~-----~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~-------~~~~~ 68 (302) -|..+.+.....++..-. +.....+. -..++..+.+.++++-|..++..|.-++++. .++|+ T Consensus 69 ta~a~a~~T~l~ve~~~~------f~~~~l~~~~~~~Evirv~sVng~~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~eEG 142 (418) T protein:vir:10 69 TAEAAADATVLTVENSDG------LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRISAAIIAANTKLIVIGTAFEEG 142 (418) T ss_pred EEEEecCceEEEEcCcce------eccccEEEEccCCeEEEEEEEeCCEEEEEEecCCeeEEEEecCceEEEeccccccc Confidence 111111111112222111 12222211 1344555667778888777666655555544 33444 Q ss_pred cccccccceeeEEeeeeeEEE-------eehhHHHHHhc----chHH-HHHHHHHHHHHHHHHHHHHHhhccc----CCC Q lcl|NC_011054. 69 VKPTSEATWADRTLVAEEVAV-------IIPVHENVVDD----ASTS-LLEEIAALGGQAIGKKLDQAVIFGT----DKP 132 (302) Q Consensus 69 ~~~~s~~~f~~i~l~~~ki~~-------~~~iS~ell~d----s~~~-~~~~i~~~l~~ai~~~~d~~~l~G~----g~~ 132 (302) +. .++....++..+.- .+.||.-.... ...+ +++.....+-++ ..+|+++|+|. ++. T Consensus 143 sd-----~~ta~~~k~~~vsNvtQIF~~avsvSgTaqAs~~q~Gvsn~~ese~drk~~~a--v~iEkalI~G~~~~~~~~ 215 (418) T protein:vir:10 143 SQ-----RPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFHA--TEQETAIFFGQAFMGTYN 215 (418) T ss_pred cc-----cCCcceecceeccchhhhhhhhhhhhhhhhhccccccCchHHHHHHHHHHHHH--HHHHHHHhcccccCCCcC Confidence 21 11112223333322 23344332221 0112 233333333333 47899999995 222 Q ss_pred ccc--cccccccccccc-ccceeecc--ccchHHHHHHHhhhhhhhhhhcccCc----cEEEecHHHHHHHHhhhcCCCc Q lcl|NC_011054. 133 SSW--VSPALLPAAVAA-NQDYTIVP--GDANEDDLIGCINRASKAVAAAGYMP----DTLLASLGFRFDVANLRDANGN 203 (302) Q Consensus 133 ~g~--~~~~~~~~~~~~-~~~~~~~~--~~~~~~~~~~~i~~~~~~~~~~~~~~----~~~v~~~~~~~~l~~l~d~~g~ 203 (302) .|. ...++....... ......+. +..+.+.+.+.+.++...-...+... -.+.++.....++.++- +. T Consensus 216 ~g~~R~m~GIl~~vr~~~~gnVv~a~~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~~---~~ 292 (418) T protein:vir:10 216 GQPLHTTQGIVDAVRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFF---GE 292 (418) T ss_pred CcchhhHHHHHHHHhhhcccceeccCCCCccCHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhhh---hh Confidence 221 111221111110 01111111 23455666665555542211122221 23567888888887763 22 Q ss_pred eeeecc--ccc---------CcceE-eeccc---ccCCCcceEEEEecceEEEEee--cCcEEEEeeccccc----chh- Q lcl|NC_011054. 204 PIFRDE--SFN---------GFGTY-FNANG---AWPVGVAEALVVDSSRVRIGVR--QDITVKFLDQATVG----SIN- 261 (302) Q Consensus 204 ~i~~~~--~~~---------g~p~~-~~~~~---~~~~~~~~~~~gd~~~~~~~~~--~~~~i~~~~~~~~~----~~~- 261 (302) |-... ... +.+.. +...+ ......+.+++.|..++-+..- +++..+.....+.+ ..+ T Consensus 293 -I~~~~~e~~~G~vv~~~~~~~G~I~L~~~p~~~~~~lp~g~mlVvD~~~vkL~~L~~R~~~~E~l~k~G~~~~~~~~~~ 371 (418) T protein:vir:10 293 -VTVTQRETSYGMVFTEWKFFKGRLILKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDY 371 (418) T ss_pred -eeecccceeeeEEEEEEEcceEEEEeecccccccccCCCceEEEEccccceEEEeccccccchhcccCCCccccccccc Confidence 21111 111 11111 11111 1234667778888777655443 44555544322200 000 Q ss_pred ------hhcCCcEEEEEEEEeccEEeccccEEEEeee----c-ccccCCCC Q lcl|NC_011054. 262 ------LAERDMIALRLKARFAYVLGNGATAVGDNKT----P-VGAVVPDG 301 (302) Q Consensus 262 ------~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~----~-a~~~~p~~ 301 (302) ..++++ ....+...+.+|.+.+++++- + +..+.|+- T Consensus 372 ~~~~~~D~~kG~----iv~E~tLe~~N~~a~avitgl~~~~~~~~~t~p~~ 418 (418) T protein:vir:10 372 SYGHGVDAQGGS----LTSEWALELLNPQGCAVITGLQKAKERVYLTAPAP 418 (418) T ss_pred ccccccccccce----EEEEeeeeeecccceEEeeccceecccccCCCCCC Confidence 122232 334567888999999999863 2 33333333 No 230 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=38.23 E-value=1.1 Score=20.41 Aligned_cols=280 Identities=10% Similarity=0.028 Sum_probs=114.6 Q ss_pred CCCccCCCc-ceecchHHHHHHHHHHHhhhhhhhhcceeecCCCce-----EEEEEeC--------------Ccceeeec Q lcl|NC_011054. 1 MADISRSEV-ATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTT-----HLPVLAT--------------LPGASWVS 60 (302) Q Consensus 1 Ma~~t~~~~-g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-----~~p~~~~--------------~~~a~~v~ 60 (302) .+..+++.. ...=|.-+ .+++++.......+++-+.||+++.- +..+... .+.+.|-+ T Consensus 80 i~es~~t~~v~~~~P~li--~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG 157 (522) T protein:vir:69 80 IAAGQTSGAVTQIGPAVM--GMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSG 157 (522) T ss_pred ccccccccccccccchHH--HHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCcccccccccccccccccc Confidence 344444322 22222221 13333444455667777777766531 1111100 01111111 Q ss_pred ccc--------------------------------------------------------------------------cc- Q lcl|NC_011054. 61 ESA--------------------------------------------------------------------------TE- 65 (302) Q Consensus 61 E~~--------------------------------------------------------------------------~~- 65 (302) .+. +. T Consensus 158 ~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal 237 (522) T protein:vir:69 158 QGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQ 237 (522) T ss_pred ccccccccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccccceeeccccchhhhhhc Confidence 000 00 Q ss_pred ------ccccccccccceeeEEeeeeeEEEeehhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccc Q lcl|NC_011054. 66 ------PEGVKPTSEATWADRTLVAEEVAVIIPVHENVVDDA----STSLLEEIAALGGQAIGKKLDQAVIFGTDKPSSW 135 (302) Q Consensus 66 ------~~~~~~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds----~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~g~ 135 (302) .....++...++++++...+.-+-...+|-||.||- ..|.++.|.+-|+..|...|++.+|.=-...+-. T Consensus 238 ~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~ 317 (522) T protein:vir:69 238 EGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQV 317 (522) T ss_pred ccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhee Confidence 000123344455555555555556678999999984 3688999999999999999999999421100000 Q ss_pred cccccc------ccccccccceeeccccchHHHHHHHhhhhhh---hh-h-hcccCccEEEecHHHHHHHHhh------- Q lcl|NC_011054. 136 VSPALL------PAAVAANQDYTIVPGDANEDDLIGCINRASK---AV-A-AAGYMPDTLLASLGFRFDVANL------- 197 (302) Q Consensus 136 ~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~---~~-~-~~~~~~~~~v~~~~~~~~l~~l------- 197 (302) ...+.. ..............+-...+.+..++..+.. .+ + ........+++|+.....|... T Consensus 318 ~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~ 397 (522) T protein:vir:69 318 GKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYA 397 (522) T ss_pred eccccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccccccc Confidence 000000 0000000001111111122333333333322 22 1 2223466788999999999752 Q ss_pred ---------hcCCCceeeecccccCcceEeecccccCCCcceEEEEecce------EEEEe-ecCcEEEEeecccccchh Q lcl|NC_011054. 198 ---------RDANGNPIFRDESFNGFGTYFNANGAWPVGVAEALVVDSSR------VRIGV-RQDITVKFLDQATVGSIN 261 (302) Q Consensus 198 ---------~d~~g~~i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~------~~~~~-~~~~~i~~~~~~~~~~~~ 261 (302) .|.++ .++..-...++.+++..+ .....+++|-... .++.. .....+...|. T Consensus 398 ~~~~~~g~~~d~~~-~~~~G~l~~~~~vy~D~y----~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp------- 465 (522) T protein:vir:69 398 AQGLASGFNTDTTK-SVFAGVLGGKYRVYIDQY----AKQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDP------- 465 (522) T ss_pred cccccccccccCCC-ceEEEEecCceEEEecCC----CCcceEEEEEeCCcccccceeeccccccccccccCC------- Confidence 11111 112211223344444433 2334444443311 01111 01111111111 Q ss_pred hhcCCcEEEEEEEEeccEEecccc-------EEEEeeecccccCCCCC Q lcl|NC_011054. 262 LAERDMIALRLKARFAYVLGNGAT-------AVGDNKTPVGAVVPDGS 302 (302) Q Consensus 262 ~~~~~~~~~r~~~r~d~~v~~~~a-------~~~lt~~~a~~~~p~~~ 302 (302) .+-|=.+-...|++..+ +|=+ .+++. --.|+++ T Consensus 466 --~sfqP~~g~~tRY~l~v-NP~~~~~~~~~~~ri~-----~g~p~~~ 505 (522) T protein:vir:69 466 --KNFQPVMGFKTRYGIGV-NPFAESSLQAPGARIQ-----SGMPSIL 505 (522) T ss_pred --ccccceeeeeeeeceee-cCcccccCCcccceee-----cccchhh Confidence 11222344455776654 3311 11222 1234443 No 231 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=37.19 E-value=1.1 Score=20.29 Aligned_cols=283 Identities=12% Similarity=0.075 Sum_probs=134.7 Q ss_pred CCCccC-------CCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEE-EEEeCCcceeeeccccccccccccc Q lcl|NC_011054. 1 MADISR-------SEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHL-PVLATLPGASWVSESATEPEGVKPT 72 (302) Q Consensus 1 Ma~~t~-------~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~-p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (302) +|.... .+--+-|.+.+...+.+.+++.+-++++++.+++..-...+ ....++..+.-........+ ..+. T Consensus 16 ~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~t~r~~t~~~~~~-~~~~ 94 (343) T protein:vir:98 16 AAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRHYGAHDRRTPIQQ-RWTR 94 (343) T ss_pred HHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccccCccccCCCccc-cccC Confidence 332221 22236788888899999999999999999999886432222 22222222222111111000 0000 Q ss_pred cccceeeEEeeeeeEEEeehhHHHHHhc-c-hHH-HHHHHHHHHHHHHHHHHHHHhhcccCCC--c----------cccc Q lcl|NC_011054. 73 SEATWADRTLVAEEVAVIIPVHENVVDD-A-STS-LLEEIAALGGQAIGKKLDQAVIFGTDKP--S----------SWVS 137 (302) Q Consensus 73 s~~~f~~i~l~~~ki~~~~~iS~ell~d-s-~~~-~~~~i~~~l~~ai~~~~d~~~l~G~g~~--~----------g~~~ 137 (302) +.-...-++.---+.|+-+.|+. + .++ |...+++.+.++++.-.-.--++|+.-. + |.+. T Consensus 95 -----~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T~nPllqDVN~GWLQ 169 (343) T protein:vir:98 95 -----QVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDTSDPNLADVNKGWIQ 169 (343) T ss_pred -----CCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCCCCcchhhcchHHHH Confidence 11122333333334555555653 1 355 8888888888888887777777885421 1 1100 Q ss_pred -------ccccccccccccceeeccccchHHHHHHHhhhhhhhhhhcccC-cc-EEEecHHHHHH-HHhhhcCCCc-e-- Q lcl|NC_011054. 138 -------PALLPAAVAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYM-PD-TLLASLGFRFD-VANLRDANGN-P-- 204 (302) Q Consensus 138 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-~~-~~v~~~~~~~~-l~~l~d~~g~-~-- 204 (302) .-.......... ........++..+..++.++...+.....+ +. .+++.+..... -..|-...++ | T Consensus 170 ~~Re~ap~rVm~~~~~~~~-~~~~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~n~~~~~ptE 248 (343) T protein:vir:98 170 FVRENKATQILTQGATSGE-IRLFGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLVAKEASLVYKGNGLIATE 248 (343) T ss_pred HHHhcchhhhhccceeccc-eeEecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhhhhhhhhhhhhcCCChHH Confidence 000000000000 111122234566666666665544443333 22 34455554332 2233333332 2 Q ss_pred ------eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecC-cEEEEeecccccchhhhcCCcEEEEEEEEec Q lcl|NC_011054. 205 ------IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQD-ITVKFLDQATVGSINLAERDMIALRLKARFA 277 (302) Q Consensus 205 ------i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~r~~~r~d 277 (302) +.....+.|+|...+... ....+++--++..-+-...+ ..=.+.+.. .+|++.-.-..--| T Consensus 249 k~Aa~~~~~~k~iGGl~a~~~PfF----P~~~llVT~L~NLsIY~Q~gs~RR~~~d~p--------~r~rie~y~s~Ne~ 316 (343) T protein:vir:98 249 KAALNTHDLMKSFGGMPAMIVPNM----PPRAAIVTSLSNLSIYTQEGSMRRGMKDDD--------DKKAVRDSYYRNEA 316 (343) T ss_pred HHHHHHHHHHHhhCCCeeEEcccc----CCCceEEeeccccEEEEecCcEEEEEEecc--------ccccccchhhhcce Confidence 112245778888776553 33446666676654443333 211122222 22222222222346 Q ss_pred cEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 278 YVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 278 ~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) +.|.+...++.+..........+|- T Consensus 317 YvVEd~~~~a~iE~i~v~~~~~~g~ 341 (343) T protein:vir:98 317 YAVEDCGKFMAVDFTKVKLSSGKGT 341 (343) T ss_pred eeeeccccEEEeeeeeeeecCCCCC Confidence 6677777777777666555333333 No 232 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=35.44 E-value=1.2 Score=20.09 Aligned_cols=280 Identities=14% Similarity=0.048 Sum_probs=120.0 Q ss_pred CCCccCCCcc-eecchHHHHHHHHHHHhhhhh-----hhhcceeecCCCceEEEEEeCCcceeeecccc-------cccc Q lcl|NC_011054. 1 MADISRSEVA-TLIQEAYANDLLASAKKGSTV-----LQAFPTVNMGTKTTHLPVLATLPGASWVSESA-------TEPE 67 (302) Q Consensus 1 Ma~~t~~~~g-~liP~~~~~~ii~~~~~~s~l-----~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~-------~~~~ 67 (302) -+... +++. ..+++.- . +++...+ ....++..+.+..+++-|..++..|.-+..+. ..+| T Consensus 69 ta~~~-a~~T~i~V~~~~---~---f~~~~l~~~~~~~EvirVtsVng~~lTV~RG~~~t~aa~iaag~~~~~ig~~~eE 141 (418) T protein:vir:96 69 TAEAL-ADATVLTVENSD---G---LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRIAAAIIAANTKLIVIGTAFEE 141 (418) T ss_pred EEEEe-cCceEEEecCCc---c---cccccEEEEecCCeEEEEEEEeCCEEEEEEccCCeeeeeeecCceEEEeecCccc Confidence 11111 1111 1222221 1 2222222 12345555667778877776665555555554 3344 Q ss_pred ccccccccceeeEEeeeeeEEEeehhHHHHHhcchH-----------HHHHHHHHHHHHHHHHHHHHHhhccc---CCCc Q lcl|NC_011054. 68 GVKPTSEATWADRTLVAEEVAVIIPVHENVVDDAST-----------SLLEEIAALGGQAIGKKLDQAVIFGT---DKPS 133 (302) Q Consensus 68 ~~~~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds~~-----------~~~~~i~~~l~~ai~~~~d~~~l~G~---g~~~ 133 (302) +. -.++....++..+.-+..|-+|..+-|.. ++.....++|.+. ...+|.+++.|. +... T Consensus 142 Gs-----d~~ta~~~k~~~vsN~tQIf~e~vsVSgTAqA~v~qaGvsn~~~~e~d~l~~~-kv~iE~ali~g~~~~~~~n 215 (418) T protein:vir:96 142 GS-----QRPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQAFMGTYN 215 (418) T ss_pred cc-----ccCCcceecceeccchhheehhhhhhhhhhhhhhhhcCcchhHHHHHHHHHHH-HHHHHHhhhccccccCCCC Confidence 32 11122233444444555555554443322 2222233445554 457788999886 2222 Q ss_pred ccc-------cccccccccccccceee-ccccchHHHHHHHhhhhhhhhhhcccCc----cEEEecHHHHHHHHhhhcCC Q lcl|NC_011054. 134 SWV-------SPALLPAAVAANQDYTI-VPGDANEDDLIGCINRASKAVAAAGYMP----DTLLASLGFRFDVANLRDAN 201 (302) Q Consensus 134 g~~-------~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~----~~~v~~~~~~~~l~~l~d~~ 201 (302) |.. ..++...... ..... .....+.+.+.+.+.++...-...+... -.++++.+...++.++-. + T Consensus 216 g~p~~~t~R~m~gI~~f~~~--Nvi~ag~~~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~~~-~ 292 (418) T protein:vir:96 216 GQPLHTTQGIVDAIRQYAPD--NVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFG-E 292 (418) T ss_pred CcccccccchhHHHHhhccc--cccccCCCCcCCHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhhhc-e Confidence 211 1111111110 11111 1123455666665555542111122222 235689999999988742 2 Q ss_pred CceeeecccccCc------------ceEeeccc-ccCCCcceEEEEecceEEEEee--cCcEEEEeeccccc----chh- Q lcl|NC_011054. 202 GNPIFRDESFNGF------------GTYFNANG-AWPVGVAEALVVDSSRVRIGVR--QDITVKFLDQATVG----SIN- 261 (302) Q Consensus 202 g~~i~~~~~~~g~------------p~~~~~~~-~~~~~~~~~~~gd~~~~~~~~~--~~~~i~~~~~~~~~----~~~- 261 (302) -++ -+.+...|. ++.+.... ......+.+++-|.+.+-+..- ++...+.....+.+ ..+ T Consensus 293 I~~-~~~en~~G~vv~~~~Td~G~v~ii~n~~~pad~I~~g~mlVvD~~~vkL~yL~~R~~~~E~l~k~G~~~~~~~~~~ 371 (418) T protein:vir:96 293 VTV-TQRETSYGMVFTEWKFFKGRLIIKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDY 371 (418) T ss_pred eEe-ccccceeceEEEEEEeeccEEEEEecCCCCccccCcceEEEEecCceEEEEecCCCccchhcccCCCccccccccc Confidence 222 122333332 22222211 1123445566777666543332 33333333222100 000 Q ss_pred ------hhcCCcEEEEEEEEeccEEeccccEEEEeeec-----ccccCCCC Q lcl|NC_011054. 262 ------LAERDMIALRLKARFAYVLGNGATAVGDNKTP-----VGAVVPDG 301 (302) Q Consensus 262 ------~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~-----a~~~~p~~ 301 (302) ..++++ ....+.+.+.+|++.+++++-. +.++.|+- T Consensus 372 ~~~~~~D~~~G~----l~~Eltle~~N~~a~a~itgl~~~~~~~~~~~~~~ 418 (418) T protein:vir:96 372 SYGHGVDAQGGS----LTSEWALELLNPQGCAVITGLQKAKERVYLTAPAP 418 (418) T ss_pred ccccccccccCE----EEEEEEEEeecccccEEeecccccccccccCCCCC Confidence 112222 3345677889999999999632 33444444 No 233 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=29.88 E-value=1.6 Score=19.43 Aligned_cols=279 Identities=11% Similarity=0.075 Sum_probs=115.5 Q ss_pred CC------C-ccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE-----EEeCCcceeeecccc----- Q lcl|NC_011054. 1 MA------D-ISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTTHLP-----VLATLPGASWVSESA----- 63 (302) Q Consensus 1 Ma------~-~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p-----~~~~~~~a~~v~E~~----- 63 (302) |+ . .++++... +-+.+. .+.+.........+++-+.||+++.--|- +.+...+-.+..|-. T Consensus 63 ~~~~n~~~~~~~t~~v~~-~~P~Li-~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~g~EAf~nEadt~fSg 140 (468) T protein:vir:10 63 IAPAGSALGSANTGGLAG-FDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTG 140 (468) T ss_pred cchhhhhhhhcccccccc-cCchhh-hhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCCCCccceeccccccccc Confidence 22 1 11111111 122211 12333334445667778888876542221 111111111111100 Q ss_pred -------------------------------------------------cc--ccccccccccceeeEEeeeeeEEEeeh Q lcl|NC_011054. 64 -------------------------------------------------TE--PEGVKPTSEATWADRTLVAEEVAVIIP 92 (302) Q Consensus 64 -------------------------------------------------~~--~~~~~~~s~~~f~~i~l~~~ki~~~~~ 92 (302) .. .....++...++++++...+.-+-... T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAe 220 (468) T protein:vir:10 141 GYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAE 220 (468) T ss_pred cccccccccccccccccccCCCCCcccccccccccccccccccchHHHhhcCCCCcccceeeeEEEEEEEeeeccceecc Confidence 00 000122233444445555555555678 Q ss_pred hHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhccc------CCCcccccccccccccccccceeeccccchHHH Q lcl|NC_011054. 93 VHENVVDDA----STSLLEEIAALGGQAIGKKLDQAVIFGT------DKPSSWVSPALLPAAVAANQDYTIVPGDANEDD 162 (302) Q Consensus 93 iS~ell~ds----~~~~~~~i~~~l~~ai~~~~d~~~l~G~------g~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 162 (302) +|-||.+|- ..|.++.|.+-|+..|...+++.+|.=- ++-.|....+..... +..++....+. T Consensus 221 YTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~g~~~~Gv~d~~-------~~~~~rw~~e~ 293 (468) T protein:vir:10 221 YTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLD-------VDSNGRWSVEK 293 (468) T ss_pred ccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheeccccccccccccc-------ccccchhHHHH Confidence 999999984 3688999999999999999999998521 111111111111110 11111112222 Q ss_pred HHHHhhhhhhh-----hhhcccCccEEEecHHHHHHHHh---hhcC---CCce------------eeecccccCcceEee Q lcl|NC_011054. 163 LIGCINRASKA-----VAAAGYMPDTLLASLGFRFDVAN---LRDA---NGNP------------IFRDESFNGFGTYFN 219 (302) Q Consensus 163 ~~~~i~~~~~~-----~~~~~~~~~~~v~~~~~~~~l~~---l~d~---~g~~------------i~~~~~~~g~p~~~~ 219 (302) ...++..+... .+........+++++.....|.. |... +++. ++.+-...++.+++. T Consensus 294 ~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~tg~~~~G~l~~r~~vy~D 373 (468) T protein:vir:10 294 FKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVD 373 (468) T ss_pred HHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceecccccccccccccccccCcceEEEEecCceEEEEc Confidence 22222222111 12334566778999999999986 3311 1111 111112234455444 Q ss_pred cccccCCCcceEEEEecce------EEEEeecCcEE-EEeecccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeee Q lcl|NC_011054. 220 ANGAWPVGVAEALVVDSSR------VRIGVRQDITV-KFLDQATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKT 292 (302) Q Consensus 220 ~~~~~~~~~~~~~~gd~~~------~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~ 292 (302) .+.........+++|-... .++..--.+.. ...|. .+-|=.+-...|++..+ +|=+ ...+. T Consensus 374 ~Ya~~~s~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp---------~sfqP~~g~~tRY~l~~-NP~~--~~~~~ 441 (468) T protein:vir:10 374 PYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDP---------NTFQPKIGFKTRYGMVS-NPFV--TTNGL 441 (468) T ss_pred cccccCCccceEEEEEecCcceeceeeeccccccccccccCC---------Ccccceeeeeeeeceee-cccc--eeccc Confidence 4333333444555554311 01111001111 11111 12222344455666543 5522 22222 Q ss_pred cccccCCCCC Q lcl|NC_011054. 293 PVGAVVPDGS 302 (302) Q Consensus 293 ~a~~~~p~~~ 302 (302) .-+ .|.|- T Consensus 442 ~~g--~~~~~ 449 (468) T protein:vir:10 442 YNG--TPDGE 449 (468) T ss_pred cCC--Ccccc Confidence 222 25543 No 234 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=27.88 E-value=1.8 Score=19.18 Aligned_cols=285 Identities=11% Similarity=0.048 Sum_probs=109.0 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCce-----EEEEEeCC---------------------- Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKTT-----HLPVLATL---------------------- 53 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-----~~p~~~~~---------------------- 53 (302) .+..+++..-.-.-+.+. .+.+++.......+++-+.||++++- +..+.+.. T Consensus 79 ia~s~~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~ 157 (529) T protein:vir:10 79 IAAGQSSGAITNIGPAVI-GMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGL 157 (529) T ss_pred ccccccccccccccchhh-hhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCccccccccccccccccccc Confidence 344444332222222221 13333333444556666667665421 11110000 Q ss_pred ---------------------------cceeeecc-------------------------------------------cc Q lcl|NC_011054. 54 ---------------------------PGASWVSE-------------------------------------------SA 63 (302) Q Consensus 54 ---------------------------~~a~~v~E-------------------------------------------~~ 63 (302) ....|..| +. T Consensus 158 ~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gm 237 (529) T protein:vir:10 158 AAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGM 237 (529) T ss_pred ccccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCcccccccccccccccccccccc Confidence 00001000 00 Q ss_pred cc------------ccccccccccceeeEEeeeeeEEEeehhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011054. 64 TE------------PEGVKPTSEATWADRTLVAEEVAVIIPVHENVVDDA----STSLLEEIAALGGQAIGKKLDQAVIF 127 (302) Q Consensus 64 ~~------------~~~~~~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds----~~~~~~~i~~~l~~ai~~~~d~~~l~ 127 (302) .. .....++...++++++...+.-+-...+|-||.+|- ..|.++.|.+-|+..|...|++.+|. T Consensus 238 sTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~ 317 (529) T protein:vir:10 238 ATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVID 317 (529) T ss_pred chhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHH Confidence 00 000122333444555555555555678999999984 36889999999999999999999996 Q ss_pred ccCCC-----cccccccc-cccccccccceeeccccchHHHHHHHhhhhh---hhhh--hcccCccEEEecHHHHHHHHh Q lcl|NC_011054. 128 GTDKP-----SSWVSPAL-LPAAVAANQDYTIVPGDANEDDLIGCINRAS---KAVA--AAGYMPDTLLASLGFRFDVAN 196 (302) Q Consensus 128 G~g~~-----~g~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~---~~~~--~~~~~~~~~v~~~~~~~~l~~ 196 (302) =.... .|+..... ...............+-...+.+..++..+- ..+. ..+.....+++++.....|.. T Consensus 318 ~i~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~ 397 (529) T protein:vir:10 318 WINYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALAL 397 (529) T ss_pred HhhhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhh Confidence 11000 01100000 0000000000111111112233333333322 2222 222345678899999999974 Q ss_pred h--hcC-------CC------ceeeecccccCcceEeecccccCCCcceEEEEecce------EEEEeecCcE-EEEeec Q lcl|NC_011054. 197 L--RDA-------NG------NPIFRDESFNGFGTYFNANGAWPVGVAEALVVDSSR------VRIGVRQDIT-VKFLDQ 254 (302) Q Consensus 197 l--~d~-------~g------~~i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~------~~~~~~~~~~-i~~~~~ 254 (302) . .+. .| ..++..-...++++++..+ .....+++|-... .++..--.+. +...+. T Consensus 398 ~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y----~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp 473 (529) T protein:vir:10 398 VDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQY----ARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDP 473 (529) T ss_pred hccccccccccccccceeecCCceEEEEecCceEEEecCC----CCcceEEEEEeCCcccccceeeccccccccccccCC Confidence 2 221 11 0111111223344444333 2333344443211 0111000011 111111 Q ss_pred ccccchhhhcCCcEEEEEEEEeccEEeccccEEEEeeecccccCCCCC Q lcl|NC_011054. 255 ATVGSINLAERDMIALRLKARFAYVLGNGATAVGDNKTPVGAVVPDGS 302 (302) Q Consensus 255 ~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~lt~~~a~~~~p~~~ 302 (302) .+-|=.+-...|++..+ +|=+- ..+.++ ..--+.|. T Consensus 474 ---------~sfqP~~g~~tRY~l~~-NP~~~-~~~~~~-~~r~~~g~ 509 (529) T protein:vir:10 474 ---------KNFQPVMGFKTRYAIGV-NPFAE-SRTQAP-TSRISNGM 509 (529) T ss_pred ---------Ccccceeeeeeeeceee-cCccc-cccccc-cccccCCc Confidence 11222334455666543 44111 111111 00112222 No 235 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=22.25 E-value=2.5 Score=18.43 Aligned_cols=285 Identities=11% Similarity=0.002 Sum_probs=107.1 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-----eEEEEEeCC---------------------- Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-----THLPVLATL---------------------- 53 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-----~~~p~~~~~---------------------- 53 (302) .+..+++..-.-.-+.+. .+++++.......+++-+.||++++ ++..+.... T Consensus 79 i~est~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~y~Pda~~sga 157 (529) T protein:vir:10 79 IAAGQSSGAITNIGPAVI-GMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSL 157 (529) T ss_pred cccccccccccccCchhh-hhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCccccccccccccccccccccccc Confidence 233333322111111111 1223333334445556666665431 010000000 Q ss_pred ----------------------------------------------------------------------cceeeecccc Q lcl|NC_011054. 54 ----------------------------------------------------------------------PGASWVSESA 63 (302) Q Consensus 54 ----------------------------------------------------------------------~~a~~v~E~~ 63 (302) ....-++++. T Consensus 158 ~~~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm 237 (529) T protein:vir:10 158 ATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGM 237 (529) T ss_pred cccccccccCccccccccccccccccCcceeeeecccceecccccccccccCccccCccccccccccccccccccccccc Confidence 0000000010 Q ss_pred cc------------ccccccccccceeeEEeeeeeEEEeehhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011054. 64 TE------------PEGVKPTSEATWADRTLVAEEVAVIIPVHENVVDDA----STSLLEEIAALGGQAIGKKLDQAVIF 127 (302) Q Consensus 64 ~~------------~~~~~~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds----~~~~~~~i~~~l~~ai~~~~d~~~l~ 127 (302) .. .....++...++++++...+.-+-...+|-||.+|- ..|.++.|.+-|+..|...|++.+|. T Consensus 238 ~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~ 317 (529) T protein:vir:10 238 ATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVID 317 (529) T ss_pred chhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHH Confidence 00 000122333444555555555555678999999984 36889999999999999999999985 Q ss_pred ccC------CCcccccccccccccccccceeeccccchHHHHHHHhhhhh---hhhh--hcccCccEEEecHHHHHHHHh Q lcl|NC_011054. 128 GTD------KPSSWVSPALLPAAVAANQDYTIVPGDANEDDLIGCINRAS---KAVA--AAGYMPDTLLASLGFRFDVAN 196 (302) Q Consensus 128 G~g------~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~---~~~~--~~~~~~~~~v~~~~~~~~l~~ 196 (302) =-- +..|....+................+-...+.+..++..+- ..+. ..+.....+++++.....|.. T Consensus 318 ~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~ 397 (529) T protein:vir:10 318 WINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALAL 397 (529) T ss_pred hHhhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHh Confidence 211 11111111000000011100111111122233333333322 2222 222345678899999999974 Q ss_pred h--hc-------CCC------ceeeecccccCcceEeecccccCCCcceEEEEecce------EEEEee-cCcEEEEeec Q lcl|NC_011054. 197 L--RD-------ANG------NPIFRDESFNGFGTYFNANGAWPVGVAEALVVDSSR------VRIGVR-QDITVKFLDQ 254 (302) Q Consensus 197 l--~d-------~~g------~~i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~------~~~~~~-~~~~i~~~~~ 254 (302) . .+ ..| ..++..-...++++++..+ .....+++|-... .++..- ....+...+. T Consensus 398 ~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y----~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp 473 (529) T protein:vir:10 398 IDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQY----ARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDP 473 (529) T ss_pred hhhhccccccccccccccccCCceEEEEecCceEEEecCC----CCcceEEEEEeCCcccccceeeccccccccccccCC Confidence 2 11 111 1112222233344444433 2333344443211 011000 0000111111 Q ss_pred ccccchhhhcCCcEEEEEEEEeccEEeccccE--------EEEeeecccccCCCCC Q lcl|NC_011054. 255 ATVGSINLAERDMIALRLKARFAYVLGNGATA--------VGDNKTPVGAVVPDGS 302 (302) Q Consensus 255 ~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~--------~~lt~~~a~~~~p~~~ 302 (302) .+-|=.+-...|++..+ +|=+. ..+.+.++.. =+|. T Consensus 474 ---------~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~--~ag~ 517 (529) T protein:vir:10 474 ---------KNFQPVMGFKTRYAIGV-NPFAESRTQAPQGRITSGMPGVN--SVGK 517 (529) T ss_pred ---------Ccccceeeeeeeeceee-cCccccccccccccccCCcchhh--hcCc Confidence 11222333445666543 33111 1111222211 1111 No 236 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=21.99 E-value=2.5 Score=18.39 Aligned_cols=277 Identities=13% Similarity=0.093 Sum_probs=126.9 Q ss_pred CCCccC-------CCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccccccccccccc Q lcl|NC_011054. 1 MADISR-------SEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-THLPVLATLPGASWVSESATEPEGVKPT 72 (302) Q Consensus 1 Ma~~t~-------~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~E~~~~~~~~~~~ 72 (302) +|.... .+--+-|.+.+...+.+.+++.+-+++.++.+++..-. ..+-....++-++...-+.... T Consensus 13 ~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~r~r~------ 86 (336) T protein:vir:37 13 LAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEKGVTGRKQTGRNLA------ 86 (336) T ss_pred HHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCcccccccCCCCCcc------ Confidence 332221 12346788999999999999999999999999987432 3444444444444433322111 Q ss_pred cccceeeEEeeeeeEEEeehhHHHHHhc-c-hHHH-HHHHHHHHHHHHHHHHHHHhhcccCC------Cc------cccc Q lcl|NC_011054. 73 SEATWADRTLVAEEVAVIIPVHENVVDD-A-STSL-LEEIAALGGQAIGKKLDQAVIFGTDK------PS------SWVS 137 (302) Q Consensus 73 s~~~f~~i~l~~~ki~~~~~iS~ell~d-s-~~~~-~~~i~~~l~~ai~~~~d~~~l~G~g~------~~------g~~~ 137 (302) ....+.-...-++.---+.|+-+.|+. + .+++ ...+...+.+.++.-+-.--++|+.. |. |.+. T Consensus 87 -~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPllqDVNkGWlQ 165 (336) T protein:vir:37 87 -TLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTTKTDLSDVNKGWLK 165 (336) T ss_pred -ccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCCCccccccchhHHH Confidence 112333334444444455667777763 1 2342 23334444455555555555677431 11 1110 Q ss_pred -------ccccccc-cccccceeeccccchHHHHHHHhhhhhhhhhhcccCc-c-EEEecHHHHH-HHHhhhcCCC-ce- Q lcl|NC_011054. 138 -------PALLPAA-VAANQDYTIVPGDANEDDLIGCINRASKAVAAAGYMP-D-TLLASLGFRF-DVANLRDANG-NP- 204 (302) Q Consensus 138 -------~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~-~~v~~~~~~~-~l~~l~d~~g-~~- 204 (302) ....... ...+. ........++..+..++.++...+.....+. . .+++.+.... ....|-..++ +| T Consensus 166 ~~Re~a~~~v~~~~~~~~g~-i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~Pt 244 (336) T protein:vir:37 166 LLQEQRAANFMTESTKSSGK-ITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGLTPT 244 (336) T ss_pred HHHhccchhhcccccccCCc-eEEecCCCCcccHHHHHHHHHhccchHHhcCCCeEEEEchhhhhhhhhhhhhhcCCCHH Confidence 0000000 00010 1111223345666666666655454433322 2 3445554422 2222333332 22 Q ss_pred -------eeecccccCcceEeecccccCCCcceEEEEecceEEEEeecCcEE-EEeecccccchhhhcCCcEEEEEEEEe Q lcl|NC_011054. 205 -------IFRDESFNGFGTYFNANGAWPVGVAEALVVDSSRVRIGVRQDITV-KFLDQATVGSINLAERDMIALRLKARF 276 (302) Q Consensus 205 -------i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~r~~~r~ 276 (302) +.....+.|+|...+... ....+++--++..-+....+-.= .+.+... +|++.-.-..-- T Consensus 245 E~~Aa~~~~~~k~iGGlpa~~~Pff----P~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~--------r~rie~y~s~Ne 312 (336) T protein:vir:37 245 EKAALGSHNLMGSFGGMNAITPPNF----PARAAAVTTLKNLSVYTEAESVRRSLRNDED--------KKGLVTSYYRQE 312 (336) T ss_pred HHHHHHHHHHHHhhCCceEEEcccc----CCCceEEeeccccEEEEecCcEEEEEEEccc--------cccccchhhhcc Confidence 112345778888776553 34446677777655444333211 1222221 222221112223 Q ss_pred ccEEeccccEEEEeeecc---ccc Q lcl|NC_011054. 277 AYVLGNGATAVGDNKTPV---GAV 297 (302) Q Consensus 277 d~~v~~~~a~~~lt~~~a---~~~ 297 (302) |+.|.+...++.+..... +.+ T Consensus 313 ~YvVEd~~~~a~iE~i~v~~~~e~ 336 (336) T protein:vir:37 313 GYVVEDLGLMTAIDHTKVKLNGEV 336 (336) T ss_pred eeeeeccccEEEeeeeeeeccccC Confidence 455555555555443321 112 No 237 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=20.73 E-value=2.7 Score=18.20 Aligned_cols=287 Identities=11% Similarity=0.029 Sum_probs=111.8 Q ss_pred CCCccCCCcceecchHHHHHHHHHHHhhhhhhhhcceeecCCCc-----eEEEEEeC--------------Ccceeeecc Q lcl|NC_011054. 1 MADISRSEVATLIQEAYANDLLASAKKGSTVLQAFPTVNMGTKT-----THLPVLAT--------------LPGASWVSE 61 (302) Q Consensus 1 Ma~~t~~~~g~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-----~~~p~~~~--------------~~~a~~v~E 61 (302) .+..+++..-.-.-+.+. .+.+++.......+++-+.||+++. ++..+... .+.+.|-+. T Consensus 79 i~es~~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~ 157 (521) T protein:vir:10 79 IAAGQTSGAVTQIGPAVM-GMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQ 157 (521) T ss_pred ccccccccccccCCchhh-hHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhcccccccccc Confidence 444444433222222221 1333334445566777888887653 11111110 011111111 Q ss_pred cccc---------------------------------------------------------------------------- Q lcl|NC_011054. 62 SATE---------------------------------------------------------------------------- 65 (302) Q Consensus 62 ~~~~---------------------------------------------------------------------------- 65 (302) +... T Consensus 158 ~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~ 237 (521) T protein:vir:10 158 GAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQE 237 (521) T ss_pred ccccccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhhHhhhc Confidence 0000 Q ss_pred -----ccccccccccceeeEEeeeeeEEEeehhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhcccCCC---- Q lcl|NC_011054. 66 -----PEGVKPTSEATWADRTLVAEEVAVIIPVHENVVDDA----STSLLEEIAALGGQAIGKKLDQAVIFGTDKP---- 132 (302) Q Consensus 66 -----~~~~~~~s~~~f~~i~l~~~ki~~~~~iS~ell~ds----~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~---- 132 (302) .....++...++++++...+.-+-...+|-||.+|- ..|.++.|.+-|+..|...|++.+|.=-... T Consensus 238 ~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~ 317 (521) T protein:vir:10 238 SFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVG 317 (521) T ss_pred cCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeee Confidence 000122233344444444444455667999999984 3688999999999999999999999421100 Q ss_pred -ccccccc-ccccccccccceeeccccchHHHHHHHhhhhhhh---h-h-hcccCccEEEecHHHHHHHHhhh------- Q lcl|NC_011054. 133 -SSWVSPA-LLPAAVAANQDYTIVPGDANEDDLIGCINRASKA---V-A-AAGYMPDTLLASLGFRFDVANLR------- 198 (302) Q Consensus 133 -~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~---~-~-~~~~~~~~~v~~~~~~~~l~~l~------- 198 (302) .|+.... ................+-...+.+..++..+... + + ........+++|+.....|...- T Consensus 318 ~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~ 397 (521) T protein:vir:10 318 KSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAA 397 (521) T ss_pred eeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccccc Confidence 0100000 0000000000011111112222323333333211 1 1 22355567889999999998531 Q ss_pred --cCCC------ceeeecccccCcceEeecccccCCCcceEEEEecce------EEEEe-ecCcEEEEeecccccchhhh Q lcl|NC_011054. 199 --DANG------NPIFRDESFNGFGTYFNANGAWPVGVAEALVVDSSR------VRIGV-RQDITVKFLDQATVGSINLA 263 (302) Q Consensus 199 --d~~g------~~i~~~~~~~g~p~~~~~~~~~~~~~~~~~~gd~~~------~~~~~-~~~~~i~~~~~~~~~~~~~~ 263 (302) ++.| ..++..-...++.+++..+ .....+++|-... .++.. .....+...|. T Consensus 398 ~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y----~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp--------- 464 (521) T protein:vir:10 398 QGLATGFNTDTTKSVFAGVLGGKYRVYIDQY----AKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDP--------- 464 (521) T ss_pred ccccccccccCCCceEEEEecCceEEEecCC----CCcceEEEEEeCCcccccceeeccccccccccccCC--------- Confidence 1111 1111111223344444333 2333344443211 01100 00111111111 Q ss_pred cCCcEEEEEEEEeccEEeccccE-------EEEeeecccccCCCCC Q lcl|NC_011054. 264 ERDMIALRLKARFAYVLGNGATA-------VGDNKTPVGAVVPDGS 302 (302) Q Consensus 264 ~~~~~~~r~~~r~d~~v~~~~a~-------~~lt~~~a~~~~p~~~ 302 (302) .+-|=.+-...|++..+ +|=+- ..|+...+..-.-.+. T Consensus 465 ~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~~i~~~~~~~~a~~~~ 509 (521) T protein:vir:10 465 KNFQPVMGFKTRYGIGI-NPFAESAAQAPASRIQSGMPSILNSLGK 509 (521) T ss_pred ccccceeeeeeeeceee-cCcccccCCccceeecccchhhhccccc Confidence 11222334445666543 44111 1111111111000011 Done!