Query lcl|NC_021307.1_cdsid_YP_008051783.1 [gene=14] [protein=major capsid protein] [protein_id=YP_008051783.1] [location=9361..10293] Match_columns 310 No_of_seqs 117 out of 1131 Neff 9.8 Searched_HMMs 1612 Date Thu Nov 7 18:04:09 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_11 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_11_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:104085 Length: 320 100.0 1.7E-68 1.1E-71 392.1 32.4 310 1-310 1-316 (320) 2 protein:vir:2430 Length: 318 # 100.0 9.1E-68 5.7E-71 388.1 32.4 310 1-310 1-312 (318) 3 protein:vir:4226 Length: 326 # 100.0 3.4E-66 2.1E-69 379.5 31.6 310 1-310 1-322 (326) 4 protein:vir:2344 Length: 397 # 100.0 3.1E-66 1.9E-69 379.7 30.9 305 5-310 1-305 (397) 5 protein:vir:7771 Length: 330 # 100.0 1.2E-62 7.7E-66 360.0 31.3 305 2-310 1-322 (330) 6 protein:vir:41 Length: 299 # N 100.0 4.8E-63 3E-66 362.2 28.6 295 7-310 1-297 (299) 7 protein:vir:9309 Length: 324 # 100.0 2.6E-61 1.6E-64 352.7 29.3 300 1-310 13-314 (324) 8 protein:vir:97148 Length: 324 100.0 3.6E-61 2.2E-64 351.9 29.7 300 1-310 9-314 (324) 9 protein:vir:99749 Length: 324 100.0 6.9E-61 4.3E-64 350.4 29.9 296 1-310 18-314 (324) 10 protein:vir:103955 Length: 324 100.0 5.9E-61 3.7E-64 350.8 29.5 296 1-310 18-314 (324) 11 protein:vir:96392 Length: 324 100.0 7.3E-61 4.5E-64 350.3 29.3 300 1-310 9-314 (324) 12 protein:vir:78830 Length: 324 100.0 7.3E-61 4.5E-64 350.3 29.3 300 1-310 9-314 (324) 13 protein:vir:96223 Length: 324 100.0 9.4E-61 5.8E-64 349.7 29.5 296 1-310 18-314 (324) 14 protein:vir:9574 Length: 300 # 100.0 2.2E-60 1.4E-63 347.7 29.8 285 15-310 1-299 (300) 15 protein:vir:95763 Length: 297 100.0 3.2E-60 2E-63 346.8 29.4 293 2-310 1-295 (297) 16 protein:vir:5739 Length: 366 # 100.0 5.2E-60 3.2E-63 345.6 28.4 298 1-310 39-365 (366) 17 protein:vir:94142 Length: 304 100.0 2.2E-59 1.4E-62 342.1 29.8 294 1-310 1-304 (304) 18 protein:vir:105905 Length: 304 100.0 2.2E-59 1.4E-62 342.1 29.8 294 1-310 1-304 (304) 19 protein:vir:8187 Length: 311 # 100.0 5.6E-59 3.5E-62 339.9 29.5 283 16-310 1-309 (311) 20 protein:vir:80684 Length: 315 100.0 6.1E-59 3.8E-62 339.7 28.9 288 15-310 1-305 (315) 21 protein:vir:105038 Length: 428 100.0 9.1E-59 5.6E-62 338.8 27.5 298 1-310 113-427 (428) 22 protein:vir:78523 Length: 338 100.0 3.4E-58 2.1E-61 335.6 30.3 305 1-310 1-334 (338) 23 protein:vir:9759 Length: 303 # 100.0 4.1E-58 2.5E-61 335.2 29.9 285 15-310 1-302 (303) 24 protein:vir:4456 Length: 401 # 100.0 5E-58 3.1E-61 334.7 27.8 288 1-310 91-400 (401) 25 protein:vir:1638 Length: 298 # 100.0 1.3E-57 7.9E-61 332.5 30.0 282 18-310 1-298 (298) 26 protein:vir:100247 Length: 425 100.0 5E-58 3.1E-61 334.7 27.2 288 1-310 108-423 (425) 27 protein:vir:485 Length: 407 # 100.0 1.1E-57 6.6E-61 332.9 28.7 288 1-310 90-399 (407) 28 protein:vir:1433 Length: 435 # 100.0 1.7E-57 1E-60 331.9 28.2 298 1-310 118-432 (435) 29 protein:vir:93616 Length: 645 100.0 3.9E-57 2.4E-60 329.8 29.4 299 1-310 302-638 (645) 30 protein:vir:7855 Length: 497 # 100.0 2.4E-57 1.5E-60 331.0 28.1 295 1-310 138-492 (497) 31 protein:vir:101650 Length: 497 100.0 2.4E-57 1.5E-60 331.0 28.1 295 1-310 138-492 (497) 32 protein:vir:80376 Length: 435 100.0 3.2E-57 2E-60 330.3 28.5 298 1-310 105-432 (435) 33 protein:vir:94771 Length: 298 100.0 6.5E-57 4E-60 328.6 29.6 282 18-310 1-298 (298) 34 protein:vir:1328 Length: 392 # 100.0 6.7E-57 4.2E-60 328.5 28.9 288 1-310 93-390 (392) 35 protein:vir:78223 Length: 333 100.0 7.5E-57 4.7E-60 328.3 29.1 301 1-310 1-331 (333) 36 protein:vir:1886 Length: 385 # 100.0 8.7E-57 5.4E-60 327.9 28.9 289 1-310 91-383 (385) 37 protein:vir:191 Length: 385 # 100.0 8.7E-57 5.4E-60 327.9 28.9 289 1-310 91-383 (385) 38 protein:vir:2504 Length: 305 # 100.0 6.8E-57 4.2E-60 328.5 27.6 281 14-310 1-297 (305) 39 protein:vir:99920 Length: 311 100.0 1.9E-56 1.2E-59 326.1 29.0 283 15-310 1-310 (311) 40 protein:vir:6242 Length: 390 # 100.0 1.7E-56 1.1E-59 326.3 27.6 286 1-310 93-388 (390) 41 protein:vir:10364 Length: 390 100.0 8.5E-56 5.2E-59 322.5 29.5 287 1-309 95-390 (390) 42 protein:vir:4339 Length: 395 # 100.0 7.2E-56 4.4E-59 322.9 28.8 288 1-310 101-394 (395) 43 protein:vir:100135 Length: 418 100.0 8.4E-56 5.2E-59 322.5 28.9 288 1-310 116-414 (418) 44 protein:vir:8102 Length: 543 # 100.0 5.9E-56 3.7E-59 323.4 27.7 292 1-310 237-541 (543) 45 protein:vir:97053 Length: 390 100.0 1.5E-55 9.5E-59 321.1 29.1 287 1-309 95-390 (390) 46 protein:vir:81070 Length: 390 100.0 1.7E-55 1E-58 320.9 29.3 287 1-309 95-390 (390) 47 protein:vir:104256 Length: 458 100.0 1.2E-54 7.5E-58 316.2 28.7 292 1-310 143-457 (458) 48 protein:vir:81227 Length: 413 100.0 1.3E-54 8.2E-58 316.0 28.9 295 1-310 105-409 (413) 49 protein:vir:4511 Length: 409 # 100.0 8.6E-55 5.3E-58 317.0 27.4 290 1-310 96-405 (409) 50 protein:vir:102119 Length: 404 100.0 3.3E-54 2.1E-57 313.8 29.1 293 1-310 92-399 (404) 51 protein:vir:81160 Length: 371 100.0 3.3E-54 2E-57 313.8 28.0 279 1-310 76-370 (371) 52 protein:vir:6212 Length: 434 # 100.0 2.9E-54 1.8E-57 314.1 26.9 290 1-310 130-430 (434) 53 protein:vir:1268 Length: 397 # 100.0 5.4E-54 3.3E-57 312.6 27.3 279 1-310 101-396 (397) 54 protein:vir:95376 Length: 425 100.0 4.8E-54 3E-57 312.9 27.0 286 1-310 119-420 (425) 55 protein:vir:3845 Length: 395 # 100.0 6.3E-54 3.9E-57 312.2 27.5 280 1-310 93-382 (395) 56 protein:vir:1025 Length: 408 # 100.0 1.4E-53 8.7E-57 310.4 29.0 280 1-310 101-392 (408) 57 protein:vir:96762 Length: 632 100.0 4.8E-54 3E-57 312.9 25.9 281 1-310 341-632 (632) 58 protein:vir:101607 Length: 379 100.0 1.2E-53 7.3E-57 310.8 27.2 283 1-310 92-378 (379) 59 protein:vir:4953 Length: 397 # 100.0 5.6E-53 3.5E-56 307.0 28.0 277 1-310 97-384 (397) 60 protein:vir:4997 Length: 397 # 100.0 8.8E-53 5.5E-56 306.0 28.9 276 1-310 98-384 (397) 61 protein:vir:4830 Length: 397 # 100.0 6.1E-53 3.8E-56 306.8 27.9 278 1-310 97-384 (397) 62 protein:vir:7409 Length: 408 # 100.0 9.3E-53 5.7E-56 305.8 28.7 280 1-310 101-392 (408) 63 protein:vir:4856 Length: 293 # 100.0 7E-53 4.3E-56 306.5 27.6 269 11-310 1-280 (293) 64 protein:vir:3991 Length: 404 # 100.0 1.9E-52 1.2E-55 304.1 29.0 280 1-310 101-392 (404) 65 protein:vir:102082 Length: 392 100.0 1.1E-52 6.9E-56 305.4 27.2 279 1-310 84-383 (392) 66 protein:vir:107593 Length: 392 100.0 1.1E-52 6.9E-56 305.4 27.2 279 1-310 84-383 (392) 67 protein:vir:102873 Length: 392 100.0 1.1E-52 6.9E-56 305.4 27.2 279 1-310 84-383 (392) 68 protein:vir:105004 Length: 392 100.0 1.1E-52 6.9E-56 305.4 27.2 279 1-310 84-383 (392) 69 protein:vir:4700 Length: 415 # 100.0 2.1E-52 1.3E-55 303.9 28.0 287 1-310 101-403 (415) 70 protein:vir:4600 Length: 415 # 100.0 2.1E-52 1.3E-55 303.9 28.0 287 1-310 101-403 (415) 71 protein:vir:4092 Length: 390 # 100.0 2.8E-52 1.7E-55 303.2 26.9 281 1-310 65-367 (390) 72 protein:vir:81100 Length: 415 100.0 8.6E-52 5.3E-55 300.6 27.8 287 1-310 96-403 (415) 73 protein:vir:79987 Length: 415 100.0 8.6E-52 5.3E-55 300.6 27.8 287 1-310 96-403 (415) 74 protein:vir:98339 Length: 415 100.0 8.6E-52 5.3E-55 300.6 27.8 287 1-310 96-403 (415) 75 protein:vir:9410 Length: 415 # 100.0 1.5E-51 9.5E-55 299.2 27.7 287 1-310 101-403 (415) 76 protein:vir:98635 Length: 377 100.0 2.6E-52 1.6E-55 303.4 21.2 293 1-310 60-376 (377) 77 protein:vir:94673 Length: 419 100.0 5.7E-51 3.6E-54 296.0 26.4 290 1-310 105-416 (419) 78 protein:vir:1383 Length: 421 # 100.0 1.4E-50 9E-54 293.8 25.7 274 1-310 103-382 (421) 79 protein:vir:9704 Length: 394 # 100.0 2E-50 1.2E-53 293.1 25.9 272 1-310 115-389 (394) 80 protein:vir:3870 Length: 400 # 100.0 5.6E-50 3.4E-53 290.6 26.5 272 1-310 119-398 (400) 81 protein:vir:101291 Length: 381 100.0 3.6E-50 2.3E-53 291.6 24.6 279 1-310 58-367 (381) 82 protein:vir:9509 Length: 381 # 100.0 3.6E-50 2.3E-53 291.6 24.6 279 1-310 58-367 (381) 83 protein:vir:100172 Length: 394 100.0 1.8E-49 1.1E-52 287.9 27.2 276 1-310 100-383 (394) 84 protein:vir:100632 Length: 381 100.0 3.1E-50 1.9E-53 292.0 22.8 279 1-310 58-367 (381) 85 protein:vir:8420 Length: 477 # 100.0 1.2E-49 7.6E-53 288.8 24.3 295 1-310 140-470 (477) 86 protein:vir:100884 Length: 389 100.0 5.7E-49 3.5E-52 285.1 26.9 274 1-310 99-381 (389) 87 protein:vir:1084 Length: 437 # 100.0 4.9E-49 3E-52 285.4 24.9 276 1-310 141-426 (437) 88 protein:vir:80128 Length: 466 100.0 2.8E-49 1.7E-52 286.8 23.3 284 1-310 123-447 (466) 89 protein:vir:78640 Length: 352 100.0 8.9E-49 5.5E-52 284.0 23.7 274 1-310 69-345 (352) 90 protein:vir:95963 Length: 395 100.0 2.6E-48 1.6E-51 281.4 25.6 280 1-310 68-375 (395) 91 protein:vir:9643 Length: 377 # 100.0 2.6E-48 1.6E-51 281.4 24.7 279 1-310 60-376 (377) 92 protein:vir:2685 Length: 387 # 100.0 4.7E-48 2.9E-51 280.1 22.1 274 1-310 104-380 (387) 93 protein:vir:96978 Length: 387 100.0 4.7E-48 2.9E-51 280.1 22.1 274 1-310 104-380 (387) 94 protein:vir:94424 Length: 387 100.0 4.7E-48 2.9E-51 280.1 22.1 274 1-310 104-380 (387) 95 protein:vir:9361 Length: 402 # 100.0 6.4E-48 4E-51 279.3 21.6 274 1-310 114-395 (402) 96 protein:vir:93881 Length: 387 100.0 2.4E-47 1.5E-50 276.2 23.8 273 1-310 104-380 (387) 97 protein:vir:78350 Length: 383 100.0 4.1E-47 2.5E-50 274.9 22.0 291 1-310 65-374 (383) 98 protein:vir:962 Length: 397 # 100.0 1.3E-46 7.8E-50 272.2 23.1 271 1-310 113-396 (397) 99 protein:vir:4197 Length: 314 # 100.0 3.8E-40 2.3E-43 236.7 25.5 287 1-310 1-311 (314) 100 protein:vir:4159 Length: 315 # 100.0 3.2E-40 2E-43 237.1 23.8 284 1-308 7-315 (315) 101 protein:vir:3158 Length: 321 # 100.0 3.4E-36 2.1E-39 215.1 24.9 291 1-310 1-310 (321) 102 protein:vir:97397 Length: 517 100.0 6.1E-35 3.8E-38 208.1 19.1 280 1-310 200-513 (517) 103 protein:vir:9820 Length: 272 # 100.0 3E-33 1.8E-36 198.9 24.4 263 15-310 1-268 (272) 104 protein:vir:3033 Length: 272 # 100.0 3E-33 1.8E-36 198.9 24.4 263 15-310 1-268 (272) 105 protein:vir:4074 Length: 480 # 100.0 3.1E-33 1.9E-36 198.8 15.0 272 1-310 198-476 (480) 106 protein:vir:93742 Length: 274 99.9 2.8E-26 1.7E-29 160.7 22.5 264 15-310 1-269 (274) 107 protein:vir:3613 Length: 272 # 99.9 1.4E-24 8.6E-28 151.4 21.1 264 15-310 1-271 (272) 108 protein:vir:105334 Length: 276 99.9 1.7E-24 1E-27 150.9 21.4 264 15-310 1-269 (276) 109 protein:vir:96123 Length: 274 99.9 3.6E-24 2.2E-27 149.1 22.3 264 15-310 1-269 (274) 110 protein:vir:80930 Length: 278 99.9 9.4E-24 5.8E-27 146.8 21.5 270 15-310 1-276 (278) 111 protein:vir:94494 Length: 274 99.9 1.6E-23 1E-26 145.5 22.6 264 15-310 1-269 (274) 112 protein:vir:97433 Length: 274 99.9 1.6E-23 1E-26 145.5 22.6 264 15-310 1-269 (274) 113 protein:vir:96833 Length: 275 99.9 1.5E-23 9.1E-27 145.8 21.0 265 14-310 1-270 (275) 114 protein:vir:95898 Length: 274 99.8 3.8E-22 2.4E-25 138.0 22.3 264 15-310 1-269 (274) 115 protein:vir:96262 Length: 274 99.8 3.8E-22 2.4E-25 138.0 22.3 264 15-310 1-269 (274) 116 protein:vir:1239 Length: 274 # 99.8 3.8E-22 2.4E-25 138.0 21.4 264 15-310 1-269 (274) 117 protein:vir:79928 Length: 393 99.8 1.8E-22 1.1E-25 139.8 18.3 297 1-310 58-377 (393) 118 protein:vir:94933 Length: 330 99.8 2.7E-21 1.7E-24 133.3 22.4 291 1-310 1-328 (330) 119 protein:vir:95107 Length: 270 99.8 2.1E-20 1.3E-23 128.5 20.9 259 16-310 1-264 (270) 120 protein:vir:739 Length: 231 # 99.7 2.5E-19 1.5E-22 122.6 17.6 228 48-310 1-230 (231) 121 protein:vir:97255 Length: 310 99.7 1.8E-17 1.1E-20 112.4 22.2 283 1-310 1-309 (310) 122 protein:vir:8324 Length: 410 # 99.7 1.7E-18 1.1E-21 117.9 14.5 279 1-309 113-410 (410) 123 protein:vir:99424 Length: 360 99.6 7E-17 4.3E-20 109.2 21.4 289 1-310 10-356 (360) 124 protein:vir:7990 Length: 273 # 99.5 2.5E-15 1.6E-18 100.6 19.7 264 15-310 1-272 (273) 125 protein:vir:102605 Length: 273 99.5 1E-14 6.5E-18 97.2 20.0 264 15-310 1-272 (273) 126 protein:vir:105822 Length: 273 99.5 1E-14 6.5E-18 97.2 20.0 264 15-310 1-272 (273) 127 protein:vir:108211 Length: 318 99.5 5.3E-15 3.3E-18 98.8 16.8 286 1-310 1-316 (318) 128 protein:vir:94622 Length: 341 99.4 8.9E-14 5.5E-17 92.1 18.3 300 1-310 1-338 (341) 129 protein:vir:93858 Length: 400 99.3 1.4E-13 8.5E-17 91.1 14.6 282 1-309 101-400 (400) 130 protein:vir:2201 Length: 345 # 99.3 1.1E-12 6.6E-16 86.2 17.5 287 1-310 1-344 (345) 131 protein:vir:5974 Length: 324 # 99.2 5.4E-12 3.4E-15 82.3 19.1 268 15-310 1-302 (324) 132 protein:vir:6324 Length: 335 # 99.2 6.5E-12 4E-15 81.9 19.5 286 1-310 1-327 (335) 133 protein:vir:80213 Length: 334 99.2 3.7E-12 2.3E-15 83.3 17.8 300 1-310 1-331 (334) 134 protein:vir:95318 Length: 328 99.2 5.8E-12 3.6E-15 82.2 18.5 230 1-241 1-328 (328) 135 protein:vir:78935 Length: 335 99.2 7.3E-12 4.5E-15 81.6 18.7 288 1-310 1-327 (335) 136 protein:vir:94576 Length: 347 99.2 4.2E-12 2.6E-15 83.0 17.3 285 1-310 1-346 (347) 137 protein:vir:80180 Length: 381 99.2 7.3E-12 4.6E-15 81.6 17.7 286 1-310 1-304 (381) 138 protein:vir:100057 Length: 375 99.1 3.2E-11 2E-14 78.1 20.3 291 1-310 1-369 (375) 139 protein:vir:10450 Length: 344 99.1 6.1E-12 3.8E-15 82.1 16.1 288 1-310 1-343 (344) 140 protein:vir:78739 Length: 332 99.1 1E-11 6.3E-15 80.9 17.1 301 1-309 1-332 (332) 141 protein:vir:103323 Length: 364 99.1 1E-10 6.2E-14 75.4 22.0 285 1-310 1-338 (364) 142 protein:vir:102944 Length: 330 99.1 6E-11 3.7E-14 76.6 19.5 267 15-310 1-298 (330) 143 protein:vir:8885 Length: 347 # 99.1 2.7E-11 1.7E-14 78.5 16.5 287 1-310 1-347 (347) 144 protein:vir:94711 Length: 347 99.0 8.5E-12 5.2E-15 81.3 12.5 283 1-310 1-345 (347) 145 protein:vir:9927 Length: 295 # 99.0 2.9E-11 1.8E-14 78.3 15.4 271 1-310 1-287 (295) 146 protein:vir:3364 Length: 347 # 99.0 8.1E-11 5E-14 75.9 17.8 285 1-310 1-344 (347) 147 protein:vir:1541 Length: 347 # 99.0 1.5E-10 9.3E-14 74.4 18.6 288 1-310 1-344 (347) 148 protein:vir:1583 Length: 351 # 99.0 1.3E-10 8.1E-14 74.8 17.3 267 15-310 1-328 (351) 149 protein:vir:3136 Length: 322 # 98.9 9.5E-11 5.9E-14 75.5 15.0 287 15-310 1-317 (322) 150 protein:vir:107388 Length: 331 98.9 3.3E-10 2E-13 72.6 17.6 230 1-241 1-331 (331) 151 protein:vir:98525 Length: 331 98.9 3.3E-10 2E-13 72.6 17.6 230 1-241 1-331 (331) 152 protein:vir:107826 Length: 331 98.9 3.3E-10 2E-13 72.6 17.6 230 1-241 1-331 (331) 153 protein:vir:99675 Length: 324 98.8 4.6E-10 2.9E-13 71.7 15.8 241 47-310 1-295 (324) 154 protein:vir:103759 Length: 330 98.8 5.6E-10 3.4E-13 71.3 16.2 229 1-241 1-330 (330) 155 protein:vir:9875 Length: 296 # 98.7 1.8E-09 1.1E-12 68.5 16.1 270 1-310 1-294 (296) 156 protein:vir:7324 Length: 335 # 98.7 4.1E-09 2.5E-12 66.6 17.5 231 1-242 1-335 (335) 157 protein:vir:106647 Length: 303 98.7 1.1E-09 6.8E-13 69.7 14.2 278 1-310 1-295 (303) 158 protein:vir:97031 Length: 402 98.7 2.9E-09 1.8E-12 67.4 16.3 295 1-310 1-332 (402) 159 protein:vir:105645 Length: 400 98.7 2.5E-09 1.5E-12 67.8 14.8 292 1-310 1-332 (400) 160 protein:vir:7019 Length: 401 # 98.6 4.5E-09 2.8E-12 66.3 13.3 292 1-310 1-332 (401) 161 protein:vir:8843 Length: 317 # 98.5 2.1E-08 1.3E-11 62.6 16.2 281 1-310 1-314 (317) 162 protein:vir:80068 Length: 301 98.5 1.1E-07 7E-11 58.7 19.5 269 17-309 1-301 (301) 163 protein:vir:102655 Length: 322 98.5 4.6E-08 2.9E-11 60.8 16.4 284 1-310 1-320 (322) 164 protein:vir:99075 Length: 392 98.4 2.6E-07 1.6E-10 56.7 19.4 273 21-310 1-303 (392) 165 protein:vir:107687 Length: 319 98.4 2.3E-07 1.4E-10 57.0 18.4 285 1-309 1-319 (319) 166 protein:vir:103285 Length: 296 98.3 3.7E-07 2.3E-10 55.8 18.1 271 15-310 1-294 (296) 167 protein:vir:79548 Length: 652 98.2 1.4E-07 8.4E-11 58.2 14.2 290 1-308 342-652 (652) 168 protein:vir:104342 Length: 314 98.1 8.8E-07 5.4E-10 53.8 16.8 285 1-310 1-312 (314) 169 protein:vir:108303 Length: 418 98.1 2.8E-06 1.8E-09 51.0 19.3 272 18-310 1-315 (418) 170 protein:vir:95512 Length: 693 97.9 2.1E-06 1.3E-09 51.7 14.8 288 1-309 371-693 (693) 171 protein:vir:79642 Length: 329 97.8 1.9E-05 1.2E-08 46.5 18.6 284 2-310 1-327 (329) 172 protein:vir:105374 Length: 423 97.6 3.3E-05 2E-08 45.2 19.6 276 21-310 1-335 (423) 173 protein:vir:95875 Length: 401 97.6 2.3E-05 1.4E-08 46.0 16.6 299 1-310 1-399 (401) 174 protein:vir:95131 Length: 325 97.5 5.8E-05 3.6E-08 43.8 18.1 275 16-310 1-295 (325) 175 protein:vir:174 Length: 423 # 97.3 9.8E-05 6.1E-08 42.6 19.7 273 15-310 1-305 (423) 176 protein:vir:3525 Length: 423 # 97.3 9.9E-05 6.1E-08 42.5 19.2 259 15-310 1-317 (423) 177 protein:vir:105522 Length: 423 97.0 0.00023 1.4E-07 40.5 19.5 263 15-310 1-305 (423) 178 protein:vir:5255 Length: 304 # 96.9 0.0002 1.2E-07 40.9 14.8 272 20-308 1-304 (304) 179 protein:vir:95451 Length: 313 96.6 0.00039 2.4E-07 39.3 14.2 275 15-310 1-310 (313) 180 protein:vir:1781 Length: 221 # 96.5 0.00028 1.8E-07 40.0 13.1 185 96-310 1-201 (221) 181 protein:vir:103886 Length: 302 96.5 0.00022 1.4E-07 40.6 12.3 264 15-310 1-301 (302) 182 protein:vir:96792 Length: 315 96.5 0.00059 3.6E-07 38.3 17.1 264 15-310 1-282 (315) 183 protein:vir:94070 Length: 339 96.4 0.00048 3E-07 38.8 13.5 280 1-309 21-339 (339) 184 protein:vir:1153 Length: 338 # 96.4 0.00066 4.1E-07 38.0 16.2 288 1-310 1-335 (338) 185 protein:vir:80446 Length: 367 96.3 0.00077 4.8E-07 37.6 19.1 273 1-310 1-347 (367) 186 protein:vir:100331 Length: 342 96.2 0.00074 4.6E-07 37.7 13.5 288 1-310 1-337 (342) 187 protein:vir:78387 Length: 349 96.1 0.001 6.4E-07 36.9 20.0 272 15-310 1-327 (349) 188 protein:vir:98856 Length: 343 96.1 0.00088 5.5E-07 37.3 13.3 288 1-310 1-332 (343) 189 protein:vir:3643 Length: 336 # 95.9 0.0012 7.2E-07 36.7 13.4 284 1-309 17-336 (336) 190 protein:vir:79171 Length: 337 95.9 0.0013 8.3E-07 36.3 16.5 288 1-310 1-333 (337) 191 protein:vir:104011 Length: 337 95.9 0.0013 8.4E-07 36.3 16.6 288 1-310 1-333 (337) 192 protein:vir:78558 Length: 336 95.8 0.0013 8.4E-07 36.3 13.3 283 1-309 17-336 (336) 193 protein:vir:94989 Length: 349 95.7 0.0015 9.4E-07 36.0 21.0 268 15-310 1-327 (349) 194 protein:vir:107732 Length: 379 95.5 0.0013 8.2E-07 36.4 12.0 294 1-309 34-379 (379) 195 protein:vir:101557 Length: 336 95.4 0.0019 1.2E-06 35.5 12.6 284 1-309 17-336 (336) 196 protein:vir:98566 Length: 355 95.1 0.0027 1.7E-06 34.7 14.4 288 1-310 1-341 (355) 197 protein:vir:270 Length: 341 # 95.1 0.0027 1.7E-06 34.7 14.9 286 1-310 1-331 (341) 198 protein:vir:79157 Length: 339 95.0 0.0029 1.8E-06 34.5 15.2 288 1-310 1-334 (339) 199 protein:vir:106734 Length: 336 95.0 0.0023 1.4E-06 35.0 11.9 283 1-309 17-336 (336) 200 protein:vir:1829 Length: 355 # 94.8 0.0036 2.2E-06 34.0 15.9 288 1-310 1-341 (355) 201 protein:vir:6061 Length: 357 # 94.7 0.0037 2.3E-06 33.9 13.5 288 1-310 1-341 (357) 202 protein:vir:99576 Length: 388 94.6 0.0026 1.6E-06 34.8 11.1 290 1-309 21-388 (388) 203 protein:vir:78186 Length: 337 94.5 0.0041 2.6E-06 33.6 15.1 288 1-310 1-333 (337) 204 protein:vir:3783 Length: 336 # 94.5 0.0042 2.6E-06 33.6 14.5 286 1-310 1-329 (336) 205 protein:vir:3746 Length: 336 # 94.4 0.0045 2.8E-06 33.5 16.5 286 1-310 1-329 (336) 206 protein:vir:5694 Length: 357 # 94.3 0.0047 2.9E-06 33.3 13.5 288 1-310 1-341 (357) 207 protein:vir:78777 Length: 358 94.2 0.0051 3.2E-06 33.1 15.6 284 1-310 1-337 (358) 208 protein:vir:107120 Length: 329 94.2 0.0052 3.2E-06 33.1 20.7 281 1-310 6-306 (329) 209 protein:vir:2016 Length: 357 # 94.1 0.0053 3.3E-06 33.1 13.7 288 1-310 1-341 (357) 210 protein:vir:94800 Length: 319 93.9 0.0059 3.7E-06 32.8 20.3 278 1-310 1-295 (319) 211 protein:vir:97331 Length: 319 93.9 0.0059 3.7E-06 32.8 20.3 278 1-310 1-295 (319) 212 protein:vir:5942 Length: 523 # 93.8 0.0065 4E-06 32.6 11.7 288 1-310 175-520 (523) 213 protein:vir:1663 Length: 393 # 93.5 0.0042 2.6E-06 33.6 10.2 283 1-309 94-393 (393) 214 protein:vir:96079 Length: 382 93.4 0.0062 3.8E-06 32.7 11.1 294 1-309 51-382 (382) 215 protein:vir:93966 Length: 400 93.3 0.0046 2.9E-06 33.4 10.1 283 1-309 101-400 (400) 216 protein:vir:861 Length: 318 # 93.0 0.0072 4.4E-06 32.3 10.7 283 1-309 19-318 (318) 217 protein:vir:78920 Length: 290 86.7 0.044 2.7E-05 28.0 19.9 276 15-310 1-289 (290) 218 protein:vir:348 Length: 321 # 84.2 0.062 3.9E-05 27.2 15.4 278 7-309 1-321 (321) 219 protein:vir:104915 Length: 470 82.5 0.076 4.7E-05 26.7 16.0 291 1-310 45-468 (470) 220 protein:vir:95603 Length: 463 80.8 0.091 5.7E-05 26.3 16.3 289 1-310 3-338 (463) 221 protein:vir:99311 Length: 463 80.8 0.091 5.7E-05 26.3 16.3 289 1-310 3-338 (463) 222 protein:vir:100603 Length: 529 80.4 0.095 5.9E-05 26.2 14.5 283 1-310 44-509 (529) 223 protein:vir:103463 Length: 521 77.5 0.12 7.7E-05 25.5 16.8 285 1-310 49-520 (521) 224 protein:vir:96666 Length: 462 74.2 0.16 0.0001 24.9 17.7 286 1-310 12-338 (462) 225 protein:vir:79008 Length: 299 71.9 0.19 0.00012 24.5 20.1 276 21-310 1-297 (299) 226 protein:vir:94870 Length: 318 68.2 0.24 0.00015 24.0 10.8 284 1-309 1-318 (318) 227 protein:vir:106998 Length: 468 62.6 0.33 0.00021 23.2 18.4 289 1-310 43-445 (468) 228 protein:vir:96442 Length: 418 61.6 0.35 0.00022 23.1 13.8 298 1-310 41-407 (418) 229 protein:vir:6901 Length: 522 # 61.5 0.35 0.00022 23.1 16.8 286 1-310 45-501 (522) 230 protein:vir:7214 Length: 521 # 56.3 0.46 0.00028 22.4 16.4 285 1-310 49-520 (521) 231 protein:vir:79712 Length: 285 54.0 0.51 0.00032 22.2 18.5 270 22-310 1-284 (285) 232 protein:vir:106286 Length: 534 50.4 0.61 0.00038 21.8 17.8 284 1-310 56-533 (534) 233 protein:vir:100851 Length: 514 45.8 0.76 0.00047 21.2 9.1 280 1-310 20-351 (514) 234 protein:vir:103370 Length: 418 44.0 0.82 0.00051 21.1 14.0 295 1-310 41-405 (418) 235 protein:vir:98143 Length: 524 42.4 0.89 0.00055 20.9 15.8 288 1-310 49-523 (524) 236 protein:vir:101039 Length: 529 42.3 0.89 0.00055 20.9 13.3 290 1-310 49-509 (529) 237 protein:vir:107947 Length: 519 42.0 0.9 0.00056 20.8 15.9 285 1-310 54-518 (519) 238 protein:vir:5670 Length: 514 # 41.3 0.93 0.00058 20.8 15.8 286 1-310 41-498 (514) 239 protein:vir:80986 Length: 528 38.5 1.1 0.00066 20.4 17.8 286 1-310 43-527 (528) 240 protein:vir:104549 Length: 462 35.9 1.2 0.00075 20.1 13.8 286 1-310 116-460 (462) 241 protein:vir:63741 Length: 468 32.4 1.4 0.00088 19.7 15.5 289 1-310 12-338 (468) 242 protein:vir:101811 Length: 529 30.9 1.5 0.00095 19.6 14.7 289 1-310 49-509 (529) 243 protein:vir:80491 Length: 467 26.0 2 0.0012 18.9 15.4 289 1-310 11-337 (467) 244 protein:vir:6601 Length: 528 # 25.3 2.1 0.0013 18.8 17.6 286 1-310 43-527 (528) 245 protein:vir:93696 Length: 364 22.6 2.4 0.0015 18.5 15.1 277 15-310 1-360 (364) 246 protein:vir:102823 Length: 470 21.9 2.5 0.0016 18.4 14.6 295 2-310 1-366 (470) No 1 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=1.7e-68 Score=392.06 Aligned_cols=310 Identities=77% Similarity=1.194 Sum_probs=288.4 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPI 80 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~ 80 (310) ||+|+.++.+....+.++++.+|++||+++++++++.+++.++|+++|+++|+.++.++||+.++++++.|++|++.+|+ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~ 80 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEGDMKPI 80 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCCccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccc Q lcl|NC_021307. 81 TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGT 160 (310) Q Consensus 81 ~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 160 (310) ++++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|++||+|+|++.+..+.+............. T Consensus 81 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 160 (320) T protein:vir:10 81 TKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGGA 160 (320) T ss_pred cccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceecccc Confidence 99999999999999999999999999999999999999999999999999999999999988888776665555443332 Q ss_pred h------HHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCC Q lcl|NC_021307. 161 T------YDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVA 234 (310) Q Consensus 161 ~------~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~ 234 (310) + .++.+.++...+...+..+++|+||++++.+|+++||++|+++|++....+......+++++|+||++++++| T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~ 240 (320) T protein:vir:10 161 TASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHVA 240 (320) T ss_pred cccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEecCCCC Confidence 2 3344556777788889999999999999999999999999999998888877777788899999999999999 Q ss_pred CCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 235 SGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 235 ~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +++..+++|||++++++++++++++++++.+++..+++...++++|++|+++||++.|+||++.+++||++|++++ T Consensus 241 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ 316 (320) T protein:vir:10 241 DGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVV 316 (320) T ss_pred CCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEecc Confidence 9998899999999999999999999999999999999999999999999999999999999999999999999877 No 2 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=9.1e-68 Score=388.12 Aligned_cols=310 Identities=74% Similarity=1.184 Sum_probs=289.0 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPI 80 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~ 80 (310) |+||..++.+......++++.++++||+++..+|++.+++.++|+++|+++|++++.++||+.+++++++|++|++++|+ T Consensus 1 ~~~~~~~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~ 80 (318) T protein:vir:24 1 MAAGTAFAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEGDMKPI 80 (318) T ss_pred CCCCCCCCHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCCccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceec--c Q lcl|NC_021307. 81 TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPA--T 158 (310) Q Consensus 81 ~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~--~ 158 (310) ++++|+++++++||+++++++|+|+++|+.++++++|.++|++++++++|+++++|+|++.+.+....+........ . T Consensus 81 ~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 160 (318) T protein:vir:24 81 TKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTGA 160 (318) T ss_pred cccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999888777665544433322 3 Q ss_pred cchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCce Q lcl|NC_021307. 159 GTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTT 238 (310) Q Consensus 159 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~ 238 (310) .+..++...++...+...+..+++|+||++++..|+++||++|+|+|++....+......+++++|+|++++++++.++. T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~ 240 (318) T protein:vir:24 161 TTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDHVVEGTT 240 (318) T ss_pred cchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCCCCCCcc Confidence 33445556677888888999999999999999999999999999999999988888888889999999999999999999 Q ss_pred eEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 239 VGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 239 ~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .+++|||++++++++++++++++++.+++...+....++++|++|++.||+++|+||++.+++||++|++++ T Consensus 241 ~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~ 312 (318) T protein:vir:24 241 VGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVV 312 (318) T ss_pred EEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeec Confidence 899999999999999999999999999999999999999999999999999999999999999999999988 No 3 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=3.4e-66 Score=379.53 Aligned_cols=310 Identities=76% Similarity=1.160 Sum_probs=278.3 Q ss_pred Cccch-----hhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAAGT-----AFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa~~-----~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) |+..+ ++..++.++..++++.+|++||++++++|++.+++.++++++|+++|++++.+++|+.++++.++|++|| T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg 80 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVSASWIGEG 80 (326) T ss_pred CCCCccchhhhcCcchhhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcceEEecCC Confidence 65553 3445566777788888899999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccce Q lcl|NC_021307. 76 DMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLT 155 (310) Q Consensus 76 ~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~ 155 (310) +.+|+++++|+++++.++|+++++++|+|+++||.++++++|.++|++++++++|+++|+|+|++.+.++.......... T Consensus 81 ~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~~~~ 160 (326) T protein:vir:42 81 DMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTKEVSLV 160 (326) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccee Confidence 99999999999999999999999999999999999999999999999999999999999999998887766544332222 Q ss_pred ec------c-cchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEE Q lcl|NC_021307. 156 PA------T-GTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTI 228 (310) Q Consensus 156 ~~------~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~ 228 (310) .. . ....+..+.+....+...+..+++|+||++++.+|+++||++|+|+|++....+.......++++|+||+ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~ 240 (326) T protein:vir:42 161 DPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTI 240 (326) T ss_pred ecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeEE Confidence 11 1 1222333455667778888899999999999999999999999999999988888877888999999999 Q ss_pred EeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEee Q lcl|NC_021307. 229 LSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTN 308 (310) Q Consensus 229 ~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~ 308 (310) +++++|+++..+++|||++++++++++++++++++.++++.+++...++++|++|++.||++.|+||++.+++||++|++ T Consensus 241 ~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~ 320 (326) T protein:vir:42 241 LSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTN 320 (326) T ss_pred EcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cC Q lcl|NC_021307. 309 AA 310 (310) Q Consensus 309 aa 310 (310) ++ T Consensus 321 ~~ 322 (326) T protein:vir:42 321 VD 322 (326) T ss_pred cc Confidence 99 No 4 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=3.1e-66 Score=379.70 Aligned_cols=305 Identities=73% Similarity=1.126 Sum_probs=277.4 Q ss_pred hhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccccccc Q lcl|NC_021307. 5 TAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGD 84 (310) Q Consensus 5 ~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~ 84 (310) =-+..+....+.++++.++++|||+++.+|++.+++.++|++++++++|+++.++||+.+..++++|++|++.+|+++++ T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~ 80 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDMKPITKGN 80 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCccccccccc Confidence 34456666777888888899999999999999999999999999999999988999999999999999999999999999 Q ss_pred eeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHH Q lcl|NC_021307. 85 MSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDA 164 (310) Q Consensus 85 ~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (310) |+++++++||++++++||+|+++|+.++++++|+++|++++++++|+++|+|+|++.+................... .+ T Consensus 81 f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~~~-~~ 159 (397) T protein:vir:23 81 MTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISPNAY-QG 159 (397) T ss_pred eeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecccch-hH Confidence 99999999999999999999999999999999999999999999999999999998876655444433333333333 34 Q ss_pred HHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeec Q lcl|NC_021307. 165 IGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGD 244 (310) Q Consensus 165 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd 244 (310) ...++...+...+..+++|+||++++.+|+++||++|+|+|++....+......+++|+|+||++++++|.++..+++|| T Consensus 160 ~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gD 239 (397) T protein:vir:23 160 LGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGYAGD 239 (397) T ss_pred HHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCceEEEEee Confidence 44567778888899999999999999999999999999999999988888888889999999999999999998889999 Q ss_pred ceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 245 FSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 245 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) |+++++++++++.++++++.++++..+....++++|++|+++||++.|+||++++++||++++..+ T Consensus 240 fs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~ 305 (397) T protein:vir:23 240 FSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDP 305 (397) T ss_pred cceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecc Confidence 999999999999999999999999999999999999999999999999999999999999999988 No 5 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=1.2e-62 Score=359.97 Aligned_cols=305 Identities=59% Similarity=0.937 Sum_probs=261.7 Q ss_pred ccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccccccccc Q lcl|NC_021307. 2 AAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPIT 81 (310) Q Consensus 2 aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~ 81 (310) .+++.+... ..++++.+|+++||+++++|++.+++.++|++++++++++++.+++|+.+++++++|++|++.+|++ T Consensus 1 m~~~~~~a~----~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~ 76 (330) T protein:vir:77 1 MAGSTVPST----QVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGEAERKPIT 76 (330) T ss_pred Ccccccchh----hccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecCCCccccc Confidence 334333332 2445667888999999999999999999999999999999988999999999999999999999999 Q ss_pred ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc-ccc------ccc Q lcl|NC_021307. 82 KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET-TKS------VDL 154 (310) Q Consensus 82 ~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~-~~~------~~~ 154 (310) +++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++||+|+|++.++.+... ... ... T Consensus 77 ~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~ 156 (330) T protein:vir:77 77 KGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNL 156 (330) T ss_pred cceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccc Confidence 9999999999999999999999999999999999999999999999999999999999887543221 111 011 Q ss_pred ee--cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC Q lcl|NC_021307. 155 TP--ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH 232 (310) Q Consensus 155 ~~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~ 232 (310) .. .......+.+.+++..+...+..+++|+||++++..|+++||++|+|+|++....+......+++|+|+||+++++ T Consensus 157 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~ 236 (330) T protein:vir:77 157 TTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADN 236 (330) T ss_pred cccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEecc Confidence 11 1111223444567777888889999999999999999999999999999999888877777889999999999999 Q ss_pred CCCC----ceeEeeecceeeeEEeecccEEEEeecceeeecccc----cccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|NC_021307. 233 VASG----TTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQ----APNFVSLWQHNLVAVRVEAEYGLLINDVEAFV 304 (310) Q Consensus 233 ~~~~----~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~ 304 (310) +|.+ +..+++|||++++++++++++++++++.++.+..+. ...++++|++|+++||++.|+||++.+|+||+ T Consensus 237 ~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~ 316 (330) T protein:vir:77 237 VVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFV 316 (330) T ss_pred ccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceE Confidence 9864 467889999999999999999999999998877654 34568999999999999999999999999999 Q ss_pred EEeecC Q lcl|NC_021307. 305 KLTNAA 310 (310) Q Consensus 305 ~l~~aa 310 (310) +|+.++ T Consensus 317 ~i~~~~ 322 (330) T protein:vir:77 317 KLTDQV 322 (330) T ss_pred EEEecc Confidence 999999 No 6 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=4.8e-63 Score=362.22 Aligned_cols=295 Identities=19% Similarity=0.298 Sum_probs=261.2 Q ss_pred hhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccccccccccccee Q lcl|NC_021307. 7 FPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMS 86 (310) Q Consensus 7 ~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~ 86 (310) +... +...++++.++++||++++++|++.+++.++++++|+++|++++..++|+.+ ++.++|++|++.+|+++++|+ T Consensus 1 ~g~~--a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~f~ 77 (299) T protein:vir:41 1 MGFN--PDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMS-GVGAFWVDEAERIQTSKPTFT 77 (299) T ss_pred CCcC--CCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEc-CCceeeeecCcccccccccee Confidence 3322 2334445566778999999999999999999999999999999999999875 578999999999999999999 Q ss_pred eeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHH Q lcl|NC_021307. 87 VQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIG 166 (310) Q Consensus 87 ~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (310) ++++.++|++++++||+|+++|+.++++++|.++|++++++++|+++++|+|++.+.++.............+....+.+ T Consensus 78 ~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~~~~~~~~l 157 (299) T protein:vir:41 78 KAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEETANKYDDL 157 (299) T ss_pred EEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeeccccccHHHH Confidence 99999999999999999999999999999999999999999999999999999988887765444433333333444555 Q ss_pred HHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCc--eeEeeec Q lcl|NC_021307. 167 VNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGT--TVGYLGD 244 (310) Q Consensus 167 ~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~--~~~~~gd 244 (310) .+++.++...+..+++|+||++++.+|++++|.+|+|++++....+ .++|+|+||++++++|.++ ..+++|| T Consensus 158 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~------~~~l~G~PV~~~~~~~~~~~~~~~~~gd 231 (299) T protein:vir:41 158 NEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNG------VDDVLGLPIAYTPKYTFGDKDISELVGD 231 (299) T ss_pred HHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCC------CceecceeeEEecccCCCCCceEEEEEe Confidence 6888889999999999999999999999999999999999876544 2479999999999999654 5688999 Q ss_pred ceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 245 FSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 245 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) |++++++++++++++++++.++....+.+..++++|++|+++||++.|+||++.+++||++|+.+| T Consensus 232 fs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~a 297 (299) T protein:vir:41 232 WNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKA 297 (299) T ss_pred cccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Confidence 999999999999999999999999999999999999999999999999999999999999999999 No 7 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=2.6e-61 Score=352.70 Aligned_cols=300 Identities=17% Similarity=0.227 Sum_probs=256.9 Q ss_pred CccchhhhHHHHHhhc-cccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQ-TGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~-~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) +.+......+...+.. +.++.++++||++++++|++.+++.++++++|+++|++++.++||+.++.++++|++||+.+| T Consensus 13 ~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~ 92 (324) T protein:vir:93 13 HFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIE 92 (324) T ss_pred HHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeeecCCcccc Confidence 2222222222222222 334445678999999999999999999999999999999989999999999999999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccc-ccccccccccceecc Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDK-NLDETTKSVDLTPAT 158 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~-~~~~~~~~~~~~~~~ 158 (310) +++++|+++++.++|++++++||+|+++||.++++++|.++|++++++++|+++|+|+|++... ++............+ T Consensus 93 ~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~ 172 (324) T protein:vir:93 93 TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKG 172 (324) T ss_pred ccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCccccccccccceeccc Confidence 9999999999999999999999999999999999999999999999999999999999877543 333333333333333 Q ss_pred cchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCce Q lcl|NC_021307. 159 GTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTT 238 (310) Q Consensus 159 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~ 238 (310) . ...+.+.++...++..+..+++|+||++++.+|++++|++|+++++.. .+++|+|+||++++..+.++. T Consensus 173 ~-~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~~---------~~~~l~G~PVv~~~~~~~~~~ 242 (324) T protein:vir:93 173 D-FTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---------NSDSLDGLPVVNLKSSNLKRG 242 (324) T ss_pred c-ccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCC---------CCCcccceeeEeecCCCCCcc Confidence 3 334555688889999999999999999999999999999999998642 245799999999998888888 Q ss_pred eEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 239 VGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 239 ~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .+++|||++++++++++++++++++..+....+.+..+|++|++|+++||++.|+||++.+++||++|++|. T Consensus 243 ~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~ 314 (324) T protein:vir:93 243 ELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred eEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeccc Confidence 889999999999999999999999999999999999999999999999999999999999999999999888 No 8 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=3.6e-61 Score=351.95 Aligned_cols=300 Identities=18% Similarity=0.241 Sum_probs=257.6 Q ss_pred Cccch----hhhHHHHHhhcc-ccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAAGT----AFPVNHTQIAQT-GDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa~~----~~~~~~~~~~~~-~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) ++.+. ....+...+... .++.++++||++++++|++.+++.++++++++++|++++.+++|+.++.+.+.|++|| T Consensus 9 ~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 88 (324) T protein:vir:97 9 LNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEG 88 (324) T ss_pred HHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceEEEEEecCcceeEeccC Confidence 22221 111111222222 3345666889999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccc-ccccccccccc Q lcl|NC_021307. 76 DMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDK-NLDETTKSVDL 154 (310) Q Consensus 76 ~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~-~~~~~~~~~~~ 154 (310) +.+|+++++|++++++++|+++++++|+|+++|+.++++++|.++|++++++++|+++|+|+|++... ++......... T Consensus 89 ~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~~~~~~~~~ 168 (324) T protein:vir:97 89 QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNK 168 (324) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCccccccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999987643 33333344343 Q ss_pred eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCC Q lcl|NC_021307. 155 TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVA 234 (310) Q Consensus 155 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~ 234 (310) ...+..++++ +.++..++...++.+++|+||++++..|++++|++|++++++. .+++|+|+||++++..+ T Consensus 169 ~~~~~~~~~~-i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~~---------~~~tl~G~PV~~~~~~~ 238 (324) T protein:vir:97 169 VIKGDFTQDN-IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---------NSDTLDGLPVVNLKSSN 238 (324) T ss_pred eccccCCHHH-HHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCC---------CCccccceeeEeecCCC Confidence 4444445544 4578889999999999999999999999999999999998642 24579999999999988 Q ss_pred CCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 235 SGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 235 ~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .++..+++|||++++++++++++++++++..+....+.+..+|++|++|+++||++.|+|+++.+++||++|+++- T Consensus 239 ~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~ 314 (324) T protein:vir:97 239 LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred CCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Confidence 8888889999999999999999999999999999999999999999999999999999999999999999999988 No 9 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=6.9e-61 Score=350.42 Aligned_cols=296 Identities=18% Similarity=0.242 Sum_probs=257.0 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPI 80 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~ 80 (310) +.+++.+.. ...+.+..++++||++++++|++.+++.++|+++|+++|+.+++++||+.++.+++.|++|++.+|+ T Consensus 18 ~~~~~~~~a----~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~ 93 (324) T protein:vir:99 18 NVKPQVFNP----DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET 93 (324) T ss_pred hhhhhhccc----cceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEeccCccccc Confidence 333333321 1122334456789999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccc-cccccccccceeccc Q lcl|NC_021307. 81 TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKN-LDETTKSVDLTPATG 159 (310) Q Consensus 81 ~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~-~~~~~~~~~~~~~~~ 159 (310) ++++|+++++.++|+++++++|+|+++|+.++++++|.++|++++++++|+++|+|+|++.... +............+. T Consensus 94 ~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 173 (324) T protein:vir:99 94 SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGD 173 (324) T ss_pred cccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceecccc Confidence 9999999999999999999999999999999999999999999999999999999999875333 333333333333333 Q ss_pred chHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCcee Q lcl|NC_021307. 160 TTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTV 239 (310) Q Consensus 160 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~ 239 (310) .+ .+.+.++...+...+..+++|+||++++..|++++|++|++++... .+++|+|+||++++.++.++.. T Consensus 174 ~~-~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~---------~~~~l~G~PVv~~~~~~~~~~~ 243 (324) T protein:vir:99 174 FT-QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---------NSDTLDGLPVVNLKSSNLKRGE 243 (324) T ss_pred CC-HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCC---------CCccccceeEEeecCCCCCcce Confidence 33 4555688899999999999999999999999999999999998642 2457999999999999988888 Q ss_pred EeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 240 GYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 240 ~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +++|||++++++++++++++++++..+....+.+..+|++|++|+++||++.|+||++.+++||++|+++. T Consensus 244 ~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~ 314 (324) T protein:vir:99 244 LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred EEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999988 No 10 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=5.9e-61 Score=350.77 Aligned_cols=296 Identities=18% Similarity=0.235 Sum_probs=257.9 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPI 80 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~ 80 (310) |.+++.+.. ...+.+...+++||++++++|++.+++.++|+++|+++|++++.+++|+.++.+.+.|++|++.+|+ T Consensus 18 ~~~~~~~~a----~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~ 93 (324) T protein:vir:10 18 NVKPQVFNP----DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET 93 (324) T ss_pred hhccceecc----cceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceeEeccCccccc Confidence 444444432 1123344556789999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccc-ccccccccccceeccc Q lcl|NC_021307. 81 TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDK-NLDETTKSVDLTPATG 159 (310) Q Consensus 81 ~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~-~~~~~~~~~~~~~~~~ 159 (310) ++++|+++++.++|+++++++|+|+++|+.++++++|.++|++++++++|+++|+|+|++... ++............+. T Consensus 94 ~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~~~ 173 (324) T protein:vir:10 94 SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGD 173 (324) T ss_pred cccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceecccc Confidence 999999999999999999999999999999999999999999999999999999999987533 3333333333333333 Q ss_pred chHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCcee Q lcl|NC_021307. 160 TTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTV 239 (310) Q Consensus 160 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~ 239 (310) .++ +.+.++...+...+..+++|+||++++..|++++|++|++++++. .+++|+|+||++++.++.++.. T Consensus 174 ~t~-~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~---------~~~~l~G~PV~~~~~~~~~~~~ 243 (324) T protein:vir:10 174 FTQ-DNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---------NSDTLDGLPVVNLKSSNLKRGE 243 (324) T ss_pred CCH-HHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeecCC---------CCccccceeEEeecCCCCCcce Confidence 344 445578889999999999999999999999999999999998642 2457999999999998888888 Q ss_pred EeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 240 GYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 240 ~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +++|||+++++++++++++++++++.+....+.+..++++|++|+++||++.|+||++.+++||++|+++. T Consensus 244 ~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~ 314 (324) T protein:vir:10 244 LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred EEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999998 No 11 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=7.3e-61 Score=350.29 Aligned_cols=300 Identities=18% Similarity=0.226 Sum_probs=256.1 Q ss_pred Cccch----hhhHHHHHh-hccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAAGT----AFPVNHTQI-AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa~~----~~~~~~~~~-~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) +..+. ....+...+ ..+.++.++++||+++..+|++.+++.++++++++++|++++.+++|+.++++.++|++|+ T Consensus 9 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg 88 (324) T protein:vir:96 9 LNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEG 88 (324) T ss_pred HHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEecCC Confidence 22221 111111112 2333455667899999999999999999999999999999988999999999999999999 Q ss_pred ccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc-ccccccccc Q lcl|NC_021307. 76 DMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL-DETTKSVDL 154 (310) Q Consensus 76 ~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~-~~~~~~~~~ 154 (310) +.+|+++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++....+ ......... T Consensus 89 ~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~ 168 (324) T protein:vir:96 89 QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNK 168 (324) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccce Confidence 9999999999999999999999999999999999999999999999999999999999999998764433 333333333 Q ss_pred eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCC Q lcl|NC_021307. 155 TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVA 234 (310) Q Consensus 155 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~ 234 (310) ...+..+ .+.+.++...+...+..+++|+||++++.+|++++|.+|++++... .+++|+|+||++++.++ T Consensus 169 ~~~~~~t-~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~---------~~~~l~G~PV~~~~~~~ 238 (324) T protein:vir:96 169 VIKGDFT-QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---------NSDSLDGLPVVNLKSSN 238 (324) T ss_pred ecccccc-HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecCC---------CCCcccceeeEeeCCCC Confidence 3333334 4445678889999999999999999999999999999999998632 24579999999999888 Q ss_pred CCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 235 SGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 235 ~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .++..+++|||+++++++++++++++++++.+....+.+..+|++|++|++.||++.|+||++.+|+||++|+++= T Consensus 239 ~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~ 314 (324) T protein:vir:96 239 LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred CCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeccc Confidence 8888889999999999999999999999999999999999999999999999999999999999999999999865 No 12 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=7.3e-61 Score=350.29 Aligned_cols=300 Identities=18% Similarity=0.226 Sum_probs=256.1 Q ss_pred Cccch----hhhHHHHHh-hccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAAGT----AFPVNHTQI-AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa~~----~~~~~~~~~-~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) +..+. ....+...+ ..+.++.++++||+++..+|++.+++.++++++++++|++++.+++|+.++++.++|++|+ T Consensus 9 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg 88 (324) T protein:vir:78 9 LNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEG 88 (324) T ss_pred HHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEecCC Confidence 22221 111111112 2333455667899999999999999999999999999999988999999999999999999 Q ss_pred ccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc-ccccccccc Q lcl|NC_021307. 76 DMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL-DETTKSVDL 154 (310) Q Consensus 76 ~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~-~~~~~~~~~ 154 (310) +.+|+++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++....+ ......... T Consensus 89 ~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~ 168 (324) T protein:vir:78 89 QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNK 168 (324) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccce Confidence 9999999999999999999999999999999999999999999999999999999999999998764433 333333333 Q ss_pred eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCC Q lcl|NC_021307. 155 TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVA 234 (310) Q Consensus 155 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~ 234 (310) ...+..+ .+.+.++...+...+..+++|+||++++.+|++++|.+|++++... .+++|+|+||++++.++ T Consensus 169 ~~~~~~t-~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~---------~~~~l~G~PV~~~~~~~ 238 (324) T protein:vir:78 169 VIKGDFT-QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---------NSDSLDGLPVVNLKSSN 238 (324) T ss_pred ecccccc-HHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecCC---------CCCcccceeeEeeCCCC Confidence 3333334 4445678889999999999999999999999999999999998632 24579999999999888 Q ss_pred CCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 235 SGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 235 ~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .++..+++|||+++++++++++++++++++.+....+.+..+|++|++|++.||++.|+||++.+|+||++|+++= T Consensus 239 ~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~ 314 (324) T protein:vir:78 239 LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred CCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeccc Confidence 8888889999999999999999999999999999999999999999999999999999999999999999999865 No 13 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=9.4e-61 Score=349.68 Aligned_cols=296 Identities=19% Similarity=0.234 Sum_probs=255.5 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPI 80 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~ 80 (310) +.+++.+.. ...+.+..++++||++++++|++.+++.++++++++++|++++.++||+.++.++++|++|++.+|+ T Consensus 18 ~~~~~~~~a----~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~ 93 (324) T protein:vir:96 18 NVKPQVFNP----DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET 93 (324) T ss_pred hhhhhhccc----ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeeecCCccccc Confidence 222222211 1122334566799999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccc-cccccccceeccc Q lcl|NC_021307. 81 TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLD-ETTKSVDLTPATG 159 (310) Q Consensus 81 ~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~-~~~~~~~~~~~~~ 159 (310) ++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++....+. ..+.......... T Consensus 94 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 173 (324) T protein:vir:96 94 SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIKGD 173 (324) T ss_pred cccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCccccccccccceecccc Confidence 999999999999999999999999999999999999999999999999999999999987644333 3333333333333 Q ss_pred chHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCcee Q lcl|NC_021307. 160 TTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTV 239 (310) Q Consensus 160 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~ 239 (310) .+++ .+.++..+++..+..+++|+||++++.+|++++|++|+++++.. .+++|+|+||++++..+.++.. T Consensus 174 ~~~~-~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~~---------~~~~l~G~PV~~~~~~~~~~~~ 243 (324) T protein:vir:96 174 FTQD-NIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---------NSDSLDGLPVVNLKSSNLKRGE 243 (324) T ss_pred cchH-HHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCC---------CCCcccceeeEeecCCCCCcce Confidence 4444 45578888999999999999999999999999999999998632 2457999999999988888888 Q ss_pred EeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 240 GYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 240 ~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +++|||+++++++++++++++++++.+....+.+..+|++|++|+++||++.|+||++.+++||++|++|- T Consensus 244 ~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~ 314 (324) T protein:vir:96 244 LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred EEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999888 No 14 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=2.2e-60 Score=347.66 Aligned_cols=285 Identities=16% Similarity=0.213 Sum_probs=243.5 Q ss_pred hccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccccccceeeeEeeeee Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPHK 94 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k 94 (310) |..+++.+|.++|++++.+|++.+++.|+++++|+++|++++.+++|+.+++++++|++|++.+|+++++|+++++++|| T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k 80 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTIVPLK 80 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcccccccccceeeEeeeEE Confidence 88888888999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEeeehhhHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccc--ccccc--cccc--cceecccchHHHH Q lcl|NC_021307. 95 IATIFVASAETVR---ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDK--NLDET--TKSV--DLTPATGTTYDAI 165 (310) Q Consensus 95 ~~~~~~is~ell~---~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~--~~~~~--~~~~--~~~~~~~~~~~~~ 165 (310) ++++++||+|+++ ++.++++++|.++|++++++++|+++|+|++++.+. .+.+. .... ......+...++. T Consensus 81 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (300) T protein:vir:95 81 VEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNPDES 160 (300) T ss_pred EEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccchHHH Confidence 9999999999994 667899999999999999999999999996543322 22211 1111 1122334555666 Q ss_pred HHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCC----ceeEe Q lcl|NC_021307. 166 GVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASG----TTVGY 241 (310) Q Consensus 166 ~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~----~~~~~ 241 (310) +.++...+...++.+++|+|||+++.+|+++||++|+|+|++...++ .+++|+|+||++++.+|.+ +..++ T Consensus 161 i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~-----~~~~l~G~Pv~~s~~v~~~~~~~~~~~~ 235 (300) T protein:vir:95 161 MEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGG-----VPDAINGLAVDKNRTVSYSQTDPKNTAI 235 (300) T ss_pred HHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccC-----CCceecceeeEEecCCCCCCCCCccEEE Confidence 77888888888899999999999999999999999999997655433 3578999999999999854 34567 Q ss_pred eecceeee-EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 242 LGDFSQIV-WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 242 ~gd~~~~~-~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +|||++++ ++.|++++++++++.. .+...+++|++|++.+|+++|+||++.+|+||++|+++| T Consensus 236 ~GDf~~~~~~~~~~~~~~~v~~~~~------~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~ 299 (300) T protein:vir:95 236 VGDFETMFKWGYAKEVPMEIIKYGD------PDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTG 299 (300) T ss_pred EeeccceEEEEEecccEEEEeeccC------CCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCC Confidence 89999865 8899999999986553 344567899999999999999999999999999999999 No 15 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=3.2e-60 Score=346.76 Aligned_cols=293 Identities=20% Similarity=0.303 Sum_probs=259.1 Q ss_pred ccchhhhHHHHHhhc-cccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 2 AAGTAFPVNHTQIAQ-TGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGST-GVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 2 aa~~~~~~~~~~~~~-~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) +..+.+ .+.+ +.++.++++||++++++|++.+++.++++++|+++|+++. ...+|+..+++.++|++||+.+| T Consensus 1 m~~~~~-----~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~ 75 (297) T protein:vir:95 1 MTVQTF-----NPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIK 75 (297) T ss_pred CCcccc-----ccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCcccc Confidence 222222 2222 3345566789999999999999999999999999999765 46788888899999999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceeccc Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATG 159 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~ 159 (310) +++++|++++++++|+++++++|+|+++|+.++++++|.++|++++++++|+++|+|+|++.+.++...........+.. T Consensus 76 ~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~ 155 (297) T protein:vir:95 76 TDKPEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGP 155 (297) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecccc Confidence 99999999999999999999999999999999999999999999999999999999999988887776655555555555 Q ss_pred chHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCcee Q lcl|NC_021307. 160 TTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTV 239 (310) Q Consensus 160 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~ 239 (310) .+++++ .++...+...+..+++|+||++++.+|++++|.+|+|++++. +++|+|+||+++...+.++.. T Consensus 156 ~t~~~i-~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~~----------~~~l~G~Pv~~~~~~~~~~~~ 224 (297) T protein:vir:95 156 INYDNI-LKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDKA----------ANTIDGITTVDLKSARFEKGD 224 (297) T ss_pred cCHHHH-HHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecCC----------CCcccceeeEeecCCCCCCce Confidence 566555 578888999999999999999999999999999999998643 356999999999888877778 Q ss_pred EeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 240 GYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 240 ~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +++|||++++++++++++++++++.++....+....++++|++|+++||++.|+||++.+|+||++|+.|. T Consensus 225 ~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at 295 (297) T protein:vir:95 225 LLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAE 295 (297) T ss_pred EEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecC Confidence 89999999999999999999999999999999999999999999999999999999999999999999999 No 16 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=5.2e-60 Score=345.61 Aligned_cols=298 Identities=19% Similarity=0.228 Sum_probs=238.2 Q ss_pred Ccc--ch-----h-----hhHHHH-HhhccccCCCCceechhhHHHHHHHHHhhchhhhh-cceeecCCCceEEEEEcCC Q lcl|NC_021307. 1 MAA--GT-----A-----FPVNHT-QIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRV-ARKIPMGSTGVKIPHWTGD 66 (310) Q Consensus 1 ~aa--~~-----~-----~~~~~~-~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~-~~~~~~~~~~~~ip~~~~~ 66 (310) +++ |. . +..+.. ....++++.+|.+||+++.++|++.+++.++++++ ++.+|+.++.+++|+.+++ T Consensus 39 ~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~ 118 (366) T protein:vir:57 39 IAAGKGNLADAAKFAATELGDTGLSMAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGG 118 (366) T ss_pred HHhcccchhHHHHHHHHhhcchhhhhhccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCC Confidence 111 11 1 111111 22334444555567888999999999999999998 8889998889999999999 Q ss_pred ceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcc-ccccc Q lcl|NC_021307. 67 VSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSP-FDKNL 145 (310) Q Consensus 67 ~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~-~~~~~ 145 (310) ++++|++|++.+|+++++|+++++.++|+++++++|+|+++|+.++++++|+++|++++++++|++||+|+|++ .|.++ T Consensus 119 ~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi 198 (366) T protein:vir:57 119 ATAGYVGEGKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGM 198 (366) T ss_pred cceeeeccCccccccccceeEEEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999986 45555 Q ss_pred cccccccccee---cccch---HHHHH--HHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCcccccccccccccccc Q lcl|NC_021307. 146 DETTKSVDLTP---ATGTT---YDAIG--VNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPY 217 (310) Q Consensus 146 ~~~~~~~~~~~---~~~~~---~~~~~--~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~ 217 (310) ...+....... .+..+ .+.+. ..........+..+++|+||+.++..|++++|++|+|+|++. T Consensus 199 ~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~--------- 269 (366) T protein:vir:57 199 KAVATAANRLVAWTGTAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPEM--------- 269 (366) T ss_pred eeccccccceeeccccccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCceeccCC--------- Confidence 43332222111 11111 22221 122223345567899999999999999999999999999632 Q ss_pred CCceeeeeeEEEeCCCCC------CceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEE Q lcl|NC_021307. 218 REGRILGRPTILSDHVAS------GTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEA 291 (310) Q Consensus 218 ~~~~l~G~pv~~t~~~~~------~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~ 291 (310) .+++|+|+||++++++|+ +...++||||+++++++|+++++++++++++.. .....+++|++|+++||+++ T Consensus 270 ~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~---~~g~~~~~f~~~~~~iR~~~ 346 (366) T protein:vir:57 270 SQGILKGYPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKD---ADGQLVSAFARNQSLIRVVT 346 (366) T ss_pred CCCeecceeeEEccccccccccCCCccEEEEEecceEEEEEecceEEEEeecccccc---ccccchhhhhcCceeEEeee Confidence 235799999999999986 345688999999999999999999999987553 44567789999999999999 Q ss_pred EeccEEeccCceEEEeecC Q lcl|NC_021307. 292 EYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 292 ~~d~~v~~~~a~~~l~~aa 310 (310) |+||+++||+||++|++.= T Consensus 347 ~~d~~v~~~~a~~~lt~~~ 365 (366) T protein:vir:57 347 EHDIGFRHPEGLVLGTGVI 365 (366) T ss_pred eeCcEeeccccEEEEeccc Confidence 9999999999999998888 No 17 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=2.2e-59 Score=342.14 Aligned_cols=294 Identities=21% Similarity=0.302 Sum_probs=250.6 Q ss_pred CccchhhhHHHHHhhccc-cCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTG-DSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~-~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) ||.++. .+.+++ ++.+|.+||+++.++|++.+++.++++++|+++|++++.++||+.++.+.+.|++|++++| T Consensus 1 ma~~~~------~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~ 74 (304) T protein:vir:94 1 MATPTY------TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQ 74 (304) T ss_pred Cccccc------ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccc Confidence 554443 333344 4445567889999999999999999999999999999999999999999999999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc--c---ccccc Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET--T---KSVDL 154 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~--~---~~~~~ 154 (310) +++++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|+++++|+|++.+.+.... . ..... T Consensus 75 ~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~ 154 (304) T protein:vir:94 75 TSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGN 154 (304) T ss_pred cccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999999999998876654321 1 11111 Q ss_pred eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCC Q lcl|NC_021307. 155 TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVA 234 (310) Q Consensus 155 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~ 234 (310) .........+.+.++..++...+..+++|+||++++.+|++++|++|+|+|+++ +++|+|+||++++++| T Consensus 155 ~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~----------~~~l~G~PV~~~~~~~ 224 (304) T protein:vir:94 155 VVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN----------GNEIMGLPLSYTGADV 224 (304) T ss_pred ccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC----------CccccceeeEEecccc Confidence 122223334555678889999999999999999999999999999999999753 2579999999999998 Q ss_pred CC--ceeEeeecceeeeEEeecccEEEEeecceeeeccc--ccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 235 SG--TTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTP--QAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 235 ~~--~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .+ +..+++|||+++++++++++++++++++.+....+ .+..++++|++|+++||++.|+|+++.+|+||++||.|= T Consensus 225 ~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 225 YDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 53 45678999999999999999999999988765544 455678999999999999999999999999999998888 No 18 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=2.2e-59 Score=342.14 Aligned_cols=294 Identities=21% Similarity=0.302 Sum_probs=250.6 Q ss_pred CccchhhhHHHHHhhccc-cCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTG-DSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~-~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) ||.++. .+.+++ ++.+|.+||+++.++|++.+++.++++++|+++|++++.++||+.++.+.+.|++|++++| T Consensus 1 ma~~~~------~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~ 74 (304) T protein:vir:10 1 MATPTY------TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQ 74 (304) T ss_pred Cccccc------ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccc Confidence 554443 333344 4445567889999999999999999999999999999999999999999999999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc--c---ccccc Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET--T---KSVDL 154 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~--~---~~~~~ 154 (310) +++++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|+++++|+|++.+.+.... . ..... T Consensus 75 ~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~ 154 (304) T protein:vir:10 75 TSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGN 154 (304) T ss_pred cccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999999999998876654321 1 11111 Q ss_pred eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCC Q lcl|NC_021307. 155 TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVA 234 (310) Q Consensus 155 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~ 234 (310) .........+.+.++..++...+..+++|+||++++.+|++++|++|+|+|+++ +++|+|+||++++++| T Consensus 155 ~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~----------~~~l~G~PV~~~~~~~ 224 (304) T protein:vir:10 155 VVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN----------GNEIMGLPLSYTGADV 224 (304) T ss_pred ccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC----------CccccceeeEEecccc Confidence 122223334555678889999999999999999999999999999999999753 2579999999999998 Q ss_pred CC--ceeEeeecceeeeEEeecccEEEEeecceeeeccc--ccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 235 SG--TTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTP--QAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 235 ~~--~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .+ +..+++|||+++++++++++++++++++.+....+ .+..++++|++|+++||++.|+|+++.+|+||++||.|= T Consensus 225 ~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 225 YDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 53 45678999999999999999999999988765544 455678999999999999999999999999999998888 No 19 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=5.6e-59 Score=339.95 Aligned_cols=283 Identities=22% Similarity=0.264 Sum_probs=238.8 Q ss_pred ccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccccccceeeeEeeeeee Q lcl|NC_021307. 16 QTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPHKI 95 (310) Q Consensus 16 ~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~ 95 (310) +.+.+.+|.++|+++.++|++.+++.|+++++|+++|++++.+++|+.+++++++|++||+.+|+++++|+++++.++|+ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~kl 80 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEEE Confidence 44444567788999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeehhhHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccc-------ccccceecccchHHHH Q lcl|NC_021307. 96 ATIFVASAETVR---ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETT-------KSVDLTPATGTTYDAI 165 (310) Q Consensus 96 ~~~~~is~ell~---~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~-------~~~~~~~~~~~~~~~~ 165 (310) ++++++|+|+++ ++..+++++|.++|++++++++|.++++|++++.+..+.+.. .....+.......+.. T Consensus 81 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) T protein:vir:81 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) T ss_pred EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccchHHHH Confidence 999999999996 556789999999999999999999999998765544332211 1112222233344555 Q ss_pred HHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCC--------- Q lcl|NC_021307. 166 GVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASG--------- 236 (310) Q Consensus 166 ~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~--------- 236 (310) +.++..++...+...++|+||++++.+|+++||++|+|+|++.... ..+++|+|+||++++.||.+ T Consensus 161 i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~-----~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~ 235 (311) T protein:vir:81 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFG-----TDVASFAGLNAAVSDTVRGGPEAVTASTG 235 (311) T ss_pred HHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCcccc-----CCCceecceeEEecccccccccccccccc Confidence 6667777777778888999999999999999999999999875543 34578999999999998753 Q ss_pred -------ceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeec Q lcl|NC_021307. 237 -------TTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 237 -------~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a 309 (310) +..+++|||++++++.+++++++++++.. ....+++|++|+++||++.|+||++.+|+||++|++| T Consensus 236 ~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~-------~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a 308 (311) T protein:vir:81 236 VYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD-------PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) T ss_pred hhcccCCccEEEEEecccEEEEEeccceEEEeccCC-------CCcchhhhhcCcEEEEEEEEeccEeecccceEEEEee Confidence 33568999999999999999999987752 2335689999999999999999999999999999999 Q ss_pred C Q lcl|NC_021307. 310 A 310 (310) Q Consensus 310 a 310 (310) . T Consensus 309 ~ 309 (311) T protein:vir:81 309 D 309 (311) T ss_pred c Confidence 9 No 20 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=6.1e-59 Score=339.73 Aligned_cols=288 Identities=22% Similarity=0.221 Sum_probs=232.6 Q ss_pred hccccC-CCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccccccceeeeEeeee Q lcl|NC_021307. 15 AQTGDS-MFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPH 93 (310) Q Consensus 15 ~~~~~~-~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~ 93 (310) |..+++ .+|+++|+++..+||+.+++.|+++++|+++|++++.++||+.++++.++|++||+.+|+++++|+++++.+| T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~ 80 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCccccccccceeeeEeeee Confidence 655554 4556788899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEeeehhhHHHhhcChhH----HHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccc----ccceecccchHHHH Q lcl|NC_021307. 94 KIATIFVASAETVRANPGN----YLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKS----VDLTPATGTTYDAI 165 (310) Q Consensus 94 k~~~~~~is~ell~~s~~~----~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~----~~~~~~~~~~~~~~ 165 (310) |++++++||+|+++++..+ ++++|.++|++++++++|.++|+|++.+.+..+.+.... ......++..++++ T Consensus 81 kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 160 (315) T protein:vir:80 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSATADL 160 (315) T ss_pred eEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccchHHH Confidence 9999999999999988765 789999999999999999999999875544333222211 11222233334443 Q ss_pred HHHHHHHhhh-hcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCC-------c Q lcl|NC_021307. 166 GVNALSLLVN-AGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASG-------T 237 (310) Q Consensus 166 ~~~~~~~l~~-~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~-------~ 237 (310) .+++.++.. .+..+++|+||++++..|++++|.+|++.+-.....+ .....+++|+|+||+++++||.+ + T Consensus 161 -~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~-~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~ 238 (315) T protein:vir:80 161 -VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPA-AGFAGLDNWRGLNVGASSTVSGAPEMSPASG 238 (315) T ss_pred -HHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccc-cccCCCceecceeeEecCcCCcccccccccc Confidence 456666644 3556778999999999999999887765543222211 11223568999999999999854 3 Q ss_pred eeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 238 TVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 238 ~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ..+++|||++++++.+++++++++++.. .+..++++|++|+++||++.|+||++++++||++|+.++ T Consensus 239 ~~~~~GDfs~~~~g~~~~~~i~i~~~~~------~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~ 305 (315) T protein:vir:80 239 VKAIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) T ss_pred cEEEEeecccEEEEEecCeeEEEecccc------ccCcccchhhcCcEEEEEEEEecceeecccceEEEeecc Confidence 4678999999999999999999997753 344467899999999999999999999999999999988 No 21 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=9.1e-59 Score=338.79 Aligned_cols=298 Identities=17% Similarity=0.225 Sum_probs=237.4 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhh-cceeecCCCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRV-ARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~-~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) ++..............++++.+|.+||+++.++||+.+++.++++++ ++.+|+.++.+++|+.++++.++|++||+.+| T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~ 192 (428) T protein:vir:10 113 FASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLAGGATASYTGENQDAK 192 (428) T ss_pred HhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcceEEEEEeCCcceeeeccCcccc Confidence 22222222222222344444455567888899999999999999999 67899988889999999999999999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccc-ccccccccccccc---- Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPF-DKNLDETTKSVDL---- 154 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~-~~~~~~~~~~~~~---- 154 (310) +++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++||+|+|++. |.++......... T Consensus 193 ~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~ 272 (428) T protein:vir:10 193 VSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPW 272 (428) T ss_pred ccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999874 5555433221111 Q ss_pred eecccchHHHHH--HH---HHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEE Q lcl|NC_021307. 155 TPATGTTYDAIG--VN---ALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTIL 229 (310) Q Consensus 155 ~~~~~~~~~~~~--~~---~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~ 229 (310) ......+.+... .+ ........+..+++|+||+.++..|++++|++|+|+|++. .+++|+|+||++ T Consensus 273 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~---------~~g~l~G~pv~~ 343 (428) T protein:vir:10 273 AADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEM---------AQGMLKGYPIQR 343 (428) T ss_pred cccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhccCCceeccCC---------CCCeeeceeeEE Confidence 111222222211 11 1222344566789999999999999999999999999642 234799999999 Q ss_pred eCCCCCC------ceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCce Q lcl|NC_021307. 230 SDHVASG------TTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAF 303 (310) Q Consensus 230 t~~~~~~------~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~ 303 (310) ++++|.+ ...++||||++++++++++++++++++..+.. .....+++|++|+++||++.|+||++.+|+|| T Consensus 344 ~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~---~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~ 420 (428) T protein:vir:10 344 TSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKVDFSKEASYID---TDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGL 420 (428) T ss_pred eccccccccCCCccceEEEEecceEEEEEecceEEEeeccccccc---ccccccchhhcchhheeeeeeeCceeeccceE Confidence 9999864 35689999999999999999999999886543 33446688999999999999999999999999 Q ss_pred EEEeecC Q lcl|NC_021307. 304 VKLTNAA 310 (310) Q Consensus 304 ~~l~~aa 310 (310) +.+++.. T Consensus 421 ~~~t~~~ 427 (428) T protein:vir:10 421 VLGTGVL 427 (428) T ss_pred EEEeccC Confidence 9999999 No 22 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=3.4e-58 Score=335.62 Aligned_cols=305 Identities=26% Similarity=0.338 Sum_probs=244.9 Q ss_pred CccchhhhHHHHHhh--ccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCce--------ee Q lcl|NC_021307. 1 MAAGTAFPVNHTQIA--QTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVS--------AA 70 (310) Q Consensus 1 ~aa~~~~~~~~~~~~--~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~--------a~ 70 (310) ||.-+.+........ ...++..+++||++++++|++.+++.++|+++|+++||+++.+++|+....+. +. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~ 80 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecccccc Confidence 444333322211111 11122344579999999999999999999999999999999999999876544 56 Q ss_pred eecccccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccc Q lcl|NC_021307. 71 WIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTK 150 (310) Q Consensus 71 ~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~ 150 (310) |++|++.+|+++++|+++++.++|+++++++|+|+++|+.++++++|.++|++++++++|++||+|+|++.+..+.+... T Consensus 81 ~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~ 160 (338) T protein:vir:78 81 EQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDT 160 (338) T ss_pred cccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccc Confidence 67799999999999999999999999999999999999999999999999999999999999999999877655543322 Q ss_pred cccc---e-----ecccchHHHHHHHHHHHh-hhhcCCCCEEEEehHHHHHHH---HhhhccCccccccccccccccccC Q lcl|NC_021307. 151 SVDL---T-----PATGTTYDAIGVNALSLL-VNAGKKWGATLLDDVAEPILN---GAKDANGRPLFVESTYEAVTTPYR 218 (310) Q Consensus 151 ~~~~---~-----~~~~~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~~~~~l~---~l~d~~g~~~~~~~~~~~~~~~~~ 218 (310) .... + ........+.+.++...+ .......++|+||++++..|+ +++|.+|+|+|++....+ . T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~-----~ 235 (338) T protein:vir:78 161 NNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAA-----S 235 (338) T ss_pred ccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccCC-----C Confidence 1111 1 111111223333444444 335566778999999988774 578999999998765544 3 Q ss_pred CceeeeeeEEEeCCCCCC-------ceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEE Q lcl|NC_021307. 219 EGRILGRPTILSDHVASG-------TTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEA 291 (310) Q Consensus 219 ~~~l~G~pv~~t~~~~~~-------~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~ 291 (310) +++|+|+||+++++||++ +..+++|||+++++++++++++++++++++++..++...++++|++|++++|++. T Consensus 236 ~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~ 315 (338) T protein:vir:78 236 AGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEV 315 (338) T ss_pred CceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEE Confidence 578999999999999852 3567899999999999999999999999999999999999999999999999999 Q ss_pred EeccEEeccCceEEEeecC Q lcl|NC_021307. 292 EYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 292 ~~d~~v~~~~a~~~l~~aa 310 (310) |+||++.|++||++|++++ T Consensus 316 r~d~~v~~~~a~~~l~~~~ 334 (338) T protein:vir:78 316 TFGWLLGDKQAFVKFVDDE 334 (338) T ss_pred EeccEeecccceEEEeccc Confidence 9999999999999999998 No 23 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=4.1e-58 Score=335.20 Aligned_cols=285 Identities=17% Similarity=0.168 Sum_probs=238.8 Q ss_pred hccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccccccceeeeEeeeee Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPHK 94 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k 94 (310) |. +++.+|.+||++++.+|++.+++.|+++++|+++||+++..++|+.++++.++|++|++.+|+++++|+++++.+|| T Consensus 1 m~-t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~~~k 79 (303) T protein:vir:97 1 MG-TETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIVPIK 79 (303) T ss_pred Cc-ccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCccccccccceeeEEeeeEE Confidence 33 44566778999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEeeehhhHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccc----cccccc-cc--eecccchHHH Q lcl|NC_021307. 95 IATIFVASAETVR---ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLD----ETTKSV-DL--TPATGTTYDA 164 (310) Q Consensus 95 ~~~~~~is~ell~---~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~----~~~~~~-~~--~~~~~~~~~~ 164 (310) +++++++|+|+++ ++.++++++|.++|++++++++|+++++|++++.+.... ...... +. .........+ T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (303) T protein:vir:97 80 VEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDADA 159 (303) T ss_pred EEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccchHH Confidence 9999999999994 667899999999999999999999999997654443221 111111 11 1112223345 Q ss_pred HHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCC------Cce Q lcl|NC_021307. 165 IGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVAS------GTT 238 (310) Q Consensus 165 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~------~~~ 238 (310) .+.+++.++...+..+++|+||++++.+|+++||++|+|+++++...+. .+++|+|+||+++++||. ++. T Consensus 160 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~----~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 235 (303) T protein:vir:97 160 NIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGA----NPDSINGLKSSVNTTVGAGADEAESKD 235 (303) T ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCC----CCceecceeeEEecccCCccccCCCcc Confidence 5567888888888999999999999999999999999999988765432 346899999999999985 345 Q ss_pred eEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 239 VGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 239 ~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .+++|||+. +.++.+++++++++++. +.+..++++|++|+++||+++|+||++.+|+||++||++= T Consensus 236 ~~~~Gdf~~~~~~~~~~~~~~~~~~~~------~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~ 302 (303) T protein:vir:97 236 LVIIGDFESMFKWGYAKQIPMEIIKYG------DPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGE 302 (303) T ss_pred EEEEeeccccEEEEEecCcEEEEeecc------CCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCC Confidence 678999965 67999999999998643 2445578899999999999999999999999999998888 No 24 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=5e-58 Score=334.70 Aligned_cols=288 Identities=16% Similarity=0.127 Sum_probs=238.7 Q ss_pred Cccch--hhhHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccccc Q lcl|NC_021307. 1 MAAGT--AFPVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDM 77 (310) Q Consensus 1 ~aa~~--~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~ 77 (310) |..+. .+.....+++.++++..|| +||+++..+|++.+++.++|+++|+++|++++.+++|+..+++.+.|++|++. T Consensus 91 lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~ 170 (401) T protein:vir:44 91 LRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGTASGWVGETDT 170 (401) T ss_pred HhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCccceeeccccc Confidence 32221 2222234455566555544 67888899999999999999999999999999999999999999999999999 Q ss_pred cccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccc-- Q lcl|NC_021307. 78 KPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDL-- 154 (310) Q Consensus 78 ~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~-- 154 (310) +|++ .++|+++++.++|+++++++|+|+++|+.++++++|.++|++++++++|.+||+|+|++.|.+++........ T Consensus 171 ~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~ 250 (401) T protein:vir:44 171 RSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDK 250 (401) T ss_pred cCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccc Confidence 9975 4899999999999999999999999999999999999999999999999999999999888776543221111 Q ss_pred ------------eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCcee Q lcl|NC_021307. 155 ------------TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRI 222 (310) Q Consensus 155 ------------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l 222 (310) ......++++ +.+++..+...+..+++|+||++++..|++++|++|+|+|+++...+. +++| T Consensus 251 ~~~~~~~~~~~t~~~~~~~~d~-i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~-----~~~l 324 (401) T protein:vir:44 251 ARAFGKLQHIVSGEATAVTADA-IIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQ-----PSSL 324 (401) T ss_pred ccccccccccccccccccCHHH-HHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCC-----Ccee Confidence 1122233444 457888899999999999999999999999999999999998776543 4689 Q ss_pred eeeeEEEeCCCCC---CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|NC_021307. 223 LGRPTILSDHVAS---GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIN 298 (310) Q Consensus 223 ~G~pv~~t~~~~~---~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~ 298 (310) +|+||+++++||. ++..++||||++ |.++++.++++..+ ..|++|+++||++.|+|+++. T Consensus 325 ~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~----------------~~~~~~~v~~~a~~r~d~~~~ 388 (401) T protein:vir:44 325 AGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRD----------------PYTNKPFVGFYTTKRTGGMLV 388 (401) T ss_pred cceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeee----------------ccccCCcEEEEEEEEeccEEe Confidence 9999999999874 456678899986 66888988877543 236799999999999999999 Q ss_pred ccCceEEEeecC Q lcl|NC_021307. 299 DVEAFVKLTNAA 310 (310) Q Consensus 299 ~~~a~~~l~~aa 310 (310) +++||++|+.+| T Consensus 389 ~~~a~~~l~~~a 400 (401) T protein:vir:44 389 DSQAIKLLKIAA 400 (401) T ss_pred cccceEEEEeec Confidence 999999999999 No 25 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=1.3e-57 Score=332.50 Aligned_cols=282 Identities=21% Similarity=0.217 Sum_probs=237.2 Q ss_pred ccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccccccceeeeEeeeeeeEe Q lcl|NC_021307. 18 GDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPHKIAT 97 (310) Q Consensus 18 ~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~ 97 (310) ....+|.++||+++.+|++.+++.++++++|+++|++++.+++|+.++.++++|++|++.+|+++++|+++++.+||+++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~k~a~ 80 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCccccccccceeEEEEeeeeEEE Confidence 34556778999999999999999999999999999998889999999999999999999999999999999999999999 Q ss_pred eehhhHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHHcccCccc--cccccccccc---cc---ceecccchHHHHH Q lcl|NC_021307. 98 IFVASAETVR---ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPF--DKNLDETTKS---VD---LTPATGTTYDAIG 166 (310) Q Consensus 98 ~~~is~ell~---~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~--~~~~~~~~~~---~~---~~~~~~~~~~~~~ 166 (310) ++++|+|+++ ++..+++++|.++|++++++++|.++++|.+.+. +..+.+.... .. .........++.+ T Consensus 81 ~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) T protein:vir:16 81 GARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) T ss_pred eehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHHHHH Confidence 9999999995 5568999999999999999999999999965333 2222221111 11 1112223344556 Q ss_pred HHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCC----ceeEee Q lcl|NC_021307. 167 VNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASG----TTVGYL 242 (310) Q Consensus 167 ~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~----~~~~~~ 242 (310) .+++.++...+..+++|+||++++.+|+++||++|+|+|++....+. +++|+|+||++++++|.+ +..+++ T Consensus 161 ~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~-----~~~l~G~PV~~~~~v~~~~~~~~~~~~~ 235 (298) T protein:vir:16 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGAT-----PDTINGLPVDVNKTVSDMSLTQRDRAII 235 (298) T ss_pred HHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCC-----CceecceeeEEecccccccCCCccEEEE Confidence 67888888888999999999999999999999999999987665543 468999999999999853 456788 Q ss_pred ecceee-eEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 243 GDFSQI-VWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 243 gd~~~~-~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) |||+++ .++.+++++++++++.. .+..++++|++|+++||+++|+||++.+|+||++|+++= T Consensus 236 GDfs~~~~~~~~~~~~~~~~~~~~------~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 236 GDFANGFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred eeccceEEEEEecCceEEEeeccC------CcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 999985 48899999999987652 344578899999999999999999999999999998888 No 26 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=5e-58 Score=334.75 Aligned_cols=288 Identities=17% Similarity=0.144 Sum_probs=236.4 Q ss_pred Cccch---hh-----hHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeee Q lcl|NC_021307. 1 MAAGT---AF-----PVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAW 71 (310) Q Consensus 1 ~aa~~---~~-----~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~ 71 (310) +...+ .| ..+..+....+++..|| ++|+++..+|++.+++.++|+++|+++|+.++..++|+..+++.+.| T Consensus 108 ~~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~~~a~w 187 (425) T protein:vir:10 108 LRDPEYTEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMGGTTSGW 187 (425) T ss_pred cccHHHHHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcCCcceee Confidence 00000 00 01122344555555555 67888899999999999999999999999999999999999999999 Q ss_pred ecccccccccc-cceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccc Q lcl|NC_021307. 72 IGEGDMKPITK-GDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTK 150 (310) Q Consensus 72 v~Eg~~~~~~~-~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~ 150 (310) ++|++.+|+++ ++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|.+|++|+|++.|.++..... T Consensus 188 v~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~ 267 (425) T protein:vir:10 188 VGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIA 267 (425) T ss_pred eccccccccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccc Confidence 99999999876 79999999999999999999999999999999999999999999999999999999988877655333 Q ss_pred cccc--------------eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccc Q lcl|NC_021307. 151 SVDL--------------TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTP 216 (310) Q Consensus 151 ~~~~--------------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~ 216 (310) .... ......+++++ .+++..+...+..+++|+||++++..|++++|++|+|+|+++...+ T Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~l-~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g---- 342 (425) T protein:vir:10 268 GGANAAKHPFGAIEVVNSGAAADITSDGI-IDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAG---- 342 (425) T ss_pred cccccccccccccccccccccccccHHHH-HHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCCceeeccCccCC---- Confidence 2211 11222334444 5788889999999999999999999999999999999999876554 Q ss_pred cCCceeeeeeEEEeCCCCC---CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEE Q lcl|NC_021307. 217 YREGRILGRPTILSDHVAS---GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAE 292 (310) Q Consensus 217 ~~~~~l~G~pv~~t~~~~~---~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~ 292 (310) .+++|+|+||+++++||. +...++||||++ |.++++.++++..+ . .|.+|++.||++.| T Consensus 343 -~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d--~--------------~~~~~~~~~~~~~r 405 (425) T protein:vir:10 343 -QPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRD--P--------------YTAKPYVLFYTTKR 405 (425) T ss_pred -CCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEec--c--------------cccCCcEEEEEEEE Confidence 346899999999999984 446688999998 56888888766432 2 26789999999999 Q ss_pred eccEEeccCceEEEeecC Q lcl|NC_021307. 293 YGLLINDVEAFVKLTNAA 310 (310) Q Consensus 293 ~d~~v~~~~a~~~l~~aa 310 (310) +|+++.+++||++|+.+| T Consensus 406 ~d~~v~~~~A~~~l~~~a 423 (425) T protein:vir:10 406 VGGGLLNPEPMRAMKVAA 423 (425) T ss_pred eccEeecccceEEEEeec Confidence 999999999999999999 No 27 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=1.1e-57 Score=332.93 Aligned_cols=288 Identities=15% Similarity=0.130 Sum_probs=241.1 Q ss_pred Cccchhh--hHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccccc Q lcl|NC_021307. 1 MAAGTAF--PVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDM 77 (310) Q Consensus 1 ~aa~~~~--~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~ 77 (310) |..|... .....++..++++..|| +||+++.++|++.+++.++|+++|+++|+.++.+++|+..+++.+.|++|++. T Consensus 90 l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~ 169 (407) T protein:vir:48 90 MRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGGTTSGWVGETDA 169 (407) T ss_pred HhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCcceeeeccccc Confidence 5544332 23344556666665555 67888899999999999999999999999999999999999999999999999 Q ss_pred ccccc-cceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccccc--- Q lcl|NC_021307. 78 KPITK-GDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVD--- 153 (310) Q Consensus 78 ~~~~~-~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~--- 153 (310) +|+++ ++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|.+|++|+|++.|.+++....... T Consensus 170 ~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~ 249 (407) T protein:vir:48 170 RPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDK 249 (407) T ss_pred ccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeeccccccccc Confidence 99865 79999999999999999999999999999999999999999999999999999999988877653322111 Q ss_pred -----------ceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCcee Q lcl|NC_021307. 154 -----------LTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRI 222 (310) Q Consensus 154 -----------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l 222 (310) ...+...++ +.+.++...+...+..+++|+||++++..|++++|.+|||+|+++...+. +++| T Consensus 250 ~~~~~~~~~~~~~~~~~~~~-d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~-----~~~l 323 (407) T protein:vir:48 250 TRAFGKLQHIASGAASGVTA-DAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQ-----PSSL 323 (407) T ss_pred ccccccccccccccccccCh-HHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCC-----Ccee Confidence 111222334 44458888899999999999999999999999999999999998766543 4689 Q ss_pred eeeeEEEeCCCCC---CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|NC_021307. 223 LGRPTILSDHVAS---GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIN 298 (310) Q Consensus 223 ~G~pv~~t~~~~~---~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~ 298 (310) +|+||+++++||. ++..++||||+. |.++++.++++..+ ..|++|++.||++.|+|+++. T Consensus 324 ~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d----------------~~~~~~~~~~~~~~r~d~~v~ 387 (407) T protein:vir:48 324 AGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRD----------------PYTNKPFVGFYTTKRTGGMLV 387 (407) T ss_pred cceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEee----------------ccccCCcEEEEEEEEeccEEe Confidence 9999999999984 456778999986 67888988877543 236789999999999999999 Q ss_pred ccCceEEEeecC Q lcl|NC_021307. 299 DVEAFVKLTNAA 310 (310) Q Consensus 299 ~~~a~~~l~~aa 310 (310) +++||++|+.+| T Consensus 388 ~~~a~~~l~~~a 399 (407) T protein:vir:48 388 DSQAIKLMKIGA 399 (407) T ss_pred cccceEEEEeec Confidence 999999999999 No 28 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=1.7e-57 Score=331.88 Aligned_cols=298 Identities=19% Similarity=0.248 Sum_probs=238.0 Q ss_pred CccchhhhHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhh-cceeecCCCceEEEEEcCCceeeeecccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRV-ARKIPMGSTGVKIPHWTGDVSAAWIGEGDMK 78 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~-~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~ 78 (310) ++..............++++..|| ++|+++..+|++.+++.++++++ ++.+|+.++.+++|+.++++.++|++|++.+ T Consensus 118 ~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~ 197 (435) T protein:vir:14 118 LAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDI 197 (435) T ss_pred HHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceeeeccCccc Confidence 111111112222334455555454 67778889999999999999998 7789998888999999999999999999999 Q ss_pred cccccceeeeEeeeeeeEeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHHcccCccc-ccccccccccccce Q lcl|NC_021307. 79 PITKGDMSVQQVEPHKIATIFVASAETVRANP--GNYLGTMRTKVATAIALAFDEAALHGTDSPF-DKNLDETTKSVDLT 155 (310) Q Consensus 79 ~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~--~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~-~~~~~~~~~~~~~~ 155 (310) |+++++|+++++.++|+++++++|+|+++|+. ++++++|.++|++++++++|++|++|+|++. |.++.......... T Consensus 198 ~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~ 277 (435) T protein:vir:14 198 PTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVI 277 (435) T ss_pred cccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccee Confidence 99999999999999999999999999999985 5699999999999999999999999999875 55544332222111 Q ss_pred -ecccchH---HHHHHHHHHHhhhh--cCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEE Q lcl|NC_021307. 156 -PATGTTY---DAIGVNALSLLVNA--GKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTIL 229 (310) Q Consensus 156 -~~~~~~~---~~~~~~~~~~l~~~--~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~ 229 (310) .....+. ...+.++...+... ++.+++|+||+.++..|++++|++|+|+|+.. .+++|+|+||++ T Consensus 278 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~---------~~g~l~G~Pv~~ 348 (435) T protein:vir:14 278 TASDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPEL---------ANGMLKGYPVGK 348 (435) T ss_pred ccccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCceeccCC---------CCCeeecceeEe Confidence 1122222 23334455555443 56688999999999999999999999999532 235799999999 Q ss_pred eCCCCCC------ceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCce Q lcl|NC_021307. 230 SDHVASG------TTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAF 303 (310) Q Consensus 230 t~~~~~~------~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~ 303 (310) ++.||.+ ...+++|||+++++++|++++++++++..+.. .....+.+|++|++.||++.|+||++.+|+|| T Consensus 349 ~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~---~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~ 425 (435) T protein:vir:14 349 TTQVPINLGETGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKD---ADGHMVSAFQRDQTLIRVIAKNDFGPRHVESI 425 (435) T ss_pred eccccccccCCCccceEEEeecccEEEEEecccEEEEeccccccc---cccchhhhhhcChhheeeeeeeCceeecccce Confidence 9999863 34688999999999999999999999987654 34566789999999999999999999999999 Q ss_pred EEEeecC Q lcl|NC_021307. 304 VKLTNAA 310 (310) Q Consensus 304 ~~l~~aa 310 (310) ++|++++ T Consensus 426 ~~l~~~~ 432 (435) T protein:vir:14 426 AVLAGVA 432 (435) T ss_pred EEEecCC Confidence 9999999 No 29 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=3.9e-57 Score=329.85 Aligned_cols=299 Identities=22% Similarity=0.256 Sum_probs=233.3 Q ss_pred Ccc--ch----------------hh---hHHHHHhhc-cccCCCCce-echhhHHHHHHHHHhhchhhhhcceeecC--- Q lcl|NC_021307. 1 MAA--GT----------------AF---PVNHTQIAQ-TGDSMFQGY-LEPEQAQDYFAEAEKTSIVQRVARKIPMG--- 54 (310) Q Consensus 1 ~aa--~~----------------~~---~~~~~~~~~-~~~~~~g~~-i~~~~~~~ii~~~~~~s~l~~~~~~~~~~--- 54 (310) +++ |. .. ......+.. ++++.+|++ +|+++..+||+.+++.+++++++....+. T Consensus 302 l~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~ 381 (645) T protein:vir:93 302 LAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQ 381 (645) T ss_pred HHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCchhhHHHHHHhhhhhhhHHhhcccccccccc Confidence 111 00 00 011111122 222334555 55667899999999999999997754332 Q ss_pred -CCceEEEEEcCCceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021307. 55 -STGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAA 133 (310) Q Consensus 55 -~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~ 133 (310) ...+++|+.++++.++|++||+.+|+++++|+++++.+||+++++++|+||++|+.++++++|.++|++++++++|++| T Consensus 382 ~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~ 461 (645) T protein:vir:93 382 VPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDF 461 (645) T ss_pred ccCceeeeeeecCcceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHh Confidence 2458999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HcccCccc-ccccccccccccceecccchHHHHHHHHHHHhhhh--cCCCCEEEEehHHHHHHHHhhhccCccccccccc Q lcl|NC_021307. 134 LHGTDSPF-DKNLDETTKSVDLTPATGTTYDAIGVNALSLLVNA--GKKWGATLLDDVAEPILNGAKDANGRPLFVESTY 210 (310) Q Consensus 134 l~G~g~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~ 210 (310) |+|+|++. +..+.+.........+.+....+ +..++..+... ...+++|+|||.++.+|+++||++|+++|.. .. T Consensus 462 l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d-~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~-~~ 539 (645) T protein:vir:93 462 VDPKKAAVADVSPASITHDVKGTASSGNPDAD-AEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPD-MT 539 (645) T ss_pred hcCCCcccCCccccceeccccccccccchHHH-HHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCceeecC-CC Confidence 99988763 33344444444444444444433 34555555433 3456899999999999999999999999843 21 Q ss_pred cccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccc--------cccchhhhhc Q lcl|NC_021307. 211 EAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQ--------APNFVSLWQH 282 (310) Q Consensus 211 ~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~--------~~~~~~~~~~ 282 (310) ..+++|+|+||++++++|++ +++|||+++++++++++.+..++++.+...... ...++++|++ T Consensus 540 ------~~~~tL~G~PV~~s~~vp~~---~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~ 610 (645) T protein:vir:93 540 ------LLGGSFQGLPVIVSQYVGDQ---LVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQT 610 (645) T ss_pred ------CCCceeeceeeEEeccCCcc---eeEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHhhc Confidence 12358999999999999975 468999999999999999999999998765433 3346899999 Q ss_pred CcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 283 NLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 283 ~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) |+++||+++|+||+++||+||++|+++= T Consensus 611 d~vaira~~r~d~~~~~p~a~~~lt~~~ 638 (645) T protein:vir:93 611 GSVAIRAERWINWRRRRTAAVAVITGVN 638 (645) T ss_pred CceEEEEEEEEcceeeCccceEEEeccc Confidence 9999999999999999999999999766 No 30 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=2.4e-57 Score=330.99 Aligned_cols=295 Identities=18% Similarity=0.179 Sum_probs=231.7 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCC-ceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGD-VSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~-~~a~~v~Eg~~~~ 79 (310) ...+.....+.......+++.+|+++|+++..+||+.+++.++|+++++++|++++.++||+.+++ +.++|++||+.+| T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~ 217 (497) T protein:vir:78 138 FADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP 217 (497) T ss_pred HhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccc Confidence 122222222222333445566777899999999999999999999999999999999999998764 6899999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceeccc Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATG 159 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~ 159 (310) +++++|+++++.+||++++++||+|+++|+. +++++|.++|++++++++|.+||+|+|++.|.++.............. T Consensus 218 ~s~~~f~~i~~~~~k~a~~~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~ 296 (497) T protein:vir:78 218 FSSEEFARVYEQVGKVANALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASS 296 (497) T ss_pred cccccceeeEeeeeeeEeecHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCccccccccccccccccccccc Confidence 9999999999999999999999999999874 799999999999999999999999999998877664332221111000 Q ss_pred c----------------------------------------------------hHHHH---HHHHHHHh-hhhcCCCCEE Q lcl|NC_021307. 160 T----------------------------------------------------TYDAI---GVNALSLL-VNAGKKWGAT 183 (310) Q Consensus 160 ~----------------------------------------------------~~~~~---~~~~~~~l-~~~~~~~~~~ 183 (310) . +..+. +......+ ...+..+++| T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 376 (497) T protein:vir:78 297 LFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAV 376 (497) T ss_pred chhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeE Confidence 0 00001 11111222 2234556789 Q ss_pred EEehHHHHHHHHhhhccCccccccccccc-cccccCCceeeeeeEEEeCCCCCCceeEeeeccee--eeEEeecccEEEE Q lcl|NC_021307. 184 LLDDVAEPILNGAKDANGRPLFVESTYEA-VTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQ--IVWGQVGGLSFDV 260 (310) Q Consensus 184 ~~~~~~~~~l~~l~d~~g~~~~~~~~~~~-~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~--~~~~~~~~~~v~~ 260 (310) +||+.+|..|+++||++|+|+|++..... ......+.+|+|+||+++++||.++ +++|||+. +.++++.++++.+ T Consensus 377 vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~--~~~Gd~~~~~~~i~~r~~~~v~~ 454 (497) T protein:vir:78 377 VMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--ILVGHFAPSVIQTARREGVTMQM 454 (497) T ss_pred EEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCc--eEEeecccceEEEEEecccEEEe Confidence 99999999999999999999998765432 2233456789999999999999886 46899987 4478899999998 Q ss_pred eecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 261 SDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +++.. ..|++|+++||++.|+||.|.+|+||++|+.++ T Consensus 455 ~~~~~------------~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~ 492 (497) T protein:vir:78 455 TNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) T ss_pred ecccc------------hhhhcCcEEEEEEEeecceeeccccEEEEEecC Confidence 86533 359999999999999999999999999999999 No 31 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=2.4e-57 Score=330.99 Aligned_cols=295 Identities=18% Similarity=0.179 Sum_probs=231.7 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCC-ceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGD-VSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~-~~a~~v~Eg~~~~ 79 (310) ...+.....+.......+++.+|+++|+++..+||+.+++.++|+++++++|++++.++||+.+++ +.++|++||+.+| T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~ 217 (497) T protein:vir:10 138 FADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP 217 (497) T ss_pred HhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccc Confidence 122222222222333445566777899999999999999999999999999999999999998764 6899999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceeccc Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATG 159 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~ 159 (310) +++++|+++++.+||++++++||+|+++|+. +++++|.++|++++++++|.+||+|+|++.|.++.............. T Consensus 218 ~s~~~f~~i~~~~~k~a~~~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~ 296 (497) T protein:vir:10 218 FSSEEFARVYEQVGKVANALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASS 296 (497) T ss_pred cccccceeeEeeeeeeEeecHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCccccccccccccccccccccc Confidence 9999999999999999999999999999874 799999999999999999999999999998877664332221111000 Q ss_pred c----------------------------------------------------hHHHH---HHHHHHHh-hhhcCCCCEE Q lcl|NC_021307. 160 T----------------------------------------------------TYDAI---GVNALSLL-VNAGKKWGAT 183 (310) Q Consensus 160 ~----------------------------------------------------~~~~~---~~~~~~~l-~~~~~~~~~~ 183 (310) . +..+. +......+ ...+..+++| T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 376 (497) T protein:vir:10 297 LFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAV 376 (497) T ss_pred chhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeE Confidence 0 00001 11111222 2234556789 Q ss_pred EEehHHHHHHHHhhhccCccccccccccc-cccccCCceeeeeeEEEeCCCCCCceeEeeeccee--eeEEeecccEEEE Q lcl|NC_021307. 184 LLDDVAEPILNGAKDANGRPLFVESTYEA-VTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQ--IVWGQVGGLSFDV 260 (310) Q Consensus 184 ~~~~~~~~~l~~l~d~~g~~~~~~~~~~~-~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~--~~~~~~~~~~v~~ 260 (310) +||+.+|..|+++||++|+|+|++..... ......+.+|+|+||+++++||.++ +++|||+. +.++++.++++.+ T Consensus 377 vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~--~~~Gd~~~~~~~i~~r~~~~v~~ 454 (497) T protein:vir:10 377 VMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--ILVGHFAPSVIQTARREGVTMQM 454 (497) T ss_pred EEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCc--eEEeecccceEEEEEecccEEEe Confidence 99999999999999999999998765432 2233456789999999999999886 46899987 4478899999998 Q ss_pred eecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 261 SDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +++.. ..|++|+++||++.|+||.|.+|+||++|+.++ T Consensus 455 ~~~~~------------~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~ 492 (497) T protein:vir:10 455 TNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) T ss_pred ecccc------------hhhhcCcEEEEEEEeecceeeccccEEEEEecC Confidence 86533 359999999999999999999999999999999 No 32 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=3.2e-57 Score=330.28 Aligned_cols=298 Identities=21% Similarity=0.265 Sum_probs=238.1 Q ss_pred Cccc-------------hhhhHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhh-cceeecCCCceEEEEEcC Q lcl|NC_021307. 1 MAAG-------------TAFPVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRV-ARKIPMGSTGVKIPHWTG 65 (310) Q Consensus 1 ~aa~-------------~~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~-~~~~~~~~~~~~ip~~~~ 65 (310) |++. ..+..+.....+++++..|| ++|+++.++|++.+++.++++++ ++++|+.++.+++|+.++ T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~ 184 (435) T protein:vir:80 105 LAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKG 184 (435) T ss_pred HHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCceEEEEEeC Confidence 1111 11122222334455555555 56778889999999999999998 788999998999999999 Q ss_pred CceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHHcccCccc-c Q lcl|NC_021307. 66 DVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANP--GNYLGTMRTKVATAIALAFDEAALHGTDSPF-D 142 (310) Q Consensus 66 ~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~--~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~-~ 142 (310) ++.+.|++|++.+|+++++|+++++.++|++++++||+|+++|+. ++++++|.++|++++++++|++|++|+|++. | T Consensus 185 ~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p 264 (435) T protein:vir:80 185 GAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTP 264 (435) T ss_pred CcceeeeccCccccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcc Confidence 999999999999999999999999999999999999999999984 5799999999999999999999999999875 4 Q ss_pred cccccccccccc-eecccchHHH---HHHHHHHHhhh--hcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccc Q lcl|NC_021307. 143 KNLDETTKSVDL-TPATGTTYDA---IGVNALSLLVN--AGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTP 216 (310) Q Consensus 143 ~~~~~~~~~~~~-~~~~~~~~~~---~~~~~~~~l~~--~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~ 216 (310) .++......... ....+.+.+. .+.++...+.. .+..+++|+||+.++..|++++|++|+|+|+.. T Consensus 265 ~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~-------- 336 (435) T protein:vir:80 265 KGLRFWALPGNVITASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPEL-------- 336 (435) T ss_pred cceeecccccceeecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceeccCC-------- Confidence 444433222221 2222333332 22334444433 356789999999999999999999999999532 Q ss_pred cCCceeeeeeEEEeCCCCCC------ceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEE Q lcl|NC_021307. 217 YREGRILGRPTILSDHVASG------TTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVE 290 (310) Q Consensus 217 ~~~~~l~G~pv~~t~~~~~~------~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~ 290 (310) .+++|+|+||+++++||.+ ...+++|||+++++++|++++++++++..+. +.....+++|++|+++||++ T Consensus 337 -~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~---~~~~~~~~~f~~n~~~~r~~ 412 (435) T protein:vir:80 337 -ANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEETLEIDYSKEATYK---DADGHMVSAFQRDQTLIRVI 412 (435) T ss_pred -CCCeEeeeeeEEeccccccccCCCCcceEEEEEcccEEEEeecceEEEEecccccc---ccccchhhhhhcCcceeeee Confidence 1347999999999999863 3468899999999999999999999998765 34456678899999999999 Q ss_pred EEeccEEeccCceEEEeecC Q lcl|NC_021307. 291 AEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 291 ~~~d~~v~~~~a~~~l~~aa 310 (310) .|+||++.+|+||++|++.+ T Consensus 413 ~r~d~~~~~~~a~~~l~~~~ 432 (435) T protein:vir:80 413 AKNDFGPRHVESIAVLSGVA 432 (435) T ss_pred eeeCcEeecccceEEEeccC Confidence 99999999999999999999 No 33 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=6.5e-57 Score=328.62 Aligned_cols=282 Identities=22% Similarity=0.235 Sum_probs=237.0 Q ss_pred ccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccccccceeeeEeeeeeeEe Q lcl|NC_021307. 18 GDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPHKIAT 97 (310) Q Consensus 18 ~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~ 97 (310) .+..+|.++|+++..+|++.+++.|+++++|++++++++.+++|+.+++++++|++||+++|+++++|+++++.++|+++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 80 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeeeEEEE Confidence 33356778999999999999999999999999999999989999999999999999999999999999999999999999 Q ss_pred eehhhHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHHcccCccc--cccccccc---cccc---ceecccchHHHHH Q lcl|NC_021307. 98 IFVASAETVR---ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPF--DKNLDETT---KSVD---LTPATGTTYDAIG 166 (310) Q Consensus 98 ~~~is~ell~---~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~--~~~~~~~~---~~~~---~~~~~~~~~~~~~ 166 (310) ++++|+|+++ ++..+++++|.++|++++++++|.++++|.+.+. +..+.... .... .........++.+ T Consensus 81 ~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) T protein:vir:94 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) T ss_pred eeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccHHHHH Confidence 9999999996 4567899999999999999999999999964332 22221111 1111 1112233345566 Q ss_pred HHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCC----ceeEee Q lcl|NC_021307. 167 VNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASG----TTVGYL 242 (310) Q Consensus 167 ~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~----~~~~~~ 242 (310) .+++.++...+..+++|+||++++.+|+++||++|+|+|++....+. +++|+|+||++++.+|.+ +..+++ T Consensus 161 ~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~-----~~tl~G~PV~~~~~v~~~~~~~~~~~~~ 235 (298) T protein:vir:94 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGAT-----PDTINGLPVDVNKTVSDMSLTQRDRAII 235 (298) T ss_pred HHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCC-----CceecceeeEEecccccccCCCccEEEE Confidence 78888888888999999999999999999999999999987665543 468999999999999853 456788 Q ss_pred ecceeee-EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 243 GDFSQIV-WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 243 gd~~~~~-~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) |||+..+ ++.+++++++++++.. .+..++++|++|+++||++.|+||++.+|+||++|+++= T Consensus 236 Gdfs~~~~~~~~~~~~~~~~~~~~------~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 236 GDFANGFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred eeccceEEEEEecCceEEEeecCC------CcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 9999864 8999999999987553 345577899999999999999999999999999998888 No 34 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=6.7e-57 Score=328.53 Aligned_cols=288 Identities=18% Similarity=0.165 Sum_probs=236.1 Q ss_pred Cccc---hhhhHHHHHhh-ccccCCCCceechhhHHHHHHHHHhh-chhhhhcceeecCCC-ceEEEEEcCCceeeeecc Q lcl|NC_021307. 1 MAAG---TAFPVNHTQIA-QTGDSMFQGYLEPEQAQDYFAEAEKT-SIVQRVARKIPMGST-GVKIPHWTGDVSAAWIGE 74 (310) Q Consensus 1 ~aa~---~~~~~~~~~~~-~~~~~~~g~~i~~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E 74 (310) +.+| +....+..... ..+++.+|+++||++..++|..+.+. ++++++++++++.++ .+.+|+.++.+.++|++| T Consensus 93 ~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 172 (392) T protein:vir:13 93 LRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGE 172 (392) T ss_pred HhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecc Confidence 2222 22222222222 23444556788999988877766554 567788899988654 589999999999999999 Q ss_pred cccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccccc- Q lcl|NC_021307. 75 GDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVD- 153 (310) Q Consensus 75 g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~- 153 (310) ++.+|+++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|.+||+|+|++.|.++........ T Consensus 173 ~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~ 252 (392) T protein:vir:13 173 TAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANA 252 (392) T ss_pred cccccccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999999999999999999988877764332221 Q ss_pred ---ceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEe Q lcl|NC_021307. 154 ---LTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILS 230 (310) Q Consensus 154 ---~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t 230 (310) ...+...+++++ .++...+...+..+++|+||++++..|++++|++|+|+|+++...+. +.+|+|+||+++ T Consensus 253 ~~~~~~~~~~~~d~l-~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~-----~~~l~G~Pv~~~ 326 (392) T protein:vir:13 253 AFGEADADSKVSDAL-IDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGA-----PDTFNGKVVETD 326 (392) T ss_pred cccccccccccHHHH-HHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCC-----CceecceeeEEc Confidence 122233444444 57888888889999999999999999999999999999998776653 458999999999 Q ss_pred CCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 231 DHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 231 ~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +++|+++ ++||||++|+++++++++++.+.+. .|++|++.||++.|+|+++.+++||+.++.++ T Consensus 327 ~~~~~~~--i~~Gdf~~~~i~~~~~~~i~~~~~~--------------~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~ 390 (392) T protein:vir:13 327 DGMPADK--VLFADLSKYRVRFAGSLRVDRSVDA--------------KFSTDQIVYRFLQRADGLLVDARGAKVLTVTP 390 (392) T ss_pred CCCCCCc--EEEeeccceeEEeecceEEEeeccc--------------cccCCcEEEEEEEEeccEEecccceEEEEeec Confidence 9999875 5789999999999999999887654 38899999999999999999999999888777 No 35 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=7.5e-57 Score=328.28 Aligned_cols=301 Identities=26% Similarity=0.354 Sum_probs=241.0 Q ss_pred CccchhhhHHHHHh--hccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc--- Q lcl|NC_021307. 1 MAAGTAFPVNHTQI--AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG--- 75 (310) Q Consensus 1 ~aa~~~~~~~~~~~--~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg--- 75 (310) ||.-+.+....... ....+...++++|+++.++|++.+++.++++++|+++|++++.+++|+.++.+.++|++|| T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~ 80 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCcccc Confidence 66555443221111 1111223444889999999999999999999999999999999999999999999998766 Q ss_pred -----ccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccc Q lcl|NC_021307. 76 -----DMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTK 150 (310) Q Consensus 76 -----~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~ 150 (310) +.+|+++++|+++++.++|+++++++|+|+++|+.++++++|+++|++++++++|.+||+|+|++.+.++.+..+ T Consensus 81 ~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~ 160 (333) T protein:vir:78 81 EQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDT 160 (333) T ss_pred cccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccccccc Confidence 456889999999999999999999999999999999999999999999999999999999999887765544332 Q ss_pred cccc---------eecccchHHHHHHHHHHHhhh-hcCCCCEEEEehHHHHHHHH---hhhccCcccccccccccccccc Q lcl|NC_021307. 151 SVDL---------TPATGTTYDAIGVNALSLLVN-AGKKWGATLLDDVAEPILNG---AKDANGRPLFVESTYEAVTTPY 217 (310) Q Consensus 151 ~~~~---------~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~~~~~~~~l~~---l~d~~g~~~~~~~~~~~~~~~~ 217 (310) .... ......+++++ .+++..+.. .+...+.|+|||.++..|++ ++|.+|+|+|++....+. T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~---- 235 (333) T protein:vir:78 161 DNVIANTTNVDYLQETGDPLLDRL-LDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQ---- 235 (333) T ss_pred cccccccccccccccccchhHHHH-HHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCC---- Confidence 1111 11122233333 355555543 45566789999999987764 679999999987665543 Q ss_pred CCceeeeeeEEEeCCCCCC-------ceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEE Q lcl|NC_021307. 218 REGRILGRPTILSDHVASG-------TTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVE 290 (310) Q Consensus 218 ~~~~l~G~pv~~t~~~~~~-------~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~ 290 (310) +++|+|+||++++++|.+ +..+++|||++++++++++++++++++..+. +....++++|++|++.||++ T Consensus 236 -~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~---~~~~~~~~~~~~~~v~~r~~ 311 (333) T protein:vir:78 236 -TGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLT---DSGSATVSMWQTNQIAILIE 311 (333) T ss_pred -CceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEecccccc---ccccceeehhhcCcEEEEEE Confidence 468999999999999865 3468899999999999999999999887543 44566778999999999999 Q ss_pred EEeccEEeccCceEEEeecC Q lcl|NC_021307. 291 AEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 291 ~~~d~~v~~~~a~~~l~~aa 310 (310) .|+||++.+++||++|++++ T Consensus 312 ~r~d~~v~~~~a~~~l~~~~ 331 (333) T protein:vir:78 312 VTFGWLLGDKQAFVKFVDDE 331 (333) T ss_pred EEEccEEecccceEEEeccC Confidence 99999999999999999999 No 36 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=8.7e-57 Score=327.93 Aligned_cols=289 Identities=16% Similarity=0.122 Sum_probs=242.8 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcC-Cceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTG-DVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~ 79 (310) ...+.....+......++++.+|+++||++..+|++.+++.++|+++|+++|++++.+++|+..+ .+.+.|++||+.+| T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~ 170 (385) T protein:vir:18 91 GKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKP 170 (385) T ss_pred HhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCcccc Confidence 11112222333455677777788899999999999999999999999999999988899999875 56889999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccce--ec Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLT--PA 157 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~--~~ 157 (310) +++++|+++++.++|++++++||+|+++++ ++++++|.++|++++++++|.+||+|+|++.++.+.......... .. T Consensus 171 ~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~ 249 (385) T protein:vir:18 171 ESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNA 249 (385) T ss_pred ccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc Confidence 999999999999999999999999999986 579999999999999999999999999999876544332222221 12 Q ss_pred ccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCc Q lcl|NC_021307. 158 TGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGT 237 (310) Q Consensus 158 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~ 237 (310) .+....+.+.+++..+...+..+++|+|||+++.+|++++|++|+|+|++.. .+.+++|+|+||++++++|.++ T Consensus 250 ~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~------~~~~~~l~G~pV~~~~~~p~~~ 323 (385) T protein:vir:18 250 TGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQ------AFTSNIMWGLPVVPTKAQAAGT 323 (385) T ss_pred cccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcc------cCCCceecceeeEEcCcCCCCc Confidence 2333445566888889999999999999999999999999999999997533 2335789999999999999876 Q ss_pred eeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 238 TVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 238 ~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +++|||+. +.++++++++++++++.. .+|++|++.||++.|+|+++.+|+||++++.+| T Consensus 324 --~~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~a 383 (385) T protein:vir:18 324 --FTVGGFDMASQVWDRMDATVEVSREDR------------DNFVKNMLTILCEERLALAHYRPTAIIKGTFSS 383 (385) T ss_pred --EEEeecccEEEEEEecceEEEEecccc------------chhhcCcEEEEEEEeeccEEecccceEEEEecc Confidence 57899987 678999999998876653 468999999999999999999999999999999 No 37 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=8.7e-57 Score=327.93 Aligned_cols=289 Identities=16% Similarity=0.122 Sum_probs=242.8 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcC-Cceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTG-DVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~ 79 (310) ...+.....+......++++.+|+++||++..+|++.+++.++|+++|+++|++++.+++|+..+ .+.+.|++||+.+| T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~ 170 (385) T protein:vir:19 91 GKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKP 170 (385) T ss_pred HhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCcccc Confidence 11112222333455677777788899999999999999999999999999999988899999875 56889999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccce--ec Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLT--PA 157 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~--~~ 157 (310) +++++|+++++.++|++++++||+|+++++ ++++++|.++|++++++++|.+||+|+|++.++.+.......... .. T Consensus 171 ~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~ 249 (385) T protein:vir:19 171 ESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNA 249 (385) T ss_pred ccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc Confidence 999999999999999999999999999986 579999999999999999999999999999876544332222221 12 Q ss_pred ccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCc Q lcl|NC_021307. 158 TGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGT 237 (310) Q Consensus 158 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~ 237 (310) .+....+.+.+++..+...+..+++|+|||+++.+|++++|++|+|+|++.. .+.+++|+|+||++++++|.++ T Consensus 250 ~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~------~~~~~~l~G~pV~~~~~~p~~~ 323 (385) T protein:vir:19 250 TGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQ------AFTSNIMWGLPVVPTKAQAAGT 323 (385) T ss_pred cccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcc------cCCCceecceeeEEcCcCCCCc Confidence 2333445566888889999999999999999999999999999999997533 2335789999999999999876 Q ss_pred eeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 238 TVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 238 ~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +++|||+. +.++++++++++++++.. .+|++|++.||++.|+|+++.+|+||++++.+| T Consensus 324 --~~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~a 383 (385) T protein:vir:19 324 --FTVGGFDMASQVWDRMDATVEVSREDR------------DNFVKNMLTILCEERLALAHYRPTAIIKGTFSS 383 (385) T ss_pred --EEEeecccEEEEEEecceEEEEecccc------------chhhcCcEEEEEEEeeccEEecccceEEEEecc Confidence 57899987 678999999998876653 468999999999999999999999999999999 No 38 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=6.8e-57 Score=328.53 Aligned_cols=281 Identities=20% Similarity=0.255 Sum_probs=233.4 Q ss_pred hhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccc-----cccccceeee Q lcl|NC_021307. 14 IAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMK-----PITKGDMSVQ 88 (310) Q Consensus 14 ~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~-----~~~~~~~~~i 88 (310) -+.++++.+|.+||+++..+|++.+++.++|+++++++++.++.+++|+.+..+.+.|++|++.. |.++++|+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 33344445566788999999999999999999999999999999999999999999999999864 5578999999 Q ss_pred EeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccc-----eecccchH- Q lcl|NC_021307. 89 QVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDL-----TPATGTTY- 162 (310) Q Consensus 89 ~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~-----~~~~~~~~- 162 (310) ++.+||++++++||+|+++|+.++++++|+++|++++++++|++||+|+|++.+..+......... ........ T Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) T ss_pred EeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccchhh Confidence 999999999999999999999999999999999999999999999999998776554432222111 11111111 Q ss_pred ---HHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCC--Cc Q lcl|NC_021307. 163 ---DAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVAS--GT 237 (310) Q Consensus 163 ---~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~--~~ 237 (310) .+.+..+...+....+..+.|+||+.++..|+++||++|+|+|++ .+|+|+||++++.+|. ++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~------------~~l~G~Pv~~~~~~~~~~~~ 228 (305) T protein:vir:25 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD------------DSFAGFRTFFNRNGAWDADA 228 (305) T ss_pred hHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecC------------CcccccceEEcCccCCCCCc Confidence 122233444455556677789999999999999999999999975 3699999999999874 45 Q ss_pred eeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 238 TVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 238 ~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ..+++|||++++++++++++++++++..+... ..++++|++|+++||++.|+||.+.||+||+++++.- T Consensus 229 ~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~----~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~ 297 (305) T protein:vir:25 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTG----ENQINLAERDMVALRLKARFAYVLGVSATAQGANKTP 297 (305) T ss_pred cEEEEEecceEEEEEecCeEEEEeeeeeeecC----CceeeeeecCcEEEEEEEeecceeeCcccEEEEcccc Confidence 67889999999999999999999999887653 4467899999999999999999999999999998853 No 39 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=1.9e-56 Score=326.12 Aligned_cols=283 Identities=23% Similarity=0.227 Sum_probs=231.4 Q ss_pred hccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccccccceeeeEeeeee Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPHK 94 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k 94 (310) |.+.++.+|++||+++.++|++.+++.++++++|+++|++++..+||+.+++++++|++|++.+|+++++|+++++.+|| T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~k 80 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTSTPKK 80 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEE Confidence 77888888889999999999999999999999999999998889999999999999999999999999999999999999 Q ss_pred eEeeehhhHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccc-------cccceecccchHHH Q lcl|NC_021307. 95 IATIFVASAETVR---ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTK-------SVDLTPATGTTYDA 164 (310) Q Consensus 95 ~~~~~~is~ell~---~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~-------~~~~~~~~~~~~~~ 164 (310) +++++++|+|+++ |+.++++++|.++|++++++++|+++|+|+|++.+..+.+..+ .++.........+. T Consensus 81 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) T protein:vir:99 81 AQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIANPDL 160 (311) T ss_pred EEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccchhHH Confidence 9999999999994 6789999999999999999999999999999776654433222 22222233344455 Q ss_pred HHHHHHHHhhhh--cCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCC------- Q lcl|NC_021307. 165 IGVNALSLLVNA--GKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVAS------- 235 (310) Q Consensus 165 ~~~~~~~~l~~~--~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~------- 235 (310) ++.+++..+... ....++|+||++++..|+++||++|+|+|++....+. +++|+|+||++++++|. T Consensus 161 ~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~-----~~~l~G~Pv~~s~~i~~~~~~~~~ 235 (311) T protein:vir:99 161 AIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIG-----VSSFEGIDASVSDTVNGGDEADPD 235 (311) T ss_pred HHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCC-----CceecceeeEeecccccccccccc Confidence 555666665544 3445679999999999999999999999987665543 46899999999998763 Q ss_pred -------CceeEeeecceee-eEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEe Q lcl|NC_021307. 236 -------GTTVGYLGDFSQI-VWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLT 307 (310) Q Consensus 236 -------~~~~~~~gd~~~~-~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~ 307 (310) +...+++|||+.. .++.++++++++++++. ...++++|++|+++||+++|+||++.|+ +|++++ T Consensus 236 ~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~ 307 (311) T protein:vir:99 236 DEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGD-------PDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIE 307 (311) T ss_pred cchhhccCcceEEEeeccccEEEEEecCceEEEeecCC-------CCcchhhhhcCcEEEEEEEeecceecCh-hHeeee Confidence 2334567888764 47777778777776542 2346789999999999999999999996 566666 Q ss_pred ecC Q lcl|NC_021307. 308 NAA 310 (310) Q Consensus 308 ~aa 310 (310) .++ T Consensus 308 ~~~ 310 (311) T protein:vir:99 308 NAV 310 (311) T ss_pred ccc Confidence 666 No 40 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=1.7e-56 Score=326.31 Aligned_cols=286 Identities=17% Similarity=0.165 Sum_probs=231.7 Q ss_pred Cccchhh---hHH-HHHhhccccCCCCceechhhHHHHHH-HHHhhchhhhhcceeecCCC-ceEEEEEcCCceeeeecc Q lcl|NC_021307. 1 MAAGTAF---PVN-HTQIAQTGDSMFQGYLEPEQAQDYFA-EAEKTSIVQRVARKIPMGST-GVKIPHWTGDVSAAWIGE 74 (310) Q Consensus 1 ~aa~~~~---~~~-~~~~~~~~~~~~g~~i~~~~~~~ii~-~~~~~s~l~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E 74 (310) +.+|... ..+ .......+++.+|+++||++..++|. .+++.++++++|+++++.++ .+++|+.++.+.+.|++| T Consensus 93 ~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E 172 (390) T protein:vir:62 93 LRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGE 172 (390) T ss_pred HhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecc Confidence 2222111 111 11112334455667888888777554 56677778889999998764 489999999999999999 Q ss_pred cccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccc--- Q lcl|NC_021307. 75 GDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKS--- 151 (310) Q Consensus 75 g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~--- 151 (310) ++.+|+++++|+++++.+||++++++||+|+++|+.++++++|.++|+++++.++|.+|++|+|.+ .++...... T Consensus 173 ~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p--~Gi~~~~~~~~~ 250 (390) T protein:vir:62 173 TAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQP--RGILTDASPATA 250 (390) T ss_pred cccccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCcc--cccccccccccc Confidence 999999999999999999999999999999999999999999999999999999999999999854 444332221 Q ss_pred -ccceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEe Q lcl|NC_021307. 152 -VDLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILS 230 (310) Q Consensus 152 -~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t 230 (310) ...+.....+++++ .++...+...+..+++|+||++++..|+++||.+|+|+|+++...+. +.+|+|+||+++ T Consensus 251 ~~~~~~~~~~~~~~l-~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~-----~~~l~G~Pv~~~ 324 (390) T protein:vir:62 251 TFLATDTDSKVSDAL-IDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGA-----PSLFNGKVVETD 324 (390) T ss_pred ceecccccccchHHH-HHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCcCCCc-----cceecccceEEe Confidence 11122233344444 47777888888899999999999999999999999999998876543 458999999999 Q ss_pred CCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 231 DHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 231 ~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +++|.+. ++||||+++++++++++.+..+.+. .|++|++.||++.|+|+++.+++||++|+.+| T Consensus 325 ~~~p~~~--i~~gd~s~~~i~~~~~~~v~~~~~~--------------~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~ 388 (390) T protein:vir:62 325 DGMPADK--ILFADLSKYRVRFAGSLRVDRSVDA--------------KFSTDQIVYRFLQRADGLLVDARGAKVLTVTP 388 (390) T ss_pred cCCCCcc--EEEeeccceeEEeecceEEEeeccc--------------cccCCcEEEEEEEEeCcEeechhheEEEEeec Confidence 9999875 5789999999999999999988653 48999999999999999999999999999888 No 41 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=8.5e-56 Score=322.51 Aligned_cols=287 Identities=17% Similarity=0.153 Sum_probs=236.1 Q ss_pred Ccc-----chhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCC-ceeeeecc Q lcl|NC_021307. 1 MAA-----GTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGD-VSAAWIGE 74 (310) Q Consensus 1 ~aa-----~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~-~~a~~v~E 74 (310) ... ..............+++.+|+++||+++.+||+.+++.++|+++|+++|++++.+++|+.++. +.+.|++| T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 174 (390) T protein:vir:10 95 WNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAE 174 (390) T ss_pred hhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCCcceeeecC Confidence 100 111111112233445666788999999999999999999999999999999999999998764 67999999 Q ss_pred cccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccc-ccccccccccc Q lcl|NC_021307. 75 GDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFD-KNLDETTKSVD 153 (310) Q Consensus 75 g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~-~~~~~~~~~~~ 153 (310) |+.+|+++++|+++++.++|+++++++|+|+++|+. +++++|.++|++++++++|++|++|+|++.. .++........ T Consensus 175 g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~ 253 (390) T protein:vir:10 175 GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYA 253 (390) T ss_pred CccccccccceeEEEEeeEEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCCcccccccccccccc Confidence 999999999999999999999999999999999875 8999999999999999999999999998874 33332221111 Q ss_pred c-eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC Q lcl|NC_021307. 154 L-TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH 232 (310) Q Consensus 154 ~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~ 232 (310) . ....+.+..+.+.++...+...++.+++|+|||+++.+|++++|++|+|+|++.... .+++|+|+||++++. T Consensus 254 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~------~~~~l~G~pv~~~~~ 327 (390) T protein:vir:10 254 APTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGT------LTPTLWGLPVVATQA 327 (390) T ss_pred ccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCcCc------CCceecceeeEEcCC Confidence 1 122233344556688888999999999999999999999999999999999876533 245899999999999 Q ss_pred CCCCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeec Q lcl|NC_021307. 233 VASGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 233 ~~~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a 309 (310) +|.++ +++|||++ +.++++.+++++.+++. ..|++|++.||++.|+||++.+|+||++++.| T Consensus 328 ~p~~~--~~~gdf~~~~~~~~~~~~~i~~~~~~-------------~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 328 MAPGE--FLVGAFDLAAQIFDQWDARVEIGYVN-------------DDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred CCCCc--EEEEeccceEEEEEecceEEEEeecc-------------cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 99876 57899997 55889999999987654 24889999999999999999999999999999 No 42 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=7.2e-56 Score=322.91 Aligned_cols=288 Identities=15% Similarity=0.127 Sum_probs=237.3 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcC-Cceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTG-DVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~ 79 (310) +..+..... ..+...++++.+|+++||+++.+|++.+++.++|+++|+++|++++.+++|+.++ .+.+.|++|++.+| T Consensus 101 ~~~~~~~~~-~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~ 179 (395) T protein:vir:43 101 LRGSHRVSM-PRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNNAAPVSEGTQKP 179 (395) T ss_pred hhhhhhhhh-hhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCceeeecCCcccc Confidence 322222222 2234445566677889999999999999999999999999999998899999866 46899999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc-cccccccccee-- Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL-DETTKSVDLTP-- 156 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~-~~~~~~~~~~~-- 156 (310) +++++|+++++.++|++++++||+|+++++ .+++++|.++|++++++++|.+||+|+|++.++.+ ........... T Consensus 180 ~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~ 258 (395) T protein:vir:43 180 YSDLTFELENAPVRTIAHLFKASRQILDDA-SALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGV 258 (395) T ss_pred ccccceeEEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc Confidence 999999999999999999999999999986 47999999999999999999999999999887533 32222212111 Q ss_pred -cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCC Q lcl|NC_021307. 157 -ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVAS 235 (310) Q Consensus 157 -~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~ 235 (310) .......+.+.++...+...+..+++|+|||+++..|++++|++|+|+|++.. . ..+++|+|+||++++++|. T Consensus 259 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~-~-----~~~~~l~G~pVv~~~~~~~ 332 (395) T protein:vir:43 259 VVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGSPQ-N-----GTTPTLWRLPVVETQAITQ 332 (395) T ss_pred ccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccccc-c-----CCCceecceeeEEcCCCCC Confidence 11222334456778888889999999999999999999999999999996532 2 2346899999999999998 Q ss_pred CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 236 GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 236 ~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++ +++|||++ +.++++.+++++++++.. .+|++|++.||++.|+||++.+++||++++.+| T Consensus 333 ~~--~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~ta 394 (395) T protein:vir:43 333 DE--FLTGAFSLGAQIFDRMDIEVLVSTEND------------KDFENNMVTIRAEERLAFAVYRPEAFVTGSLTA 394 (395) T ss_pred Cc--EEEEeccceEEEEEecceEEEEecccc------------chhhcCcEEEEEEEeeccEEecccceEEEEecc Confidence 86 57899998 558888999999886543 469999999999999999999999999999999 No 43 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=8.4e-56 Score=322.53 Aligned_cols=288 Identities=16% Similarity=0.140 Sum_probs=236.5 Q ss_pred Cc-----cchh-hhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcC-Cceeeeec Q lcl|NC_021307. 1 MA-----AGTA-FPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTG-DVSAAWIG 73 (310) Q Consensus 1 ~a-----a~~~-~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~ 73 (310) +. ..+. ...+......++++.+|++||+++..+|++.+++.++|+++++++|++++.+++|+... ++.+.|++ T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 195 (418) T protein:vir:10 116 ARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVA 195 (418) T ss_pred HhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeeec Confidence 11 1111 11223344455666677789999999999999999999999999999998899999866 57899999 Q ss_pred ccccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccc-ccccccccccc Q lcl|NC_021307. 74 EGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPF-DKNLDETTKSV 152 (310) Q Consensus 74 Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~-~~~~~~~~~~~ 152 (310) |++.+|+++++|+++++.++|++++++||+|+++++ .+++++|.++|++++++++|.+||+|+|++. |.++....... T Consensus 196 E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~ 274 (418) T protein:vir:10 196 EGAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDA-PALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAF 274 (418) T ss_pred cCccccccccceeeEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccc Confidence 999999999999999999999999999999999987 5899999999999999999999999999876 44444332222 Q ss_pred cce--ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEe Q lcl|NC_021307. 153 DLT--PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILS 230 (310) Q Consensus 153 ~~~--~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t 230 (310) ... ..+..+++ .+.+++..+...+..+++|+||+.++..|++++|++|+|+|++. .. ..+++|+|+||+++ T Consensus 275 ~~~~~~~~~~~~~-~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~-~~-----~~~~~l~G~pV~~~ 347 (418) T protein:vir:10 275 MPSITLANATPID-KIRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVGNP-VN-----GTTPRLWNLPVVET 347 (418) T ss_pred cccccccccccHH-HHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceecccc-cc-----CCCceecceeeEEc Confidence 221 12223334 44567778888899999999999999999999999999999642 22 23468999999999 Q ss_pred CCCCCCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeec Q lcl|NC_021307. 231 DHVASGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 231 ~~~~~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a 309 (310) ++||.++ +++|||++ ++++++++++++++++.. .+|++|++.||++.|+||++.+|+||++++.+ T Consensus 348 ~~~p~~~--~~~gd~s~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~ 413 (418) T protein:vir:10 348 QAMTANE--FLVGAFSMAAQIFDRMEIEVLLSTENV------------DDFEKNMVSIRAEERLALAVYRPESFVTGALV 413 (418) T ss_pred CCCCCCc--EEEeeccceEEEEEecceEEEEecccc------------hhhhcCceEEEEEEeeccEEecccceEEEEec Confidence 9999886 57899997 668899999999876643 46899999999999999999999999999998 Q ss_pred C Q lcl|NC_021307. 310 A 310 (310) Q Consensus 310 a 310 (310) + T Consensus 414 ~ 414 (418) T protein:vir:10 414 E 414 (418) T ss_pred c Confidence 8 No 44 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=5.9e-56 Score=323.37 Aligned_cols=292 Identities=13% Similarity=0.054 Sum_probs=236.5 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHH-HHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYF-AEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii-~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) +...+...........++++.+|.+||+++..++| +.+++.++++++++++++ ++.+++|+.++++.++|++||+.+| T Consensus 237 l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~~~~~~~~~~~~a~~v~Eg~~~~ 315 (543) T protein:vir:81 237 LTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGDVWHGVSSAAVQWSWDAEFEEVS 315 (543) T ss_pred hhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC-CcceEEEEecCCcceeecccCcccc Confidence 22223333333334445566667778889888865 667888999999998776 5668999999999999999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccc-ccccccccccccc---e Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPF-DKNLDETTKSVDL---T 155 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~-~~~~~~~~~~~~~---~ 155 (310) +++++|+++++.++|++++++||+|+++|+ ++++++|.+.|++++++++|.+||+|+|++. |.++......... + T Consensus 316 ~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~ 394 (543) T protein:vir:81 316 DDSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAP 394 (543) T ss_pred ccccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhcccccccccc Confidence 999999999999999999999999999987 6999999999999999999999999999874 5555433222111 1 Q ss_pred ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCC Q lcl|NC_021307. 156 PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVAS 235 (310) Q Consensus 156 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~ 235 (310) ........+.+.++...+...+..+++|+||++++..|++++|++|+|+|.+... +.+++|+|+||+++++||. T Consensus 395 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~------g~~~~l~G~pv~~~~~~~~ 468 (543) T protein:vir:81 395 VTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWTTIGN------GEPSQLLGRPVGEAEAMDA 468 (543) T ss_pred cccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCceeccCcCC------CCCccccceeeEEeccccc Confidence 1222333455567888899999999999999999999999999999999976432 2246899999999999875 Q ss_pred --------CceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEe Q lcl|NC_021307. 236 --------GTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLT 307 (310) Q Consensus 236 --------~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~ 307 (310) +...++||||++++++++++++++++.+... .+.|.+|++.|+++.|+||++.+++||++|+ T Consensus 469 ~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~----------~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~ 538 (543) T protein:vir:81 469 NWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFG----------TNRRPNGSRGWFAYYRMGADVVNPNAFRLLN 538 (543) T ss_pred cccccccCCcceEEEeeccceeEEeecccEEEEeccccc----------cchhhcCceEEEEEEeeccEeecccceEEEE Confidence 3456889999999999999999998865431 1357899999999999999999999999999 Q ss_pred ecC Q lcl|NC_021307. 308 NAA 310 (310) Q Consensus 308 ~aa 310 (310) .++ T Consensus 539 ~~~ 541 (543) T protein:vir:81 539 VET 541 (543) T ss_pred ecc Confidence 999 No 45 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=1.5e-55 Score=321.11 Aligned_cols=287 Identities=16% Similarity=0.140 Sum_probs=236.0 Q ss_pred Ccc-----chhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCC-ceeeeecc Q lcl|NC_021307. 1 MAA-----GTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGD-VSAAWIGE 74 (310) Q Consensus 1 ~aa-----~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~-~~a~~v~E 74 (310) +.. .............++++.+|+++|++++.+|++.+++.++|+++++++|++++.+++|+.++. +.+.|++| T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 174 (390) T protein:vir:97 95 WNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAE 174 (390) T ss_pred hhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEecCCcceeeecC Confidence 000 011111222233345666777899999999999999999999999999999999999999764 68999999 Q ss_pred cccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccc-ccccccccccc Q lcl|NC_021307. 75 GDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFD-KNLDETTKSVD 153 (310) Q Consensus 75 g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~-~~~~~~~~~~~ 153 (310) |+.+|+++++|+++++.++|+++++++|+|+++|+ .+++++|.++|++++++++|++||+|+|++.. .++........ T Consensus 175 g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~ 253 (390) T protein:vir:97 175 GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYA 253 (390) T ss_pred CccccccccceeEEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeecccccc Confidence 99999999999999999999999999999999987 58999999999999999999999999998874 34332221111 Q ss_pred c-eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC Q lcl|NC_021307. 154 L-TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH 232 (310) Q Consensus 154 ~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~ 232 (310) . ...++....+.+.++...++..+..+++|+|||+++.+|++++|++|+|+|++.... .+++|+|+||++++. T Consensus 254 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~------~~~~l~G~pV~~~~~ 327 (390) T protein:vir:97 254 APTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGT------LTPTLWGLPVVATQA 327 (390) T ss_pred ccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCccCC------CCceecceeeEEcCC Confidence 1 122333444556678889999999999999999999999999999999999864422 246899999999999 Q ss_pred CCCCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeec Q lcl|NC_021307. 233 VASGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 233 ~~~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a 309 (310) +|.++ +++|||++ +.++++++++++.+++. ..|++|+++||++.|+||++.+|+||++++.| T Consensus 328 ~~~~~--~~~gd~~~~~~~~~~~~~~i~~~~~~-------------~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 328 MAPGE--FLVGAFDLAAQIFDQWDARVEIGYVN-------------DDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred CCCCc--EEEEeccceEEEEEecceEEEEeecc-------------cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 99876 57899997 66889999999987654 24899999999999999999999999999999 No 46 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=1.7e-55 Score=320.90 Aligned_cols=287 Identities=16% Similarity=0.141 Sum_probs=235.2 Q ss_pred Cccc---hhhh-HHHHH-hhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCC-ceeeeecc Q lcl|NC_021307. 1 MAAG---TAFP-VNHTQ-IAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGD-VSAAWIGE 74 (310) Q Consensus 1 ~aa~---~~~~-~~~~~-~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~-~~a~~v~E 74 (310) +..+ .... ..... ...++++.+|+++||+++.+|++.+++.++|+++++++|++++.+++|+.++. +.+.|++| T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 174 (390) T protein:vir:81 95 WNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAE 174 (390) T ss_pred HhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEecCCcceeeecC Confidence 0000 1111 11112 22345667778999999999999999999999999999999999999998764 57999999 Q ss_pred cccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccc-cccccccccc Q lcl|NC_021307. 75 GDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDK-NLDETTKSVD 153 (310) Q Consensus 75 g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~-~~~~~~~~~~ 153 (310) |+.+|+++++|+++++.++|+++++++|+|+++|+ ++++++|.++|++++++++|++|++|+|++..+ ++........ T Consensus 175 g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~ 253 (390) T protein:vir:81 175 GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYA 253 (390) T ss_pred CcccccccceeeEEEEeeeEEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccc Confidence 99999999999999999999999999999999997 589999999999999999999999999988743 3332222111 Q ss_pred c-eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC Q lcl|NC_021307. 154 L-TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH 232 (310) Q Consensus 154 ~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~ 232 (310) . ....+....+.+.++...+...+..+++|+|||+++..|++++|++|+|+|++.... .+++|+|+||+++++ T Consensus 254 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~------~~~~l~G~pv~~~~~ 327 (390) T protein:vir:81 254 APTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGNARGT------LTPTLWGLPVVATQA 327 (390) T ss_pred cccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcccc------cCceecceeeEEcCC Confidence 1 112233334556688889999999999999999999999999999999999864432 245899999999999 Q ss_pred CCCCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeec Q lcl|NC_021307. 233 VASGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 233 ~~~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a 309 (310) +|.++ +++|||++ +.++++++++++.+++. .+|++|++.||++.|+||++.+|+||++++.| T Consensus 328 ~p~~~--~~~gd~~~~~~~~~~~~~~v~~~~~~-------------~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 328 MAPGE--FLVGAFDLAAQIFDQWDARVEIGYVG-------------EDFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred CCCCc--EEEEehhceEEEEEecceEEEEeccc-------------chhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 99886 57899998 56888999999887653 25899999999999999999999999999999 No 47 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=1.2e-54 Score=316.17 Aligned_cols=292 Identities=16% Similarity=0.105 Sum_probs=236.7 Q ss_pred CccchhhhH----HHHHh--hccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecc Q lcl|NC_021307. 1 MAAGTAFPV----NHTQI--AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGE 74 (310) Q Consensus 1 ~aa~~~~~~----~~~~~--~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 74 (310) +..+..... ..... ..++.+.++.++|+++..+|++.+++.++++++|+++|++++...+|+...++.+.|++| T Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e 222 (458) T protein:vir:10 143 VMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAA 222 (458) T ss_pred HHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeeccc Confidence 111111111 11111 122334566688999999999999999999999999999999999999999999999999 Q ss_pred ccccccc------ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc Q lcl|NC_021307. 75 GDMKPIT------KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET 148 (310) Q Consensus 75 g~~~~~~------~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~ 148 (310) ++.++++ +++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|.+||+|+|++.|.++... T Consensus 223 ~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~ 302 (458) T protein:vir:10 223 STYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTL 302 (458) T ss_pred ccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeec Confidence 9998854 5789999999999999999999999999999999999999999999999999999999888776654 Q ss_pred cccccc--------eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCc Q lcl|NC_021307. 149 TKSVDL--------TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREG 220 (310) Q Consensus 149 ~~~~~~--------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~ 220 (310) ...... ......++++ +.+++..+...+..+++|+||+++|..|++++|++|+|++.+....+ ...+.+. T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~-i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~-~~~~~~~ 380 (458) T protein:vir:10 303 ASEDSAKVVTEAKADGSVLVTAKT-ISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSV-KLQGQVG 380 (458) T ss_pred ccccccceeecccccccccccHHH-HHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccc-cccCcCc Confidence 332211 1112223444 45788889999999999999999999999999999999998765543 3334567 Q ss_pred eeeeeeEEEeCCCCCC--ceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEE Q lcl|NC_021307. 221 RILGRPTILSDHVASG--TTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLI 297 (310) Q Consensus 221 ~l~G~pv~~t~~~~~~--~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v 297 (310) +|+|+||+++++||++ ...+++|||++ +.++++.++++..+ +.+.+|++.||++.|+|+.+ T Consensus 381 ~l~G~pv~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~~~~v~~d----------------~~~~~~~~~~~~~~r~~~~v 444 (458) T protein:vir:10 381 RIYGLPVVVSEYFPAKANSAEFAVIVYKDNFVMPRQRAVTVERE----------------RQAGKQRDAYYVTQRVNLQR 444 (458) T ss_pred eecceeeEEccccccccCCcceEEEEecccEEEEEeeceEEEee----------------cccCCCceEEEEEEEecceE Confidence 8999999999999974 35667899975 67999999888653 22568999999999999999 Q ss_pred eccCceEEEeecC Q lcl|NC_021307. 298 NDVEAFVKLTNAA 310 (310) Q Consensus 298 ~~~~a~~~l~~aa 310 (310) .+|+||++.+.|| T Consensus 445 ~~~~a~v~~~~aa 457 (458) T protein:vir:10 445 YFANGVVSGTYAA 457 (458) T ss_pred ecccceEEEeecc Confidence 9999999999999 No 48 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=1.3e-54 Score=315.96 Aligned_cols=295 Identities=18% Similarity=0.174 Sum_probs=235.3 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCC----ceeeeecccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGD----VSAAWIGEGD 76 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~----~~a~~v~Eg~ 76 (310) +.+.+............+++.+++++|+++..+|++.+++.++|+++++++|++++.+++|+.... ..+.|++||+ T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~ 184 (413) T protein:vir:81 105 YVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGG 184 (413) T ss_pred hhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCceeEEEeccccccccccceecCcc Confidence 222222222223333445566777899999999999999999999999999999998999998653 4579999999 Q ss_pred cccccc-cceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccc-cccccccccc Q lcl|NC_021307. 77 MKPITK-GDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKN-LDETTKSVDL 154 (310) Q Consensus 77 ~~~~~~-~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~-~~~~~~~~~~ 154 (310) .+|+++ ++|+++++.++|++++++||+|+++|+. .++++|.+.|++++++++|++||+|+|++.++. +......... T Consensus 185 ~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~ 263 (413) T protein:vir:81 185 KKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYD-FLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTL 263 (413) T ss_pred cccccCcccceeeEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccc Confidence 999987 6899999999999999999999999985 599999999999999999999999999988754 3333333333 Q ss_pred eecccchHHHHHHHHHHHhhh-hcCCCCEEEEehHHHHHHHHhhhccCcccccccccccc--ccccCCceeeeeeEEEeC Q lcl|NC_021307. 155 TPATGTTYDAIGVNALSLLVN-AGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAV--TTPYREGRILGRPTILSD 231 (310) Q Consensus 155 ~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~--~~~~~~~~l~G~pv~~t~ 231 (310) ......+..+.+.++...+.. ..+..++|+||++++.+|+++||++|+|+|.+....+. ......++|+|+||++++ T Consensus 264 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~ 343 (413) T protein:vir:81 264 AVSNKDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQ 343 (413) T ss_pred cccccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcC Confidence 333333344444445544433 34556679999999999999999999999987665432 223345689999999999 Q ss_pred CCCCCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 232 HVASGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 232 ~~~~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.++ ++||||+. +.++++++++++++++.. ..|++|++.||++.|+|+++.+++||++|+.++ T Consensus 344 ~~~~~~--~~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~ 409 (413) T protein:vir:81 344 VVPVGK--PVVGAFRSAASVLRKGGVRIDSTNTNV------------DDFENNLITVRAEERVGLMVTFPEAIVQLDVAE 409 (413) T ss_pred CCCccc--EEEEecccEEEEEEecceEEEEecccc------------chhhcCcEEEEEEEeeccEEecccceEEEEecC Confidence 999875 57899997 668888999999987654 358999999999999999999999999999998 No 49 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=8.6e-55 Score=317.00 Aligned_cols=290 Identities=12% Similarity=0.148 Sum_probs=231.1 Q ss_pred Cc-------cchhhhHHHHHhhccccCCC-CceechhhHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEcCC-ceee Q lcl|NC_021307. 1 MA-------AGTAFPVNHTQIAQTGDSMF-QGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG-VKIPHWTGD-VSAA 70 (310) Q Consensus 1 ~a-------a~~~~~~~~~~~~~~~~~~~-g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~ip~~~~~-~~a~ 70 (310) |. ..+.-.....+...++++.. |.+||+++..+|++.+++.++|+++|+++|++++. ..+|+..+. ..+. T Consensus 96 l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 175 (409) T protein:vir:45 96 MRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGV 175 (409) T ss_pred HHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCccccc Confidence 11 11111112233344445444 44688889999999999999999999999997765 445555443 4578 Q ss_pred eecccccccccccceeeeEeeeeeeE-eeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccc Q lcl|NC_021307. 71 WIGEGDMKPITKGDMSVQQVEPHKIA-TIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETT 149 (310) Q Consensus 71 ~v~Eg~~~~~~~~~~~~i~l~~~k~~-~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~ 149 (310) |++|++.+|+++++|+++++.++|++ +++++|+|+++|+.++++++|.++|+++++.++|.+|++|+|++.+..+.++. T Consensus 176 ~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil 255 (409) T protein:vir:45 176 LLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLA 255 (409) T ss_pred cccccccccccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceee Confidence 99999999999999999999999985 67899999999999999999999999999999999999999987554433332 Q ss_pred c---ccccee-cccchHHHHHHHHHHHhhhhcCCCCEE--EEehHHHHHHHHhhhccCccccccccccccccccCCceee Q lcl|NC_021307. 150 K---SVDLTP-ATGTTYDAIGVNALSLLVNAGKKWGAT--LLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRIL 223 (310) Q Consensus 150 ~---~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~--~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~ 223 (310) + ...... ....+++ .+.++...+...+..++.| +||+.++.+|++++|++|+|+|+++...+. +.+|+ T Consensus 256 ~~~~~~~~~~~~~~~~~d-~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~-----~~~l~ 329 (409) T protein:vir:45 256 ASVTGTTQTAAANAVKWQ-EILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVA-----PASVL 329 (409) T ss_pred eccccccccccccccchH-HHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCC-----Cceec Confidence 2 222222 2233444 4457888898888888876 679999999999999999999988776543 46899 Q ss_pred eeeEEEeCCCCC---CceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecc Q lcl|NC_021307. 224 GRPTILSDHVAS---GTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDV 300 (310) Q Consensus 224 G~pv~~t~~~~~---~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~ 300 (310) |+||+++++||. +...++||||+++++++++++.++.+++.+ |++|++.||++.|+|+++.++ T Consensus 330 G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~~~~~d~~--------------~~~~~~~~~~~~r~d~~~~~~ 395 (409) T protein:vir:45 330 NVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERY--------------AEYDQTGFLAFHRFDCILEDT 395 (409) T ss_pred ceeeEEecCcCCccCCccEEEEeehhhhheeeccceEEEEeeccc--------------ccCCcEEEEEEEEeccEeech Confidence 999999999985 445678899999999999999999887643 688999999999999999999 Q ss_pred CceEEEeecC Q lcl|NC_021307. 301 EAFVKLTNAA 310 (310) Q Consensus 301 ~a~~~l~~aa 310 (310) +||++|+.|+ T Consensus 396 ~A~~~l~~k~ 405 (409) T protein:vir:45 396 SAIKALVGKG 405 (409) T ss_pred hheEEEEecc Confidence 9999999988 No 50 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=3.3e-54 Score=313.77 Aligned_cols=293 Identities=11% Similarity=0.095 Sum_probs=237.2 Q ss_pred Cccc----hhhhHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCC--CceEEEEEcCCceeeeec Q lcl|NC_021307. 1 MAAG----TAFPVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGS--TGVKIPHWTGDVSAAWIG 73 (310) Q Consensus 1 ~aa~----~~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~--~~~~ip~~~~~~~a~~v~ 73 (310) +..+ .........+..++++..|| ++|+++..+|++.+++.++|+++++++|+++ +.+.+|+..+.+.++|++ T Consensus 92 ~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~ 171 (404) T protein:vir:10 92 LKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLS 171 (404) T ss_pred HHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeecc Confidence 2211 12223344455555555444 5788889999999999999999999999864 457788888889999999 Q ss_pred cccccccc--ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccc Q lcl|NC_021307. 74 EGDMKPIT--KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKS 151 (310) Q Consensus 74 Eg~~~~~~--~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~ 151 (310) |++.++++ +++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.++.+...... T Consensus 172 e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~ 251 (404) T protein:vir:10 172 ENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANK 251 (404) T ss_pred ccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccc Confidence 99999875 5889999999999999999999999999999999999999999999999999999999887665544444 Q ss_pred cccee-cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEe Q lcl|NC_021307. 152 VDLTP-ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILS 230 (310) Q Consensus 152 ~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t 230 (310) ..... .+..+++++...+...+...+..+++|+|||++|..|+++||++|+|+|.++...+ .+++|+|+||++. T Consensus 252 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~-----~~~~l~G~PV~~~ 326 (404) T protein:vir:10 252 FKKITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDP-----TQYRFLGLPVIEL 326 (404) T ss_pred cceeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCC-----CCccccceeeEEe Confidence 33333 33444555544344467888888999999999999999999999999999876544 3468999999865 Q ss_pred CC-CC---CCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEE Q lcl|NC_021307. 231 DH-VA---SGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVK 305 (310) Q Consensus 231 ~~-~~---~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~ 305 (310) +. ++ .++..+++|||++ +.++++++++++++++.. ..|++|++.||++.|+|+++.+++||++ T Consensus 327 ~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~r~d~~v~~~~a~~~ 394 (404) T protein:vir:10 327 PNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGA------------GAFETNTTKARIIMRIDGNVKDSEALLI 394 (404) T ss_pred cccccCCCCCccEEEEEeccccEEEEEecceEEEEecccc------------chhhcCceEEEEEEeeccEEecccceEE Confidence 44 33 3456678999997 668899999999886643 6799999999999999999999999999 Q ss_pred EeecC Q lcl|NC_021307. 306 LTNAA 310 (310) Q Consensus 306 l~~aa 310 (310) ++.++ T Consensus 395 ~~~~~ 399 (404) T protein:vir:10 395 AEIPV 399 (404) T ss_pred EEeec Confidence 99988 No 51 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=3.3e-54 Score=313.79 Aligned_cols=279 Identities=14% Similarity=0.110 Sum_probs=230.1 Q ss_pred Cccchhh-hHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceE--EEEEcCCceeeeecccc Q lcl|NC_021307. 1 MAAGTAF-PVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVK--IPHWTGDVSAAWIGEGD 76 (310) Q Consensus 1 ~aa~~~~-~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~--ip~~~~~~~a~~v~Eg~ 76 (310) +.+..+. ......+..++++..|| ++|+++..+|++.+++.++|+++++++||+++... +++..+.+.++|++||+ T Consensus 76 ~~~~~~~l~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~ 155 (371) T protein:vir:81 76 VEAFVNHIRTRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGA 155 (371) T ss_pred HHHHHHHHHHHHHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeecccc Confidence 1111111 01123344555555444 67788899999999999999999999999876544 55566678899999999 Q ss_pred cccc-cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccce Q lcl|NC_021307. 77 MKPI-TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLT 155 (310) Q Consensus 77 ~~~~-~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~ 155 (310) .+|+ ++++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|.+|++|+|++.+.+. T Consensus 156 ~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~~~---------- 225 (371) T protein:vir:81 156 AIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAKTAI---------- 225 (371) T ss_pred ccccccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---------- Confidence 9996 56999999999999999999999999999999999999999999999999999999998765432 Q ss_pred ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCC Q lcl|NC_021307. 156 PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVAS 235 (310) Q Consensus 156 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~ 235 (310) .+++++...+...+...+..+++|+||++++.+|++++|++|+|+|+++...+ .+++|+|+||++++++|. T Consensus 226 ----~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~-----~~~~l~G~pV~~~~~~~~ 296 (371) T protein:vir:81 226 ----ADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSP-----TGRQLLGLPVVIVSNKVL 296 (371) T ss_pred ----ccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCC-----CCceecceeEEEeccccc Confidence 23445554455678888889999999999999999999999999998876543 357899999999999873 Q ss_pred ----------CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|NC_021307. 236 ----------GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFV 304 (310) Q Consensus 236 ----------~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~ 304 (310) +...+++|||++ +.++++.+++++++++.. +.|++|++.||++.|+||++.+|+||+ T Consensus 297 ~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~v~~~~~~r~d~~~~~~~a~~ 364 (371) T protein:vir:81 297 ANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAM------------DAFETDATLWRAIERMDVKMRDDEAFV 364 (371) T ss_pred CccccccccCCcceEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEeeccEEecccceE Confidence 345678999998 568899999999886643 569999999999999999999999999 Q ss_pred EEeecC Q lcl|NC_021307. 305 KLTNAA 310 (310) Q Consensus 305 ~l~~aa 310 (310) +++.++ T Consensus 365 ~~~~~~ 370 (371) T protein:vir:81 365 FGEVQL 370 (371) T ss_pred EEEEec Confidence 999999 No 52 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=2.9e-54 Score=314.10 Aligned_cols=290 Identities=14% Similarity=0.101 Sum_probs=230.0 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeee---ccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWI---GEGDM 77 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v---~Eg~~ 77 (310) +.++.. ......+..++++.+|.+||+++.++|++.+++.++|+++|+++++.+ ..++|+....+.+.|. +|++. T Consensus 130 ~l~~~~-~~~e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~-~~~~p~~~~~~~a~~~~~~~e~~~ 207 (434) T protein:vir:62 130 YIVGNI-DEKEARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKE-NIKYPVLVKKAEAQGHKNERTNNE 207 (434) T ss_pred Hhcccc-chhhhhhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccCC-ceEEEEEecCCcccceeccccccc Confidence 001111 111223334455555556788889999999999999999999999865 5899999887777765 56888 Q ss_pred ccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceec Q lcl|NC_021307. 78 KPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPA 157 (310) Q Consensus 78 ~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~ 157 (310) +|+++++|+++++.+||+++++++|+|+++|+.++++++|.++|++++++++|++|++|+|++.++.+......+..... T Consensus 208 ~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~ 287 (434) T protein:vir:62 208 MPETDIEFDEIELSPTEFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTD 287 (434) T ss_pred ccccccceeeEEeeheeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeeccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999988766655555444433 Q ss_pred ccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCC- Q lcl|NC_021307. 158 TGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASG- 236 (310) Q Consensus 158 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~- 236 (310) ...+++ .+.++...+...+..+++|+||+.++..|+++||++|+|+|++..... .+.+.+|+|+||++++++|.+ T Consensus 288 ~~~~~d-~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~---~g~~~tl~G~pV~~~~~~~~~~ 363 (434) T protein:vir:62 288 EKNLYD-ALVKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAE---GGIGYTLLGFPVEEEDAIDIPD 363 (434) T ss_pred ccchhh-HHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCcc---CCCCceecceeeEEecCccCcc Confidence 333444 445788889999999999999999999999999999999998754321 234568999999999999743 Q ss_pred ---ceeEeeecceeeeEEeecc-cEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEec-cCceEEE--eec Q lcl|NC_021307. 237 ---TTVGYLGDFSQIVWGQVGG-LSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIND-VEAFVKL--TNA 309 (310) Q Consensus 237 ---~~~~~~gd~~~~~~~~~~~-~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~-~~a~~~l--~~a 309 (310) ...++||||++++++++.+ ++++.+.+. +|.+|+|+||++.|+|+++.+ |.+++++ +++ T Consensus 364 ~~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~~--------------~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~ 429 (434) T protein:vir:62 364 SPDTPVFYFGDFSKFYIQDVIGSLEVQKLVEL--------------FSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLK 429 (434) T ss_pred CCCceEEEEeeccceEEEEeeceeEEEeehhh--------------hcccCceEEEEEeeecceeecCcccceEEEEEec Confidence 3557899999999998864 677766543 478999999999999999875 7765544 434 Q ss_pred C Q lcl|NC_021307. 310 A 310 (310) Q Consensus 310 a 310 (310) + T Consensus 430 ~ 430 (434) T protein:vir:62 430 A 430 (434) T ss_pred c Confidence 4 No 53 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=5.4e-54 Score=312.64 Aligned_cols=279 Identities=13% Similarity=0.034 Sum_probs=228.8 Q ss_pred Cccchhhh--------HHHHHhhccccC-CCCceechhhHHHHHHHHHhhchhhhhcceeecCC--CceEEEEEcCCcee Q lcl|NC_021307. 1 MAAGTAFP--------VNHTQIAQTGDS-MFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGS--TGVKIPHWTGDVSA 69 (310) Q Consensus 1 ~aa~~~~~--------~~~~~~~~~~~~-~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~--~~~~ip~~~~~~~a 69 (310) ...+.... ....+++.++++ .+|.+||+++..+|++.+++.++|+++|+++|+++ +.+.+|+..+.+.+ T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 180 (397) T protein:vir:12 101 GLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPF 180 (397) T ss_pred HHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcce Confidence 11111111 111223334443 44556788889999999999999999999999875 44667777888899 Q ss_pred eeeccccccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc Q lcl|NC_021307. 70 AWIGEGDMKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET 148 (310) Q Consensus 70 ~~v~Eg~~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~ 148 (310) +|++||+.+|++ .++|+++++.++|+++++++|+|+++|+.++++++|.++|++++++++|.+|++|+|++.+.+.. T Consensus 181 ~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~~~g~~-- 258 (397) T protein:vir:12 181 SPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASLKKVDID-- 258 (397) T ss_pred eeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc-- Confidence 999999999975 69999999999999999999999999999999999999999999999999999999987654332 Q ss_pred cccccceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEE Q lcl|NC_021307. 149 TKSVDLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTI 228 (310) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~ 228 (310) +++++...+...++..+..+++|+||++++.+|++++|++|+|+|+++...+. +++|+|+||+ T Consensus 259 ------------~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~-----~~~l~G~pv~ 321 (397) T protein:vir:12 259 ------------GLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPT-----KKLLDGRPVV 321 (397) T ss_pred ------------cHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceeecccccCCC-----CccccceeeE Confidence 24455444556788899999999999999999999999999999988765543 4689999999 Q ss_pred EeCCC-C---CCceeEeeecceee-eEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCce Q lcl|NC_021307. 229 LSDHV-A---SGTTVGYLGDFSQI-VWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAF 303 (310) Q Consensus 229 ~t~~~-~---~~~~~~~~gd~~~~-~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~ 303 (310) +++++ + .++..+++|||+++ .++++++++++++++.. ..|++|++.||++.|+|+++.+++|| T Consensus 322 ~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~~~~~~~a~ 389 (397) T protein:vir:12 322 PFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGA------------GAFETNSTKVRGIEREDVRKWDEDAV 389 (397) T ss_pred EecccccccCCCccEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEeeccEEecccce Confidence 87764 2 35566889999985 58889999999876643 56999999999999999999999999 Q ss_pred EEEeecC Q lcl|NC_021307. 304 VKLTNAA 310 (310) Q Consensus 304 ~~l~~aa 310 (310) ++++.+| T Consensus 390 ~~~~~t~ 396 (397) T protein:vir:12 390 VFGQITV 396 (397) T ss_pred EEEEEee Confidence 9999999 No 54 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=4.8e-54 Score=312.88 Aligned_cols=286 Identities=15% Similarity=0.144 Sum_probs=228.5 Q ss_pred Cccchhh------hHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecc Q lcl|NC_021307. 1 MAAGTAF------PVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGE 74 (310) Q Consensus 1 ~aa~~~~------~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 74 (310) +.++..+ .........++++.+|.++|+++.++|++.+++.++++++|+++|+++ ..++|+..+.+.+.|++| T Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~g-~~~ip~~~~~~~a~~v~E 197 (425) T protein:vir:95 119 LKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVKG-TTRILVDTDTSPATWIEQ 197 (425) T ss_pred HhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecCc-eeEEEEecCCcccccccc Confidence 2222221 111222233444455567788889999999999999999999999865 579999999999999999 Q ss_pred cccccccc-cceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcc--ccccccccccc Q lcl|NC_021307. 75 GDMKPITK-GDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSP--FDKNLDETTKS 151 (310) Q Consensus 75 g~~~~~~~-~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~--~~~~~~~~~~~ 151 (310) ++.+|+++ ++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++||+|+|++ .|.++...... T Consensus 198 ~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~ 277 (425) T protein:vir:95 198 SGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPP 277 (425) T ss_pred ccccccccccccceeeeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccc Confidence 99999887 6899999999999999999999999999999999999999999999999999999976 45555533222 Q ss_pred cc-ceecccchHHHHHHHHHHHhhhhc--CCCCEEEEehHHH----HHHHHhhhccCccccccccccccccccCCceeee Q lcl|NC_021307. 152 VD-LTPATGTTYDAIGVNALSLLVNAG--KKWGATLLDDVAE----PILNGAKDANGRPLFVESTYEAVTTPYREGRILG 224 (310) Q Consensus 152 ~~-~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~~~----~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G 224 (310) .. .....+....+.+.++...+...+ ..+++|+||+.++ ..|+.++|++|+|+++.... ..++|+| T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~-------~~~~l~G 350 (425) T protein:vir:95 278 ENQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNL-------RTPDLLG 350 (425) T ss_pred ccccccccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCC-------CCccccc Confidence 21 122222333344456776666554 3567899999985 34677899999999875432 2357999 Q ss_pred eeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|NC_021307. 225 RPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFV 304 (310) Q Consensus 225 ~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~ 304 (310) +||++++++|.+. ++||||+++++++++++++.++++.. |.+|++.||++.|+|+++.+|+||+ T Consensus 351 ~pvv~~~~~~~~~--i~~Gd~~~~~~~~~~~~~i~~~~~~~--------------f~~~~~~~~~~~r~d~~~~~~~a~~ 414 (425) T protein:vir:95 351 LRVVFNNFLDDDT--VLFGEFEQYTLVERENITIDSSTHVK--------------FTEDQTAFRGKGRFDGKPVKPEAFV 414 (425) T ss_pred eeeEEcCcCCCcc--EEEEecccEEEEeecceEEEeecccc--------------cccCceEEEEEEeeCcEeecccceE Confidence 9999999999875 67899999999999999999987643 8899999999999999999999999 Q ss_pred EEeecC Q lcl|NC_021307. 305 KLTNAA 310 (310) Q Consensus 305 ~l~~aa 310 (310) +++.+. T Consensus 415 ~~~i~~ 420 (425) T protein:vir:95 415 LVTITD 420 (425) T ss_pred EEEecC Confidence 999999 No 55 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=6.3e-54 Score=312.24 Aligned_cols=280 Identities=15% Similarity=0.096 Sum_probs=229.3 Q ss_pred Cccch-hhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEE--EcC-Cceeeeecccc Q lcl|NC_021307. 1 MAAGT-AFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPH--WTG-DVSAAWIGEGD 76 (310) Q Consensus 1 ~aa~~-~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~--~~~-~~~a~~v~Eg~ 76 (310) +.+.. +..........++++.+|.+||+++..+|++.+++.++|+++|+++||++....+++ ... .+.+.|++|++ T Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 172 (395) T protein:vir:38 93 MKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESA 172 (395) T ss_pred HHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCcccccccccc Confidence 22221 112222333444555666778889999999999999999999999999876555544 433 45679999999 Q ss_pred ccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccce Q lcl|NC_021307. 77 MKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLT 155 (310) Q Consensus 77 ~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~ 155 (310) .+|++ +++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|.+|++|+|++.+... T Consensus 173 ~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~---------- 242 (395) T protein:vir:38 173 LIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKKPT---------- 242 (395) T ss_pred ccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc---------- Confidence 99976 5999999999999999999999999999999999999999999999999999999998765332 Q ss_pred ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCC- Q lcl|NC_021307. 156 PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVA- 234 (310) Q Consensus 156 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~- 234 (310) ..+++++...+...+...+..+++|+||++++..|++++|++|+|+|+++...+ .+.+|+|+||+++++++ T Consensus 243 ---~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~-----~~~~l~G~pV~~~~~~~~ 314 (395) T protein:vir:38 243 ---ISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSP-----DKYLIDGKPVIRIADKWL 314 (395) T ss_pred ---cccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCC-----CcceeccceeEEeccccc Confidence 123445543344578888899999999999999999999999999998866543 35689999999998764 Q ss_pred ---CCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 235 ---SGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 235 ---~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .++..++||||++ +.++++++++++++++.. ..|++|++.||++.|+|+++.+|+||++++.++ T Consensus 315 ~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 382 (395) T protein:vir:38 315 PDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGA------------GSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKT 382 (395) T ss_pred CcCCCcceEEEEeccccEEEEEecceEEEEecccc------------chhhcCceEEEEEEeeccEEecccceEEEEeec Confidence 2455688999997 678999999999987653 469999999999999999999999999999998 No 56 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=1.4e-53 Score=310.35 Aligned_cols=280 Identities=16% Similarity=0.134 Sum_probs=229.1 Q ss_pred Cccchhh-hHHHHHhhccccCCCC-ceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEE--Ec-CCceeeeeccc Q lcl|NC_021307. 1 MAAGTAF-PVNHTQIAQTGDSMFQ-GYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPH--WT-GDVSAAWIGEG 75 (310) Q Consensus 1 ~aa~~~~-~~~~~~~~~~~~~~~g-~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~--~~-~~~~a~~v~Eg 75 (310) +..+... .....++...+++..| .+||+++..+|++.+++.++|+++|+++|+++....+|. .. ....+.|++|+ T Consensus 101 ~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~ 180 (408) T protein:vir:10 101 VRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAED 180 (408) T ss_pred hhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCc Confidence 2222211 2223344445555544 467778889999999999999999999999876655554 33 34678999999 Q ss_pred cccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccc Q lcl|NC_021307. 76 DMKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDL 154 (310) Q Consensus 76 ~~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 154 (310) +.+|++ .++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+.. T Consensus 181 ~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~~---------- 250 (408) T protein:vir:10 181 GKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKP---------- 250 (408) T ss_pred cccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---------- Confidence 999985 589999999999999999999999999999999999999999999999999999999875432 Q ss_pred eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC-- Q lcl|NC_021307. 155 TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH-- 232 (310) Q Consensus 155 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~-- 232 (310) +..++++++..+...+...+..+++|+||++++..|++++|++|+|+|+++...+. +.+|+|+||+++++ T Consensus 251 ---~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~-----~~~l~G~PV~~~~~~~ 322 (408) T protein:vir:10 251 ---TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPN-----SYLIKGKQVIVVADRW 322 (408) T ss_pred ---ccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCC-----CceecceeeEEecccc Confidence 12345566555557788888999999999999999999999999999988765543 46899999998663 Q ss_pred CC---CCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEee Q lcl|NC_021307. 233 VA---SGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTN 308 (310) Q Consensus 233 ~~---~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~ 308 (310) +| ++...+++|||+. +.++++++++++.+++.+ ..|++|++.||++.|+|+++.+++||++++. T Consensus 323 ~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~ 390 (408) T protein:vir:10 323 LPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDVKATDSEALVAGSF 390 (408) T ss_pred cCccCCCceEEEEEehhccEEEEEecceEEEEccccc------------chhhcCceEEEEEEeeccEEeccccEEEEEe Confidence 44 3456689999998 568999999999887653 5689999999999999999999999999998 Q ss_pred cC Q lcl|NC_021307. 309 AA 310 (310) Q Consensus 309 aa 310 (310) ++ T Consensus 391 ~~ 392 (408) T protein:vir:10 391 SA 392 (408) T ss_pred ec Confidence 88 No 57 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=4.8e-54 Score=312.88 Aligned_cols=281 Identities=20% Similarity=0.204 Sum_probs=227.5 Q ss_pred CccchhhhHH--HHHhhccccC-CCCceechhh-HHHHHHHHHhhchhhhh-cceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAAGTAFPVN--HTQIAQTGDS-MFQGYLEPEQ-AQDYFAEAEKTSIVQRV-ARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa~~~~~~~--~~~~~~~~~~-~~g~~i~~~~-~~~ii~~~~~~s~l~~~-~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) .++|..++.+ ...+..++++ .+|.++|+++ ..+||+.+++.++++++ ++.+|+.++.++||+.+++++++|++|+ T Consensus 341 ~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~ 420 (632) T protein:vir:96 341 EARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGED 420 (632) T ss_pred hhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeCCceeEeecCC Confidence 2222222221 1223344444 4455677776 57899999999999998 6789988889999999999999999999 Q ss_pred ccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccc-ccccccccccccc Q lcl|NC_021307. 76 DMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPF-DKNLDETTKSVDL 154 (310) Q Consensus 76 ~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~-~~~~~~~~~~~~~ 154 (310) +.+++++++|+++++.++|++++++||+|+|+|+.++++++|.++|.++++.++|++||+|+|+++ |.++...+..... T Consensus 421 ~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~ 500 (632) T protein:vir:96 421 EDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPAL 500 (632) T ss_pred ccccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccce Confidence 999999999999999999999999999999999999999999999999999999999999999754 5555433332222 Q ss_pred eec-ccchHHHHHHHHHHHhhhhc--CCCCEEEEehHHHHHHHH--hhhccCccccccccccccccccCCceeeeeeEEE Q lcl|NC_021307. 155 TPA-TGTTYDAIGVNALSLLVNAG--KKWGATLLDDVAEPILNG--AKDANGRPLFVESTYEAVTTPYREGRILGRPTIL 229 (310) Q Consensus 155 ~~~-~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~~~~~l~~--l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~ 229 (310) ... ...+++ .+.++..++...+ ..+++|+||+.++..|++ ++|.+|+|+|.+ ++|+|+||++ T Consensus 501 ~~~~~~~~~~-~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~------------~~l~G~pv~~ 567 (632) T protein:vir:96 501 TYPAGGVDWA-SVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN------------NEVNGYRAEA 567 (632) T ss_pred ecccccCCHH-HHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeecC------------CeecccceEe Confidence 222 233333 3456777776654 457789999998877765 779999999863 3689999999 Q ss_pred eCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeec Q lcl|NC_021307. 230 SDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 230 t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a 309 (310) ++.+|.++ +++|||+++++++++++.+.++++. .|.+|++.||++.|+|++++++++|+.++.+ T Consensus 568 s~~ip~~~--~~~gd~s~~~i~~~~~~~i~~~~~~--------------~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~ 631 (632) T protein:vir:96 568 SNQIPADT--WIFGDWSQIVIAMWGVLDLKVDPYT--------------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKG 631 (632) T ss_pred ccccccCc--EEEeecceEEEEEecceEEEEcccc--------------ccccCceEEEEEeecCceeechhhhhheeec Confidence 99999886 5789999999999999999987553 4789999999999999999999999999999 Q ss_pred C Q lcl|NC_021307. 310 A 310 (310) Q Consensus 310 a 310 (310) | T Consensus 632 A 632 (632) T protein:vir:96 632 A 632 (632) T ss_pred C Confidence 9 No 58 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=1.2e-53 Score=310.75 Aligned_cols=283 Identities=15% Similarity=0.066 Sum_probs=231.7 Q ss_pred Cccchhh--hHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCC--ceeeeecccc Q lcl|NC_021307. 1 MAAGTAF--PVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGD--VSAAWIGEGD 76 (310) Q Consensus 1 ~aa~~~~--~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~--~~a~~v~Eg~ 76 (310) +.+.... .........+.++..++.+|+++..+|++.+++.++++++|+++++.++.++||+.++. ..+.|++||+ T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~ 171 (379) T protein:vir:10 92 IKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENGAGEGAIGAQVEGA 171 (379) T ss_pred HHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeecCCCcccccccCCc Confidence 0000000 01112223344445556789999999999999999999999999999999999998753 4568999999 Q ss_pred cccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccccccee Q lcl|NC_021307. 77 MKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTP 156 (310) Q Consensus 77 ~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~ 156 (310) .+|+++++|+++++.++|++++++||+|+++|+. ++++||.++|++++++++|.+|+.|.++....+.... T Consensus 172 ~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~-~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~~-------- 242 (379) T protein:vir:10 172 TKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLP-FLTSFIPNALRRDYAKAENAAFNAVLAANATASTEII-------- 242 (379) T ss_pred cccccccceeeeEeeeeeEEeeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHhcccccccccccccc-------- Confidence 9999999999999999999999999999999975 6999999999999999999999999887543222211 Q ss_pred cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCC Q lcl|NC_021307. 157 ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASG 236 (310) Q Consensus 157 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~ 236 (310) ...... +.+.+++..+...++.+++|+|||.+|..|+++||++|+|+++++.... .+.+.+|+|+||++++.||.+ T Consensus 243 ~~~~~~-d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~---~~~~~~l~G~pvv~s~~~~ag 318 (379) T protein:vir:10 243 TNKNKV-EMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQ---DNGVLRINGIPLFRATWLAAN 318 (379) T ss_pred cCcccH-HHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCC---CCCcceecceeeEecCCCCCC Confidence 112223 3445777888889999999999999999999999999999998766432 123458999999999999988 Q ss_pred ceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 237 TTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 237 ~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) + +++|||+++++..+.++.++++++.. ..|++|++.||++.|+|++|.+|+||++++.+| T Consensus 319 ~--~~~gdf~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~ 378 (379) T protein:vir:10 319 K--YYVGDWTRVTKVTTEGLSLEFSEVEG------------TNFVKNNITARIEAQVALAVEQPAALIFGDFTA 378 (379) T ss_pred c--eEEeecccEEEEEEeceEEEEeeccc------------ccccCCcEEEEEEEEeccEEecCccEEEEEecC Confidence 6 57899999999999999999886643 459999999999999999999999999999999 No 59 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=5.6e-53 Score=307.05 Aligned_cols=277 Identities=16% Similarity=0.105 Sum_probs=226.7 Q ss_pred CccchhhhHHHHHhhccccCC-CCceechhhHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEcC-Cceeeeecccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSM-FQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG--VKIPHWTG-DVSAAWIGEGD 76 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~-~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~ip~~~~-~~~a~~v~Eg~ 76 (310) +..+.. ........++++. +|.++|+++..+|++.+++.++|+++|+++|+++.. +.+|+... .+.+.|++||+ T Consensus 97 ~l~~~~--~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 174 (397) T protein:vir:49 97 LVRGRY--QNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAG 174 (397) T ss_pred HHhcch--hHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCcc Confidence 111111 1122233444444 445678888999999999999999999999987544 55666543 46799999999 Q ss_pred cccc-cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccce Q lcl|NC_021307. 77 MKPI-TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLT 155 (310) Q Consensus 77 ~~~~-~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~ 155 (310) .+|+ ++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|.+|++|+|++.+.... T Consensus 175 ~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~~~~--------- 245 (397) T protein:vir:49 175 KIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAALPTKPTL--------- 245 (397) T ss_pred ccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc--------- Confidence 9997 579999999999999999999999999999999999999999999999999999999987653321 Q ss_pred ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC--C Q lcl|NC_021307. 156 PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH--V 233 (310) Q Consensus 156 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~--~ 233 (310) .+++ .+.++...+...+..+++|+||++++..|++++|++|+|+|+++...+ .+++|+|+||+++++ + T Consensus 246 ----~~~d-~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~-----~~~~l~G~PV~~~~~~~~ 315 (397) T protein:vir:49 246 ----TKWD-DIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSP-----TGYSIDGFAVKEVADRWL 315 (397) T ss_pred ----ccHH-HHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCC-----CCceecceeeEEeccccc Confidence 2334 445788899999999999999999999999999999999998876554 346899999998654 3 Q ss_pred CC---CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeec Q lcl|NC_021307. 234 AS---GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 234 ~~---~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a 309 (310) |. ++..+++|||++ +.++++++++++++++.. +.|++|++.||++.|+|+++.+++||++++.+ T Consensus 316 ~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 316 ANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGG------------GAFETDTTKVRVIDRFDVVATDTEAFVPASFK 383 (397) T ss_pred ccccCCceeEEEeeccceEEEEeecceEEEEecccc------------chhhcCceeEEEEeeeCcEEecccceEEEEee Confidence 43 456688999997 668999999999876543 46899999999999999999999999999988 Q ss_pred C Q lcl|NC_021307. 310 A 310 (310) Q Consensus 310 a 310 (310) + T Consensus 384 ~ 384 (397) T protein:vir:49 384 A 384 (397) T ss_pred c Confidence 8 No 60 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=8.8e-53 Score=305.96 Aligned_cols=276 Identities=16% Similarity=0.114 Sum_probs=227.0 Q ss_pred CccchhhhHHHHHhhccccCC-CCceechhhHHHHHHHHHhhchhhhhcceeecCCCce--EEEEEcC-Cceeeeecccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSM-FQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV--KIPHWTG-DVSAAWIGEGD 76 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~-~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~--~ip~~~~-~~~a~~v~Eg~ 76 (310) |..+.. ........+++. +|.+||+++..+|++.+++.++|+++++++|++++.. .+|+... .+.+.|++|++ T Consensus 98 l~~~~~---~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 174 (397) T protein:vir:49 98 VRGRYQ---NLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGG 174 (397) T ss_pred hhcchh---hHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeecccc Confidence 222211 122333344444 4456788889999999999999999999999986654 4555543 46789999999 Q ss_pred cccccc-cceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccce Q lcl|NC_021307. 77 MKPITK-GDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLT 155 (310) Q Consensus 77 ~~~~~~-~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~ 155 (310) .+|+++ ++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+.. T Consensus 175 ~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~~~----------- 243 (397) T protein:vir:49 175 QIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLPNKP----------- 243 (397) T ss_pred ccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc----------- Confidence 999875 79999999999999999999999999999999999999999999999999999999876532 Q ss_pred ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC--C Q lcl|NC_021307. 156 PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH--V 233 (310) Q Consensus 156 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~--~ 233 (310) +..++++ +.++...++..+..+++|+||++++..|++++|++|+|+|.++...+. +++|+|+||+++++ + T Consensus 244 --~~~~~d~-i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~-----~~~l~G~pV~~~~~~~~ 315 (397) T protein:vir:49 244 --TLAKWDD-IIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPT-----GYSIDGFVVKEISDRFL 315 (397) T ss_pred --cccCHHH-HHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCCC-----CceecceeeEEeccccc Confidence 1223444 457888999999999999999999999999999999999988765543 46899999988654 3 Q ss_pred C---CCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeec Q lcl|NC_021307. 234 A---SGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 234 ~---~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a 309 (310) | .++..++||||++ +.++++++++++++++.. ++|++|++.||++.|+|+++.+++||++++.+ T Consensus 316 ~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 316 PNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGG------------GAFETDTTKVRVIDRFDVVSTDTEAFVPASFK 383 (397) T ss_pred ccccCCceeEEEeeccceEEEEeecccEEEEecccc------------chhhcCeeeEEEEEeeccEEecccceEEEEec Confidence 4 3456788999997 678999999999886543 56999999999999999999999999999988 Q ss_pred C Q lcl|NC_021307. 310 A 310 (310) Q Consensus 310 a 310 (310) | T Consensus 384 ~ 384 (397) T protein:vir:49 384 A 384 (397) T ss_pred c Confidence 8 No 61 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=6.1e-53 Score=306.84 Aligned_cols=278 Identities=14% Similarity=0.064 Sum_probs=227.8 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEE---cCCceeeeeccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHW---TGDVSAAWIGEGDM 77 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~---~~~~~a~~v~Eg~~ 77 (310) +.++...... ......+++.+|.+||++++.+|++.+++.++|+++|+++|+++....+|+. +..+.++|++|++. T Consensus 97 ~~~~~~~~~~-~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~ 175 (397) T protein:vir:48 97 LVRGRYQNLL-DSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGS 175 (397) T ss_pred HHhhhhhHHH-HHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccc Confidence 2222211111 1122223334556788999999999999999999999999998776666544 34456899999999 Q ss_pred cccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccccccee Q lcl|NC_021307. 78 KPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTP 156 (310) Q Consensus 78 ~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~ 156 (310) ++++ +++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+... T Consensus 176 ~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~~~~----------- 244 (397) T protein:vir:48 176 IGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATLPTKPT----------- 244 (397) T ss_pred cccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc----------- Confidence 9987 5899999999999999999999999999999999999999999999999999999998765321 Q ss_pred cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC--CC Q lcl|NC_021307. 157 ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH--VA 234 (310) Q Consensus 157 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~--~~ 234 (310) ..++++ +.++...+...+..+++|+||++++..|+++||++|+|+|+++...+ .+++|+|+||+++++ ++ T Consensus 245 --~~~~d~-i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~-----~~~~l~G~PV~~~~~~~~~ 316 (397) T protein:vir:48 245 --LTKWDD-IIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSP-----TGYSIDGFAVKEVADRWLA 316 (397) T ss_pred --cccHHH-HHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCC-----CCceeccceeEEecccccC Confidence 123344 45788899999999999999999999999999999999998876554 346899999988654 33 Q ss_pred ---CCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 235 ---SGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 235 ---~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .++..+++|||+. +.++++++++++++++.. +.|++|++.||++.|+|+++.+|+||++++.++ T Consensus 317 ~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:48 317 NASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGG------------GAFETDTTKIRVIDRFDVVATDTESFVPASFKA 384 (397) T ss_pred CcCCCceEEEEEeccceEEEEeecceEEEEeccch------------hhhhcCceeEEEEeeeccEEecccceEEEEecc Confidence 3456788999997 568999999999886643 568999999999999999999999999999988 No 62 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=9.3e-53 Score=305.85 Aligned_cols=280 Identities=16% Similarity=0.154 Sum_probs=227.6 Q ss_pred Cccchhh-hHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceE--EEEEcC-Cceeeeeccc Q lcl|NC_021307. 1 MAAGTAF-PVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVK--IPHWTG-DVSAAWIGEG 75 (310) Q Consensus 1 ~aa~~~~-~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~--ip~~~~-~~~a~~v~Eg 75 (310) +..+... ......+...+++..|| +||+++..+|++.+++.++|+++|+++|++++... +++..+ +..+.|++|+ T Consensus 101 ~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~ 180 (408) T protein:vir:74 101 VRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEED 180 (408) T ss_pred HhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccc Confidence 2222221 22233334445555444 67888889999999999999999999999876554 445444 4567899999 Q ss_pred ccccc-cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccc Q lcl|NC_021307. 76 DMKPI-TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDL 154 (310) Q Consensus 76 ~~~~~-~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 154 (310) +.+++ ++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+.. T Consensus 181 ~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~~~---------- 250 (408) T protein:vir:74 181 GKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKP---------- 250 (408) T ss_pred cccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---------- Confidence 99997 5699999999999999999999999999999999999999999999999999999999876532 Q ss_pred eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC-- Q lcl|NC_021307. 155 TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH-- 232 (310) Q Consensus 155 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~-- 232 (310) +..++++++..+...+...+..+++|+||++++.+|+++||++|+|+|+++...+. +++|+|+||+++++ T Consensus 251 ---~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~-----~~~l~G~pV~~~~~~~ 322 (408) T protein:vir:74 251 ---TIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPN-----SYLIKGKQVIVVADRW 322 (408) T ss_pred ---ccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCC-----CceecceeeEEecCcc Confidence 12234555544557888899999999999999999999999999999998765543 46899999998764 Q ss_pred CC---CCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEee Q lcl|NC_021307. 233 VA---SGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTN 308 (310) Q Consensus 233 ~~---~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~ 308 (310) +| .++..+++|||+. +.++++++++++++++.. ..|++|++.||++.|+|+++.+++||++++. T Consensus 323 ~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~ 390 (408) T protein:vir:74 323 LPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDVKATDSEALVAGSF 390 (408) T ss_pred cccccCCcceEEEEehhccEEEEEecceEEEEecccc------------chhhcceeeEEEEEeeCcEEecccceEEEEe Confidence 44 3456789999997 568999999999886533 5689999999999999999999999999998 Q ss_pred cC Q lcl|NC_021307. 309 AA 310 (310) Q Consensus 309 aa 310 (310) ++ T Consensus 391 ~~ 392 (408) T protein:vir:74 391 TA 392 (408) T ss_pred ec Confidence 87 No 63 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=7e-53 Score=306.53 Aligned_cols=269 Identities=17% Similarity=0.109 Sum_probs=226.3 Q ss_pred HHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEc-CCceeeeecccccccc-cccce Q lcl|NC_021307. 11 HTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG--VKIPHWT-GDVSAAWIGEGDMKPI-TKGDM 85 (310) Q Consensus 11 ~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~ip~~~-~~~~a~~v~Eg~~~~~-~~~~~ 85 (310) ..+.+.++++.+|+ +||+++..+|++.+++.++|+++|+++|+++.. +.+|+.. ..+.+.|++||+.+|+ ++++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 44445555555555 678888999999999999999999999987654 5566664 4577999999999997 46999 Q ss_pred eeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHH Q lcl|NC_021307. 86 SVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAI 165 (310) Q Consensus 86 ~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (310) +++++.+||+++++++|+|+++|+.++++++|.+++++++++++|++|++|.++.... ....++++ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~-------------~~~~~~d~- 146 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTK-------------PTLTKWDD- 146 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccccc-------------ccccCHHH- Confidence 9999999999999999999999999999999999999999999999999998864421 12233444 Q ss_pred HHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC--CC---CCceeE Q lcl|NC_021307. 166 GVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH--VA---SGTTVG 240 (310) Q Consensus 166 ~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~--~~---~~~~~~ 240 (310) +.+++.+++..+..+++|+||++++..|+++||++|+|+|+++...+. +++|+|+||+++++ +| .++..+ T Consensus 147 i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~-----~~~l~G~Pv~~~~~~~~~~~~~~~~~~ 221 (293) T protein:vir:48 147 IIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPT-----GYSIAGFAVKEISDRWLPNASSGVMPL 221 (293) T ss_pred HHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCC-----CceecceeeEEecccccCCccCCceEE Confidence 457888999999999999999999999999999999999998765543 56899999987654 33 345678 Q ss_pred eeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 241 YLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 241 ~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|||++ +.++++++++++++++.. +.|++|++.||++.|+|+++.+++||++++.++ T Consensus 222 ~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~ 280 (293) T protein:vir:48 222 YFGDLKQAVTLFDRQQMSLLSTNIGG------------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKA 280 (293) T ss_pred EEEeccceEEEEEecceEEEEecccc------------hhhhcCeEEEEEEEeeCcEEecccceEEEEeec Confidence 9999998 568899999999887643 568999999999999999999999999999877 No 64 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=1.9e-52 Score=304.11 Aligned_cols=280 Identities=16% Similarity=0.145 Sum_probs=226.7 Q ss_pred Cccchhh-hHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEE--Ec-CCceeeeeccc Q lcl|NC_021307. 1 MAAGTAF-PVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPH--WT-GDVSAAWIGEG 75 (310) Q Consensus 1 ~aa~~~~-~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~--~~-~~~~a~~v~Eg 75 (310) +..+... .....++...+++..|| ++|+++..+|++.+++.++|+++|+++|++++...+|. .. ..+.+.|++|| T Consensus 101 ~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg 180 (404) T protein:vir:39 101 VRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAED 180 (404) T ss_pred HhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCc Confidence 3222222 22233334445544444 67888899999999999999999999999876655554 33 34678999999 Q ss_pred ccccc-cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccc Q lcl|NC_021307. 76 DMKPI-TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDL 154 (310) Q Consensus 76 ~~~~~-~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 154 (310) +.+|+ ++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+.. T Consensus 181 ~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~~~~---------- 250 (404) T protein:vir:39 181 GKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKP---------- 250 (404) T ss_pred cccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc---------- Confidence 99997 5799999999999999999999999999999999999999999999999999999999875432 Q ss_pred eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC-- Q lcl|NC_021307. 155 TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH-- 232 (310) Q Consensus 155 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~-- 232 (310) ...+++++...+...+...+..+++|+||++++..|++++|++|+|+|+++...+ .+.+|+|+||+++++ T Consensus 251 ---~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~-----~~~~l~G~pV~~~~~~~ 322 (404) T protein:vir:39 251 ---TIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKP-----NSYLIKGKKVIVVADRW 322 (404) T ss_pred ---ccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCcCCC-----CcceecceeEEEecccc Confidence 1223445544445577788888999999999999999999999999998876544 346899999999765 Q ss_pred CC---CCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEee Q lcl|NC_021307. 233 VA---SGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTN 308 (310) Q Consensus 233 ~~---~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~ 308 (310) +| .+...+++|||+. +.++++++++++++++.. +.|++|++.||++.|+|+.+.+|+||++++. T Consensus 323 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~ 390 (404) T protein:vir:39 323 LPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDVKTTDSEALVAGSF 390 (404) T ss_pred cCccCCCccEEEEEeccccEEEEeecceEEEEeccch------------hhhhhceeeEEEEeeeccEEecccceEEEEe Confidence 33 2345688999997 568899999999886543 5689999999999999999999999999998 Q ss_pred cC Q lcl|NC_021307. 309 AA 310 (310) Q Consensus 309 aa 310 (310) ++ T Consensus 391 ~~ 392 (404) T protein:vir:39 391 TA 392 (404) T ss_pred ec Confidence 88 No 65 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=1.1e-52 Score=305.40 Aligned_cols=279 Identities=13% Similarity=0.027 Sum_probs=225.5 Q ss_pred Cccch--------hhhHHHHHhhccccCCCC-ceechhhHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEcCCcee Q lcl|NC_021307. 1 MAAGT--------AFPVNHTQIAQTGDSMFQ-GYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG--VKIPHWTGDVSA 69 (310) Q Consensus 1 ~aa~~--------~~~~~~~~~~~~~~~~~g-~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~ip~~~~~~~a 69 (310) |..+. .........+..+++..| .++|+++..+|++.+++.++|+++|++++++++. +.+|+..+++.+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 11111 011112233444454444 4678888999999999999999999999997655 456777778889 Q ss_pred eeeccccccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc Q lcl|NC_021307. 70 AWIGEGDMKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET 148 (310) Q Consensus 70 ~~v~Eg~~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~ 148 (310) .|++|++.++++ .++|+++++.++|++++++||+|+++|+.++++++|.+.|++++++++|.+|++|+|++.+.+ T Consensus 164 ~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~---- 239 (392) T protein:vir:10 164 AEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA---- 239 (392) T ss_pred eeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC---- Confidence 999999999986 589999999999999999999999999999999999999999999999999999999765422 Q ss_pred cccccceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEE Q lcl|NC_021307. 149 TKSVDLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTI 228 (310) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~ 228 (310) ..+++++...+...+...+..+++|+||++++..|+++||++|+|+|+++...+. +++|+|+|++ T Consensus 240 ----------~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~-----~~tllG~~~v 304 (392) T protein:vir:10 240 ----------IKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN-----KKLFAGTNPV 304 (392) T ss_pred ----------ccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCc-----cccccCcccE Confidence 2334555433446788899999999999999999999999999999988765543 4679998766 Q ss_pred Ee-CC-------CCCCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_021307. 229 LS-DH-------VASGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIND 299 (310) Q Consensus 229 ~t-~~-------~~~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~ 299 (310) ++ ++ ...++..+++|||++ +.+++|++++++++++.. ..|++|++.||++.|+||++.+ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~ 372 (392) T protein:vir:10 305 VVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRDDVQMWD 372 (392) T ss_pred EEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEeeccEEec Confidence 53 22 234667789999998 568999999999876543 4689999999999999999999 Q ss_pred cCceEEEeecC Q lcl|NC_021307. 300 VEAFVKLTNAA 310 (310) Q Consensus 300 ~~a~~~l~~aa 310 (310) ++||++|+.+. T Consensus 373 ~~a~~~l~~~~ 383 (392) T protein:vir:10 373 NEAAVYGEIDL 383 (392) T ss_pred ccceEEEEecc Confidence 99999988866 No 66 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=1.1e-52 Score=305.40 Aligned_cols=279 Identities=13% Similarity=0.027 Sum_probs=225.5 Q ss_pred Cccch--------hhhHHHHHhhccccCCCC-ceechhhHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEcCCcee Q lcl|NC_021307. 1 MAAGT--------AFPVNHTQIAQTGDSMFQ-GYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG--VKIPHWTGDVSA 69 (310) Q Consensus 1 ~aa~~--------~~~~~~~~~~~~~~~~~g-~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~ip~~~~~~~a 69 (310) |..+. .........+..+++..| .++|+++..+|++.+++.++|+++|++++++++. +.+|+..+++.+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 11111 011112233444454444 4678888999999999999999999999997655 456777778889 Q ss_pred eeeccccccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc Q lcl|NC_021307. 70 AWIGEGDMKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET 148 (310) Q Consensus 70 ~~v~Eg~~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~ 148 (310) .|++|++.++++ .++|+++++.++|++++++||+|+++|+.++++++|.+.|++++++++|.+|++|+|++.+.+ T Consensus 164 ~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~---- 239 (392) T protein:vir:10 164 AEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA---- 239 (392) T ss_pred eeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC---- Confidence 999999999986 589999999999999999999999999999999999999999999999999999999765422 Q ss_pred cccccceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEE Q lcl|NC_021307. 149 TKSVDLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTI 228 (310) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~ 228 (310) ..+++++...+...+...+..+++|+||++++..|+++||++|+|+|+++...+. +++|+|+|++ T Consensus 240 ----------~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~-----~~tllG~~~v 304 (392) T protein:vir:10 240 ----------IKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN-----KKLFAGTNPV 304 (392) T ss_pred ----------ccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCc-----cccccCcccE Confidence 2334555433446788899999999999999999999999999999988765543 4679998766 Q ss_pred Ee-CC-------CCCCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_021307. 229 LS-DH-------VASGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIND 299 (310) Q Consensus 229 ~t-~~-------~~~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~ 299 (310) ++ ++ ...++..+++|||++ +.+++|++++++++++.. ..|++|++.||++.|+||++.+ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~ 372 (392) T protein:vir:10 305 VVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRDDVQMWD 372 (392) T ss_pred EEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEeeccEEec Confidence 53 22 234667789999998 568999999999876543 4689999999999999999999 Q ss_pred cCceEEEeecC Q lcl|NC_021307. 300 VEAFVKLTNAA 310 (310) Q Consensus 300 ~~a~~~l~~aa 310 (310) ++||++|+.+. T Consensus 373 ~~a~~~l~~~~ 383 (392) T protein:vir:10 373 NEAAVYGEIDL 383 (392) T ss_pred ccceEEEEecc Confidence 99999988866 No 67 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=1.1e-52 Score=305.40 Aligned_cols=279 Identities=13% Similarity=0.027 Sum_probs=225.5 Q ss_pred Cccch--------hhhHHHHHhhccccCCCC-ceechhhHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEcCCcee Q lcl|NC_021307. 1 MAAGT--------AFPVNHTQIAQTGDSMFQ-GYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG--VKIPHWTGDVSA 69 (310) Q Consensus 1 ~aa~~--------~~~~~~~~~~~~~~~~~g-~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~ip~~~~~~~a 69 (310) |..+. .........+..+++..| .++|+++..+|++.+++.++|+++|++++++++. +.+|+..+++.+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 11111 011112233444454444 4678888999999999999999999999997655 456777778889 Q ss_pred eeeccccccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc Q lcl|NC_021307. 70 AWIGEGDMKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET 148 (310) Q Consensus 70 ~~v~Eg~~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~ 148 (310) .|++|++.++++ .++|+++++.++|++++++||+|+++|+.++++++|.+.|++++++++|.+|++|+|++.+.+ T Consensus 164 ~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~---- 239 (392) T protein:vir:10 164 AEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA---- 239 (392) T ss_pred eeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC---- Confidence 999999999986 589999999999999999999999999999999999999999999999999999999765422 Q ss_pred cccccceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEE Q lcl|NC_021307. 149 TKSVDLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTI 228 (310) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~ 228 (310) ..+++++...+...+...+..+++|+||++++..|+++||++|+|+|+++...+. +++|+|+|++ T Consensus 240 ----------~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~-----~~tllG~~~v 304 (392) T protein:vir:10 240 ----------IKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN-----KKLFAGTNPV 304 (392) T ss_pred ----------ccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCc-----cccccCcccE Confidence 2334555433446788899999999999999999999999999999988765543 4679998766 Q ss_pred Ee-CC-------CCCCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_021307. 229 LS-DH-------VASGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIND 299 (310) Q Consensus 229 ~t-~~-------~~~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~ 299 (310) ++ ++ ...++..+++|||++ +.+++|++++++++++.. ..|++|++.||++.|+||++.+ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~ 372 (392) T protein:vir:10 305 VVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRDDVQMWD 372 (392) T ss_pred EEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEeeccEEec Confidence 53 22 234667789999998 568999999999876543 4689999999999999999999 Q ss_pred cCceEEEeecC Q lcl|NC_021307. 300 VEAFVKLTNAA 310 (310) Q Consensus 300 ~~a~~~l~~aa 310 (310) ++||++|+.+. T Consensus 373 ~~a~~~l~~~~ 383 (392) T protein:vir:10 373 NEAAVYGEIDL 383 (392) T ss_pred ccceEEEEecc Confidence 99999988866 No 68 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=1.1e-52 Score=305.40 Aligned_cols=279 Identities=13% Similarity=0.027 Sum_probs=225.5 Q ss_pred Cccch--------hhhHHHHHhhccccCCCC-ceechhhHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEcCCcee Q lcl|NC_021307. 1 MAAGT--------AFPVNHTQIAQTGDSMFQ-GYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG--VKIPHWTGDVSA 69 (310) Q Consensus 1 ~aa~~--------~~~~~~~~~~~~~~~~~g-~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~ip~~~~~~~a 69 (310) |..+. .........+..+++..| .++|+++..+|++.+++.++|+++|++++++++. +.+|+..+++.+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 11111 011112233444454444 4678888999999999999999999999997655 456777778889 Q ss_pred eeeccccccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc Q lcl|NC_021307. 70 AWIGEGDMKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET 148 (310) Q Consensus 70 ~~v~Eg~~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~ 148 (310) .|++|++.++++ .++|+++++.++|++++++||+|+++|+.++++++|.+.|++++++++|.+|++|+|++.+.+ T Consensus 164 ~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~---- 239 (392) T protein:vir:10 164 AEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQA---- 239 (392) T ss_pred eeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC---- Confidence 999999999986 589999999999999999999999999999999999999999999999999999999765422 Q ss_pred cccccceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEE Q lcl|NC_021307. 149 TKSVDLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTI 228 (310) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~ 228 (310) ..+++++...+...+...+..+++|+||++++..|+++||++|+|+|+++...+. +++|+|+|++ T Consensus 240 ----------~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~-----~~tllG~~~v 304 (392) T protein:vir:10 240 ----------IKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN-----KKLFAGTNPV 304 (392) T ss_pred ----------ccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCc-----cccccCcccE Confidence 2334555433446788899999999999999999999999999999988765543 4679998766 Q ss_pred Ee-CC-------CCCCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_021307. 229 LS-DH-------VASGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIND 299 (310) Q Consensus 229 ~t-~~-------~~~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~ 299 (310) ++ ++ ...++..+++|||++ +.+++|++++++++++.. ..|++|++.||++.|+||++.+ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~ 372 (392) T protein:vir:10 305 VVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRDDVQMWD 372 (392) T ss_pred EEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEeeccEEec Confidence 53 22 234667789999998 568999999999876543 4689999999999999999999 Q ss_pred cCceEEEeecC Q lcl|NC_021307. 300 VEAFVKLTNAA 310 (310) Q Consensus 300 ~~a~~~l~~aa 310 (310) ++||++|+.+. T Consensus 373 ~~a~~~l~~~~ 383 (392) T protein:vir:10 373 NEAAVYGEIDL 383 (392) T ss_pred ccceEEEEecc Confidence 99999988866 No 69 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=2.1e-52 Score=303.87 Aligned_cols=287 Identities=13% Similarity=0.057 Sum_probs=230.7 Q ss_pred Cccchhhh-------HHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEE--cCCceeee Q lcl|NC_021307. 1 MAAGTAFP-------VNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHW--TGDVSAAW 71 (310) Q Consensus 1 ~aa~~~~~-------~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~--~~~~~a~~ 71 (310) +...+.-. ........++++.++.++|+++..+|++.+++.++|+++|+++|++++..++|+. .....+.| T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:47 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceee Confidence 01000000 0111122334555666788899999999999999999999999998887777765 55678899 Q ss_pred eccccccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccc Q lcl|NC_021307. 72 IGEGDMKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTK 150 (310) Q Consensus 72 v~Eg~~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~ 150 (310) ++|++.+|+. .++|+++++.+++++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+.+...... T Consensus 181 v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~ 260 (415) T protein:vir:47 181 VEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFE 260 (415) T ss_pred cccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccc Confidence 9999999974 689999999999999999999999999999999999999999999999999999999987765544333 Q ss_pred cccc-ee-cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEE Q lcl|NC_021307. 151 SVDL-TP-ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTI 228 (310) Q Consensus 151 ~~~~-~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~ 228 (310) .... .. .+..++ +.+.+++..+...++.+++|+||+++|.+|++++|++|+|+|+++...+ .+++|+|+||+ T Consensus 261 ~~~~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~-----~~~~l~G~pV~ 334 (415) T protein:vir:47 261 KEGKKLEVKKAKSL-DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEK-----TQQRLLGAKIE 334 (415) T ss_pred cccceeccccccch-HHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCC-----CCccccceeeE Confidence 2222 22 223334 4445888888888899999999999999999999999999998876544 34689999999 Q ss_pred EeCCCCC---CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|NC_021307. 229 LSDHVAS---GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFV 304 (310) Q Consensus 229 ~t~~~~~---~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~ 304 (310) +++++|. ++..++||||++ +.++++++++++.++ |.++++.+|++.|+|+++.+++||+ T Consensus 335 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:47 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EeccccccCCCccEEEEEehhccEEEEeecceEEEeec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 9998874 345689999998 567888999887653 4567788999999999999999999 Q ss_pred EEeecC Q lcl|NC_021307. 305 KLTNAA 310 (310) Q Consensus 305 ~l~~aa 310 (310) +++..+ T Consensus 398 ~~~~~~ 403 (415) T protein:vir:47 398 VIEYDD 403 (415) T ss_pred EEEeec Confidence 999888 No 70 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=2.1e-52 Score=303.87 Aligned_cols=287 Identities=13% Similarity=0.057 Sum_probs=230.7 Q ss_pred Cccchhhh-------HHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEE--cCCceeee Q lcl|NC_021307. 1 MAAGTAFP-------VNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHW--TGDVSAAW 71 (310) Q Consensus 1 ~aa~~~~~-------~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~--~~~~~a~~ 71 (310) +...+.-. ........++++.++.++|+++..+|++.+++.++|+++|+++|++++..++|+. .....+.| T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:46 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceee Confidence 01000000 0111122334555666788899999999999999999999999998887777765 55678899 Q ss_pred eccccccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccc Q lcl|NC_021307. 72 IGEGDMKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTK 150 (310) Q Consensus 72 v~Eg~~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~ 150 (310) ++|++.+|+. .++|+++++.+++++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+.+...... T Consensus 181 v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~ 260 (415) T protein:vir:46 181 VEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFE 260 (415) T ss_pred cccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccc Confidence 9999999974 689999999999999999999999999999999999999999999999999999999987765544333 Q ss_pred cccc-ee-cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEE Q lcl|NC_021307. 151 SVDL-TP-ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTI 228 (310) Q Consensus 151 ~~~~-~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~ 228 (310) .... .. .+..++ +.+.+++..+...++.+++|+||+++|.+|++++|++|+|+|+++...+ .+++|+|+||+ T Consensus 261 ~~~~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~-----~~~~l~G~pV~ 334 (415) T protein:vir:46 261 KEGKKLEVKKAKSL-DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEK-----TQQRLLGAKIE 334 (415) T ss_pred cccceeccccccch-HHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCC-----CCccccceeeE Confidence 2222 22 223334 4445888888888899999999999999999999999999998876544 34689999999 Q ss_pred EeCCCCC---CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|NC_021307. 229 LSDHVAS---GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFV 304 (310) Q Consensus 229 ~t~~~~~---~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~ 304 (310) +++++|. ++..++||||++ +.++++++++++.++ |.++++.+|++.|+|+++.+++||+ T Consensus 335 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:46 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EeccccccCCCccEEEEEehhccEEEEeecceEEEeec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 9998874 345689999998 567888999887653 4567788999999999999999999 Q ss_pred EEeecC Q lcl|NC_021307. 305 KLTNAA 310 (310) Q Consensus 305 ~l~~aa 310 (310) +++..+ T Consensus 398 ~~~~~~ 403 (415) T protein:vir:46 398 VIEYDD 403 (415) T ss_pred EEEeec Confidence 999888 No 71 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=2.8e-52 Score=303.23 Aligned_cols=281 Identities=11% Similarity=0.018 Sum_probs=221.5 Q ss_pred Cccch-hhhHHH-----HHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecc Q lcl|NC_021307. 1 MAAGT-AFPVNH-----TQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGE 74 (310) Q Consensus 1 ~aa~~-~~~~~~-----~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 74 (310) ..++. .+..+. ......+++.+|.+||+++.++|++.+++.++|+++|+++|++++...+|+.++.+.+.|++| T Consensus 65 ~~~~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~~~~~a~~~~E 144 (390) T protein:vir:40 65 ASRGANALTSDESKYYNEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVGDVATAWWGPL 144 (390) T ss_pred HhcCchhccHHHHHHHHHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEcCCcceeeecc Confidence 11110 011111 112233455566678999999999999999999999999999999999999999999999999 Q ss_pred cccccc-cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccccc Q lcl|NC_021307. 75 GDMKPI-TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVD 153 (310) Q Consensus 75 g~~~~~-~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~ 153 (310) ++.+++ ++++|+++++.+||++++++||+|+++|+.++++++|+++|++++++++|++|++|+|++.|.++........ T Consensus 145 ~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~ 224 (390) T protein:vir:40 145 CAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVT 224 (390) T ss_pred ccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeecccccc Confidence 998875 6899999999999999999999999999999999999999999999999999999999988877665433222 Q ss_pred cee----cccchHHHHHHHHHHHhhh-------hcCCCCEEEEehHHHH----HHHHhhhccCccccccccccccccccC Q lcl|NC_021307. 154 LTP----ATGTTYDAIGVNALSLLVN-------AGKKWGATLLDDVAEP----ILNGAKDANGRPLFVESTYEAVTTPYR 218 (310) Q Consensus 154 ~~~----~~~~~~~~~~~~~~~~l~~-------~~~~~~~~~~~~~~~~----~l~~l~d~~g~~~~~~~~~~~~~~~~~ 218 (310) ... ......+....++...+.. ....+++|+||+.++. .++.++|.+|+|++.. T Consensus 225 ~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~----------- 293 (390) T protein:vir:40 225 AGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTGI----------- 293 (390) T ss_pred ccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCcccccc----------- Confidence 211 1111111112222222222 2356889999998842 4457899999998642 Q ss_pred CceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|NC_021307. 219 EGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIN 298 (310) Q Consensus 219 ~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~ 298 (310) .++|+||+++++||.++ ++||||++++++++++++++++++. .|.+|++.||++.|+|+++. T Consensus 294 --~~~g~pvv~~~~~p~~~--i~~Gd~s~~~i~~~~~~~v~~~~~~--------------~f~~~~~~~r~~~r~dg~v~ 355 (390) T protein:vir:40 294 --LPVPLEIVQSVAVPVGK--AVAGRAKDYFMGIGSEQVIRTSTEY--------------RLLDDETLYYAKQYANGRPK 355 (390) T ss_pred --CCCceeEEEcCCCCCCc--EEEEeeceEEEEeecceEEEecchh--------------hhhcCcEEEEEEEEeCCEEe Confidence 24799999999999886 5789999999999999999988654 48899999999999999999 Q ss_pred ccCceEEEeecC Q lcl|NC_021307. 299 DVEAFVKLTNAA 310 (310) Q Consensus 299 ~~~a~~~l~~aa 310 (310) +++||++|+.+| T Consensus 356 ~~~A~~~l~~~~ 367 (390) T protein:vir:40 356 DNSSFLVFDITG 367 (390) T ss_pred cccceEEEEeec Confidence 999999999888 No 72 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=8.6e-52 Score=300.55 Aligned_cols=287 Identities=14% Similarity=0.068 Sum_probs=228.3 Q ss_pred Cccchhhh------------HHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEE--EEEcCC Q lcl|NC_021307. 1 MAAGTAFP------------VNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKI--PHWTGD 66 (310) Q Consensus 1 ~aa~~~~~------------~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~i--p~~~~~ 66 (310) +....... ........++++.+|.++|.++...|++.+++.++|+++++++||+++..++ |+..+. T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 175 (415) T protein:vir:81 96 IQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEV 175 (415) T ss_pred hhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCC Confidence 10000000 0111112334444555778888999999999999999999999998765554 555667 Q ss_pred ceeeeeccccccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc Q lcl|NC_021307. 67 VSAAWIGEGDMKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL 145 (310) Q Consensus 67 ~~a~~v~Eg~~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~ 145 (310) ..++|++|++.+|+. .++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+... T Consensus 176 ~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~ 255 (415) T protein:vir:81 176 AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST 255 (415) T ss_pred ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc Confidence 789999999999975 5899999999999999999999999999999999999999999999999999999998877655 Q ss_pred ccccccc--cceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceee Q lcl|NC_021307. 146 DETTKSV--DLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRIL 223 (310) Q Consensus 146 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~ 223 (310) ....... ..+.....++++ +.+++..+...++.+++|+||+++|..|+++||++|+|+|.++...+ .+++|+ T Consensus 256 ~~~~~~~~~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~-----~~~~l~ 329 (415) T protein:vir:81 256 SSGFEKEGKKLEVKKAKSLDD-IKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEK-----TQQRLL 329 (415) T ss_pred cccccccccccccccccchhH-HHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCC-----CCceec Confidence 4433222 222223344444 45788888888899999999999999999999999999998876543 356899 Q ss_pred eeeEEEeCCCCC---CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_021307. 224 GRPTILSDHVAS---GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIND 299 (310) Q Consensus 224 G~pv~~t~~~~~---~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~ 299 (310) |+||++++++|. ++..++||||++ +.++++++++++.++ |.++++.+|++.|+|+++.+ T Consensus 330 G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~ 392 (415) T protein:vir:81 330 GAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILD 392 (415) T ss_pred ceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----------------cccCceEEEEEEEeccEEec Confidence 999999999874 456689999998 558889999998753 34566789999999999999 Q ss_pred cCceEEEeecC Q lcl|NC_021307. 300 VEAFVKLTNAA 310 (310) Q Consensus 300 ~~a~~~l~~aa 310 (310) ++||++++..+ T Consensus 393 ~~a~~~~~~~~ 403 (415) T protein:vir:81 393 YKSAIVIEYDD 403 (415) T ss_pred cccEEEEEEec Confidence 99999999988 No 73 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=8.6e-52 Score=300.55 Aligned_cols=287 Identities=14% Similarity=0.068 Sum_probs=228.3 Q ss_pred Cccchhhh------------HHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEE--EEEcCC Q lcl|NC_021307. 1 MAAGTAFP------------VNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKI--PHWTGD 66 (310) Q Consensus 1 ~aa~~~~~------------~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~i--p~~~~~ 66 (310) +....... ........++++.+|.++|.++...|++.+++.++|+++++++||+++..++ |+..+. T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 175 (415) T protein:vir:79 96 IQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEV 175 (415) T ss_pred hhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCC Confidence 10000000 0111112334444555778888999999999999999999999998765554 555667 Q ss_pred ceeeeeccccccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc Q lcl|NC_021307. 67 VSAAWIGEGDMKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL 145 (310) Q Consensus 67 ~~a~~v~Eg~~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~ 145 (310) ..++|++|++.+|+. .++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+... T Consensus 176 ~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~ 255 (415) T protein:vir:79 176 AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST 255 (415) T ss_pred ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc Confidence 789999999999975 5899999999999999999999999999999999999999999999999999999998877655 Q ss_pred ccccccc--cceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceee Q lcl|NC_021307. 146 DETTKSV--DLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRIL 223 (310) Q Consensus 146 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~ 223 (310) ....... ..+.....++++ +.+++..+...++.+++|+||+++|..|+++||++|+|+|.++...+ .+++|+ T Consensus 256 ~~~~~~~~~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~-----~~~~l~ 329 (415) T protein:vir:79 256 SSGFEKEGKKLEVKKAKSLDD-IKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEK-----TQQRLL 329 (415) T ss_pred cccccccccccccccccchhH-HHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCC-----CCceec Confidence 4433222 222223344444 45788888888899999999999999999999999999998876543 356899 Q ss_pred eeeEEEeCCCCC---CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_021307. 224 GRPTILSDHVAS---GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIND 299 (310) Q Consensus 224 G~pv~~t~~~~~---~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~ 299 (310) |+||++++++|. ++..++||||++ +.++++++++++.++ |.++++.+|++.|+|+++.+ T Consensus 330 G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~ 392 (415) T protein:vir:79 330 GAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILD 392 (415) T ss_pred ceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----------------cccCceEEEEEEEeccEEec Confidence 999999999874 456689999998 558889999998753 34566789999999999999 Q ss_pred cCceEEEeecC Q lcl|NC_021307. 300 VEAFVKLTNAA 310 (310) Q Consensus 300 ~~a~~~l~~aa 310 (310) ++||++++..+ T Consensus 393 ~~a~~~~~~~~ 403 (415) T protein:vir:79 393 YKSAIVIEYDD 403 (415) T ss_pred cccEEEEEEec Confidence 99999999988 No 74 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=8.6e-52 Score=300.55 Aligned_cols=287 Identities=14% Similarity=0.068 Sum_probs=228.3 Q ss_pred Cccchhhh------------HHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEE--EEEcCC Q lcl|NC_021307. 1 MAAGTAFP------------VNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKI--PHWTGD 66 (310) Q Consensus 1 ~aa~~~~~------------~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~i--p~~~~~ 66 (310) +....... ........++++.+|.++|.++...|++.+++.++|+++++++||+++..++ |+..+. T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 175 (415) T protein:vir:98 96 IQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEV 175 (415) T ss_pred hhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCC Confidence 10000000 0111112334444555778888999999999999999999999998765554 555667 Q ss_pred ceeeeeccccccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc Q lcl|NC_021307. 67 VSAAWIGEGDMKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL 145 (310) Q Consensus 67 ~~a~~v~Eg~~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~ 145 (310) ..++|++|++.+|+. .++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+... T Consensus 176 ~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~ 255 (415) T protein:vir:98 176 AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST 255 (415) T ss_pred ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc Confidence 789999999999975 5899999999999999999999999999999999999999999999999999999998877655 Q ss_pred ccccccc--cceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceee Q lcl|NC_021307. 146 DETTKSV--DLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRIL 223 (310) Q Consensus 146 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~ 223 (310) ....... ..+.....++++ +.+++..+...++.+++|+||+++|..|+++||++|+|+|.++...+ .+++|+ T Consensus 256 ~~~~~~~~~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~-----~~~~l~ 329 (415) T protein:vir:98 256 SSGFEKEGKKLEVKKAKSLDD-IKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEK-----TQQRLL 329 (415) T ss_pred cccccccccccccccccchhH-HHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCC-----CCceec Confidence 4433222 222223344444 45788888888899999999999999999999999999998876543 356899 Q ss_pred eeeEEEeCCCCC---CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_021307. 224 GRPTILSDHVAS---GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIND 299 (310) Q Consensus 224 G~pv~~t~~~~~---~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~ 299 (310) |+||++++++|. ++..++||||++ +.++++++++++.++ |.++++.+|++.|+|+++.+ T Consensus 330 G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~ 392 (415) T protein:vir:98 330 GAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILD 392 (415) T ss_pred ceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----------------cccCceEEEEEEEeccEEec Confidence 999999999874 456689999998 558889999998753 34566789999999999999 Q ss_pred cCceEEEeecC Q lcl|NC_021307. 300 VEAFVKLTNAA 310 (310) Q Consensus 300 ~~a~~~l~~aa 310 (310) ++||++++..+ T Consensus 393 ~~a~~~~~~~~ 403 (415) T protein:vir:98 393 YKSAIVIEYDD 403 (415) T ss_pred cccEEEEEEec Confidence 99999999988 No 75 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=1.5e-51 Score=299.18 Aligned_cols=287 Identities=13% Similarity=0.070 Sum_probs=229.7 Q ss_pred Cccchh------h-hHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceE--EEEEcCCceeee Q lcl|NC_021307. 1 MAAGTA------F-PVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVK--IPHWTGDVSAAW 71 (310) Q Consensus 1 ~aa~~~------~-~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~--ip~~~~~~~a~~ 71 (310) +...+. . .........++++.+|.++|+++..+|++.+++.++|+++|+++||+++..+ +++..+...+.| T Consensus 101 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:94 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCcccee Confidence 111110 0 0111222333455566678888999999999999999999999999876555 455567778999 Q ss_pred eccccccccc-ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccc Q lcl|NC_021307. 72 IGEGDMKPIT-KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTK 150 (310) Q Consensus 72 v~Eg~~~~~~-~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~ 150 (310) ++|++.+|+. .++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++|++|+|++.+........ T Consensus 181 v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~ 260 (415) T protein:vir:94 181 VEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFE 260 (415) T ss_pred ccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccc Confidence 9999999975 689999999999999999999999999999999999999999999999999999999988765544333 Q ss_pred cccc--eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEE Q lcl|NC_021307. 151 SVDL--TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTI 228 (310) Q Consensus 151 ~~~~--~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~ 228 (310) .... ......++++ +.+++..+...++.+++|+||+++|.+|+++||++|+|+|.++...+ .+++|+|+||+ T Consensus 261 ~~~~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~-----~~~~l~G~pV~ 334 (415) T protein:vir:94 261 KEGKKLEVKKAKSLDD-IKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEK-----TQQRLLGAKIE 334 (415) T ss_pred ccccccccccccchHH-HHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCC-----CCceecceeeE Confidence 2222 2223334444 45788888888889999999999999999999999999998876543 35689999999 Q ss_pred EeCCCCCC---ceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|NC_021307. 229 LSDHVASG---TTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFV 304 (310) Q Consensus 229 ~t~~~~~~---~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~ 304 (310) +++++|.+ +..+++|||++ ++++++++++++.++ |.++++.+|++.|+|+++.+++||+ T Consensus 335 ~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~r~~~r~d~~~~~~~a~~ 397 (415) T protein:vir:94 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 99998753 45678999998 567888999887653 4567789999999999999999999 Q ss_pred EEeecC Q lcl|NC_021307. 305 KLTNAA 310 (310) Q Consensus 305 ~l~~aa 310 (310) +++..+ T Consensus 398 ~~~~~~ 403 (415) T protein:vir:94 398 VIEYDD 403 (415) T ss_pred EEEEec Confidence 999888 No 76 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=2.6e-52 Score=303.42 Aligned_cols=293 Identities=12% Similarity=0.003 Sum_probs=232.8 Q ss_pred Cccchhh-hHH---HH-Hhhccc-cCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecc Q lcl|NC_021307. 1 MAAGTAF-PVN---HT-QIAQTG-DSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGE 74 (310) Q Consensus 1 ~aa~~~~-~~~---~~-~~~~~~-~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 74 (310) ...+... ..+ .. .....+ .+.+|.+||++++++|++.+.+.|+++++|+++++++. .++|+.++.+.+.|++| T Consensus 60 ~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~~~~~~~~~~a~w~~e 138 (377) T protein:vir:98 60 LRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLR-LKALTAETSGTAVWGDI 138 (377) T ss_pred hccCCcccCHHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCcc-eEEEEecCCcceeEeec Confidence 1111111 111 11 122233 44455678889999999999999999999999998654 79999999999999999 Q ss_pred ccccc-ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccccc Q lcl|NC_021307. 75 GDMKP-ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVD 153 (310) Q Consensus 75 g~~~~-~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~ 153 (310) +++++ +++++|+++++.+||++++++||+|+|+|+.+++++||.+++++++++++|.+|++|+|++.|.+++....... T Consensus 139 ~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~ 218 (377) T protein:vir:98 139 FGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPT 218 (377) T ss_pred ccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccc Confidence 88776 57899999999999999999999999999999999999999999999999999999999999988875432222 Q ss_pred ceec------ccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccc---------cccccC Q lcl|NC_021307. 154 LTPA------TGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEA---------VTTPYR 218 (310) Q Consensus 154 ~~~~------~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~---------~~~~~~ 218 (310) .... +..+..+.+.++...+...+..+++|+||+.++..++++||.+|+++|..+.... ....+. T Consensus 219 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~ 298 (377) T protein:vir:98 219 VDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGE 298 (377) T ss_pred cccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCc Confidence 1111 1111223455677777888899999999999999999999999999995443210 011233 Q ss_pred Cceeeeee--EEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccE Q lcl|NC_021307. 219 EGRILGRP--TILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLL 296 (310) Q Consensus 219 ~~~l~G~p--v~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~ 296 (310) +.+++|+| ++.++++|+++ ++||||++|.++++++++++.+++. .|.+|++.||+..|+|++ T Consensus 299 ~~t~lg~p~~vv~s~~~p~~~--i~fgdf~~Y~i~~r~~~~i~~~~~~--------------~~~~d~~~f~~~~r~dg~ 362 (377) T protein:vir:98 299 YVTVLPHGITILESLAVETGK--AIAFVANRYDAFMATASTIEEYDQT--------------FAMEDLQLYLTKNYFYGK 362 (377) T ss_pred cccccCCCceEEecCCCCccc--EEEEEecceeEEeecceEEEeechh--------------hhhcCceEEEEEEEEcCE Confidence 45788888 56777899876 5799999999999999999988764 488999999999999999 Q ss_pred EeccCceEEEeecC Q lcl|NC_021307. 297 INDVEAFVKLTNAA 310 (310) Q Consensus 297 v~~~~a~~~l~~aa 310 (310) +.+++||++|+.+. T Consensus 363 ~~~~~a~~vl~i~~ 376 (377) T protein:vir:98 363 AKDNHTAALLTLAG 376 (377) T ss_pred EeccCcEEEEEEec Confidence 99999999999999 No 77 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=5.7e-51 Score=296.02 Aligned_cols=290 Identities=15% Similarity=0.103 Sum_probs=226.1 Q ss_pred CccchhhhHHHH----H-hhccccCCCCceechhhHHH-HHHHHHhhchhhhhcceeecCCCceEEEEEcC--------C Q lcl|NC_021307. 1 MAAGTAFPVNHT----Q-IAQTGDSMFQGYLEPEQAQD-YFAEAEKTSIVQRVARKIPMGSTGVKIPHWTG--------D 66 (310) Q Consensus 1 ~aa~~~~~~~~~----~-~~~~~~~~~g~~i~~~~~~~-ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~--------~ 66 (310) ....+.-..... . ....+....+..++|+...+ |+...+..+.++++|+++|+.++.+++|+.++ . T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 184 (419) T protein:vir:94 105 QFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTW 184 (419) T ss_pred hhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccC Confidence 000000000000 0 11112234445666666655 55566777789999999999999899988654 3 Q ss_pred ceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccc Q lcl|NC_021307. 67 VSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLD 146 (310) Q Consensus 67 ~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~ 146 (310) ..++|++||+.+|+++++|+++++.++|++++++||+|+++|+ .+++++|.++|++++++++|++||+|+|++.|.++. T Consensus 185 ~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~ 263 (419) T protein:vir:94 185 NKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGIL 263 (419) T ss_pred cccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCccccccee Confidence 4578999999999999999999999999999999999999986 579999999999999999999999999999888776 Q ss_pred cccccccce------ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCcc-ccccccccccccccCC Q lcl|NC_021307. 147 ETTKSVDLT------PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRP-LFVESTYEAVTTPYRE 219 (310) Q Consensus 147 ~~~~~~~~~------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~-~~~~~~~~~~~~~~~~ 219 (310) ......... ..+.....+.+.+++..+...+..+++|+||++++..|++++|++|++ +++++..++ .+ T Consensus 264 ~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~-----~~ 338 (419) T protein:vir:94 264 TTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGE-----AT 338 (419) T ss_pred cccccccccccccccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccC-----CC Confidence 543322211 122333455666888888888899999999999999999999987665 455544333 35 Q ss_pred ceeeeeeEEEeCCCCCCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|NC_021307. 220 GRILGRPTILSDHVASGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIN 298 (310) Q Consensus 220 ~~l~G~pv~~t~~~~~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~ 298 (310) ++|+|+||++++++|.++ +++|||++ ++++++++++++++++.. ++|++|++.||++.|+|+++. T Consensus 339 ~~l~G~pV~~~~~~~~~~--~~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~~~~~~~~~r~~~r~d~~v~ 404 (419) T protein:vir:94 339 PRIWGLNVVSTVAIAQGT--ALVGGFRQGATLWSRQGITVLMTDSHA------------DFFTANTLVILAEFRANLAVY 404 (419) T ss_pred ccccceeeEEcCCCCCcc--EEEeeccceEEEEEecceEEEEecccc------------chhhcCcEEEEEEEeeccEEe Confidence 689999999999999876 57899998 568889999999876643 468999999999999999999 Q ss_pred ccCceEEEeecC Q lcl|NC_021307. 299 DVEAFVKLTNAA 310 (310) Q Consensus 299 ~~~a~~~l~~aa 310 (310) +++||++++.+| T Consensus 405 ~~~a~~~~~~~a 416 (419) T protein:vir:94 405 QPKAFVRVTFAA 416 (419) T ss_pred ccccEEEEEecc Confidence 999999999999 No 78 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=1.4e-50 Score=293.82 Aligned_cols=274 Identities=15% Similarity=0.117 Sum_probs=224.5 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCce--eeeecccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVS--AAWIGEGDMK 78 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~--a~~v~Eg~~~ 78 (310) ..++..+ ....+...+++.+|.+||+++..+|++.+++.++|+++|+++||+++..++|+...... ++|++|+..+ T Consensus 103 ~~~~~~~--~~~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~ 180 (421) T protein:vir:13 103 TIRGIQL--SEEERDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRAGASVDKLANLAKDTEL 180 (421) T ss_pred hhhccch--hHHHhhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEeecCCccceeeccccccc Confidence 1112222 22234445555667788889999999999999999999999999999999998876544 5779999999 Q ss_pred cccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecc Q lcl|NC_021307. 79 PITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPAT 158 (310) Q Consensus 79 ~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~ 158 (310) ++++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|..+++. +.++.. ..+ T Consensus 181 ~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~-----~~g~~~--------~~~ 247 (421) T protein:vir:13 181 VKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQ-----AKAVLA--------EET 247 (421) T ss_pred cccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhh-----hhhccc--------ccc Confidence 9999999999999999999999999999999999999999999999999999998742 212111 112 Q ss_pred cchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCC--- Q lcl|NC_021307. 159 GTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVAS--- 235 (310) Q Consensus 159 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~--- 235 (310) ..++++ +.+++..+...++.+++|+||+.+|..|++++|++|+|+|++.. . +.+++|+|+||++++++|. T Consensus 248 ~~~~d~-i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~-~-----~~~~tl~G~pV~~~~~~~~~~~ 320 (421) T protein:vir:13 248 INDYAG-LVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKELS-D-----GGDLVFKGRPVIELEESIFDVG 320 (421) T ss_pred ccchHH-HHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecCcC-C-----CCCceecceeeEEeccccccCC Confidence 233444 45788888889999999999999999999999999999997632 2 2356899999999999874 Q ss_pred CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 236 GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 236 ~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +...+++|||++ +.++++++++++++++. .|++|++.||++.|+|+++.+++||++++..- T Consensus 321 ~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~--------------~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 382 (421) T protein:vir:13 321 DETKFIVSDFKTLIKFMDRKQYLIDQSKEA--------------GYTKNETIARIIERFDVNSPLDKSSDAEKIRK 382 (421) T ss_pred CceEEEEEeccccEEEEEecceEEEeeccc--------------ccccCeeEEEEEeeecceeecchhhheeeecc Confidence 346788999998 66899999999998764 38999999999999999999999987776654 No 79 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=2e-50 Score=293.06 Aligned_cols=272 Identities=14% Similarity=0.099 Sum_probs=218.7 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcC-Cceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTG-DVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~ 79 (310) ++++.............+...+|.++|+++...|++.+++.++|+++|+++|++++..++|+... +..+.|++|++.+| T Consensus 115 ~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~ 194 (394) T protein:vir:97 115 LMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNP 194 (394) T ss_pred HHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCCCccceeccccccc Confidence 22222222222223333444455678888999999999999999999999999998899998764 56789999999999 Q ss_pred c-cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecc Q lcl|NC_021307. 80 I-TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPAT 158 (310) Q Consensus 80 ~-~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~ 158 (310) + ++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|.+|++|.+++.+.+ T Consensus 195 ~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~-------------- 260 (394) T protein:vir:97 195 ALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKT-------------- 260 (394) T ss_pred ccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-------------- Confidence 7 5699999999999999999999999999999999999999999999999999999887654321 Q ss_pred cchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCce Q lcl|NC_021307. 159 GTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTT 238 (310) Q Consensus 159 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~ 238 (310) ..+++++. +++..... ...+++|+||+++|..|++++|++|+|+|+++..++. +++|+|+||+++++.+.+.. T Consensus 261 ~~~~~~~~-~~~~~~~~-~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~-----~~~l~G~pv~~~~~~~~~~~ 333 (394) T protein:vir:97 261 VKNLDEIK-ALLNGGFD-PAYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVS-----GKVLLGKPVFVLSDEVLGAN 333 (394) T ss_pred cccHHHHH-HHHHhhhh-hhhCCEEEEcHHHHHHHHHhhccCCCeeeecCcCCCC-----CceeccceeEEecccccCCc Confidence 22344444 33333222 2347899999999999999999999999988765543 46899999999887777777 Q ss_pred eEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 239 VGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 239 ~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .+++|||++ +.++++++++++.+++. .+...||++.|+|+++.+|+||++|+..+ T Consensus 334 ~~~~gd~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 389 (394) T protein:vir:97 334 KAFIGDFKRGVLFADRKDLGLRWADNE-----------------IYGQYLQAVLRFGVSKVDDKAGYYVTFTP 389 (394) T ss_pred cEEEeeccccEEEEEecceEEEEeccc-----------------ccceeEEEEEEEccEEecccceEEEEecc Confidence 789999998 56889999999876543 33467999999999999999999998888 No 80 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=5.6e-50 Score=290.62 Aligned_cols=272 Identities=13% Similarity=0.136 Sum_probs=217.3 Q ss_pred CccchhhhHHHHHhh--ccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcC-Cceeeeeccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIA--QTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTG-DVSAAWIGEGDM 77 (310) Q Consensus 1 ~aa~~~~~~~~~~~~--~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~ 77 (310) .........+..... .++++.+|.++|+++..+|++.+++.++|+++++++|++++..++|+... .+.+.|++|++. T Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~ 198 (400) T protein:vir:38 119 FAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEK 198 (400) T ss_pred HhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecCCCcccccccccc Confidence 111111122222222 23444455678888999999999999999999999999998899999864 466899999999 Q ss_pred ccc-cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccccccee Q lcl|NC_021307. 78 KPI-TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTP 156 (310) Q Consensus 78 ~~~-~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~ 156 (310) +++ ++++|+++++.++|++++++||+|+++|+.++++++|.+.|+++++.++|.++++|+|++.+.+. T Consensus 199 ~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~~~----------- 267 (400) T protein:vir:38 199 NPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTAKTI----------- 267 (400) T ss_pred ccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc----------- Confidence 986 57999999999999999999999999999999999999999999999999999999987654221 Q ss_pred cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCC- Q lcl|NC_021307. 157 ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVAS- 235 (310) Q Consensus 157 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~- 235 (310) .+++++. ++.....+. ..+++|+|||+++..|++++|++|+|+|+++...+. +++|+|+||++++++|. T Consensus 268 ---~~~~~~~-~~~~~~~~~-~~~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~-----~~~l~G~pv~~~~~~~~~ 337 (400) T protein:vir:38 268 ---SSVDDLK-HINNVDLDP-AYSRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPS-----GKSVLGMPIAVVSDDTLG 337 (400) T ss_pred ---ccHHHHH-HHHHhhhhh-hhCcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCC-----ccccccceeEEecccccC Confidence 1233333 343332222 347899999999999999999999999988765543 46899999999998874 Q ss_pred --CceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 236 --GTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 236 --~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++..+++|||++ +.+++++++++..+++.+ +...||++.|+|+++.+++||++|+.++ T Consensus 338 ~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~r~d~~~~~~~a~~~l~~~~ 398 (400) T protein:vir:38 338 AAGEAHAFLGDIKRAILFANRADFMVRWVDDQI-----------------YGQFLQAGMRFGVSVADEKAGYFLTYTP 398 (400) T ss_pred CCCceEEEEEeccccEEEEeecceEEEEecccc-----------------cceeEEEEEEeccEEecccceEEEEeec Confidence 466789999998 567889999998876532 2357999999999999999999999888 No 81 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=3.6e-50 Score=291.63 Aligned_cols=279 Identities=12% Similarity=-0.007 Sum_probs=217.1 Q ss_pred Cccc-hh---hhHHHHHhhccccC-CCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAAG-TA---FPVNHTQIAQTGDS-MFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa~-~~---~~~~~~~~~~~~~~-~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) ..++ .. -+.....+..++++ .+|.++|+++.++|++.+++.|+++++|+++++++. .++|+.++.+.+.|++|+ T Consensus 58 ~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~i~~~~~~~~a~w~~e~ 136 (381) T protein:vir:10 58 LPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKFLKSETSGVAVWGKIY 136 (381) T ss_pred hccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcc-eEEEEecCCcceeeeccc Confidence 1111 11 11222334444544 455578889999999999999999999999998654 799999999999999999 Q ss_pred cccc-ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccc Q lcl|NC_021307. 76 DMKP-ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDL 154 (310) Q Consensus 76 ~~~~-~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 154 (310) ++++ +++++|+++++.+||++++++||+|||+|+.+++++||.++|++++++++|++|++|+|++.|.+++........ T Consensus 137 ~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~ 216 (381) T protein:vir:10 137 GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVS 216 (381) T ss_pred ccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccc Confidence 8876 568999999999999999999999999999999999999999999999999999999999999887643221110 Q ss_pred e--------ecccc-------hHHHHHHHHHHHhhh-------hcCCCCEEEEehHHHHHHHHhh---hccCcccccccc Q lcl|NC_021307. 155 T--------PATGT-------TYDAIGVNALSLLVN-------AGKKWGATLLDDVAEPILNGAK---DANGRPLFVEST 209 (310) Q Consensus 155 ~--------~~~~~-------~~~~~~~~~~~~l~~-------~~~~~~~~~~~~~~~~~l~~l~---d~~g~~~~~~~~ 209 (310) . .+.++ +..+.+.++...+.. .+..++.|+||+.++..++.++ +++|+|++.. T Consensus 217 ~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l-- 294 (381) T protein:vir:10 217 VTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL-- 294 (381) T ss_pred cccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecC-- Confidence 0 01111 111222233333321 3566788999999999988665 5567666431 Q ss_pred ccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEE Q lcl|NC_021307. 210 YEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRV 289 (310) Q Consensus 210 ~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 289 (310) ..|++|+.++.||.++ ++||||++|.+++|++++++.+++. .|.+|++.||+ T Consensus 295 ------------~~g~~vv~s~~~p~~~--iifgDfs~Y~i~~r~~~~i~~~~~~--------------~~~~d~~~f~a 346 (381) T protein:vir:10 295 ------------PFNLNVIESTVQEAGK--VLTYVKGLYDGYLAGGINVQKFKET--------------LALDDMDLYTA 346 (381) T ss_pred ------------CCCceEEecCCCCcCc--EEEEecccEEEEEecccEEEeechh--------------HhhcCCeEEEE Confidence 2467899999999876 6799999999999999999998764 48999999999 Q ss_pred EEEeccEEeccCceEEEeecC Q lcl|NC_021307. 290 EAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 290 ~~~~d~~v~~~~a~~~l~~aa 310 (310) ..|+|+++.+++||++++.+- T Consensus 347 ~~r~dg~~~~~~A~~v~~l~~ 367 (381) T protein:vir:10 347 KQFAYGKAKDNKVAAVWKLDL 367 (381) T ss_pred EEEEcCEEecCceEEEEEEEe Confidence 999999999999999977766 No 82 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=3.6e-50 Score=291.63 Aligned_cols=279 Identities=12% Similarity=-0.007 Sum_probs=217.1 Q ss_pred Cccc-hh---hhHHHHHhhccccC-CCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAAG-TA---FPVNHTQIAQTGDS-MFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa~-~~---~~~~~~~~~~~~~~-~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) ..++ .. -+.....+..++++ .+|.++|+++.++|++.+++.|+++++|+++++++. .++|+.++.+.+.|++|+ T Consensus 58 ~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~i~~~~~~~~a~w~~e~ 136 (381) T protein:vir:95 58 LPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKFLKSETSGVAVWGKIY 136 (381) T ss_pred hccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcc-eEEEEecCCcceeeeccc Confidence 1111 11 11222334444544 455578889999999999999999999999998654 799999999999999999 Q ss_pred cccc-ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccc Q lcl|NC_021307. 76 DMKP-ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDL 154 (310) Q Consensus 76 ~~~~-~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 154 (310) ++++ +++++|+++++.+||++++++||+|||+|+.+++++||.++|++++++++|++|++|+|++.|.+++........ T Consensus 137 ~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~ 216 (381) T protein:vir:95 137 GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVS 216 (381) T ss_pred ccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccc Confidence 8876 568999999999999999999999999999999999999999999999999999999999999887643221110 Q ss_pred e--------ecccc-------hHHHHHHHHHHHhhh-------hcCCCCEEEEehHHHHHHHHhh---hccCcccccccc Q lcl|NC_021307. 155 T--------PATGT-------TYDAIGVNALSLLVN-------AGKKWGATLLDDVAEPILNGAK---DANGRPLFVEST 209 (310) Q Consensus 155 ~--------~~~~~-------~~~~~~~~~~~~l~~-------~~~~~~~~~~~~~~~~~l~~l~---d~~g~~~~~~~~ 209 (310) . .+.++ +..+.+.++...+.. .+..++.|+||+.++..++.++ +++|+|++.. T Consensus 217 ~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l-- 294 (381) T protein:vir:95 217 VTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL-- 294 (381) T ss_pred cccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecC-- Confidence 0 01111 111222233333321 3566788999999999988665 5567666431 Q ss_pred ccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEE Q lcl|NC_021307. 210 YEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRV 289 (310) Q Consensus 210 ~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 289 (310) ..|++|+.++.||.++ ++||||++|.+++|++++++.+++. .|.+|++.||+ T Consensus 295 ------------~~g~~vv~s~~~p~~~--iifgDfs~Y~i~~r~~~~i~~~~~~--------------~~~~d~~~f~a 346 (381) T protein:vir:95 295 ------------PFNLNVIESTVQEAGK--VLTYVKGLYDGYLAGGINVQKFKET--------------LALDDMDLYTA 346 (381) T ss_pred ------------CCCceEEecCCCCcCc--EEEEecccEEEEEecccEEEeechh--------------HhhcCCeEEEE Confidence 2467899999999876 6799999999999999999998764 48999999999 Q ss_pred EEEeccEEeccCceEEEeecC Q lcl|NC_021307. 290 EAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 290 ~~~~d~~v~~~~a~~~l~~aa 310 (310) ..|+|+++.+++||++++.+- T Consensus 347 ~~r~dg~~~~~~A~~v~~l~~ 367 (381) T protein:vir:95 347 KQFAYGKAKDNKVAAVWKLDL 367 (381) T ss_pred EEEEcCEEecCceEEEEEEEe Confidence 999999999999999977766 No 83 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=1.8e-49 Score=287.87 Aligned_cols=276 Identities=15% Similarity=0.125 Sum_probs=217.6 Q ss_pred CccchhhhHHHHHhhccccCC-CCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcC-Cceeeeecccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSM-FQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTG-DVSAAWIGEGDMK 78 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~-~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~ 78 (310) |..+... ...++.++++. +|.++|+++..+|++.+++.++|+++|+++|+++++.++|+... ...+.|++|++.+ T Consensus 100 l~~~~~~---~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~ 176 (394) T protein:vir:10 100 IHSHGKV---IDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAEN 176 (394) T ss_pred Hhccchh---hhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCccccccccccc Confidence 1111111 11233334444 44567888899999999999999999999999998899998764 4668999999999 Q ss_pred cc-cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceec Q lcl|NC_021307. 79 PI-TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPA 157 (310) Q Consensus 79 ~~-~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~ 157 (310) |+ ++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|++|++|.|++.+.... T Consensus 177 ~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~~----------- 245 (394) T protein:vir:10 177 PALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTAKATT----------- 245 (394) T ss_pred cccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc----------- Confidence 97 679999999999999999999999999999999999999999999999999999999876543221 Q ss_pred ccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCC--C- Q lcl|NC_021307. 158 TGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHV--A- 234 (310) Q Consensus 158 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~--~- 234 (310) +..+++++.......+...+ +++|+||+++|.+|++++|++|||+|+++.... .....+++|+|+||++++++ + T Consensus 246 ~~~~~d~l~~~~~~~~~~~~--~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~-~~~~~~~~L~G~PV~~~~~~~~~~ 322 (394) T protein:vir:10 246 TDTLVDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSI-TDGTAKGTVLGVPVYVVGDALLGS 322 (394) T ss_pred ccccHHHHHHHHHhhhhhhc--cCEEEecHHHHHHHHHhhccCCCeeeecccccc-ccCCcccccccceeEEecccccCC Confidence 22334444432333444443 689999999999999999999999999877553 33455678999999987754 3 Q ss_pred -CCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 235 -SGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 235 -~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .++..+++|||++ +++++++++++..+++.. |. ..+|++.|+|+++.+++||+.|+.++ T Consensus 323 ~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~--------------~~---~~~~~~~r~d~~~~~~~ai~~~~~~~ 383 (394) T protein:vir:10 323 AAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKI--------------YG---RYLGAAFRFGVKQADSNAGYFVTNTD 383 (394) T ss_pred CCCceEEEEeeccccEEEEeecceEEEEecccc--------------cc---eeEEEEEEeccEEeccccEEEEEeec Confidence 2456789999998 668888999998775532 23 45899999999999999999999887 No 84 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=3.1e-50 Score=291.98 Aligned_cols=279 Identities=13% Similarity=0.007 Sum_probs=216.9 Q ss_pred Cccchhh----hHHHHHhhccccCC-CCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAAGTAF----PVNHTQIAQTGDSM-FQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa~~~~----~~~~~~~~~~~~~~-~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) +.++... ..+...+...+++. +|.++|+++.++|++.+++.|+++++|+++++++ ..++|+.++.+.+.|++|+ T Consensus 58 ~~~~~~~l~~~e~~~~~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~-~~~i~~~~~~~~a~W~~e~ 136 (381) T protein:vir:10 58 LPKSAQTLSANQRNFFMDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIY 136 (381) T ss_pred hcccccccCHHHHHHHHHHhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCc-ceEEEeecCCcceEEeecc Confidence 2222211 11222244455544 4556788899999999999999999999999865 5789999999999999998 Q ss_pred cccc-ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccc Q lcl|NC_021307. 76 DMKP-ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDL 154 (310) Q Consensus 76 ~~~~-~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 154 (310) ++++ +++++|+++++.+||+++++++|+|||+|+.+++++||.++|++++++++|++|++|+|++.|.++......... T Consensus 137 ~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~ 216 (381) T protein:vir:10 137 GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVS 216 (381) T ss_pred cccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCcccc Confidence 7765 668999999999999999999999999999999999999999999999999999999999999887643222111 Q ss_pred ee--------cccc-hHHH------HHHHHHHHh-------hhhcCCCCEEEEehHHHHHHHHhh---hccCcccccccc Q lcl|NC_021307. 155 TP--------ATGT-TYDA------IGVNALSLL-------VNAGKKWGATLLDDVAEPILNGAK---DANGRPLFVEST 209 (310) Q Consensus 155 ~~--------~~~~-~~~~------~~~~~~~~l-------~~~~~~~~~~~~~~~~~~~l~~l~---d~~g~~~~~~~~ 209 (310) .. +.++ +..+ .+..+...+ ...+..++.|+||+.++..++.++ +++|+|++.. T Consensus 217 ~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~l-- 294 (381) T protein:vir:10 217 VTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL-- 294 (381) T ss_pred ccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceeecC-- Confidence 00 0001 1111 111111111 113456788999999999887644 7788877642 Q ss_pred ccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEE Q lcl|NC_021307. 210 YEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRV 289 (310) Q Consensus 210 ~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 289 (310) ..|+||+.+++||+++ ++||||++|.+++|++++++.+++. .|.+|++.||+ T Consensus 295 ------------p~g~~vv~~~~~p~~~--i~fGDfs~Y~i~~r~~~~i~~~~~~--------------~~~~d~~~f~a 346 (381) T protein:vir:10 295 ------------PFNLNVIESTVQEAGK--VLTYVKGLYDGYLAGGINVQKFKET--------------LALDDMDLYTA 346 (381) T ss_pred ------------CCCceeEEcCCCCcCc--EEEEEcccEEEEEecccEEEeechh--------------hhhcCceEEEE Confidence 2578899999999876 6799999999999999999998764 48999999999 Q ss_pred EEEeccEEeccCceEEEeecC Q lcl|NC_021307. 290 EAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 290 ~~~~d~~v~~~~a~~~l~~aa 310 (310) ..|+|+++.+++||++++.+. T Consensus 347 ~~r~dG~~~~~~A~~v~~l~~ 367 (381) T protein:vir:10 347 KQFAYGKAKDNKVAAVWKLDL 367 (381) T ss_pred EEEEcCEEecCCcEEEEEEee Confidence 999999999999999999987 No 85 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=1.2e-49 Score=288.75 Aligned_cols=295 Identities=11% Similarity=0.042 Sum_probs=218.1 Q ss_pred Cccchhh---hHHHHHhhccccCCCCceechhh-HHHHHHHHHhhchhhhhcceeecCC--CceEEEEEcCCc-eeeeec Q lcl|NC_021307. 1 MAAGTAF---PVNHTQIAQTGDSMFQGYLEPEQ-AQDYFAEAEKTSIVQRVARKIPMGS--TGVKIPHWTGDV-SAAWIG 73 (310) Q Consensus 1 ~aa~~~~---~~~~~~~~~~~~~~~g~~i~~~~-~~~ii~~~~~~s~l~~~~~~~~~~~--~~~~ip~~~~~~-~a~~v~ 73 (310) ....... ..+......++++.+|+++||++ .++|++.+++.++++++++.+++++ +.++||+..+++ .+.|++ T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~ 219 (477) T protein:vir:84 140 SDKEIRKIAKVGEEYRDLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAA 219 (477) T ss_pred hhhhHHHHHHhhhhhccccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeec Confidence 0000000 01112233445555667788886 5789999999999999999988764 468999976655 467999 Q ss_pred cccc-----ccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcc-ccccccc Q lcl|NC_021307. 74 EGDM-----KPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSP-FDKNLDE 147 (310) Q Consensus 74 Eg~~-----~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~-~~~~~~~ 147 (310) ||+. +|+++++|+++++.++|++++++||+|+|+|+.++++++|.++|++++++++|.+||+|+|++ .|.++.. T Consensus 220 Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~ 299 (477) T protein:vir:84 220 DNAALTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRA 299 (477) T ss_pred cCcccccccccccccceeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeee Confidence 9864 468889999999999999999999999999999999999999999999999999999999975 4655554 Q ss_pred ccccccceec-ccch---HH---HHHHHHHHHhhhhcCC-CCEEEEehHHHHHHHHhhhccCcccccccccc-------- Q lcl|NC_021307. 148 TTKSVDLTPA-TGTT---YD---AIGVNALSLLVNAGKK-WGATLLDDVAEPILNGAKDANGRPLFVESTYE-------- 211 (310) Q Consensus 148 ~~~~~~~~~~-~~~~---~~---~~~~~~~~~l~~~~~~-~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~-------- 211 (310) .......... ...+ .+ ..+.++...+...+.. ...|+||++++..|++++|.+|+|+|+++... T Consensus 300 ~~~~~~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~ 379 (477) T protein:vir:84 300 TAGITQVTATSAGSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLT 379 (477) T ss_pred ccccccccccccccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCccccccccccc Confidence 3322111111 1111 11 1123334444444444 45799999999999999999999999886533 Q ss_pred ccccccCCceeeeeeEEEeCCCCCC------ceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcE Q lcl|NC_021307. 212 AVTTPYREGRILGRPTILSDHVASG------TTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLV 285 (310) Q Consensus 212 ~~~~~~~~~~l~G~pv~~t~~~~~~------~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (310) +.......++|+|+||++++.+|.+ ...++||||++++++. +++.++++++. .+.++++ T Consensus 380 ~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~-~~~~~~~~~~~--------------~~~~~~~ 444 (477) T protein:vir:84 380 EVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE-SSVRMRALQET--------------RAENLSV 444 (477) T ss_pred ccccccccchhcccceEecCcccccccccCCcceEEEEEeceEEEEe-eceeEEecccc--------------cccccee Confidence 2233455678999999999999964 3467899999999886 56777776543 2456788 Q ss_pred EEEEEEEecc-EEeccCceEEEeecC Q lcl|NC_021307. 286 AVRVEAEYGL-LINDVEAFVKLTNAA 310 (310) Q Consensus 286 ~~r~~~~~d~-~v~~~~a~~~l~~aa 310 (310) .||+..++++ .+++|+||+.++++| T Consensus 445 ~~~v~~~~~~~~~r~~~afv~~t~~~ 470 (477) T protein:vir:84 445 LLQVYGYLAFTAARFPQSVVEIGGTA 470 (477) T ss_pred eeeehhhhhhhhhccccceEEeeccc Confidence 8988888887 556799999999999 No 86 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=5.7e-49 Score=285.07 Aligned_cols=274 Identities=15% Similarity=0.143 Sum_probs=216.1 Q ss_pred CccchhhhHHHHHhhccc-cCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcC-Cceeeeecccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTG-DSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTG-DVSAAWIGEGDMK 78 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~-~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~ 78 (310) |..+. ........+ ++.+|.+||+++..+|++.+++.++|+++|+++|++++..++|+... ...+.|++|++.+ T Consensus 99 lr~~~----~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~ 174 (389) T protein:vir:10 99 IHSHG----KVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAEN 174 (389) T ss_pred hhcch----hhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEecCCCccccccccccc Confidence 11111 111222233 34455578888899999999999999999999999998899998865 4556899999999 Q ss_pred cc-cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceec Q lcl|NC_021307. 79 PI-TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPA 157 (310) Q Consensus 79 ~~-~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~ 157 (310) ++ ++++|+++++.++|+++++++|+|+++|+.++++++|.+.|++++++++|.+|++|.+++.+.+ .. T Consensus 175 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~-----------~~ 243 (389) T protein:vir:10 175 PKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTAKK-----------TT 243 (389) T ss_pred cccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc-----------cc Confidence 85 7899999999999999999999999999999999999999999999999999999988764322 12 Q ss_pred ccchHHHHHHHHHH-HhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCC--C Q lcl|NC_021307. 158 TGTTYDAIGVNALS-LLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHV--A 234 (310) Q Consensus 158 ~~~~~~~~~~~~~~-~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~--~ 234 (310) +..+++++. +++. .+...+ +++|+||++++..|+++||++|+|+|+++..... ..+.+++|+|+||++++++ + T Consensus 244 ~~~~~d~l~-~~~~~~~~~~~--~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~-~~~~~~~l~G~pV~~~~~~~~~ 319 (389) T protein:vir:10 244 TDTLVDSLK-HILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSIT-DGTAKGTILGVPVYVVGDTLLG 319 (389) T ss_pred ccccHHHHH-HHHHhhhhhhh--CcEEEecHHHHHHHHHhhccCCCeeeecCccccc-ccccccccccceeEEecccccC Confidence 233444443 4443 444433 6899999999999999999999999988765432 2345678999999887654 2 Q ss_pred --CCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 235 --SGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 235 --~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .++..++||||++ +.+++++++++..+++.. |. ..+|+..|+|+++.+++||++++.++ T Consensus 320 ~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~~---~~~~~~~r~d~~~~~~~a~~~~~~~~ 381 (389) T protein:vir:10 320 SLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKI--------------YG---KYLGAAFRFGVQKADSKAGYFVTNTD 381 (389) T ss_pred CCCCceEEEEeeccccEEEEeecceEEEeecccc--------------cc---ceEEEEEEeccEEecccceEEEEeec Confidence 2456689999998 679999999999886543 22 46899999999999999999999776 No 87 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=4.9e-49 Score=285.45 Aligned_cols=276 Identities=11% Similarity=0.065 Sum_probs=214.3 Q ss_pred CccchhhhH-HHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEc-CCceeeeeccccc Q lcl|NC_021307. 1 MAAGTAFPV-NHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWT-GDVSAAWIGEGDM 77 (310) Q Consensus 1 ~aa~~~~~~-~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~-~~~~a~~v~Eg~~ 77 (310) ..+...... ........+++..+| ++|+++. .++..+++.+++++++++++++++...+|+.. ..+.+.|++|++. T Consensus 141 ~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~-~~i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 219 (437) T protein:vir:10 141 VTAFADYLKTGEVRDVTGIALKDGKVIIPETIL-TPEKEVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQ 219 (437) T ss_pred hhhhHHHHHhhhhhhhhhcccccccccchHHHH-HHHHHhhhhhhhhhcceeEeeccCceeeEEeecccccccccccccc Confidence 111111111 111223333444444 5555554 45566788999999999999998889999885 4567899999999 Q ss_pred ccc-cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccccccee Q lcl|NC_021307. 78 KPI-TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTP 156 (310) Q Consensus 78 ~~~-~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~ 156 (310) +++ ++++|+++++.++|++++++||+|+++|+.++++++|.++|+++++.++|.+|++|+|++.+... T Consensus 220 ~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~----------- 288 (437) T protein:vir:10 220 TTKNATPVITPILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKTT----------- 288 (437) T ss_pred ccccccccceeeeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc----------- Confidence 996 55899999999999999999999999999999999999999999999999999999997654211 Q ss_pred cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCC--C Q lcl|NC_021307. 157 ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHV--A 234 (310) Q Consensus 157 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~--~ 234 (310) ...+.+++...+...+...+..+++|+||++++..|++++|++|+|+|+++...+. +++|+|+||++++++ | T Consensus 289 -~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~-----~~~l~G~pv~~~~~~~~~ 362 (437) T protein:vir:10 289 -STYLLGDLKKVLNVTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAAT-----GYTLLGKTVVIVDDKLFP 362 (437) T ss_pred -cccchhhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCC-----CcccccceeEEecccccC Confidence 11123333322334678888899999999999999999999999999998776543 468999999998764 3 Q ss_pred ---CCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 235 ---SGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 235 ---~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .++..++||||++ +.++++.+++++.++. |..+.+.+|+..|+|+++.+++||++|+.+. T Consensus 363 ~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~----------------~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~ 426 (437) T protein:vir:10 363 SASAGDVNIVVAPLKKAVINFKLTEITGQFQDT----------------YDIWYKQLGIFLRQNVVQASKDLIVNLTGKL 426 (437) T ss_pred CcCCCceEEEEeeccccEEEEeeeceEEEEecc----------------cccccceeeEEEEEccEEecccceEEEEeec Confidence 3566789999997 5588899999876532 4455678899999999999999999999775 No 88 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=2.8e-49 Score=286.76 Aligned_cols=284 Identities=14% Similarity=0.077 Sum_probs=216.2 Q ss_pred Cc--------cch---hhhHHHH--HhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCc Q lcl|NC_021307. 1 MA--------AGT---AFPVNHT--QIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDV 67 (310) Q Consensus 1 ~a--------a~~---~~~~~~~--~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~ 67 (310) +. ... .+-.+.. .....+.+.++.++|.+++..|++.+++.++++++++++|+++. .++|+....+ T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g~-~~~~~~~~~~ 201 (466) T protein:vir:80 123 MPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKGT-ARQNIAGAIP 201 (466) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCce-eEeeeecCCc Confidence 00 000 0001111 11122333333467778889999999999999999999998654 7899988888 Q ss_pred eeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccc Q lcl|NC_021307. 68 SAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDE 147 (310) Q Consensus 68 ~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~ 147 (310) .+.|++|++.+++++++|+++++.+||++++++||+|+++|+.+++++||.++|+++++.++|.+|++|+|++.|.+++. T Consensus 202 ~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~ 281 (466) T protein:vir:80 202 EGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVT 281 (466) T ss_pred ceeecccccccccccccccceeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998887654 Q ss_pred ccccccceeccc--------chH-------------HHHHHH---HHHHhhhhc-CCCCEEEEehHHHHHHHHhh---hc Q lcl|NC_021307. 148 TTKSVDLTPATG--------TTY-------------DAIGVN---ALSLLVNAG-KKWGATLLDDVAEPILNGAK---DA 199 (310) Q Consensus 148 ~~~~~~~~~~~~--------~~~-------------~~~~~~---~~~~l~~~~-~~~~~~~~~~~~~~~l~~l~---d~ 199 (310) ............ .+. .....+ ....+...+ .....|+||+.++..|..++ +. T Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~ 361 (466) T protein:vir:80 282 RLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNS 361 (466) T ss_pred cccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccC Confidence 322221111100 000 000111 112222333 34456999999999998887 56 Q ss_pred cCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhh Q lcl|NC_021307. 200 NGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSL 279 (310) Q Consensus 200 ~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~ 279 (310) +|.+++.+.. ...++|+||+++++||.++ +++|||++|++++|+++++..+++. . T Consensus 362 ~g~~~~~~~~---------~~~i~G~pvv~s~~~~~~~--~~~g~~~~y~i~~r~~~~i~~~~~~--------------~ 416 (466) T protein:vir:80 362 AGALVASLNN---------TMPIVGGDIVILDFIPDND--IIGGYGSLYLLAERADIKLAQSEHV--------------R 416 (466) T ss_pred CccccccCCC---------cccccccceeecCccCccc--eeeeccccEEEEeecceEEEechhh--------------h Confidence 6776665432 2248999999999999886 5789999999999999999988653 4 Q ss_pred hhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 280 WQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 280 ~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) |.+|++.||++.|+|+++.+++||++++.+. T Consensus 417 f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~ 447 (466) T protein:vir:80 417 FIEDQTVFKGTARYDGKPVFGEGFVAVNIAN 447 (466) T ss_pred hhcCcEEEEEEEEEccEEeccCceEEEEecC Confidence 8899999999999999999999999998887 No 89 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=8.9e-49 Score=284.01 Aligned_cols=274 Identities=12% Similarity=0.098 Sum_probs=215.2 Q ss_pred CccchhhhHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcC-Cceeeeecccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTG-DVSAAWIGEGDMK 78 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~ 78 (310) .............+...+++.+|| +||+++..+|++.+++.++|+++++++++++ ..+|+... .+++.|++|++.+ T Consensus 69 ~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~ 146 (352) T protein:vir:78 69 FEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETA 146 (352) T ss_pred HHHHHhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC--ceEEEEecCCCccccccccccc Confidence 111111112223445555655555 5677889999999999999999999998754 46677554 4689999999999 Q ss_pred cccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHH-HHHcccCcccccccccccccccceec Q lcl|NC_021307. 79 PITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDE-AALHGTDSPFDKNLDETTKSVDLTPA 157 (310) Q Consensus 79 ~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~-~~l~G~g~~~~~~~~~~~~~~~~~~~ 157 (310) ++++++|+++++.+||++++++||+|+|+|+.+++++||.++|+++++++++. .|.+|+|++.+.++........ .. T Consensus 147 ~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~--~t 224 (352) T protein:vir:78 147 KELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKE--VE 224 (352) T ss_pred ccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceecccccc--cc Confidence 99999999999999999999999999999999999999999999999998655 5667777777766554322222 12 Q ss_pred ccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCc Q lcl|NC_021307. 158 TGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGT 237 (310) Q Consensus 158 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~ 237 (310) +..++ +.+.++...+...+..+++|+||+.++..|.++++.+|++++.. .+.+|+|+||++++.++. T Consensus 225 ~~~~~-d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~~----------~~~~llG~PV~~~~~~~~-- 291 (352) T protein:vir:78 225 GANMY-DAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT----------PAEKVFGKPVVFTDAAVK-- 291 (352) T ss_pred ccchH-HHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCccccc----------CCccccccceEEecCCCc-- Confidence 22334 44557888999999999999999999999999999999988742 235799999999987653 Q ss_pred eeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 238 TVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 238 ~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++||||+++++. +.++.++.+++ ..++++.|++..|+|+++.+++||+.++.+| T Consensus 292 --~~~Gdf~~~~~~-~~~~~~~~~~~----------------~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a 345 (352) T protein:vir:78 292 --PIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 345 (352) T ss_pred --eeEeehhhhhhh-hhhheeeeecc----------------ccCCeeEEEEEeeeCceeechhheEEEEeec Confidence 578999998775 45666665544 2368999999999999999999999999999 No 90 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=2.6e-48 Score=281.44 Aligned_cols=280 Identities=13% Similarity=0.020 Sum_probs=213.0 Q ss_pred Cccchh-hh---HHHHHhhccccC-CCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAAGTA-FP---VNHTQIAQTGDS-MFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa~~~-~~---~~~~~~~~~~~~-~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) ..++.+ +. .........+++ .+|.+||++++++|++.+++.|+++++|+++|+++ ..++|+.++.+.+.|++|. T Consensus 68 ~~r~~~~l~~ee~~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~~~~a~w~~e~ 146 (395) T protein:vir:95 68 AKRSQDPLTSEERKFFNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGI-KTRVIKADPAGQAVWGKVF 146 (395) T ss_pred hhcCccccchHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceEEeecc Confidence 122211 11 111223334444 44556788899999999999999999999999965 4799999999999999887 Q ss_pred ccc-cccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccc--ccccccccccc Q lcl|NC_021307. 76 DMK-PITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPF--DKNLDETTKSV 152 (310) Q Consensus 76 ~~~-~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~--~~~~~~~~~~~ 152 (310) +++ ++++++|+++++.+||++++++||+|||+|+.+++++||.+.|++++++++|++|++|+|++. |.++....... T Consensus 147 ~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~ 226 (395) T protein:vir:95 147 GEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTN 226 (395) T ss_pred cccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeeccccc Confidence 666 568899999999999999999999999999999999999999999999999999999999974 66665433332 Q ss_pred ccee----cccchHHHHHHHHHHHh--------------hhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccc Q lcl|NC_021307. 153 DLTP----ATGTTYDAIGVNALSLL--------------VNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVT 214 (310) Q Consensus 153 ~~~~----~~~~~~~~~~~~~~~~l--------------~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~ 214 (310) .... ..+....+........+ ...+..+.+|+||++++. |..|+|+|++.. | T Consensus 227 ~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~------~~~g~~~~~~~~--G-- 296 (395) T protein:vir:95 227 SGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW------DVQARYTYLTAN--G-- 296 (395) T ss_pred ccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh------hcCCcceeccCC--C-- Confidence 2211 11111222221222211 113445778999999875 456899998732 2 Q ss_pred cccCCcee--eeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEE Q lcl|NC_021307. 215 TPYREGRI--LGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAE 292 (310) Q Consensus 215 ~~~~~~~l--~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~ 292 (310) .+.++ +|+||+.++.||+++ ++||||++|++++|++++++++++. .|.+|++.||+..| T Consensus 297 ---~~~~~lg~g~~v~~~~~~p~~~--i~fgdfs~y~i~~r~~~~i~~~~~~--------------~~~~d~~~f~~~~r 357 (395) T protein:vir:95 297 ---GFVTVLPYNVTIITSEFVPEGK--LVAFVTDRYNAVRGGGLTVKKFDQT--------------LALEDAVLFTAKTF 357 (395) T ss_pred ---cceeccCCcceEEEcCCCCCCc--EEEEecccEEEEEecceEEEeccch--------------hhhCCcEEEEEEEE Confidence 22345 466789999999886 5789999999999999999988764 38899999999999 Q ss_pred eccEEeccCceEEEeecC Q lcl|NC_021307. 293 YGLLINDVEAFVKLTNAA 310 (310) Q Consensus 293 ~d~~v~~~~a~~~l~~aa 310 (310) +|+++.+++||++|+... T Consensus 358 ~dg~~~~~~A~~~l~i~~ 375 (395) T protein:vir:95 358 AYGQPDDNKASAVYDLKV 375 (395) T ss_pred ECCEEeccccEEEEEeec Confidence 999999999999998875 No 91 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=2.6e-48 Score=281.43 Aligned_cols=279 Identities=14% Similarity=0.027 Sum_probs=211.4 Q ss_pred Cccchhh-hHHH---HH-hh-ccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecc Q lcl|NC_021307. 1 MAAGTAF-PVNH---TQ-IA-QTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGE 74 (310) Q Consensus 1 ~aa~~~~-~~~~---~~-~~-~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 74 (310) ..++... ..+. .. .. ..+...+|.+||+++++.|++.+.+.|+++++|+++++++ ..++|+.++.+.+.|++| T Consensus 60 ~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~~~~a~wv~e 138 (377) T protein:vir:96 60 LRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDI 138 (377) T ss_pred hccCCcccCHHHHHHHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceeEeec Confidence 1111111 1111 11 11 2334444557888899999999999999999999999865 589999999999999999 Q ss_pred ccccc-ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccccc Q lcl|NC_021307. 75 GDMKP-ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVD 153 (310) Q Consensus 75 g~~~~-~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~ 153 (310) +++++ +++++|+++++.+||++++++||+|||+|+.+++++||.+++++++++++|++|++|+|++.|.+++....... T Consensus 139 ~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~ 218 (377) T protein:vir:96 139 FGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPT 218 (377) T ss_pred ccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccc Confidence 98876 56899999999999999999999999999999999999999999999999999999999999988875333222 Q ss_pred ceec------------------ccchHHHHHHHHHHHhhhhc-----------CCCCEEEEehHHHHHHHHhhhccCccc Q lcl|NC_021307. 154 LTPA------------------TGTTYDAIGVNALSLLVNAG-----------KKWGATLLDDVAEPILNGAKDANGRPL 204 (310) Q Consensus 154 ~~~~------------------~~~~~~~~~~~~~~~l~~~~-----------~~~~~~~~~~~~~~~l~~l~d~~g~~~ 204 (310) .... ...+.+. +.++.+.+...+ ..+++|+||+.++..+ .|++. T Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~------~~~~~ 291 (377) T protein:vir:96 219 VDQSTGRDITTYKTDKEAIADLSDLDPDT-AVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL------EAKFT 291 (377) T ss_pred ccccccccccceeeccccccccccCChhH-HHHHHHHHHHhhccccccccccccCceEEEEchhhHHhc------ccccc Confidence 1110 0111122 223333332222 2466799999998765 34555 Q ss_pred cccccccccccccCCceeeeee--EEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhc Q lcl|NC_021307. 205 FVESTYEAVTTPYREGRILGRP--TILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQH 282 (310) Q Consensus 205 ~~~~~~~~~~~~~~~~~l~G~p--v~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 282 (310) +++.. | .+.+++|+| ++.++.+|+++ ++||||++|.++++++++++.+++. .|.+ T Consensus 292 ~~~~~--G-----~~~~~l~~p~~v~~s~~~p~~~--i~fgdf~~Y~i~~r~~~~i~~~~~~--------------~~~~ 348 (377) T protein:vir:96 292 SRNQF--G-----EYVTVLPHGITILESLAVETGK--AIAFVANRYDAFMATASTIEEYDQT--------------FAME 348 (377) T ss_pred ccCCC--C-----CceeccCCCceEEecCCCCccc--EEEEEcCcEEEEEecccEEEeehhh--------------hhhc Confidence 55421 1 233566666 56778899875 6799999999999999999998764 4899 Q ss_pred CcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 283 NLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 283 ~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) |++.||+..|+|+++.+++||++|+.+= T Consensus 349 d~~~f~~~~r~dG~~~d~~a~~vl~l~~ 376 (377) T protein:vir:96 349 DLQLYLTKNYFYGKAKDNHTAALLTLAG 376 (377) T ss_pred CCeEEEEEEEEcCEEecCCcEEEEEEec Confidence 9999999999999999999999999888 No 92 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=4.7e-48 Score=280.06 Aligned_cols=274 Identities=12% Similarity=0.106 Sum_probs=211.8 Q ss_pred CccchhhhHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEc-CCceeeeecccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWT-GDVSAAWIGEGDMK 78 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~-~~~~a~~v~Eg~~~ 78 (310) ......-.........++++.+|| +||+++..+|++.+++.++|+++++++++++ ..+|+.. ....+.|++||+.+ T Consensus 104 ~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~ 181 (387) T protein:vir:26 104 FEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETA 181 (387) T ss_pred HHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccc Confidence 011111112223344555555555 5777889999999999999999999999865 4567754 45679999999999 Q ss_pred cccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHH-HHHcccCcccccccccccccccceec Q lcl|NC_021307. 79 PITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDE-AALHGTDSPFDKNLDETTKSVDLTPA 157 (310) Q Consensus 79 ~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~-~~l~G~g~~~~~~~~~~~~~~~~~~~ 157 (310) ++++++|+++++.++|++++++||+|+|+||.+++++||.++|+++++++++. .|.+|+|++.+.++...... . ..+ T Consensus 182 ~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~-~-~~~ 259 (387) T protein:vir:26 182 KELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSV-K-EVE 259 (387) T ss_pred cccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccc-c-ccc Confidence 99999999999999999999999999999999999999999999999999765 56677777776655433221 1 112 Q ss_pred ccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCc Q lcl|NC_021307. 158 TGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGT 237 (310) Q Consensus 158 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~ 237 (310) +..++ +.+.+++..+...+..+++|+||+.++..+..+++..|++++.. .+.+|+|+||++++.++. T Consensus 260 ~~~~~-d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~----------~~~~llG~PV~~~~~~~~-- 326 (387) T protein:vir:26 260 GADMY-DAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT----------PAEKVFGKPVVFTDAAVK-- 326 (387) T ss_pred ccchH-HHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCccccc----------CCccccccceEEecCCCc-- Confidence 23334 44557888999999999999999999988877777777777742 245799999999998653 Q ss_pred eeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 238 TVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 238 ~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++||||+++++. +.++.+..+++ ..+|++.|+++.|+|+++.+++||++|+.+| T Consensus 327 --~~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka 380 (387) T protein:vir:26 327 --PIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred --eeeechhhhhhh-hhhhhheeccc----------------ccCCceEEEEEEEeCcEeechhheEEEEeec Confidence 679999998765 45666554433 2468999999999999999999999999988 No 93 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=4.7e-48 Score=280.06 Aligned_cols=274 Identities=12% Similarity=0.106 Sum_probs=211.8 Q ss_pred CccchhhhHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEc-CCceeeeecccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWT-GDVSAAWIGEGDMK 78 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~-~~~~a~~v~Eg~~~ 78 (310) ......-.........++++.+|| +||+++..+|++.+++.++|+++++++++++ ..+|+.. ....+.|++||+.+ T Consensus 104 ~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~ 181 (387) T protein:vir:96 104 FEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETA 181 (387) T ss_pred HHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccc Confidence 011111112223344555555555 5777889999999999999999999999865 4567754 45679999999999 Q ss_pred cccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHH-HHHcccCcccccccccccccccceec Q lcl|NC_021307. 79 PITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDE-AALHGTDSPFDKNLDETTKSVDLTPA 157 (310) Q Consensus 79 ~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~-~~l~G~g~~~~~~~~~~~~~~~~~~~ 157 (310) ++++++|+++++.++|++++++||+|+|+||.+++++||.++|+++++++++. .|.+|+|++.+.++...... . ..+ T Consensus 182 ~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~-~-~~~ 259 (387) T protein:vir:96 182 KELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSV-K-EVE 259 (387) T ss_pred cccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccc-c-ccc Confidence 99999999999999999999999999999999999999999999999999765 56677777776655433221 1 112 Q ss_pred ccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCc Q lcl|NC_021307. 158 TGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGT 237 (310) Q Consensus 158 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~ 237 (310) +..++ +.+.+++..+...+..+++|+||+.++..+..+++..|++++.. .+.+|+|+||++++.++. T Consensus 260 ~~~~~-d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~----------~~~~llG~PV~~~~~~~~-- 326 (387) T protein:vir:96 260 GADMY-DAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT----------PAEKVFGKPVVFTDAAVK-- 326 (387) T ss_pred ccchH-HHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCccccc----------CCccccccceEEecCCCc-- Confidence 23334 44557888999999999999999999988877777777777742 245799999999998653 Q ss_pred eeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 238 TVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 238 ~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++||||+++++. +.++.+..+++ ..+|++.|+++.|+|+++.+++||++|+.+| T Consensus 327 --~~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka 380 (387) T protein:vir:96 327 --PIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred --eeeechhhhhhh-hhhhhheeccc----------------ccCCceEEEEEEEeCcEeechhheEEEEeec Confidence 679999998765 45666554433 2468999999999999999999999999988 No 94 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=4.7e-48 Score=280.06 Aligned_cols=274 Identities=12% Similarity=0.106 Sum_probs=211.8 Q ss_pred CccchhhhHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEc-CCceeeeecccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWT-GDVSAAWIGEGDMK 78 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~-~~~~a~~v~Eg~~~ 78 (310) ......-.........++++.+|| +||+++..+|++.+++.++|+++++++++++ ..+|+.. ....+.|++||+.+ T Consensus 104 ~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~ 181 (387) T protein:vir:94 104 FEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETA 181 (387) T ss_pred HHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccc Confidence 011111112223344555555555 5777889999999999999999999999865 4567754 45679999999999 Q ss_pred cccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHH-HHHcccCcccccccccccccccceec Q lcl|NC_021307. 79 PITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDE-AALHGTDSPFDKNLDETTKSVDLTPA 157 (310) Q Consensus 79 ~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~-~~l~G~g~~~~~~~~~~~~~~~~~~~ 157 (310) ++++++|+++++.++|++++++||+|+|+||.+++++||.++|+++++++++. .|.+|+|++.+.++...... . ..+ T Consensus 182 ~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~-~-~~~ 259 (387) T protein:vir:94 182 KELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSV-K-EVE 259 (387) T ss_pred cccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccc-c-ccc Confidence 99999999999999999999999999999999999999999999999999765 56677777776655433221 1 112 Q ss_pred ccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCc Q lcl|NC_021307. 158 TGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGT 237 (310) Q Consensus 158 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~ 237 (310) +..++ +.+.+++..+...+..+++|+||+.++..+..+++..|++++.. .+.+|+|+||++++.++. T Consensus 260 ~~~~~-d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~----------~~~~llG~PV~~~~~~~~-- 326 (387) T protein:vir:94 260 GADMY-DAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT----------PAEKVFGKPVVFTDAAVK-- 326 (387) T ss_pred ccchH-HHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCccccc----------CCccccccceEEecCCCc-- Confidence 23334 44557888999999999999999999988877777777777742 245799999999998653 Q ss_pred eeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 238 TVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 238 ~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++||||+++++. +.++.+..+++ ..+|++.|+++.|+|+++.+++||++|+.+| T Consensus 327 --~~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka 380 (387) T protein:vir:94 327 --PIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred --eeeechhhhhhh-hhhhhheeccc----------------ccCCceEEEEEEEeCcEeechhheEEEEeec Confidence 679999998765 45666554433 2468999999999999999999999999988 No 95 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=6.4e-48 Score=279.31 Aligned_cols=274 Identities=12% Similarity=0.104 Sum_probs=210.6 Q ss_pred Cc-----cchhhhHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEc-CCceeeeec Q lcl|NC_021307. 1 MA-----AGTAFPVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWT-GDVSAAWIG 73 (310) Q Consensus 1 ~a-----a~~~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~-~~~~a~~v~ 73 (310) +. ....-.........++++.+|| +||+++..+|++.+++.++|+++|+++++++ ..+|+.. ....+.|++ T Consensus 114 ~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~ 191 (402) T protein:vir:93 114 ILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFIT 191 (402) T ss_pred HhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC--ceeeeeeccCCcccccc Confidence 00 0000112223344555555555 6778889999999999999999999999854 4677764 456789999 Q ss_pred ccccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHH-HHHcccCcccccccccccccc Q lcl|NC_021307. 74 EGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDE-AALHGTDSPFDKNLDETTKSV 152 (310) Q Consensus 74 Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~-~~l~G~g~~~~~~~~~~~~~~ 152 (310) ||+.+++++++|+++++.++|++++++||+|+++||.+++++||.++|+++++++++. .|..|+|++.+.++...... T Consensus 192 Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~- 270 (402) T protein:vir:93 192 DVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSV- 270 (402) T ss_pred ccccccccccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccc- Confidence 9999999999999999999999999999999999999999999999999999999765 56677887777665533221 Q ss_pred cceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC Q lcl|NC_021307. 153 DLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH 232 (310) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~ 232 (310) ...++.+..+.+.+++..+...+..+++|+||+.++..+..+++..|++++.. .+.+|+|+||++++. T Consensus 271 --~~~~~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~~----------~~~~llG~PV~~t~~ 338 (402) T protein:vir:93 271 --KEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT----------PAEKVFGKPVVFTDA 338 (402) T ss_pred --ccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCccccc----------CCccccccceEEecC Confidence 11222233345567888999999999999999999988777766677777642 245799999999998 Q ss_pred CCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 233 VASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 233 ~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++. ++||||+++++. +.++.+..+++ ..++++.|++..|+|+++.+++||+.|+.+| T Consensus 339 ~~~----i~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~ 395 (402) T protein:vir:93 339 AVK----PIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 395 (402) T ss_pred CCc----eeeechhhhhhh-hhhhhhhhhhc----------------ccCCceEEEEEEEeCcEEechhheEEEEeec Confidence 653 678999987765 34555544433 2368999999999999999999999999988 No 96 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=2.4e-47 Score=276.21 Aligned_cols=273 Identities=12% Similarity=0.099 Sum_probs=209.1 Q ss_pred CccchhhhHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEc-CCceeeeecccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWT-GDVSAAWIGEGDMK 78 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~-~~~~a~~v~Eg~~~ 78 (310) .+..............++++.+|| +||+++..+|++.+++.++|+++|+++++++ ..+|+.. ...++.|++|++.+ T Consensus 104 ~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~ 181 (387) T protein:vir:93 104 FEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETA 181 (387) T ss_pred hhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCC--ceEEEEeecCCccccccCcccc Confidence 111111112233444556655555 5777888999999999999999999999864 4677754 45779999999999 Q ss_pred cccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHH-HHHcccCcccccccccccccccceec Q lcl|NC_021307. 79 PITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDE-AALHGTDSPFDKNLDETTKSVDLTPA 157 (310) Q Consensus 79 ~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~-~~l~G~g~~~~~~~~~~~~~~~~~~~ 157 (310) ++++++|+++++.++|++++++||+|+++||.+++++||.++|+++++++++. .|.+|+|++.+.++...... ... T Consensus 182 ~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~---~~v 258 (387) T protein:vir:93 182 KELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSV---KEV 258 (387) T ss_pred cccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccc---ccc Confidence 99999999999999999999999999999999999999999999999999776 56677888777665533221 112 Q ss_pred ccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHH-HhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCC Q lcl|NC_021307. 158 TGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILN-GAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASG 236 (310) Q Consensus 158 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~-~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~ 236 (310) ++....+.+.+++..+...++.+++|+||+.++..+. +++|.+| +++. +.+.+|+|+||++++.++. T Consensus 259 ~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~-~~~~----------~~~~~llG~PV~~~~~~~~- 326 (387) T protein:vir:93 259 EGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTT-NFFD----------TPAEKVFGKPVVFTDAAVK- 326 (387) T ss_pred cccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCC-cccc----------cCCccccccceEEecCCCc- Confidence 2223334456788899999999999999999987765 4555544 4442 1245799999999988653 Q ss_pred ceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 237 TTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 237 ~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++||||+++++. +.++.+..+++ +.++++.|++..|+|+++.+++||+.++.++ T Consensus 327 ---~~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~ 380 (387) T protein:vir:93 327 ---PIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred ---eeeeehhhhhee-hhhheeeeccc----------------ccCCceeEEEEeeeCceeechhheEEEEeec Confidence 578999998775 55666654432 3578999999999999999999999998877 No 97 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=4.1e-47 Score=274.90 Aligned_cols=291 Identities=13% Similarity=-0.038 Sum_probs=205.8 Q ss_pred Cccchh-hhH---HHHHhhcccc-CCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAAGTA-FPV---NHTQIAQTGD-SMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa~~~-~~~---~~~~~~~~~~-~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) ..+|.. +.. .......+++ +.+|.+||++++++|++.+++.|+++++|+++|+++. .++|+.++.+.+.|++|+ T Consensus 65 ~~~g~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~~-~~i~~~~~~~~a~w~~e~ 143 (383) T protein:vir:78 65 ASRTDKNITNEEIKFFNDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGLR-TKFLKSETSGVAVWGKIF 143 (383) T ss_pred hcCChhhhhHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCCc-eEEEEEcCCcceEEeecc Confidence 222221 111 1122333444 4455578888999999999999999999999998765 799999999999999998 Q ss_pred cccc-ccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccc Q lcl|NC_021307. 76 DMKP-ITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDL 154 (310) Q Consensus 76 ~~~~-~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 154 (310) ++++ +++++|+++++.+||++++++||+|||+|+.+++++||.+++++++++++|++|++|+|++.|.++...+..... T Consensus 144 ~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~ 223 (383) T protein:vir:78 144 GEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGST 223 (383) T ss_pred cccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCccc Confidence 8765 578999999999999999999999999999999999999999999999999999999999999887653322111 Q ss_pred e--------ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhh---hccCccccccccccccccccCCceee Q lcl|NC_021307. 155 T--------PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAK---DANGRPLFVESTYEAVTTPYREGRIL 223 (310) Q Consensus 155 ~--------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~---d~~g~~~~~~~~~~~~~~~~~~~~l~ 223 (310) . ...+...++....+...+. .++.+..|+||..++..+++++ +..+.+.+++....... .+.+.+++ T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~-~G~~~t~l 301 (383) T protein:vir:78 224 VVDGVYAEKAATGTLTFANPKTTVNELT-DVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSLNA-NGVYVTAL 301 (383) T ss_pred ccccccccccccchhhhhhhHHHHHHHH-HHHhccchhcccchhhhcCceEEEEcCcchhhhccchhccCC-CCceeeec Confidence 1 1112222223323333333 3334444555544444444433 11111111111110000 11223455 Q ss_pred eee--EEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccC Q lcl|NC_021307. 224 GRP--TILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVE 301 (310) Q Consensus 224 G~p--v~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~ 301 (310) |+| ++.++++|+++ ++||||++|.++++++++++.+++. .|.+|++.||+..|+|+++.+++ T Consensus 302 ~~~~~iv~s~~~p~~~--iifgdfs~Y~i~~r~~~~i~~~~~~--------------~f~~d~~~f~~~~r~dG~~~~~~ 365 (383) T protein:vir:78 302 PFNLNIIESLFVPEKK--AISYVAERYDALIGGPLDIGTYDQT--------------LAIEDLNLYAAKQFAYGKAKDDK 365 (383) T ss_pred CCCceEEecCCCCccc--EEEeeccceEEEecccceEEecchh--------------hhhcCceEEEEEEEEcCEEecCC Confidence 555 67789999876 5789999999999999999988664 48999999999999999999999 Q ss_pred ceEEEeecC Q lcl|NC_021307. 302 AFVKLTNAA 310 (310) Q Consensus 302 a~~~l~~aa 310 (310) ||++|+.+= T Consensus 366 A~~vl~~~~ 374 (383) T protein:vir:78 366 AAAVWTLNI 374 (383) T ss_pred eEEEEEEEe Confidence 999866544 No 98 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=1.3e-46 Score=272.22 Aligned_cols=271 Identities=15% Similarity=0.115 Sum_probs=211.6 Q ss_pred Cc----cchhhh--HHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcC-Cceeeeec Q lcl|NC_021307. 1 MA----AGTAFP--VNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTG-DVSAAWIG 73 (310) Q Consensus 1 ~a----a~~~~~--~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~ 73 (310) .+ +..... .........++..++..+|+++...|++ +++.++++++|+.+|++++...+|+... ...++|++ T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (397) T protein:vir:96 113 LAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQ 191 (397) T ss_pred HHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccCCcccccc Confidence 00 000000 0111112233445556778888888887 5778889999999999988888888754 45679999 Q ss_pred ccccccc-cccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccc Q lcl|NC_021307. 74 EGDMKPI-TKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSV 152 (310) Q Consensus 74 Eg~~~~~-~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~ 152 (310) |++..|+ ++++|+++++.++++++++++|+|+++|+.++++++|.++|+++++.++|.+|++|+|.+.+.+ T Consensus 192 E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~-------- 263 (397) T protein:vir:96 192 QLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTATAKS-------- 263 (397) T ss_pred ccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-------- Confidence 9999996 6799999999999999999999999999999999999999999999999999999998765432 Q ss_pred cceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC Q lcl|NC_021307. 153 DLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH 232 (310) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~ 232 (310) ..+++++. +++......+ .+++|+||+++|..|++++|++|+|+|+++...+. +++|+|+||+++++ T Consensus 264 ------~~~~d~~~-~~~~~~~~~~-~~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~~~-----~~~l~G~pv~~~~~ 330 (397) T protein:vir:96 264 ------VVGVDGLK-DLINKEIKKV-YDVKLFISASMYSELDKLKDKNGRYLLQDSITAAS-----GKQLLGKEVVVLDD 330 (397) T ss_pred ------ccchHHHH-HHHHHhhhhh-cCcEEEEcHHHHHHHHHhhccCCCeEeccCccCCC-----cccccccceEEecc Confidence 12344443 4554433333 47899999999999999999999999988765543 46899999998775 Q ss_pred CC----CCceeEeeeccee-eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEe Q lcl|NC_021307. 233 VA----SGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLT 307 (310) Q Consensus 233 ~~----~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~ 307 (310) +. .++..++||||++ +.+++++++++..+++.. ..+.+|++.|+|+++.+|+||++|+ T Consensus 331 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~r~d~~~~~~~a~~~~~ 393 (397) T protein:vir:96 331 DVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNNI-----------------YGQLLAGIIRYDVKATDKKAGFYVT 393 (397) T ss_pred cccCCCCCceEEEEeehhcceEeEeecceEEEEecccc-----------------cceeEEEEEEEccEEecccceEEEE Confidence 43 3456788999997 568999999998875432 3467899999999999999999999 Q ss_pred ecC Q lcl|NC_021307. 308 NAA 310 (310) Q Consensus 308 ~aa 310 (310) .++ T Consensus 394 ~~~ 396 (397) T protein:vir:96 394 FTI 396 (397) T ss_pred eec Confidence 877 No 99 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=3.8e-40 Score=236.72 Aligned_cols=287 Identities=13% Similarity=0.048 Sum_probs=221.8 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceee-cCCCceEEEEEcCCc----eeeeeccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIP-MGSTGVKIPHWTGDV----SAAWIGEG 75 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~-~~~~~~~ip~~~~~~----~a~~v~Eg 75 (310) |--- +-.-+..+. .+.++.+||++.|+..+++++.+++.+++++++++++ +++....||....+. ...|.+|. T Consensus 1 ~~~~-~~~~~~~k~-it~~d~~gG~L~P~~~~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~ 78 (314) T protein:vir:41 1 MDFL-NKPFQITPK-IDVPDLGKGILAVQRFGEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTK 78 (314) T ss_pred Cchh-hhHHHhhcc-cccccCCCceeChHHHHHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCC Confidence 2222 222223333 3445566778888878899999999999999999986 577778898865432 34677888 Q ss_pred ccccccccceeeeEeeeeeeEeeehhhHHHhhcChh--HHHHHHHHHHHHHHHHHHHHHHHcccCccc--------cccc Q lcl|NC_021307. 76 DMKPITKGDMSVQQVEPHKIATIFVASAETVRANPG--NYLGTMRTKVATAIALAFDEAALHGTDSPF--------DKNL 145 (310) Q Consensus 76 ~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~--~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~--------~~~~ 145 (310) .+.++++++|+++++.+||+...+.||+|+|+|+.. +|+++|.+.|++++++.++..+++|+|+.. +.+. T Consensus 79 ~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~ 158 (314) T protein:vir:41 79 VAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGW 158 (314) T ss_pred ccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhh Confidence 888999999999999999999999999999999965 999999999999999999999999999642 2232 Q ss_pred cccccc--ccceecccchHHHHHHHHHHHhhhhcCC---CCEEEEehHHHHHHHHhhhccCccccccccccccccccCCc Q lcl|NC_021307. 146 DETTKS--VDLTPATGTTYDAIGVNALSLLVNAGKK---WGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREG 220 (310) Q Consensus 146 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~~~~~---~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~ 220 (310) ...... ...........++.+.+++..+...|++ +.+|+||+.++.+++++++..+++++++....+. +. T Consensus 159 l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~-----~~ 233 (314) T protein:vir:41 159 MKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGAT-----GL 233 (314) T ss_pred hhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCC-----Cc Confidence 221111 1111223345566677888889888764 5689999999999999999999999988765543 55 Q ss_pred eeeeeeEEEeCCCCC---CceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEE Q lcl|NC_021307. 221 RILGRPTILSDHVAS---GTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLI 297 (310) Q Consensus 221 ~l~G~pv~~t~~~~~---~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v 297 (310) +++|+||+..+.+|. ++..++||||++++++.+..+.++..++ ..++++.|.+..|+|+.+ T Consensus 234 ~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~----------------a~~~~~~~~~~~r~d~~~ 297 (314) T protein:vir:41 234 QYDGIPIQYVPALDALGDDKARALLTVPTNLVYGFWRNIRIEPKRD----------------AAMRRTEYIASLRADCNY 297 (314) T ss_pred eecceeeEecccccccCCCCceEEEechhheEEEeeceeEEeeccc----------------CcCCeEEEEEEEEeceEE Confidence 799999999998863 6788999999999999988887776543 357899999999999999 Q ss_pred eccCc-eEEEeecC Q lcl|NC_021307. 298 NDVEA-FVKLTNAA 310 (310) Q Consensus 298 ~~~~a-~~~l~~aa 310 (310) ...+| ++.+.++| T Consensus 298 ~~~~aa~~~~~~~~ 311 (314) T protein:vir:41 298 EDENAAVAAVIDMS 311 (314) T ss_pred EEcCcEEEEEeecc Confidence 87644 44455555 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=3.2e-40 Score=237.14 Aligned_cols=284 Identities=13% Similarity=0.083 Sum_probs=217.6 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceee-cCCCceEEEEEcCC----ceeeeeccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIP-MGSTGVKIPHWTGD----VSAAWIGEG 75 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~-~~~~~~~ip~~~~~----~~a~~v~Eg 75 (310) |.-+.-+... + +.+.++.+||+++|+..+++++.+.+.|+++++|++++ +.+....++....+ ....|.+|+ T Consensus 7 ~~~~~~~~~~--k-~~t~~d~~Gg~l~P~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~~~~ 83 (315) T protein:vir:41 7 IRGGKPFEIV--P-KIDVPDLGRGVLSVDRFGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDETGQK 83 (315) T ss_pred hhcCChhhhh--h-hcCCcCCCCceechHHHHHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccccccCc Confidence 5544444332 2 33455667888999999999999999999999999864 55555556553221 234688899 Q ss_pred ccccccccceeeeEeeeeeeEeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHHcccCcccc------ccccc Q lcl|NC_021307. 76 DMKPITKGDMSVQQVEPHKIATIFVASAETVRANP--GNYLGTMRTKVATAIALAFDEAALHGTDSPFD------KNLDE 147 (310) Q Consensus 76 ~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~--~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~------~~~~~ 147 (310) .+.++++++|+++++.++++.+.+.||+|+|+|+. ++++++|.+++++++++.++.+|++|+|+... .+.+. T Consensus 84 ~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~ 163 (315) T protein:vir:41 84 LAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLK 163 (315) T ss_pred CCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCcccccccccee Confidence 99999999999999999999999999999999986 49999999999999999999999999986432 23322 Q ss_pred cccc-cc---ceecccchHHHHHHHHHHHhhhhcC---CCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCc Q lcl|NC_021307. 148 TTKS-VD---LTPATGTTYDAIGVNALSLLVNAGK---KWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREG 220 (310) Q Consensus 148 ~~~~-~~---~~~~~~~~~~~~~~~~~~~l~~~~~---~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~ 220 (310) .... +. ..........+.+.++...+...|+ .+++|+||+.++.++++++|++|+++|++....+. +. T Consensus 164 ~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g~-----~~ 238 (315) T protein:vir:41 164 LASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTGAN-----SI 238 (315) T ss_pred cccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccchhhcCC-----Cc Confidence 1111 11 1111112223555678888888776 46689999999999999999999999998776654 45 Q ss_pred eeeeeeEEEeCCCCC---CceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEE Q lcl|NC_021307. 221 RILGRPTILSDHVAS---GTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLI 297 (310) Q Consensus 221 ~l~G~pv~~t~~~~~---~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v 297 (310) +|+|+||+..++||+ ++..++||||++++++.+.++.++..+++ .++.+.|.+..|+|+.+ T Consensus 239 tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a----------------~~~~~~~~~~~r~d~~~ 302 (315) T protein:vir:41 239 LYDGRPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDA----------------EMRLTKYVASLRTDNHY 302 (315) T ss_pred eecccceEecccccccCCCCccEEEecccceEEEeccccEEEeeecC----------------CCCceEEEEEEEeceeE Confidence 899999999999874 56778999999999999999988877553 35678899999999976 Q ss_pred eccCc--eEEEee Q lcl|NC_021307. 298 NDVEA--FVKLTN 308 (310) Q Consensus 298 ~~~~a--~~~l~~ 308 (310) ...++ ++.++. T Consensus 303 ~~~~~~a~~~~~v 315 (315) T protein:vir:41 303 EDEEGAVSATITV 315 (315) T ss_pred EeccceeEeeeeC Confidence 65544 566666 No 101 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=3.4e-36 Score=215.06 Aligned_cols=291 Identities=13% Similarity=0.096 Sum_probs=217.6 Q ss_pred Cccchhhh--HHHHH--hhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeec-cc Q lcl|NC_021307. 1 MAAGTAFP--VNHTQ--IAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIG-EG 75 (310) Q Consensus 1 ~aa~~~~~--~~~~~--~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~-Eg 75 (310) |++..--. .+..+ ...+++..+|+.|||++..++++.+.+.++++++++++++.+...++|....++.+.|++ |+ T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~~~~~e~ 80 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHRRPQDEG 80 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCccccccccc Confidence 55442221 12222 223344556678999999999999999999999999999999999999987777778875 33 Q ss_pred -ccccccccceeeeEeeeeeeEeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHHcccCcccccc------cc Q lcl|NC_021307. 76 -DMKPITKGDMSVQQVEPHKIATIFVASAETVRANP--GNYLGTMRTKVATAIALAFDEAALHGTDSPFDKN------LD 146 (310) Q Consensus 76 -~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~--~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~------~~ 146 (310) ...+.++++|+++++.++|+.+.+.||+|+|+|+. ++++++|.+.+++++++.++..+++|++++.+.+ .. T Consensus 81 ~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l 160 (321) T protein:vir:31 81 EWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFI 160 (321) T ss_pred ccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhh Confidence 35567889999999999999999999999999985 6999999999999999999999999999877642 21 Q ss_pred ccc--ccccceecccchHHHHHHHHHHHhhhhcCC--CCEEEEehHHHHHHHH-hhhccCccccccccccccccccCCce Q lcl|NC_021307. 147 ETT--KSVDLTPATGTTYDAIGVNALSLLVNAGKK--WGATLLDDVAEPILNG-AKDANGRPLFVESTYEAVTTPYREGR 221 (310) Q Consensus 147 ~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~-l~d~~g~~~~~~~~~~~~~~~~~~~~ 221 (310) ... .........+....+.+.++...+.+.++. +.+|+||+.++.+++. +++. +.+++.+....+ .+.+ T Consensus 161 ~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~-~~~~~~~~l~~~-----~~~t 234 (321) T protein:vir:31 161 TVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDR-DTPLGDNVIMGE-----ADVN 234 (321) T ss_pred hhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcC-CCccccchhhcc-----cccc Confidence 111 111111222223334556888888887764 5689999999988776 5554 446776654433 3457 Q ss_pred eeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccC Q lcl|NC_021307. 222 ILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVE 301 (310) Q Consensus 222 l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~ 301 (310) |.|+||+.++++|++. ++++||++++++.++++.++..++.... ..+++.+......++|+.|.+.+ T Consensus 235 l~G~pvv~~~~mP~~~--il~t~~~nl~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~ve~~~ 301 (321) T protein:vir:31 235 PFSFPIIGSGLWPDDK--AMFTDPQNLIYALYRDLEIDVLTESDKV-----------SERDLHARYFMRGDDDFAIENTE 301 (321) T ss_pred ccceeEEEcCCCCCCc--EEEeccccEEEEEeeccEEEEeecCccc-----------cccceeeEeeeeeecceeEeccc Confidence 9999999999999875 6789999999999999988877654211 01234455555668999999999 Q ss_pred ceEEEeecC Q lcl|NC_021307. 302 AFVKLTNAA 310 (310) Q Consensus 302 a~~~l~~aa 310 (310) |++.+++.- T Consensus 302 a~a~~~~i~ 310 (321) T protein:vir:31 302 AVVLAEGLG 310 (321) T ss_pred cEEEEecCC Confidence 999999877 No 102 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=6.1e-35 Score=208.15 Aligned_cols=280 Identities=11% Similarity=0.054 Sum_probs=190.8 Q ss_pred Cc---c-----c----hhhhHHHHHhh--cc------------ccCCCCceechhhHHHHHHHHHhhchhhhhcceeecC Q lcl|NC_021307. 1 MA---A-----G----TAFPVNHTQIA--QT------------GDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMG 54 (310) Q Consensus 1 ~a---a-----~----~~~~~~~~~~~--~~------------~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~ 54 (310) +. . + ........... .. ....++...|+.+...+...+...+++.++++..+.+ T Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~ 279 (517) T protein:vir:97 200 LGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP 279 (517) T ss_pred cccccccccchhhHHHHHHHHHHHHHHhcccccccceeeeecccccccccccchHHHHHHHHhhhhhccceeeeeecccc Confidence 00 0 0 00000000000 00 1122344567788888989999999888887765543 Q ss_pred CCceEEEEEcCCceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcChhH----HHHHHHHHHHHHHHHHHH Q lcl|NC_021307. 55 STGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGN----YLGTMRTKVATAIALAFD 130 (310) Q Consensus 55 ~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~----~~~~v~~~l~~a~~~~~d 130 (310) ...+|.......+.|+.||+.+|+++++|+++++.++++++++++|+++|+|+..+ +++||.++|++++++++| T Consensus 280 --~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee 357 (517) T protein:vir:97 280 --TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVN 357 (517) T ss_pred --ceeeecccccceeeeeecCCcccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHH Confidence 35677777777889999999999999999999999999999999999999988776 999999999999999999 Q ss_pred HHHHcccCccccc-ccccccccccceeccc-chHHHHHHHHHHHhhhhc--CCCCEEEEehHHHHHHHHhhhccCccccc Q lcl|NC_021307. 131 EAALHGTDSPFDK-NLDETTKSVDLTPATG-TTYDAIGVNALSLLVNAG--KKWGATLLDDVAEPILNGAKDANGRPLFV 206 (310) Q Consensus 131 ~~~l~G~g~~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~~~~~l~~l~d~~g~~~~~ 206 (310) ++|++|+|++... ++.............. .+..+ ++..+...+ ..+++|+||+.+|.+|+++||++|||+|+ T Consensus 358 ~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~~~~d----~i~~l~~a~~~a~~a~~vmn~~t~~~I~klKD~~G~Yl~~ 433 (517) T protein:vir:97 358 RAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQE----LLEKLSVATPKAADSTLVIHRNDLAAIRFLKDKNGNYVFP 433 (517) T ss_pred HHHhcccCCCcccccccccccccccccccccchHHH----HHHHHHHHhhhccCCEEEECHHHHHHHHHhhcCCCCeecc Confidence 9999999987543 3333222222222121 22222 233333222 24788999999999999999999999998 Q ss_pred cccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEE Q lcl|NC_021307. 207 ESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVA 286 (310) Q Consensus 207 ~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 286 (310) +....+.. .+++|..-+. +.++.+...+ ++.+.++++.+.++.+.-+ + .+.+|+.. T Consensus 434 ~~~~~~~~-----~~l~G~~~~~-~~~~~~~~~~--~~~~~y~i~~~~g~~~~~~----------f------d~~~n~~~ 489 (517) T protein:vir:97 434 VGVSNQTI-----ATHFGFNRLV-QSVAVDEKTA--VSLSGYVTNGSRGMEFEQG----------T------ILVENNKE 489 (517) T ss_pred CcCCcccc-----cccCCccccc-cccccCceeE--eeccccEEEeecceeeeee----------e------ecccCcee Confidence 76655433 3566632222 2233344333 3456777777766653211 0 13578899 Q ss_pred EEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 287 VRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 287 ~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) |+.+.|+++.|+.+++|+..+..- T Consensus 490 f~~~~~~~g~i~~~~r~a~~~~~p 513 (517) T protein:vir:97 490 YLFEMPISGSLEYKGTTAYGTYTP 513 (517) T ss_pred EeeeeeeccccccccceEEEEEcC Confidence 999999999999999998877665 No 103 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=3e-33 Score=198.91 Aligned_cols=263 Identities=16% Similarity=0.150 Sum_probs=202.0 Q ss_pred hccccCCCCcee-chhhHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEcCCceeeeecccccccccccceeeeE Q lcl|NC_021307. 15 AQTGDSMFQGYL-EPEQAQDYFAEAEKTSIVQRVARKI----PMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQ 89 (310) Q Consensus 15 ~~~~~~~~g~~i-~~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 89 (310) |..+++..+..+ |..+...+++.+++.+++.+++... ..++.+++||++...+++.|++||+.++.+++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 544444444455 4455676888999988888887652 23466799999988889999999999999999999999 Q ss_pred eeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +.+++++..+++|++++.++.+++++.+.+++++++++++|+.++....... .......+++. +.++ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~------------~~~~~~~t~d~-i~da 147 (272) T protein:vir:98 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST------------QTVEATATVDG-VSKA 147 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------cccccccCHHH-HHHH Confidence 9999999999999999999999999999999999999999999996432211 11112233444 4577 Q ss_pred HHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeee Q lcl|NC_021307. 170 LSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIV 249 (310) Q Consensus 170 ~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~ 249 (310) ...+...+.....|+|||.++..|++.+..+.... .....+....+..++++|+||++++++|.++. ++.+...+. T Consensus 148 ~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~--~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~--~~~~~~a~~ 223 (272) T protein:vir:98 148 LDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGA--TEVGANRVVSGVYGEVLGVQIVRSRKCPKGTA--YMVRKGALR 223 (272) T ss_pred HHHHhccCCCccEEEEcHHHHHHHHHhcccccccc--ccccccccccccchhhcCeeEEEcCCCCcceE--EEEcCCeEE Confidence 88888888888999999999999987542221100 11111222233446899999999999998874 345677888 Q ss_pred EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 250 WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 250 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++.+++++++.+++.. ++...++...|+++++.+|+++++++.++ T Consensus 224 ~~~~~~~~ve~~r~~~----------------~~~~~i~~~~~~~~~v~~~~~vv~~t~~~ 268 (272) T protein:vir:98 224 IMLKRNTMVETDRDIT----------------KAINQIVANKHYGVYLYKAEKAVKITLKD 268 (272) T ss_pred EEecCCceeeeccccc----------------cceeEEEEEEEEEEEEEcCCceEEEEecc Confidence 8888888888876643 45688999999999999999999999999 No 104 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=3e-33 Score=198.91 Aligned_cols=263 Identities=16% Similarity=0.150 Sum_probs=202.0 Q ss_pred hccccCCCCcee-chhhHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEcCCceeeeecccccccccccceeeeE Q lcl|NC_021307. 15 AQTGDSMFQGYL-EPEQAQDYFAEAEKTSIVQRVARKI----PMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQ 89 (310) Q Consensus 15 ~~~~~~~~g~~i-~~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 89 (310) |..+++..+..+ |..+...+++.+++.+++.+++... ..++.+++||++...+++.|++||+.++.+++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 544444444455 4455676888999988888887652 23466799999988889999999999999999999999 Q ss_pred eeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +.+++++..+++|++++.++.+++++.+.+++++++++++|+.++....... .......+++. +.++ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~------------~~~~~~~t~d~-i~da 147 (272) T protein:vir:30 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST------------QTVEATATVDG-VSKA 147 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------cccccccCHHH-HHHH Confidence 9999999999999999999999999999999999999999999996432211 11112233444 4577 Q ss_pred HHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeee Q lcl|NC_021307. 170 LSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIV 249 (310) Q Consensus 170 ~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~ 249 (310) ...+...+.....|+|||.++..|++.+..+.... .....+....+..++++|+||++++++|.++. ++.+...+. T Consensus 148 ~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~--~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~--~~~~~~a~~ 223 (272) T protein:vir:30 148 LDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGA--TEVGANRVVSGVYGEVLGVQIVRSRKCPKGTA--YMVRKGALR 223 (272) T ss_pred HHHHhccCCCccEEEEcHHHHHHHHHhcccccccc--ccccccccccccchhhcCeeEEEcCCCCcceE--EEEcCCeEE Confidence 88888888888999999999999987542221100 11111222233446899999999999998874 345677888 Q ss_pred EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 250 WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 250 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++.+++++++.+++.. ++...++...|+++++.+|+++++++.++ T Consensus 224 ~~~~~~~~ve~~r~~~----------------~~~~~i~~~~~~~~~v~~~~~vv~~t~~~ 268 (272) T protein:vir:30 224 IMLKRNTMVETDRDIT----------------KAINQIVANKHYGVYLYKAEKAVKITLKD 268 (272) T ss_pred EEecCCceeeeccccc----------------cceeEEEEEEEEEEEEEcCCceEEEEecc Confidence 8888888888876643 45688999999999999999999999999 No 105 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.96 E-value=3.1e-33 Score=198.78 Aligned_cols=272 Identities=9% Similarity=0.000 Sum_probs=175.0 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPI 80 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~ 80 (310) +..+. +-.....+...+....++.+++++...+.......+++...++.. ..+.....|++|+...++ T Consensus 198 ~~e~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~g~~~~~~~~e~~~~~~ 265 (480) T protein:vir:40 198 MPEQG-FLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQGLTLA-----------EDGVDDTFISGTFKAGTD 265 (480) T ss_pred chhhh-hhhhhhhhccccccccccccccchhhheeechhhhhhhhhcceee-----------eccccceeeeeeeecccc Confidence 11110 001111122223333444555555544444444444433333321 123334567776655443 Q ss_pred c--ccceeeeEee---eeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccce Q lcl|NC_021307. 81 T--KGDMSVQQVE---PHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLT 155 (310) Q Consensus 81 ~--~~~~~~i~l~---~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~ 155 (310) . ..++.+.++. .++++.....|+++++|+. ++++||.++|++.++++++++|++|+|++... +.+...... . T Consensus 266 ~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDDa~-~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~-~~g~~~~~~-~ 342 (480) T protein:vir:40 266 KNKSQTATKRSLRPQMAEAYLQMDKATVRGVNDSG-ALSEYVMSEMVNRVIQKVEYNMILGSVDGSNG-FYGLKTATD-G 342 (480) T ss_pred cccccccccchhhHHHHHHHHHhHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCccc-cccceeecc-c Confidence 2 2234455554 4678888899999999876 89999999999999999999999997665321 111111111 1 Q ss_pred ecccchHHHHHHHHHHHhhhhcCCCC-EEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeC-CC Q lcl|NC_021307. 156 PATGTTYDAIGVNALSLLVNAGKKWG-ATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSD-HV 233 (310) Q Consensus 156 ~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~-~~ 233 (310) .+...+.++.+.++...+...++.++ .|+||+.+|.+|+++||++|+|||++....+. +.+|+|+||+++. .+ T Consensus 343 ~~~~~~~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~G~Yi~q~~~~~~~-----~~~llG~pvv~~~~~~ 417 (480) T protein:vir:40 343 WTKQIEYTDLFEGITDAVAECSISDAITIVMSPQTFAELRKAKGTDGHSRFNELATKEQ-----IAQSFGAVNLETRVWM 417 (480) T ss_pred ccccchhHHHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCCCCeeccCcccccC-----cceecccceeeeeccc Confidence 12233445666668888888888878 59999999999999999999999998766653 5689999987764 45 Q ss_pred CCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 234 ASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 234 ~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) |.+.. .+.++..++.+++++ + +..++ ..++.++..++.+.|+++++.+|+|+..|+.++ T Consensus 418 ~~~~~-~~~~~~~~~~~~d~~-~--~~~~~--------------~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~ 476 (480) T protein:vir:40 418 PKDEV-AVYNHDEYVLIGDLN-V--ENYND--------------FDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKG 476 (480) T ss_pred cCCcc-eeeeCCccEEEEecc-c--ceecc--------------cccccchhhhhhhhhhceeeEccccEEEEEecc Confidence 65553 333444556778764 2 22221 124678889999999999999999999999999 No 106 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.92 E-value=2.8e-26 Score=160.67 Aligned_cols=264 Identities=17% Similarity=0.112 Sum_probs=199.2 Q ss_pred hccccCCCCceechh-hHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEcCCceeeeecccccccccccceeeeE Q lcl|NC_021307. 15 AQTGDSMFQGYLEPE-QAQDYFAEAEKTSIVQRVARKIPM----GSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQ 89 (310) Q Consensus 15 ~~~~~~~~g~~i~~~-~~~~ii~~~~~~s~l~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 89 (310) |..+.+.-+..|.|+ +...+.+.+++...+.+++..... ++..++||++...+++.++.||+.++.++.++++.+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~~ 80 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccccccceeE Confidence 444444555555555 556678888888888888776432 355799999987788999999999999999999999 Q ss_pred eeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +..++.+..+.++++...++..++.+.+.++++.++++++|+.++..-.+.... + .....+++ .+.++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~--------~---~~~~~~~d-~i~dA 148 (274) T protein:vir:93 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------V---NADITKLN-GLQSA 148 (274) T ss_pred EEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------c---cccccCHH-HHHHH Confidence 999999999999999999999999999999999999999999999754332211 1 11112333 44577 Q ss_pred HHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeee Q lcl|NC_021307. 170 LSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIV 249 (310) Q Consensus 170 ~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~ 249 (310) ..++.+.......++|||..+..|++. ..-+++-......+....+.-++++|+||++++++|.++. ++.....+. T Consensus 149 ~~~l~d~~~~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~--~l~~~gai~ 224 (274) T protein:vir:93 149 IDKFNDEDLEPMVLFINPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTA--ILAKKGAVK 224 (274) T ss_pred HHHhhhccCCccEEEeCHHHHHHHHhh--hhhcccccccccccceeecccceecCeeEEEcCCCCcceE--EEEeCCeEE Confidence 778887777888999999999999753 2222221222222233344567899999999999998764 445667777 Q ss_pred EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 250 WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 250 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++..+++.++..|+.. +....+++..+++.++.++++++++++++ T Consensus 225 ~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~~v~~t~~~ 269 (274) T protein:vir:93 225 LILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred EEecCCcccccccchh----------------hcccEEEEEEEEEEEEEcCCceEEEeeCc Confidence 7777888888777653 33567889999999999999999999999 No 107 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.89 E-value=1.4e-24 Score=151.39 Aligned_cols=264 Identities=14% Similarity=0.130 Sum_probs=191.7 Q ss_pred hccccCCCCceechhhH-HHHHHHHHhhchhhhhcceeec----CCCceEEEEEcCCceeeeecccccccccccceeeeE Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQA-QDYFAEAEKTSIVQRVARKIPM----GSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQ 89 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~-~~ii~~~~~~s~l~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 89 (310) |..+.+.-..+|.||+. .-+.+.+.+...+.+++..-+. ++..++||.+....++.++.||..++..+.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~~ 80 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKS 80 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCccee Confidence 54444555556666655 5566788888888888776443 356799999988788899999999999999999999 Q ss_pred eeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +..++.+..+.++++...++..++.+.+.++++.++++++|+.++....+.. . ......++ +.+.++ T Consensus 81 ~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~--------~----~~~~~~~~-d~i~~A 147 (272) T protein:vir:36 81 VTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTS--------Q----TVSTKANV-DGVQAA 147 (272) T ss_pred EeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------c----cccccccH-HHHHHH Confidence 9999999999999999999999999999999999999999999985432211 0 11112233 344577 Q ss_pred HHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeE--eeeccee Q lcl|NC_021307. 170 LSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVG--YLGDFSQ 247 (310) Q Consensus 170 ~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~--~~gd~~~ 247 (310) ...+.+.......++|||..+..|++..+.... . .....+....+.-++++|+||++++++|.++... ++.-... T Consensus 148 ~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~--~-~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA 224 (272) T protein:vir:36 148 LDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNI--G-SEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPA 224 (272) T ss_pred HHHhhhcCCCceEEEEcHHHHHHHhcccccccc--c-ccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccc Confidence 888887777888999999999999864322211 1 1111122222334689999999999999887532 1111233 Q ss_pred eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 248 IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 248 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +-++..+++++|..|+.. +....+++..+|+.++.+|+++++++.+= T Consensus 225 ~~~~~~~~~~vE~~R~~~----------------~~~d~i~~~~~y~~~v~~~~~vv~~t~~g 271 (272) T protein:vir:36 225 LKLVLKRGVQVETDRDIV----------------TKTTVITADEHYAAYLYDLTKVVNITFTG 271 (272) T ss_pred eeeeecCCcccccccchh----------------hcCcEEEEEEEEEEEEEcCccEEEEeecC Confidence 334455677777666543 23456888999999999999999999999 No 108 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.89 E-value=1.7e-24 Score=150.93 Aligned_cols=264 Identities=16% Similarity=0.101 Sum_probs=198.9 Q ss_pred hccccCCCCceechhhH-HHHHHHHHhhchhhhhcceee----cCCCceEEEEEcCCceeeeecccccccccccceeeeE Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQA-QDYFAEAEKTSIVQRVARKIP----MGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQ 89 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~-~~ii~~~~~~s~l~~~~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 89 (310) |..+.+.-+.+|.||+. .-+.+.+.+...+.+++..-+ .++..++||.+...+++.++.||+.++..+.+.++.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~~ 80 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRRE 80 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccccccceee Confidence 44444555566766665 557778888888888887533 3577799999988788899999999999999999999 Q ss_pred eeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) ...++.+..+.++++....+..|..+.+.++++.++++++|+.++.-..+... .........+.+.++ T Consensus 81 a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~------------~~~~~~~t~d~i~~A 148 (276) T protein:vir:10 81 AKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKL------------TVSADIGTLAGLEAA 148 (276) T ss_pred EEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------cccccccCHHHHHHH Confidence 99999999999999999999899999999999999999999998853221110 111112223445578 Q ss_pred HHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeee Q lcl|NC_021307. 170 LSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIV 249 (310) Q Consensus 170 ~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~ 249 (310) ..++.+......+++|||..+..|+++.+.. ++.......+....+.-+++.|++|++++++|.++.. ++ ....+- T Consensus 149 ~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~--f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~-l~-~~gAi~ 224 (276) T protein:vir:10 149 IDTFDDEDLEPMVLFINPKDAGKLRSSASDN--FTRATELGDNIIVKGAFGEALGAVIVRSKKLDEGEAI-LA-KRGAVK 224 (276) T ss_pred HHHhccccCcccEEEEcHHHHHHHHHhcccc--ccccccccccceeccccceecceeEEEcCCCCcceEE-EE-ecccee Confidence 8888877778889999999999998754332 2222222222333444568999999999999988653 33 345566 Q ss_pred EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 250 WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 250 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++..+++.+|.+|+.. +..-.+++..+|+.++.+++.++++++++ T Consensus 225 ~~~~~~~~vE~dRd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (276) T protein:vir:10 225 LITKRDFFLETDRDPS----------------TKTTALYSDKHYVAYLYDESKAVKVTKGA 269 (276) T ss_pred eeecCCceeecccchh----------------hcccEEEEeeEEEEEEEcCcceEEEecCC Confidence 6677888888887754 23566788899999999999999999888 No 109 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.88 E-value=3.6e-24 Score=149.10 Aligned_cols=264 Identities=16% Similarity=0.115 Sum_probs=194.9 Q ss_pred hccccCCCCceechhhH-HHHHHHHHhhchhhhhcceeec----CCCceEEEEEcCCceeeeecccccccccccceeeeE Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQA-QDYFAEAEKTSIVQRVARKIPM----GSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQ 89 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~-~~ii~~~~~~s~l~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 89 (310) |...++.-+.+|.|++. ..+.+.+.+...+.++++..+. ++..++||++...+++..+.||+.++..+.+.++.+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~~~~ 80 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhcccceeE Confidence 55445555567767665 5567777777777777765332 366799999987778888999999999999999999 Q ss_pred eeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +..++.+..+.++++...++..++.+.+.++++.++++++|+.+++-..+... . ..+...++ +.+.++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~--------~---~~~~~~~~-d~i~dA 148 (274) T protein:vir:96 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL--------T---VEADITKL-DGLQTA 148 (274) T ss_pred EEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--------C---cCcccccH-HHHHHH Confidence 99999999999999999999899999999999999999999998865432211 0 11111223 444577 Q ss_pred HHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeee Q lcl|NC_021307. 170 LSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIV 249 (310) Q Consensus 170 ~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~ 249 (310) ..++.+.......++|||..+..|++... .+++-......+....+.-+++.|++|++++++|.++.. ++ ....+. T Consensus 149 ~~~l~d~~~~~~~ivv~p~~~~~L~k~~~--~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~t~~-l~-~~gA~~ 224 (274) T protein:vir:96 149 IDKFNDEDLEPMVLFVNPLDAGGLRTSAS--DNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEAL-LA-KKGAVK 224 (274) T ss_pred HHHhcccCCCceEEEeCHHHHHHHHhccc--ccccccccccccceeecccceecCeeEEEcCCCCcceEE-EE-eCccee Confidence 77787777788899999999999987531 122212222223333445678999999999999988753 23 455666 Q ss_pred EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 250 WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 250 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++...++.+|..|+.. +..-.+++..+|+.++.+|+++++++.++ T Consensus 225 ~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~yg~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:96 225 LITKRDFFLEKDRDAS----------------RKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) T ss_pred eeecCCcccccccchh----------------hcccEEEEeeEEEEEEEcCccEEEEEcCc Confidence 6667777777766543 34567888899999999999999999999 No 110 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.87 E-value=9.4e-24 Score=146.82 Aligned_cols=270 Identities=13% Similarity=0.085 Sum_probs=189.9 Q ss_pred hccccCCCCceechh-hHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEcCCceeeeecccccccccccceeeeE Q lcl|NC_021307. 15 AQTGDSMFQGYLEPE-QAQDYFAEAEKTSIVQRVARKIPM----GSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQ 89 (310) Q Consensus 15 ~~~~~~~~g~~i~~~-~~~~ii~~~~~~s~l~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 89 (310) |..+++.-+..+.|+ +...+.+.+++...+.+++..... ++..++||++...+++.++.||+.++..+++.++.+ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESVK 80 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcccccccceee Confidence 444444445556555 566678888888787787765332 356799999987778899999999999999999999 Q ss_pred eeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +..++.+..+.++++...++..++.+.+.++++.++++++|+.++..-.+... ......+........+.+.++ T Consensus 81 ~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~------~~~~~~t~~~~~~~~~~~~da 154 (278) T protein:vir:80 81 HGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTL------EVKGAINIGLIDKIENTFTDA 154 (278) T ss_pred EeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc------ccccccccchhhhHHHHHHHH Confidence 99999999999999999999999999999999999999999988865432111 000011111111223334455 Q ss_pred HHHhhhhcCC-CCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceee Q lcl|NC_021307. 170 LSLLVNAGKK-WGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQI 248 (310) Q Consensus 170 ~~~l~~~~~~-~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~ 248 (310) ...+...... ...++|||..+..|++.... +++-......+....+.-+++.|++|++++++|.++..+ + ....+ T Consensus 155 ~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~--~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l-~-~~gAi 230 (278) T protein:vir:80 155 PDAIEDESITTTGVLFLNYKDTAKLREEAAG--SWTKASQLGDDLLVKGAFGELLGWEIVRTKKLADGNALA-V-KAGAL 230 (278) T ss_pred HHhhcccCCCcccEEEECHHHHHHHHhhhhh--hccccccccccceeeccceeecceeEEEcCCCCcceEEE-E-eccce Confidence 5555444333 44688999999999865322 222112222233333455689999999999999876533 3 34556 Q ss_pred eEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 249 VWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 249 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) -++..+++.+|..|+.. +....+++..+|+.++.+|++++++++.| T Consensus 231 ~~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~yg~~v~~~~~~v~it~~a 276 (278) T protein:vir:80 231 KTFLKRNLLAESGRDMD----------------HKLTKFNADQHYAVALVDETKAVKVVPVA 276 (278) T ss_pred eeeecCCcccccccchh----------------hccceeeeeeEEEEEEEcCcceEEEeecc Confidence 56666777887776643 33457888899999999999999999999 No 111 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.87 E-value=1.6e-23 Score=145.53 Aligned_cols=264 Identities=17% Similarity=0.114 Sum_probs=196.1 Q ss_pred hccccCCCCceechhh-HHHHHHHHHhhchhhhhcceeec----CCCceEEEEEcCCceeeeecccccccccccceeeeE Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQ-AQDYFAEAEKTSIVQRVARKIPM----GSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQ 89 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~-~~~ii~~~~~~s~l~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 89 (310) |..+.+.-+.+|.|++ ...+.+.+++...+.+++..-+. ++..++||++...+++..+.||+.++..+.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 4444445555666655 55577788887777777766432 466799999987778888999999999999999999 Q ss_pred eeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--.+... . ......+++ .+.++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~--------~---~~~~~~~~d-~i~dA 148 (274) T protein:vir:94 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--------T---VNADITKLN-GLQSA 148 (274) T ss_pred EEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc--------c---ccccccCHH-HHHHH Confidence 99999999999999999999899999999999999999999999865332211 0 011122334 44577 Q ss_pred HHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeee Q lcl|NC_021307. 170 LSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIV 249 (310) Q Consensus 170 ~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~ 249 (310) ..++.+.......++|||..+..|++ +..-+++-......+....+.-+++.|++|++++++|.++. ++.....+. T Consensus 149 ~~~l~d~~~~~~~ivv~p~~~~~L~k--~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~--~l~~~gA~~ 224 (274) T protein:vir:94 149 IDKFNDEDLEPMVLFVNPLDAGKLRG--DASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTA--ILAKKGAVK 224 (274) T ss_pred HHHhhccCCCceEEEeCHHHHHHHHh--hhhhhccccCcccccceeccccceecCeeEEEcCCCCcceE--EEEeCcceE Confidence 77888777788899999999999975 22222222222222233344457899999999999998764 333456676 Q ss_pred EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 250 WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 250 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++..+++.+|..|+.. +..-.+++..+|+.++.++++++++++++ T Consensus 225 ~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:94 225 LILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eeecCCceeccccchh----------------hcccEEEEEEEEEEEEEcCCceEEEecCc Confidence 7777888888877754 23456788889999999999999999988 No 112 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.87 E-value=1.6e-23 Score=145.53 Aligned_cols=264 Identities=17% Similarity=0.114 Sum_probs=196.1 Q ss_pred hccccCCCCceechhh-HHHHHHHHHhhchhhhhcceeec----CCCceEEEEEcCCceeeeecccccccccccceeeeE Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQ-AQDYFAEAEKTSIVQRVARKIPM----GSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQ 89 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~-~~~ii~~~~~~s~l~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 89 (310) |..+.+.-+.+|.|++ ...+.+.+++...+.+++..-+. ++..++||++...+++..+.||+.++..+.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 4444445555666655 55577788887777777766432 466799999987778888999999999999999999 Q ss_pred eeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--.+... . ......+++ .+.++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~--------~---~~~~~~~~d-~i~dA 148 (274) T protein:vir:97 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--------T---VNADITKLN-GLQSA 148 (274) T ss_pred EEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc--------c---ccccccCHH-HHHHH Confidence 99999999999999999999899999999999999999999999865332211 0 011122334 44577 Q ss_pred HHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeee Q lcl|NC_021307. 170 LSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIV 249 (310) Q Consensus 170 ~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~ 249 (310) ..++.+.......++|||..+..|++ +..-+++-......+....+.-+++.|++|++++++|.++. ++.....+. T Consensus 149 ~~~l~d~~~~~~~ivv~p~~~~~L~k--~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~--~l~~~gA~~ 224 (274) T protein:vir:97 149 IDKFNDEDLEPMVLFVNPLDAGKLRG--DASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTA--ILAKKGAVK 224 (274) T ss_pred HHHhhccCCCceEEEeCHHHHHHHHh--hhhhhccccCcccccceeccccceecCeeEEEcCCCCcceE--EEEeCcceE Confidence 77888777788899999999999975 22222222222222233344457899999999999998764 333456676 Q ss_pred EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 250 WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 250 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++..+++.+|..|+.. +..-.+++..+|+.++.++++++++++++ T Consensus 225 ~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:97 225 LILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eeecCCceeccccchh----------------hcccEEEEEEEEEEEEEcCCceEEEecCc Confidence 7777888888877754 23456788889999999999999999988 No 113 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.87 E-value=1.5e-23 Score=145.77 Aligned_cols=265 Identities=18% Similarity=0.129 Sum_probs=195.4 Q ss_pred hhccccCCCCceechhh-HHHHHHHHHhhchhhhhcceeec----CCCceEEEEEcCCceeeeecccccccccccceeee Q lcl|NC_021307. 14 IAQTGDSMFQGYLEPEQ-AQDYFAEAEKTSIVQRVARKIPM----GSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQ 88 (310) Q Consensus 14 ~~~~~~~~~g~~i~~~~-~~~ii~~~~~~s~l~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i 88 (310) -++.+.+.-..+|.||+ ..-+.+.+++...+.+++..-+. ++..++||++...+++.++.||+.++..+.+.++. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~ 80 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETKKR 80 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhccccee Confidence 12222233444666665 45577788888888888876443 46679999998878889999999999999999999 Q ss_pred EeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHH Q lcl|NC_021307. 89 QVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVN 168 (310) Q Consensus 89 ~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (310) ++..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--+++... ......++ +.+.+ T Consensus 81 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~-----------~~~~~~~~-d~i~d 148 (275) T protein:vir:96 81 QATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLK-----------VEADITKL-AGLQT 148 (275) T ss_pred eEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----------ccccccCH-HHHHH Confidence 9999999999999999999888899999999999999999999998644332110 11112234 44457 Q ss_pred HHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceee Q lcl|NC_021307. 169 ALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQI 248 (310) Q Consensus 169 ~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~ 248 (310) +..++.+.......++|||..+..|++.... +++-......+....+.-+++.|++|++++++|.++.. +++ ...+ T Consensus 149 A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~--~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~-i~~-~gA~ 224 (275) T protein:vir:96 149 AIDKFNDEDLEPMVLFVNPLDAGKLRASATD--NFTRATLLGDNVIVKGAFGEALGAIIVRSNKIKEGEAI-LAK-RGAV 224 (275) T ss_pred HHHHhccccCCccEEEeCHHHHHHHHhcccc--cccccccccccceeccccceecCeeEEEeCCCCcceEE-EEe-ccce Confidence 7888877777888999999999999875321 22212222222233344578999999999999988753 343 4456 Q ss_pred eEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 249 VWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 249 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .++...++.+|..|+.. +..-.+++..+|+.++.+++++++++..+ T Consensus 225 ~~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 270 (275) T protein:vir:96 225 KLITKRDFFLETERHAS----------------HKSTALFSDKHYVAYLYDESKVVKITKSA 270 (275) T ss_pred eeeecCCcccccccchh----------------hcCcEEEEeEEEEEEEEcCccEEEEEecc Confidence 56667778888777653 33567888899999999999999999998 No 114 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.84 E-value=3.8e-22 Score=138.01 Aligned_cols=264 Identities=17% Similarity=0.121 Sum_probs=194.1 Q ss_pred hccccCCCCceechhh-HHHHHHHHHhhchhhhhcceee----cCCCceEEEEEcCCceeeeecccccccccccceeeeE Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQ-AQDYFAEAEKTSIVQRVARKIP----MGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQ 89 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~-~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 89 (310) |..+.+.-..+|.|++ ...+.+.+.+...+.+++..-+ .++..++||.+...+++..+.||+.++..+.+.++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 4444445555666665 4557777777777777765433 2467899999987778888999999999999999999 Q ss_pred eeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--.++... + .....+++. +.++ T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~--------~---~~~~~~~d~-i~~A 148 (274) T protein:vir:95 81 AKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT--------V---EADITKLTG-LQTA 148 (274) T ss_pred EEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------c---cccccCHHH-HHHH Confidence 999999999999999999988899999999999999999999988644332211 0 111223443 4567 Q ss_pred HHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeee Q lcl|NC_021307. 170 LSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIV 249 (310) Q Consensus 170 ~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~ 249 (310) ...+.+........+|||..+..|++. ..-+++-......+....+.-+++.|++|++++++|.++.. +++ ...+. T Consensus 149 ~~~lgd~~~~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~-l~~-~gA~~ 224 (274) T protein:vir:95 149 IDKFNDEDLEPMVLFISPLDAGKLRGD--ATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGTAI-LAK-KGAVK 224 (274) T ss_pred HHHhccccccccEEEeCHHHHHHHHhh--ccccccccccccccceeccccceecCeEEEEeCCCCCceEE-EEe-cccee Confidence 777877667788999999999999753 21222222222223333445678999999999999987653 444 34555 Q ss_pred EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 250 WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 250 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++..+++.+|..|+.. +....++...++++++.+|++++++++.+ T Consensus 225 ~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~~v~~tk~~ 269 (274) T protein:vir:95 225 LITKRDFFLETDRDPS----------------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eeecCCcccccccccc----------------cccCEEEEeEEEEEEEEcCCcEEEEEcCC Confidence 6667788888877654 34567888899999999999999999888 No 115 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.84 E-value=3.8e-22 Score=138.01 Aligned_cols=264 Identities=17% Similarity=0.121 Sum_probs=194.1 Q ss_pred hccccCCCCceechhh-HHHHHHHHHhhchhhhhcceee----cCCCceEEEEEcCCceeeeecccccccccccceeeeE Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQ-AQDYFAEAEKTSIVQRVARKIP----MGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQ 89 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~-~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 89 (310) |..+.+.-..+|.|++ ...+.+.+.+...+.+++..-+ .++..++||.+...+++..+.||+.++..+.+.++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 4444445555666665 4557777777777777765433 2467899999987778888999999999999999999 Q ss_pred eeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--.++... + .....+++. +.++ T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~--------~---~~~~~~~d~-i~~A 148 (274) T protein:vir:96 81 AKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT--------V---EADITKLTG-LQTA 148 (274) T ss_pred EEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------c---cccccCHHH-HHHH Confidence 999999999999999999988899999999999999999999988644332211 0 111223443 4567 Q ss_pred HHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeee Q lcl|NC_021307. 170 LSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIV 249 (310) Q Consensus 170 ~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~ 249 (310) ...+.+........+|||..+..|++. ..-+++-......+....+.-+++.|++|++++++|.++.. +++ ...+. T Consensus 149 ~~~lgd~~~~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~-l~~-~gA~~ 224 (274) T protein:vir:96 149 IDKFNDEDLEPMVLFISPLDAGKLRGD--ATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGTAI-LAK-KGAVK 224 (274) T ss_pred HHHhccccccccEEEeCHHHHHHHHhh--ccccccccccccccceeccccceecCeEEEEeCCCCCceEE-EEe-cccee Confidence 777877667788999999999999753 21222222222223333445678999999999999987653 444 34555 Q ss_pred EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 250 WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 250 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++..+++.+|..|+.. +....++...++++++.+|++++++++.+ T Consensus 225 ~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~~v~~tk~~ 269 (274) T protein:vir:96 225 LITKRDFFLETDRDPS----------------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eeecCCcccccccccc----------------cccCEEEEeEEEEEEEEcCCcEEEEEcCC Confidence 6667788888877654 34567888899999999999999999888 No 116 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.84 E-value=3.8e-22 Score=138.00 Aligned_cols=264 Identities=16% Similarity=0.099 Sum_probs=193.1 Q ss_pred hccccCCCCceechhh-HHHHHHHHHhhchhhhhcceee----cCCCceEEEEEcCCceeeeecccccccccccceeeeE Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQ-AQDYFAEAEKTSIVQRVARKIP----MGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQ 89 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~-~~~ii~~~~~~s~l~~~~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 89 (310) |..+.+.-+.+|.|++ ...+.+.+.+...+.+++..-. .++..++||.+...+++..+.||+.++..+.+.++.+ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhhcccceee Confidence 4444444455666665 4556777777777777766532 2467899999987778888999999999999999999 Q ss_pred eeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--.++.. . ......++ +.+.++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~--------~---~~~~a~~~-d~i~dA 148 (274) T protein:vir:12 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL--------T---VNADITKL-NGLQSA 148 (274) T ss_pred EEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------c---ccccccCH-HHHHHH Confidence 99999999999999999988889999999999999999999999865433211 0 01112233 444577 Q ss_pred HHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeee Q lcl|NC_021307. 170 LSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIV 249 (310) Q Consensus 170 ~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~ 249 (310) ..++.+.......++|||..+..|++.. .-+++-......+....+.-+++.|++|++++.+|.++.. +++ ...+. T Consensus 149 ~~~lgd~~~~~~~ivv~p~~~~~L~k~~--~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~-l~~-~gA~~ 224 (274) T protein:vir:12 149 IDKFNDEDLEPMVLFINPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRSNKLEAGTAI-LAK-KGAVK 224 (274) T ss_pred HHHhccccccccEEEeCHHHHHHHHhhh--hhhccccccccccceecccceeecCeeEEEeCCCCcceEE-EEe-cccee Confidence 7778777777889999999999987632 1122211112222333445567999999999999987653 444 34555 Q ss_pred EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 250 WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 250 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++..+++.+|..|+.. +..-.++...+|+.++.+++.++++++++ T Consensus 225 ~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:12 225 LILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eeecCCceeccccchh----------------hcccEEEeeeEEEEEEEcCCceEEEEcCC Confidence 6666788888887764 23457888899999999999999999888 No 117 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.83 E-value=1.8e-22 Score=139.82 Aligned_cols=297 Identities=13% Similarity=0.049 Sum_probs=207.6 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGST-GVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |.+|+...++...+...+++.+.-+||..++.-+.|..++-....++...+....+ +..+|. .+.-.++-++||++.| T Consensus 58 mm~G~~p~~eV~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~-~g~~Ra~~IgEGgE~~ 136 (393) T protein:vir:79 58 MMEGETPTNEVNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPS-IGIMRAYDVAEGQEIP 136 (393) T ss_pred HhcCCCchhheehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccc-hheeeecccccccccc Confidence 88899998886666666666666667777778888888888888888888887544 444443 3456678899999999 Q ss_pred cccc---ceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccc--ccccccc Q lcl|NC_021307. 80 ITKG---DMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDE--TTKSVDL 154 (310) Q Consensus 80 ~~~~---~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~--~~~~~~~ 154 (310) +.+. ++++++++.+|.|..+.+|+|+++||..++.+++...+.++++++.|...+++..+.+..-... +...... T Consensus 137 ~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahp 216 (393) T protein:vir:79 137 EDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHT 216 (393) T ss_pred ccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCcccee Confidence 8765 4789999999999999999999999999999999999999999999999999987765422111 1111111 Q ss_pred ------eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhh-------hccCccc---cccccccccccccC Q lcl|NC_021307. 155 ------TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAK-------DANGRPL---FVESTYEAVTTPYR 218 (310) Q Consensus 155 ------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~-------d~~g~~~---~~~~~~~~~~~~~~ 218 (310) ..-.++...+++.++..++.+..+.+.+++|||-.|+.+.+-. ++.|++- +..... ..+.... T Consensus 217 tGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~a-lgp~~i~ 295 (393) T protein:vir:79 217 TGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMA-LGPDSIQ 295 (393) T ss_pred ecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhh-hchhhhc Confidence 1224555666667888888889999999999999999987532 1222111 111110 1111222 Q ss_pred CceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|NC_021307. 219 EGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIN 298 (310) Q Consensus 219 ~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~ 298 (310) +...+.+.|++++.+|-.+..-- | +++..++....+...++.- .. ..+..-.+|...|+...|+|+.|. T Consensus 296 ~~~~~nlnv~~sPfvp~d~k~~r---F-d~~~Vd~NnvgvlLV~D~i-~t------dq~ddk~rdiq~iKl~ERYG~gvL 364 (393) T protein:vir:79 296 GRLPFNFNVNLSPFIPLDKKSRR---F-DVYAVDRNNVGVLLVRDDL-KT------DQWDEKARGLQNIKMIERYGIGIL 364 (393) T ss_pred cccccceeEEEecccccccccce---e-eEEEeecCCceEEEEecCc-ce------eccccccccceeeeeeeeeceeee Confidence 23344578999999997654221 2 5566666666665555422 11 122233578999999999999888 Q ss_pred cc-CceEEEeecC Q lcl|NC_021307. 299 DV-EAFVKLTNAA 310 (310) Q Consensus 299 ~~-~a~~~l~~aa 310 (310) +. +|+++.++.. T Consensus 365 n~gkaiavakNI~ 377 (393) T protein:vir:79 365 NEGKAIAVAKNIS 377 (393) T ss_pred eCCceEEEEecce Confidence 76 6777777766 No 118 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.82 E-value=2.7e-21 Score=133.33 Aligned_cols=291 Identities=10% Similarity=0.010 Sum_probs=200.2 Q ss_pred Cccc-------------hhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCc Q lcl|NC_021307. 1 MAAG-------------TAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDV 67 (310) Q Consensus 1 ~aa~-------------~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~ 67 (310) |-+- -+||.-... ..+-..++.+-+.+....+|+.+.+.+.+++..++.++.++.+.+++...-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~l~m~--alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~lp 78 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPELKMP--TVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVLG 78 (330) T ss_pred CceecCCccccceeehhccccccchh--hhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeeecCC Confidence 1110 112211111 1112223344566778899999999999999999988888889999999999 Q ss_pred eeeeecccccccccc-cceeeeEeeeeeeEeeehhhHHHhh--cChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccc- Q lcl|NC_021307. 68 SAAWIGEGDMKPITK-GDMSVQQVEPHKIATIFVASAETVR--ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDK- 143 (310) Q Consensus 68 ~a~~v~Eg~~~~~~~-~~~~~i~l~~~k~~~~~~is~ell~--~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~- 143 (310) .+.|...++..+++. .+|.+++...+.+++.+.|.+++.+ .+..+...+-.+...++++.+.+.+||+|+.++... T Consensus 79 ~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~F~ 158 (330) T protein:vir:94 79 DVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNSFQ 158 (330) T ss_pred cceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcccc Confidence 999999999988765 5899999999999999999999965 455688889999999999999999999998776544 Q ss_pred cccccccccccee---cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCc Q lcl|NC_021307. 144 NLDETTKSVDLTP---ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREG 220 (310) Q Consensus 144 ~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~ 220 (310) +............ .++....+++..++..+......+..|+||+++..+++.+....|++...+.... .....-. T Consensus 159 GL~~~~~~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~--~~G~~v~ 236 (330) T protein:vir:94 159 GMMGLVAASQTISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTL--PSGRQIP 236 (330) T ss_pred chhhcCCcccEEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccc--cCCCEEe Confidence 3332222222221 1233333444456666665566789999999999999999887776554332211 1111224 Q ss_pred eeeeeeEEEeCCCCCC--------ceeEeeecce-----eeeEEeec----ccEEEEeecceeeecccccccchhhhhcC Q lcl|NC_021307. 221 RILGRPTILSDHVASG--------TTVGYLGDFS-----QIVWGQVG----GLSFDVSDQATLNLGTPQAPNFVSLWQHN 283 (310) Q Consensus 221 ~l~G~pv~~t~~~~~~--------~~~~~~gd~~-----~~~~~~~~----~~~v~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (310) .+.|+|++.++.+|.+ ++.|+...|. +.+.|..+ |+.++-. + ..-.++ T Consensus 237 ~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~---G------------~~~~k~ 301 (330) T protein:vir:94 237 TYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNV---G------------AKENAD 301 (330) T ss_pred eeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeC---C------------Cccccc Confidence 5789999999988764 3444443332 23344321 2222110 0 112456 Q ss_pred cEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 284 LVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 284 ~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ...+++++|++.++.+++|+++|++-. T Consensus 302 v~~~~v~~y~~~av~~~~a~~~L~~V~ 328 (330) T protein:vir:94 302 ETITRVKMYCGFANFSQLGLAAIKGLI 328 (330) T ss_pred eeeEEEEEeeeeEEechhheeeecccc Confidence 788999999999999999999999999 No 119 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.79 E-value=2.1e-20 Score=128.52 Aligned_cols=259 Identities=11% Similarity=0.033 Sum_probs=188.0 Q ss_pred ccccCCCCceechhhHHH-HHHHHHhhchhhhhcceeec----CCCceEEEEEcCCceeeeecccccccccccceeeeEe Q lcl|NC_021307. 16 QTGDSMFQGYLEPEQAQD-YFAEAEKTSIVQRVARKIPM----GSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQV 90 (310) Q Consensus 16 ~~~~~~~g~~i~~~~~~~-ii~~~~~~s~l~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l 90 (310) +.. +.-..+|.|++..+ +.+.+.+.+.+.+++..-+. ++..+++|.+...+++.-+.||+.++..+.+.++... T Consensus 1 Ma~-T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~a 79 (270) T protein:vir:95 1 MTQ-TKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTKV 79 (270) T ss_pred CCc-eehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchhee Confidence 222 23334666766655 66677777778888776333 4677999999988888889999999999999999999 Q ss_pred eeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHHH Q lcl|NC_021307. 91 EPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNAL 170 (310) Q Consensus 91 ~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (310) +.++.+..+.++++....+..+....+.++++..+++++|+.++.-.... ........+.+. +.+.. T Consensus 80 ~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a------------~~~~~~~~t~~~-~~dA~ 146 (270) T protein:vir:95 80 TVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKS------------KQTATVSADATG-ILDAI 146 (270) T ss_pred eeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccc------------ccccccccCHHH-HHHHH Confidence 99999999999999998877788999999999999999999887432211 111111223333 45777 Q ss_pred HHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeE Q lcl|NC_021307. 171 SLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVW 250 (310) Q Consensus 171 ~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~ 250 (310) .++.+......+++|||.++..|++...-. ......+....+.-+++.|++|+++++++......++ ....+-+ T Consensus 147 ~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~-----~~~~~~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l~-~~gAi~~ 220 (270) T protein:vir:95 147 EVFNSENDEDYVLYVNPKDYNKLVKSLFKV-----GGNVQDRAISKGDLVEIVGVSDIVKSKRVSENTAFLQ-RYGAMEI 220 (270) T ss_pred HHhccccCCCcEEEEcHHHHHHHHhhhccc-----ccccccchhcccccceecceeEEEeCCCCCceeEEEE-eccceee Confidence 888888788889999999999998643111 1111222233345678999999998877654444333 3455666 Q ss_pred EeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 251 GQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 251 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +..+++.+|..|+.. +....+.+..+|+.++.+++.+++++.+- T Consensus 221 ~~~~~~~vEtdRd~~----------------~~~d~i~~~~~y~v~~~~~skvv~~t~~~ 264 (270) T protein:vir:95 221 VNKKKPEAYTDFDIL----------------KRTHLLSTNYHYSVNLKDETGVVKVTFKP 264 (270) T ss_pred eecCCceeeeccchh----------------hcccEEEeeeEEEEEEEccceEEEEEecC Confidence 777788888887754 23456778899999999999999998665 No 120 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.73 E-value=2.5e-19 Score=122.60 Aligned_cols=228 Identities=13% Similarity=0.142 Sum_probs=172.3 Q ss_pred cceeecCCCceEEEEEcCCceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHH Q lcl|NC_021307. 48 ARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIAL 127 (310) Q Consensus 48 ~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~ 127 (310) -+-+++ +.++++|.+ .+++.-+.||..++..+.++++.+.+.++.+..++|+++....+..+......++++.++++ T Consensus 1 ~~~~~~-Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINL-ANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccC-CceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 233443 667999976 55778899999999999999999999999999999999999988889999999999999999 Q ss_pred HHHHHHHcccCcccccccccccccccceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCcccccc Q lcl|NC_021307. 128 AFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVE 207 (310) Q Consensus 128 ~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~ 207 (310) ++|+.++.-..+.. . . .....++ +.+.++.+.+.+......+.+|||..+..||+..+..- . .. T Consensus 78 kvD~di~~~~~~a~--------l--~--~~~~~t~-d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~~~~~~--~-~~ 141 (231) T protein:vir:73 78 KVDDDLLKAAKTTS--------Q--T--VSTKANV-DGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKN--I-GS 141 (231) T ss_pred hhhHHHHHhhcccc--------c--c--ccccccH-HHHHHHHHHhccccccceEEEEcchHHHhhhhccchhh--h-hh Confidence 99999885322211 0 0 1112233 44557888888877788899999999999988544321 1 11 Q ss_pred ccccccccccCCceeeeeeEEEeCCCCCCceeE--eeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcE Q lcl|NC_021307. 208 STYEAVTTPYREGRILGRPTILSDHVASGTTVG--YLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLV 285 (310) Q Consensus 208 ~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~--~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (310) ....+....+.-+++.|+||++|+++|.++... ++.-...+.+...+++.+|..|+.. .... T Consensus 142 ~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~----------------~k~~ 205 (231) T protein:vir:73 142 EVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV----------------TKTT 205 (231) T ss_pred hhccceeeecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeecccccc----------------cccc Confidence 122334445556789999999999999876432 2222345556677788888877754 3456 Q ss_pred EEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 286 AVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 286 ~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .+++..+|+.++.+++.+++++.|- T Consensus 206 ~i~~~~~y~v~l~~~~~vv~~t~~g 230 (231) T protein:vir:73 206 VITADEHYAAYLYDLTKVVNITFTG 230 (231) T ss_pred EEEEeEEEEEEEEcCccEEEEEeec Confidence 7888999999999999999999999 No 121 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.68 E-value=1.8e-17 Score=112.35 Aligned_cols=283 Identities=10% Similarity=0.008 Sum_probs=182.6 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeecc-----c Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGE-----G 75 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E-----g 75 (310) |.|=. ...+ +-+-+......+||.+.+.|.+++..++.++.++.+.+.+...-+.+.+.+. . T Consensus 1 mpalt-----Laea--------~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~ 67 (310) T protein:vir:97 1 MASVT-----LAES--------AKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSG 67 (310) T ss_pred Ccccc-----hHHH--------hhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccC Confidence 22111 1111 1223345678899999999999999999998888899988876555544433 3 Q ss_pred ccccccccceeeeEeeeeeeEeeehhhHHHhhc--C-hhHHHHHHHHHHHHHHHHHHHHHHHcccCccccc-cccccccc Q lcl|NC_021307. 76 DMKPITKGDMSVQQVEPHKIATIFVASAETVRA--N-PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDK-NLDETTKS 151 (310) Q Consensus 76 ~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~--s-~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~-~~~~~~~~ 151 (310) +..+++..+|++.+...+-+++.+.|-+.+.+- + ..+...+-.+...+++..+.+..|+||+.+..+. +....... T Consensus 68 ~g~~~~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~ 147 (310) T protein:vir:97 68 AGAGKAAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCAS 147 (310) T ss_pred CCccccccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCc Confidence 445678899999999999999999999876652 3 3455555677888999999999999999887765 32222222 Q ss_pred ccce---ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHh-hhccCccccccccccccccccCCceeeeeeE Q lcl|NC_021307. 152 VDLT---PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGA-KDANGRPLFVESTYEAVTTPYREGRILGRPT 227 (310) Q Consensus 152 ~~~~---~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l-~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv 227 (310) .... ..++....+++..+++.+....+.+..|+|||+++.+++.+ +...++.+++..... .+..-.++.|+|+ T Consensus 148 ~q~i~~~~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~---~G~~v~~~~GiPi 224 (310) T protein:vir:97 148 GQKATTGATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELP---SGAEVPAYSGTPI 224 (310) T ss_pred cceeecCCCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccC---CCCEEeeeCCeEE Confidence 1211 12233333455556666655667889999999998888764 344455555543321 1122246899999 Q ss_pred EEeCCCCCC--------ceeEeeeccee-----eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEec Q lcl|NC_021307. 228 ILSDHVASG--------TTVGYLGDFSQ-----IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYG 294 (310) Q Consensus 228 ~~t~~~~~~--------~~~~~~gd~~~-----~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d 294 (310) +.++.+|.+ ++.|+...|.. -+.|...+ ..+++....-. ..-.++...+|++++++ T Consensus 225 ~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~------~~~glsVr~~G-----~~~~~~v~~~~V~~Y~~ 293 (310) T protein:vir:97 225 FRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTAT------QAAGIQVVDVG-----ESEDSDEHIWRVKWYCG 293 (310) T ss_pred EEeCccCCCccccccCCceeEEEEeeCccccccceeccccC------CccceeEEeCC-----cccCCcceeEEEEEeee Confidence 999998864 44444333321 22232110 01111110000 01245668899999999 Q ss_pred cEEeccCceEEEeecC Q lcl|NC_021307. 295 LLINDVEAFVKLTNAA 310 (310) Q Consensus 295 ~~v~~~~a~~~l~~aa 310 (310) .++.+++|+++|.+-- T Consensus 294 ~av~~~~A~a~L~~V~ 309 (310) T protein:vir:97 294 LALFSEKGLACADGIT 309 (310) T ss_pred EEEecccceeeecccc Confidence 9999999999998888 No 122 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.66 E-value=1.7e-18 Score=117.94 Aligned_cols=279 Identities=14% Similarity=0.081 Sum_probs=187.7 Q ss_pred CccchhhhHH---H-HHhhcc-ccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCcee------ Q lcl|NC_021307. 1 MAAGTAFPVN---H-TQIAQT-GDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSA------ 69 (310) Q Consensus 1 ~aa~~~~~~~---~-~~~~~~-~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a------ 69 (310) =++|..-... . ..+... .++...+.|+++++.+.|+.+.+..++.++....|.++.++.+|+.++.... T Consensus 113 ~~~Gd~~A~~~~e~~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~ 192 (410) T protein:vir:83 113 SAQGNASAADRLEVYARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVA 192 (410) T ss_pred cCCchHHHHHHHHHHHHhhccCcccccccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccc Confidence 1556554422 1 122222 2334456788889999999999999999999899999999999888766543 Q ss_pred -eeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHH---HcccCccccccc Q lcl|NC_021307. 70 -AWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAA---LHGTDSPFDKNL 145 (310) Q Consensus 70 -~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~---l~G~g~~~~~~~ 145 (310) +...||...+..+.+|+..+...+++|++..+||+.++.|.+...+...+.|..+++++-++++ |..+-+. T Consensus 193 ~kqa~EGd~L~~gKl~~~t~tA~ikTyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~----- 267 (410) T protein:vir:83 193 GGASDEKTELDSQKMVIDRLTVNAKTLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTG----- 267 (410) T ss_pred cccccccccccccceeeeeccceeehhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----- Confidence 3345999999999999999999999999999999999999999999999999999999888754 3322211 Q ss_pred ccccccccceecccchHHHHHHHHHHHhhhh--cCCCCEEEEehHHHHHHHHhhhccCccccccccc--cccccccCCce Q lcl|NC_021307. 146 DETTKSVDLTPATGTTYDAIGVNALSLLVNA--GKKWGATLLDDVAEPILNGAKDANGRPLFVESTY--EAVTTPYREGR 221 (310) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~--~~~~~~~~~~~ 221 (310) .......+.......+++...++.+. +..-..+.++|..+..+..+- ..+++.+..... .+....+..+. T Consensus 268 -----~~a~~~~Tad~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f-~~~~~~~~dt~Gfg~~~lg~gi~G~ 341 (410) T protein:vir:83 268 -----AVGYGNATADNVASAIWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLF-APVNPTNAHSTGFEAGRFGQGVMGS 341 (410) T ss_pred -----hhhhhhccHHHHHHHHHHHHHHHhhhhccceeeeEEechhhhhhcccee-eccCCCCcccccccccccccchhhh Confidence 01111111112223344555555554 444556788999876664432 122222221111 11112345678 Q ss_pred eeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccC Q lcl|NC_021307. 222 ILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVE 301 (310) Q Consensus 222 l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~ 301 (310) ++++||++.+..++++..+ -|...+.....++-.+.+.++.......+ | + .||.+++..++ T Consensus 342 ~~~ipVvm~~~a~AgTA~f--~~~~Ai~~~eS~~gp~qL~d~~i~nLt~~--------y--------S-gY~a~a~~~~~ 402 (410) T protein:vir:83 342 ISGIPVVMSAALGSGDAYL--FSTAAIECFEQRVGTLQVVEPSVFGLQVA--------Y--------A-GYFSTLVVNED 402 (410) T ss_pred hcccceEEecCCCcCeeeE--eccceeeeeecCCceeEeeCCchhhhhhh--------h--------e-eeeeecccccc Confidence 9999999999999998654 47777777776654555554443222211 1 1 57788889999 Q ss_pred ceEEEeec Q lcl|NC_021307. 302 AFVKLTNA 309 (310) Q Consensus 302 a~~~l~~a 309 (310) ++.-|.+. T Consensus 403 gliPv~g~ 410 (410) T protein:vir:83 403 AIVPLVGS 410 (410) T ss_pred ceeeeccC Confidence 99999888 No 123 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.64 E-value=7e-17 Score=109.16 Aligned_cols=289 Identities=8% Similarity=0.044 Sum_probs=184.6 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeee-cccccc- Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWI-GEGDMK- 78 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v-~Eg~~~- 78 (310) |..-+... ..+...+.+.-+++.|+|++...+++.+++.+++++.+++++|.+.+..|++..-+..-.-. .|+... T Consensus 10 ~~n~~~~~--i~k~~it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~G~r~~r~~~e~~~~~ 87 (360) T protein:vir:99 10 VRNQNMNS--LSQKDIGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGVPRLSGHTRDEEGSRT 87 (360) T ss_pred HhhhHHHH--HHhhhccccccCceeecHHHHHHHHHHHhhccchhhhcceeecccccccccccccceeeccccccCCCCC Confidence 22211111 12233443445688999999999999999999999999999999998888876544332221 233222 Q ss_pred cccccceeeeEee-eeeeEeeehhhHHHhhcCh----hHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc----- Q lcl|NC_021307. 79 PITKGDMSVQQVE-PHKIATIFVASAETVRANP----GNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET----- 148 (310) Q Consensus 79 ~~~~~~~~~i~l~-~~k~~~~~~is~ell~~s~----~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~----- 148 (310) ...+.+..++.+. .+++...+.++.+-++++. ..+++.|++.|++++++-++.-.++|+.........+. T Consensus 88 ~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~~~~d~fl 167 (360) T protein:vir:99 88 ENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSIGGAAELD 167 (360) T ss_pred cCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccccCcccchhh Confidence 2244555555553 4566677788888777653 36789999999999999999999999876432110000 Q ss_pred ----------cccc---------cc-------eecccc---------------hHHHHHHHHHHHhhhhcCCC----CEE Q lcl|NC_021307. 149 ----------TKSV---------DL-------TPATGT---------------TYDAIGVNALSLLVNAGKKW----GAT 183 (310) Q Consensus 149 ----------~~~~---------~~-------~~~~~~---------------~~~~~~~~~~~~l~~~~~~~----~~~ 183 (310) .+.. .. ...+.. ....+..++...+...|+++ -+| T Consensus 168 ~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~ 247 (360) T protein:vir:99 168 NTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESDAYSPVL 247 (360) T ss_pred hhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhhhcCcccceEE Confidence 0000 00 000000 12233456777777777653 389 Q ss_pred EEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeec Q lcl|NC_021307. 184 LLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQ 263 (310) Q Consensus 184 ~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~ 263 (310) +|++..+...+..-..-.-++.-....+ ...-.+.|+|++..+.+|++. ++|.++++++++.+.+++++.+.+ T Consensus 248 ~~s~~~~~~yr~~L~~R~t~LGd~~l~g-----~~~~~~~Gipi~~v~~~pd~~--~mlT~p~NLi~g~~~~iri~~~~e 320 (360) T protein:vir:99 248 MTSPNQVQSYTMSLTEREDPLGSAVIFG-----DSDITPFSYDLVGVNGFPDEY--MMFTDPNNLAFGLYEEMELDQSTD 320 (360) T ss_pred EccCchHHHHHHHHhccCcccchhheec-----ccccccceeeeEEcCCCCCCc--eEEeccCceeEEeeeeeEEeeccc Confidence 9999987777654322222221111111 122357899999999999874 688999999999999999987655 Q ss_pred ceeeecccccccchhhhhcC-cEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 264 ATLNLGTPQAPNFVSLWQHN-LVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~-~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ... ..++. .+.......+|+.+...+|++++++.- T Consensus 321 ~~~------------~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~ 356 (360) T protein:vir:99 321 TDK------------VHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLE 356 (360) T ss_pred chh------------hhhhceeeeEEEEEEeeEEEEecccEEEEecCC Confidence 321 11122 133445678999999999999999887 No 124 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.53 E-value=2.5e-15 Score=100.61 Aligned_cols=264 Identities=11% Similarity=-0.012 Sum_probs=164.1 Q ss_pred hccccCCCCceechhhHHHHHHHHHhhchhhhhcce----eecCCCceEEEEEcCCceeeeecccccccccccceeeeEe Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARK----IPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQV 90 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~----~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l 90 (310) |.. .-++|..+...+++.+++.+++.+++.. +...+.+++||+......+....++..++..+.+.+++++ T Consensus 1 MA~-----~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:79 1 MAF-----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred Ccc-----hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEEE Confidence 111 1146777888899999999998888644 3334678999997655556678889888888888888888 Q ss_pred eeee-eEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 91 EPHK-IATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 91 ~~~k-~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +..+ .+.-+.|++.-...+..++.+ +.+++..++++++|+.++.=-...... . ............+.+.++ T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~~~------~-~~~~~~~~~~~~~~i~~a 147 (273) T protein:vir:79 76 LIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA------L-TGSAPSDADDAFDLIASA 147 (273) T ss_pred EEeeecccceeeccHHHHhhcccHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc------c-ccccccchhhHHHHHHHH Confidence 8866 466667777555556678887 567788999999999766322111100 0 001111222233445566 Q ss_pred HHHhhhhcC--CCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCcee-Eeeecce Q lcl|NC_021307. 170 LSLLVNAGK--KWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTV-GYLGDFS 246 (310) Q Consensus 170 ~~~l~~~~~--~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~-~~~gd~~ 246 (310) ...+.+.+. .+-.++++|..+..|.+..+.-.+.-... ..+....+.-+++.|++|+.++++|.++.. .+.+-.+ T Consensus 148 ~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~--~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~ 225 (273) T protein:vir:79 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSG--DAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS 225 (273) T ss_pred HHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcc--cccceeeeEeeEEeceEEEecccccccCceEEEEEecc Confidence 666666554 34578999999998875432111111110 111222344578999999999999975432 2222223 Q ss_pred eeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 247 QIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 247 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .+.+.. +...++..++.. +-...+++...+|+.+.||++++.|+..+ T Consensus 226 A~~~a~-~~~~~e~~r~~~----------------~~~~~v~~~~~yg~~v~~p~~vv~~~~~g 272 (273) T protein:vir:79 226 AAAYVS-QIDTVEALRDQD----------------SFSDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) T ss_pred ceeeee-ehhhhhcccCcc----------------cceeeeeeeeeeeeEEecCceEEEEeccC Confidence 332222 112333332221 11356788899999999999999998777 No 125 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.49 E-value=1e-14 Score=97.23 Aligned_cols=264 Identities=10% Similarity=-0.011 Sum_probs=161.2 Q ss_pred hccccCCCCceechhhHHHHHHHHHhhchhhhhcce----eecCCCceEEEEEcCCceeeeecccccccccccceeeeEe Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARK----IPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQV 90 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~----~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l 90 (310) |.. .-++|..+...+++.+++.+++.+++.. ....+.++.||+......+....++..++..+.+.+++++ T Consensus 1 MA~-----~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:10 1 MAF-----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred Ccc-----hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEE Confidence 111 1146777889999999999998887643 1234668999997665556677788888777778888888 Q ss_pred eeee-eEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 91 EPHK-IATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 91 ~~~k-~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +..+ .+..+.|++.-..++..++++ +.+++.++++.++|+.++.=-...... .. ..+........+.+.++ T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~------~~-~~~~~~~~~~~~~i~~a 147 (273) T protein:vir:10 76 LIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA------LT-GSAPTDADDAFDLIAKA 147 (273) T ss_pred EEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc------cc-cccccchhHHHHHHHHH Confidence 8755 356666776545555677887 567788999999999877421111100 00 01111122233444566 Q ss_pred HHHhhhhcC--CCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCce-eEeeecce Q lcl|NC_021307. 170 LSLLVNAGK--KWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTT-VGYLGDFS 246 (310) Q Consensus 170 ~~~l~~~~~--~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~-~~~~gd~~ 246 (310) ...+..... .+-.++++|..+..|.+..+.-.+.-... ..+....+.-+++.|++|+.++++|.++. ..+.+..+ T Consensus 148 ~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~--~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~ 225 (273) T protein:vir:10 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSG--DAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS 225 (273) T ss_pred HHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccc--cccceeeeeeeEEeceEEEEecccccCCccEEEEEecc Confidence 666666554 34578999999999875422111100000 01122233457899999999999997542 22333333 Q ss_pred eeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 247 QIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 247 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .+.+..+ ...++..++.. .| ...+++...+|+.+.||++++.|+..+ T Consensus 226 A~~~a~q-~~~~e~~r~~~-------------~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g 272 (273) T protein:vir:10 226 AAAYVSQ-IDTVEALRDQD-------------SF---SDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) T ss_pred ceeeeee-eehhhcccCCC-------------cc---eeeeeeeeeeeeeEeccceEEEEeccC Confidence 3333321 12333333221 11 345788899999999999999998877 No 126 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.49 E-value=1e-14 Score=97.23 Aligned_cols=264 Identities=10% Similarity=-0.011 Sum_probs=161.2 Q ss_pred hccccCCCCceechhhHHHHHHHHHhhchhhhhcce----eecCCCceEEEEEcCCceeeeecccccccccccceeeeEe Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARK----IPMGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQV 90 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~----~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l 90 (310) |.. .-++|..+...+++.+++.+++.+++.. ....+.++.||+......+....++..++..+.+.+++++ T Consensus 1 MA~-----~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:10 1 MAF-----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred Ccc-----hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEE Confidence 111 1146777889999999999998887643 1234668999997665556677788888777778888888 Q ss_pred eeee-eEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHH Q lcl|NC_021307. 91 EPHK-IATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNA 169 (310) Q Consensus 91 ~~~k-~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (310) +..+ .+..+.|++.-..++..++++ +.+++.++++.++|+.++.=-...... .. ..+........+.+.++ T Consensus 76 tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~------~~-~~~~~~~~~~~~~i~~a 147 (273) T protein:vir:10 76 LIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA------LT-GSAPTDADDAFDLIAKA 147 (273) T ss_pred EEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc------cc-cccccchhHHHHHHHHH Confidence 8755 356666776545555677887 567788999999999877421111100 00 01111122233444566 Q ss_pred HHHhhhhcC--CCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCce-eEeeecce Q lcl|NC_021307. 170 LSLLVNAGK--KWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTT-VGYLGDFS 246 (310) Q Consensus 170 ~~~l~~~~~--~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~-~~~~gd~~ 246 (310) ...+..... .+-.++++|..+..|.+..+.-.+.-... ..+....+.-+++.|++|+.++++|.++. ..+.+..+ T Consensus 148 ~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~--~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~ 225 (273) T protein:vir:10 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSG--DAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS 225 (273) T ss_pred HHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccc--cccceeeeeeeEEeceEEEEecccccCCccEEEEEecc Confidence 666666554 34578999999999875422111100000 01122233457899999999999997542 22333333 Q ss_pred eeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 247 QIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 247 ~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .+.+..+ ...++..++.. .| ...+++...+|+.+.||++++.|+..+ T Consensus 226 A~~~a~q-~~~~e~~r~~~-------------~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g 272 (273) T protein:vir:10 226 AAAYVSQ-IDTVEALRDQD-------------SF---SDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) T ss_pred ceeeeee-eehhhcccCCC-------------cc---eeeeeeeeeeeeeEeccceEEEEeccC Confidence 3333321 12333333221 11 345788899999999999999998877 No 127 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.46 E-value=5.3e-15 Score=98.85 Aligned_cols=286 Identities=15% Similarity=0.013 Sum_probs=169.0 Q ss_pred CccchhhhHHHHHhhccccCCCCc-----eec-hhhHH-HHHHHHHhhchhhhhcceeec-CCCceEEEEEcC---Ccee Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQG-----YLE-PEQAQ-DYFAEAEKTSIVQRVARKIPM-GSTGVKIPHWTG---DVSA 69 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~-----~i~-~~~~~-~ii~~~~~~s~l~~~~~~~~~-~~~~~~ip~~~~---~~~a 69 (310) |.+-..+- ....++. ++- |+++. .+.+.+++.-+.-.+.+.... .+..+.+-.... ..++ T Consensus 1 ~~~~~~i~---------s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~ 71 (318) T protein:vir:10 1 MTAPTGIV---------SVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDV 71 (318) T ss_pred CCCCCcce---------eeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcH Confidence 33221111 1111222 222 55553 455556555555556665543 345555544322 2466 Q ss_pred eeecccccccccccceeeeEe-eeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc Q lcl|NC_021307. 70 AWIGEGDMKPITKGDMSVQQV-EPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET 148 (310) Q Consensus 70 ~~v~Eg~~~~~~~~~~~~i~l-~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~ 148 (310) .-+.||+++|...++++...+ ..+|.|..++||+|++..+..+..+...++++.++.++.|+..+.---++.-...... T Consensus 72 e~VaEggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s 151 (318) T protein:vir:10 72 ADVAEFGEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVP 151 (318) T ss_pred hhccCcccccccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCC Confidence 788999999999999987777 5589999999999999999999999999999999999999987765433321111111 Q ss_pred cccccceec-cc-chHHHHHHHHHH---------HhhhhcCCCCEEEEehHHHHHHHH------hhhccCcccccccccc Q lcl|NC_021307. 149 TKSVDLTPA-TG-TTYDAIGVNALS---------LLVNAGKKWGATLLDDVAEPILNG------AKDANGRPLFVESTYE 211 (310) Q Consensus 149 ~~~~~~~~~-~~-~~~~~~~~~~~~---------~l~~~~~~~~~~~~~~~~~~~l~~------l~d~~g~~~~~~~~~~ 211 (310) ....+.... .+ ....+.+....+ .-+..++.+..++|||.+|..|++ +...++.+++... T Consensus 152 ~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~--- 228 (318) T protein:vir:10 152 TAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAP--- 228 (318) T ss_pred cCCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhcc--- Confidence 111110000 00 001111111111 112456778999999999999954 3233333333211 Q ss_pred ccccccCCceeeeeeEEEeCCCCCCceeEeeecceee-eEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEE Q lcl|NC_021307. 212 AVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQI-VWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVE 290 (310) Q Consensus 212 ~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~-~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~ 290 (310) ...+..+++++|+.|+.++.+|.++..++ +...+ ++++..+++.+..+.- .+.. .. -.+.+..+|+. T Consensus 229 -~~tg~~~g~~lGl~vi~s~~~p~~~alvl--q~g~vG~~~d~~pl~~t~~~~e---gg~~-~g-----~~~~s~~~~~~ 296 (318) T protein:vir:10 229 -DWTGNFPGSVMGLNVIRSRTFPIDRVLIM--ERGTVGFYSDTRPLQFTALYPE---GNGP-NG-----GPTESYRADAS 296 (318) T ss_pred -cccccccceeeceEEeecCccCCCeeEEE--ecCCcceeeccccceeeecccC---CCCC-CC-----Ccchhhheehh Confidence 11233467899999999999999875432 33222 3445555555543311 0011 01 12445677888 Q ss_pred EEeccEEeccCceEEEeecC Q lcl|NC_021307. 291 AEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 291 ~~~d~~v~~~~a~~~l~~aa 310 (310) .+-...|.+|+|+++||+-= T Consensus 297 ~~~~~~V~~PkA~~~itgi~ 316 (318) T protein:vir:10 297 HKRALAVDQPKAALWLTGIV 316 (318) T ss_pred eeeeeeeeCcceeEEEeecc Confidence 88899999999999999988 No 128 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.38 E-value=8.9e-14 Score=92.12 Aligned_cols=300 Identities=10% Similarity=0.017 Sum_probs=168.0 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceee---cCCCceEEEEEcCCceeeeeccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIP---MGSTGVKIPHWTGDVSAAWIGEGDM 77 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~---~~~~~~~ip~~~~~~~a~~v~Eg~~ 77 (310) |+=|+.+.-.. ..+..-..++|+.+..++++.+++.+.+.++++..+ ..+.+++||+.. .+.+.-..++.. T Consensus 1 ~~~~~~~~~~~-----~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~ 74 (341) T protein:vir:94 1 MALGNTITGPS-----INTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS-ELGVEDKATDVP 74 (341) T ss_pred Ccchhhhcccc-----ccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC-cceeeeecCCCc Confidence 55554443211 122222336888889999999999998888876543 236679999864 566777788888 Q ss_pred ccccccceeeeEeee-eeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccc--ccccccccccc Q lcl|NC_021307. 78 KPITKGDMSVQQVEP-HKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDK--NLDETTKSVDL 154 (310) Q Consensus 78 ~~~~~~~~~~i~l~~-~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~--~~~~~~~~~~~ 154 (310) ++..+.+-++++++. +..+.-+.|+++-..++..++.+.+.++..+++++++|+.++.--...... ........... T Consensus 75 i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~ 154 (341) T protein:vir:94 75 VGVQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAI 154 (341) T ss_pred cccccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccc Confidence 887778888888888 445677888887776778899999999999999999999887542211111 00011111111 Q ss_pred eecccchHHHHHHHHHHHhhhhcC--CCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCC Q lcl|NC_021307. 155 TPATGTTYDAIGVNALSLLVNAGK--KWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDH 232 (310) Q Consensus 155 ~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~ 232 (310) +........+.+.++...+...+. ..-.++++|..+..|.+...-... .....+....+.-+++.|++|+.+++ T Consensus 155 t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~----~~~g~~~l~~G~ig~i~G~~V~~Sn~ 230 (341) T protein:vir:94 155 TGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISK----DFINNAPIAQGQIGSLMGVRVIRTSL 230 (341) T ss_pred cCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhh----hccccchhheeeeeeEeceEEEEecc Confidence 111111122334456666665544 344578899999999653211111 11111122233446899999999999 Q ss_pred CCCCceeEee-------------------------eccee--eeEEeeccc-EEEEeecceeeecccccccchhhh--hc Q lcl|NC_021307. 233 VASGTTVGYL-------------------------GDFSQ--IVWGQVGGL-SFDVSDQATLNLGTPQAPNFVSLW--QH 282 (310) Q Consensus 233 ~~~~~~~~~~-------------------------gd~~~--~~~~~~~~~-~v~~~~~~~~~~~~~~~~~~~~~~--~~ 282 (310) +|.+....+. ++++. .+++.++.+ .++..+-..+.............| ++ T Consensus 231 lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (341) T protein:vir:94 231 IGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENRE 310 (341) T ss_pred ccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhh Confidence 9865422110 01111 111111111 111000000000000000000001 11 Q ss_pred CcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 283 NLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 283 ~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) -...+++..-||.++.||++.+.|...+ T Consensus 311 ~~~~i~~~~~~G~~~lrp~~~v~~~~~~ 338 (341) T protein:vir:94 311 QVWLMVGRQAYGARLYRPLHAVNIHTTG 338 (341) T ss_pred hhhhhhhhhhhcccccCcceeEEEecCc Confidence 1233456667899999999998888888 No 129 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.30 E-value=1.4e-13 Score=91.11 Aligned_cols=282 Identities=13% Similarity=0.091 Sum_probs=188.0 Q ss_pred Ccc-c-----hhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeee-ec Q lcl|NC_021307. 1 MAA-G-----TAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAW-IG 73 (310) Q Consensus 1 ~aa-~-----~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~-v~ 73 (310) |-+ | .+-.+.....+.++++..- .+|.-++..|-..+....+++++..+.+.++-- +-+......-+| .. T Consensus 101 ~~nsg~sd~knaW~A~l~E~gvt~td~n~-iLP~~il~aIq~al~~~~~~~~f~~v~n~p~l~--V~~~~dt~~qa~gHk 177 (400) T protein:vir:93 101 KKNSGKSEIKNAWSAKLAENGVTITDTTF-QLPRKLVESINTALLNTNPVFKVFHVTNVGALL--VSRSFDSANEAQVHK 177 (400) T ss_pred HhhcCCcchhhhhhhhhhhcccccCCchh-hcchHHHHHHHHhhhccCCcccceeeecCCcee--eecchhhhcccceec Confidence 111 1 1222334445555444332 678888899999999999999998888874432 222222233466 68 Q ss_pred ccccccccccceeeeEeeeeeeEeeehhhHHHhh--cChhHHHHHHHHHHHHHHHH-HHHHHHHcccCcccccccc---- Q lcl|NC_021307. 74 EGDMKPITKGDMSVQQVEPHKIATIFVASAETVR--ANPGNYLGTMRTKVATAIAL-AFDEAALHGTDSPFDKNLD---- 146 (310) Q Consensus 74 Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~--~s~~~~~~~v~~~l~~a~~~-~~d~~~l~G~g~~~~~~~~---- 146 (310) -|+.+.++..+|..-++.|+-++.+..+.+-..+ .+...+.+||+.+|...+.. .++++++-|+|+.+-.++. T Consensus 178 ~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~ 257 (400) T protein:vir:93 178 DGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEAD 257 (400) T ss_pred cCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcchhh Confidence 9999999999999999999999999998555554 33456899999999999995 5899999998876432221 Q ss_pred --cccccccce-ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceee Q lcl|NC_021307. 147 --ETTKSVDLT-PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRIL 223 (310) Q Consensus 147 --~~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~ 223 (310) .+......+ .+..+.+.+++..+..-..+...++..++|+|..|+.|+.++|++|++.|...+...... +-+ T Consensus 258 Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a~f~~~n~d~~IA-----~~f 332 (400) T protein:vir:93 258 VKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIA-----SEV 332 (400) T ss_pred hhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCcceeeeeeccccchhh-----hhc Confidence 111111112 234455556665556666666778889999999999999999999999997655443322 223 Q ss_pred ee-eEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCc Q lcl|NC_021307. 224 GR-PTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEA 302 (310) Q Consensus 224 G~-pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a 302 (310) |+ .+++...++.++..+.+ |-. +++. -+++ +..+. .-|.+|+-.|..+...++.+.-+++ T Consensus 333 Gv~~Lv~~Tr~~~~kp~V~V-Dek-~~i~-~~~~--~t~~s--------------f~~~tNs~~ilvetlv~Gsi~~~N~ 393 (400) T protein:vir:93 333 GVDEIIVYTGSKALKPTVLV-DQK-YHID-MQDL--TKVDA--------------FEWKTNSNMILVETLTSGHVETYNA 393 (400) T ss_pred ccceeeeeccCCCCCceeee-ehh-hhcc-ccCc--eeccc--------------eeeeeccceEEeeeeeccceecccc Confidence 43 34455666666544433 432 2232 2332 22211 1256778888899999999999999 Q ss_pred eEEEeec Q lcl|NC_021307. 303 FVKLTNA 309 (310) Q Consensus 303 ~~~l~~a 309 (310) =+.++.+ T Consensus 394 ~ay~~v~ 400 (400) T protein:vir:93 394 GAVITVS 400 (400) T ss_pred eeeEeeC Confidence 8999888 No 130 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.26 E-value=1.1e-12 Score=86.23 Aligned_cols=287 Identities=13% Similarity=0.045 Sum_probs=160.3 Q ss_pred Ccc---chhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeecccc Q lcl|NC_021307. 1 MAA---GTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGD 76 (310) Q Consensus 1 ~aa---~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~ 76 (310) ||. +.+..... .....+.+..=.+..+++..++.+.....|+++++.+...+. +.+++||+. +..+++....|+ T Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~~~~~~~G~ 78 (345) T protein:vir:22 1 MASMTGGQQMGTNQ-GKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGE 78 (345) T ss_pred Ccccccchhccccc-ccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceEEEeeecCC Confidence 332 22222211 000001111112445788899999999999999999987776 567889986 677788888888 Q ss_pred ccccc--ccceeeeEeeeee-eEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHccc----Ccc-----cc-- Q lcl|NC_021307. 77 MKPIT--KGDMSVQQVEPHK-IATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGT----DSP-----FD-- 142 (310) Q Consensus 77 ~~~~~--~~~~~~i~l~~~k-~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~----g~~-----~~-- 142 (310) ++..+ .+..++.+|...+ +.....|.+-=--++..++.+.+.++++.++++..|+.++.-- ... .+ T Consensus 79 ~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~~ 158 (345) T protein:vir:22 79 NLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIEG 158 (345) T ss_pred CCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 87554 4677776666544 2233333332223566789999999999999999999887311 000 00 Q ss_pred --cccc-cccccccceecc---cchHHHHHHHHHHHhhhhcCC--CCEEEEehHHHHHHHHhhhccC-cccccccccccc Q lcl|NC_021307. 143 --KNLD-ETTKSVDLTPAT---GTTYDAIGVNALSLLVNAGKK--WGATLLDDVAEPILNGAKDANG-RPLFVESTYEAV 213 (310) Q Consensus 143 --~~~~-~~~~~~~~~~~~---~~~~~~~~~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~l~d~~g-~~~~~~~~~~~~ 213 (310) .+.. ..+......... +....+.+.++...|...+.. .-..+++|..+..|..-+.-+. .+. ..+. T Consensus 159 ~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~-----~~~~ 233 (345) T protein:vir:22 159 LGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYA-----ALID 233 (345) T ss_pred cccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccc-----cccc Confidence 0000 000100100011 112223334555555554443 3468889999998865432221 111 1112 Q ss_pred ccccCCceeeeeeEEEeCCCCCCc------------------------------eeEeeecceeeeEEeecccEEEEeec Q lcl|NC_021307. 214 TTPYREGRILGRPTILSDHVASGT------------------------------TVGYLGDFSQIVWGQVGGLSFDVSDQ 263 (310) Q Consensus 214 ~~~~~~~~l~G~pv~~t~~~~~~~------------------------------~~~~~gd~~~~~~~~~~~~~v~~~~~ 263 (310) ...+.-+.+.|++|+.++++|.+. .+.++...+.+..+...+++++..++ T Consensus 234 ~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~ 313 (345) T protein:vir:22 234 PEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARR 313 (345) T ss_pred cccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeec Confidence 223345679999999999887321 11112222222233333344444443 Q ss_pred ceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 264 ATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .. .+ ...+++..-+|..+.||+|.+.|+.+= T Consensus 314 ~~--------------~~--~d~I~~~~a~G~~vlRPeaa~~i~~~~ 344 (345) T protein:vir:22 314 AN--------------FQ--ADQIIAKYAMGHGGLRPEAAGAVVFKV 344 (345) T ss_pred hh--------------HH--HHHHHHHHhcCCcccccceeEEEEEee Confidence 22 11 124667778999999999998888887 No 131 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.21 E-value=5.4e-12 Score=82.34 Aligned_cols=268 Identities=11% Similarity=0.068 Sum_probs=155.6 Q ss_pred hccccCCCCceechhhHHHHHH-HHHhhchhhh---------hcceee--cCCCceEEEEEcCC-ceeeeeccccccccc Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFA-EAEKTSIVQR---------VARKIP--MGSTGVKIPHWTGD-VSAAWIGEGDMKPIT 81 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~-~~~~~s~l~~---------~~~~~~--~~~~~~~ip~~~~~-~~a~~v~Eg~~~~~~ 81 (310) |. ++--+.+|.||+..++++ ...+.+.+.+ +..... .++..+++|.+..- .++.-+.|+..++.. T Consensus 1 MA--~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~ 78 (324) T protein:vir:59 1 MA--YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQ 78 (324) T ss_pred CC--ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchh Confidence 32 233456788888777544 4445544433 222221 34677899998653 567778899999999 Q ss_pred ccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccc-cccccccccccceecccc Q lcl|NC_021307. 82 KGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFD-KNLDETTKSVDLTPATGT 160 (310) Q Consensus 82 ~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~-~~~~~~~~~~~~~~~~~~ 160 (310) +.+.++.....++.+..+.++++...-+..+....+.+++++...+..++.+|.-...-.. .........+........ T Consensus 79 ~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~~~ 158 (324) T protein:vir:59 79 KINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADGIY 158 (324) T ss_pred hcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeecccccee Confidence 9999998888889999999999888878888999999999999999999887653211000 000001111111111112 Q ss_pred hHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCce-- Q lcl|NC_021307. 161 TYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTT-- 238 (310) Q Consensus 161 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~-- 238 (310) +. +.+.+...++.+......+++||+.++..|+++.--+. +. ...++ ..-+.++|+||++++.||.... T Consensus 159 s~-~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~--~~--~s~~~----~~i~~~~G~~VivdD~~p~~~~~~ 229 (324) T protein:vir:59 159 SA-ETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIEF--VK--DSQSG----IRFPTYMNKRVIVDDSMPVETLED 229 (324) T ss_pred cH-HHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhhh--cc--ccccC----ceeeeecccEEEEeCCCCccccCC Confidence 22 33446777788888888999999999999997642211 11 11111 1235789999999999985321 Q ss_pred ------eEeeecceeeeEEe-ecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe----ccCceE--- Q lcl|NC_021307. 239 ------VGYLGDFSQIVWGQ-VGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIN----DVEAFV--- 304 (310) Q Consensus 239 ------~~~~gd~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~----~~~a~~--- 304 (310) ..+|+. ..+.+.. ..++.+|..|+.. .+...+....++...+. .+.++. T Consensus 230 ~~~~y~s~l~~~-GAi~~~~~~~~v~vE~dRd~~----------------~g~~~l~~r~~~~~~p~G~s~~~~~~~~~s 292 (324) T protein:vir:59 230 GTKVFTSYLFGA-GALGYAEGQPEVPTETARNAL----------------GSQDILINRKHFVLHPRGVKFTENAMAGTT 292 (324) T ss_pred CCceEEEEEEec-CeEEEeecCCCcceecccCcc----------------ccceEEEEeeEEEeEeeeEEecccccCCCC Confidence 122221 2222222 2345556655532 23333444444433332 111111 Q ss_pred ----EEeecC Q lcl|NC_021307. 305 ----KLTNAA 310 (310) Q Consensus 305 ----~l~~aa 310 (310) .|..++ T Consensus 293 Pt~~~L~~~~ 302 (324) T protein:vir:59 293 PTDEELANGA 302 (324) T ss_pred CChhhhcCCc Confidence 111111 No 132 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.21 E-value=6.5e-12 Score=81.91 Aligned_cols=286 Identities=10% Similarity=-0.001 Sum_probs=163.4 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |-- +....+....+++..-.+..+++..++.+.....++++++..+..+. +.++++|+. +..+++...-|+++. T Consensus 1 ms~----~~~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~ 75 (335) T protein:vir:63 1 MSF----LNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELE 75 (335) T ss_pred CCC----cccchhhhcccccchhheehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eeeeeecccCCcCcC Confidence 211 11111222222222223555889999999999999999999887775 566889987 677888888888888 Q ss_pred ccccceeeeEeeeee-eEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHH----cccCccccc--------ccc Q lcl|NC_021307. 80 ITKGDMSVQQVEPHK-IATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAAL----HGTDSPFDK--------NLD 146 (310) Q Consensus 80 ~~~~~~~~i~l~~~k-~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l----~G~g~~~~~--------~~~ 146 (310) .+.+..++.+++... +.....|-+----++..++.+.+.+++.+++++..|+.++ .+.....+. ++. T Consensus 76 ~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~ 155 (335) T protein:vir:63 76 RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVL 155 (335) T ss_pred CCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcc Confidence 777778888888865 3333334333233566789999999999999999999765 222221111 111 Q ss_pred cccccccceecccchHHH---HHHHHHHHhhhhcCC-----CCEEEEehHHHHHHHHhhhccCccccccccccccccccC Q lcl|NC_021307. 147 ETTKSVDLTPATGTTYDA---IGVNALSLLVNAGKK-----WGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYR 218 (310) Q Consensus 147 ~~~~~~~~~~~~~~~~~~---~~~~~~~~l~~~~~~-----~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~ 218 (310) ........... ...+. ...++...+...+.. .-+.+++|..|..|...+.--.+- |......+....+. T Consensus 156 ~~~~~tg~~~~--~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~-~~~s~~~~~~~~g~ 232 (335) T protein:vir:63 156 EKLDLTGLTAK--QAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVE-YQATGATNDYVKSR 232 (335) T ss_pred eeeeeccCccc--ccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccc-cccccccccccCce Confidence 11111111111 11222 223444555544433 357899999999997643221110 11111112233455 Q ss_pred CceeeeeeEEEeCCCCCCcee---------Eeeecceeee----------EEeecccEEEEeecceeeecccccccchhh Q lcl|NC_021307. 219 EGRILGRPTILSDHVASGTTV---------GYLGDFSQIV----------WGQVGGLSFDVSDQATLNLGTPQAPNFVSL 279 (310) Q Consensus 219 ~~~l~G~pv~~t~~~~~~~~~---------~~~gd~~~~~----------~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~ 279 (310) ...+.|+||+.++++|.+... .+-+|++... .+...+++.++.++... T Consensus 233 v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~------------- 299 (335) T protein:vir:63 233 VAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEK------------- 299 (335) T ss_pred eEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccch------------- Confidence 678999999999999843211 1223443322 22222222222222211 Q ss_pred hhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 280 WQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 280 ~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) | ...+.+..-+|..+.||++.+.++.+- T Consensus 300 ~---~~~i~~~~a~G~g~lRPe~a~~i~~tg 327 (335) T protein:vir:63 300 F---SWVLDTFQMYNIGARRPDTAGAIELKG 327 (335) T ss_pred h---hHHhHHHHHcCCcccccceEEEEEEcC Confidence 1 112334445899999999998888766 No 133 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.20 E-value=3.7e-12 Score=83.28 Aligned_cols=300 Identities=11% Similarity=-0.023 Sum_probs=161.0 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |+--..-. ....+..++...=.+..+++..++.......++++++..+..+. +.+++||+. +..+++...-|+++. T Consensus 1 m~~~~~~~--~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~~~~g~~l~ 77 (334) T protein:vir:80 1 MTYPAANT--HTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAGRKAGEELV 77 (334) T ss_pred CCCCcCCC--ccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeeeecCCCCCC Confidence 33221100 00111111111111233888999999999999999999988776 667999976 677788888899998 Q ss_pred ccccceeeeEeeeee-eEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHc----ccCcccc--------cccc Q lcl|NC_021307. 80 ITKGDMSVQQVEPHK-IATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALH----GTDSPFD--------KNLD 146 (310) Q Consensus 80 ~~~~~~~~i~l~~~k-~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~----G~g~~~~--------~~~~ 146 (310) .+.++-++.+|.... +.....|.+-=--++..++.+.+.++++.+++++.|+.++. +.....+ .++. T Consensus 78 ~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~ 157 (334) T protein:vir:80 78 VQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGIL 157 (334) T ss_pred CCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCcc Confidence 888888888888866 44444554443346677899999999999999999997752 2211111 1111 Q ss_pred cccccccceecccchHHHH---HHHHHHHhhhhcCC-----CCEEEEehHHHHHHHHhhhccCccccccccccccccccC Q lcl|NC_021307. 147 ETTKSVDLTPATGTTYDAI---GVNALSLLVNAGKK-----WGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYR 218 (310) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~---~~~~~~~l~~~~~~-----~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~ 218 (310) ........+.....+.+.+ +.++...+...+.. .-..+++|..|..|..-..-..+- |...........+. T Consensus 158 ~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d-~~~s~~~~~~~~g~ 236 (334) T protein:vir:80 158 LPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVE-FGAKEGGNSFVGGR 236 (334) T ss_pred eeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccce-ecccccccccccee Confidence 1111111111111112111 12333444443333 357889999999997532211110 11000111222334 Q ss_pred CceeeeeeEEEeCCCCCCce---------eEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEE Q lcl|NC_021307. 219 EGRILGRPTILSDHVASGTT---------VGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRV 289 (310) Q Consensus 219 ~~~l~G~pv~~t~~~~~~~~---------~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 289 (310) -.++.|+||+.|+++|.... -.+-|||+......+....+...+...+.......... +. -.+.+ T Consensus 237 i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~----~~--d~i~~ 310 (334) T protein:vir:80 237 IAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKD----FG--HYLDT 310 (334) T ss_pred EEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhh----HH--HHHHH Confidence 56799999999999995421 12335555543221222222111111111000000000 00 11223 Q ss_pred EEEeccEEeccCceEEEeecC Q lcl|NC_021307. 290 EAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 290 ~~~~d~~v~~~~a~~~l~~aa 310 (310) ..-+|..+.||+|++.++..= T Consensus 311 ~~a~G~g~lRPeaa~vv~~~~ 331 (334) T protein:vir:80 311 FQSYNIGQRRPDAVAVHDITV 331 (334) T ss_pred HHHcCCceeccceEEEEEEee Confidence 346788999999988887777 No 134 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=99.19 E-value=5.8e-12 Score=82.20 Aligned_cols=230 Identities=12% Similarity=0.039 Sum_probs=152.0 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |+--..-.......+.-.+ +......|||.+.+.++|++.++++... +..+.+.+.++-|++.|..=++.++ T Consensus 1 m~~~~~~~~TL~e~Akr~~-------~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~lN~g~~ 73 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVD-------PNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRLLNYGVQ 73 (328) T ss_pred CCccccccccHHHHHhhhC-------cchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeeecCCccC Confidence 3222111122222111111 2235667999999999999999998875 4457888999999999999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc--------- Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANP--GNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET--------- 148 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~--~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~--------- 148 (310) +++.++.+++-..+-+++.+.|.+.+.+... .++...-.+...+++.+++...||+|+.+..|....+. T Consensus 74 ~s~~tt~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~ 153 (328) T protein:vir:95 74 PSKSTTVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSA 153 (328) T ss_pred cccceeEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCcccc Confidence 9999999999999999999999999998653 23344455668899999999999999765443211000 Q ss_pred -------------------------------------cccc--------------------------------------- Q lcl|NC_021307. 149 -------------------------------------TKSV--------------------------------------- 152 (310) Q Consensus 149 -------------------------------------~~~~--------------------------------------- 152 (310) .++. T Consensus 154 ~~a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~v 233 (328) T protein:vir:95 154 GNAQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYV 233 (328) T ss_pred ccccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccE Confidence 0000 Q ss_pred ----cc------eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCcee Q lcl|NC_021307. 153 ----DL------TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRI 222 (310) Q Consensus 153 ----~~------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l 222 (310) .+ ..+......++..++...+......+.+|+||++....|+++....++..+......+ ...-.+ T Consensus 234 vrI~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g----~~~t~~ 309 (328) T protein:vir:95 234 VRIANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEG----EWWTSF 309 (328) T ss_pred EEEecCcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCC----cceeEE Confidence 00 0001111223333444455556677889999999999999865444433333222222 234568 Q ss_pred eeeeEEEeCCCCCCceeEe Q lcl|NC_021307. 223 LGRPTILSDHVASGTTVGY 241 (310) Q Consensus 223 ~G~pv~~t~~~~~~~~~~~ 241 (310) .|+||..++.+-.++..++ T Consensus 310 ~gipir~~dai~~tE~~vv 328 (328) T protein:vir:95 310 RGVPIRETDALLETEARVV 328 (328) T ss_pred CCeEEEEEeeeecCccccC Confidence 9999999999877665554 No 135 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.18 E-value=7.3e-12 Score=81.64 Aligned_cols=288 Identities=10% Similarity=-0.004 Sum_probs=161.8 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |-- +....+.+..+++..-.+..+++..++.+.....++++++..+..+. +.++++|+. +..+++...-|+++. T Consensus 1 ms~----~~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~ 75 (335) T protein:vir:78 1 MSF----LNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELE 75 (335) T ss_pred CCc----cccccccccccccchhhhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccccCcccC Confidence 211 11111222222222223556888999999999999999999887765 567899976 667778888888887 Q ss_pred ccccceeeeEeeeeee-EeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHH----cccCccccc--------ccc Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKI-ATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAAL----HGTDSPFDK--------NLD 146 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~-~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l----~G~g~~~~~--------~~~ 146 (310) .+.+..++.++....+ .....|-+----++..++.+.+.+++++++++..|+.++ .+.....+. ++. T Consensus 76 ~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~ 155 (335) T protein:vir:78 76 RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVL 155 (335) T ss_pred CCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcc Confidence 7777788888887553 333333332223566789999999999999999999775 222211111 111 Q ss_pred ccccccccee-cccchHHHHHHHHHHHhhhhcCC-----CCEEEEehHHHHHHHHhhhccCccccccccccccccccCCc Q lcl|NC_021307. 147 ETTKSVDLTP-ATGTTYDAIGVNALSLLVNAGKK-----WGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREG 220 (310) Q Consensus 147 ~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~-----~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~ 220 (310) .......... .......+...++...+...+.. .-+.+++|..|..|.....--.+. |......+....+... T Consensus 156 ~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~-~~~s~~~~~~~~g~v~ 234 (335) T protein:vir:78 156 EKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVE-YQATGATNDYVKSRVA 234 (335) T ss_pred eeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccc-ccccccccccccceeE Confidence 1111111111 11111122223334444443332 357899999999997643221111 1111111223345567 Q ss_pred eeeeeeEEEeCCCCCCcee---------Eeeeccee----------eeEEeecccEEEEeecceeeecccccccchhhhh Q lcl|NC_021307. 221 RILGRPTILSDHVASGTTV---------GYLGDFSQ----------IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQ 281 (310) Q Consensus 221 ~l~G~pv~~t~~~~~~~~~---------~~~gd~~~----------~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 281 (310) .+.|+||+.++++|.+... .+-+|++. +..+...++..++.++... | T Consensus 235 ~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~-------------~- 300 (335) T protein:vir:78 235 ILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQ-------------F- 300 (335) T ss_pred EeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccch-------------h- Confidence 8999999999999954211 01123333 2222222223333222211 1 Q ss_pred cCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 282 HNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 282 ~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ...+.+..-+|..+.||++.+.++..- T Consensus 301 --~~~i~~~~a~G~g~lRPe~a~~i~~tg 327 (335) T protein:vir:78 301 --SWVLDTFQMYNIGARRPDTAGAIELKG 327 (335) T ss_pred --hHhhhHHHHcCCcccCcceEEEEEecC Confidence 122334445899999999998888766 No 136 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.18 E-value=4.2e-12 Score=82.97 Aligned_cols=285 Identities=12% Similarity=0.021 Sum_probs=159.0 Q ss_pred Ccc---chhhhHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAA---GTAFPVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa---~~~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg 75 (310) ||. |.++... .+..+.+..-. +..+++..++.+.....+.++++..+..+. +.+++||+. +..+++....| T Consensus 1 ma~~~~~~~~~t~---~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~i-G~~~~~~~~~G 76 (347) T protein:vir:94 1 MANMNGGQQMGKD---QGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVL-GRTKAAYLQPG 76 (347) T ss_pred CCccccccccccc---cccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeec-cceeEeeeecC Confidence 442 2222211 11111111111 344888999999999999999998886654 667888875 56677888888 Q ss_pred ccccc--cccceeeeEeeeeee-EeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHc----ccCcc----cccc Q lcl|NC_021307. 76 DMKPI--TKGDMSVQQVEPHKI-ATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALH----GTDSP----FDKN 144 (310) Q Consensus 76 ~~~~~--~~~~~~~i~l~~~k~-~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~----G~g~~----~~~~ 144 (310) .++.. ..++.++.++...++ .....|-+-=--++..++.+.+.++++.++++..|+.++. +.... .+.. T Consensus 77 ~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~ 156 (347) T protein:vir:94 77 ENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIA 156 (347) T ss_pred cCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 88754 457888888877664 4444454433345667899999999999999999998863 11110 0000 Q ss_pred ccccc---cc-----ccce-ecccchHHHHHHHHHHHhhhhcCC--CCEEEEehHHHHHHHHhhhcc-Cccccccccccc Q lcl|NC_021307. 145 LDETT---KS-----VDLT-PATGTTYDAIGVNALSLLVNAGKK--WGATLLDDVAEPILNGAKDAN-GRPLFVESTYEA 212 (310) Q Consensus 145 ~~~~~---~~-----~~~~-~~~~~~~~~~~~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~l~d~~-g~~~~~~~~~~~ 212 (310) +.+.. .. .... ........+.+.++...|...+.. +-.++++|+.+..|.+..+.. +.+-.. . T Consensus 157 g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~-----~ 231 (347) T protein:vir:94 157 GLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQAL-----I 231 (347) T ss_pred cCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccc-----c Confidence 00000 00 0000 011111122234555555544433 334556899998887643322 222111 1 Q ss_pred cccccCCceeeeeeEEEeCCCCCCc---ee--------------------Eeeecceeee--E--------EeecccEEE Q lcl|NC_021307. 213 VTTPYREGRILGRPTILSDHVASGT---TV--------------------GYLGDFSQIV--W--------GQVGGLSFD 259 (310) Q Consensus 213 ~~~~~~~~~l~G~pv~~t~~~~~~~---~~--------------------~~~gd~~~~~--~--------~~~~~~~v~ 259 (310) ....+.-+.+.|++|+.++++|.+. .. -+-+||++.. + +...++.++ T Consensus 232 ~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e 311 (347) T protein:vir:94 232 DPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALE 311 (347) T ss_pred ccccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhccccee Confidence 1223455689999999999998421 00 0223333321 1 122233333 Q ss_pred EeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 260 VSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +.++.. .+ ...|.+..-+|..+.||++.+.++.++ T Consensus 312 ~~~~~~--------------~~--~~~i~~~~a~G~g~~rPe~a~~i~~~~ 346 (347) T protein:vir:94 312 RARRAN--------------FQ--ADQIIAKYAMGHGGLRPEACGALVFKK 346 (347) T ss_pred eeechh--------------hh--hhhhhhhhhhcCcccccceeEEEEecC Confidence 332221 12 234667778899999999987665555 No 137 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.16 E-value=7.3e-12 Score=81.63 Aligned_cols=286 Identities=14% Similarity=0.028 Sum_probs=156.7 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeec---CCCceEEEEEcCCceeeeeccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPM---GSTGVKIPHWTGDVSAAWIGEGDM 77 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~---~~~~~~ip~~~~~~~a~~v~Eg~~ 77 (310) ||-=..=.-.......+ +....++|+.+..++++.+++.+++.+++..... .+.+++||+.. .+++....++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~--t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~a~d~~~g~~ 77 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDL--SNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAAVYDKQPQTP 77 (381) T ss_pred CceecccccccCcccch--hhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cceeeeecCCCc Confidence 44322111110011111 1112367778889999999999998888765332 35678999864 567888899999 Q ss_pred ccccccceeeeEeeeee-eEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCc----cccccccc--ccc Q lcl|NC_021307. 78 KPITKGDMSVQQVEPHK-IATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDS----PFDKNLDE--TTK 150 (310) Q Consensus 78 ~~~~~~~~~~i~l~~~k-~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~----~~~~~~~~--~~~ 150 (310) ++..+.+.++++++..+ ......|++.-..++..++.+.+.+++..+++++.|+.++.-... ..+..... ... T Consensus 78 i~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~ 157 (381) T protein:vir:80 78 VNLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLG 157 (381) T ss_pred ccccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccc Confidence 98888888888888844 455577887766677789999999999999999999998743211 11100000 000 Q ss_pred ccc----ceecccchHHHHHHHHHHHhhhhcC--CCCEEEEehHHHHHHHHhhhccC-ccccccccccccccccCCceee Q lcl|NC_021307. 151 SVD----LTPATGTTYDAIGVNALSLLVNAGK--KWGATLLDDVAEPILNGAKDANG-RPLFVESTYEAVTTPYREGRIL 223 (310) Q Consensus 151 ~~~----~~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~~~~~l~~l~d~~g-~~~~~~~~~~~~~~~~~~~~l~ 223 (310) ... .+........+.+.++...+...+. .+-.++++|..+..|.+...-.. .+.. ......+..+++. T Consensus 158 ~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~-----~~~l~~G~Ig~i~ 232 (381) T protein:vir:80 158 DGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQ-----VKPVTSGVVGTIL 232 (381) T ss_pred ccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhcc-----chhhhceeeeEEc Confidence 000 0111111122333456666655544 23478999999999875321111 1111 1112233456899 Q ss_pred eeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCc- Q lcl|NC_021307. 224 GRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEA- 302 (310) Q Consensus 224 G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a- 302 (310) |++|+.++++|.+........+. + ..... . .. ...... ..|..+..+++....+|.++...-. T Consensus 233 G~~Vv~Sn~lp~~~~t~~~~~ag----a---p~~~~--~--~~-~~~~~~----g~~s~~a~av~~~k~yd~~~~~~~~~ 296 (381) T protein:vir:80 233 GMEVIVTTQIGINSLTGYVNGQG----A---PTQPT--P--GV-LGSPYL----PDQAGTANVVNTGSASDLAVSLSYFG 296 (381) T ss_pred ceEEEeecccccccccceeeecc----c---ccccc--c--cc-cccccc----cccccceeeeeeeeeeceeeeeeecc Confidence 99999999999754321110000 0 00000 0 00 000000 1133445667777777777754322 Q ss_pred eEEEeecC Q lcl|NC_021307. 303 FVKLTNAA 310 (310) Q Consensus 303 ~~~l~~aa 310 (310) +-...++- T Consensus 297 ~~~~~g~~ 304 (381) T protein:vir:80 297 LPVFSGAG 304 (381) T ss_pred ceeeecce Confidence 22222111 No 138 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.14 E-value=3.2e-11 Score=78.09 Aligned_cols=291 Identities=12% Similarity=0.017 Sum_probs=158.5 Q ss_pred CccchhhhHHHHHhhccccCCC-C------ceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeee Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMF-Q------GYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWI 72 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~-g------~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v 72 (310) |+--..- ...+.+.++..+ | .+..+++..++.+.....|+++++.++..+. +.+++||+. +..+++.. T Consensus 1 ~~~~~~~---~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~i-G~~t~~~~ 76 (375) T protein:vir:10 1 MANANQV---ALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYT-GRMTSSFH 76 (375) T ss_pred Ccccccc---ccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEee-eeeEEeee Confidence 3322111 111122222111 1 1334778888999999999999999987765 667889987 66677777 Q ss_pred ccccccc---ccccceeeeEeeeeee-EeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHc----ccCccccc- Q lcl|NC_021307. 73 GEGDMKP---ITKGDMSVQQVEPHKI-ATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALH----GTDSPFDK- 143 (310) Q Consensus 73 ~Eg~~~~---~~~~~~~~i~l~~~k~-~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~----G~g~~~~~- 143 (310) .-|+++. ..++...+.+|...++ +....|.+-=--++..++.+.+.++++.++++..|+.++. +.....+. T Consensus 77 t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~ 156 (375) T protein:vir:10 77 TPGTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVS 156 (375) T ss_pred cCCcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 7777663 2345555555655443 3333343332335677999999999999999999998863 21111111 Q ss_pred -------cccccccccccee---cccchHHHHHHHHHHHhhhhcCC--CCEEEEehHHHHHHHHhhhccCccccccccc- Q lcl|NC_021307. 144 -------NLDETTKSVDLTP---ATGTTYDAIGVNALSLLVNAGKK--WGATLLDDVAEPILNGAKDANGRPLFVESTY- 210 (310) Q Consensus 144 -------~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~- 210 (310) +............ .+.....+.+.++...|.+.+.. .-..+++|..|..|.+-+|.+. +...+.. T Consensus 157 ~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~--~~n~d~~~ 234 (375) T protein:vir:10 157 ATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNG--LVNRDVQG 234 (375) T ss_pred cccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccc--eeeecccc Confidence 1111111101111 11222233344555666555443 4467889999999876555431 1111111 Q ss_pred cccccccCCceeeeeeEEEeCCCCCCcee-----------------------------------Eeeecc---e------ Q lcl|NC_021307. 211 EAVTTPYREGRILGRPTILSDHVASGTTV-----------------------------------GYLGDF---S------ 246 (310) Q Consensus 211 ~~~~~~~~~~~l~G~pv~~t~~~~~~~~~-----------------------------------~~~gd~---~------ 246 (310) .+....+.-..+.|++|+.++++|..... -+-+|| + T Consensus 235 ~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~ 314 (375) T protein:vir:10 235 SALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLI 314 (375) T ss_pred cceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEE Confidence 12222233357999999999999843210 011233 1 Q ss_pred ----eeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 247 ----QIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 247 ----~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .+..+...++++++.+..+ . -++-...|.+..-+|..+.||++.+.|+..| T Consensus 315 ~~~~A~g~v~~~~~~~~~~~~~~------~-------~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~ 369 (375) T protein:vir:10 315 FQKEAAGVVEAIGPQVQVTNGDV------S-------VIYQGDVILGRMAMGADYLNPAAAVELYIGA 369 (375) T ss_pred Echhheeeeeeeccccccccchh------h-------heeeeeeeeeeeeeccCccCceeEEEEecCc Confidence 1111222223333221000 0 0122344667778899999999999998776 No 139 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.13 E-value=6.1e-12 Score=82.08 Aligned_cols=288 Identities=11% Similarity=0.025 Sum_probs=155.5 Q ss_pred Cccc---hhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeecccc Q lcl|NC_021307. 1 MAAG---TAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGD 76 (310) Q Consensus 1 ~aa~---~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~ 76 (310) ||.- ......... ...+.+..=.+..+++..++.+.....++++++.++..+. +.+++||+. +..++.....|+ T Consensus 1 ma~~~~~~~~n~~~~~-~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G~~~~~~~~~G~ 78 (344) T protein:vir:10 1 MANMTGGQQLGTNQGK-DVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGE 78 (344) T ss_pred CccccccccCCcccCC-ccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-ceeEEEeeecCC Confidence 3322 111111000 0001111111334788899999999999999999987776 667889986 667778888888 Q ss_pred ccccc--ccceeeeEeeeee-eEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcc----cCcccc--cccc- Q lcl|NC_021307. 77 MKPIT--KGDMSVQQVEPHK-IATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHG----TDSPFD--KNLD- 146 (310) Q Consensus 77 ~~~~~--~~~~~~i~l~~~k-~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G----~g~~~~--~~~~- 146 (310) ++..+ ++.-++.+|...+ +.....|.+-=--++..++.+.+.++++.++++..|+.++.- .....+ ..+. T Consensus 79 ~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g 158 (344) T protein:vir:10 79 NLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENITG 158 (344) T ss_pred CCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc Confidence 88654 4667777777755 333334443333356678999999999999999999988531 111111 0100 Q ss_pred -ccc--cccccee---ccc----chHHHHHHHHHHHhhhhcCC--CCEEEEehHHHHHHHHhhhccCccccccccccccc Q lcl|NC_021307. 147 -ETT--KSVDLTP---ATG----TTYDAIGVNALSLLVNAGKK--WGATLLDDVAEPILNGAKDANGRPLFVESTYEAVT 214 (310) Q Consensus 147 -~~~--~~~~~~~---~~~----~~~~~~~~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~ 214 (310) ... ...+... ... ....+.+.++...|...+.. .-..+++|..+..|..-+.-+.. .....+.. T Consensus 159 ~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~----~~~~~~~~ 234 (344) T protein:vir:10 159 LGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAA----NYAALIDP 234 (344) T ss_pred ccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhccccccc----ccccccce Confidence 000 0000000 011 11122233445555544433 33567799999988643322111 11112222 Q ss_pred cccCCceeeeeeEEEeCCCCCCc----eeE---------------eeecceeee----------EEeecccEEEEeecce Q lcl|NC_021307. 215 TPYREGRILGRPTILSDHVASGT----TVG---------------YLGDFSQIV----------WGQVGGLSFDVSDQAT 265 (310) Q Consensus 215 ~~~~~~~l~G~pv~~t~~~~~~~----~~~---------------~~gd~~~~~----------~~~~~~~~v~~~~~~~ 265 (310) ..+.-+.+.|++|+.++++|.+. ..+ +..+|++.. .+...+++++..++.. T Consensus 235 ~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~ 314 (344) T protein:vir:10 235 EKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 314 (344) T ss_pred eeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchh Confidence 23344678999999999998431 011 111232211 1122223334333221 Q ss_pred eeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 266 LNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .|. ..+++..-+|.++.||++.+.++.+. T Consensus 315 -------------~~~---d~i~g~~~~G~~vlRPe~a~~v~~~~ 343 (344) T protein:vir:10 315 -------------FQA---DQIIAKYAMGHGGLRPEAAGAVVFKT 343 (344) T ss_pred -------------HHH---HHHHHHhhcccceecccceEEEEeec Confidence 111 24567778999999999885555555 No 140 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.13 E-value=1e-11 Score=80.87 Aligned_cols=301 Identities=12% Similarity=-0.003 Sum_probs=159.7 Q ss_pred CccchhhhHHHHHhhccccCCC-C--ceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeecccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMF-Q--GYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGD 76 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~-g--~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~ 76 (310) |.--..+....-.+..-+...+ . .+..+++..++++...+.|+++.+.+..+.. +.+++||+. +..+++....|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~i-g~~~~~~~~~g~ 79 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEec-cceeEeeecCCC Confidence 2221111111111111111111 1 2445788999999999999999998876654 667889987 566777777777 Q ss_pred cccc-cccceeeeEeeeee-eEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHc----ccCcccccccccccc Q lcl|NC_021307. 77 MKPI-TKGDMSVQQVEPHK-IATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALH----GTDSPFDKNLDETTK 150 (310) Q Consensus 77 ~~~~-~~~~~~~i~l~~~k-~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~----G~g~~~~~~~~~~~~ 150 (310) .+.. .+++-++.++...+ .+....|-+-=--++..++.+.+.++.+.++++..|+.++. +..+..+........ T Consensus 80 ~l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~ 159 (332) T protein:vir:78 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) T ss_pred CCCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccccc Confidence 7643 34666777777755 33333443322225567899999999999999999997763 221111111111011 Q ss_pred cccceec---ccchHHHHHHHHHHHhhhhcCC-CC-EEEEehHHHHHHHHhhhccCccccccc-ccccccccc-CCceee Q lcl|NC_021307. 151 SVDLTPA---TGTTYDAIGVNALSLLVNAGKK-WG-ATLLDDVAEPILNGAKDANGRPLFVES-TYEAVTTPY-REGRIL 223 (310) Q Consensus 151 ~~~~~~~---~~~~~~~~~~~~~~~l~~~~~~-~~-~~~~~~~~~~~l~~l~d~~g~~~~~~~-~~~~~~~~~-~~~~l~ 223 (310) .+..... ......+.+.++...|...+.. .. .++++|..+..|.+.+|.. ..-... ..++....+ .-+.+. T Consensus 160 ~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~--~~n~~~~~~~~~~~~g~~i~~i~ 237 (332) T protein:vir:78 160 HVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTN--ILNREIGNSQGDMNSGKGLYSIA 237 (332) T ss_pred ccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCce--eeeeeccccccceecceeeeEEe Confidence 1111111 1112233345666666665553 33 4666999998887644331 111100 111112222 246799 Q ss_pred eeeEEEeCCCCCCcee------------Eeeeccee--eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEE Q lcl|NC_021307. 224 GRPTILSDHVASGTTV------------GYLGDFSQ--IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRV 289 (310) Q Consensus 224 G~pv~~t~~~~~~~~~------------~~~gd~~~--~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 289 (310) |++|+.++++|.+... .+-|+|+. .++..+..+......+..++....+. ....| .-.+++ T Consensus 238 G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~--~~~~~---~d~i~~ 312 (332) T protein:vir:78 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF--NVQYQ---GDLIVG 312 (332) T ss_pred eeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhccc--chhhh---Hhhhhh Confidence 9999999999853210 12334444 12222222222211111111100000 00111 235667 Q ss_pred EEEeccEEeccCceEEEeec Q lcl|NC_021307. 290 EAEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 290 ~~~~d~~v~~~~a~~~l~~a 309 (310) ...+|..+.||++++.|+-| T Consensus 313 ~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 313 KLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhhhcCceecccceEEEeeC Confidence 77899999999999999888 No 141 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.12 E-value=1e-10 Score=75.41 Aligned_cols=285 Identities=11% Similarity=-0.020 Sum_probs=154.4 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |.-... ....+..++...=.+.-+++..++.+.....++++++..+..+. +.+++||+. +..+++...-|+++- T Consensus 1 ms~~n~----~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~i-G~~~~~~~~~G~~ld 75 (364) T protein:vir:10 1 MSNPNV----LTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYI-GETELQVLSPGKSPD 75 (364) T ss_pred CCCccc----ccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeee-eeeEEeeeccCcccC Confidence 222211 11111122222112334778889999999999999998887765 567899987 566677777777765 Q ss_pred ccccceeeeEeeeeee-EeeehhhHHHhhcChhH-HHHHHHHHHHHHHHHHHHHHHHccc---C-c---ccccccccccc Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKI-ATIFVASAETVRANPGN-YLGTMRTKVATAIALAFDEAALHGT---D-S---PFDKNLDETTK 150 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~-~~~~~is~ell~~s~~~-~~~~v~~~l~~a~~~~~d~~~l~G~---g-~---~~~~~~~~~~~ 150 (310) .+.+.-++.++....+ .....|-+----++..+ +-+.+.++++.++++..|+.++.-- + + +....+..... T Consensus 76 ~~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~ 155 (364) T protein:vir:10 76 ASPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGH 155 (364) T ss_pred CCCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCC Confidence 5667778877777553 33333322212244566 6789999999999999999885211 0 0 00001111111 Q ss_pred cc----cceecc-cchHHHH---HHHHHHHhhhhcC--CCCEEEEehHHHHHHHHhhhccCccccccc--cccccccccC Q lcl|NC_021307. 151 SV----DLTPAT-GTTYDAI---GVNALSLLVNAGK--KWGATLLDDVAEPILNGAKDANGRPLFVES--TYEAVTTPYR 218 (310) Q Consensus 151 ~~----~~~~~~-~~~~~~~---~~~~~~~l~~~~~--~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~--~~~~~~~~~~ 218 (310) +. ...+.. .+....+ +.++...|.+.+. ..-+.+++|..|..|.+-. +.+...- ...+....+. T Consensus 156 g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~----~lvn~d~~~~~~~~~~~G~ 231 (364) T protein:vir:10 156 GFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDAD----RIVDKSYTIAASDNTVDGF 231 (364) T ss_pred cceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCC----ccccccccccCCCccccce Confidence 10 111111 1111111 1234444544443 4457889999998887622 2221110 0122333445 Q ss_pred CceeeeeeEEEeCCCCCCc---------------------eeEeeecceee----------eEEeecccEEEEeecceee Q lcl|NC_021307. 219 EGRILGRPTILSDHVASGT---------------------TVGYLGDFSQI----------VWGQVGGLSFDVSDQATLN 267 (310) Q Consensus 219 ~~~l~G~pv~~t~~~~~~~---------------------~~~~~gd~~~~----------~~~~~~~~~v~~~~~~~~~ 267 (310) ...+.|+||+.++++|... ..-..+|++.. ..+...++..++.++.. T Consensus 232 v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~-- 309 (364) T protein:vir:10 232 VLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKK-- 309 (364) T ss_pred eEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccc-- Confidence 5679999999999998310 00011344332 22222333333332221 Q ss_pred ecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 268 LGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +-...+.+..-+|..+.||+|++.++.++ T Consensus 310 --------------~~~~~ida~~a~G~g~lRPeaa~~i~~~~ 338 (364) T protein:vir:10 310 --------------EKTWYIDTFLAEGAIPDRWEAVAVVTAAD 338 (364) T ss_pred --------------eeeeeeeeehcccCcccCccceEEEEecC Confidence 11222334556899999999999998887 No 142 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.08 E-value=6e-11 Score=76.62 Aligned_cols=267 Identities=15% Similarity=0.122 Sum_probs=154.0 Q ss_pred hccccCCCCceechhhHHHHHH-HHHhhchhhhh---------cceeecCCCceEEEEEcCC-ceeeeecccc-cccccc Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFA-EAEKTSIVQRV---------ARKIPMGSTGVKIPHWTGD-VSAAWIGEGD-MKPITK 82 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~-~~~~~s~l~~~---------~~~~~~~~~~~~ip~~~~~-~~a~~v~Eg~-~~~~~~ 82 (310) |..+.+.-..+|.||+..++++ ...+.+.+.+- ......++..+++|.+... .++.-+.||+ .++..+ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~k 80 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETGK 80 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchhh Confidence 4444556667888888877655 44444444332 1122235778999999743 5666778886 688889 Q ss_pred cceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccC------cccccccccccccccc-e Q lcl|NC_021307. 83 GDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTD------SPFDKNLDETTKSVDL-T 155 (310) Q Consensus 83 ~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g------~~~~~~~~~~~~~~~~-~ 155 (310) .+-++-....++.+..+.++++...-+..+....+.+++++...+..++.++.-.. ................ . T Consensus 81 i~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~~~ 160 (330) T protein:vir:10 81 ITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSDQSK 160 (330) T ss_pred cccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheecccc Confidence 99999999999999999999999888888999999999999999988887664211 1111100111111111 1 Q ss_pred ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCC Q lcl|NC_021307. 156 PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVAS 235 (310) Q Consensus 156 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~ 235 (310) .....+. +.+.+...++.+......+++||+..+..|++..--+ +.....++ ..-++++|++|++++.+|. T Consensus 161 ~~a~~s~-~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~----~~~~s~~~----~~i~~~~G~~VivdD~~p~ 231 (330) T protein:vir:10 161 ASTGIDA-GMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQ----YIQPTTAT----INIPTYLGYRVIIDDGIAP 231 (330) T ss_pred cccccCH-HHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhh----hhcccccC----cccccccceEEEEeCCCCC Confidence 1111222 3344677778887788899999999999998743111 11111111 1235789999999999985 Q ss_pred Ccee---EeeecceeeeEEe---ecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEe-- Q lcl|NC_021307. 236 GTTV---GYLGDFSQIVWGQ---VGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLT-- 307 (310) Q Consensus 236 ~~~~---~~~gd~~~~~~~~---~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~-- 307 (310) .... .+|+ ...+.+++ ...+.+|.+|+.. .++..+....++-+. |..+.--. T Consensus 232 ~~~~yt~yl~~-~GAi~~~~~~~~~~v~~EtdRd~~----------------~g~~~l~~r~~~~~h---p~G~s~~~~~ 291 (330) T protein:vir:10 232 TGDIYTSYLFR-TGSIGLNTGNPSGLTTFETSREAA----------------KGNDMIYTRRALVMH---PYGVKWTGAE 291 (330) T ss_pred CCCceeEEEEe-cCceeeecccCCccccccccCCcc----------------ccceEEEEeeEEEee---eeeeeecccc Confidence 4321 1222 11222222 1123455554432 223333333443332 22222111 Q ss_pred ----ecC Q lcl|NC_021307. 308 ----NAA 310 (310) Q Consensus 308 ----~aa 310 (310) +.. T Consensus 292 ~~~~~~s 298 (330) T protein:vir:10 292 VDAGNIT 298 (330) T ss_pred cccCcCC Confidence 111 No 143 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.06 E-value=2.7e-11 Score=78.52 Aligned_cols=287 Identities=12% Similarity=0.019 Sum_probs=157.4 Q ss_pred CccchhhhHHHHHhhccccCCCC--ceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeeccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQ--GYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGDM 77 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g--~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~ 77 (310) ||.-.- ......+...+...+- .+..+++..++.+..+..|.++++.+..+.. +.++.||+. +..++.....|.+ T Consensus 1 ~a~~~~-~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~i-G~~~~~~~~~g~~ 78 (347) T protein:vir:88 1 MANATG-GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGEN 78 (347) T ss_pred CCCccc-chhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeee-cceeeeeeccccC Confidence 553221 1111112222211111 2345788899999999999999998886654 667888875 4556677777777 Q ss_pred ccc--cccceeeeEeeeeee-EeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCc----cc----ccccc Q lcl|NC_021307. 78 KPI--TKGDMSVQQVEPHKI-ATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDS----PF----DKNLD 146 (310) Q Consensus 78 ~~~--~~~~~~~i~l~~~k~-~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~----~~----~~~~~ 146 (310) +.. .++..+++++...++ .....|.+-=.-++..++.+.+.++++.++++..|+.++.--.. .. ...+. T Consensus 79 l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g~ 158 (347) T protein:vir:88 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) T ss_pred CCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCCc Confidence 654 357788888888664 44445554444455678999999999999999999988632110 00 00011 Q ss_pred cccccccceecc--------cchHHHHHHHHHHHhhhhcC--CCCEEEEehHHHHHHHHhhhcc-Ccccccccccccccc Q lcl|NC_021307. 147 ETTKSVDLTPAT--------GTTYDAIGVNALSLLVNAGK--KWGATLLDDVAEPILNGAKDAN-GRPLFVESTYEAVTT 215 (310) Q Consensus 147 ~~~~~~~~~~~~--------~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~~~~~l~~l~d~~-g~~~~~~~~~~~~~~ 215 (310) ..........+. .....+.+.++...+.+.+. ..-.++++|..+..|.+....+ ..+... +... T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~-----~~~~ 233 (347) T protein:vir:88 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL-----IDPE 233 (347) T ss_pred cccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccc-----cchh Confidence 111111111110 11112223344445554443 3456888999998886533222 222111 1122 Q ss_pred ccCCceeeeeeEEEeCCCCCCc---eeE--------------------eeecceeee--EE--------eecccEEEEee Q lcl|NC_021307. 216 PYREGRILGRPTILSDHVASGT---TVG--------------------YLGDFSQIV--WG--------QVGGLSFDVSD 262 (310) Q Consensus 216 ~~~~~~l~G~pv~~t~~~~~~~---~~~--------------------~~gd~~~~~--~~--------~~~~~~v~~~~ 262 (310) .+.-+.+.|++|+.++++|.+. ... +.+|+++.. +. ...++.++..+ T Consensus 234 ~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r 313 (347) T protein:vir:88 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) T ss_pred cceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeee Confidence 2344679999999999998421 100 112333311 11 11222333333 Q ss_pred cceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeec--C Q lcl|NC_021307. 263 QATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNA--A 310 (310) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a--a 310 (310) +.. .+ ...+++..-+|..+.||++.+.|+.. | T Consensus 314 ~~~--------------~~--~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 314 RPE--------------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred chh--------------hH--HHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 221 11 23577888999999999987555444 4 No 144 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.03 E-value=8.5e-12 Score=81.29 Aligned_cols=283 Identities=12% Similarity=0.026 Sum_probs=152.0 Q ss_pred Ccc--chhhhHHHHHhhccccCCCC----ceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeec Q lcl|NC_021307. 1 MAA--GTAFPVNHTQIAQTGDSMFQ----GYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIG 73 (310) Q Consensus 1 ~aa--~~~~~~~~~~~~~~~~~~~g----~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~ 73 (310) ||- +..+.. ..|.+... .+..+++..+++......+.++++.+..++. +.+++||+. +..++.... T Consensus 1 m~~~~~~~~~t------~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~i-G~~tv~~~t 73 (347) T protein:vir:94 1 MANVPGQKIGT------DQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVM-GRTSGVYLA 73 (347) T ss_pred CCCCCcccccc------ccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecc-cceeeeeec Confidence 332 222211 11111111 1334778888999888889999998887765 667889887 667778888 Q ss_pred cccccccc--ccceeeeEeeeeee-EeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHccc----C-cc-c--c Q lcl|NC_021307. 74 EGDMKPIT--KGDMSVQQVEPHKI-ATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGT----D-SP-F--D 142 (310) Q Consensus 74 Eg~~~~~~--~~~~~~i~l~~~k~-~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~----g-~~-~--~ 142 (310) .|+.++.+ ..+-.+.+++..++ .....|-+-=--++..++.+.+.++++.++++..|+.++.=. . ++ . . T Consensus 74 ~G~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~ 153 (347) T protein:vir:94 74 PGERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNEN 153 (347) T ss_pred CCCCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Confidence 88887554 34455655665443 222233222122456789999999999999999999886311 1 01 0 0 Q ss_pred cccccccccccceeccc--------chHHHHHHHHHHHhhhhcC--CCCEEEEehHHHHHHHHhhhccCccccccccccc Q lcl|NC_021307. 143 KNLDETTKSVDLTPATG--------TTYDAIGVNALSLLVNAGK--KWGATLLDDVAEPILNGAKDANGRPLFVESTYEA 212 (310) Q Consensus 143 ~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~ 212 (310) ..+.+............ ....+.+.++...|...+. ..-..+++|..+..|..-+.-+... +. ..+ T Consensus 154 ~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~-~~---~~~ 229 (347) T protein:vir:94 154 IAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAAN-YA---ALI 229 (347) T ss_pred cCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhh-cc---ccc Confidence 11111111111111110 1111222334444554443 2447888999998775433222111 11 111 Q ss_pred cccccCCceeeeeeEEEeCCCCCCce---------eE---------------eeecceeee--EEeec--------ccEE Q lcl|NC_021307. 213 VTTPYREGRILGRPTILSDHVASGTT---------VG---------------YLGDFSQIV--WGQVG--------GLSF 258 (310) Q Consensus 213 ~~~~~~~~~l~G~pv~~t~~~~~~~~---------~~---------------~~gd~~~~~--~~~~~--------~~~v 258 (310) ....+.-+++.|++|+.++++|.+.. .+ +-+||++.. ++.+. ++++ T Consensus 230 ~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~ 309 (347) T protein:vir:94 230 DPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLAL 309 (347) T ss_pred cccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccc Confidence 22223457899999999999984210 00 112222211 11111 1222 Q ss_pred EEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 259 DVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) |..++. .. + .-.+++..-+|.++.||++.+.|+.++ T Consensus 310 e~~r~~-------------~~-~--~d~i~~~~~~G~~~~rP~~a~~~~~~~ 345 (347) T protein:vir:94 310 ERDRDV-------------DA-Q--GDLIVGKYAMGHGGLRPEAAGALVFSP 345 (347) T ss_pred cchhch-------------hh-H--HHHhhhhhhhcCcccccceeEEEEecC Confidence 222221 11 1 236778889999999999999888877 No 145 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=99.02 E-value=2.9e-11 Score=78.33 Aligned_cols=271 Identities=13% Similarity=0.061 Sum_probs=146.5 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHH---HHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeecccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQ---DYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGD 76 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~---~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~ 76 (310) ||. +. .+....+.+++.++ .+=+.+.+-..++..-+.+||. +.++++|.+.....+.-|+||+ T Consensus 1 mAe-~n------------lt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe 67 (295) T protein:vir:99 1 MAE-KN------------LNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGE 67 (295) T ss_pred CCC-cc------------cccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCc Confidence 221 11 11112233343332 2322223333345555788887 5669999999888999999999 Q ss_pred ccccccccee---eeEeeeeeeEeeehhhHHHhh-cChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccc Q lcl|NC_021307. 77 MKPITKGDMS---VQQVEPHKIATIFVASAETVR-ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSV 152 (310) Q Consensus 77 ~~~~~~~~~~---~i~l~~~k~~~~~~is~ell~-~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~ 152 (310) .||-++.+.+ ..+++.+|++..+ |.|.+. ....+-...--++|..+++.++|+.|+.-..++... T Consensus 68 ~Iplskvt~~~~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t--------- 136 (295) T protein:vir:99 68 TIPLSKVTRTKDKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTK--------- 136 (295) T ss_pred ccchhhheeeeeeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCcee--------- Confidence 9999999875 5888888888754 999985 444567788899999999999999999755432211 Q ss_pred cceecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeee-EEEeC Q lcl|NC_021307. 153 DLTPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRP-TILSD 231 (310) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~p-v~~t~ 231 (310) .....-........+....+.+.+....+.++||.+...|++-..-+ ++....-|..+. -.++|.- ++.+. T Consensus 137 -~tg~~lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~----~~~a~~fG~~~L---~nfLG~q~II~S~ 208 (295) T protein:vir:99 137 -VKGVGLQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVG----ADASNVFGMTLL---KNFLGMQNVIVMP 208 (295) T ss_pred -eehhhHHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccc----cchhhhhhhhhh---hhhhccceEEEcc Confidence 00000001112222333444445556779999999999987532111 111111111111 1388986 99999 Q ss_pred CCCCCceeEeeecceeeeEEeecc--cE---EEEeecceeeecccccccchhhhhcCcEEEEEEEEecc--EEeccCceE Q lcl|NC_021307. 232 HVASGTTVGYLGDFSQIVWGQVGG--LS---FDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGL--LINDVEAFV 304 (310) Q Consensus 232 ~~~~~~~~~~~gd~~~~~~~~~~~--~~---v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~--~v~~~~a~~ 304 (310) .+|.|+......|==.+++....+ +. ....++.++-...-.. ..+...+......++ -..+.++++ T Consensus 209 kv~~G~~~aT~~~Ni~~ay~~~~~g~l~~~f~~~~D~tglIg~~h~~-------~~~~~t~et~~~~~~~lfpE~~dgiv 281 (295) T protein:vir:99 209 SVPEGKIYSTAVENLVFASLNVKGGDLGGLFADFTDETGLIAAARNR-------QLSNLTYESVFFGANVLFAEIPEGVV 281 (295) T ss_pred cCCCceEEEeeccceEEEEecCCchhhhhhhhhccCcccceEEEecc-------ccceeeehhhhHhHHHhcccccceEE Confidence 999998655433221222222221 11 1111111111000000 000011111111111 234567888 Q ss_pred EEeecC Q lcl|NC_021307. 305 KLTNAA 310 (310) Q Consensus 305 ~l~~aa 310 (310) +.+..+ T Consensus 282 ~~tI~~ 287 (295) T protein:vir:99 282 EATIEA 287 (295) T ss_pred EEEEec Confidence 888877 No 146 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.02 E-value=8.1e-11 Score=75.90 Aligned_cols=285 Identities=11% Similarity=0.002 Sum_probs=155.3 Q ss_pred Ccc---chhhhHHHHHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAA---GTAFPVNHTQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa---~~~~~~~~~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg 75 (310) ||. |.++.. .....+...... +..+++..++.+..+..|+++++.+..... +.++.||+. +..++.....| T Consensus 1 ~~~~~~~~~~~t---~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~i-G~~t~~~~~~g 76 (347) T protein:vir:33 1 MANIQGGQQIGT---NQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVI-GRTKAAYLKPG 76 (347) T ss_pred CCCCccCccccc---ccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeec-cceeeeeecCC Confidence 442 222221 111111111111 234788899999999999999998876644 667888886 45666777778 Q ss_pred ccccc--cccceeeeEeeeeee-EeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHc-----ccCccc------ Q lcl|NC_021307. 76 DMKPI--TKGDMSVQQVEPHKI-ATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALH-----GTDSPF------ 141 (310) Q Consensus 76 ~~~~~--~~~~~~~i~l~~~k~-~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~-----G~g~~~------ 141 (310) .+++. .++..++.+++..+. .....|.+-=--++..++.+.+.++.+.++++..|+.++. +..... T Consensus 77 ~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~ 156 (347) T protein:vir:33 77 ENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIE 156 (347) T ss_pred CCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 87754 346667766765443 2223333332334567899999999999999999999872 111110 Q ss_pred ccccccccccccceec-------ccchHHHHHHHHHHHhhhhcC--CCCEEEEehHHHHHHHHhhhcc-Ccccccccccc Q lcl|NC_021307. 142 DKNLDETTKSVDLTPA-------TGTTYDAIGVNALSLLVNAGK--KWGATLLDDVAEPILNGAKDAN-GRPLFVESTYE 211 (310) Q Consensus 142 ~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~~~~~l~~l~d~~-g~~~~~~~~~~ 211 (310) ..+..........++. ......+.+.++...|...+. ..-..+++|..+..|.+-..-. ..+ ... T Consensus 157 ~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~-----~~~ 231 (347) T protein:vir:33 157 GLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANY-----QAL 231 (347) T ss_pred cccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccc-----ccc Confidence 0011101110010010 011122333345555655444 3456888999998886532221 111 111 Q ss_pred ccccccCCceeeeeeEEEeCCCCCCcee-----E---------------eeecceee--eE--------EeecccEEEEe Q lcl|NC_021307. 212 AVTTPYREGRILGRPTILSDHVASGTTV-----G---------------YLGDFSQI--VW--------GQVGGLSFDVS 261 (310) Q Consensus 212 ~~~~~~~~~~l~G~pv~~t~~~~~~~~~-----~---------------~~gd~~~~--~~--------~~~~~~~v~~~ 261 (310) +....+.-+++.|++|+.++++|.+... . +-++|+.. ++ ....++.++.. T Consensus 232 ~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~ 311 (347) T protein:vir:33 232 LDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERA 311 (347) T ss_pred cccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeec Confidence 2233344567999999999999864210 0 11112111 11 11222334433 Q ss_pred ecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 262 DQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 262 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++.. +-.-.+++...+|.++.||++.+.|+.+= T Consensus 312 r~~~----------------~~~d~i~~~~~~G~~vlrP~~av~i~~~~ 344 (347) T protein:vir:33 312 RRAN----------------YQADQIIAKYAMGHGGLRPEAAGAIVLPK 344 (347) T ss_pred cchh----------------hhhHhhhhhhhcCCceecccceEEEecCC Confidence 3221 11234566778899999999988887666 No 147 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.00 E-value=1.5e-10 Score=74.44 Aligned_cols=288 Identities=11% Similarity=-0.015 Sum_probs=152.6 Q ss_pred CccchhhhHHH-HHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeeccccc Q lcl|NC_021307. 1 MAAGTAFPVNH-TQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGDM 77 (310) Q Consensus 1 ~aa~~~~~~~~-~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~ 77 (310) ||.=.- .... ......+....-. +..+.+..++.+..+..|.++.+.+..... +.++.||+. +..++.....|.+ T Consensus 1 ma~~~~-~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~i-g~~t~~~~~~g~~ 78 (347) T protein:vir:15 1 MANIQG-GQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVI-GRTKAAYLKPGEN 78 (347) T ss_pred CCcccc-CCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeec-cceeeeeeccCCC Confidence 332111 1100 0011111101001 234677788899999999999998876654 667888886 4566777778887 Q ss_pred ccc--cccceeeeEeeeeee-EeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHccc--------Cccccc--- Q lcl|NC_021307. 78 KPI--TKGDMSVQQVEPHKI-ATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGT--------DSPFDK--- 143 (310) Q Consensus 78 ~~~--~~~~~~~i~l~~~k~-~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~--------g~~~~~--- 143 (310) ++. ..++.++.++...+. +....|.+-=--++..++.+.+.++.+.++++..|+.++.-- .+..+. T Consensus 79 l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~ 158 (347) T protein:vir:15 79 LDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEGL 158 (347) T ss_pred CCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 754 446677777766443 223333332223566789999999999999999999887311 010110 Q ss_pred ccccccccccceeccc-------chHHHHHHHHHHHhhhhcC--CCCEEEEehHHHHHHHHhhhccCccccccccccccc Q lcl|NC_021307. 144 NLDETTKSVDLTPATG-------TTYDAIGVNALSLLVNAGK--KWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVT 214 (310) Q Consensus 144 ~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~ 214 (310) +...........+... ....+.+.++...|...+. ..-..+++|..+..|.+-.+... ......+.. T Consensus 159 g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~----~d~~~~~~~ 234 (347) T protein:vir:15 159 GKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNA----ANYQALIDH 234 (347) T ss_pred CccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccccc----ccccccccc Confidence 0011111111111110 1112222234444544443 33356669999999865432221 111111222 Q ss_pred cccCCceeeeeeEEEeCCCCCCce------------eEee--------ecc----------eeeeEEeecccEEEEeecc Q lcl|NC_021307. 215 TPYREGRILGRPTILSDHVASGTT------------VGYL--------GDF----------SQIVWGQVGGLSFDVSDQA 264 (310) Q Consensus 215 ~~~~~~~l~G~pv~~t~~~~~~~~------------~~~~--------gd~----------~~~~~~~~~~~~v~~~~~~ 264 (310) ..+.-+.+.|++|+.++++|.+.. ..+- ++| +.+-.+...+++++..++. T Consensus 235 ~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~ 314 (347) T protein:vir:15 235 ERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA 314 (347) T ss_pred cceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccc Confidence 334456799999999999984321 0000 111 1111222233344444332 Q ss_pred eeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 265 TLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) . +-.-.+++...+|.++.||++.+.|+.+= T Consensus 315 ~----------------~~~d~i~~~~~~G~~vlrP~~av~~~~~~ 344 (347) T protein:vir:15 315 N----------------YQADQIIAKYAMGHGGLRPEAAGAIVLPK 344 (347) T ss_pred h----------------hhhhhhehhhhcCCceeccccEEEEecCC Confidence 1 11244666778899999999988876665 No 148 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.98 E-value=1.3e-10 Score=74.77 Aligned_cols=267 Identities=12% Similarity=0.054 Sum_probs=148.9 Q ss_pred hccccCCCCceechhhHHHHHH-HHHhhchhhh---------hcceeecCCCceEEEEEcCC-ceeeeeccccccccccc Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFA-EAEKTSIVQR---------VARKIPMGSTGVKIPHWTGD-VSAAWIGEGDMKPITKG 83 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~-~~~~~s~l~~---------~~~~~~~~~~~~~ip~~~~~-~~a~~v~Eg~~~~~~~~ 83 (310) |. ++--+.+|.||+..++++ ...+.+.+++ +.....-++..+++|.+..- .++.-+.|+..++..+. T Consensus 1 MA--~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~ki 78 (351) T protein:vir:15 1 MA--ETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNL 78 (351) T ss_pred CC--ceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhee Confidence 32 233456788888877654 4444454433 21222235777999999753 57777899999999999 Q ss_pred ceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHccc------Ccccccccccccccccceec Q lcl|NC_021307. 84 DMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGT------DSPFDKNLDETTKSVDLTPA 157 (310) Q Consensus 84 ~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~------g~~~~~~~~~~~~~~~~~~~ 157 (310) +-++-....++.+..+.++++...-+..+....+.+++++...+..++.+|.-. .+.........+.. .... T Consensus 79 tt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~--~~~~ 156 (351) T protein:vir:15 79 TSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKV--SPSE 156 (351) T ss_pred cccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceeccccc--cccc Confidence 988888888999988999998888788899999999999999999999877532 11111111111111 0111 Q ss_pred ccchHHHHHHHHHHHhhhhc-CCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCC Q lcl|NC_021307. 158 TGTTYDAIGVNALSLLVNAG-KKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASG 236 (310) Q Consensus 158 ~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~ 236 (310) ...+. +.+.+...++.+.. ..-.+|+||+..+..|+++.--+- .... .+ ...-+++.|++|++++.||.. T Consensus 157 ~~is~-~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~----~~~s-~~---~~~i~t~~G~~VivdD~~p~~ 227 (351) T protein:vir:15 157 PMFGA-KGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIET----IQPQ-NG---ATPFEAYNGLRIVLDDDIEID 227 (351) T ss_pred cccCH-HHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhh----cccc-cc---CcccceecceEEEEcCCCccc Confidence 11222 33456777776643 346899999999999986542111 1111 11 112467999999999999853 Q ss_pred c--------eeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEe-----ccEEe----- Q lcl|NC_021307. 237 T--------TVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEY-----GLLIN----- 298 (310) Q Consensus 237 ~--------~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~-----d~~v~----- 298 (310) . ...+|+. ..+.++. ++..+++.|+..... ++-.+....++ |+... T Consensus 228 ~~~~~~~~ytsyl~~~-GAi~~~~-~~~~ve~~rd~~~~~--------------g~d~l~~r~~~~~hp~G~s~~~~~~~ 291 (351) T protein:vir:15 228 LTDKTKPVSTSYIFAP-GAVRYST-NMRSTETKYDPLING--------------GQDVIVQKRVGTIHVAGTSIKASFSP 291 (351) T ss_pred cCCCCCceeEEEEEec-ceeeeec-CCcCcceeecccCCC--------------CceEEEEeeeeeeeeeeeeecccccc Confidence 1 1112221 1122232 233455555544221 11111111111 11111 Q ss_pred -----------------------ccCc--eEEEeecC Q lcl|NC_021307. 299 -----------------------DVEA--FVKLTNAA 310 (310) Q Consensus 299 -----------------------~~~a--~~~l~~aa 310 (310) ++++ +++++-+. T Consensus 292 ~~~~sPt~~~L~~~~NW~~v~~~d~k~I~iv~~~~~~ 328 (351) T protein:vir:15 292 SKASFPTIDELAKSSTWEVVDGIDVRSIGVVAYTAQL 328 (351) T ss_pred cCcCCcChHHhcCCcccccccCCCccccceEEEEEec Confidence 1111 01111110 No 149 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.93 E-value=9.5e-11 Score=75.53 Aligned_cols=287 Identities=14% Similarity=0.019 Sum_probs=158.6 Q ss_pred hccc--cCCCCcee-chhhHHHHHHHHHhhchhhhhcceeec-CCCceEEEEEcCCceeeeecccccccccccceeeeEe Q lcl|NC_021307. 15 AQTG--DSMFQGYL-EPEQAQDYFAEAEKTSIVQRVARKIPM-GSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQV 90 (310) Q Consensus 15 ~~~~--~~~~g~~i-~~~~~~~ii~~~~~~s~l~~~~~~~~~-~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l 90 (310) |.+| ++....++ |++|+..+..-+.+......+.++... .+.++.||.. +.+...-..+++.+.-..++-.++++ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsI-g~~tV~dY~~~~~i~~d~ltt~~~~l 79 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSV-GTPVVRSRPEQGDFTFDNLDTGEISI 79 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccc-cccccccccCCCCcccccCCCceEEE Confidence 4444 23334456 667889999888888877776665443 3677888876 45666666677777655566665555 Q ss_pred ee--eeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcc--cCc------ccccccccccccccceecccc Q lcl|NC_021307. 91 EP--HKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHG--TDS------PFDKNLDETTKSVDLTPATGT 160 (310) Q Consensus 91 ~~--~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G--~g~------~~~~~~~~~~~~~~~~~~~~~ 160 (310) .. .|+.++ .++++. .+...++.+...++.+.+++...|+.+..= +|. +.+..+.+.......+..... T Consensus 80 ~IDq~KYfaf-~VdDD~-~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~~~ 157 (322) T protein:vir:31 80 ILRDEVYAGN-AISKKL-RQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGTDQT 157 (322) T ss_pred EEehhhhhcc-ccchhH-HHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCCCch Confidence 55 445444 477755 466789999999999999999999877431 111 111111111111111222333 Q ss_pred hHHHHHHHHHHHhhhhcCC-CCEE-EEehHHHHHHHHh-----hhccCcccccccccccc-ccccCCceeeeeeEEEeCC Q lcl|NC_021307. 161 TYDAIGVNALSLLVNAGKK-WGAT-LLDDVAEPILNGA-----KDANGRPLFVESTYEAV-TTPYREGRILGRPTILSDH 232 (310) Q Consensus 161 ~~~~~~~~~~~~l~~~~~~-~~~~-~~~~~~~~~l~~l-----~d~~g~~~~~~~~~~~~-~~~~~~~~l~G~pv~~t~~ 232 (310) ...+.+.++..+|.+.... ...| |++|..+..|..+ --.++|..... .+|. ......+++.|..|++|+. T Consensus 158 ~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~--~sG~a~g~~~Vg~~~GF~V~~SN~ 235 (322) T protein:vir:31 158 MDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIV--ESGIAPDMQFVRSVYGIDLFVSNL 235 (322) T ss_pred hhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccc--cccchhhHHHHHHHhceeeeeecc Confidence 4455566777777766554 3454 5578887766332 11233322111 1111 1112357899999999999 Q ss_pred CCCCceeEeeeccee-eeEEeecccEEEEeeccee-------eecccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|NC_021307. 233 VASGTTVGYLGDFSQ-IVWGQVGGLSFDVSDQATL-------NLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFV 304 (310) Q Consensus 233 ~~~~~~~~~~gd~~~-~~~~~~~~~~v~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~ 304 (310) ++.++..++-|.-.. ...|...+. ..+++.... ++..-+. . ..-.+.--.+|..+|+|..+.|++.++ T Consensus 236 l~~~~~~i~aG~d~~~t~ag~~n~f-~~~~~~~~~~~~~~~~~l~~~e~--~-r~~~~~~d~~~~~~~~g~g~~r~e~l~ 311 (322) T protein:vir:31 236 LADANETINAGGDARSTTAGKCNMF-MNVSDMGLLPFVVAWKEMPTTKS--F-IDDYNDDLNTATTARWGNGLVRDENLV 311 (322) T ss_pred ccccccccccCcccccccceeeccc-ccccchhhhhhhhHhhhhhhhhc--c-cCccccccceeeeeeecceeecccceE Confidence 976553333321111 111111111 111111110 0000000 0 001233456889999999999999998 Q ss_pred EEeecC Q lcl|NC_021307. 305 KLTNAA 310 (310) Q Consensus 305 ~l~~aa 310 (310) .|.--| T Consensus 312 ~~~a~~ 317 (322) T protein:vir:31 312 CVLANA 317 (322) T ss_pred EEEecc Confidence 886666 No 150 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.92 E-value=3.3e-10 Score=72.57 Aligned_cols=230 Identities=10% Similarity=0.009 Sum_probs=145.4 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG-VKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |+--..-.......+.-..+. ..+...|+|.+.+.++|++.++++...... ....+.++-|++.|..=++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~------~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~ 74 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPD------GKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQ 74 (331) T ss_pred CCccccCcccHHHHHHhcCcc------hhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccC Confidence 332111111111111111100 123456999999999999999998754333 4556778889999999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccc---------- Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANP--GNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDE---------- 147 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~--~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~---------- 147 (310) +++.++.+++-..+-+++.+.|.+.+.+... .++...-.+...+++..++...||+|+.+..|....+ T Consensus 75 ~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a 154 (331) T protein:vir:10 75 PEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) T ss_pred cccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccc Confidence 9999999999999999999999999988643 2334446666888999999999999975533311100 Q ss_pred ------------------------------------ccccc--------------------------------------- Q lcl|NC_021307. 148 ------------------------------------TTKSV--------------------------------------- 152 (310) Q Consensus 148 ------------------------------------~~~~~--------------------------------------- 152 (310) ..++. T Consensus 155 ~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v 234 (331) T protein:vir:10 155 ENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYV 234 (331) T ss_pred ccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccE Confidence 00000 Q ss_pred ----cc-------eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCc--cccccccccccccccCC Q lcl|NC_021307. 153 ----DL-------TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGR--PLFVESTYEAVTTPYRE 219 (310) Q Consensus 153 ----~~-------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~--~~~~~~~~~~~~~~~~~ 219 (310) .+ .++.+.+..+....+...+........+|+||++....|+++.-..++ .+..... ..... T Consensus 235 ~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~-----~g~~~ 309 (331) T protein:vir:10 235 VRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI-----AGKKV 309 (331) T ss_pred EEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeec-----CCcce Confidence 00 000001112223334444555567778999999999999886433322 2332222 22334 Q ss_pred ceeeeeeEEEeCCCCCCceeEe Q lcl|NC_021307. 220 GRILGRPTILSDHVASGTTVGY 241 (310) Q Consensus 220 ~~l~G~pv~~t~~~~~~~~~~~ 241 (310) -.+.|+||..++.+-.++..++ T Consensus 310 t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 310 VAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred eEECCeeEEEeeeeecCccccC Confidence 5699999999999877765554 No 151 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.92 E-value=3.3e-10 Score=72.57 Aligned_cols=230 Identities=10% Similarity=0.009 Sum_probs=145.4 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG-VKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |+--..-.......+.-..+. ..+...|+|.+.+.++|++.++++...... ....+.++-|++.|..=++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~------~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~ 74 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPD------GKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQ 74 (331) T ss_pred CCccccCcccHHHHHHhcCcc------hhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccC Confidence 332111111111111111100 123456999999999999999998754333 4556778889999999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccc---------- Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANP--GNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDE---------- 147 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~--~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~---------- 147 (310) +++.++.+++-..+-+++.+.|.+.+.+... .++...-.+...+++..++...||+|+.+..|....+ T Consensus 75 ~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a 154 (331) T protein:vir:98 75 PEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) T ss_pred cccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccc Confidence 9999999999999999999999999988643 2334446666888999999999999975533311100 Q ss_pred ------------------------------------ccccc--------------------------------------- Q lcl|NC_021307. 148 ------------------------------------TTKSV--------------------------------------- 152 (310) Q Consensus 148 ------------------------------------~~~~~--------------------------------------- 152 (310) ..++. T Consensus 155 ~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v 234 (331) T protein:vir:98 155 ENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYV 234 (331) T ss_pred ccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccE Confidence 00000 Q ss_pred ----cc-------eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCc--cccccccccccccccCC Q lcl|NC_021307. 153 ----DL-------TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGR--PLFVESTYEAVTTPYRE 219 (310) Q Consensus 153 ----~~-------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~--~~~~~~~~~~~~~~~~~ 219 (310) .+ .++.+.+..+....+...+........+|+||++....|+++.-..++ .+..... ..... T Consensus 235 ~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~-----~g~~~ 309 (331) T protein:vir:98 235 VRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI-----AGKKV 309 (331) T ss_pred EEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeec-----CCcce Confidence 00 000001112223334444555567778999999999999886433322 2332222 22334 Q ss_pred ceeeeeeEEEeCCCCCCceeEe Q lcl|NC_021307. 220 GRILGRPTILSDHVASGTTVGY 241 (310) Q Consensus 220 ~~l~G~pv~~t~~~~~~~~~~~ 241 (310) -.+.|+||..++.+-.++..++ T Consensus 310 t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:98 310 VAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred eEECCeeEEEeeeeecCccccC Confidence 5699999999999877765554 No 152 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.92 E-value=3.3e-10 Score=72.57 Aligned_cols=230 Identities=10% Similarity=0.009 Sum_probs=145.4 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG-VKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |+--..-.......+.-..+. ..+...|+|.+.+.++|++.++++...... ....+.++-|++.|..=++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~------~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~ 74 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPD------GKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQ 74 (331) T ss_pred CCccccCcccHHHHHHhcCcc------hhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccC Confidence 332111111111111111100 123456999999999999999998754333 4556778889999999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccc---------- Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANP--GNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDE---------- 147 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~--~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~---------- 147 (310) +++.++.+++-..+-+++.+.|.+.+.+... .++...-.+...+++..++...||+|+.+..|....+ T Consensus 75 ~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a 154 (331) T protein:vir:10 75 PEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) T ss_pred cccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccc Confidence 9999999999999999999999999988643 2334446666888999999999999975533311100 Q ss_pred ------------------------------------ccccc--------------------------------------- Q lcl|NC_021307. 148 ------------------------------------TTKSV--------------------------------------- 152 (310) Q Consensus 148 ------------------------------------~~~~~--------------------------------------- 152 (310) ..++. T Consensus 155 ~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v 234 (331) T protein:vir:10 155 ENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYV 234 (331) T ss_pred ccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccE Confidence 00000 Q ss_pred ----cc-------eecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCc--cccccccccccccccCC Q lcl|NC_021307. 153 ----DL-------TPATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGR--PLFVESTYEAVTTPYRE 219 (310) Q Consensus 153 ----~~-------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~--~~~~~~~~~~~~~~~~~ 219 (310) .+ .++.+.+..+....+...+........+|+||++....|+++.-..++ .+..... ..... T Consensus 235 ~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~-----~g~~~ 309 (331) T protein:vir:10 235 VRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEI-----AGKKV 309 (331) T ss_pred EEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeec-----CCcce Confidence 00 000001112223334444555567778999999999999886433322 2332222 22334 Q ss_pred ceeeeeeEEEeCCCCCCceeEe Q lcl|NC_021307. 220 GRILGRPTILSDHVASGTTVGY 241 (310) Q Consensus 220 ~~l~G~pv~~t~~~~~~~~~~~ 241 (310) -.+.|+||..++.+-.++..++ T Consensus 310 t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 310 VAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred eEECCeeEEEeeeeecCccccC Confidence 5699999999999877765554 No 153 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.84 E-value=4.6e-10 Score=71.75 Aligned_cols=241 Identities=11% Similarity=0.018 Sum_probs=129.2 Q ss_pred hcceeecCCCceEEEEEcCCceeeeecccccccc--cccceeeeEeeeee-eEeeehhhHHHhhcChhHHHHHHHHHHHH Q lcl|NC_021307. 47 VARKIPMGSTGVKIPHWTGDVSAAWIGEGDMKPI--TKGDMSVQQVEPHK-IATIFVASAETVRANPGNYLGTMRTKVAT 123 (310) Q Consensus 47 ~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~--~~~~~~~i~l~~~k-~~~~~~is~ell~~s~~~~~~~v~~~l~~ 123 (310) +.+.+. ++.+++||+. +..+++...-|+++.. .++.-++.+|...+ +.....|-+-=--++..++.+...++++. T Consensus 1 ~vr~i~-~g~s~~~~~i-G~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G~ 78 (324) T protein:vir:99 1 MTRTIT-SGKSAQFPVM-GRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGE 78 (324) T ss_pred Ceeeee-cCceEEEeee-eeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHHHHH Confidence 555444 4777999987 6677788887887743 44555665555433 22222333322234667899999999999 Q ss_pred HHHHHHHHHHHcc----c--C---cccccccccccccccc--eecc----cchHHHHHHHHHHHhhhhcCC--CCEEEEe Q lcl|NC_021307. 124 AIALAFDEAALHG----T--D---SPFDKNLDETTKSVDL--TPAT----GTTYDAIGVNALSLLVNAGKK--WGATLLD 186 (310) Q Consensus 124 a~~~~~d~~~l~G----~--g---~~~~~~~~~~~~~~~~--~~~~----~~~~~~~~~~~~~~l~~~~~~--~~~~~~~ 186 (310) ++++..|+.++.- . . ...+....+.+..... .... .....+.+.++...|...+.. .-..+++ T Consensus 79 aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv~ 158 (324) T protein:vir:99 79 ALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYTD 158 (324) T ss_pred HHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeC Confidence 9999999887521 1 1 1111111111111000 1111 111122233445555544432 4468889 Q ss_pred hHHHHHHHHhhhcc-CccccccccccccccccCCceeeeeeEEEeCCCCCCcee-----------------------Eee Q lcl|NC_021307. 187 DVAEPILNGAKDAN-GRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTV-----------------------GYL 242 (310) Q Consensus 187 ~~~~~~l~~l~d~~-g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~-----------------------~~~ 242 (310) |..+..|..-+.-+ +.+. ..+....+.-+.+.|++|+.++++|.+... -+- T Consensus 159 P~~y~~Ll~~~~~~~~~~~-----~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~ 233 (324) T protein:vir:99 159 PDTYSAILAALMPNAANYA-----ALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMT 233 (324) T ss_pred hHHHHHHhhcccccccccc-----cccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccc Confidence 99998775432221 2221 222333344567899999999999853110 022 Q ss_pred ecceee--eE--------EeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 243 GDFSQI--VW--------GQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 243 gd~~~~--~~--------~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +|++.. ++ +...+++.+..++.. +-...+++..-+|..+.||++.+.++..+ T Consensus 234 ~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~----------------~~~d~i~~~~a~G~~~lRPe~a~~v~l~~ 295 (324) T protein:vir:99 234 VGADNVVGLFVHRSAVATLKLKDMALERARRPE----------------YQADQIIAKYAMGHGGLRPEAVGAIIFED 295 (324) T ss_pred cccCceeEEEEehhheEEEeeecceecceechh----------------hHHHhhhhhhhhcCcccccceEEEEEEcc Confidence 333322 11 111222233322211 11244566677899999999998888777 No 154 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.84 E-value=5.6e-10 Score=71.32 Aligned_cols=229 Identities=12% Similarity=0.007 Sum_probs=145.0 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG-VKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |+--..-.......+.-.+ +......|+|.+.+.++|++.+++......+ .+..+.++-|++.|..=++.++ T Consensus 1 m~~~~~~a~TL~e~AKr~~-------~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN~g~~ 73 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLD-------PNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVL 73 (330) T ss_pred CCcCCCCcccHHHHHhhcC-------cchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhhcCCccc Confidence 3322111122222222111 1224567999999999999999887542222 2334556778999999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccc--------- Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANP--GNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDET--------- 148 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~--~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~--------- 148 (310) +++.++.+++-..+-+++...|-+.+.+.+. .++...-.+...+++.+++.+.||+|+.+..|....+. T Consensus 74 ~s~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta 153 (330) T protein:vir:10 74 PNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSA 153 (330) T ss_pred cccceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCC Confidence 9999999999999999999999999987543 23445566778899999999999999766444211100 Q ss_pred -------------------------------------ccccc--------c----------------------------- Q lcl|NC_021307. 149 -------------------------------------TKSVD--------L----------------------------- 154 (310) Q Consensus 149 -------------------------------------~~~~~--------~----------------------------- 154 (310) .++.. . T Consensus 154 ~~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r 233 (330) T protein:vir:10 154 ENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWR 233 (330) T ss_pred CchhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcc Confidence 00000 0 Q ss_pred ----ee-------cccchHHHHH---HHHHHHhhhhcCCCCEEEEehHHHHHHHHh-hhccCccccccccccccccccCC Q lcl|NC_021307. 155 ----TP-------ATGTTYDAIG---VNALSLLVNAGKKWGATLLDDVAEPILNGA-KDANGRPLFVESTYEAVTTPYRE 219 (310) Q Consensus 155 ----~~-------~~~~~~~~~~---~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l-~d~~g~~~~~~~~~~~~~~~~~~ 219 (310) .. .......+++ .++...+........+|+||++....|+++ .+.....+.... .. .... T Consensus 234 ~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~-~~----g~~~ 308 (330) T protein:vir:10 234 YVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWET-VS----GERV 308 (330) T ss_pred cEEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeee-cC----Ceee Confidence 00 0000112222 233344455566788999999999999986 444443333322 22 2233 Q ss_pred ceeeeeeEEEeCCCCCCceeEe Q lcl|NC_021307. 220 GRILGRPTILSDHVASGTTVGY 241 (310) Q Consensus 220 ~~l~G~pv~~t~~~~~~~~~~~ 241 (310) -.+.|+||..++.+-.++..++ T Consensus 309 t~~~gipir~~Dail~tE~~vv 330 (330) T protein:vir:10 309 MTFDGIPVQRTDALLNTESRVV 330 (330) T ss_pred EEECCeEEEEEeeeecCccccC Confidence 5689999999999877776554 No 155 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.75 E-value=1.8e-09 Score=68.47 Aligned_cols=270 Identities=11% Similarity=0.124 Sum_probs=147.0 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCC-ce-EEEEEcCCceeeeecccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGST-GV-KIPHWTGDVSAAWIGEGDMK 78 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~-~ip~~~~~~~a~~v~Eg~~~ 78 (310) |--....|.+. .+.++.-+....-++.+.+-+.+.+-.-++...+.+||..+ .+ .+|.+.....+.-|+||+.| T Consensus 1 ~~~~~~~~e~n----lt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe~I 76 (296) T protein:vir:98 1 MVTSRTYPEEN----LIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVI 76 (296) T ss_pred CCCccccCcCC----CcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccCCccc Confidence 33333333211 11111111111223344443333333335555588998754 46 34667888889999999999 Q ss_pred ccccccee---eeEeeeeeeEeeehhhHHHhh-cChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccc Q lcl|NC_021307. 79 PITKGDMS---VQQVEPHKIATIFVASAETVR-ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDL 154 (310) Q Consensus 79 ~~~~~~~~---~i~l~~~k~~~~~~is~ell~-~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 154 (310) |-++.+.+ ..+++.+|.+..+ |.|.+. ....+....--++|..+++.++|+.|+.-..++... T Consensus 77 plskvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t----------- 143 (296) T protein:vir:98 77 PLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGT----------- 143 (296) T ss_pred chhhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccce----------- Confidence 99999876 4888888988775 999985 444567788899999999999999999765443211 Q ss_pred eecccchHH----HHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCc-eeeeeeEEE Q lcl|NC_021307. 155 TPATGTTYD----AIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREG-RILGRPTIL 229 (310) Q Consensus 155 ~~~~~~~~~----~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~-~l~G~pv~~ 229 (310) ..+.+.... ..+.++...+++.+....+.++||.+...+++- ++ +. .+...+.... .++|.-++. T Consensus 144 ~~~t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~--a~---it-----~qt~fG~tyl~nfLG~~II~ 213 (296) T protein:vir:98 144 QDALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAK--AG---IT-----TQTAFGLTYLVDFTGTVIIS 213 (296) T ss_pred eeechhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcC--Cc---cc-----hhheechhhhhhccccEEEE Confidence 001111111 122334455666555678999999999887642 21 11 1111111111 278888999 Q ss_pred eCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEE-----------EEecc--E Q lcl|NC_021307. 230 SDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVE-----------AEYGL--L 296 (310) Q Consensus 230 t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~-----------~~~d~--~ 296 (310) +..+|.|+......|==.+++.+.++-++.- .+.+..+ ++|.+++.-. ...++ - T Consensus 214 S~kV~~G~~~~T~~~Ni~~ay~~~~~~~l~~----~f~~~~d---------~tglIGv~h~~~~~~~t~eT~~~~~~~lf 280 (296) T protein:vir:98 214 TNDVTKGEIWATVPENIIFAYINPNNSELAK----EFNLYGD---------PTGYIGMNHFQENTTLTIQTLLVSGMLMY 280 (296) T ss_pred cCcCCCceEEEeeecceEEEeecccccchhh----hhccccc---------cccceEEEeccccceeeehhHhHhHHHhc Confidence 9999998865543322222233221111100 0000000 1112221111 11111 2 Q ss_pred EeccCceEEEeecC Q lcl|NC_021307. 297 INDVEAFVKLTNAA 310 (310) Q Consensus 297 v~~~~a~~~l~~aa 310 (310) ..+.+++++.+..+ T Consensus 281 pE~~dgiv~~tI~~ 294 (296) T protein:vir:98 281 PERIDGIVKVTLTP 294 (296) T ss_pred ccccceEEEEEecC Confidence 34567888888888 No 156 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.73 E-value=4.1e-09 Score=66.56 Aligned_cols=231 Identities=10% Similarity=-0.008 Sum_probs=143.9 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG-VKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |+---.-.......+.-.+. ......|||.+.+.++|++.+++....+.+ .+..+.++-|+++|..=++.++ T Consensus 1 m~~~~~~a~TL~E~Akr~~~-------d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~lN~g~~ 73 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDK-------NGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRRYNQGVQ 73 (335) T ss_pred CCcCCCCchhHHHHHhhcCc-------chhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhhcCCccc Confidence 33332222223222222221 124556999999999999999987543222 2334556778999999999999 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccc-------- Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANP--GNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETT-------- 149 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~--~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~-------- 149 (310) +++.++.+++-..+-+++.+.|-+.+.+.+. .++...-.+...+++..++.+.||+|+.+..|....+.. T Consensus 74 ~s~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st 153 (335) T protein:vir:73 74 PTKTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLST 153 (335) T ss_pred cccceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccc Confidence 9999999999999999999999998887544 234555566688999999999999997655442221100 Q ss_pred -----------ccc------------------------------------------------------------------ Q lcl|NC_021307. 150 -----------KSV------------------------------------------------------------------ 152 (310) Q Consensus 150 -----------~~~------------------------------------------------------------------ 152 (310) ++. T Consensus 154 ~~a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~ 233 (335) T protein:vir:73 154 SKAASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDW 233 (335) T ss_pred cccCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCc Confidence 000 Q ss_pred -------ccee----cccchHHHHHHHHHHH-----hhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccc Q lcl|NC_021307. 153 -------DLTP----ATGTTYDAIGVNALSL-----LVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTP 216 (310) Q Consensus 153 -------~~~~----~~~~~~~~~~~~~~~~-----l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~ 216 (310) .+.. .......+++..++.+ +........+|+||++....|+++.....+..+......+ T Consensus 234 r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g---- 309 (335) T protein:vir:73 234 RSISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYGG---- 309 (335) T ss_pred ccEEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccCC---- Confidence 0000 0001112222223333 2333445578999999999999865444443333322222 Q ss_pred cCCceeeeeeEEEeCCCCCCceeEee Q lcl|NC_021307. 217 YREGRILGRPTILSDHVASGTTVGYL 242 (310) Q Consensus 217 ~~~~~l~G~pv~~t~~~~~~~~~~~~ 242 (310) ...-.+.|+||..++.+-.++..+.- T Consensus 310 ~~~t~~~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 310 KKIVSFLGIPIRRVDAILNTESAVTA 335 (335) T ss_pred ceeEEECCeEEEEEeeeecCcccccC Confidence 23346889999999998777655432 No 157 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.72 E-value=1.1e-09 Score=69.71 Aligned_cols=278 Identities=12% Similarity=0.077 Sum_probs=143.3 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCC-ce---EEEEEcCCceeeeecccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGST-GV---KIPHWTGDVSAAWIGEGD 76 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~-~~---~ip~~~~~~~a~~v~Eg~ 76 (310) |++-+.+...... +-...-++.+.+-+.+.+-.-++...+.+||..+ .+ ++|.++....++-|+||+ T Consensus 1 M~~e~nl~~~~dL---------~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe 71 (303) T protein:vir:10 1 MSAENNLINVEAL---------GKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGD 71 (303) T ss_pred CCCCcCCcchhhc---------ccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCc Confidence 4444433322111 1112223444444444444445556677888644 34 456566677889999999 Q ss_pred ccccccccee---eeEeeeeeeEeeehhhHHHhh-cChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccc Q lcl|NC_021307. 77 MKPITKGDMS---VQQVEPHKIATIFVASAETVR-ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSV 152 (310) Q Consensus 77 ~~~~~~~~~~---~i~l~~~k~~~~~~is~ell~-~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~ 152 (310) .||.++.+.. ..+++.+|++..+ |.|.+. ....+....--++|..+++.++++.|+.-..++.... ... T Consensus 72 ~Iplskvt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~-----~~t 144 (303) T protein:vir:10 72 VIPLTKVTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENG-----KRT 144 (303) T ss_pred ccchhhheeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhccccc-----ccc Confidence 9999998864 6888899988865 999984 4445667788899999999999999987544332110 000 Q ss_pred cceecccchHHHHHHHHHHHhh--hhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEe Q lcl|NC_021307. 153 DLTPATGTTYDAIGVNALSLLV--NAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILS 230 (310) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t 230 (310) ..+.........-+......+. ..+....+.++||.+...+++ ++.- +..+..-|..+. -.++|.-++.+ T Consensus 145 ~~t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~--~A~i---~~~~t~fG~n~L---~nfLG~~II~S 216 (303) T protein:vir:10 145 NKTKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLA--NGFI---NSTGAQFGVNLL---TPYVGVKIVEF 216 (303) T ss_pred cceeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhh--cCCc---chhhhhhhhhhh---hhhhcceEEEe Confidence 1111111111111111111111 112345699999999999874 2211 000000111111 12788899999 Q ss_pred CCCCCCceeEeeecceeeeEEeecc-----cEEEEeecceeeecccccccchhhhhcCcEEEEEEEEecc--EEeccCce Q lcl|NC_021307. 231 DHVASGTTVGYLGDFSQIVWGQVGG-----LSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGL--LINDVEAF 303 (310) Q Consensus 231 ~~~~~~~~~~~~gd~~~~~~~~~~~-----~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~--~v~~~~a~ 303 (310) ..+|.|+......|==.+++....| ..+.++.-+.+-...... .+...+-.....++ -..+.+++ T Consensus 217 ~kv~~G~~~~T~~~Ni~~ay~~~~g~l~~~f~~t~D~tglIGv~h~~~--------~~~~t~eT~~~~~~~lfpE~~dgi 288 (303) T protein:vir:10 217 ADVPQGEVWMTVAENLNVAYANPRGELSRAFAFATDATGFVGVLHDIQ--------PQRLTSDTIYASAISMFPENIDAV 288 (303) T ss_pred ccCCCceEEEeeccceEEEEecCchhhhhhhhhccccccceEEEeccc--------cceeeehhHhHhHHHhcccccceE Confidence 9999998665443322222222222 122221111111110000 00011111111111 23456788 Q ss_pred EEEeecC Q lcl|NC_021307. 304 VKLTNAA 310 (310) Q Consensus 304 ~~l~~aa 310 (310) ++.+..+ T Consensus 289 v~~ti~~ 295 (303) T protein:vir:10 289 IKVTIKK 295 (303) T ss_pred EEEEEec Confidence 9998866 No 158 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.72 E-value=2.9e-09 Score=67.39 Aligned_cols=295 Identities=11% Similarity=-0.009 Sum_probs=148.5 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |.-... ....+..++...=.+.-+++..++.+.....++++++..+..+. +.++++|+. +..+++...-|++.- T Consensus 1 Ms~~n~----~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~i-G~~~a~y~~~G~~ld 75 (402) T protein:vir:97 1 MSTPNT----LTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPN 75 (402) T ss_pred CCCccc----ccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEE-eeeEEeeeccccccC Confidence 222211 11111122222112334778888999999999999998887765 567899987 566677777676665 Q ss_pred ccccceeeeEeeeeee-EeeehhhHHHhhcChhH-HHHHHHHHHHHHHHHHHHHHHHc-----ccC-cccc---cccc-- Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKI-ATIFVASAETVRANPGN-YLGTMRTKVATAIALAFDEAALH-----GTD-SPFD---KNLD-- 146 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~-~~~~~is~ell~~s~~~-~~~~v~~~l~~a~~~~~d~~~l~-----G~g-~~~~---~~~~-- 146 (310) .+.+.-++.++....+ .....|-+----++..+ +-+.+.++++.++++..|+.++. +.. +..+ .... T Consensus 76 g~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~ 155 (402) T protein:vir:97 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) T ss_pred CCCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCccccc Confidence 5667777777777543 33233322111244566 67899999999999999997753 110 0000 0000 Q ss_pred cccccccceec-ccchHH---HHHHHHHHHhhhhc--CCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCc Q lcl|NC_021307. 147 ETTKSVDLTPA-TGTTYD---AIGVNALSLLVNAG--KKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREG 220 (310) Q Consensus 147 ~~~~~~~~~~~-~~~~~~---~~~~~~~~~l~~~~--~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~ 220 (310) +.......+.. ...+.. +-+.++...+.+.+ ...-+.+++|..|..|.+-.+--.+- |.. ...+....+.-. T Consensus 156 g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d-~~~-~~~g~~~~G~v~ 233 (402) T protein:vir:97 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKT-YTI-SQSGATINGFVL 233 (402) T ss_pred ccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchh-hcc-ccCCccccceeE Confidence 01111111111 111221 11223334443332 33457899999999997632211110 110 111223334446 Q ss_pred eeeeeeEEEeCCCCCCce-------------eE--eeecceeee--EEeecccEEEEeecceeeecccccccchhhhhcC Q lcl|NC_021307. 221 RILGRPTILSDHVASGTT-------------VG--YLGDFSQIV--WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHN 283 (310) Q Consensus 221 ~l~G~pv~~t~~~~~~~~-------------~~--~~gd~~~~~--~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (310) .+.|+||+.++++|.... .. +-+|++... +..+.- +-+.+-..+.......... - T Consensus 234 ~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~A--v~tvk~~~vT~~~~~d~r~------~ 305 (402) T protein:vir:97 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDA--LLVGRTIEVTGDIFYEKKE------K 305 (402) T ss_pred EEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecce--EEEEEeeccccchhhchhH------H Confidence 799999999999985321 01 125555422 222221 1111111111111000000 0 Q ss_pred cEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 284 LVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 284 ~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ...+-+..-++..+.||+|..+++.+= T Consensus 306 ~~~id~~~a~G~g~~RPeaa~vv~~~~ 332 (402) T protein:vir:97 306 TYYIDTFMAEGAIPDRWEAVSVVTTKR 332 (402) T ss_pred HHHHHHHHHhCCcccCccceEEEEEec Confidence 011223345788889999988774443 No 159 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.68 E-value=2.5e-09 Score=67.77 Aligned_cols=292 Identities=11% Similarity=-0.004 Sum_probs=154.1 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |.--. ........+++..=.+.-+++..++.......++++++..+..+. +.++++|+. +..+++...-|+++- T Consensus 1 Ms~~n----~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG~~ld 75 (400) T protein:vir:10 1 MSTPN----NLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPA 75 (400) T ss_pred CCCCc----cccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCCCCcC Confidence 21110 001111112222222456777888899999999999999988876 556889987 677888888888876 Q ss_pred ccccceeeeEeeeee-eEeeehhhHHHhhcChhH-HHHHHHHHHHHHHHHHHHHHHHc----cc--Cccccc---cc--- Q lcl|NC_021307. 80 ITKGDMSVQQVEPHK-IATIFVASAETVRANPGN-YLGTMRTKVATAIALAFDEAALH----GT--DSPFDK---NL--- 145 (310) Q Consensus 80 ~~~~~~~~i~l~~~k-~~~~~~is~ell~~s~~~-~~~~v~~~l~~a~~~~~d~~~l~----G~--g~~~~~---~~--- 145 (310) .+.+.-++..++... +.....|-+----++..+ +-+.+.+++.+++++..|+.++. +. .+..+. ++ T Consensus 76 g~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~ 155 (400) T protein:vir:10 76 ATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGH 155 (400) T ss_pred CCCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCcccc Confidence 666777888787755 444444443222344566 78999999999999999997752 11 011110 00 Q ss_pred ccccccccceecccchHHHH---HHHHHHHhhhhcC--CCCEEEEehHHHHHHHHhhhccCcccccccc---cccccccc Q lcl|NC_021307. 146 DETTKSVDLTPATGTTYDAI---GVNALSLLVNAGK--KWGATLLDDVAEPILNGAKDANGRPLFVEST---YEAVTTPY 217 (310) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~l~~~~~--~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~---~~~~~~~~ 217 (310) ...............+...+ ..++...+...+. ..-++++.|..|..|.... .+..... ..+....+ T Consensus 156 g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~d-----kLvnrdf~~s~~g~~~~g 230 (400) T protein:vir:10 156 GFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDAD-----RIVDKSYTISQSGATIQG 230 (400) T ss_pred ccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCC-----cccchhccccCCCccccc Confidence 00111111111111121122 2233333333222 2346666777776765321 1222221 11223344 Q ss_pred CCceeeeeeEEEeCCCCCCc-----ee--------E--eeecceeee--EEeecccEEEEeecceeeecccccccchhhh Q lcl|NC_021307. 218 REGRILGRPTILSDHVASGT-----TV--------G--YLGDFSQIV--WGQVGGLSFDVSDQATLNLGTPQAPNFVSLW 280 (310) Q Consensus 218 ~~~~l~G~pv~~t~~~~~~~-----~~--------~--~~gd~~~~~--~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~ 280 (310) ....+.|+||+.++++|... +. . +-||++... +..++-+- +.+-..+........ T Consensus 231 ~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~--tvk~~~lt~~~~~d~------ 302 (400) T protein:vir:10 231 FVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALL--VGRSIDVIGDIFYEK------ 302 (400) T ss_pred eEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheE--EEEeeccccccccch------ Confidence 45679999999999998421 01 1 236666543 22222211 111111111111110 Q ss_pred hcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 281 QHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 281 ~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++-...+-+..-++..+.||+|++.++.+= T Consensus 303 r~~~~~id~~~a~G~g~~RPeaa~vv~~~~ 332 (400) T protein:vir:10 303 KEKTYYIDTFMSEGAIPDRWEAVSVVTTKR 332 (400) T ss_pred hhHHHHHHHHHHhCCcccchhheEEEEecC Confidence 111122334456788999999999998876 No 160 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.56 E-value=4.5e-09 Score=66.34 Aligned_cols=292 Identities=12% Similarity=-0.009 Sum_probs=149.7 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEcCCceeeeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMG-STGVKIPHWTGDVSAAWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (310) |.--. ........+++..=.+.-+++..++.......++++++..+..+. +.++++|+. +..+++...-|++.- T Consensus 1 Ms~~n----~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG~~ld 75 (401) T protein:vir:70 1 MSTPN----NLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPA 75 (401) T ss_pred CCCCc----cccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCCCCcC Confidence 21110 000111111121112455777888899999999999999988876 566899987 667788887777776 Q ss_pred ccccceeeeEeeeee-eEeeehhhHHHhhcChhH-HHHHHHHHHHHHHHHHHHHHHHc-----ccC--ccccccccccc- Q lcl|NC_021307. 80 ITKGDMSVQQVEPHK-IATIFVASAETVRANPGN-YLGTMRTKVATAIALAFDEAALH-----GTD--SPFDKNLDETT- 149 (310) Q Consensus 80 ~~~~~~~~i~l~~~k-~~~~~~is~ell~~s~~~-~~~~v~~~l~~a~~~~~d~~~l~-----G~g--~~~~~~~~~~~- 149 (310) .+.+.-++..|.... +.....|-+----++..+ +.+.+.+++.+++++..|+.++. |-. .+....+.... T Consensus 76 ~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~ 155 (401) T protein:vir:70 76 ATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGH 155 (401) T ss_pred CCCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCC Confidence 667778888787755 334444433222244566 68899999999999999986632 211 00001111000 Q ss_pred ----ccccceecccchHH---HHHHHHHHHhhhhcCC--CCEEEEehHHHHHHHHhhhccCcccccccc---cccccccc Q lcl|NC_021307. 150 ----KSVDLTPATGTTYD---AIGVNALSLLVNAGKK--WGATLLDDVAEPILNGAKDANGRPLFVEST---YEAVTTPY 217 (310) Q Consensus 150 ----~~~~~~~~~~~~~~---~~~~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~---~~~~~~~~ 217 (310) ...........+.. +-+.++...+...+.. .-++++.|..|..|... | .+..... ..+....+ T Consensus 156 G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~-d----~L~nrd~~~s~~g~~~~G 230 (401) T protein:vir:70 156 GFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDA-D----RIVDKTYTISQSGATIQG 230 (401) T ss_pred ceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhc-C----cccchhhccccCCccccc Confidence 00011111111111 1223444444433332 33566666666666432 1 1221111 12223334 Q ss_pred CCceeeeeeEEEeCCCCCCce-------------eE--eeecceeee--EEeecccEEEEeecceeeecccccccchhhh Q lcl|NC_021307. 218 REGRILGRPTILSDHVASGTT-------------VG--YLGDFSQIV--WGQVGGLSFDVSDQATLNLGTPQAPNFVSLW 280 (310) Q Consensus 218 ~~~~l~G~pv~~t~~~~~~~~-------------~~--~~gd~~~~~--~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~ 280 (310) ....+.|+||+.++++|.... .. +-||++... +..++-+- +.+-..+........ T Consensus 231 ~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~--tvk~~~lt~~~~~d~------ 302 (401) T protein:vir:70 231 FTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALL--VGRSIDVTGDIFYEK------ 302 (401) T ss_pred eEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheE--EEEeeccccchhhhh------ Confidence 446799999999999985320 11 225665533 22222211 111111111111110 Q ss_pred hcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 281 QHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 281 ~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++-...+-+..-++..+.||+|.+.++.|= T Consensus 303 r~~~~~id~~~a~g~g~~RPeaa~vv~~k~ 332 (401) T protein:vir:70 303 KEKTYYIDTFMAEGAIPDRWEAVSVVTTKR 332 (401) T ss_pred hhhHHHHHHHHHhCCcccchhheEEEeecC Confidence 011111224456788999999998886665 No 161 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.53 E-value=2.1e-08 Score=62.64 Aligned_cols=281 Identities=10% Similarity=-0.015 Sum_probs=155.4 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCcee-eeeccccccc Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSA-AWIGEGDMKP 79 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a-~~v~Eg~~~~ 79 (310) ||. .+++-++....-..+.+.+.|...-....|++++.......+....|....-...+ .-..||+..+ T Consensus 1 ma~----------~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~ 70 (317) T protein:vir:88 1 MAT----------PTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDAT 70 (317) T ss_pred CCc----------cccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccc Confidence 322 11121222222345667788888888999999988877766666777765433332 3334887766 Q ss_pred ccccceee---eEeeeeeeEeeehhhHHHhhcCh-hHHHHHHHHHHHHHHHHHHHHHHHcccCcc---cc--ccc-cccc Q lcl|NC_021307. 80 ITKGDMSV---QQVEPHKIATIFVASAETVRANP-GNYLGTMRTKVATAIALAFDEAALHGTDSP---FD--KNL-DETT 149 (310) Q Consensus 80 ~~~~~~~~---i~l~~~k~~~~~~is~ell~~s~-~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~---~~--~~~-~~~~ 149 (310) ........ -......=...+.=|.+...... .+...+-..+-...+.+.+|+++++|.... +. ..- .+.. T Consensus 71 ~~~~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~ 150 (317) T protein:vir:88 71 IKAGSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIF 150 (317) T ss_pred cccccCCEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHH Confidence 44322211 11122222222223333333222 243444444455567889999999997431 11 100 0000 Q ss_pred cc---------c--------c--ceecc-cchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccc- Q lcl|NC_021307. 150 KS---------V--------D--LTPAT-GTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVES- 208 (310) Q Consensus 150 ~~---------~--------~--~~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~- 208 (310) .- . . .+..+ .....+.+.++..++-..+..+..++|++.....|.++...++..+..+. T Consensus 151 ~~i~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~ 230 (317) T protein:vir:88 151 AYYKTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDAS 230 (317) T ss_pred HHhccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEccc Confidence 00 0 0 00001 11234555677778888888888999999999999887543443443221 Q ss_pred -cccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEE Q lcl|NC_021307. 209 -TYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAV 287 (310) Q Consensus 209 -~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (310) ...+.........+--+.++.+.++|+++ +++.|++++-+...+++..+.+-. .-|.... T Consensus 231 ~~~~g~~v~~~~tdfG~v~ii~~r~lp~~~--~~~~D~~~~~l~~Lr~~~~e~laK-----------------tGd~~k~ 291 (317) T protein:vir:88 231 DNRIAQTVDVYESDFGKYTIRANRWFHENT--LFVFDPKMHSLCYLRPFFQHELAK-----------------TGDSEKR 291 (317) T ss_pred CeEEEEEEEEEEeCCeEEEEEeCCCCCCCe--EEEEcccccceeecccceeeccCC-----------------Cccccee Confidence 11111111111112224888899999876 455688887666555554443211 2345566 Q ss_pred EEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 288 RVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 288 r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .....++..+.++.|.+++++-+ T Consensus 292 ~i~~E~tLe~~N~~a~a~i~~l~ 314 (317) T protein:vir:88 292 QLLVEYTFRVNNEKSGALIRDVV 314 (317) T ss_pred EEEEEEEEEEcCccceeEEEEec Confidence 67889999999999999999998 No 162 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.50 E-value=1.1e-07 Score=58.67 Aligned_cols=269 Identities=14% Similarity=0.044 Sum_probs=152.3 Q ss_pred cccCCCCceechhh---HHHHHHHHHhhchhhhhcce---eecCCCceEEEEEcCCceeeeeccc-ccccccccceeeeE Q lcl|NC_021307. 17 TGDSMFQGYLEPEQ---AQDYFAEAEKTSIVQRVARK---IPMGSTGVKIPHWTGDVSAAWIGEG-DMKPITKGDMSVQQ 89 (310) Q Consensus 17 ~~~~~~g~~i~~~~---~~~ii~~~~~~s~l~~~~~~---~~~~~~~~~ip~~~~~~~a~~v~Eg-~~~~~~~~~~~~i~ 89 (310) ..+.+.|.++..++ -..+++.+.+....+++..+ .+.....+.+...+....+.|.+.+ ..+|..+..++... T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 33334444444322 24578888888887777654 3333445666766677778898764 44788889999999 Q ss_pred eeeeeeEeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccc------ccc-eec-- Q lcl|NC_021307. 90 VEPHKIATIFVASAETVRAN---PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKS------VDL-TPA-- 157 (310) Q Consensus 90 l~~~k~~~~~~is~ell~~s---~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~------~~~-~~~-- 157 (310) ...+.++..+.++..-++.+ ..++..--....++++++.+|+.+|+|+..-+-.+.+...+. .+. ... T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~~~~ 160 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGNVSK 160 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcccccccc Confidence 99999999999988877754 457888889999999999999999999775332222221110 000 000 Q ss_pred -ccchHHHHH---HHHHHHhhh---hcCCCCEEEEehHHHHHHHHhh--hccCccccccccccccccccCCceeeeeeEE Q lcl|NC_021307. 158 -TGTTYDAIG---VNALSLLVN---AGKKWGATLLDDVAEPILNGAK--DANGRPLFVESTYEAVTTPYREGRILGRPTI 228 (310) Q Consensus 158 -~~~~~~~~~---~~~~~~l~~---~~~~~~~~~~~~~~~~~l~~l~--d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~ 228 (310) ...+.+.+. ..+..++.. ....+..++++|+.+..|...+ +..|..++.- .. -.....+|...|-. T Consensus 161 w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~-l~----~~~~~~~I~~~p~L 235 (301) T protein:vir:80 161 WEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKV-LQ----DNAWFSAIVRVPDL 235 (301) T ss_pred cccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHH-HH----HHcCcceEEEccee Confidence 111233333 334444322 2235678999999999997543 3333322211 00 01122456666665 Q ss_pred EeCCCCCCceeEee-e-cceeeeEEeecccEEEEeecceeeecccccccchhhhhcCc-EEEEEEEEe-ccEEeccCceE Q lcl|NC_021307. 229 LSDHVASGTTVGYL-G-DFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNL-VAVRVEAEY-GLLINDVEAFV 304 (310) Q Consensus 229 ~t~~~~~~~~~~~~-g-d~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~r~~~~~-d~~v~~~~a~~ 304 (310) .+... .++..+++ . +...+.+.....+.... ...+++ .....+.|+ +..+.+|.|++ T Consensus 236 ~~~g~-~g~~~~v~~~~~~d~~~~~v~~~~~~~~------------------~e~~~~~~~~~~~~r~~Gv~i~~P~ai~ 296 (301) T protein:vir:80 236 AGMGT-AGSDSFAVIHDSNETAELIIPMDITRHP------------------EEYSFPRTKVPFEERTAGVVVRFPAAIV 296 (301) T ss_pred ccCCC-CcccEEEEEecCCcEEEEEecCceeeec------------------ceecCceeEeeeeeeeEEEEEEccceEE Confidence 54332 22222221 1 11112222222221110 112221 223345566 56899999999 Q ss_pred EEeec Q lcl|NC_021307. 305 KLTNA 309 (310) Q Consensus 305 ~l~~a 309 (310) .+.+- T Consensus 297 ~~~GI 301 (301) T protein:vir:80 297 RVDGI 301 (301) T ss_pred EEecC Confidence 99999 No 163 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.46 E-value=4.6e-08 Score=60.80 Aligned_cols=284 Identities=9% Similarity=-0.002 Sum_probs=145.1 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHH-hhchhhhhcceeecCCCceEEEEEcCCceeeeec------ Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAE-KTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIG------ 73 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~-~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~------ 73 (310) |+=+..+.-- ...++.-...-..++..++....+ +.+.|++-++...-.++...+-.. ......-++ T Consensus 1 ~~~~~~~~~~-----~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 74 (322) T protein:vir:10 1 MKLNAIMSML-----PLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETL-ASMDPDAVKRKRSRQ 74 (322) T ss_pred Ccccceeeee-----eeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeec-ccccccccccccccc Confidence 3222211100 000111111122556666555443 455666665533322222111111 111112222 Q ss_pred ---ccc-cccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccc-- Q lcl|NC_021307. 74 ---EGD-MKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDE-- 147 (310) Q Consensus 74 ---Eg~-~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~-- 147 (310) .+. ..|......+.........+....|.+.-..+...+..+...+..+.+++++.|..++++.-.....+..+ T Consensus 75 ~~~d~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~ 154 (322) T protein:vir:10 75 QSADGTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQP 154 (322) T ss_pred cccCcccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccc Confidence 111 23333334444444444444456777776666778899999999999999999998887532111111111 Q ss_pred --ccccccce-ecccchHHHHHHHHHHHhhhhcCCC--C-EEEEehHHHHHHHHhhhcc-CccccccccccccccccCCc Q lcl|NC_021307. 148 --TTKSVDLT-PATGTTYDAIGVNALSLLVNAGKKW--G-ATLLDDVAEPILNGAKDAN-GRPLFVESTYEAVTTPYREG 220 (310) Q Consensus 148 --~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~--~-~~~~~~~~~~~l~~l~d~~-g~~~~~~~~~~~~~~~~~~~ 220 (310) ........ ...+.+.+. +.++...+...+... . .++.+|..+..|.....-. ..+...... ...+..+ T Consensus 155 v~~~ss~~i~~g~~g~t~~k-l~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l----~~~G~ig 229 (322) T protein:vir:10 155 VEFLATQEIGDGTKPISFDY-VTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDL----QSKGIIT 229 (322) T ss_pred cccCCCcccccCccchhHHH-HHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhh----hhcCeee Confidence 00011111 122334444 345666666655543 2 4677888888886533222 222221111 1123356 Q ss_pred eeeeeeEEEeCCCCCCce----------------eEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCc Q lcl|NC_021307. 221 RILGRPTILSDHVASGTT----------------VGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNL 284 (310) Q Consensus 221 ~l~G~pv~~t~~~~~~~~----------------~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (310) +++|..++.++++|.+.. ..+....+.+.++.+.++..++...+.. ... T Consensus 230 ~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~---------------~~a 294 (322) T protein:vir:10 230 NWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSA---------------SFA 294 (322) T ss_pred eeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCc---------------chh Confidence 799999999999984321 1233444555566655666665443331 223 Q ss_pred EEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 285 VAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 285 ~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ..+++..-+|..+.+|+.++.+...= T Consensus 295 ~~I~~~~~~Ga~ri~~~gVv~i~~~e 320 (322) T protein:vir:10 295 WRIYSAFTADCVRVEDEHIFKLRLKN 320 (322) T ss_pred hhhhhhhhhCceEeccCcEEEEEEec Confidence 44667788999999999999998876 No 164 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.41 E-value=2.6e-07 Score=56.71 Aligned_cols=273 Identities=10% Similarity=-0.019 Sum_probs=131.6 Q ss_pred CCCc-eechhhHHHHHHHHHhhchhhhhccee---e---cCCCceEEEEEcCCceeeee-----cccccccccccceeee Q lcl|NC_021307. 21 MFQG-YLEPEQAQDYFAEAEKTSIVQRVARKI---P---MGSTGVKIPHWTGDVSAAWI-----GEGDMKPITKGDMSVQ 88 (310) Q Consensus 21 ~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~---~---~~~~~~~ip~~~~~~~a~~v-----~Eg~~~~~~~~~~~~i 88 (310) +.-. ++|..++.++++.+++..++.+++..- . -.+.+++||+... ..+.+. +++..+...+.+-+++ T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTEDSF 79 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc-ccceeeeccccccCCcccccccccceE Confidence 2222 345556788999999999988887531 1 2356788987542 333332 3455566566666777 Q ss_pred Eeee-eeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHH Q lcl|NC_021307. 89 QVEP-HKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGV 167 (310) Q Consensus 89 ~l~~-~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 167 (310) ++.. +..+.-+.++++-......++.+.+.++..++++.++|..++.--. +.+..... ...........+.+. T Consensus 80 ~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~-~a~~~~~~-----~~~~~~~~~~~~~i~ 153 (392) T protein:vir:99 80 PVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIV-GAPYEAAG-----AVHEVAPDEFFKGVN 153 (392) T ss_pred EEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHh-cccccccc-----cccccChhhhHHHHH Confidence 7776 4556667778877777788898889999999999999998874221 11111000 001111122233444 Q ss_pred HHHHHhhhhcCC-CCEEEEehHHHHHHHHhhhccCcccccccc---ccccccccCCceeeeeeEEEeCCCCCCceeEeee Q lcl|NC_021307. 168 NALSLLVNAGKK-WGATLLDDVAEPILNGAKDANGRPLFVEST---YEAVTTPYREGRILGRPTILSDHVASGTTVGYLG 243 (310) Q Consensus 168 ~~~~~l~~~~~~-~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~---~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~g 243 (310) ++...|...... .-+++++|..+..|.+. .. ....... .......+.-+++.|.+|+.++++|.++... + T Consensus 154 ~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~--~~--~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a--~ 227 (392) T protein:vir:99 154 GARRALNELYIPQGRVLVVGTAVTEQILND--DR--FIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYL--Y 227 (392) T ss_pred HHHHHHhhcCCCCCCEEEEcHHHHHHHhcc--cc--eeecccccchhhhhhhcceeeeeeeeEEEeeccccccccee--e Confidence 666666665443 33677888888887642 11 1111111 0111223344689999999999998765432 2 Q ss_pred cceeeeEEeecccEEE-------EeecceeeecccccccchhhhhcCcEEEEEEEEeccEEec---cCceEE---EeecC Q lcl|NC_021307. 244 DFSQIVWGQVGGLSFD-------VSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIND---VEAFVK---LTNAA 310 (310) Q Consensus 244 d~~~~~~~~~~~~~v~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~---~~a~~~---l~~aa 310 (310) ..+.+.+......... .+....+......... ..+..+...+.. ..+..... ..++.. ++..+ T Consensus 228 ~~~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~--~t~~s~~~~v~~--~~g~~~v~~~~~~~~~~~~~~~~~~ 303 (392) T protein:vir:99 228 HPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYD--STITSNRSLIDT--YFGLKVVEDPNGVGFVRARKIHLIP 303 (392) T ss_pred eccccccccccccccccccceeEEecccceecceeeccc--ceeeccccccce--eEEEEEEeeccccceeeeeeeeeec Confidence 2222222211111100 0000000000000000 000111111110 01111110 001100 00000 No 165 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.38 E-value=2.3e-07 Score=56.97 Aligned_cols=285 Identities=9% Similarity=0.023 Sum_probs=154.0 Q ss_pred Ccc-------chhhhHHHHHhhccccC--CCCcee-chh--hHH-HHHHHHHhhchhhhhccee---ecCCCceEEEEEc Q lcl|NC_021307. 1 MAA-------GTAFPVNHTQIAQTGDS--MFQGYL-EPE--QAQ-DYFAEAEKTSIVQRVARKI---PMGSTGVKIPHWT 64 (310) Q Consensus 1 ~aa-------~~~~~~~~~~~~~~~~~--~~g~~i-~~~--~~~-~ii~~~~~~s~l~~~~~~~---~~~~~~~~ip~~~ 64 (310) |.- +..+.... ..+..... ...|.+ ..+ .++ .+++...+....+++..+. +....++.+...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~ 79 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYL-IQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFD 79 (319) T ss_pred CCCcchhHHhhHHHHHHH-hhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeec Confidence 322 11111111 11111111 122333 332 233 4777777777777766653 2233446666777 Q ss_pred CCceeeeecc-cccccccccceeeeEeeeeeeEeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHHcccCcc Q lcl|NC_021307. 65 GDVSAAWIGE-GDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN---PGNYLGTMRTKVATAIALAFDEAALHGTDSP 140 (310) Q Consensus 65 ~~~~a~~v~E-g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s---~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~ 140 (310) ..+.+.|.+. ...+|..+..++......+.++..+.++..-++.+ ..++..--....++++++.+|+.+|+|+..- T Consensus 80 ~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~ 159 (319) T protein:vir:10 80 KVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAPH 159 (319) T ss_pred cccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccc Confidence 7777889875 45589888999999999999999999988777654 4578888889999999999999999997654 Q ss_pred cccccccccccccceec-----ccchHHHHHH---HHHHHhhh---hcCCCCEEEEehHHHHHHHHhhhccCcccccccc Q lcl|NC_021307. 141 FDKNLDETTKSVDLTPA-----TGTTYDAIGV---NALSLLVN---AGKKWGATLLDDVAEPILNGAKDANGRPLFVEST 209 (310) Q Consensus 141 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~---~~~~~l~~---~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~ 209 (310) .-.+.+...+......+ ...+.+.... .+...+.. ....+..++++|+.+..|.......|..+..--. T Consensus 160 g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~t~l~~lk 239 (319) T protein:vir:10 160 KIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPETTMSYLDYFK 239 (319) T ss_pred cceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCCCeeHHHHHH Confidence 32233222111111111 1112233333 33333322 3346789999999999997555444433321100 Q ss_pred ccccccccCCceeeeeeEEEeCCCCCCceeEeee--cceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEE Q lcl|NC_021307. 210 YEAVTTPYREGRILGRPTILSDHVASGTTVGYLG--DFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAV 287 (310) Q Consensus 210 ~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~g--d~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (310) -...+.+|.+.|....... .++..+++- +...+-+.....+++..... +.=...+ T Consensus 240 -----~~~~~l~I~~~pel~~ag~-~g~~~~v~y~~~~~~~~~~v~~~~~~~~~e~-----------------~~l~~~~ 296 (319) T protein:vir:10 240 -----SQNSGIEIDSIAELEDIDG-AGTKGVLVYEKNPMNMSIEIPEAFNMLPAQP-----------------KDLHFKV 296 (319) T ss_pred -----HhcCCceEEEeeeecccCC-CcceEEEEEecCCceEEEecCcceeeeeeee-----------------cCceEEE Confidence 0122445777776654332 222222222 22222222222222211100 0011233 Q ss_pred EEEEEec-cEEeccCceEEEeec Q lcl|NC_021307. 288 RVEAEYG-LLINDVEAFVKLTNA 309 (310) Q Consensus 288 r~~~~~d-~~v~~~~a~~~l~~a 309 (310) ....|++ ..+.+|.|++.+.+- T Consensus 297 ~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 297 PCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eeeeeeEEEEEEccceeEeeecC Confidence 3455554 678899999999999 No 166 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.31 E-value=3.7e-07 Score=55.82 Aligned_cols=271 Identities=9% Similarity=-0.003 Sum_probs=155.4 Q ss_pred hccccC-CCCceechhh--H-HHHHHHHHhhchhhhhcceee---cCCCceEEEEEcCCceeeeecc-ccccccccccee Q lcl|NC_021307. 15 AQTGDS-MFQGYLEPEQ--A-QDYFAEAEKTSIVQRVARKIP---MGSTGVKIPHWTGDVSAAWIGE-GDMKPITKGDMS 86 (310) Q Consensus 15 ~~~~~~-~~g~~i~~~~--~-~~ii~~~~~~s~l~~~~~~~~---~~~~~~~ip~~~~~~~a~~v~E-g~~~~~~~~~~~ 86 (310) +..-.. .+|.++..++ + ..+++...+.-..+++..+.. ....++.+++.+..+.+.|.+. +..+|..+..++ T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~ 80 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALAT 80 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccceeeccce Confidence 222212 2334455432 2 346666666666666655432 2234566677777777889865 555888899999 Q ss_pred eeEeeeeeeEeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccc------cccceec Q lcl|NC_021307. 87 VQQVEPHKIATIFVASAETVRAN---PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTK------SVDLTPA 157 (310) Q Consensus 87 ~i~l~~~k~~~~~~is~ell~~s---~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~------~~~~~~~ 157 (310) ......+.++..+.++.+-++.+ ..++..--....++++++.+|+.+|.|+..-+-.+.+.... ..++... T Consensus 81 ~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W~~~ 160 (296) T protein:vir:10 81 ERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSWSQP 160 (296) T ss_pred eEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCCccCH Confidence 99999999999999988777654 45788888889999999999999999976543223322111 1111111 Q ss_pred ccchHHHHHHHHHHHhhh---hcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCC Q lcl|NC_021307. 158 TGTTYDAIGVNALSLLVN---AGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVA 234 (310) Q Consensus 158 ~~~~~~~~~~~~~~~l~~---~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~ 234 (310) +...+++..++..+.. ....+..++++|..+..|.......|..+..- . .-...+.++...|...+.+. T Consensus 161 --t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~-i----k~~~~~l~i~~~~~l~~a~~- 232 (296) T protein:vir:10 161 --TTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEF-F----RQNNSGVTVEFVQYLNDYNG- 232 (296) T ss_pred --HHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHH-H----HHhcCCceEEEeeeeccCCC- Confidence 1222333344443332 34567789999999998875544444322211 0 01123445666666544332 Q ss_pred CCceeEee--ecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEec-cEEeccCceEEEeecC Q lcl|NC_021307. 235 SGTTVGYL--GDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYG-LLINDVEAFVKLTNAA 310 (310) Q Consensus 235 ~~~~~~~~--gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d-~~v~~~~a~~~l~~aa 310 (310) .++..+++ -+...+-+.....+...-.. .+.-...++...+++ ..+.+|.|++.+.+-- T Consensus 233 ~g~~~~v~~~~~~~~~~~~v~~~~~~~~~e-----------------~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~ 294 (296) T protein:vir:10 233 TGTSAAIAYEKDPNNMAIEIPEATNALPAQ-----------------PKDLHFKIPVTSKATGLIVYRPLTMAVMKGIT 294 (296) T ss_pred CcceEEEEEEcCCceEEEEcCcceeeeccc-----------------ccCceEEEeeEeeEEEEEEECCceeEEEeeee Confidence 23333332 22223323322332221110 011234566678785 7999999999996655 No 167 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=98.24 E-value=1.4e-07 Score=58.23 Aligned_cols=290 Identities=13% Similarity=0.087 Sum_probs=150.1 Q ss_pred Cccchhh----hHHHHHhhc-cccCCCCceechhhHHHHHHHHHhhch-hhhhcceeecC-CCceEEEEEcCCceeeeec Q lcl|NC_021307. 1 MAAGTAF----PVNHTQIAQ-TGDSMFQGYLEPEQAQDYFAEAEKTSI-VQRVARKIPMG-STGVKIPHWTGDVSAAWIG 73 (310) Q Consensus 1 ~aa~~~~----~~~~~~~~~-~~~~~~g~~i~~~~~~~ii~~~~~~s~-l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~ 73 (310) -.+|... +.+...++. .+|+..+.+|-...-..+++.-+.... ..+.|..-.++ -...+..+..+.++-.-|. T Consensus 342 ~~~G~~~~~~~~~~~v~~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~ 421 (652) T protein:vir:79 342 TERGIGVSSYNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVR 421 (652) T ss_pred HhhccCCCCCCHHHHHHHHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccC Confidence 1122111 223444444 355555544433333333333333322 55556654443 1223344555666777889 Q ss_pred ccccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHH---cccCcc--cccccccc Q lcl|NC_021307. 74 EGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAAL---HGTDSP--FDKNLDET 148 (310) Q Consensus 74 Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l---~G~g~~--~~~~~~~~ 148 (310) |+++.......=+..++...++|..+.||++++-..+.++..-|...++++.++.+++.++ .+...- .++..... T Consensus 422 E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~h 501 (652) T protein:vir:79 422 EGAEYKYVTTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDK 501 (652) T ss_pred CCCccceeeecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeecc Confidence 9999987666557788999999999999999998778899999999999999999987554 333211 11111101 Q ss_pred cccccceecccchHHHHHHHHHHHhh---h----hcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCce Q lcl|NC_021307. 149 TKSVDLTPATGTTYDAIGVNALSLLV---N----AGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGR 221 (310) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~l~---~----~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~ 221 (310) ....+...+...+.+.+ ......+. . .+..+..|++.+.....-.++-.+...+ ..+...+. ... T Consensus 502 A~H~Nl~~~aa~~~~~l-~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~--~a~~~~~~-----~Np 573 (652) T protein:vir:79 502 AKHANVLESAAMDVASL-DKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVK--GADINAGI-----INP 573 (652) T ss_pred cccccccccccCCHHHH-HHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCCc--cccccccc-----ccc Confidence 11111111122222222 12222221 1 2244666777777665555544332110 00111111 122 Q ss_pred eeee-eEEEeCCCCCCceeEeeecceeeeEEeeccc-EEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_021307. 222 ILGR-PTILSDHVASGTTVGYLGDFSQIVWGQVGGL-SFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIND 299 (310) Q Consensus 222 l~G~-pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~-~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~ 299 (310) +.|. .+++++.+..+.. ..+|++...+. .+++ +++.....+....-.-|..|-+.+|+...+|.++.+ T Consensus 574 ~~~~~~~i~eprL~~~s~-------~~wylaa~~~~dtiev---~yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD 643 (652) T protein:vir:79 574 VKDFATVIAEPRLDDNSQ-------TTFYLAASKGSDTIEV---AYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVD 643 (652) T ss_pred cccccccccccccCCCCc-------ccEEEecCCCCCeEEE---EEecCCCCCeeeecCCCCcceEEEEEEEeccCceee Confidence 3332 5566666643221 11222211111 1222 122222222111122388999999999999999999 Q ss_pred cCceEEEee Q lcl|NC_021307. 300 VEAFVKLTN 308 (310) Q Consensus 300 ~~a~~~l~~ 308 (310) --.+.|.+- T Consensus 644 ~RG~~k~t~ 652 (652) T protein:vir:79 644 HRGLVKCTA 652 (652) T ss_pred ccceeeecC Confidence 999999877 No 168 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.14 E-value=8.8e-07 Score=53.79 Aligned_cols=285 Identities=11% Similarity=0.005 Sum_probs=151.9 Q ss_pred CccchhhhHHHHHhh--c---cccCCCCceechh--hH-HHHHHHHHhhchhhhhcceee---cCCCceEEEEEcCCcee Q lcl|NC_021307. 1 MAAGTAFPVNHTQIA--Q---TGDSMFQGYLEPE--QA-QDYFAEAEKTSIVQRVARKIP---MGSTGVKIPHWTGDVSA 69 (310) Q Consensus 1 ~aa~~~~~~~~~~~~--~---~~~~~~g~~i~~~--~~-~~ii~~~~~~s~l~~~~~~~~---~~~~~~~ip~~~~~~~a 69 (310) ||=--..+.+..... . .....+|.++..+ .+ ..+++...+.-..+++..+.. ....++.++.....+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a 80 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIA 80 (314) T ss_pred CccchHHHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccccce Confidence 443222222211111 1 1122334455543 23 336666666655555555432 22335667777777788 Q ss_pred eeeccc-ccccccccceeeeEeeeeeeEeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc Q lcl|NC_021307. 70 AWIGEG-DMKPITKGDMSVQQVEPHKIATIFVASAETVRAN---PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL 145 (310) Q Consensus 70 ~~v~Eg-~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s---~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~ 145 (310) .|.+.+ ..+|..+..+++.....+.++..+.++..-++.+ ..++...-....++++.+.+|+.+|.|+....-.+. T Consensus 81 ~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GL 160 (314) T protein:vir:10 81 QIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPHGIVSV 160 (314) T ss_pred eeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeE Confidence 998764 5589889999999999999999999987777654 457888888999999999999999999765432233 Q ss_pred ccccccccc-eecccchHHHHHHH---HHHHhhh---hcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccC Q lcl|NC_021307. 146 DETTKSVDL-TPATGTTYDAIGVN---ALSLLVN---AGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYR 218 (310) Q Consensus 146 ~~~~~~~~~-~~~~~~~~~~~~~~---~~~~l~~---~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~ 218 (310) +...+.... ......+.+....+ ++..+.. ....+..++++|..+..|....+..|.-++.-- .-.+. T Consensus 161 lN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl~~l-----~~n~~ 235 (314) T protein:vir:10 161 FDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYGELF-----TRNNP 235 (314) T ss_pred eecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHHHHH-----HHhCC Confidence 221110000 11122333333333 3333332 224466899999988877543333332221100 00123 Q ss_pred CceeeeeeEEEeCCCCCCceeEee--ecceeeeEEeecccEEEEeecceeeecccccccchhhhhcC--cEEEEEEEEe- Q lcl|NC_021307. 219 EGRILGRPTILSDHVASGTTVGYL--GDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHN--LVAVRVEAEY- 293 (310) Q Consensus 219 ~~~l~G~pv~~t~~~~~~~~~~~~--gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~r~~~~~- 293 (310) +-+|.+.|-..+.... ++..+++ -+...+-+.....++.... +.. ...+....|+ T Consensus 236 ~l~I~~~~el~~ag~~-g~~~~v~y~~~~~~~~~~vp~~~~~l~~-------------------e~~~~~~~~~~~~r~~ 295 (314) T protein:vir:10 236 GLTIRFLQFLDNYDGA-GGKAALAFEKSPLNMSIEIPEVTNVLPA-------------------QPKDLHFRYPVTSKAT 295 (314) T ss_pred CcEEEEcccccccCCC-cceEEEEEecCCcEEEEecCccceeecc-------------------eecCceEEEcceeeeE Confidence 4456666665544322 2222221 2222222222222221110 111 1233345566 Q ss_pred ccEEeccCceEEEeecC Q lcl|NC_021307. 294 GLLINDVEAFVKLTNAA 310 (310) Q Consensus 294 d~~v~~~~a~~~l~~aa 310 (310) |..+.+|.|++.+.+-- T Consensus 296 Gv~i~~P~ai~~~dGI~ 312 (314) T protein:vir:10 296 GLIVYRPLTMAVIKGIT 312 (314) T ss_pred EEEEECcceeEeeeeee Confidence 56889999999777766 No 169 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=98.13 E-value=2.8e-06 Score=50.99 Aligned_cols=272 Identities=13% Similarity=-0.006 Sum_probs=131.6 Q ss_pred ccCCCCcee-chhhHHHHHHHHHhhchhhhhcceee-----cCCCceEEEEEcCCceeeeecccccccccccceeeeEee Q lcl|NC_021307. 18 GDSMFQGYL-EPEQAQDYFAEAEKTSIVQRVARKIP-----MGSTGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVE 91 (310) Q Consensus 18 ~~~~~g~~i-~~~~~~~ii~~~~~~s~l~~~~~~~~-----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~ 91 (310) .......++ |..++.++++.+++.+++.+++..-. -.+++++||+.. ..-+.++..+.-.+.+-+++++. T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~----~~~v~dg~~~~~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPY----RVKSASGRTLVKQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCC----ceeecccCCccccccccceEEEE Confidence 112222355 44567899999999999888876522 124678888732 12233455555555565666666 Q ss_pred e-eeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHHH Q lcl|NC_021307. 92 P-HKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNAL 170 (310) Q Consensus 92 ~-~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (310) . +..+..+.++++-..++..++.+.+.+....+++..+|+.++.--. +.+ ............+ +.+.++. T Consensus 77 id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~-~a~-------~~~gt~gt~~~~~-~~i~~a~ 147 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLK-KAF-------HSSGTPGVRPGAF-IDFANAG 147 (418) T ss_pred EecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHh-hcc-------cccccCCcCcchH-HHHHHHH Confidence 6 4455666777776667788998899999999999999998774211 111 1111111111223 4445677 Q ss_pred HHhhhhcCCC-C-E-EEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecce- Q lcl|NC_021307. 171 SLLVNAGKKW-G-A-TLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFS- 246 (310) Q Consensus 171 ~~l~~~~~~~-~-~-~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~- 246 (310) ..|....... . + .+++|..+..|.+ +.. ..+...........+.-+++.|..|+.++++|..+. |.+. T Consensus 148 ~~Ld~~~VP~~G~R~lVv~P~~~~~L~~--~~~--~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~ta----g~~~~ 219 (418) T protein:vir:10 148 AKQTTYAVPQDGMRHAVLDPFTCASLSD--EVT--KLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTV----GDHGG 219 (418) T ss_pred HHHHhcCCCCCCceEEEeCHHHHHHHhh--hcc--ccccccccchhhheeeeeeeeceEEEEecCCCcccc----ccccc Confidence 7777766652 2 4 5688988877643 222 222222222223344457899999999999985331 1111 Q ss_pred -eeeEEe-ecccEEEEe-----ecceeeeccccc------ccchhhh-hcCcEEEEEEEEec------cEEecc------ Q lcl|NC_021307. 247 -QIVWGQ-VGGLSFDVS-----DQATLNLGTPQA------PNFVSLW-QHNLVAVRVEAEYG------LLINDV------ 300 (310) Q Consensus 247 -~~~~~~-~~~~~v~~~-----~~~~~~~~~~~~------~~~~~~~-~~~~~~~r~~~~~d------~~v~~~------ 300 (310) ..+.+- ..+-.+.+. ..+.+.-++... ..++... ..+...|++..-+. ..|.-. T Consensus 220 t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~ 299 (418) T protein:vir:10 220 TPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDG 299 (418) T ss_pred ceeeecccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEeccccccc Confidence 111111 112222111 111111111100 0000000 00111122211110 011000 Q ss_pred ------CceEEEeecC Q lcl|NC_021307. 301 ------EAFVKLTNAA 310 (310) Q Consensus 301 ------~a~~~l~~aa 310 (310) ..+-++..++ T Consensus 300 ~~~~~~~~~~~~~~~~ 315 (418) T protein:vir:10 300 TATINNENGDPVSLTA 315 (418) T ss_pred cccccccccccccccC Confidence 0000011111 No 170 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=97.91 E-value=2.1e-06 Score=51.74 Aligned_cols=288 Identities=15% Similarity=0.063 Sum_probs=144.0 Q ss_pred Cc------cchhh----hHHHHHhhc-cccCCCCceechhhHHHHHHHHHhhc-hhhhhcceeecC-CCceEEEEEcCCc Q lcl|NC_021307. 1 MA------AGTAF----PVNHTQIAQ-TGDSMFQGYLEPEQAQDYFAEAEKTS-IVQRVARKIPMG-STGVKIPHWTGDV 67 (310) Q Consensus 1 ~a------a~~~~----~~~~~~~~~-~~~~~~g~~i~~~~~~~ii~~~~~~s-~l~~~~~~~~~~-~~~~~ip~~~~~~ 67 (310) || +|... ..+...++. .+|+..+.+|-...-..+++.-+..- ...+.+..-.++ -...+..+...-+ T Consensus 371 lAr~~L~~rg~~~~~~~~~~~~~~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~ 450 (693) T protein:vir:95 371 LARASLVDRGIGVASLNAPQMVGLAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFS 450 (693) T ss_pred HHHHHHHhcCCccCCCCHHHHHHHHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCC Confidence 22 22211 122333333 44555544333222222322222222 244444433332 1112333344445 Q ss_pred eeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHc---ccCcccc-c Q lcl|NC_021307. 68 SAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALH---GTDSPFD-K 143 (310) Q Consensus 68 ~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~---G~g~~~~-~ 143 (310) +-.-|.|+++..-....=+.-++...++|..+.||++++-..+.++...|...++++.++.+++.++. +...-.. + T Consensus 451 ~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk 530 (693) T protein:vir:95 451 SLRQVREGAEYKYVTLGERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGK 530 (693) T ss_pred ChhhcCCCCceeeeecCCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCc Confidence 55677888887655444455678899999999999999987789999999999999999999885553 2211111 1 Q ss_pred ccccccccccce-ecccchHHHHHHHHHHHh---h---------hhcCCCCEEEEehHHHHHHHHhhhccCccccccccc Q lcl|NC_021307. 144 NLDETTKSVDLT-PATGTTYDAIGVNALSLL---V---------NAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTY 210 (310) Q Consensus 144 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~l---~---------~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~ 210 (310) .+.....+.-.+ .+...+.+.+- .....+ . ..+..+..|++.+.......++-.+...+- .+.. T Consensus 531 ~LFhadH~Nl~tga~sals~~sl~-~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~--a~~~ 607 (693) T protein:vir:95 531 TLFHADHSNLLTGAASALSIDSLS-KAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPG--ADVN 607 (693) T ss_pred ceeeccccccccccccccChHHHH-HHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccc--cccc Confidence 111111111111 11122222221 221111 1 123456677777777666666544432111 0111 Q ss_pred cccccccCCceeeee-eEEEeCCCCC--CceeEeeeccee--eeEEeecccEEEEeecceeeecccccccchhhhhcCcE Q lcl|NC_021307. 211 EAVTTPYREGRILGR-PTILSDHVAS--GTTVGYLGDFSQ--IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLV 285 (310) Q Consensus 211 ~~~~~~~~~~~l~G~-pv~~t~~~~~--~~~~~~~gd~~~--~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (310) .+.. ..+.|+ .++.++.+.+ ++...++.|... +.++...| ...+....-.-|..|-+ T Consensus 608 ~~~~-----NP~~~~~~vi~~prL~~~s~~~Wyl~a~~~~dtie~~yL~G-------------~~~P~ie~~~gf~~dG~ 669 (693) T protein:vir:95 608 SGIV-----NPIRAFAQVIGEPRLDDASATAWYMAAKKGSDTIEVAYLDG-------------VDTPYLEQQEGFTVDGV 669 (693) T ss_pred cccc-----cchhccccccccceecCCCCCceEEecCCCCCeEEEEEecC-------------CCCCeEeecCCCCcceE Confidence 1111 123332 4555555532 223333333221 22222222 11111111123889999 Q ss_pred EEEEEEEeccEEeccCceEEEeec Q lcl|NC_021307. 286 AVRVEAEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 286 ~~r~~~~~d~~v~~~~a~~~l~~a 309 (310) .+|+...+|.++.+--.+.|-.+| T Consensus 670 ~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 670 ASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred EEEEEEeccCceeeccccccCCCC Confidence 999999999999999999888888 No 171 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=97.79 E-value=1.9e-05 Score=46.50 Aligned_cols=284 Identities=11% Similarity=0.043 Sum_probs=151.4 Q ss_pred ccchhhh----------HHHHHhhc-ccc----CCCCceechh--hH-HHHHHHHHhhchhhhhccee---ecCCCceEE Q lcl|NC_021307. 2 AAGTAFP----------VNHTQIAQ-TGD----SMFQGYLEPE--QA-QDYFAEAEKTSIVQRVARKI---PMGSTGVKI 60 (310) Q Consensus 2 aa~~~~~----------~~~~~~~~-~~~----~~~g~~i~~~--~~-~~ii~~~~~~s~l~~~~~~~---~~~~~~~~i 60 (310) .+|.-+. ......+. .+. ...+.++..+ .+ ..+++...+....+++.... +....++.+ T Consensus 1 ~~~~~~~~~~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~ 80 (329) T protein:vir:79 1 MRGNIMSKEMKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEY 80 (329) T ss_pred CccchhhhhhccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEe Confidence 2221111 11111111 111 1123344432 23 44777777777777766543 333345667 Q ss_pred EEEcCCceeeeecc-cccccccccceeeeEeeeeeeEeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHHcc Q lcl|NC_021307. 61 PHWTGDVSAAWIGE-GDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN---PGNYLGTMRTKVATAIALAFDEAALHG 136 (310) Q Consensus 61 p~~~~~~~a~~v~E-g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s---~~~~~~~v~~~l~~a~~~~~d~~~l~G 136 (310) ......+.+.|.+. ...+|..+..+++.....+.++..+.++..-++.+ ..++...-....++++++.+|+-+|+| T Consensus 81 ~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G 160 (329) T protein:vir:79 81 QTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKG 160 (329) T ss_pred eeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEee Confidence 77777778899875 46788888888888888899999988887777654 457888888999999999999999999 Q ss_pred cCccccccccccccccccee-------cccchHHHHHH---HHHHHhhhh---cCCCCEEEEehHHHHHHHHhhhccCcc Q lcl|NC_021307. 137 TDSPFDKNLDETTKSVDLTP-------ATGTTYDAIGV---NALSLLVNA---GKKWGATLLDDVAEPILNGAKDANGRP 203 (310) Q Consensus 137 ~g~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~---~~~~~l~~~---~~~~~~~~~~~~~~~~l~~l~d~~g~~ 203 (310) +..-.-.+.+...+...... -...+.+.... .++..+... ...+..++++|+.+..|.......|.. T Consensus 161 ~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~~t 240 (329) T protein:vir:79 161 SKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETTMS 240 (329) T ss_pred cccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCCcc Confidence 76533323222111111010 11123333333 333333322 234678999999998886544444433 Q ss_pred ccccccccccccccCCceeeeeeEEEeCCCCCCceeEee--ecceeeeEEeecccEEEEeecceeeecccccccchhhhh Q lcl|NC_021307. 204 LFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYL--GDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQ 281 (310) Q Consensus 204 ~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~--gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 281 (310) +..--.. .+.+.+|.+.|-..+.. ..++..+++ -+...+-+.....+..... + T Consensus 241 vl~~lk~-----~~~~l~I~~~~el~~ag-~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~-------------------q 295 (329) T protein:vir:79 241 YLDYFKQ-----QNGGITIESISELEDID-GAGTKAALVYEKDPMNMSIEIPEAFNMLTA-------------------Q 295 (329) T ss_pred HHHHHHH-----hCCCcEEEEcccccccC-CCCceEEEEEecCCceEEEecCcceeeeec-------------------e Confidence 3211000 11233566655544332 122222222 1222222221122221110 1 Q ss_pred cCc--EEEEEEEEec-cEEeccCceEEEeecC Q lcl|NC_021307. 282 HNL--VAVRVEAEYG-LLINDVEAFVKLTNAA 310 (310) Q Consensus 282 ~~~--~~~r~~~~~d-~~v~~~~a~~~l~~aa 310 (310) ... .......|++ ..+.+|.||+.+.+-- T Consensus 296 ~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~ 327 (329) T protein:vir:79 296 PKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLV 327 (329) T ss_pred ecCceEEEceeeeEEEEEEECcceeeeeeeee Confidence 111 2333455664 6888999999999888 No 172 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=97.65 E-value=3.3e-05 Score=45.16 Aligned_cols=276 Identities=11% Similarity=0.001 Sum_probs=119.3 Q ss_pred CCCce---echhhHHHHHHHHHhhchhhhhcce-e---e-c--CCCceEEEEEcCCceeeee-cccccccccccceee-- Q lcl|NC_021307. 21 MFQGY---LEPEQAQDYFAEAEKTSIVQRVARK-I---P-M--GSTGVKIPHWTGDVSAAWI-GEGDMKPITKGDMSV-- 87 (310) Q Consensus 21 ~~g~~---i~~~~~~~ii~~~~~~s~l~~~~~~-~---~-~--~~~~~~ip~~~~~~~a~~v-~Eg~~~~~~~~~~~~-- 87 (310) +..-+ +|..++.+.++.+++..++.+++.. . . . .+.+++|++.......... ..+..+...+..-.+ T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e~~v~ 80 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCccccceeE Confidence 22222 4666778999999999998888776 2 1 1 3567888765322222222 233333333444444 Q ss_pred eEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHH Q lcl|NC_021307. 88 QQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGV 167 (310) Q Consensus 88 i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 167 (310) +++..+|.-++--=..| +..+..++++++.+. .++++.++|+.++.--... .....+ ......+..+.+. T Consensus 81 l~id~~k~va~~v~d~E-~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~-~~~~~g-------t~~t~~~a~~~i~ 150 (423) T protein:vir:10 81 GRVGNYITVAVEYQQLE-EAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNN-GALSLG-------SPNTPITKWSDVA 150 (423) T ss_pred EEeeceeeeeeeechHH-HhcChhhHHHHHHHH-HHHHHHHHHHHHHHHHhhc-cccccc-------cCCcccchHHHHH Confidence 44555555444443444 445567787766655 5889999999987531111 111111 1111112234445 Q ss_pred HHHHHhhhhcCC--CCEEEEehHHHHHHHHhhhccCccccccccccccccc--cCCceeeeeeEEEeCCCCCCceeEeee Q lcl|NC_021307. 168 NALSLLVNAGKK--WGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTP--YREGRILGRPTILSDHVASGTTVGYLG 243 (310) Q Consensus 168 ~~~~~l~~~~~~--~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~--~~~~~l~G~pv~~t~~~~~~~~~~~~g 243 (310) ++...|...... .-..+++|..+..|.+-. +.+.......+...- ...+++.|+.++.++++|..+....-+ T Consensus 151 ~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~----~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~ 226 (423) T protein:vir:10 151 QTASFLKDLGVNEGENYAVMDPWSAQRLADAQ----TGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGG 226 (423) T ss_pred HHHHHHHhccCCcCCCEEEeChHHHHHHhccc----cceecccccchhhhhhccceeeecceEEEEeCCCcccccccccc Confidence 677777665543 446788999888776321 111111111111111 123689999999999999643211000 Q ss_pred c----ceeee----EEeecccEEEEe-----ecceeeeccccc---ccchhh---------hhcCcEEEEEEEEe----- Q lcl|NC_021307. 244 D----FSQIV----WGQVGGLSFDVS-----DQATLNLGTPQA---PNFVSL---------WQHNLVAVRVEAEY----- 293 (310) Q Consensus 244 d----~~~~~----~~~~~~~~v~~~-----~~~~~~~~~~~~---~~~~~~---------~~~~~~~~r~~~~~----- 293 (310) - +...+ ........+.+. ..+.+..++... ...++. -+-..-.|++.... T Consensus 227 t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~ 306 (423) T protein:vir:10 227 TLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSG 306 (423) T ss_pred ceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeeccC Confidence 0 00000 001111111111 011111111100 000000 00011122222111 Q ss_pred -ccEEec-c-----Cce-----EEEeecC Q lcl|NC_021307. 294 -GLLIND-V-----EAF-----VKLTNAA 310 (310) Q Consensus 294 -d~~v~~-~-----~a~-----~~l~~aa 310 (310) +..+.- | .++ +.-..|+ T Consensus 307 g~~tv~i~p~~i~~~~~~~~~~v~a~~a~ 335 (423) T protein:vir:10 307 GDVTVTLSGVPIYDTTNPQYNSVSRQVEA 335 (423) T ss_pred CceeeeccCccccccCCcccccccccccC Confidence 011110 0 000 0000011 No 173 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=97.63 E-value=2.3e-05 Score=46.00 Aligned_cols=299 Identities=13% Similarity=0.084 Sum_probs=156.3 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhh-HHHHHHHHHhhchhhhhcceeecCC---CceEEEEEcCCceee-eecc- Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQ-AQDYFAEAEKTSIVQRVARKIPMGS---TGVKIPHWTGDVSAA-WIGE- 74 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~-~~~ii~~~~~~s~l~~~~~~~~~~~---~~~~ip~~~~~~~a~-~v~E- 74 (310) |...++.+. ...++++++.+-.+-..+ ....+..+++...+.+++...|++. .++++.+...-+.+. --.| T Consensus 1 ~~~~~a~~~---~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eG 77 (401) T protein:vir:95 1 MLNYNAPTD---GQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQG 77 (401) T ss_pred CCccCCCcc---cccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcC Confidence 444433322 223333333333333323 3556666677788999999999874 344544443332221 1122 Q ss_pred ----cccc-----------------------------cccccceeeeEeeeeeeEeeehhhHHHhh-cChhHHHHHH-HH Q lcl|NC_021307. 75 ----GDMK-----------------------------PITKGDMSVQQVEPHKIATIFVASAETVR-ANPGNYLGTM-RT 119 (310) Q Consensus 75 ----g~~~-----------------------------~~~~~~~~~i~l~~~k~~~~~~is~ell~-~s~~~~~~~v-~~ 119 (310) |.++ .....+-..+..+.+++|.++++|+++.+ +++.++.+.+ .+ T Consensus 78 v~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~e 157 (401) T protein:vir:95 78 IDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRE 157 (401) T ss_pred CCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHH Confidence 2211 01112223466678999999999999887 4556777655 34 Q ss_pred HHHHHH---HHHHHHHHHcccCcccccccc--cccccccceecccchHHHHHHHHHHHhhh-------------h----- Q lcl|NC_021307. 120 KVATAI---ALAFDEAALHGTDSPFDKNLD--ETTKSVDLTPATGTTYDAIGVNALSLLVN-------------A----- 176 (310) Q Consensus 120 ~l~~a~---~~~~d~~~l~G~g~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-------------~----- 176 (310) .|..+. ...+-+.+|++-++---.+.. ..+......+.+..+.+++. .+...|.. . T Consensus 158 ll~g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~-rl~~~L~~nRapk~t~~i~~s~~~dTk 236 (401) T protein:vir:95 158 LMNGATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLM-RLDQILTENRTPTQTTIITGSRMIDTK 236 (401) T ss_pred HhhhhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHH-HHHHHHHhcccccchhhhhhhhccCcc Confidence 444443 334455677544321111111 11111122222333444433 33333321 0 Q ss_pred -cCCCCEEEEehHHHHHHHHhhhccCccccccccccc---cccccCCceeeeeeEEEeCCC--------CCCc------- Q lcl|NC_021307. 177 -GKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEA---VTTPYREGRILGRPTILSDHV--------ASGT------- 237 (310) Q Consensus 177 -~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~---~~~~~~~~~l~G~pv~~t~~~--------~~~~------- 237 (310) ....-.-+||+.....|+.++|-.|.+-|.+-.--+ ....+.-+.+-++++++++.+ +.+. T Consensus 237 ~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~ 316 (401) T protein:vir:95 237 VIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRT 316 (401) T ss_pred ccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCccccccccccccc Confidence 111223678999999999999988888887655433 333445577889999988763 2211 Q ss_pred ------------eeEeeecceeeeEEeec-cc----EEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecc Q lcl|NC_021307. 238 ------------TVGYLGDFSQIVWGQVG-GL----SFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDV 300 (310) Q Consensus 238 ------------~~~~~gd~~~~~~~~~~-~~----~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~ 300 (310) ..+++|+-....++..+ +. .+.+ +.++. .+.+..+| +-|++.+.|++ .+++.+.++ T Consensus 317 ~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~iv-k~pG~--~~ad~~DP--lgQ~g~vgwK~--~~a~~vL~~ 389 (401) T protein:vir:95 317 SMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMT-KMPGK--ETADRNDP--YGETGFSSIKW--YYGILVKRP 389 (401) T ss_pred ccccCCCcceeeeeeEEccccceecccccCCccccceeEe-ecCCc--CCCCCCCc--ccceehhhhhh--hhhhheecc Confidence 11123332222222211 11 2211 22221 12233444 34777888875 678889999 Q ss_pred CceEEEeecC Q lcl|NC_021307. 301 EAFVKLTNAA 310 (310) Q Consensus 301 ~a~~~l~~aa 310 (310) +-.++|+-+| T Consensus 390 e~m~~ies~a 399 (401) T protein:vir:95 390 ERLALIKTVA 399 (401) T ss_pred ceeEEEEeec Confidence 9999999999 No 174 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=97.48 E-value=5.8e-05 Score=43.81 Aligned_cols=275 Identities=10% Similarity=-0.011 Sum_probs=120.7 Q ss_pred ccccCCCCceechhhHHHHHHHHHhhchhhhhcc-------eeecCCCceEEEEEcCC---c-eeeeecccccccccccc Q lcl|NC_021307. 16 QTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVAR-------KIPMGSTGVKIPHWTGD---V-SAAWIGEGDMKPITKGD 84 (310) Q Consensus 16 ~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~-------~~~~~~~~~~ip~~~~~---~-~a~~v~Eg~~~~~~~~~ 84 (310) ++-+..- +-.|......+|.+.+.....+.+. ..+..+.-+.+|.+..- . +..-+.+...++..+.+ T Consensus 1 m~lsD~~--vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~kit 78 (325) T protein:vir:95 1 MALSDLA--VYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKVLK 78 (325) T ss_pred Cchhhhh--hhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceeccceec Confidence 1111110 1234455556666666555444322 13334555667776431 1 22223344445545544 Q ss_pred -eeeeEeeeeeeEeeehhhHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccccccee--cc Q lcl|NC_021307. 85 -MSVQQVEPHKIATIFVASAETVR---ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTP--AT 158 (310) Q Consensus 85 -~~~i~l~~~k~~~~~~is~ell~---~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~--~~ 158 (310) ..++.....+-.+......+... +....+.+.|.++++++..+.+-+.++.+........ ......+.... .. T Consensus 79 t~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~-~~~v~dis~~~~~~~ 157 (325) T protein:vir:95 79 HLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQV-SDVVYDATANTDAAD 157 (325) T ss_pred cccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-ccceeeeecccCccc Confidence 44455555554443333333322 2233445556666666655555455544332211100 00001111111 11 Q ss_pred cchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCce Q lcl|NC_021307. 159 GTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTT 238 (310) Q Consensus 159 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~ 238 (310) .......+.+...++-+....-..|+||..++..|.++.-.+...++...... .-.+++|+||++++.+|.... T Consensus 158 ~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~------~i~t~~G~~VIVdD~~p~~~~ 231 (325) T protein:vir:95 158 KLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVN------VVRDPFGKLLVMTDSPNLFAA 231 (325) T ss_pred ccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCCcc------cccccCCcEEEEeCCCCCCCc Confidence 11122344567777888888899999999999999876655444443322211 113678999999999986531 Q ss_pred eEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeec---C Q lcl|NC_021307. 239 VGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNA---A 310 (310) Q Consensus 239 ~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a---a 310 (310) - --+-+.-|.++ .+.+.+....+....- ... ..-++-...+|++.. -+.||..+.--+-. . T Consensus 232 g-~~~~ytty~lg-~GAi~~~~~~~~~~~~--~~~----~~~~~~~~~~~~~~t---f~lhp~G~sw~~s~~g~s 295 (325) T protein:vir:95 232 G-TPNVYHILGLV-PGGVLIGQNNDFDANE--ETK----NGDENIIRTYQAEWS---YNIGVKGFAWDKANGGKS 295 (325) T ss_pred c-CceeEEEEEEe-cCeEEecCCCCccccc--ccc----Ccccceeeeeeeeee---EEeecceeeeecccccCC Confidence 1 00111112222 1333333222211111 000 011122223332211 24455554442111 1 No 175 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=97.31 E-value=9.8e-05 Score=42.56 Aligned_cols=273 Identities=10% Similarity=0.001 Sum_probs=120.6 Q ss_pred hccccCCCCceechhhHHHHHHHHHhhchhhhhcceee-----c--CCCceEEEEEcCCceee-ee-cccccccccccce Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIP-----M--GSTGVKIPHWTGDVSAA-WI-GEGDMKPITKGDM 85 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~-----~--~~~~~~ip~~~~~~~a~-~v-~Eg~~~~~~~~~~ 85 (310) |... ---.+|..++.+.++.+++..++.+++..-. . .+.+++|++.. ...+. .. ..+..+..++..- T Consensus 1 MaN~---llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~-~~~~~~~~~~~~~~~~~~~l~e 76 (423) T protein:vir:17 1 MPNN---LDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPH-QFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cccc---hhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCC-cceeecccCcccCCcccCcccc Confidence 1111 0012566778899999999999888876522 1 35678888632 22221 11 2333333344433 Q ss_pred ee--eEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHH Q lcl|NC_021307. 86 SV--QQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYD 163 (310) Q Consensus 86 ~~--i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (310) .+ +++..+|.-++ .++++-......++++++... .++++..+|+.++.---...+ ...+ +.....+.. T Consensus 77 ~~v~l~id~~k~va~-~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a~-~~~g-------t~~t~~~a~ 146 (423) T protein:vir:17 77 GKATGRVGNYITVAV-EYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGA-LSLG-------SPNTPITKW 146 (423) T ss_pred ceeEEEeeceeeeee-eecHHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhccc-cccc-------cCCcccccH Confidence 44 44444554444 455544455667787766655 588999999988743211111 1011 111111223 Q ss_pred HHHHHHHHHhhhhcCC--CCEEEEehHHHHHHHHhhhccCccccccccccc-cccc-cCCceeeeeeEEEeCCCCCCcee Q lcl|NC_021307. 164 AIGVNALSLLVNAGKK--WGATLLDDVAEPILNGAKDANGRPLFVESTYEA-VTTP-YREGRILGRPTILSDHVASGTTV 239 (310) Q Consensus 164 ~~~~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~-~~~~-~~~~~l~G~pv~~t~~~~~~~~~ 239 (310) +.+.++...|...... .-..+++|..+..|.+-. +.++......+ .... ...+++.|+.++.++++|..+.. T Consensus 147 ~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~----~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~g 222 (423) T protein:vir:17 147 SDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQ----TGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQG 222 (423) T ss_pred HHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccc----cceecccccchHHHhhccceeeecceEEEEeCCCcccccc Confidence 4445677777665543 446788999888876311 11111111111 1111 12368999999999999965321 Q ss_pred EeeecceeeeEEeecccEEEEeecce-----eeecccccccchhhhhcCcEEEEEE---EEe------ccEEeccCceEE Q lcl|NC_021307. 240 GYLGDFSQIVWGQVGGLSFDVSDQAT-----LNLGTPQAPNFVSLWQHNLVAVRVE---AEY------GLLINDVEAFVK 305 (310) Q Consensus 240 ~~~gd~~~~~~~~~~~~~v~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~r~~---~~~------d~~v~~~~a~~~ 305 (310) . ++- + ........+.-....+.. +...+...... +-..|.+.|-.. .+. ++.-.+..-|.+ T Consensus 223 t-~~~-t-~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~--l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v 297 (423) T protein:vir:17 223 A-FGG-T-LTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGF--LKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATV 297 (423) T ss_pred c-eec-e-eeecccccccccccccccceeeeeeeeeeeccCc--eeecceEEecceeeecccccccccccccccceEEEE Confidence 1 110 0 000000000000000000 00000000000 111222222111 111 111222334443 Q ss_pred Eeec---C Q lcl|NC_021307. 306 LTNA---A 310 (310) Q Consensus 306 l~~a---a 310 (310) ...+ | T Consensus 298 ~~~~~~~a 305 (423) T protein:vir:17 298 TADANSDS 305 (423) T ss_pred Eecccccc Confidence 3211 1 No 176 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=97.31 E-value=9.9e-05 Score=42.54 Aligned_cols=259 Identities=10% Similarity=-0.001 Sum_probs=120.0 Q ss_pred hccccCCCCceechhhHHHHHHHHHhhchhhhhcceee-----c--CCCceEEEEEcCCceeee-eccccccccccccee Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIP-----M--GSTGVKIPHWTGDVSAAW-IGEGDMKPITKGDMS 86 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~-----~--~~~~~~ip~~~~~~~a~~-v~Eg~~~~~~~~~~~ 86 (310) |... ---.+|..++.+.++.+++..++.+++..-. . .+.+++||+......... .+.+..+..++..-. T Consensus 1 MAN~---llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e~ 77 (423) T protein:vir:35 1 MANN---LESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFSA 77 (423) T ss_pred Cccc---hhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCccccccccc Confidence 1110 0113566778999999999999999877622 1 156778887532211112 122333333444444 Q ss_pred eeEeeeee-eEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHH Q lcl|NC_021307. 87 VQQVEPHK-IATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAI 165 (310) Q Consensus 87 ~i~l~~~k-~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (310) ++.+...+ .+..+.++++-..++..++++++...+ .+++.++|..++..--.+.+.. .++ ........+. T Consensus 78 ~v~l~id~~k~~a~~v~d~e~~l~i~~~~~~l~~a~-~ala~~vd~~l~~~l~~~a~~~-vgt-------~~t~~~~~~~ 148 (423) T protein:vir:35 78 KATGKVGKYITVAVEWTQIEEALKLNQLDQILSPIH-ERMVTDLETELAHFMMNNGALS-LGS-------PNTAIKKWAD 148 (423) T ss_pred eeeEEeccceeccceeCHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccc-ccc-------ccCCcchHHH Confidence 45455522 334555666655656778888777665 7788899998875322221111 111 1111122344 Q ss_pred HHHHHHHhhhhcCC--CCEEEEehHHHHHHHHhhhccCccccccccccccccc-cCCceeeeeeEEEeCCCCCCceeEee Q lcl|NC_021307. 166 GVNALSLLVNAGKK--WGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTP-YREGRILGRPTILSDHVASGTTVGYL 242 (310) Q Consensus 166 ~~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~-~~~~~l~G~pv~~t~~~~~~~~~~~~ 242 (310) +.++...|...... +-..+++|..+..|.+- ..+.............. ...+++.|+.++.|+++|..+. T Consensus 149 i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~---~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~---- 221 (423) T protein:vir:35 149 VAQTASFIKDIGIKTGENYAIMDPWSAQRLADA---QSGLHAADQLVRTAWENAQISGNFGGIRALMSNGLASRKQ---- 221 (423) T ss_pred HHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhcc---ccceeccccchhHHHhhccceeeecceEEEEcCCCccccc---- Confidence 55777777666554 33458899988887531 11111111111111111 1236899999999999996431 Q ss_pred ecceeeeE------------EeecccEEEE-----eecceeeecccccccchhhhhcCcEEEEEEEEeccEEecc----- Q lcl|NC_021307. 243 GDFSQIVW------------GQVGGLSFDV-----SDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDV----- 300 (310) Q Consensus 243 gd~~~~~~------------~~~~~~~v~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~----- 300 (310) |.+..... ...+...+.+ ...+.+. ..|.+.|- |....++ T Consensus 222 gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~-------------~GD~~t~a-----Gv~~v~~~t~~~ 283 (423) T protein:vir:35 222 GDFDGAITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLK-------------AGDQLKFT-----STHWLNQQSKQT 283 (423) T ss_pred cccccceeeccccccccccccccccceeeeeeeeeccCCcEE-------------ecceEEee-----eeeeccccccce Confidence 11111110 0001111100 0011111 12222111 1111111 Q ss_pred ---------CceEEE---------------eecC Q lcl|NC_021307. 301 ---------EAFVKL---------------TNAA 310 (310) Q Consensus 301 ---------~a~~~l---------------~~aa 310 (310) .-|+++ .++- T Consensus 284 ~~~~~t~~~~~~~V~~~~~~~a~g~~~v~i~p~~ 317 (423) T protein:vir:35 284 LYNGSTAMSFTATVLEETNSTASGDVTVKLSGVP 317 (423) T ss_pred eecccCCceeEEEEeccccccccCceeEEccccc Confidence 111111 1110 No 177 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=96.97 E-value=0.00023 Score=40.50 Aligned_cols=263 Identities=9% Similarity=-0.057 Sum_probs=113.7 Q ss_pred hccccCCCCceechhhHHHHHHHHHhhchhhhhcceee-----c--CCCceEEEEEcCCceeeeeccc--ccccccccce Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIP-----M--GSTGVKIPHWTGDVSAAWIGEG--DMKPITKGDM 85 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~-----~--~~~~~~ip~~~~~~~a~~v~Eg--~~~~~~~~~~ 85 (310) |. ..-..++|.-++.++++.+++..++.+++..-. . .+.+++||+-... .+.-...+ ...+..+..- T Consensus 1 MA---Nsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~-~~~d~~~~~~t~~~~~~l~e 76 (423) T protein:vir:10 1 MA---NNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQF-KSERTMDGDITGKSKNSLIS 76 (423) T ss_pred Cc---cccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCce-eeecccCcccCccccccccc Confidence 11 112225666778999999999999999877622 1 2567888764321 11111111 1111112222 Q ss_pred ee--eEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHH Q lcl|NC_021307. 86 SV--QQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYD 163 (310) Q Consensus 86 ~~--i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (310) ++ +++..+|... +.++++-+..+..++++++... .++++..+|+.+.......... ..+. +....+.. T Consensus 77 ~~v~l~id~~k~~a-~~v~d~E~~l~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~-~vgt-------~~t~~~a~ 146 (423) T protein:vir:10 77 AKATGEVGNYITVA-VEYRQIEEALKLNQLDQILVPI-NERMVTDLETELALFMMKHGAL-SLGS-------PNTPIKKW 146 (423) T ss_pred ceEEEEecceeeee-eeeChHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHhhhcccc-cccc-------cccccccH Confidence 33 4444444444 4454444456678887766555 6899999999886432221111 1111 11111222 Q ss_pred HHHHHHHHHhhhhcCC--CCEEEEehHHHHHHHH-h---hhccCccccccccccccccccCCceeeeeeEEEeCCCCCCc Q lcl|NC_021307. 164 AIGVNALSLLVNAGKK--WGATLLDDVAEPILNG-A---KDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGT 237 (310) Q Consensus 164 ~~~~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~-l---~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~ 237 (310) +.+.++...|...... .-..+++|..+..|.+ + ...++ .....-......+++.|..++.++++|..+ T Consensus 147 ~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~------~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T 220 (423) T protein:vir:10 147 SDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQ------LVRTAWENAQISGNFGGIRALMSNGLASRT 220 (423) T ss_pred HHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccc------cchHHHHhcccceeecceEEEEecCCcccc Confidence 3445666666655443 4567889998888753 1 11111 000111111233689999999999998421 Q ss_pred e--eEeeecceeeeEEeec--------c---cEEEEeecceeeecccccccchhhhhcCcEEEEE---EEEeccEE---- Q lcl|NC_021307. 238 T--VGYLGDFSQIVWGQVG--------G---LSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRV---EAEYGLLI---- 297 (310) Q Consensus 238 ~--~~~~gd~~~~~~~~~~--------~---~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~---~~~~d~~v---- 297 (310) . .....--+..+..... + .....+..+.+..++ .+.|-. .-++...+ T Consensus 221 ~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD-------------~~t~aGv~~v~~~tk~~l~~~ 287 (423) T protein:vir:10 221 QGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGD-------------QLQFDDTHWLNQQSKQTLYNG 287 (423) T ss_pred cccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecc-------------eEeecceeeecccccceeecc Confidence 1 0000000111100000 0 001111111111111 111100 00111110 Q ss_pred --eccCceEEEeec---C Q lcl|NC_021307. 298 --NDVEAFVKLTNA---A 310 (310) Q Consensus 298 --~~~~a~~~l~~a---a 310 (310) .+..-|.+...+ | T Consensus 288 ~~~~~~~~~V~~~~~~~a 305 (423) T protein:vir:10 288 ASALSFTATVMEDANAHS 305 (423) T ss_pred cCCcceEEEEEecccccc Confidence 011112221110 1 No 178 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=96.93 E-value=0.0002 Score=40.88 Aligned_cols=272 Identities=11% Similarity=-0.010 Sum_probs=136.5 Q ss_pred CCCCceechhh--H-HHHHHHHHhhchhhhhcce---eecCCCceEEEEEcCCceee--eec-ccccccccccceeeeEe Q lcl|NC_021307. 20 SMFQGYLEPEQ--A-QDYFAEAEKTSIVQRVARK---IPMGSTGVKIPHWTGDVSAA--WIG-EGDMKPITKGDMSVQQV 90 (310) Q Consensus 20 ~~~g~~i~~~~--~-~~ii~~~~~~s~l~~~~~~---~~~~~~~~~ip~~~~~~~a~--~v~-Eg~~~~~~~~~~~~i~l 90 (310) -++..++..|. + ..+.+...+.-..+++..+ .+....++.+...+..+.+. |.+ ....+|..+..+++... T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 22223344322 2 2244433344444444443 33334456666666556666 874 66788999999999999 Q ss_pred eeeeeEeeehhhHHHhhcCh---hHHHHHHHHHHHHHHHHHHHHHHHcccCccccc-ccccccccccc---------eec Q lcl|NC_021307. 91 EPHKIATIFVASAETVRANP---GNYLGTMRTKVATAIALAFDEAALHGTDSPFDK-NLDETTKSVDL---------TPA 157 (310) Q Consensus 91 ~~~k~~~~~~is~ell~~s~---~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~-~~~~~~~~~~~---------~~~ 157 (310) ..+..+..+..|.+-++.+. .++...-.+...+++...+|+..+.|+....+. +.+... .+.. ..- T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p-~v~~~~~~~~~a~~~w 159 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNK-SVEVYAIKGAAQNTKV 159 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCC-CcceeeecCCccCCcc Confidence 99999998888877776443 367777777778889999999999997532221 221111 1110 001 Q ss_pred ccchHHHHHHHHHHHhhh---h---cCCCCEEEEehHHHHHHHHhhhc-cCccccccccccccccccCCceeeeeeEEEe Q lcl|NC_021307. 158 TGTTYDAIGVNALSLLVN---A---GKKWGATLLDDVAEPILNGAKDA-NGRPLFVESTYEAVTTPYREGRILGRPTILS 230 (310) Q Consensus 158 ~~~~~~~~~~~~~~~l~~---~---~~~~~~~~~~~~~~~~l~~l~d~-~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t 230 (310) ...+.++++.++...+.. . ...+..++|.++.+..|....-+ .+.-++.--........+.+-.|.++|-... T Consensus 160 ~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~~~~g~~l~I~~v~~~~~ 239 (304) T protein:vir:52 160 QAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLSAAAGRQVAIKALPSNYG 239 (304) T ss_pred ccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcccccCCcceEEEeccccc Confidence 122455555554443322 1 13466899999999888653222 2222210000000001112223444432222 Q ss_pred CCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEE--EEEEEecc-EEeccCceEEEe Q lcl|NC_021307. 231 DHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAV--RVEAEYGL-LINDVEAFVKLT 307 (310) Q Consensus 231 ~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--r~~~~~d~-~v~~~~a~~~l~ 307 (310) ..-..++..+++-+.+.=++...-.+.+..+ ....+|...+ =.+.|+++ .+++|.|++.+- T Consensus 240 ~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l----------------~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D 303 (304) T protein:vir:52 240 TRVTDGKTRAMVYVNSKEHVIFDVPMSPTVL----------------DAQPKGLLAFESGLRMAFGGVTFMEPDSALYVD 303 (304) T ss_pred ccCCCCceEEEEEecChhheEEecCcccccc----------------chhhcCCceEEecceeeeeeEEEEccceeeeec Confidence 2222333333332222211111111111111 1133444333 24556654 788999999887 Q ss_pred e Q lcl|NC_021307. 308 N 308 (310) Q Consensus 308 ~ 308 (310) - T Consensus 304 ~ 304 (304) T protein:vir:52 304 Y 304 (304) T ss_pred C Confidence 7 No 179 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=96.60 E-value=0.00039 Score=39.30 Aligned_cols=275 Identities=14% Similarity=0.127 Sum_probs=146.9 Q ss_pred hccccCCCCceech-hhHHHHHHHHHhhchhhhhcc-eeecC-CCceEEEEEcCCceeeeecccccccccccceeeeEee Q lcl|NC_021307. 15 AQTGDSMFQGYLEP-EQAQDYFAEAEKTSIVQRVAR-KIPMG-STGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVE 91 (310) Q Consensus 15 ~~~~~~~~g~~i~~-~~~~~ii~~~~~~s~l~~~~~-~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~ 91 (310) ++.+ +....+|.. ++++.|...+.+..-=-...+ +...+ +..+.||.. +.+...-..|..+..-...+.++|++. T Consensus 1 ~~~T-SNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~ti-Gs~~~~~~~E~~~~~~~~i~TGEIt~~ 78 (313) T protein:vir:95 1 MQLT-SNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTI-GSVTLQEAEEDTPLIYNPIETGEITFQ 78 (313) T ss_pred Cccc-ccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEeccc-CceeeeccccCCCeeecccccceEEEE Confidence 4433 344456664 556666666655532122222 33333 556777754 445556667777777777888999999 Q ss_pred eeeeEee-ehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHHccc-----CcccccccccccccccceecccchHH Q lcl|NC_021307. 92 PHKIATI-FVASAETVRANP--GNYLGTMRTKVATAIALAFDEAALHGT-----DSPFDKNLDETTKSVDLTPATGTTYD 163 (310) Q Consensus 92 ~~k~~~~-~~is~ell~~s~--~~~~~~v~~~l~~a~~~~~d~~~l~G~-----g~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (310) ..++.+- .-||++|-+|+- -++.+.+..+-+++|....+..+|.-- +.+.|..+.+...-...+...++-.. T Consensus 79 i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~~~ 158 (313) T protein:vir:95 79 ITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGVFAL 158 (313) T ss_pred EEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCceehh Confidence 9886654 369999999874 245555666667777777777666421 12223332222222222222222222 Q ss_pred HHHHHHHHHhh--hhcCCCCEEEEehHHHHHHHHhh------hccCccccccccccccccccCCceeeeeeEEEeCCCCC Q lcl|NC_021307. 164 AIGVNALSLLV--NAGKKWGATLLDDVAEPILNGAK------DANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVAS 235 (310) Q Consensus 164 ~~~~~~~~~l~--~~~~~~~~~~~~~~~~~~l~~l~------d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~ 235 (310) .....+...+. ..-...-.+++.|.....|..+. ..+|+.+...+.. .....-..++|..+.+++-+.. T Consensus 159 ~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A---~~~~Fi~~~YG~Di~~SN~L~~ 235 (313) T protein:vir:95 159 KHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMA---RGQRFIMNLYGWDILTSNRLHV 235 (313) T ss_pred hHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCC---chhHHHHHHhhhhhhhhhhhhh Confidence 22233333333 23455678999999998887654 2346665544432 2233345688888888775531 Q ss_pred -----C--ceeEeeecce--------eeeEEeecccEEEEeecceeeecccccccchhh-hhcCcEEEEEEEEeccEEec Q lcl|NC_021307. 236 -----G--TTVGYLGDFS--------QIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSL-WQHNLVAVRVEAEYGLLIND 299 (310) Q Consensus 236 -----~--~~~~~~gd~~--------~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~r~~~~~d~~v~~ 299 (310) + +...+.|+.- .=+++-|..|.-..+ ..+. -..+..+ ..+|+|+.+.+ T Consensus 236 AN~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~--------------~~~~~~~~~~~~--~~~R~G~Gi~R 299 (313) T protein:vir:95 236 ANYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEG--------------ERNKDRARDEHV--VRCRYGFGIQR 299 (313) T ss_pred ccccccccccCceeeeeeeeeecccccceeeeecccccccc--------------ccccccccccce--eeeeeccccee Confidence 1 1122233221 112233333321111 0010 0122233 45799999999 Q ss_pred cCceEEEeecC Q lcl|NC_021307. 300 VEAFVKLTNAA 310 (310) Q Consensus 300 ~~a~~~l~~aa 310 (310) .+-...+--.| T Consensus 300 ~~~L~~~~~~A 310 (313) T protein:vir:95 300 LDTLGLLATSA 310 (313) T ss_pred ecceeEEEecc Confidence 98887776666 No 180 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=96.54 E-value=0.00028 Score=40.03 Aligned_cols=185 Identities=12% Similarity=-0.008 Sum_probs=85.3 Q ss_pred EeeehhhHHHhh-----cChhHHHHHHHHHHHHHHHHHHHHHHHc----ccCcccccc--cccccccccc-eecccchHH Q lcl|NC_021307. 96 ATIFVASAETVR-----ANPGNYLGTMRTKVATAIALAFDEAALH----GTDSPFDKN--LDETTKSVDL-TPATGTTYD 163 (310) Q Consensus 96 ~~~~~is~ell~-----~s~~~~~~~v~~~l~~a~~~~~d~~~l~----G~g~~~~~~--~~~~~~~~~~-~~~~~~~~~ 163 (310) --..-+|+-+++ ++..++.+...+++..++++..|+.++. +..+..+.. +.+....... .+..+.... T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 112224444443 5667899999999999999999998864 222211111 1111111111 111122223 Q ss_pred HHHHHHHHHhhhhcCC-CC-EEEEehHHHHHHHHhhhcc-Ccccccccccccccccc-CCceeeeeeEEEeCCCCCCcee Q lcl|NC_021307. 164 AIGVNALSLLVNAGKK-WG-ATLLDDVAEPILNGAKDAN-GRPLFVESTYEAVTTPY-REGRILGRPTILSDHVASGTTV 239 (310) Q Consensus 164 ~~~~~~~~~l~~~~~~-~~-~~~~~~~~~~~l~~l~d~~-g~~~~~~~~~~~~~~~~-~~~~l~G~pv~~t~~~~~~~~~ 239 (310) +.+.++...|.+.+.. .. .++++|..+..|.+..|.. .+.-+.. .++....+ .-+.+.|++|+.|+++|..... T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~--s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt 158 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGN--TQGDMNTGKGLYVNAGIRIYKSNVLASLYGT 158 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeeccc--ccccccccceeeeecCcEEEEeccCCccccc Confidence 3344566666655544 33 4666898888876532221 1111110 11111111 2456999999999999963211 Q ss_pred EeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 240 GYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 240 ~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) -+..+...+.. .+.+.. .++-+.. -.=+.+.|++|+.-++.-. T Consensus 159 ~~~~~ag~~~~---~~~~~~-------------------~yr~~fs------~~~glv~~~~Avgtvkl~~ 201 (221) T protein:vir:17 159 NLVTDPGDATT---SGENNG-------------------SYRPAIT------DRAGLVFHKEAADTVEVLL 201 (221) T ss_pred ccccCCccccc---cccccc-------------------ccccccc------ceEEEEEcchheeeeeeec Confidence 01111111100 000000 0000000 0114556677776665555 No 181 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=96.50 E-value=0.00022 Score=40.60 Aligned_cols=264 Identities=11% Similarity=0.038 Sum_probs=127.9 Q ss_pred hccccCCCCceechhhHHHHHHHHHhhch-hhhhcceeecCCCceEEEEEcCCcee-eeecccccccccccceeeeEeee Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFAEAEKTSI-VQRVARKIPMGSTGVKIPHWTGDVSA-AWIGEGDMKPITKGDMSVQQVEP 92 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~~~~~~s~-l~~~~~~~~~~~~~~~ip~~~~~~~a-~~v~Eg~~~~~~~~~~~~i~l~~ 92 (310) |.++... =..|-..+...+.+......+ ..++|+..+......++.....-+.. .|.+| .+-.++.-...++.. T Consensus 1 m~it~~~-l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge---~~~~~l~~~~~~i~~ 76 (302) T protein:vir:10 1 MLINKQS-LNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGA---KVVKNLKAYKYVVEN 76 (302) T ss_pred CcccHHH-HHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccccc---eeeccccccceeEEe Confidence 2222211 011111112223333332222 55567766644444455555444443 56544 334445556678999 Q ss_pred eeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCccc------ccccccccccc------cc-----e Q lcl|NC_021307. 93 HKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPF------DKNLDETTKSV------DL-----T 155 (310) Q Consensus 93 ~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~------~~~~~~~~~~~------~~-----~ 155 (310) ++++..+.||++.+.+...++..-+.+.|.++.++.+|+.++.=-.++. +.......... +. . T Consensus 77 ~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~~~~ 156 (302) T protein:vir:10 77 EDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTAPLS 156 (302) T ss_pred ecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccchhhh Confidence 9999999999999999899999999999999999999987665322110 11111111100 00 0 Q ss_pred ecccchHHHHHHHHHH---Hhhh-----hcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeee-ee Q lcl|NC_021307. 156 PATGTTYDAIGVNALS---LLVN-----AGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILG-RP 226 (310) Q Consensus 156 ~~~~~~~~~~~~~~~~---~l~~-----~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G-~p 226 (310) ........+.+..... .... ....+..++..+.....-+++... ++. ..+... .+.| .- T Consensus 157 ~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~-~~~------~~g~~N-----p~~g~~~ 224 (302) T protein:vir:10 157 NASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTN-PKL------ADNTPN-----PYVGTAE 224 (302) T ss_pred hcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhc-ccc------CCCCcc-----eeccceE Confidence 0000011111112222 2221 223456677777666555544322 111 011111 1122 35 Q ss_pred EEEeCCCCCCceeEeeecceee---eEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe----- Q lcl|NC_021307. 227 TILSDHVASGTTVGYLGDFSQI---VWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIN----- 298 (310) Q Consensus 227 v~~t~~~~~~~~~~~~gd~~~~---~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~----- 298 (310) +++++.+.+++...++.|.+.+ ++..+.+..++... -|..+.+.+|....++..-+ T Consensus 225 ~vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~----------------~~~~dgv~~k~~~d~Gvd~R~~~G~ 288 (302) T protein:vir:10 225 LVVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQV----------------NLDSDDVFNLRKLKFGAEARAAAGY 288 (302) T ss_pred EEEeeccCCCCceEEEecCCccceEEEcCccccEEEecc----------------CCCCCceEEEEEEEEeeeeeeecch Confidence 6777777777777777666553 22233334443321 25566777776555553111 Q ss_pred -ccCceEEEeecC Q lcl|NC_021307. 299 -DVEAFVKLTNAA 310 (310) Q Consensus 299 -~~~a~~~l~~aa 310 (310) .++.--+-++.| T Consensus 289 ~~wq~a~~s~g~~ 301 (302) T protein:vir:10 289 GFWQLAYGSTGTG 301 (302) T ss_pred hhhhhhhccCccC Confidence 111112333333 No 182 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=96.48 E-value=0.00059 Score=38.30 Aligned_cols=264 Identities=12% Similarity=0.072 Sum_probs=104.9 Q ss_pred hccccCCCCcee--chhhHHHHHHHHHhhchhhhhcce--e-----ecCCCceEEEEEc-CCcee-eeeccccccccccc Q lcl|NC_021307. 15 AQTGDSMFQGYL--EPEQAQDYFAEAEKTSIVQRVARK--I-----PMGSTGVKIPHWT-GDVSA-AWIGEGDMKPITKG 83 (310) Q Consensus 15 ~~~~~~~~g~~i--~~~~~~~ii~~~~~~s~l~~~~~~--~-----~~~~~~~~ip~~~-~~~~a-~~v~Eg~~~~~~~~ 83 (310) |.++ -...++ .+......+|.+.+...+++.+.. + |+.+.-...+... ++... .-+...+.+...++ T Consensus 1 ~~~t--~~sdl~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~ki 78 (315) T protein:vir:96 1 MATT--VNSDLVIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKI 78 (315) T ss_pred Ccee--eecceeeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceec Confidence 3222 333333 355566678887776665554322 1 1112211222111 11100 11111222332332 Q ss_pred c-eeeeEeeeeeeEeeehhhHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceeccc Q lcl|NC_021307. 84 D-MSVQQVEPHKIATIFVASAETVR---ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATG 159 (310) Q Consensus 84 ~-~~~i~l~~~k~~~~~~is~ell~---~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~ 159 (310) + ...+.....--.+-+..+.+.+. +.+......|.+.+..+..+..-...+.+........ +.......... T Consensus 79 t~~~dvaVk~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~~----t~~~~~~~~a~ 154 (315) T protein:vir:96 79 AADEMVSVKVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIGSN----AGMNVSGELAT 154 (315) T ss_pred ccccceeEEEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhccc----ccccccccccc Confidence 2 22232222111122223333333 3333444445555555554444444443332111111 11111111112 Q ss_pred chHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCcee Q lcl|NC_021307. 160 TTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTV 239 (310) Q Consensus 160 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~ 239 (310) .+. ..+.+...++.+....-..|+||..++..|.+ +.=. ..++... .+..+...+. .+|+||++++.||..... T Consensus 155 ~~~-~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~-q~L~-~~~~~~~--~~~~~~~~~~-~lGkrViVdD~~P~~~~~ 228 (315) T protein:vir:96 155 EGK-KVLTKGLRTMGDKASSIAIWVMDSTSYFDIVD-EAID-NKLYEEA--GVVVYGGTPG-TLGKPVLVTDQCPATKIF 228 (315) T ss_pred cCH-HHHHHHHHHhcccccCeeEEEEchHHHHHHHH-hhhh-hhccccc--ceeEecCcCc-ccccEEEEECCCCcceee Confidence 222 33456777788888889999999999999986 2211 1232211 1112222233 459999999999975422 Q ss_pred EeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEecc-EEeccCceEEEee--cC Q lcl|NC_021307. 240 GYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGL-LINDVEAFVKLTN--AA 310 (310) Q Consensus 240 ~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~-~v~~~~a~~~l~~--aa 310 (310) . |+. ..+.++....+.... .+ ..++-.+....|.+| -..+|..|.--+. ++ T Consensus 229 g-l~~-GAi~~~~~~~~~~~~----------~~--------~~g~e~l~~~~r~e~tf~l~p~G~sw~~~~~~s 282 (315) T protein:vir:96 229 G-LVA-GAVMITESQAPGMRS----------YQ--------IDDQENLAIGFRAEGTANVEVLGYKWKTKTNVN 282 (315) T ss_pred e-eec-ceeeecCCCcccccc----------cc--------CCCcceeEEEEeeeeEeeeeeeeEEeecCCCcC Confidence 1 111 111122111110000 00 011122222333333 2445555444211 11 No 183 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=96.40 E-value=0.00048 Score=38.79 Aligned_cols=280 Identities=10% Similarity=0.013 Sum_probs=134.8 Q ss_pred Cccchhh-hHHHHHhhccc-------cCCCCceec---hhhH-HHHHHHHHhhchhhhhcceeecCC---CceEEEEEcC Q lcl|NC_021307. 1 MAAGTAF-PVNHTQIAQTG-------DSMFQGYLE---PEQA-QDYFAEAEKTSIVQRVARKIPMGS---TGVKIPHWTG 65 (310) Q Consensus 1 ~aa~~~~-~~~~~~~~~~~-------~~~~g~~i~---~~~~-~~ii~~~~~~s~l~~~~~~~~~~~---~~~~ip~~~~ 65 (310) ..++... ..+....+... ++....-|| .+++ ..+++...+.-..+++....+.+. .++.+++.+. T Consensus 21 ~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~ 100 (339) T protein:vir:94 21 DGYSPKSISSEVSAYAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEP 100 (339) T ss_pred ccchhhhcchhhHhhhccccccccccccccccchhhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeec Confidence 1222221 11111111110 111111233 2333 446666777777777777666543 4578888888 Q ss_pred Cceeeeecccccccccc--cceeeeEeeeeeeEeeehhhHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCccc Q lcl|NC_021307. 66 DVSAAWIGEGDMKPITK--GDMSVQQVEPHKIATIFVASAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPF 141 (310) Q Consensus 66 ~~~a~~v~Eg~~~~~~~--~~~~~i~l~~~k~~~~~~is~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~ 141 (310) .+.+.+.+.++..|..+ .++.+.++.....+-.+. ..|+..- ...++.+.-.....+++.+.+|+-.+.|+.... T Consensus 101 ~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~-~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~ 179 (339) T protein:vir:94 101 VGQVATYSDWSANGMSKANVNFESRQNYRYQTWTEYG-DLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIA 179 (339) T ss_pred ccceEEcccccCCCcccccceeeEEeEEEEEEEEeec-HHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccc Confidence 88999999988888766 445555555555554443 3333222 235788888888999999999999999976443 Q ss_pred cccccc-------ccccccceecccchHHHHHHH---HHHHhhhhcC------CCCEEEEehHHHHHHHHhhhccCcccc Q lcl|NC_021307. 142 DKNLDE-------TTKSVDLTPATGTTYDAIGVN---ALSLLVNAGK------KWGATLLDDVAEPILNGAKDANGRPLF 205 (310) Q Consensus 142 ~~~~~~-------~~~~~~~~~~~~~~~~~~~~~---~~~~l~~~~~------~~~~~~~~~~~~~~l~~l~d~~g~~~~ 205 (310) -.+.+. .+...++.. .+.+.++.| +...+..... .+..++|.++.+..|..- +..|.-++ T Consensus 180 ~~GLlN~P~l~~~v~~s~~Wa~---kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~-n~~~~Tvl 255 (339) T protein:vir:94 180 NYGLMNDPSLPAPVAATVNWAT---AAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRT-NNFGLSAG 255 (339) T ss_pred eEEEEeCCCccccccCCCCccc---CCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC-CcCCccHH Confidence 222221 111222222 233333333 3344433321 244799999999888642 33332221 Q ss_pred ccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeec---ccEEEEeecceeeecccccccchhhhhc Q lcl|NC_021307. 206 VESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVG---GLSFDVSDQATLNLGTPQAPNFVSLWQH 282 (310) Q Consensus 206 ~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~---~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 282 (310) .- ... .+.+-++...|=.-+ +++ +-...++.... -..+.+.. +...-++- .+. T Consensus 256 ~~--lk~---n~pnl~i~~~~el~~----a~g------~~~~~~~~~~~~~~~~~~~~p~--------~~~~lpvq-~~~ 311 (339) T protein:vir:94 256 AK--IAQ---TYPNIQFVAVPEFDT----ASG------RLVQLWVPEVNGQPTGEVAFAE--------KLRSHSIE-RYS 311 (339) T ss_pred HH--HHH---hcCCcEEEEcccccc----CCC------ceEEEEEEeccCCcceEEEcch--------hhhccccE-EcC Confidence 10 000 011122333222211 111 11111111111 11111111 00000000 011 Q ss_pred CcEEEEEEEEe-ccEEeccCceEEEeec Q lcl|NC_021307. 283 NLVAVRVEAEY-GLLINDVEAFVKLTNA 309 (310) Q Consensus 283 ~~~~~r~~~~~-d~~v~~~~a~~~l~~a 309 (310) -........|. |..+++|.||+.+++- T Consensus 312 ~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 312 TTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred ceEEecceeeeeeEEEEccceeeeeecC Confidence 23344456664 5578899999999999 No 184 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=96.40 E-value=0.00066 Score=38.02 Aligned_cols=288 Identities=13% Similarity=0.013 Sum_probs=162.0 Q ss_pred CccchhhhHHH----HHhhcc-ccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceeeeec- Q lcl|NC_021307. 1 MAAGTAFPVNH----TQIAQT-GDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAAWIG- 73 (310) Q Consensus 1 ~aa~~~~~~~~----~~~~~~-~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v~- 73 (310) |-.-.+.--.. .+.... .+....-.+.|.+...+...+++.|-+++..+++++..-.. .+-...+++-++-+. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdT 80 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRTDT 80 (338) T ss_pred CCHHHHHHHHHHHHHHHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccccC Confidence 43332221111 111111 12222235778999999999999999999999999875443 344444455454432 Q ss_pred -ccccccccc-cceeeeEeeeeeeEeeehhhHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccc----c Q lcl|NC_021307. 74 -EGDMKPITK-GDMSVQQVEPHKIATIFVASAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKN----L 145 (310) Q Consensus 74 -Eg~~~~~~~-~~~~~i~l~~~k~~~~~~is~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~----~ 145 (310) .+......+ ..++.-....++.---..|+.+.|+. ..++|+..+.+.+.++++.-.-.--++|........ + T Consensus 81 ~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~nP 160 (338) T protein:vir:11 81 TGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAANP 160 (338) T ss_pred CCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCc Confidence 222222233 35677778888888888888888873 346899999999999988877777778865432211 1 Q ss_pred cc------------------------cccccccee---cccchHHHHHHHHHH-HhhhhcCC--CCEEEEehHHHH-HHH Q lcl|NC_021307. 146 DE------------------------TTKSVDLTP---ATGTTYDAIGVNALS-LLVNAGKK--WGATLLDDVAEP-ILN 194 (310) Q Consensus 146 ~~------------------------~~~~~~~~~---~~~~~~~~~~~~~~~-~l~~~~~~--~~~~~~~~~~~~-~l~ 194 (310) .. .+..+.... ..-...|.++.++.. ++.+.+.+ .-+.+|.+.... +-. T Consensus 161 llqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~ 240 (338) T protein:vir:11 161 LLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKYF 240 (338) T ss_pred CccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHh Confidence 00 000111111 112345677777664 55666655 357888877554 222 Q ss_pred HhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeec-ccEEEEeecceeeeccccc Q lcl|NC_021307. 195 GAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVG-GLSFDVSDQATLNLGTPQA 273 (310) Q Consensus 195 ~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~ 273 (310) .+.+....|-- ...........++-|+|.+..+++|.+.. ++..++++-+.... ...-.+.+.+ T Consensus 241 ~l~n~~~~ptE----~~Aa~~~~s~k~iGGlpa~~~PffP~~~~--lVT~L~NLsIY~Q~gs~RR~~~d~p--------- 305 (338) T protein:vir:11 241 PMVNKDQPATE----KIATDLILSQKRMGGLPPVEVPYVPEKGL--MVTTLKNLSLYWQIGGRRRYLKEVP--------- 305 (338) T ss_pred HHHhcCCChHH----HHHHHHHHHhhhhCCceeEEccccCCCce--EEeeccccEEEEecCcEEEEEEecc--------- Confidence 22222111110 00001011145799999999999999864 44567776544332 2332222222 Q ss_pred ccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 274 PNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 274 ~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.+.. T Consensus 306 -------~r~rie~y~s~Ne~YvVEd~~~~a~ieni~ 335 (338) T protein:vir:11 306 -------EKNRIENYESSNDAYVVEDYGLGCLVENIE 335 (338) T ss_pred -------ccccccchhhhccceeeeccccEEEeecce Confidence 234444444445577788888888887776 No 185 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=96.29 E-value=0.00077 Score=37.64 Aligned_cols=273 Identities=12% Similarity=0.086 Sum_probs=122.2 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHH-HHHhhchhhhhcce---------eecCCCceEEEEEcCC-cee Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFA-EAEKTSIVQRVARK---------IPMGSTGVKIPHWTGD-VSA 69 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~-~~~~~s~l~~~~~~---------~~~~~~~~~ip~~~~~-~~a 69 (310) |+-.... +.-..++.|++....++ ...+.+.|.+-+-+ ...++..+.+|.+..- .+. T Consensus 1 M~~~~~~------------T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~ 68 (367) T protein:vir:80 1 MPDFNNQ------------VRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLE 68 (367) T ss_pred Ccchhhh------------hhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCc Confidence 2221110 11123566666665443 33344444332222 2234667889988432 222 Q ss_pred eeecccc---cccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHc---ccC---cc Q lcl|NC_021307. 70 AWIGEGD---MKPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALH---GTD---SP 140 (310) Q Consensus 70 ~~v~Eg~---~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~---G~g---~~ 140 (310) .-+.+.. .++..+.+-++.....+..+.....++-.-.-+..+.++.|.+++++.-.+...+.+|. |-- .. T Consensus 69 ~n~~~d~~~~~~t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a 148 (367) T protein:vir:80 69 PNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLA 148 (367) T ss_pred cccCCCCCcccccccccccchheeeeehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccc Confidence 2222222 34444555554444444444444444443333455788889999997777766665443 211 10 Q ss_pred cccccc-------------ccccccccee-----cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCc Q lcl|NC_021307. 141 FDKNLD-------------ETTKSVDLTP-----ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGR 202 (310) Q Consensus 141 ~~~~~~-------------~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~ 202 (310) ...... .......+.. ........ ..++..++.+....-+.++||+..+..|++++-- T Consensus 149 ~~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~-~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~li--- 224 (367) T protein:vir:80 149 GNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREA-FVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEI--- 224 (367) T ss_pred cchhhhhhhhccccccccccCceeeeeeccCCCccceecHHH-HHHHHHHhccccccccEEEEchHHHHHHHhcccc--- Confidence 000000 0000111111 11122233 3466788888888899999999999999876411 Q ss_pred cccccccccccccccCCceeeeeeEEEeCCCCCCce-------eEeeecceeeeEEeecc-cEEEEeecceeeecccccc Q lcl|NC_021307. 203 PLFVESTYEAVTTPYREGRILGRPTILSDHVASGTT-------VGYLGDFSQIVWGQVGG-LSFDVSDQATLNLGTPQAP 274 (310) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~-------~~~~gd~~~~~~~~~~~-~~v~~~~~~~~~~~~~~~~ 274 (310) -|.... .+ ...-+++.|++|++++.||.... ..+||. ..+-++.... ..+++.|++.... T Consensus 225 -~~i~~s-d~---~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~-GAi~~~~~~~~~~~E~~Rd~~~~~------ 292 (367) T protein:vir:80 225 -EFIPDS-KG---QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRGN------ 292 (367) T ss_pred -ccccCC-CC---ccccceecceeEEEeCCCcccccCCCceEEEEEEec-ceeeecccCCccceecccchhhhc------ Confidence 111111 11 12346789999999999995321 112221 1111222111 2245554443110 Q ss_pred cchhhhhcCcEEEEEEEEeccEEeccCceEEE----------------------------eecC Q lcl|NC_021307. 275 NFVSLWQHNLVAVRVEAEYGLLINDVEAFVKL----------------------------TNAA 310 (310) Q Consensus 275 ~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l----------------------------~~aa 310 (310) ..++-.+....| .+.||..|.-. ..++ T Consensus 293 ------~gG~d~L~~Rr~---~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~~eLa~~~ 347 (367) T protein:vir:80 293 ------GSGLEYILERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPD 347 (367) T ss_pred ------CCceEEEEeeee---EEeecceeeecccccccccccccccccccccCCCChHHhcCCc Confidence 011111211112 23344333221 1111 No 186 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=96.19 E-value=0.00074 Score=37.74 Aligned_cols=288 Identities=10% Similarity=0.028 Sum_probs=160.0 Q ss_pred CccchhhhHHHHHh-h--ccccC----CCC--ceechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceee Q lcl|NC_021307. 1 MAAGTAFPVNHTQI-A--QTGDS----MFQ--GYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAA 70 (310) Q Consensus 1 ~aa~~~~~~~~~~~-~--~~~~~----~~g--~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~ 70 (310) |-.-.+.--..... . ..|.. ..+ =.+.|.+...+...+.+.|-+++..+++++..-.. .+-...+++-++ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iag 80 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVAS 80 (342) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCccccc Confidence 43332221111111 1 11211 111 24678888999999999999999999999875433 344444555554 Q ss_pred eec---ccccccccccceeeeEeeeeeeEeeehhhHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc Q lcl|NC_021307. 71 WIG---EGDMKPITKGDMSVQQVEPHKIATIFVASAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL 145 (310) Q Consensus 71 ~v~---Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~ 145 (310) -+. -+...|..-..++.-....++.---..|+.+.|+. ..++|+..+.+.+.+.++.-.=.--++|......... T Consensus 81 rtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 160 (342) T protein:vir:10 81 TTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDR 160 (342) T ss_pred ccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCCh Confidence 432 12233334456677778888888888888888872 3468999999999998887777777777654432211 Q ss_pred c-------------------------c---ccccccceec-ccchHHHHHHHHHH-HhhhhcCC--CCEEEEehHHHH-H Q lcl|NC_021307. 146 D-------------------------E---TTKSVDLTPA-TGTTYDAIGVNALS-LLVNAGKK--WGATLLDDVAEP-I 192 (310) Q Consensus 146 ~-------------------------~---~~~~~~~~~~-~~~~~~~~~~~~~~-~l~~~~~~--~~~~~~~~~~~~-~ 192 (310) . . ....+..... .-...|.++.++.. ++.+.+.+ .-+.+|.+.... + T Consensus 161 ~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLladk 240 (342) T protein:vir:10 161 NSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKLLADK 240 (342) T ss_pred hhCcCccccchHHHHHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHH Confidence 0 0 0011111111 22345667777664 45666655 467888887654 2 Q ss_pred HHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEe-ecccEEEEeecceeeeccc Q lcl|NC_021307. 193 LNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQ-VGGLSFDVSDQATLNLGTP 271 (310) Q Consensus 193 l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~-~~~~~v~~~~~~~~~~~~~ 271 (310) -..+......+-- ...........++-|+|.+..+++|.+... +..++++-+-. .+...-.+.+++ T Consensus 241 ~~~l~n~~~~ptE----~~Aa~~i~s~k~iGGl~a~~~PfFP~~~il--VT~L~NLsIY~Q~gs~RR~~~d~p------- 307 (342) T protein:vir:10 241 YFPIVNQQNAPTE----ELAADIVISQKRIGGLKAVRVPFFPANAIL--ITKLENLAIYVQEGTTRKHIENVP------- 307 (342) T ss_pred HHHHHhcCCChHH----HHHHHHHHhhhhhcCceeEEccccCCCceE--EeeccccEEEEecCcEEEEEEecc------- Confidence 2222222111110 000111112457999999999999998744 45666654332 233333333222 Q ss_pred ccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 272 QAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 272 ~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.+.- T Consensus 308 ---------~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~ 337 (342) T protein:vir:10 308 ---------KKDRIETYESENIDYVVEDYGCAALIENIT 337 (342) T ss_pred ---------ccccccchhhhccceeeeccccEEEeecce Confidence 233343333445567777777777776555 No 187 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=96.07 E-value=0.001 Score=36.94 Aligned_cols=272 Identities=13% Similarity=0.119 Sum_probs=114.9 Q ss_pred hccccCCCCceechh--hHHHH-HHHHHhhchhhhhccee---------ecCCCceEEEEEcC-C--ceeeeecc--ccc Q lcl|NC_021307. 15 AQTGDSMFQGYLEPE--QAQDY-FAEAEKTSIVQRVARKI---------PMGSTGVKIPHWTG-D--VSAAWIGE--GDM 77 (310) Q Consensus 15 ~~~~~~~~g~~i~~~--~~~~i-i~~~~~~s~l~~~~~~~---------~~~~~~~~ip~~~~-~--~~a~~v~E--g~~ 77 (310) |. .+--...++|+ +...+ .+...+.+.|.+-+-+. ..++..+++|.+.. . .+..+... .+. T Consensus 1 Ma--~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MA--ITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CC--ceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 22 22233456665 45554 33444445544422221 23466788998853 2 23222222 234 Q ss_pred ccccccceeeeEeeeeeeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHc---ccCc---c-cccccccccc Q lcl|NC_021307. 78 KPITKGDMSVQQVEPHKIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALH---GTDS---P-FDKNLDETTK 150 (310) Q Consensus 78 ~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~---G~g~---~-~~~~~~~~~~ 150 (310) .+..+.+-++.....+..+.....++-.-.-+..+.++.|.+++++...+...+.++. |--. . .......... T Consensus 79 ~t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~~ 158 (349) T protein:vir:78 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDM 158 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhcccc Confidence 4444544444333333333333333222222344778889999998887776665543 2111 0 0000000000 Q ss_pred cccceecccchHHHHHHHHHHHhhhh-----cCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeee Q lcl|NC_021307. 151 SVDLTPATGTTYDAIGVNALSLLVNA-----GKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGR 225 (310) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~l~~~-----~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~ 225 (310) ........+.+...++ +....+.+. ...-..++||+..+..|++++-=. +.++... ...-.+++|+ T Consensus 159 t~d~s~~a~~~~~~~~-dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~----~i~~s~~----~~~i~ty~G~ 229 (349) T protein:vir:78 159 VVDVSATLGFDAGAFI-DATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID----FIRDAEN----NTMFATYQGY 229 (349) T ss_pred eeeeccccCCChhhhh-hhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhh----hccCccc----CcccceecCe Confidence 1111111222333332 334344333 455678999999999998653211 1111111 1123578999 Q ss_pred eEEEeCCCCCCc-------eeEeeecceeeeEEeecc-cEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEE Q lcl|NC_021307. 226 PTILSDHVASGT-------TVGYLGDFSQIVWGQVGG-LSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLI 297 (310) Q Consensus 226 pv~~t~~~~~~~-------~~~~~gd~~~~~~~~~~~-~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v 297 (310) +|++++.||... ...+||. ..+.++..+. ..+++.+++.... ..++-.+....++-+.+ T Consensus 230 ~VivDD~~Pv~~~g~~~~yttylfg~-GAi~~~~~~~~~~~et~rd~~~g~------------~~G~d~l~~R~~~~~hp 296 (349) T protein:vir:78 230 RVIVDDSMTVVGQGAQRKFISIIFGQ-GAIGYGEGNPVMPLEYEREASRAN------------GGGVETLWTRKTWLLHP 296 (349) T ss_pred EEEEeCCCccccCCCCceEEEEEeec-ceEEEccCCCccceeeecccccCC------------cceeEEEEEeeEEEeee Confidence 999999999532 1122331 2222332221 2355544442100 01122222222221111 Q ss_pred e---ccCce-E--------------EEeecC Q lcl|NC_021307. 298 N---DVEAF-V--------------KLTNAA 310 (310) Q Consensus 298 ~---~~~a~-~--------------~l~~aa 310 (310) . -.+++ . -|..++ T Consensus 297 ~G~s~~~a~v~~~~~~~~~~sPt~aeLa~~~ 327 (349) T protein:vir:78 297 FGYRFTSAVITGNGTETIARSASWQDLANAT 327 (349) T ss_pred eeeeeccccccCCccccccCCCChHHhcCCc Confidence 1 01111 1 111111 No 188 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=96.07 E-value=0.00088 Score=37.33 Aligned_cols=288 Identities=10% Similarity=0.003 Sum_probs=148.4 Q ss_pred CccchhhhHHH----HHhhcccc---CCCC--ceechhhHHHHHHHHHhhchhhhhcceeecCCCceEE-EEEcCCceee Q lcl|NC_021307. 1 MAAGTAFPVNH----TQIAQTGD---SMFQ--GYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKI-PHWTGDVSAA 70 (310) Q Consensus 1 ~aa~~~~~~~~----~~~~~~~~---~~~g--~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~i-p~~~~~~~a~ 70 (310) |-.-.+.--.. .+.....+ ...+ -.+.|.+...+...+.+.|-+++..+++++..-..++ ....++..+. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~t~ 80 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRHYG 80 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccccC Confidence 44332221111 11111111 0111 2477888899999999999999999999986433232 2222332222 Q ss_pred eecccccccccccceeeeEeeeeeeEeeehhhHHHhhc--ChhH-HHHHHHHHHHHHHHHHHHHHHHcccCccccc-cc- Q lcl|NC_021307. 71 WIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRA--NPGN-YLGTMRTKVATAIALAFDEAALHGTDSPFDK-NL- 145 (310) Q Consensus 71 ~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~--s~~~-~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~-~~- 145 (310) -....+...+ ..+.+......++.---..|+-+.|+. ..++ |+..+.+.+.+.++.-.=.--++|....... .+ T Consensus 81 r~~t~~~~~~-~~~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T~nPl 159 (343) T protein:vir:98 81 AHDRRTPIQQ-RWTRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDTSDPN 159 (343) T ss_pred ccccCCCccc-cccCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCCCCcc Confidence 2111111100 111122245566666666778887763 1255 8888888888888776666667776433211 11 Q ss_pred --------------------cc-ccccccc---eec-ccchHHHHHHHHHHHhhhhcCC--CCEEEEehHHHHH-HHHhh Q lcl|NC_021307. 146 --------------------DE-TTKSVDL---TPA-TGTTYDAIGVNALSLLVNAGKK--WGATLLDDVAEPI-LNGAK 197 (310) Q Consensus 146 --------------------~~-~~~~~~~---~~~-~~~~~~~~~~~~~~~l~~~~~~--~~~~~~~~~~~~~-l~~l~ 197 (310) +. .....+. ... .-...|.++.++...+.+.+.+ .-+.+|.+..... ...+. T Consensus 160 lqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~ 239 (343) T protein:vir:98 160 LADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLVAKEASLVY 239 (343) T ss_pred hhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhhhhhhhhhh Confidence 00 0000011 111 1234567777777777776555 4577888776433 22233 Q ss_pred hccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEe-ecccEEEEeecceeeecccccccc Q lcl|NC_021307. 198 DANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQ-VGGLSFDVSDQATLNLGTPQAPNF 276 (310) Q Consensus 198 d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~ 276 (310) +..+++.... ..........++-|+|.+..+++|.+... +..++++-+-. .+...-.+.+++ T Consensus 240 n~~~~~ptEk---~Aa~~~~~~k~iGGl~a~~~PfFP~~~ll--VT~L~NLsIY~Q~gs~RR~~~d~p------------ 302 (343) T protein:vir:98 240 KGNGLIATEK---AALNTHDLMKSFGGMPAMIVPNMPPRAAI--VTSLSNLSIYTQEGSMRRGMKDDD------------ 302 (343) T ss_pred hhcCCChHHH---HHHHHHHHHHhhCCCeeEEccccCCCceE--EeeccccEEEEecCcEEEEEEecc------------ Confidence 3323211110 00000112357899999999999998744 45677654332 233333333222 Q ss_pred hhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 277 VSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 277 ~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.... T Consensus 303 ----~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~ 332 (343) T protein:vir:98 303 ----DKKAVRDSYYRNEAYAVEDCGKFMAVDFTK 332 (343) T ss_pred ----ccccccchhhhcceeeeeccccEEEeeeee Confidence 233343333445567777888877776665 No 189 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=95.92 E-value=0.0012 Score=36.66 Aligned_cols=284 Identities=10% Similarity=0.025 Sum_probs=135.2 Q ss_pred Cc-cchhhhHHHHHhhccccCCCCc------eechhhHHHHH-----HHHHhhchhhhhcceeecCC---CceEEEEEcC Q lcl|NC_021307. 1 MA-AGTAFPVNHTQIAQTGDSMFQG------YLEPEQAQDYF-----AEAEKTSIVQRVARKIPMGS---TGVKIPHWTG 65 (310) Q Consensus 1 ~a-a~~~~~~~~~~~~~~~~~~~g~------~i~~~~~~~ii-----~~~~~~s~l~~~~~~~~~~~---~~~~ip~~~~ 65 (310) +. +...+..+...-+......+++ .-+|++...++ +.+.+......+......+. ....+++.+. T Consensus 17 ~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~ 96 (336) T protein:vir:36 17 LPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP 96 (336) T ss_pred ecchhhhhhhHHHHhhhhhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhccccccCCccceeEEEeeeec Confidence 11 1111111111111111111111 12345555544 33333333444444433332 2456677777 Q ss_pred CceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccc Q lcl|NC_021307. 66 DVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN---PGNYLGTMRTKVATAIALAFDEAALHGTDSPFD 142 (310) Q Consensus 66 ~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s---~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~ 142 (310) .+.+.+.+.++..|..+...+..+.+.+.++..+.++.+-+..+ ..++.+.-....++++.+.+|+-.+.|+....- T Consensus 97 ~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~ 176 (336) T protein:vir:36 97 TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLEN 176 (336) T ss_pred eeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccce Confidence 77788889999999998888888888999999999984444322 356777788888888889999988888765432 Q ss_pred ccccccccc---cccee--cccchHHHHHH---HHHHHhhhhc------CCCCEEEEehHHHHHHHHhhhccCccccccc Q lcl|NC_021307. 143 KNLDETTKS---VDLTP--ATGTTYDAIGV---NALSLLVNAG------KKWGATLLDDVAEPILNGAKDANGRPLFVES 208 (310) Q Consensus 143 ~~~~~~~~~---~~~~~--~~~~~~~~~~~---~~~~~l~~~~------~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~ 208 (310) .+.+...+. ..... ...++.+.+.. .++..+.... -.+..++|.+..+..|.. .+..|.-+..- T Consensus 177 yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~- 254 (336) T protein:vir:36 177 YGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK- 254 (336) T ss_pred EEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccC-CCccCccHHHH- Confidence 222221110 11000 11122233333 3444443322 236689999998888753 33333322110 Q ss_pred cccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecc---cEEEEeecceeeecccccccchhhhhcCcE Q lcl|NC_021307. 209 TYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGG---LSFDVSDQATLNLGTPQAPNFVSLWQHNLV 285 (310) Q Consensus 209 ~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~---~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (310) ... .+.+-++...|=.-+ ++ |+-..+++-...+ ..+.+... +..-++ -.+.-.. T Consensus 255 -lk~---n~Pnl~i~t~pEl~~----a~------g~~~~l~~~~~~~~~t~~~~~p~~--------~~~l~v-q~~~~~~ 311 (336) T protein:vir:36 255 -LKD---IFPKLEFVTIPEYDT----AS------GRLVQLWAPRVEGKDTATCGFTEK--------MRAHSI-ERYSSYF 311 (336) T ss_pred -HHH---hcCccEEEEcccccc----CC------CceEEEEEEecCCCcceeeecchh--------hhccce-eecCcee Confidence 000 011223333332211 11 1111222221111 11111100 000000 0011223 Q ss_pred EEEEEEEec-cEEeccCceEEEeec Q lcl|NC_021307. 286 AVRVEAEYG-LLINDVEAFVKLTNA 309 (310) Q Consensus 286 ~~r~~~~~d-~~v~~~~a~~~l~~a 309 (310) ......|.+ ..+++|.||+++++- T Consensus 312 ~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 312 RQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred EeccccceeeeeeeccchheeeecC Confidence 344555654 477899999999999 No 190 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=95.87 E-value=0.0013 Score=36.34 Aligned_cols=288 Identities=12% Similarity=0.047 Sum_probs=159.0 Q ss_pred CccchhhhHHH-----HHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceeeeec- Q lcl|NC_021307. 1 MAAGTAFPVNH-----TQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAAWIG- 73 (310) Q Consensus 1 ~aa~~~~~~~~-----~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v~- 73 (310) |-.-.+.--.. ...-...+....-.+.|.+...+...+++.|-+++..+++++..-.. .+-...+++-++-+. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 44322221111 11111111112224778888889999999999999999999875433 333334455444432 Q ss_pred -ccccccccccceeeeEeeeeeeEeeehhhHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc----c Q lcl|NC_021307. 74 -EGDMKPITKGDMSVQQVEPHKIATIFVASAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL----D 146 (310) Q Consensus 74 -Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~----~ 146 (310) .+...|..-..++.-....++.---..|+.+.|+. ..++|+..+.+.+.+.++.-.-.--++|......... . T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:79 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 23333444456777888888888888899998873 3468999999999999888777777788654332111 0 Q ss_pred ---------------------c----ccccccceec-ccchHHHHHHHHHH-HhhhhcCC--CCEEEEehHHHH-HHHHh Q lcl|NC_021307. 147 ---------------------E----TTKSVDLTPA-TGTTYDAIGVNALS-LLVNAGKK--WGATLLDDVAEP-ILNGA 196 (310) Q Consensus 147 ---------------------~----~~~~~~~~~~-~~~~~~~~~~~~~~-~l~~~~~~--~~~~~~~~~~~~-~l~~l 196 (310) . ....+..... .-...|.++.++.. ++.+.+.+ .-+.+|.+.... +-..+ T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l 240 (337) T protein:vir:79 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDKYFPI 240 (337) T ss_pred ccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHH Confidence 0 0001111111 12345666677664 45666655 457888877654 22222 Q ss_pred hhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEee-cccEEEEeecceeeeccccccc Q lcl|NC_021307. 197 KDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQV-GGLSFDVSDQATLNLGTPQAPN 275 (310) Q Consensus 197 ~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~ 275 (310) -+....+-- ...........++-|+|.+..+++|.+.. ++..++++-+... +...-.+.+.+ T Consensus 241 ~n~~~~ptE----~~Aa~~i~s~k~iGGlpa~~~PffP~~~~--lVT~L~NLsIY~Q~gs~RR~~~d~p----------- 303 (337) T protein:vir:79 241 VNATQAPTE----RLAADLIVSQKRIGNLPAVRVPFFPKRAL--MVTKLSNLSIYYQEGARRRTLKEVP----------- 303 (337) T ss_pred hccCCCcHH----HHHHHHHHHhhhhCCceeEEccccCCCce--EEeechhcEEEEecCcEEEEEEEcc----------- Confidence 222221110 00001111235799999999999999874 4456777654332 22332222222 Q ss_pred chhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 276 FVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 276 ~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.+-- T Consensus 304 -----~r~rie~y~s~Ne~YvVEd~~~~a~ienI~ 333 (337) T protein:vir:79 304 -----ERDRIENYESSNDAYVVEDFGCGCVAENIE 333 (337) T ss_pred -----ccccccchhhccceeeeeccccEEEEecee Confidence 233333333344466777777766654322 No 191 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=95.86 E-value=0.0013 Score=36.32 Aligned_cols=288 Identities=12% Similarity=0.040 Sum_probs=159.1 Q ss_pred CccchhhhHH-----HHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceeeeec- Q lcl|NC_021307. 1 MAAGTAFPVN-----HTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAAWIG- 73 (310) Q Consensus 1 ~aa~~~~~~~-----~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v~- 73 (310) |-.-.+.--. ....-...+....-.+.|.+...+...+++.|-+++..+++++..-.. .+-...+++-++-+. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 4432222111 111111111122224778888899999999999999999999875433 333334455444432 Q ss_pred -ccccccccccceeeeEeeeeeeEeeehhhHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc----c Q lcl|NC_021307. 74 -EGDMKPITKGDMSVQQVEPHKIATIFVASAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL----D 146 (310) Q Consensus 74 -Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~----~ 146 (310) .+...|..-..++.-....++.---..|+.+.|+. ..++|+..+.+.+.++++.-.-.--++|......... . T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:10 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 23333444456777888888888888899998873 3468999999999999888777777788654332111 0 Q ss_pred ---------------------c----ccccccceec-ccchHHHHHHHHHH-HhhhhcCC--CCEEEEehHHHH-HHHHh Q lcl|NC_021307. 147 ---------------------E----TTKSVDLTPA-TGTTYDAIGVNALS-LLVNAGKK--WGATLLDDVAEP-ILNGA 196 (310) Q Consensus 147 ---------------------~----~~~~~~~~~~-~~~~~~~~~~~~~~-~l~~~~~~--~~~~~~~~~~~~-~l~~l 196 (310) . ....+..... .-...|.++.++.. ++.+.+.+ .-+.+|.+.... +-..+ T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l 240 (337) T protein:vir:10 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) T ss_pred ccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHH Confidence 0 0001111111 12345666677664 45666655 457888877654 22222 Q ss_pred hhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEee-cccEEEEeecceeeeccccccc Q lcl|NC_021307. 197 KDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQV-GGLSFDVSDQATLNLGTPQAPN 275 (310) Q Consensus 197 ~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~ 275 (310) -+....+-- ...........++-|+|.+..+++|.+.. ++..++++-+... +...-.+.+.+ T Consensus 241 ~n~~~~ptE----~~Aa~~i~s~k~iGGlpa~~~PffP~~~~--lVT~L~NLsIY~Q~gs~RR~~~d~p----------- 303 (337) T protein:vir:10 241 VNATQAPTE----RLAADLIVSQKRIGNLPAVRVPFFPKRAL--MVTKLSNLSIYYQEGARRRTLKEVP----------- 303 (337) T ss_pred hccCCCcHH----HHHHHHHHHhhhhCCceeEEccccCCCce--EEeechhcEEEEecCcEEEEEEEcc----------- Confidence 222221110 00001111235799999999999999874 4456777654333 22332222222 Q ss_pred chhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 276 FVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 276 ~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.+-- T Consensus 304 -----~r~rie~y~s~Ne~YvVEd~~~~a~ienI~ 333 (337) T protein:vir:10 304 -----ERDRIENYESSNDAYVVEDFGCGCVAENIE 333 (337) T ss_pred -----ccccccchhhccceeeeeccccEEEEecee Confidence 233343333344566777777766654322 No 192 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=95.81 E-value=0.0013 Score=36.32 Aligned_cols=283 Identities=10% Similarity=-0.001 Sum_probs=138.9 Q ss_pred Cccch-hhhHHHHHhhccccCCCCc------eechhhHHHHH-----HHHHhhchhhhhcceeecCC---CceEEEEEcC Q lcl|NC_021307. 1 MAAGT-AFPVNHTQIAQTGDSMFQG------YLEPEQAQDYF-----AEAEKTSIVQRVARKIPMGS---TGVKIPHWTG 65 (310) Q Consensus 1 ~aa~~-~~~~~~~~~~~~~~~~~g~------~i~~~~~~~ii-----~~~~~~s~l~~~~~~~~~~~---~~~~ip~~~~ 65 (310) +.++. .+..+....++.....+++ .-+|++...++ +.+........+..+...+. ....+++.+. T Consensus 17 ~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~ 96 (336) T protein:vir:78 17 LPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP 96 (336) T ss_pred cchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHHhcccceeeehhhhhhhhhhcccccCCCccccEEEEeeeec Confidence 21111 1111111111111111111 12345554444 44444444444444433332 3466777777 Q ss_pred CceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccc Q lcl|NC_021307. 66 DVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN---PGNYLGTMRTKVATAIALAFDEAALHGTDSPFD 142 (310) Q Consensus 66 ~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s---~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~ 142 (310) .+.+.+.+.++..|..+...+..+.+.+.++..+.++.+-+..+ ..++.+.-....++++.+.+|+-.+.|+....- T Consensus 97 ~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~ 176 (336) T protein:vir:78 97 TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLEN 176 (336) T ss_pred ceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccce Confidence 78888889999999999999999999999999999996666543 357778888888888889999888888765433 Q ss_pred ccccccc---ccccceec--ccchHHHHHHHHH---HHhhhhcC------CCCEEEEehHHHHHHHHhhhccCccccccc Q lcl|NC_021307. 143 KNLDETT---KSVDLTPA--TGTTYDAIGVNAL---SLLVNAGK------KWGATLLDDVAEPILNGAKDANGRPLFVES 208 (310) Q Consensus 143 ~~~~~~~---~~~~~~~~--~~~~~~~~~~~~~---~~l~~~~~------~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~ 208 (310) .+.+... ........ ...+.+.++.|+. ..+..... .+..++|.+..+..|.. .+..|.-+.. T Consensus 177 ~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~-- 253 (336) T protein:vir:78 177 YGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-- 253 (336) T ss_pred EEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHH-- Confidence 2322211 11111111 1123343443433 33332221 24478999998888864 2333322211 Q ss_pred cccccccccCCceeeeeeEEEeCCCC-CCceeEeeecceeeeEEeecc---cEEEEeecceeeecccccccchhhhhcCc Q lcl|NC_021307. 209 TYEAVTTPYREGRILGRPTILSDHVA-SGTTVGYLGDFSQIVWGQVGG---LSFDVSDQATLNLGTPQAPNFVSLWQHNL 284 (310) Q Consensus 209 ~~~~~~~~~~~~~l~G~pv~~t~~~~-~~~~~~~~gd~~~~~~~~~~~---~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (310) ......-++.+...+.+. ++ |+-..++.-+..+ ..+.+.. .+..-++ -.+.-. T Consensus 254 --------~lk~n~Pnl~i~t~pel~~Ag------g~~~~~~~~~~~~~~t~~~~~p~--------~f~~lpv-q~~~~~ 310 (336) T protein:vir:78 254 --------KLKEIFPKLEFVTIPEYDTAS------GRLVQLWAPRVEGKDTATCGFTE--------KMRAHSI-ERYSSY 310 (336) T ss_pred --------HHHHhcCccEEEEcccccccC------cceEEEEEeeccCCcceeeecch--------hhhccce-eecCce Confidence 000001112222222221 11 1211222222211 1111110 0000010 011123 Q ss_pred EEEEEEEEecc-EEeccCceEEEeec Q lcl|NC_021307. 285 VAVRVEAEYGL-LINDVEAFVKLTNA 309 (310) Q Consensus 285 ~~~r~~~~~d~-~v~~~~a~~~l~~a 309 (310) .......|.++ .+++|-||+++++- T Consensus 311 ~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 311 FRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred eEeccccceeeeeeeccchheeeccC Confidence 33445556544 77799999999999 No 193 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=95.75 E-value=0.0015 Score=36.04 Aligned_cols=268 Identities=13% Similarity=0.127 Sum_probs=115.1 Q ss_pred hccccCCCCceechh--hHHHH-HHHHHhhchhhhhccee---------ecCCCceEEEEEcC-Cc--eeeeeccc--cc Q lcl|NC_021307. 15 AQTGDSMFQGYLEPE--QAQDY-FAEAEKTSIVQRVARKI---------PMGSTGVKIPHWTG-DV--SAAWIGEG--DM 77 (310) Q Consensus 15 ~~~~~~~~g~~i~~~--~~~~i-i~~~~~~s~l~~~~~~~---------~~~~~~~~ip~~~~-~~--~a~~v~Eg--~~ 77 (310) |. .+--...++|+ +.... .+...+.+.|.+-+-+. ..++..+++|.+.. .. +..+-+.. +. T Consensus 1 Ma--~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~ 78 (349) T protein:vir:94 1 MA--ITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CC--ceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 22 22223456665 45543 33444555555432222 23456688898753 22 22222222 23 Q ss_pred ccccccceee-eEeeeeee--EeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHc---ccCc----cccccccc Q lcl|NC_021307. 78 KPITKGDMSV-QQVEPHKI--ATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALH---GTDS----PFDKNLDE 147 (310) Q Consensus 78 ~~~~~~~~~~-i~l~~~k~--~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~---G~g~----~~~~~~~~ 147 (310) ++..+.+-++ +-....+- -..-.++.++ +..+.++.|.+++++...+...+.++. |--. +....... T Consensus 79 ~t~~kit~~~~~a~~~~r~kaw~~~Dla~~l---sG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~ 155 (349) T protein:vir:94 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVEL---TSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQ 155 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHh---hCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhccccccccccccc Confidence 4444544443 33333222 2333345554 334778889999999888877775553 2111 11000000 Q ss_pred ccccccceecccchHHHHHHHHHHHhhhh-----cCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCcee Q lcl|NC_021307. 148 TTKSVDLTPATGTTYDAIGVNALSLLVNA-----GKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRI 222 (310) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-----~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l 222 (310) ...........+.+...++ +...++.+. ...-..++||...+..|++++-=. +.++. .+ ...-.++ T Consensus 156 ~~~~~d~~~~a~~~~~~~~-~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~----~i~~s-~~---~~~i~ty 226 (349) T protein:vir:94 156 NDMVVDVSATSGFDAGAFI-DATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID----FIRDA-EN---NTMFATY 226 (349) T ss_pred CceeEEecccCCCChhhHH-HHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhh----hccCc-cc---Cccccee Confidence 0001111111222333333 334344332 345678999999999998654211 11111 11 1123579 Q ss_pred eeeeEEEeCCCCCCc-------eeEeeecceeeeEEeec-ccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEe- Q lcl|NC_021307. 223 LGRPTILSDHVASGT-------TVGYLGDFSQIVWGQVG-GLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEY- 293 (310) Q Consensus 223 ~G~pv~~t~~~~~~~-------~~~~~gd~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~- 293 (310) +|++|++++.||... ...+||. ..+.++..+ ...+++.+++.... ..++-.+....++ T Consensus 227 ~G~~VivDD~~Pv~~~g~~~~yttylfg~-GAi~~~~~~~~~~~E~~rd~~~g~------------~~G~d~L~~R~~~~ 293 (349) T protein:vir:94 227 QGYRVIVDDSMTVVGQDTSRKFISIIFGQ-GAIGYGEGNPEMPLEYEREASRAN------------GGGVETLWTRKTWL 293 (349) T ss_pred cCcEEEEeCCCccccCCCCceEEEEEeec-ceEEeecCCCCcceeeecccccCC------------cceeEEEEEeeEEE Confidence 999999999998421 1112331 222233322 12355555442100 0111222222222 Q ss_pred ----ccEEeccCceE--------------EEeecC Q lcl|NC_021307. 294 ----GLLINDVEAFV--------------KLTNAA 310 (310) Q Consensus 294 ----d~~v~~~~a~~--------------~l~~aa 310 (310) |+.... ..+. -|..++ T Consensus 294 ~hp~G~s~~~-a~v~~~~~~~~~~sPt~aeLa~~~ 327 (349) T protein:vir:94 294 LHPFGYSFTS-AVITGNGTETIARSASWQDLANAA 327 (349) T ss_pred eeeeeeeecc-cccCCCccccccCCCChHHhcCCc Confidence 111111 1111 111112 No 194 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=95.46 E-value=0.0013 Score=36.37 Aligned_cols=294 Identities=10% Similarity=-0.057 Sum_probs=132.6 Q ss_pred Cc-cchhhh-------HHHHHhhccccCC-----------CCceechhhH----HHHHHHHHhhchhhhhcceeecCC-- Q lcl|NC_021307. 1 MA-AGTAFP-------VNHTQIAQTGDSM-----------FQGYLEPEQA----QDYFAEAEKTSIVQRVARKIPMGS-- 55 (310) Q Consensus 1 ~a-a~~~~~-------~~~~~~~~~~~~~-----------~g~~i~~~~~----~~ii~~~~~~s~l~~~~~~~~~~~-- 55 (310) |. -|-.++ .+...+|...... .+-.-+|++. ..+++.+-.-..+.++..+...+. T Consensus 34 l~~~gi~~~~~~~~~~~~~~~amd~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~ 113 (379) T protein:vir:10 34 LESYGIHLNGRKNKLFELMQFAMDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWD 113 (379) T ss_pred HHhcCccccchhhhhhhhhhhhhccccccccccccCccccccccchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCce Confidence 00 000000 0001111111000 0000123333 345665555544555555444332 Q ss_pred -CceEEEEEcCCceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021307. 56 -TGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN---PGNYLGTMRTKVATAIALAFDE 131 (310) Q Consensus 56 -~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s---~~~~~~~v~~~l~~a~~~~~d~ 131 (310) ....+++.+..+.+.+.+.++..|..+...+..+-..+.++..+.++..-+..+ ..++...-.....+++.+.+|+ T Consensus 114 ~~~~~~~v~e~~G~A~~ygd~~d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~ 193 (379) T protein:vir:10 114 DEQIVQRVLEGLGTAQPYTDGGNMALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNR 193 (379) T ss_pred eeeEEEeeeeeeeeeEEeccccCCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhce Confidence 245667777778888888888888888777777777777777777776555433 3578888899999999999999 Q ss_pred HHHcccCcc-ccc-cccc-------cccccc--c-eecccchHHHHHHHHHHHhh---hhc-------CCCCEEEEehHH Q lcl|NC_021307. 132 AALHGTDSP-FDK-NLDE-------TTKSVD--L-TPATGTTYDAIGVNALSLLV---NAG-------KKWGATLLDDVA 189 (310) Q Consensus 132 ~~l~G~g~~-~~~-~~~~-------~~~~~~--~-~~~~~~~~~~~~~~~~~~l~---~~~-------~~~~~~~~~~~~ 189 (310) -.|.|.+.+ ... +.+. .+.... . ..=...+.+.++.|+...+. ... ..+..+++.+.. T Consensus 194 i~f~G~~d~~~~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~ 273 (379) T protein:vir:10 194 VAFYGYNDGSGRTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAY 273 (379) T ss_pred EEEEeecCCCcceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHH Confidence 999995422 221 1111 111000 0 00112234444444443332 211 123378899998 Q ss_pred HHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeec Q lcl|NC_021307. 190 EPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLG 269 (310) Q Consensus 190 ~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~ 269 (310) +..|..- +..|.-++.- ... .+.+-++...|=....+ ..++...++ +-+..+......+.-..... T Consensus 274 ~~~L~~~-n~~g~Tvl~~--lk~---n~Pnl~i~t~pEL~~ag-gg~~~~~~~-------~~~~~~~~t~~~~~~~~~~p 339 (379) T protein:vir:10 274 ENYITTP-TELGYSVAQY--MRE---SYPNVTFVSAPELNDAN-GGSSAIYYY-------ADAVENNGTDDGRTWLQVVP 339 (379) T ss_pred HHhhccc-cccCccHHHH--HHH---hcCCcEEEEcccccccC-CCccEEEEE-------eeccCCCccCCcceEEEecc Confidence 8888642 3333222110 000 01122343433332211 111111111 11112111100000000000 Q ss_pred ccccccchhhhhcCcEEEEEEEEe-ccEEeccCceEEEeec Q lcl|NC_021307. 270 TPQAPNFVSLWQHNLVAVRVEAEY-GLLINDVEAFVKLTNA 309 (310) Q Consensus 270 ~~~~~~~~~~~~~~~~~~r~~~~~-d~~v~~~~a~~~l~~a 309 (310) .....-++ -...-........|. |..+++|.||+.+.++ T Consensus 340 ~k~~~l~v-e~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 340 TKMFTLGV-EKKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred hhhhhccc-eecCceeEeccccceeeeeeecchhhheecCC Confidence 00000000 001112223344454 5578899999999999 No 195 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=95.40 E-value=0.0019 Score=35.54 Aligned_cols=284 Identities=11% Similarity=0.019 Sum_probs=136.5 Q ss_pred Cccc-hhhhHHHHHhhc------cccCCCCceechhhHHHHH-----HHHHhhchhhhhcceeecCC---CceEEEEEcC Q lcl|NC_021307. 1 MAAG-TAFPVNHTQIAQ------TGDSMFQGYLEPEQAQDYF-----AEAEKTSIVQRVARKIPMGS---TGVKIPHWTG 65 (310) Q Consensus 1 ~aa~-~~~~~~~~~~~~------~~~~~~g~~i~~~~~~~ii-----~~~~~~s~l~~~~~~~~~~~---~~~~ip~~~~ 65 (310) +.+. ..+..+...-+. -+-...+..-+|++...++ +.+.+.-....+......+. ....+++.+. T Consensus 17 ~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~ 96 (336) T protein:vir:10 17 LPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP 96 (336) T ss_pred ecchhhhhhhhHHHhhhhhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhccccccCCccceeEEEeeeec Confidence 1111 111111100000 0111111123445555444 44444444455555443332 2456677777 Q ss_pred CceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccc Q lcl|NC_021307. 66 DVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN---PGNYLGTMRTKVATAIALAFDEAALHGTDSPFD 142 (310) Q Consensus 66 ~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s---~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~ 142 (310) .+.+.+.+.++..|..+...+..+.+.+.++..+.++.+-+..+ ..++.+.-....++++.+.+|+-.+.|+....- T Consensus 97 ~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~ 176 (336) T protein:vir:10 97 TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLEN 176 (336) T ss_pred eeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccce Confidence 77788889999999998888888888999999999995444432 357778888888889999999988888765432 Q ss_pred ccccccccc---cccee--cccchHHHHHH---HHHHHhhhhc------CCCCEEEEehHHHHHHHHhhhccCccccccc Q lcl|NC_021307. 143 KNLDETTKS---VDLTP--ATGTTYDAIGV---NALSLLVNAG------KKWGATLLDDVAEPILNGAKDANGRPLFVES 208 (310) Q Consensus 143 ~~~~~~~~~---~~~~~--~~~~~~~~~~~---~~~~~l~~~~------~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~ 208 (310) .+.+...+. ..... ...++.+.++. .++..+.... -.+..++|.+..+..|.. .+..|.-+..- T Consensus 177 yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~- 254 (336) T protein:vir:10 177 YGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK- 254 (336) T ss_pred EEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccC-CCccCccHHHH- Confidence 222211111 11111 11122233333 3344443322 237789999998888753 23333222110 Q ss_pred cccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecc---cEEEEeecceeeecccccccchhhhhcCcE Q lcl|NC_021307. 209 TYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGG---LSFDVSDQATLNLGTPQAPNFVSLWQHNLV 285 (310) Q Consensus 209 ~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~---~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (310) ... .+.+-++...|=.- .++ |+-..+++-...+ ..+.+... +..-++ -.+.-.. T Consensus 255 -lk~---n~Pnl~i~t~pEl~----~a~------G~~~~l~~~~~~~~~t~~~~~p~~--------~~~l~v-q~~~~~~ 311 (336) T protein:vir:10 255 -LKD---IFPKLEFVTIPEYD----TAS------GRLVQLWAPRVEGKDTATCGFTEK--------MRAHSI-ERYSSYF 311 (336) T ss_pred -HHH---hcCccEEEEccccc----cCC------CceEEEEEEecCCCcceeeecchh--------hhccce-eecCcee Confidence 000 01122333333221 111 1111222221111 11111100 000000 0011223 Q ss_pred EEEEEEEec-cEEeccCceEEEeec Q lcl|NC_021307. 286 AVRVEAEYG-LLINDVEAFVKLTNA 309 (310) Q Consensus 286 ~~r~~~~~d-~~v~~~~a~~~l~~a 309 (310) ......|.+ ..+++|.||+++++- T Consensus 312 ~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 312 RQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred EeccccceeeeeeeccchheeeecC Confidence 344555654 477899999999999 No 196 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=95.14 E-value=0.0027 Score=34.69 Aligned_cols=288 Identities=12% Similarity=0.050 Sum_probs=152.3 Q ss_pred CccchhhhHHH----HHhhcccc---CCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceeeee Q lcl|NC_021307. 1 MAAGTAFPVNH----TQIAQTGD---SMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAAWI 72 (310) Q Consensus 1 ~aa~~~~~~~~----~~~~~~~~---~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v 72 (310) |-.-.+.--.. .+.....+ ....-.+.|.+...+...+++.|-+++..+++++..-.. .+-...+++-++-+ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:98 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccccc Confidence 33332211111 11111111 111224678888899999999999999999999875433 33443445444443 Q ss_pred c--cc-ccccccccceeeeEeeeeeeEeeehhhHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccc--- Q lcl|NC_021307. 73 G--EG-DMKPITKGDMSVQQVEPHKIATIFVASAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKN--- 144 (310) Q Consensus 73 ~--Eg-~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~--- 144 (310) . .+ ...|..-..++.-....++.---..|+.+.|+. ..++|+..+.+.+.+.++.-.-.--|+|........ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:98 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) T ss_pred cCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 2 11 223444455677778888888888888888873 236899999999999888877777778865433211 Q ss_pred -ccc----------------------cc--------cccccee-cccchHHHHHHHHHH-HhhhhcCC--CCEEEEehHH Q lcl|NC_021307. 145 -LDE----------------------TT--------KSVDLTP-ATGTTYDAIGVNALS-LLVNAGKK--WGATLLDDVA 189 (310) Q Consensus 145 -~~~----------------------~~--------~~~~~~~-~~~~~~~~~~~~~~~-~l~~~~~~--~~~~~~~~~~ 189 (310) +.. .+ ..+.... ..-...|.++.++.. ++.+.+.+ .-+.+|.+.. T Consensus 161 nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (355) T protein:vir:98 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 100 00 0000011 112345666777664 45665554 4578888775 Q ss_pred HH-HHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEee-cccEEEEeecceee Q lcl|NC_021307. 190 EP-ILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQV-GGLSFDVSDQATLN 267 (310) Q Consensus 190 ~~-~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~-~~~~v~~~~~~~~~ 267 (310) .. +...+.+....+--. .+ ........++-|+|.+..+++|.+.. ++..++++-+... +...-.+.+.+. T Consensus 241 la~k~~~l~n~~~~ptE~---~A-a~~i~s~k~iGGlpa~~~PffP~~~~--lVT~L~NLsIY~Q~gs~RR~~~d~p~-- 312 (355) T protein:vir:98 241 LADKYFPLVNKQQENSES---LA-ADIIISQKRIGNLPAVRVPYFPANAV--LVTTLENLSIYFMDESHRRSIDENPK-- 312 (355) T ss_pred hHHHhhhHhhccCCcHHH---HH-HHHHHHhhhhCCceeEEccccCCCce--EEeeccccEEEEecCcEEEEEEeccc-- Confidence 43 322333222211100 00 01111235799999999999999874 4456777654332 223322222221 Q ss_pred ecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 268 LGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +|.+.-.-..--|+.|.+.++++.+.+-- T Consensus 313 --------------r~rie~y~s~Ne~YvVEd~~~~a~ienI~ 341 (355) T protein:vir:98 313 --------------KDRVENYESMNIDYVVEVYAAGCLLENIT 341 (355) T ss_pred --------------cccccchhhhcceeeeeccccEEEeecee Confidence 22222222233344455555544432111 No 197 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=95.13 E-value=0.0027 Score=34.66 Aligned_cols=286 Identities=7% Similarity=-0.001 Sum_probs=148.7 Q ss_pred Ccc----chh--hhHHH--HHhhccc-cCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceE-EEEEcCCceee Q lcl|NC_021307. 1 MAA----GTA--FPVNH--TQIAQTG-DSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVK-IPHWTGDVSAA 70 (310) Q Consensus 1 ~aa----~~~--~~~~~--~~~~~~~-~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~-ip~~~~~~~a~ 70 (310) |.- -.+ +..-. .+..... +....-.+.|.+...+...+.+.|-+++..+++++..-..+ +-...+++-++ T Consensus 1 m~~~m~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iag 80 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQLAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTG 80 (341) T ss_pred CcccccHHHHHHHHHHHHHHHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceee Confidence 221 111 11110 1111111 11222247788889999999999999999999998754433 33333444444 Q ss_pred eecccccccccccceeeeEeeeeeeEeeehhhHHHhhc-----ChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc Q lcl|NC_021307. 71 WIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRA-----NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL 145 (310) Q Consensus 71 ~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~-----s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~ 145 (310) -+.- +..| .++.++......++.---+.|+-+.|+. +.++|+..+.+.+.++++.-.-.--++|......... T Consensus 81 rtdt-~R~~-r~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td~ 158 (341) T protein:vir:27 81 RKAG-GRFT-KQVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTDP 158 (341) T ss_pred ccCC-Ccee-cccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCCh Confidence 4332 2222 2246777777777777777788888862 2478999999999999888777777888764332211 Q ss_pred ----cc------------------ccccccceeccc---chHHHHHHHHHH-HhhhhcCC--CCEEEEehHHHH-HHHHh Q lcl|NC_021307. 146 ----DE------------------TTKSVDLTPATG---TTYDAIGVNALS-LLVNAGKK--WGATLLDDVAEP-ILNGA 196 (310) Q Consensus 146 ----~~------------------~~~~~~~~~~~~---~~~~~~~~~~~~-~l~~~~~~--~~~~~~~~~~~~-~l~~l 196 (310) .. ...........+ ...|.++.++.. ++.+.+.+ .-+.+|.+.... +-..+ T Consensus 159 ~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l 238 (341) T protein:vir:27 159 SANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIGAAQAKL 238 (341) T ss_pred hhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhhhhhhhh Confidence 00 000001111112 235566667664 45666555 357888876654 22233 Q ss_pred hhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecc-cEEEEeecceeeeccccccc Q lcl|NC_021307. 197 KDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGG-LSFDVSDQATLNLGTPQAPN 275 (310) Q Consensus 197 ~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~-~~v~~~~~~~~~~~~~~~~~ 275 (310) .+....+- .-........++-|+|.+..+++|.+.. ++..++++-+....| ..-.+.+.+. .+ T Consensus 239 ~n~~~~pt------E~~Aa~~i~k~iGGlpa~~~PffP~~~~--lVT~L~NLsIY~Q~gs~RR~~~d~p~--------r~ 302 (341) T protein:vir:27 239 YDKADKPS------EQIAAQKLDKTIAGRPAYVPPFLPDNAM--VVTIPENLQVLTQHGTAQRKAKHESD--------RK 302 (341) T ss_pred hccCCCCH------HHHHHHHHHHhhCCCeEEEccccCCCce--EEeeccceEEEEecCcEEEEEEeccc--------cc Confidence 22211111 0011122246899999999999999874 445777765443332 3322222222 22 Q ss_pred chhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 276 FVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 276 ~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .+-.+++ ++ +...+|.... -.|..++..+ T Consensus 303 rie~yes---~Y-vVEdyg~~~~--~~~~~vkl~~ 331 (341) T protein:vir:27 303 RSKTHTG---AW-KVTQWVCWKR--SPLTTQKKST 331 (341) T ss_pred cccchhh---hh-eeehhhhhhh--ccccccccCc Confidence 2222222 22 2222332222 2233444433 No 198 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=95.03 E-value=0.0029 Score=34.47 Aligned_cols=288 Identities=11% Similarity=0.017 Sum_probs=157.8 Q ss_pred CccchhhhHHHH----HhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceeeeec- Q lcl|NC_021307. 1 MAAGTAFPVNHT----QIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAAWIG- 73 (310) Q Consensus 1 ~aa~~~~~~~~~----~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v~- 73 (310) |-.-.+.--... +........... .+.|.+...+...+.+.|-+++..+++++..-.. .+-...+++-++-+. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTTDT 80 (339) T ss_pred CChHHHHHHHHHHHHHHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecccC Confidence 433322221111 111111112222 4678888999999999999999999999875433 344444555454431 Q ss_pred -ccccccccccceeeeEeeeeeeEeeehhhHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc----- Q lcl|NC_021307. 74 -EGDMKPITKGDMSVQQVEPHKIATIFVASAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL----- 145 (310) Q Consensus 74 -Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~----- 145 (310) -++..|..-..++.-....++.---..|+.+.|+. ..++|+..+.+.+.+.++.-.-.--++|........+ T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (339) T protein:vir:79 81 TQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANPM 160 (339) T ss_pred CCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCcC Confidence 12222333346677777888877788888888872 3468999999999998887776777777654432211 Q ss_pred --------------------cc-ccc-ccccee-c---ccchHHHHHHHHHH-HhhhhcCC--CCEEEEehHHHH-HHHH Q lcl|NC_021307. 146 --------------------DE-TTK-SVDLTP-A---TGTTYDAIGVNALS-LLVNAGKK--WGATLLDDVAEP-ILNG 195 (310) Q Consensus 146 --------------------~~-~~~-~~~~~~-~---~~~~~~~~~~~~~~-~l~~~~~~--~~~~~~~~~~~~-~l~~ 195 (310) .. .+. ..+... . .-...|.++.++.. ++.+.+.+ .-+.+|.+.... +-.. T Consensus 161 lqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k~~~ 240 (339) T protein:vir:79 161 LQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDKYFP 240 (339) T ss_pred ccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhHhhh Confidence 00 000 001101 1 12345677777774 56676665 457788777654 2222 Q ss_pred hhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEe-ecccEEEEeecceeeecccccc Q lcl|NC_021307. 196 AKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQ-VGGLSFDVSDQATLNLGTPQAP 274 (310) Q Consensus 196 l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~ 274 (310) +.+....|-- ...........++-|+|.+..+++|.+... +..++++-+-. .+...-.+.+++ T Consensus 241 l~n~~~~ptE----~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ll--VT~L~NLsIY~Q~gs~RR~~~d~p---------- 304 (339) T protein:vir:79 241 LVNRDRDPVQ----QIAADLIISQKRIGNLPAIRVPYFPANGLL--VTRLDNLSIYYQEGGRRRTILDNA---------- 304 (339) T ss_pred HhhcCCChHH----HHHHHHHHHhhhhCCceeEEccccCCCceE--EeechhcEEEEecCcEEEEEEecc---------- Confidence 3222221110 000111112357999999999999998744 45666654332 233333333222 Q ss_pred cchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 275 NFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 275 ~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.+.. T Consensus 305 ------~r~rie~y~s~Ne~YvVEd~~~~a~iEni~ 334 (339) T protein:vir:79 305 ------KRDRIENYESSNDAYVIEDLACAAMAENIA 334 (339) T ss_pred ------ccccccchhhccceeeeeccccEEEeeeee Confidence 233333333344466677777776655444 No 199 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=94.98 E-value=0.0023 Score=35.03 Aligned_cols=283 Identities=10% Similarity=-0.003 Sum_probs=135.6 Q ss_pred Cccch-hhhHHHHHhhccccCCCCc------eechhhHHHHHH--HHHhhchhhhhcceeecC---C---CceEEEEEcC Q lcl|NC_021307. 1 MAAGT-AFPVNHTQIAQTGDSMFQG------YLEPEQAQDYFA--EAEKTSIVQRVARKIPMG---S---TGVKIPHWTG 65 (310) Q Consensus 1 ~aa~~-~~~~~~~~~~~~~~~~~g~------~i~~~~~~~ii~--~~~~~s~l~~~~~~~~~~---~---~~~~ip~~~~ 65 (310) +.++. .+..+....++.....+++ .-+|++...+++ ..+-..+-++....+|+. . ....++.... T Consensus 17 ~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~ 96 (336) T protein:vir:10 17 LPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP 96 (336) T ss_pred cchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHhhcCcceeeeeechhchhhhcccccCCCcceeeEEEEeeee Confidence 21111 1111111111111111111 124555555552 222333333333333332 1 2355666666 Q ss_pred CceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccc Q lcl|NC_021307. 66 DVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN---PGNYLGTMRTKVATAIALAFDEAALHGTDSPFD 142 (310) Q Consensus 66 ~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s---~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~ 142 (310) .+.+.+.+....+|..+...+...-+.+.++..+.++.+-+... ..++.+.-....++++.+.+|+-.+.|+....- T Consensus 97 ~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~ 176 (336) T protein:vir:10 97 TTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLEN 176 (336) T ss_pred eeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeecccce Confidence 77778888888999999888888888999999999996666543 357778888888888888999888888775433 Q ss_pred cccccccc---cccceec--ccchHHHHHHHHH---HHhhhhcC------CCCEEEEehHHHHHHHHhhhccCccccccc Q lcl|NC_021307. 143 KNLDETTK---SVDLTPA--TGTTYDAIGVNAL---SLLVNAGK------KWGATLLDDVAEPILNGAKDANGRPLFVES 208 (310) Q Consensus 143 ~~~~~~~~---~~~~~~~--~~~~~~~~~~~~~---~~l~~~~~------~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~ 208 (310) .+.+...+ ....... ...+.+.+..|+. ..+..... .+..+++.+..+..|.. .+..|.-+.. T Consensus 177 ~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~-- 253 (336) T protein:vir:10 177 YGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-- 253 (336) T ss_pred EEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHH-- Confidence 22222111 1111111 1123343443333 33332221 24478999998888864 3333322211 Q ss_pred cccccccccCCceeeeeeEEEeCCCC-CCceeEeeecceeeeEEeecc---cEEEEeecceeeecccccccchhhhhcCc Q lcl|NC_021307. 209 TYEAVTTPYREGRILGRPTILSDHVA-SGTTVGYLGDFSQIVWGQVGG---LSFDVSDQATLNLGTPQAPNFVSLWQHNL 284 (310) Q Consensus 209 ~~~~~~~~~~~~~l~G~pv~~t~~~~-~~~~~~~~gd~~~~~~~~~~~---~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (310) ......-++.+...+.+. ++ |+-..++.-+..+ .++.+.. .+..-++ -.+.-. T Consensus 254 --------~lk~n~Pnl~i~t~pel~~Ag------g~~~~~~~~~~~~~~t~~~~~P~--------~f~~lpv-q~~~~~ 310 (336) T protein:vir:10 254 --------KLKEIFPKLEFVTIPEYDTAS------GRLVQLWAPRVEGKDTATCGFTE--------KMRAHSI-ERYSSY 310 (336) T ss_pred --------HHHHhCCccEEEEcccccccC------CceEEEEEecccCCcceeeecCh--------hhhccce-eecCce Confidence 000001112233222221 11 1211222222111 1111110 0000010 011122 Q ss_pred EEEEEEEEecc-EEeccCceEEEeec Q lcl|NC_021307. 285 VAVRVEAEYGL-LINDVEAFVKLTNA 309 (310) Q Consensus 285 ~~~r~~~~~d~-~v~~~~a~~~l~~a 309 (310) .....+.|.++ .+.+|-||+++.+- T Consensus 311 ~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 311 FRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred eEeccccceeeeeeeccchheeeccC Confidence 33445556544 67799999999999 No 200 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=94.76 E-value=0.0036 Score=34.01 Aligned_cols=288 Identities=12% Similarity=0.070 Sum_probs=155.6 Q ss_pred CccchhhhHHH----HHhhc-cc--cCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceeeee Q lcl|NC_021307. 1 MAAGTAFPVNH----TQIAQ-TG--DSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAAWI 72 (310) Q Consensus 1 ~aa~~~~~~~~----~~~~~-~~--~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v 72 (310) |-.-.+.--.. .+... .. +....-.+.|.+...+...+++.|-+++..+++++..-.. .+-...+++-++-+ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceeecc Confidence 43322221111 11111 11 1112234678889999999999999999999999875443 33444455555443 Q ss_pred c--cc-ccccccccceeeeEeeeeeeEeeehhhHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccc--- Q lcl|NC_021307. 73 G--EG-DMKPITKGDMSVQQVEPHKIATIFVASAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKN--- 144 (310) Q Consensus 73 ~--Eg-~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~--- 144 (310) . .+ ...|.....++.-....++.---..|+.+.|+. ..++|+..+.+.+.++++.-.-.--|+|........ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:18 81 DTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRVK 160 (355) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 2 12 233444455777788888888888888888873 236899999999999888777777778865433211 Q ss_pred -ccc----------------------cc--------cccccee-cccchHHHHHHHHHH-HhhhhcCC--CCEEEEehHH Q lcl|NC_021307. 145 -LDE----------------------TT--------KSVDLTP-ATGTTYDAIGVNALS-LLVNAGKK--WGATLLDDVA 189 (310) Q Consensus 145 -~~~----------------------~~--------~~~~~~~-~~~~~~~~~~~~~~~-~l~~~~~~--~~~~~~~~~~ 189 (310) +.. .+ ..+.... ..-...|.++.++.. ++.+.+.+ .-+.+|.+.. T Consensus 161 nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG~dL 240 (355) T protein:vir:18 161 NPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVGRKL 240 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 100 00 0000011 112345677777775 45665554 4578888775 Q ss_pred HH-HHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEee-cccEEEEeecceee Q lcl|NC_021307. 190 EP-ILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQV-GGLSFDVSDQATLN 267 (310) Q Consensus 190 ~~-~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~-~~~~v~~~~~~~~~ 267 (310) .. +...+.+..+.+--. . + ........++-|+|.+..+++|.+.. ++..++++-+... +...-.+.+.+ T Consensus 241 la~k~~~l~n~~~~ptE~--~-A-a~~i~s~k~iGGlpa~~~PffP~~~~--lVT~L~NLsIY~Q~gs~RR~~~d~p--- 311 (355) T protein:vir:18 241 LADKYFPLVNKQQENTES--L-A-ADIIISQKRIGNLPAVRVPYFPANAV--FVTTLENLSIYFMDESHRRSIDENP--- 311 (355) T ss_pred hHHHHhHHhhccCChHHH--H-H-HHHHHHHHhhCCceeEEccccCCCce--EEeeccccEEEEecCcEEEEEEecc--- Confidence 43 322333332221110 0 0 01011135799999999999999864 4456777654332 22332222222 Q ss_pred ecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 268 LGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.+.- T Consensus 312 -------------~r~rie~y~s~Ne~YvVEd~~~~a~ieni~ 341 (355) T protein:vir:18 312 -------------KKDRVENYESMNIDYVVEAYAAGCLLENIT 341 (355) T ss_pred -------------ccccccchhhhcceeeeeccccEEEEeeee Confidence 223333322333455555555555443211 No 201 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=94.72 E-value=0.0037 Score=33.94 Aligned_cols=288 Identities=12% Similarity=0.047 Sum_probs=155.2 Q ss_pred CccchhhhHHH----HHhhcccc--C-CCCceechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceeeee Q lcl|NC_021307. 1 MAAGTAFPVNH----TQIAQTGD--S-MFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAAWI 72 (310) Q Consensus 1 ~aa~~~~~~~~----~~~~~~~~--~-~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v 72 (310) |-.-.+.--.. .+.....+ . ...=.+.|.+...+...+.+.|-+++..+++++..-.. ++-...+++-++-+ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 33332221111 11111111 1 11224678888999999999999999999999875433 33443445544443 Q ss_pred c--cccc-ccccccceeeeEeeeeeeEeeehhhHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccc- Q lcl|NC_021307. 73 G--EGDM-KPITKGDMSVQQVEPHKIATIFVASAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLD- 146 (310) Q Consensus 73 ~--Eg~~-~~~~~~~~~~i~l~~~k~~~~~~is~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~- 146 (310) . -+.. .|..-..++.-....++.---..|+.+.|+. ..++|+..+.+.+.+.++.-.=.--++|.......... T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:60 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRSS 160 (357) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 2 1222 2333345677777888877788888888872 23678899999998888877767777776544322110 Q ss_pred ---c------------------ccc-------c-----cccee-cccchHHHHHHHHHH-HhhhhcCC--CCEEEEehHH Q lcl|NC_021307. 147 ---E------------------TTK-------S-----VDLTP-ATGTTYDAIGVNALS-LLVNAGKK--WGATLLDDVA 189 (310) Q Consensus 147 ---~------------------~~~-------~-----~~~~~-~~~~~~~~~~~~~~~-~l~~~~~~--~~~~~~~~~~ 189 (310) . ... . +.... ..-...|.++.++.. ++.+.+.+ .-+.+|.+.. T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (357) T protein:vir:60 161 NQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQL 240 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 0 000 0 00111 112345667777664 46666655 4578888776 Q ss_pred HH-HHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEe-ecccEEEEeecceee Q lcl|NC_021307. 190 EP-ILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQ-VGGLSFDVSDQATLN 267 (310) Q Consensus 190 ~~-~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~-~~~~~v~~~~~~~~~ 267 (310) .. +...+.+..+.+-- ...........++-|+|.+..+++|.+... +..++++-+-. .+...-.+.+.+ T Consensus 241 la~k~~~l~n~~~~pTE----~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ll--VT~L~NLsIY~Q~gs~RR~~~d~p--- 311 (357) T protein:vir:60 241 LADKYFPIVNREQDNSE----MLAADVIISQKRIGNLPAVRVPYFPADAML--ITKLENLSIYYMDDSHRRVIEENP--- 311 (357) T ss_pred hhHHhhhHhhcCCChHH----HHHHHHHHHhhhhcCcceEEccccCCCceE--EeeccccEEEEecCcEEEEEEecc--- Confidence 53 22233332222110 000111112457999999999999998744 45666654332 233333333222 Q ss_pred ecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 268 LGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.... T Consensus 312 -------------~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~ 341 (357) T protein:vir:60 312 -------------KLDRVENYESMNIDYVVEDYAAGCLVEKIK 341 (357) T ss_pred -------------ccccccchhhhcceeeeeccccEEEeeeee Confidence 233333333334456666666666655433 No 202 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=94.57 E-value=0.0026 Score=34.80 Aligned_cols=290 Identities=12% Similarity=0.043 Sum_probs=126.9 Q ss_pred Cccc----------------------------------------hhhhHHHHHhhccccCCCCceechhhHH----HHHH Q lcl|NC_021307. 1 MAAG----------------------------------------TAFPVNHTQIAQTGDSMFQGYLEPEQAQ----DYFA 36 (310) Q Consensus 1 ~aa~----------------------------------------~~~~~~~~~~~~~~~~~~g~~i~~~~~~----~ii~ 36 (310) |+++ -.++.. -....+.++.-+|-++.+ .+++ T Consensus 21 ~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~a~da~----~~~~~t~~~~gip~~~~~~~~p~~~~ 96 (388) T protein:vir:99 21 MANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSA----YVAPTTQASIPTPIQFLQQWLPGFVK 96 (388) T ss_pred hhcCCcceeeechhhHhhhhcceeccCccchhhhhhhhhhhhhhcccCcc----cccccccCcccHHHHHhhhhccceee Confidence 1111 011100 000111111113433333 2444 Q ss_pred HHHhhchhhhhcceeecCC---CceEEEEEcCCceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhcC---h Q lcl|NC_021307. 37 EAEKTSIVQRVARKIPMGS---TGVKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN---P 110 (310) Q Consensus 37 ~~~~~s~l~~~~~~~~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s---~ 110 (310) .+.......++..+...+. ....+++.+..+.+.+.+.++..|..+...+..+-..+.+.....++.+-+..+ . T Consensus 97 ~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g 176 (388) T protein:vir:99 97 VLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMR 176 (388) T ss_pred eeechhhhhhhccccccCCccceeEEEeeeecceeEEEeecccCCCceeccceeeeeeEEEEEeeeeecHHHHHHHHhhC Confidence 4444444444544433332 246677777778888889888888887777766777777777777776655533 3 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHcccCccccc---ccccc---cccccce------ecccchHHHHHHHHH---HHhhh Q lcl|NC_021307. 111 GNYLGTMRTKVATAIALAFDEAALHGTDSPFDK---NLDET---TKSVDLT------PATGTTYDAIGVNAL---SLLVN 175 (310) Q Consensus 111 ~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~---~~~~~---~~~~~~~------~~~~~~~~~~~~~~~---~~l~~ 175 (310) .++...-.....+++.+.+|+-.|.|....... +.+.. ...+..+ +-...+.+.++.|+. ..+.. T Consensus 177 ~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~ 256 (388) T protein:vir:99 177 INSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRV 256 (388) T ss_pred CCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccccccccCCcCcccccCCHHHHHHHHHHHHHHHHH Confidence 578888888888899999999999995433211 11110 1111111 011124444444444 33322 Q ss_pred hcC-------CCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeE-eeecc-e Q lcl|NC_021307. 176 AGK-------KWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVG-YLGDF-S 246 (310) Q Consensus 176 ~~~-------~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~-~~gd~-~ 246 (310) ... .+..+++.+..+..|.. .+..|.-++.- ... .+.+-++...|=....+...+...+ ++.+- . T Consensus 257 qs~g~~~~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~--lk~---n~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~ 330 (388) T protein:vir:99 257 QSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRDW--LKQ---TYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVD 330 (388) T ss_pred hcCCeeeecccceEEEechHHHHhccc-cCcCCccHHHH--HHH---hcCCcEEEEecccccccccCCceeEEEEecccc Confidence 221 12268888888888853 23333222110 000 0111222222222111111111111 11110 0 Q ss_pred eeeEEee-cccEEEEeecceeeecccccccchhhhhcC--cEEEEEEEE-eccEEeccCceEEEeec Q lcl|NC_021307. 247 QIVWGQV-GGLSFDVSDQATLNLGTPQAPNFVSLWQHN--LVAVRVEAE-YGLLINDVEAFVKLTNA 309 (310) Q Consensus 247 ~~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~r~~~~-~d~~v~~~~a~~~l~~a 309 (310) ....+.. ....... . ....+..-+ .+.. ........| .|..+++|.||+++++- T Consensus 331 ~~~~~~~~~~~t~~~---~---~p~~~~~l~---vq~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 331 TAVDGSTDGGDTWAQ---L---VQSKFVTLG---VEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred cccccCccCcceeEE---e---ccccccccc---ceecCceeEeccccceeeeEEeccchhheeccC Confidence 0000000 0000000 0 000000000 0111 122222333 46678899999999999 No 203 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=94.54 E-value=0.0041 Score=33.64 Aligned_cols=288 Identities=11% Similarity=0.025 Sum_probs=157.7 Q ss_pred CccchhhhHHH----HHhhccccCCCCc-eechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceeeeec- Q lcl|NC_021307. 1 MAAGTAFPVNH----TQIAQTGDSMFQG-YLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAAWIG- 73 (310) Q Consensus 1 ~aa~~~~~~~~----~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v~- 73 (310) |-.-.+.--.. .+........... .+.|.+...+...+.+.|-+++..+++++..-.. ++-...+++-++-.. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeeecC Confidence 44322221111 1111111111222 3678888999999999999999999999875433 333334444444432 Q ss_pred -ccccccccccceeeeEeeeeeeEeeehhhHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc----- Q lcl|NC_021307. 74 -EGDMKPITKGDMSVQQVEPHKIATIFVASAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL----- 145 (310) Q Consensus 74 -Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~----- 145 (310) -+...|..-..++.-....++.---..|+.+.|+. ..++|+..+.+.+.+.++.-.-.--++|........+ T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (337) T protein:vir:78 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcC Confidence 22233334456777778888877788888888872 3468999999999998887776777777654432211 Q ss_pred --------------------cc-c---cccccceec-ccchHHHHHHHHHH-HhhhhcCC--CCEEEEehHHHHH-HHHh Q lcl|NC_021307. 146 --------------------DE-T---TKSVDLTPA-TGTTYDAIGVNALS-LLVNAGKK--WGATLLDDVAEPI-LNGA 196 (310) Q Consensus 146 --------------------~~-~---~~~~~~~~~-~~~~~~~~~~~~~~-~l~~~~~~--~~~~~~~~~~~~~-l~~l 196 (310) +. . ...+..... .-...|.++.++.. ++.+.+.+ .-+.+|.+..... -..+ T Consensus 161 lqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l 240 (337) T protein:vir:78 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) T ss_pred ccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHH Confidence 00 0 001111111 22345677777775 46676655 4678888776542 2222 Q ss_pred hhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEe-ecccEEEEeecceeeeccccccc Q lcl|NC_021307. 197 KDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQ-VGGLSFDVSDQATLNLGTPQAPN 275 (310) Q Consensus 197 ~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~ 275 (310) .+....|--. ..........++-|+|.+..+++|.+... +..++++-+-. .+...-.+.+++ T Consensus 241 ~n~~~~ptE~----~Aa~~i~s~k~iGGl~a~~~PfFP~~~il--VT~L~NLsIY~Q~gs~RR~~~d~p----------- 303 (337) T protein:vir:78 241 VNATQAPTER----LAADLIVSQKRIGNLPAVRVPFFPKRALM--VTKLSNLSIYYQEGARRRTLKEVP----------- 303 (337) T ss_pred HhcCCCcHHH----HHHHHHHHhhhhcCcceEEccccCCCceE--EeechhcEEEEecCcEEEEEEecc----------- Confidence 2222211100 00011112357999999999999998744 45666654332 233333333222 Q ss_pred chhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 276 FVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 276 ~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.+-- T Consensus 304 -----~r~rie~y~s~Ne~YvVEd~~~~a~iEnI~ 333 (337) T protein:vir:78 304 -----ERDRIENYESSNDAYVVEDFGCGCVAENIE 333 (337) T ss_pred -----ccccccchhhccceeeeeccccEEEEecee Confidence 233333333344466777777766654322 No 204 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=94.53 E-value=0.0042 Score=33.63 Aligned_cols=286 Identities=10% Similarity=0.011 Sum_probs=149.8 Q ss_pred CccchhhhHHHHHhh-ccccC-----CC-CceechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceeeee Q lcl|NC_021307. 1 MAAGTAFPVNHTQIA-QTGDS-----MF-QGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAAWI 72 (310) Q Consensus 1 ~aa~~~~~~~~~~~~-~~~~~-----~~-g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v 72 (310) |++ +.+..-....+ .-|.. .+ -=.+.|.+...+...+++.|-+++..+++++..-.. ++-...+++-++-. T Consensus 1 mtr-~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrt 79 (336) T protein:vir:37 1 MNK-QAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEKGVTGRK 79 (336) T ss_pred CcH-HHHHHHHHHHHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCccccccc Confidence 555 34432221111 11111 11 124778888999999999999999999999874432 33444444444433 Q ss_pred cccccccccccceeeeEeeeeeeEeeehhhHHHhhcC--hhH-HHHHHHHHHHHHHHHHHHHHHHcccCccccc-cccc- Q lcl|NC_021307. 73 GEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN--PGN-YLGTMRTKVATAIALAFDEAALHGTDSPFDK-NLDE- 147 (310) Q Consensus 73 ~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s--~~~-~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~-~~~~- 147 (310) .-+.. -....++.-....++.---..|+.+.|+.= .++ +...+...+.++++.-.-.--++|....... .+.. T Consensus 80 dt~r~--r~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPllq 157 (336) T protein:vir:37 80 QTGRN--LATLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTTKTDLS 157 (336) T ss_pred CCCCC--ccccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCCCcccc Confidence 32211 122346667777777777788888888742 133 2233333344444444444445564322211 0100 Q ss_pred -----------------cc-----ccccc---ee-cccchHHHHHHHHHHHhhhhcCC--CCEEEEehHHHH-HHHHhhh Q lcl|NC_021307. 148 -----------------TT-----KSVDL---TP-ATGTTYDAIGVNALSLLVNAGKK--WGATLLDDVAEP-ILNGAKD 198 (310) Q Consensus 148 -----------------~~-----~~~~~---~~-~~~~~~~~~~~~~~~~l~~~~~~--~~~~~~~~~~~~-~l~~l~d 198 (310) .. ...+. .. ..-...|.++.++...+.+.+.+ .-+.+|.+.... +...+.. T Consensus 158 DVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~ 237 (336) T protein:vir:37 158 DVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQ 237 (336) T ss_pred ccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhccchHHhcCCCeEEEEchhhhhhhhhhhhh Confidence 00 00010 11 11234567777777777776655 557788776542 2222333 Q ss_pred ccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeec-ccEEEEeecceeeecccccccch Q lcl|NC_021307. 199 ANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVG-GLSFDVSDQATLNLGTPQAPNFV 277 (310) Q Consensus 199 ~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~~ 277 (310) ..+.... .... ........++-|+|.+..+++|.+.. ++..++++-+.... ...-.+.+.+ T Consensus 238 ~~~~~Pt-E~~A--a~~~~~~k~iGGlpa~~~PffP~~~~--lVT~L~NLsIY~Q~gs~RR~~~d~p------------- 299 (336) T protein:vir:37 238 KHGLTPT-EKAA--LGSHNLMGSFGGMNAITPPNFPARAA--AVTTLKNLSVYTEAESVRRSLRNDE------------- 299 (336) T ss_pred hcCCCHH-HHHH--HHHHHHHHhhCCceEEEccccCCCce--EEeeccccEEEEecCcEEEEEEEcc------------- Confidence 2221110 0000 00112346799999999999999874 44567776544332 2332222222 Q ss_pred hhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 278 SLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 278 ~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.... T Consensus 300 ---~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~ 329 (336) T protein:vir:37 300 ---DKKGLVTSYYRQEGYVVEDLGLMTAIDHTK 329 (336) T ss_pred ---ccccccchhhhcceeeeeccccEEEeeeee Confidence 234444444445677888888888888777 No 205 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=94.41 E-value=0.0045 Score=33.45 Aligned_cols=286 Identities=10% Similarity=-0.000 Sum_probs=151.9 Q ss_pred CccchhhhHHHHHhh-ccccC----C-C-CceechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceeeee Q lcl|NC_021307. 1 MAAGTAFPVNHTQIA-QTGDS----M-F-QGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAAWI 72 (310) Q Consensus 1 ~aa~~~~~~~~~~~~-~~~~~----~-~-g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v 72 (310) |++ +.+..-....+ .-|.. . + -=.+.|.+...+...+++.|-+++..+++++..-.. ++-...+++-++-. T Consensus 1 mtr-~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrt 79 (336) T protein:vir:37 1 MNK-QAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATEKGVTGRK 79 (336) T ss_pred CcH-HHHHHHHHHHHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccCccccccc Confidence 555 34432222111 11211 1 1 124778899999999999999999999999874432 33333444444333 Q ss_pred cccccccccccceeeeEeeeeeeEeeehhhHHHhhcC--hhHHH-HHHHHHHHHHHHHHHHHHHHcccCcc----cccc- Q lcl|NC_021307. 73 GEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN--PGNYL-GTMRTKVATAIALAFDEAALHGTDSP----FDKN- 144 (310) Q Consensus 73 ~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s--~~~~~-~~v~~~l~~a~~~~~d~~~l~G~g~~----~~~~- 144 (310) .- +..| .+..++.-....++.---..|+.+.|+.= .+++. ..+...+.++++.-.-.--++|.... .|.+ T Consensus 80 dt-~R~~-~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPllq 157 (336) T protein:vir:37 80 QT-GRNL-ANLDHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVADNTTKADLS 157 (336) T ss_pred CC-Cccc-cccCcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHhhchhhhcccceeeccCCCCCccc Confidence 22 2222 23467777788888888888999988742 23322 33333344444444444455664332 2210 Q ss_pred -----------------ccc-c-cccccc-e--e-cccchHHHHHHHHHHHhhhhcCC--CCEEEEehHHHH-HHHHhhh Q lcl|NC_021307. 145 -----------------LDE-T-TKSVDL-T--P-ATGTTYDAIGVNALSLLVNAGKK--WGATLLDDVAEP-ILNGAKD 198 (310) Q Consensus 145 -----------------~~~-~-~~~~~~-~--~-~~~~~~~~~~~~~~~~l~~~~~~--~~~~~~~~~~~~-~l~~l~d 198 (310) ... . ....+. . . ..-...|.++.++...+.+.+.+ .-+.+|.+.... +...+.. T Consensus 158 DVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~ 237 (336) T protein:vir:37 158 DVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQ 237 (336) T ss_pred ccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhcCchHHhcCCCeEEEEchhhhhhhhhhhhh Confidence 000 0 000011 1 1 11234567777777777776655 557788776542 2222333 Q ss_pred ccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeec-ccEEEEeecceeeecccccccch Q lcl|NC_021307. 199 ANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVG-GLSFDVSDQATLNLGTPQAPNFV 277 (310) Q Consensus 199 ~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~~ 277 (310) ..+.... ..... .......++-|+|.+..+++|.+.. ++..++++-+.... ...-.+.+.+ T Consensus 238 ~~~~~Pt-E~~Aa--~~~~~~k~iGGlpa~~~PffP~~~~--lVT~L~NLsIY~Q~gs~RR~~~d~p------------- 299 (336) T protein:vir:37 238 KHGLTPT-EKAAL--GSHNLMGSFGGMNAITPPNFPARAA--AVTTLKNLSVYTEAESVRRSLRNDE------------- 299 (336) T ss_pred hcCCCHH-HHHHH--HHHHHHHhhCCceeEEccccCCCce--EEeechhcEEEEecCcEEEEEEEcc------------- Confidence 3221111 00000 0112245799999999999999874 44567776543332 2322222222 Q ss_pred hhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 278 SLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 278 ~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.... T Consensus 300 ---~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~ 329 (336) T protein:vir:37 300 ---DKKGLVTSYYRQEGYVVEDLGLMTAIDHTK 329 (336) T ss_pred ---ccccccchhhhcceeeeeccccEEEeeeee Confidence 234444444445577888888888887777 No 206 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=94.34 E-value=0.0047 Score=33.35 Aligned_cols=288 Identities=12% Similarity=0.057 Sum_probs=156.0 Q ss_pred CccchhhhHHH----HHhhcccc--C-CCCceechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceeeee Q lcl|NC_021307. 1 MAAGTAFPVNH----TQIAQTGD--S-MFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAAWI 72 (310) Q Consensus 1 ~aa~~~~~~~~----~~~~~~~~--~-~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v 72 (310) |-.-.+.--.. .+.....+ . ...=.+.|.+...+...+.+.|-+++..+++++..-.. ++-...+++-++-+ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 33332221111 11111111 1 11224678888999999999999999999999875433 33333445444443 Q ss_pred c--ccccc-cccccceeeeEeeeeeeEeeehhhHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccc- Q lcl|NC_021307. 73 G--EGDMK-PITKGDMSVQQVEPHKIATIFVASAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLD- 146 (310) Q Consensus 73 ~--Eg~~~-~~~~~~~~~i~l~~~k~~~~~~is~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~- 146 (310) . -+... |..-..++.-....++.---..|+.+.|+. ..++|+..+.+.+.+.++.-.=.--++|.......... T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:56 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 2 12222 222245677777888877788888888872 23678898999998888877766777776544322110 Q ss_pred ---c------------------ccc-------c-----cccee-cccchHHHHHHHHHH-HhhhhcCC--CCEEEEehHH Q lcl|NC_021307. 147 ---E------------------TTK-------S-----VDLTP-ATGTTYDAIGVNALS-LLVNAGKK--WGATLLDDVA 189 (310) Q Consensus 147 ---~------------------~~~-------~-----~~~~~-~~~~~~~~~~~~~~~-~l~~~~~~--~~~~~~~~~~ 189 (310) . ... . +.... ..-...|.++.++.. ++.+.+.+ .-+.+|.+.. T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (357) T protein:vir:56 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQL 240 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 0 000 0 00111 112345667777664 46666655 4578888776 Q ss_pred HH-HHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEe-ecccEEEEeecceee Q lcl|NC_021307. 190 EP-ILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQ-VGGLSFDVSDQATLN 267 (310) Q Consensus 190 ~~-~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~-~~~~~v~~~~~~~~~ 267 (310) .. +...+.+..+.+--. ..........++-|+|.+..+++|.+... +..++++-+-. .+...-.+.+.+ T Consensus 241 la~k~~~l~n~~~~pTE~----~Aa~~i~s~k~iGGl~a~~~PfFP~~~ll--VT~L~NLsIY~Q~gs~RR~~~d~p--- 311 (357) T protein:vir:56 241 LADKYFPIVNKEQDNSEM----LAADVIISQKRIGNLPAVRVPYFPADAML--ITKLENLSIYYMDDSHRRVIEENP--- 311 (357) T ss_pred hhhhhhhHhhccCChHHH----HHHHHHHHhhhhCCceeEEccccCCCceE--EeeccccEEEEecCcEEEEEEecc--- Confidence 54 333333332221110 00111112357999999999999998744 45666654332 233333333222 Q ss_pred ecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 268 LGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.... T Consensus 312 -------------~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~ 341 (357) T protein:vir:56 312 -------------KLDRVENYESMNIDYVVEDYAAGCLVEKIK 341 (357) T ss_pred -------------ccccccchhhhcceeeeeccccEEEeeeee Confidence 233333333334456666667666665544 No 207 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=94.18 E-value=0.0051 Score=33.13 Aligned_cols=284 Identities=10% Similarity=0.011 Sum_probs=158.5 Q ss_pred Cc----cchhhhHH----HHHhhccc---cCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceE-EEEEcCCce Q lcl|NC_021307. 1 MA----AGTAFPVN----HTQIAQTG---DSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVK-IPHWTGDVS 68 (310) Q Consensus 1 ~a----a~~~~~~~----~~~~~~~~---~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~-ip~~~~~~~ 68 (310) |. .-.+.--. ..+..... +....-.+.|.+...+...+.+.|-+++..+++++..-... +-...+++- T Consensus 1 m~~~M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~i 80 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYCDALAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQLY 80 (358) T ss_pred CcccccHHHHHHHHHHHHHHHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCccc Confidence 22 22111111 11111111 11122247788888899999999999999999998754333 333344554 Q ss_pred eeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhc-----ChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccc Q lcl|NC_021307. 69 AAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRA-----NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDK 143 (310) Q Consensus 69 a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~-----s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~ 143 (310) ++-+.. ..|.....++.-....++.---+.|+.+.|+. +..+|+..+.+.+.+.++.-.-.--++|....... T Consensus 81 agrt~t--r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T 158 (358) T protein:vir:78 81 TGRKKG--GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADDT 158 (358) T ss_pred ceecCC--CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccCC Confidence 554443 23444556677777777777778888888873 22378999999999988877777777776544322 Q ss_pred cc-------------------------cc---ccccccceec---ccchHHHHHHHHH-HHhhhhcCC--CCEEEEehHH Q lcl|NC_021307. 144 NL-------------------------DE---TTKSVDLTPA---TGTTYDAIGVNAL-SLLVNAGKK--WGATLLDDVA 189 (310) Q Consensus 144 ~~-------------------------~~---~~~~~~~~~~---~~~~~~~~~~~~~-~~l~~~~~~--~~~~~~~~~~ 189 (310) .. .. .+..+..... .-...|.++.++. ..+.+.+.+ .-+.+|.+.. T Consensus 159 d~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 238 (358) T protein:vir:78 159 DPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVGTDL 238 (358) T ss_pred ChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 11 00 0011111111 1245567777765 566776655 4678888877 Q ss_pred HH-HHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEe-ecccEEEEeecceee Q lcl|NC_021307. 190 EP-ILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQ-VGGLSFDVSDQATLN 267 (310) Q Consensus 190 ~~-~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~-~~~~~v~~~~~~~~~ 267 (310) .. +-..+.+..+.+--. ..+ .....++-|+|.+..+++|.+... +..++++-+-. .+...-.+.+++ T Consensus 239 la~k~~~l~n~~~~pTE~--~Aa----~~i~k~iGGlpa~~~PfFP~~~il--VT~L~NLsIY~Q~gs~RR~~~d~p--- 307 (358) T protein:vir:78 239 VAAAQAKLYSEATKPSEQ--IAA----QQLAKSIAGRKAYIPPFFPGKRMV--VTTLDNLHCYTQRGTRKRKADDNQ--- 307 (358) T ss_pred hhHHhhhHhhcCCCcHHH--HHH----HHHHHHhCCCeEEEccccCCCceE--EeeccccEEEEecCcEEEEEEecc--- Confidence 54 323333332221110 111 111257899999999999998744 45666654332 233333333222 Q ss_pred ecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 268 LGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.... T Consensus 308 -------------~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~ 337 (358) T protein:vir:78 308 -------------DSKSFDNQYWRMEGYALGEHKAYGGFEEAD 337 (358) T ss_pred -------------ccccccchhhhcceeeeeccccEEEEeeee Confidence 233333333344566777777777666554 No 208 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=94.17 E-value=0.0052 Score=33.11 Aligned_cols=281 Identities=7% Similarity=-0.087 Sum_probs=121.5 Q ss_pred CccchhhhHHHHHhh-----------ccccCCCCceechhhHHHHHHHHHhhchhhh-hcc--eeecCCCceEEEEEcCC Q lcl|NC_021307. 1 MAAGTAFPVNHTQIA-----------QTGDSMFQGYLEPEQAQDYFAEAEKTSIVQR-VAR--KIPMGSTGVKIPHWTGD 66 (310) Q Consensus 1 ~aa~~~~~~~~~~~~-----------~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~-~~~--~~~~~~~~~~ip~~~~~ 66 (310) +.--..+.++...+. +-..-.+...+-..+...+-+.+...+--.. +++ .....+.+++||+.... T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~ 85 (329) T protein:vir:10 6 ITGVKTMNKEIKNATGKLKLNLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVT 85 (329) T ss_pred EechhhhhhhhhcccceeEEehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeeccc Confidence 111111122111000 0000001111112222222222222221111 122 24556888999998653 Q ss_pred ceeee-ecccccccccccceeeeEeeeeeeEeeehhhHHHhhcChh--HHHHHHHHHHHHHHHHHHHHHHHcccCccccc Q lcl|NC_021307. 67 VSAAW-IGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPG--NYLGTMRTKVATAIALAFDEAALHGTDSPFDK 143 (310) Q Consensus 67 ~~a~~-v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~--~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~ 143 (310) .-..+ ...|-....-+.++...+++..|.-.+..=.-+. +++.. .+...+.+.+...++-.+|...+.---..... T Consensus 86 gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~-dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~ 164 (329) T protein:vir:10 86 ELKDYKRNATNEFDHPQIQETTYFLDQEKYWGRFVDALDR-RDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAK 164 (329) T ss_pred ccccccCCCCccccccccceeEEEeecccceeeecchhhH-hhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhccc Confidence 22222 2233222233344555666665555444322222 22222 33455666677777778887655322111110 Q ss_pred ccccccccccceecccchHHHHHHHHHHHhhhhcCCCC-EEEEehHHHHHHHHhhhccCccccccccccccccccCCcee Q lcl|NC_021307. 144 NLDETTKSVDLTPATGTTYDAIGVNALSLLVNAGKKWG-ATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRI 222 (310) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l 222 (310) . . . ...+.....+.+.++...|........ .++++|..+..|.+.. +..............+.-++| T Consensus 165 ~---~----~-~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~----~f~~~~~~~~~~~~~g~Vg~i 232 (329) T protein:vir:10 165 H---L----T-VGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFV----IELPQGDNRQQVLGKGVQGEL 232 (329) T ss_pred c---c----c-cccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhh----hhhccccccccceeeeeeeee Confidence 0 0 0 111223344555566667766543333 5677888888775421 112122222223334455789 Q ss_pred eeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCc Q lcl|NC_021307. 223 LGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEA 302 (310) Q Consensus 223 ~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a 302 (310) .|.+|+.+++....+.-++++..+.+.... +--.+++.+. . -.++.-.++...++|..|.++++ T Consensus 233 dG~~Ii~vps~~~k~in~ii~~~~A~~~~~-K~~~~~~~~p--------~-------~~~~a~~v~gr~yyd~~V~~~k~ 296 (329) T protein:vir:10 233 DGFTIVKVPSKMLQGVEAMAVIGEVMASPI-QANEAKLNSN--------V-------PGMFGTLAEQMLYTGAFVPEHLQ 296 (329) T ss_pred cCeEEEEecCCcccceeEEEEcCCceeeee-eeeeeeeeCC--------C-------CccchheeeeeeeeeeEEEcccc Confidence 999999876543323223334333322111 1111222110 0 01233577788899999999985 Q ss_pred eE--EEeecC Q lcl|NC_021307. 303 FV--KLTNAA 310 (310) Q Consensus 303 ~~--~l~~aa 310 (310) .. ....+| T Consensus 297 ~~I~~~~~~a 306 (329) T protein:vir:10 297 KYIFTIGGKE 306 (329) T ss_pred CEEEEecccC Confidence 43 333333 No 209 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=288 Identities=12% Similarity=0.058 Sum_probs=154.8 Q ss_pred CccchhhhHHH----HHhhcccc--C-CCCceechhhHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEcCCceeeee Q lcl|NC_021307. 1 MAAGTAFPVNH----TQIAQTGD--S-MFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGV-KIPHWTGDVSAAWI 72 (310) Q Consensus 1 ~aa~~~~~~~~----~~~~~~~~--~-~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v 72 (310) |-.-.+.--.. .+.....+ . ...=.+.|.+...+...+.+.|-+++..+++++..-.. ++-...+++-++-+ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 33332221111 11111111 1 11224678888999999999999999999999875433 33443445544443 Q ss_pred c--ccccc-cccccceeeeEeeeeeeEeeehhhHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccc- Q lcl|NC_021307. 73 G--EGDMK-PITKGDMSVQQVEPHKIATIFVASAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLD- 146 (310) Q Consensus 73 ~--Eg~~~-~~~~~~~~~i~l~~~k~~~~~~is~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~- 146 (310) . -+... |..-..++.-....++.---..|+.+.|+. ..++|+..+.+.+.+.++.-.=.--++|.......... T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:20 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 2 12222 222245677777888877788888888872 23678899999998888877766777776544322110 Q ss_pred ---c------------------ccc-------c-----cccee-cccchHHHHHHHHHH-HhhhhcCC--CCEEEEehHH Q lcl|NC_021307. 147 ---E------------------TTK-------S-----VDLTP-ATGTTYDAIGVNALS-LLVNAGKK--WGATLLDDVA 189 (310) Q Consensus 147 ---~------------------~~~-------~-----~~~~~-~~~~~~~~~~~~~~~-~l~~~~~~--~~~~~~~~~~ 189 (310) . ... . +.... ..-...|.++.++.. ++.+.+.+ .-+.+|.+.. T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (357) T protein:vir:20 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQL 240 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 0 000 0 00111 112345667777664 46666655 4578888776 Q ss_pred HH-HHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEe-ecccEEEEeecceee Q lcl|NC_021307. 190 EP-ILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQ-VGGLSFDVSDQATLN 267 (310) Q Consensus 190 ~~-~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~-~~~~~v~~~~~~~~~ 267 (310) .. +...+.+..+.+--. ..........++-|+|.+..+++|.+... +..++++-+-. .+...-.+.+.+ T Consensus 241 la~k~~~l~n~~~~ptE~----~Aa~~i~s~k~iGGl~a~~~PfFP~~~il--VT~L~NLsIY~Q~gs~RR~~~d~p--- 311 (357) T protein:vir:20 241 LADKYFPIVNKEQDNSEM----LAADVIISQKRIGNLPAVRVPYFPADAML--ITKLENLSIYYMDDSHRRVIEENP--- 311 (357) T ss_pred hhhhhhhHhhccCChHHH----HHHHHHHHhhhhCCceeEEccccCCCceE--EeeccccEEEEecCcEEEEEEecc--- Confidence 54 233333332221110 00111112357999999999999998744 45666654332 233333333222 Q ss_pred ecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 268 LGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ++|.+.-.-..--|+.|.+.++++.+.... T Consensus 312 -------------~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~ 341 (357) T protein:vir:20 312 -------------KLDRVENYESMNIDYVVEDYAAGCLVEKIK 341 (357) T ss_pred -------------ccccccchhhhcceeeeeccccEEEeeeee Confidence 223333333334455666666666554433 No 210 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=93.92 E-value=0.0059 Score=32.79 Aligned_cols=278 Identities=9% Similarity=-0.068 Sum_probs=121.8 Q ss_pred Cc------cc-hhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhh--hcc--eeecCCCceEEEEEcCCcee Q lcl|NC_021307. 1 MA------AG-TAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQR--VAR--KIPMGSTGVKIPHWTGDVSA 69 (310) Q Consensus 1 ~a------a~-~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~--~~~--~~~~~~~~~~ip~~~~~~~a 69 (310) |- -| -++.-..++. -....+.-.+-.. -..+++.+.....+.. .++ .....+.+++||+.....-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~--~~~~~nt~~l~~k-~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~ 77 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFAN--KSVEPGQTLLKNK-HVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELK 77 (319) T ss_pred CCcccccccceeEeehhhhhc--cCCCcchHHHHHH-HHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccc Confidence 10 00 0000000000 0000111111111 2223444444333222 122 34456788999998653222 Q ss_pred ee-ecccccccccccceeeeEeeeeeeEeeehhhHHHhhcChh--HHHHHHHHHHHHHHHHHHHHHHHcccCcccccccc Q lcl|NC_021307. 70 AW-IGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPG--NYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLD 146 (310) Q Consensus 70 ~~-v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~--~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~ 146 (310) .. ...+-....-+.++.+.+++..|.-.+..=.-+ .+++.. .+...+.+.+...+.-.+|...+.-.-...... T Consensus 78 DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D-~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~-- 154 (319) T protein:vir:94 78 DYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALD-RKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH-- 154 (319) T ss_pred cccCCCCcccCCcccceeEEEeecccccccccchhh-HhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc-- Confidence 22 223322333334455555555554444332222 222322 234455666666777777876554322211100 Q ss_pred cccccccceecccchHHHHHHHHHHHhhhhcCC-CCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeee Q lcl|NC_021307. 147 ETTKSVDLTPATGTTYDAIGVNALSLLVNAGKK-WGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGR 225 (310) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~ 225 (310) . . ...+.....+.+.++...|...... +-.++++|..+..|.+-..- .-...........+.-+.|.|+ T Consensus 155 -~----~-~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f----~~~~~~~~~~~~~g~Vg~idG~ 224 (319) T protein:vir:94 155 -L----T-VGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIA----LPQGDTRQQVLGKGVQGELDGF 224 (319) T ss_pred -c----c-cccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhh----hccccccccceeeeeceeecCe Confidence 0 0 1112233445555677777665543 33467788888877543211 1111121222334456789999 Q ss_pred eEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEE Q lcl|NC_021307. 226 PTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVK 305 (310) Q Consensus 226 pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~ 305 (310) +|+.+++....+.-++++..+.+... .+--.+++.+. . -.+..-.++...++|..|.++++... T Consensus 225 ~Vi~vps~~~k~in~i~~h~~A~~~~-~k~~~~~~~~p--------~-------~~~~a~~v~gr~y~d~~V~~~k~~~I 288 (319) T protein:vir:94 225 VIVKVPTKLLQGLQAIAVVGEVLASP-IQADLAKTNSN--------I-------PGMFGTLAEQLLYTGAFVPEHLQKYI 288 (319) T ss_pred EEEEecccccccceEEEEcCCeeeee-eeeeeeeccCC--------C-------ccccceeeeeeeeeeeEEeccccceE Confidence 99976553322333344433332211 11111121110 0 01223567788899999999986555 Q ss_pred Ee--ecC Q lcl|NC_021307. 306 LT--NAA 310 (310) Q Consensus 306 l~--~aa 310 (310) .+ .++ T Consensus 289 y~~~~~~ 295 (319) T protein:vir:94 289 FTIGGTE 295 (319) T ss_pred EEeecCC Confidence 54 333 No 211 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=93.92 E-value=0.0059 Score=32.79 Aligned_cols=278 Identities=9% Similarity=-0.068 Sum_probs=121.8 Q ss_pred Cc------cc-hhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhh--hcc--eeecCCCceEEEEEcCCcee Q lcl|NC_021307. 1 MA------AG-TAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQR--VAR--KIPMGSTGVKIPHWTGDVSA 69 (310) Q Consensus 1 ~a------a~-~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~--~~~--~~~~~~~~~~ip~~~~~~~a 69 (310) |- -| -++.-..++. -....+.-.+-.. -..+++.+.....+.. .++ .....+.+++||+.....-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~--~~~~~nt~~l~~k-~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~ 77 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFAN--KSVEPGQTLLKNK-HVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELK 77 (319) T ss_pred CCcccccccceeEeehhhhhc--cCCCcchHHHHHH-HHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccc Confidence 10 00 0000000000 0000111111111 2223444444333222 122 34456788999998653222 Q ss_pred ee-ecccccccccccceeeeEeeeeeeEeeehhhHHHhhcChh--HHHHHHHHHHHHHHHHHHHHHHHcccCcccccccc Q lcl|NC_021307. 70 AW-IGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRANPG--NYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLD 146 (310) Q Consensus 70 ~~-v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~--~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~ 146 (310) .. ...+-....-+.++.+.+++..|.-.+..=.-+ .+++.. .+...+.+.+...+.-.+|...+.-.-...... T Consensus 78 DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D-~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~-- 154 (319) T protein:vir:97 78 DYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALD-RKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH-- 154 (319) T ss_pred cccCCCCcccCCcccceeEEEeecccccccccchhh-HhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc-- Confidence 22 223322333334455555555554444332222 222322 234455666666777777876554322211100 Q ss_pred cccccccceecccchHHHHHHHHHHHhhhhcCC-CCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeee Q lcl|NC_021307. 147 ETTKSVDLTPATGTTYDAIGVNALSLLVNAGKK-WGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGR 225 (310) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~ 225 (310) . . ...+.....+.+.++...|...... +-.++++|..+..|.+-..- .-...........+.-+.|.|+ T Consensus 155 -~----~-~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f----~~~~~~~~~~~~~g~Vg~idG~ 224 (319) T protein:vir:97 155 -L----T-VGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIA----LPQGDTRQQVLGKGVQGELDGF 224 (319) T ss_pred -c----c-cccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhh----hccccccccceeeeeceeecCe Confidence 0 0 1112233445555677777665543 33467788888877543211 1111121222334456789999 Q ss_pred eEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEE Q lcl|NC_021307. 226 PTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVK 305 (310) Q Consensus 226 pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~ 305 (310) +|+.+++....+.-++++..+.+... .+--.+++.+. . -.+..-.++...++|..|.++++... T Consensus 225 ~Vi~vps~~~k~in~i~~h~~A~~~~-~k~~~~~~~~p--------~-------~~~~a~~v~gr~y~d~~V~~~k~~~I 288 (319) T protein:vir:97 225 VIVKVPTKLLQGLQAIAVVGEVLASP-IQADLAKTNSN--------I-------PGMFGTLAEQLLYTGAFVPEHLQKYI 288 (319) T ss_pred EEEEecccccccceEEEEcCCeeeee-eeeeeeeccCC--------C-------ccccceeeeeeeeeeeEEeccccceE Confidence 99976553322333344433332211 11111121110 0 01223567788899999999986555 Q ss_pred Ee--ecC Q lcl|NC_021307. 306 LT--NAA 310 (310) Q Consensus 306 l~--~aa 310 (310) .+ .++ T Consensus 289 y~~~~~~ 295 (319) T protein:vir:97 289 FTIGGTE 295 (319) T ss_pred EEeecCC Confidence 54 333 No 212 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=93.75 E-value=0.0065 Score=32.58 Aligned_cols=288 Identities=12% Similarity=-0.028 Sum_probs=117.4 Q ss_pred Cccchh---hhHHHHHhhccccCCCCceechhhHHH----HHHHHHh-----hc-hhhhhcc----------eeecCCCc Q lcl|NC_021307. 1 MAAGTA---FPVNHTQIAQTGDSMFQGYLEPEQAQD----YFAEAEK-----TS-IVQRVAR----------KIPMGSTG 57 (310) Q Consensus 1 ~aa~~~---~~~~~~~~~~~~~~~~g~~i~~~~~~~----ii~~~~~-----~s-~l~~~~~----------~~~~~~~~ 57 (310) .+++.. ...........+.+........+.... .-+.+.. .+ ....... ......+ T Consensus 175 s~agta~~~li~A~~~q~itg~tga~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g- 253 (523) T protein:vir:59 175 SLPGVADVNTVRFWQYDDASGDPENTVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDAR- 253 (523) T ss_pred ccccccccccccccccccccccccccccchhhccccccccccccccccccccccccccccCCCcccccccccccccccc- Confidence 111100 000000000000000000000000000 0000000 00 0000000 0000000 Q ss_pred eEEEEEcCCce-eeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhc-----ChhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021307. 58 VKIPHWTGDVS-AAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRA-----NPGNYLGTMRTKVATAIALAFDE 131 (310) Q Consensus 58 ~~ip~~~~~~~-a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~-----s~~~~~~~v~~~l~~a~~~~~d~ 131 (310) ........... .....++..+++-.-.+++++++.+..+=....|-||.+| +..|.|+.|.+.|+..|...+|+ T Consensus 254 ~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR 333 (523) T protein:vir:59 254 NDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDL 333 (523) T ss_pred cchhhccccccccccccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhH Confidence 00000000011 1112456677888888899999999988888999999886 35679999999999999999999 Q ss_pred HHHcccCcccccc-cccc-cccc-cceecc------cchH---HHHHHHHH-------HHhhh--hcCCCCEEEEehHHH Q lcl|NC_021307. 132 AALHGTDSPFDKN-LDET-TKSV-DLTPAT------GTTY---DAIGVNAL-------SLLVN--AGKKWGATLLDDVAE 190 (310) Q Consensus 132 ~~l~G~g~~~~~~-~~~~-~~~~-~~~~~~------~~~~---~~~~~~~~-------~~l~~--~~~~~~~~~~~~~~~ 190 (310) .|+.---+....+ ..+. ..++ ...... +..+ -+....+. ..+.. .....+.++|+++.. T Consensus 334 ~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~ 413 (523) T protein:vir:59 334 EILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVA 413 (523) T ss_pred HHHHhHhhhheeeeeccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHH Confidence 9986533221110 0000 0000 000000 0000 01111121 12222 223577899999998 Q ss_pred HHHHHhhhccCccccccccccccccccCCceee-eeeEEEeCCCCCCceeEeeecc---e----eeeEEeecccEEEEee Q lcl|NC_021307. 191 PILNGAKDANGRPLFVESTYEAVTTPYREGRIL-GRPTILSDHVASGTTVGYLGDF---S----QIVWGQVGGLSFDVSD 262 (310) Q Consensus 191 ~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~-G~pv~~t~~~~~~~~~~~~gd~---~----~~~~~~~~~~~v~~~~ 262 (310) ..|.. .|..-...............+.|. +++|+++++.+.+ .+++|-. + .+++...-.+... T Consensus 414 ~~l~~----~~~~~~~~~~~~~~~~~~~~g~l~~~~~vy~d~~~~~d--y~~~g~k~~~~~~~~~~~y~Py~~l~~~--- 484 (523) T protein:vir:59 414 ALLES----MPGFTPGNDNRDGGTGIFYVGMVQGRYRLYKNIYQNQP--VIIMGNQDLNTPWQTGAVYAPYVPLLFT--- 484 (523) T ss_pred HHHHh----ccccccCCccccccccceeEEEecCceEEEecCCCCcc--eEEEEecccCCcccccceecccchhhcc--- Confidence 88853 232211111111111111123333 4699999887754 2233211 1 1122221111000 Q ss_pred cceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 263 QATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) . -..+|.+ || =.++| ..|++..|.+|-+..+|-.+- T Consensus 485 ---~-----~~~dp~s-~q-p~~~~--~tRY~l~v~nP~~~~~~~~~~ 520 (523) T protein:vir:59 485 ---P-----TIVDPVN-FS-YRRGL--MTRYALEVVRPEFYGLLYVKL 520 (523) T ss_pred ---c-----ccccCCc-cc-ceeee--eeehhheecchhHhhhhhhhh Confidence 0 0011212 11 12444 469999999998866555444 No 213 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=93.48 E-value=0.0042 Score=33.63 Aligned_cols=283 Identities=15% Similarity=0.129 Sum_probs=124.1 Q ss_pred Ccc-chh--hhHHHHHhhcccc--CCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAA-GTA--FPVNHTQIAQTGD--SMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa-~~~--~~~~~~~~~~~~~--~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) |-. |.. -......-+-.|. +...-.+|.-++..|-..+..+.+++++.-+...+.--++... .+..+|.-...| T Consensus 94 ~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~-~s~~eAq~HkdG 172 (393) T protein:vir:16 94 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF-DSANEAQVHKDG 172 (393) T ss_pred hccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhh-hhhhhhhhhccC Confidence 111 111 0111111111111 2222356777777777788888888886665554433333222 233467777899 Q ss_pred ccccccccceeeeEeeeeeeEeeehhhHHHhh---cChhHHHHHHHHHHHHHHH-HHHHHHHHcccCccccccccccc-- Q lcl|NC_021307. 76 DMKPITKGDMSVQQVEPHKIATIFVASAETVR---ANPGNYLGTMRTKVATAIA-LAFDEAALHGTDSPFDKNLDETT-- 149 (310) Q Consensus 76 ~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~---~s~~~~~~~v~~~l~~a~~-~~~d~~~l~G~g~~~~~~~~~~~-- 149 (310) +.+.+...+|.--++.+--++....+ -++.. ++.-.+..+++.+|+.++. +.+|.+++-|+|+.+-......+ T Consensus 173 qTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~adv 251 (393) T protein:vir:16 173 QTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADV 251 (393) T ss_pred CccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHH Confidence 99999999988888888766666555 33443 3344568999999999998 89999999999987644332211 Q ss_pred ----cccccee-cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHH-HHHHHhhhccCccccccccccccccccCCceee Q lcl|NC_021307. 150 ----KSVDLTP-ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAE-PILNGAKDANGRPLFVESTYEAVTTPYREGRIL 223 (310) Q Consensus 150 ----~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~-~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~ 223 (310) ...+.+. ++.++..+.+.....-+-+.. +.-.++...... +.|..++-+..+. +.++. T Consensus 252 K~I~k~Ttkaksagktpfadaieeavdfvrpta-grrylivktedrkalldelrqatana---------------nvrik 315 (393) T protein:vir:16 252 KKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA-GRRYLIVKTEDRKALLDELRQATANA---------------NVRIK 315 (393) T ss_pred HHHHHHhhhhhhcCCCchhHHHHHHHhhhccCC-CceEEEEeccchHHHHHHHHhhhccC---------------ceeee Confidence 1111122 233344443333332222221 122233333333 3333332211100 00111 Q ss_pred eeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCce Q lcl|NC_021307. 224 GRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAF 303 (310) Q Consensus 224 G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~ 303 (310) .-...+...+..++..++-|... --.++.++...-+.+.+...... .-|.+|+--+.++.--.+.+..-.|= T Consensus 316 nddteiasevgvdeiivytgska-------lkptvlvdqkyhidmqdltkvda-fewktnsnmilvetltsghvetynag 387 (393) T protein:vir:16 316 NDDTEIASEVGVDEIIVYTGSKA-------LKPTVLVDQKYHIDMQDLTKVDA-FEWKTNSNMILVETLTSGHVETYNAG 387 (393) T ss_pred ccchhhhhhcCcceeeeeecccc-------ccceeeeccccccchhhhhhhhh-heeccCCceEEEeecccCcceeeccc Confidence 10000000111111111111100 00111111111111111111100 11334444444444444444433333 Q ss_pred EEEeec Q lcl|NC_021307. 304 VKLTNA 309 (310) Q Consensus 304 ~~l~~a 309 (310) ++++.. T Consensus 388 avitvs 393 (393) T protein:vir:16 388 AVITVS 393 (393) T ss_pred eeEeeC Confidence 444444 No 214 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=93.44 E-value=0.0062 Score=32.69 Aligned_cols=294 Identities=13% Similarity=0.018 Sum_probs=124.2 Q ss_pred CccchhhhHHHHHhhc--cccCCCCceechhhHH----HHHHHHHhhchhhhhcceeecCC---CceEEEEEcCCceeee Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQ--TGDSMFQGYLEPEQAQ----DYFAEAEKTSIVQRVARKIPMGS---TGVKIPHWTGDVSAAW 71 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~--~~~~~~g~~i~~~~~~----~ii~~~~~~s~l~~~~~~~~~~~---~~~~ip~~~~~~~a~~ 71 (310) |+++........-.+. ...+..+.-+|-++.+ .+++.+.+.....++..+...+. ..+.+++.+..+.+.+ T Consensus 51 ~~~~~~~~~~~amDa~~~~~~t~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ 130 (382) T protein:vir:96 51 LAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVE 130 (382) T ss_pred hhhhhhhhhhcccccccCCccccCCccHHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEE Confidence 1111110000000101 1111111123433332 35555555555555555544332 3467777777788889 Q ss_pred ecccccccccccceeeeEeeeeeeEeeehh-hHHHhhc--ChhHHHHHHHHHHHHHHHHHHHHHHHcccCccccc---cc Q lcl|NC_021307. 72 IGEGDMKPITKGDMSVQQVEPHKIATIFVA-SAETVRA--NPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDK---NL 145 (310) Q Consensus 72 v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~i-s~ell~~--s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~---~~ 145 (310) .+.++..|..+...+..+-+.+.+.....+ ..|+.+. ...++.+.-.....+++.+.+|+-.|.|+..+.+. +. T Consensus 131 ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGl 210 (382) T protein:vir:96 131 YGDHTNIPLTSWNANFERRTIVRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGF 210 (382) T ss_pred eecccCCCccccccceeEEEEEEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEE Confidence 898888887776655555555555555666 4555553 24567777788888888899999999997443222 11 Q ss_pred ccccc---cccceec--ccchHHHHHHH---HHHHhhhhcC-------CCCEEEEehHHHHHHHHhhhccCccccccccc Q lcl|NC_021307. 146 DETTK---SVDLTPA--TGTTYDAIGVN---ALSLLVNAGK-------KWGATLLDDVAEPILNGAKDANGRPLFVESTY 210 (310) Q Consensus 146 ~~~~~---~~~~~~~--~~~~~~~~~~~---~~~~l~~~~~-------~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~ 210 (310) +...+ ....... ...+.+.++.| ++..+..... .+..+++.++.+..|.. .+..|.-++.- . T Consensus 211 lNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~-~n~~g~Tvl~~--l 287 (382) T protein:vir:96 211 LNDPNLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSV-TTPYGISVSDW--I 287 (382) T ss_pred EeCCCcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhccc-cCccCccHHHH--H Confidence 11110 0000000 11233333333 3344432221 12257888888877743 23333211110 0 Q ss_pred cccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEE--Eeecceeeecc----cccccchhhhhc-C Q lcl|NC_021307. 211 EAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFD--VSDQATLNLGT----PQAPNFVSLWQH-N 283 (310) Q Consensus 211 ~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~--~~~~~~~~~~~----~~~~~~~~~~~~-~ 283 (310) .. .+.+-++...|=........+.. +...+-....+... .+.+...-... .....+ ...+ - T Consensus 288 k~---n~Pnl~i~t~peL~~a~~~g~g~-------~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~--ve~~~~ 355 (382) T protein:vir:96 288 EQ---TYPKMRIVSAPELSGVQMQGKTP-------EDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLG--VEKRAK 355 (382) T ss_pred HH---hcCCcEEEEccccccccCCCccc-------eeEEEEecchhhhhcccccccCcceeccccceeeecc--ceeecc Confidence 00 01112333333221111111000 00111001111000 00000000000 000000 0000 0 Q ss_pred cEEEEEE-EEeccEEeccCceEEEeec Q lcl|NC_021307. 284 LVAVRVE-AEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 284 ~~~~r~~-~~~d~~v~~~~a~~~l~~a 309 (310) ....... ...|..+++|.||+++++- T Consensus 356 ~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 356 SYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred eeEeccccceeeeEEEcchhhhhccCC Confidence 0111111 2356788899999999999 No 215 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=93.28 E-value=0.0046 Score=33.38 Aligned_cols=283 Identities=15% Similarity=0.127 Sum_probs=124.9 Q ss_pred Ccc-chh--hhHHHHHhhcccc--CCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAA-GTA--FPVNHTQIAQTGD--SMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa-~~~--~~~~~~~~~~~~~--~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) |-. |.. -......-+-.|. +...-.+|.-++..|-..+..+.+++++.-+...+.--++... .+..+|.-...| T Consensus 101 ~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~-~s~~~Aq~HkdG 179 (400) T protein:vir:93 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF-DSANEAQVHKDG 179 (400) T ss_pred hccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhh-hhhhhhhhhccC Confidence 111 111 0111111111111 2222356777777787888888888887665554433333222 233467777899 Q ss_pred ccccccccceeeeEeeeeeeEeeehhhHHHhh---cChhHHHHHHHHHHHHHHH-HHHHHHHHcccCccccccccccc-- Q lcl|NC_021307. 76 DMKPITKGDMSVQQVEPHKIATIFVASAETVR---ANPGNYLGTMRTKVATAIA-LAFDEAALHGTDSPFDKNLDETT-- 149 (310) Q Consensus 76 ~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~---~s~~~~~~~v~~~l~~a~~-~~~d~~~l~G~g~~~~~~~~~~~-- 149 (310) +.+.+...+|.--++.+--++....+ -++.. ++.-.+..+++.+|+.++. +.+|.+++-|+|+.+-......+ T Consensus 180 qTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~adv 258 (400) T protein:vir:93 180 QTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADV 258 (400) T ss_pred CccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHH Confidence 99999999998888888766666555 33333 3444568999999999998 89999999999987644332211 Q ss_pred ----cccccee-cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHH-HHHHHhhhccCccccccccccccccccCCceee Q lcl|NC_021307. 150 ----KSVDLTP-ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAE-PILNGAKDANGRPLFVESTYEAVTTPYREGRIL 223 (310) Q Consensus 150 ----~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~-~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~ 223 (310) ...+.+. ++.++..+.+.....-+-+.. +.-.++...... +.|..++-+..+.-. ++. T Consensus 259 K~I~~~Ttkaksagktpfadaieeavdfvrpta-grrylivktedrkalldelrqatanahv---------------rik 322 (400) T protein:vir:93 259 KKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA-GRRYLIVKTEDRKALLDELRQATANAHV---------------RIK 322 (400) T ss_pred HHHHHHhhhhhhcCCCchhHHHHHHHhhhccCC-CceEEEEeccchHHHHHHHHhhccccce---------------Eee Confidence 1111122 233444443333332222221 122233333333 333333322211110 010 Q ss_pred eeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCce Q lcl|NC_021307. 224 GRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAF 303 (310) Q Consensus 224 G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~ 303 (310) .-...+...+..++..++-|... --.++.++...-+.+.+...... .-|.+|.--+.++.--.+.+..-.|= T Consensus 323 nddaeiasevgvdeiivytgska-------lkptvlvdqkyhidmqdltkvda-fewktnsnmilvetltsghvetynag 394 (400) T protein:vir:93 323 NDDAEIASEVGVDEIIVYTGSKA-------LKPTVLVDQKYHIDMQDLTKVDA-FEWKTNSNMILVETLTSGHVETYNAG 394 (400) T ss_pred cchhhhhhhcCcceeeeeecccc-------ccceeeeccccccchhhhhhhhh-heeccCCceEEEeecccCcceeeccc Confidence 00000000111111111111100 00111111111111111111100 11334444444444444444433333 Q ss_pred EEEeec Q lcl|NC_021307. 304 VKLTNA 309 (310) Q Consensus 304 ~~l~~a 309 (310) ++++.. T Consensus 395 avitvs 400 (400) T protein:vir:93 395 AVITVS 400 (400) T ss_pred eeEeeC Confidence 444444 No 216 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=92.98 E-value=0.0072 Score=32.34 Aligned_cols=283 Identities=16% Similarity=0.131 Sum_probs=123.8 Q ss_pred Ccc-chh--hhHHHHHhhcccc--CCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEcCCceeeeeccc Q lcl|NC_021307. 1 MAA-GTA--FPVNHTQIAQTGD--SMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWTGDVSAAWIGEG 75 (310) Q Consensus 1 ~aa-~~~--~~~~~~~~~~~~~--~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (310) |-. |.. -......-+-.|. +...-.+|.-+...|-..+.+..++++..-+...+.--++.... +..++.-...| T Consensus 19 ~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~~-s~AeAq~HkdG 97 (318) T protein:vir:86 19 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFD-SSAEAQVHKDG 97 (318) T ss_pred hccCCchhhhhhhhhhhhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhhhhhhh-hhhhhhhhccC Confidence 111 111 0111111111111 22223567777777888888888888876655544433333322 33667777899 Q ss_pred ccccccccceeeeEeeeeeeEeeehhhHHHhh---cChhHHHHHHHHHHHHHHH-HHHHHHHHcccCccccccccccc-- Q lcl|NC_021307. 76 DMKPITKGDMSVQQVEPHKIATIFVASAETVR---ANPGNYLGTMRTKVATAIA-LAFDEAALHGTDSPFDKNLDETT-- 149 (310) Q Consensus 76 ~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~---~s~~~~~~~v~~~l~~a~~-~~~d~~~l~G~g~~~~~~~~~~~-- 149 (310) +.+.+...+|.--++++--++....+ -|+.. ++.-.+..+++.+|+.++. +.+|.+++-|+|+.+-......+ T Consensus 98 qTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~adv 176 (318) T protein:vir:86 98 QTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGSNGFKSIDKEADV 176 (318) T ss_pred CccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhheeecCCCCccchhhHHHH Confidence 99999999998888888766666555 34443 3444568999999999998 89999999999987643332211 Q ss_pred ----ccccceecccchH-HHHHHHHHHHhhhhcCCCCEEEEehHHH-HHHHHhhhccCccccccccccccccccCCceee Q lcl|NC_021307. 150 ----KSVDLTPATGTTY-DAIGVNALSLLVNAGKKWGATLLDDVAE-PILNGAKDANGRPLFVESTYEAVTTPYREGRIL 223 (310) Q Consensus 150 ----~~~~~~~~~~~~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~-~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~ 223 (310) ...+...+.+++. ...+.....-+.+. .+.-.++...... +.|..++-+..+.-. +|. T Consensus 177 K~I~k~Ttkaksagttpfanaieeavdfvrpt-agrrylivkaedrkalldelrqatanahv---------------rik 240 (318) T protein:vir:86 177 KKIKKITTKAKSAGTTPFANAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQATANAHV---------------RIK 240 (318) T ss_pred HHHHHHhhhhhccCCCchhhHHHHHHhhhccC-CCceEEEEeecchHHHHHHHHhhccccee---------------EEe Confidence 1122222333332 22222222112211 1222234433333 333333322211110 000 Q ss_pred eeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCce Q lcl|NC_021307. 224 GRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAF 303 (310) Q Consensus 224 G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~ 303 (310) .-...+...+..++..++-|... --.++.++...-+.+.+...... .-|.+|+--+.++.--.+.+..-+|= T Consensus 241 nddteiasevgvdeiivytgska-------lkptvlvdqkyhidmqdltkvda-fewktnsnmilvetltsghvetynag 312 (318) T protein:vir:86 241 NDDTEIASEVGVDEIIVYTGSKA-------LKPTVLVDQKYHIDMQDLTKVDA-FEWKTNSNMILVETLTSGHVETYNAG 312 (318) T ss_pred ccchhhhhhcCcceeeeeecccc-------ccceeeeccceecchhhhhhhhc-ceeccCCceEEEeecccCcceeecCc Confidence 00000000111111111111100 00011111111111111111000 11334443344444444444433333 Q ss_pred EEEeec Q lcl|NC_021307. 304 VKLTNA 309 (310) Q Consensus 304 ~~l~~a 309 (310) ++++.. T Consensus 313 avitvs 318 (318) T protein:vir:86 313 AVITVS 318 (318) T ss_pred eeEEeC Confidence 444444 No 217 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=86.71 E-value=0.044 Score=28.04 Aligned_cols=276 Identities=9% Similarity=-0.020 Sum_probs=129.0 Q ss_pred hccccCCCCceechhhHHHHHHHHHhhchhhhhcc--eeecCCCceEEEEEcCCcee-eeecccccccccccceeeeEee Q lcl|NC_021307. 15 AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVAR--KIPMGSTGVKIPHWTGDVSA-AWIGEGDMKPITKGDMSVQQVE 91 (310) Q Consensus 15 ~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~--~~~~~~~~~~ip~~~~~~~a-~~v~Eg~~~~~~~~~~~~i~l~ 91 (310) |. -..-..+...+.+.+.+.+....+.. ..-.++.+++||+.....-. +-...|-..++-+.++.+.+++ T Consensus 1 Ma-------in~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R~~g~~~g~v~~~~et~tl~ 73 (290) T protein:vir:78 1 MA-------INYVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTRNKGYNEGSASNTNKSYTID 73 (290) T ss_pred Cc-------hhHHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccccCCCcccCccccceeeEEee Confidence 10 11123456666666666665444433 23346778999998643322 3333444444455667777777 Q ss_pred eeeeEeeehhhHHHhh-cChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHHH Q lcl|NC_021307. 92 PHKIATIFVASAETVR-ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNAL 170 (310) Q Consensus 92 ~~k~~~~~~is~ell~-~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (310) ..+.-.+..=....-+ +....+...+.+...+.++-.+|+-.+.---+... ..+.......+.....+.+.++. T Consensus 74 qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~-----~~~~~~~~t~t~~n~~~~i~~~~ 148 (290) T protein:vir:78 74 FDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAK-----TNSNSVAEEITKDNVFTKLKAAI 148 (290) T ss_pred ccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhh-----ccCcccccccCHHHHHHHHHHHH Confidence 7665554432222222 12345667777888888888888765532111110 00011111122233444455666 Q ss_pred HHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCC---------CCCceeEe Q lcl|NC_021307. 171 SLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHV---------ASGTTVGY 241 (310) Q Consensus 171 ~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~---------~~~~~~~~ 241 (310) ..+.+....+-.++|+|..+..|.....-. +-+.......+ ...+.-+.|.|.+|+..+.- -+|-.... T Consensus 149 ~~ldevp~~~rvl~vtp~~~~lL~~~~~f~-r~~~~~~~~~~-~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~ 226 (290) T protein:vir:78 149 RKVKKYGTQNLVMYVSPDVMAALELSDDFV-RAINVQNIGPS-SIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAA 226 (290) T ss_pred HHHHhcCCCCeEEEECHHHHHHHhhChhhh-ccccccccccc-cccceeeeecCcEEEEecccchhhhhhhhcccccccC Confidence 666666555666778888888775322111 11111111111 11234567999999865421 11111110 Q ss_pred eecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 242 LGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 242 ~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) -+.--++++.. .+..+.+..........+.... +-|...+.-..+.|.=|.+.+.=.....++ T Consensus 227 ~ak~in~ii~~-~~a~i~~~K~~~~~~~~P~~~~-----~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~ 289 (290) T protein:vir:78 227 GAKKLNFLLVN-KGSVVGGAKHASIYLHAPGSVG-----QGDGWLYQYRVYHDIFVLDQQKDGVIASTE 289 (290) T ss_pred CccceeEEEEc-CCceeeeeeeeEEEeeCCCCCc-----CcceeeeeeeeeeeeeeeccccCeeEEEee Confidence 01111222322 3344444444444444433321 123244444455566555544433333334 No 218 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=84.23 E-value=0.062 Score=27.20 Aligned_cols=278 Identities=11% Similarity=0.003 Sum_probs=133.9 Q ss_pred hhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhh----cceeecCC-CceEEEEEcC-Cceeeee-ccccccc Q lcl|NC_021307. 7 FPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRV----ARKIPMGS-TGVKIPHWTG-DVSAAWI-GEGDMKP 79 (310) Q Consensus 7 ~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~----~~~~~~~~-~~~~ip~~~~-~~~a~~v-~Eg~~~~ 79 (310) +|........+.+ -.+.+..+.+.+...++|++. +.+.+..+ .++..|..-+ .+++.|. ++-.-.. T Consensus 1 mp~~~lsel~t~t-------l~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~~ 73 (321) T protein:vir:34 1 MPFPNISDIITTT-------IESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSGYDVLPT 73 (321) T ss_pred CCCchHHHHHHHH-------HHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEeeeeecc Confidence 2221111111111 112233345555666664443 33344433 3455555544 7788997 4555455 Q ss_pred ccccceeeeEeeeeeeEeeehhhHHHhhcCh-----hHHHHHHHHHHHHHHHHHHHHHHHcccCcc--cc-cccc-cccc Q lcl|NC_021307. 80 ITKGDMSVQQVEPHKIATIFVASAETVRANP-----GNYLGTMRTKVATAIALAFDEAALHGTDSP--FD-KNLD-ETTK 150 (310) Q Consensus 80 ~~~~~~~~i~l~~~k~~~~~~is~ell~~s~-----~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~--~~-~~~~-~~~~ 150 (310) ...-.|++.++..+.+.+.+.||-.-+..+. +++.+.=.+...+.+..+++..+.. +|++ .. ..++ .... T Consensus 74 ~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~s-dGTa~g~~~i~GL~~lv~ 152 (321) T protein:vir:34 74 APQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYG-DGTAFGGRAINGLDGAVP 152 (321) T ss_pred chhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhc-cccccccchhhhhhhhcc Confidence 5667799999999999988888854443322 3444444455556667778777764 4443 21 1111 1111 Q ss_pred ------ccccee-----------------cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCcccccc Q lcl|NC_021307. 151 ------SVDLTP-----------------ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVE 207 (310) Q Consensus 151 ------~~~~~~-----------------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~ 207 (310) ...... .+..+.......+.-..-.....+..|++....|...+.-.-.--|+-... T Consensus 153 ~~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~y~~y~~s~q~~qR~~~~~ 232 (321) T protein:vir:34 153 VDPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDAWTTYSNSLQVLQRFTSAE 232 (321) T ss_pred cCCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHHHHHHHHhhheeeeecccc Confidence 000000 000111111112222233445578889999988888765332222222221 Q ss_pred ccccccccccCCceeeeeeEEEeC----CCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcC Q lcl|NC_021307. 208 STYEAVTTPYREGRILGRPTILSD----HVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHN 283 (310) Q Consensus 208 ~~~~~~~~~~~~~~l~G~pv~~t~----~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (310) . .......-...|..|+.++ .+|++. .+|=|-+++.+.+..+-.+....... ..|+ -+| T Consensus 233 ~----a~~Gf~~Lky~~~div~D~~~g~~~pan~--~yfiNT~yl~~r~h~~~~~~pi~p~r--------~~~~---Nqd 295 (321) T protein:vir:34 233 E----ANLGFRSLKFLSTDVVLDGGIGGFAGANT--MYFLNTKYLHFRPHKDRNMVPLSPSR--------RAAF---NQD 295 (321) T ss_pred c----ccccceeeeeeeEEEEEeCCCCCCccccc--eeeeecceEEEEEcCCCceeecCccc--------cccc---chh Confidence 1 2222344567888899887 467775 34557777766655544443332211 0011 122 Q ss_pred cEEEEEEEEeccEEeccCceEEEeec Q lcl|NC_021307. 284 LVAVRVEAEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 284 ~~~~r~~~~~d~~v~~~~a~~~l~~a 309 (310) .+.-....+....+-++.+=.+|+.. T Consensus 296 A~~q~I~~~GnL~~sn~~~~~vL~~~ 321 (321) T protein:vir:34 296 AEAQILAWAGNLTCSGAQFQGRLIAE 321 (321) T ss_pred HHhhhhhhhheeeeecccceeEEeeC Confidence 22222222333344455554555444 No 219 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=82.51 E-value=0.076 Score=26.71 Aligned_cols=291 Identities=14% Similarity=0.081 Sum_probs=125.4 Q ss_pred CccchhhhHHHH------------HhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEEc---- Q lcl|NC_021307. 1 MAAGTAFPVNHT------------QIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHWT---- 64 (310) Q Consensus 1 ~aa~~~~~~~~~------------~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~---- 64 (310) +...+.+-.|.. .+..++++... ..-|.++. ++++..+..+..+++.+.||++++.-|.-.. T Consensus 45 ~~~~~~~l~e~~~~~~~~~~~~~~i~~st~t~~v~-~~~P~Li~-lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~ 122 (470) T protein:vir:10 45 LREERNFLSEAPNVNTNSGATAGFSADATAAGPVA-GFDPVLIS-LIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYK 122 (470) T ss_pred Hhhccchhhhhhhcccccccccccccccccccccc-ccCchhhh-hHHHHHhhhhhhhhheeecCCccceeeeEEEEEec Confidence 222222222211 11111111111 12233322 6666777888999999999988765443111 Q ss_pred -C-Ccee-------eeec---------------------------------------------------------c---- Q lcl|NC_021307. 65 -G-DVSA-------AWIG---------------------------------------------------------E---- 74 (310) Q Consensus 65 -~-~~~a-------~~v~---------------------------------------------------------E---- 74 (310) . +.++ .|.+ | T Consensus 123 n~sG~EaffnEA~T~fSG~~~~~~~~~~~~~~~a~~~g~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~aE~lg~ 202 (470) T protein:vir:10 123 TQSGTEALFNEADTAFSGQPDGLDDTSGFTATGANNVGLGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDSAEDLGD 202 (470) T ss_pred CCCccceeeecCCcccCcccccccccccccccccccccccccccccccccccccccccccccccccccccchHHhhhcCC Confidence 0 0000 1110 0 Q ss_pred --cccccccccceeeeEeeeeeeEeeehhhHHHhhcC----hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccc-ccc Q lcl|NC_021307. 75 --GDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN----PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKN-LDE 147 (310) Q Consensus 75 --g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s----~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~-~~~ 147 (310) +..+++-.-.+++++++.+...=....|-||.+|- ..|.++.|.+.|+..|...+|+.|+.---+....+ ..+ T Consensus 203 s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~k~~~ 282 (470) T protein:vir:10 203 GTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPGAQAN 282 (470) T ss_pred CCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhceecc Confidence 12234444455666666666666677888988764 46889999999999999999999886533221110 000 Q ss_pred ccccccce--ec--ccchHHHHHHHHHHHh---------hhhcCCCCEEEEehHHHHHHHHhhhccCcccccccccc--- Q lcl|NC_021307. 148 TTKSVDLT--PA--TGTTYDAIGVNALSLL---------VNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYE--- 211 (310) Q Consensus 148 ~~~~~~~~--~~--~~~~~~~~~~~~~~~l---------~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~--- 211 (310) ........ .. +....+.. ..+...+ .......+.++|+++....|.. .|..-+.+.... T Consensus 283 ~~~~Gv~Dl~~~~~gr~~~e~~-~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~----sG~l~~~~~~~~~~~ 357 (470) T protein:vir:10 283 VAAAGTFDLDTDSNGRWSVEKF-KGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTM----AGVLDYTPALNANLN 357 (470) T ss_pred ccccceEEeecccchhHHHHHH-HHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhh----ccccccccccccccc Confidence 11000000 00 00111111 1111111 2234556778999999888842 232222221111 Q ss_pred -ccccccCCcee-eeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeeccc------ccccchhhhhcC Q lcl|NC_021307. 212 -AVTTPYREGRI-LGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTP------QAPNFVSLWQHN 283 (310) Q Consensus 212 -~~~~~~~~~~l-~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~------~~~~~~~~~~~~ 283 (310) +.......+.| .|++|+++..+..+.. ..+.++.+|..+..++. +++.+..+ ...+|.++ | = T Consensus 358 ~D~t~~~~~G~l~~~~~vy~d~y~~~~~~----a~~dy~~vG~KG~~~~~----~glfy~PYv~l~~~~~~dp~sf-q-P 427 (470) T protein:vir:10 358 VDDTGNTFAGILQGKYRVYIDPFSASGGA----AATQYYVVGYKGSSPYD----AGLFYCPYVPLQMVRAVGQDTF-Q-P 427 (470) T ss_pred cCCCCceEEEEecCceEEEeeccccccCc----ccccEEEEEEecCccee----cceeeccccccccCCCCCCccc-c-c Confidence 01111112333 3469999886553311 01122333333222211 11111111 11122221 1 1 Q ss_pred cEEEEEEEEeccEEe-----ccCc-----------eEEEeecC Q lcl|NC_021307. 284 LVAVRVEAEYGLLIN-----DVEA-----------FVKLTNAA 310 (310) Q Consensus 284 ~~~~r~~~~~d~~v~-----~~~a-----------~~~l~~aa 310 (310) .++|+ .|++..+- .++. |.++..|= T Consensus 428 ~~g~~--tRY~l~~NP~~~~~~~~~~~i~~~~n~y~r~~~v~~ 468 (470) T protein:vir:10 428 KIGFK--TRYGLVENPFSQGTTQGLGTLTRNSNRYYRRVKVAN 468 (470) T ss_pred eeeee--eeeceeecCcccCCCcccccccCCCCceeeEEEeec Confidence 23333 45555432 1111 22222222 No 220 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=80.82 E-value=0.091 Score=26.28 Aligned_cols=289 Identities=13% Similarity=0.046 Sum_probs=141.9 Q ss_pred Ccc---------chhhhHHHHHhhcccc------CCCCceechhhHHHHHHHHH---hhchhhhhcceeecCCCceEEEE Q lcl|NC_021307. 1 MAA---------GTAFPVNHTQIAQTGD------SMFQGYLEPEQAQDYFAEAE---KTSIVQRVARKIPMGSTGVKIPH 62 (310) Q Consensus 1 ~aa---------~~~~~~~~~~~~~~~~------~~~g~~i~~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~ip~ 62 (310) |.. -+.+..+..++..+|- -.+++.+..+..++-+..+- +.-.+.+-..+.+..+...++-. T Consensus 3 ~~~~~~~~~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~ 82 (463) T protein:vir:95 3 IEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQ 82 (463) T ss_pred cccccchHHHHHHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhhee Confidence 111 1344455555555533 22345566555544333332 22235555556666665545444 Q ss_pred EcC---CceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhh-cChhHHHHHHHHHHHHHHHHHHHHHHHcccC Q lcl|NC_021307. 63 WTG---DVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVR-ANPGNYLGTMRTKVATAIALAFDEAALHGTD 138 (310) Q Consensus 63 ~~~---~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~-~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g 138 (310) ... ...+.++.|+...+.+++.+.......|-++..-.+|.-+-. ++..+.+..+.+.-...++..+|.+.|.|+. T Consensus 83 ~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~FyGds 162 (463) T protein:vir:95 83 YLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDA 162 (463) T ss_pred eeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 433 244688999999999999999999999999988777765544 4456788888888889999999999999987 Q ss_pred ccccc------ccccccccccc---eec-ccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccc Q lcl|NC_021307. 139 SPFDK------NLDETTKSVDL---TPA-TGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVES 208 (310) Q Consensus 139 ~~~~~------~~~~~~~~~~~---~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~ 208 (310) .-.+. ...+..+-... .-+ +.....+++..+...+...+....-++|+..+.+.|..---..-|.+.+++ T Consensus 163 ~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N 242 (463) T protein:vir:95 163 SLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDN 242 (463) T ss_pred ccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCC Confidence 54431 12222222211 111 223334445555556777888889999999999999753333333333332 Q ss_pred cccccccccCCceeeeeeE--EEeC--------CCCCCceeEeeecceeee-EEeecc--c--EEEEeecceeeeccccc Q lcl|NC_021307. 209 TYEAVTTPYREGRILGRPT--ILSD--------HVASGTTVGYLGDFSQIV-WGQVGG--L--SFDVSDQATLNLGTPQA 273 (310) Q Consensus 209 ~~~~~~~~~~~~~l~G~pv--~~t~--------~~~~~~~~~~~gd~~~~~-~~~~~~--~--~v~~~~~~~~~~~~~~~ 273 (310) .... ..|+|| +.+. +.-.+...+ + |.+.-. -..... . +++..+....... T Consensus 243 ~~~~---------~~G~~v~~f~s~~G~I~L~~s~~m~~~~i-l-~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~---- 307 (463) T protein:vir:95 243 SGNV---------NTGYSVNGFYSSRGFIKLHGSTVMENELI-L-DESLQPLPNAPQPAKVTATVETKQKGAFENE---- 307 (463) T ss_pred CCce---------eeeeeccceeeeeeeeeeCCceecCCccc-c-cchhhcCCCCccCceeEEEEeeccCCCCCCc---- Confidence 2211 122222 1110 000000000 0 011000 000000 0 1111111111000 Q ss_pred ccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 274 PNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 274 ~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .......+++...-+..=-.|+.++-.|.++ T Consensus 308 ------~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~ 338 (463) T protein:vir:95 308 ------EDRAGLSYKVVVNSDDAQSAPSEEVTATVSN 338 (463) T ss_pred ------ccccceEEEEEEECCCCCcccchheeeeeee Confidence 0111122222222222111223333333333 No 221 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=80.82 E-value=0.091 Score=26.28 Aligned_cols=289 Identities=13% Similarity=0.046 Sum_probs=141.9 Q ss_pred Ccc---------chhhhHHHHHhhcccc------CCCCceechhhHHHHHHHHH---hhchhhhhcceeecCCCceEEEE Q lcl|NC_021307. 1 MAA---------GTAFPVNHTQIAQTGD------SMFQGYLEPEQAQDYFAEAE---KTSIVQRVARKIPMGSTGVKIPH 62 (310) Q Consensus 1 ~aa---------~~~~~~~~~~~~~~~~------~~~g~~i~~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~ip~ 62 (310) |.. -+.+..+..++..+|- -.+++.+..+..++-+..+- +.-.+.+-..+.+..+...++-. T Consensus 3 ~~~~~~~~~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~ 82 (463) T protein:vir:99 3 IEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQ 82 (463) T ss_pred cccccchHHHHHHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhhee Confidence 111 1344455555555533 22345566555544333332 22235555556666665545444 Q ss_pred EcC---CceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhh-cChhHHHHHHHHHHHHHHHHHHHHHHHcccC Q lcl|NC_021307. 63 WTG---DVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVR-ANPGNYLGTMRTKVATAIALAFDEAALHGTD 138 (310) Q Consensus 63 ~~~---~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~-~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g 138 (310) ... ...+.++.|+...+.+++.+.......|-++..-.+|.-+-. ++..+.+..+.+.-...++..+|.+.|.|+. T Consensus 83 ~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~FyGds 162 (463) T protein:vir:99 83 YLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDA 162 (463) T ss_pred eeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 433 244688999999999999999999999999988777765544 4456788888888889999999999999987 Q ss_pred ccccc------ccccccccccc---eec-ccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccccc Q lcl|NC_021307. 139 SPFDK------NLDETTKSVDL---TPA-TGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVES 208 (310) Q Consensus 139 ~~~~~------~~~~~~~~~~~---~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~ 208 (310) .-.+. ...+..+-... .-+ +.....+++..+...+...+....-++|+..+.+.|..---..-|.+.+++ T Consensus 163 ~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N 242 (463) T protein:vir:99 163 SLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDN 242 (463) T ss_pred ccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCC Confidence 54431 12222222211 111 223334445555556777888889999999999999753333333333332 Q ss_pred cccccccccCCceeeeeeE--EEeC--------CCCCCceeEeeecceeee-EEeecc--c--EEEEeecceeeeccccc Q lcl|NC_021307. 209 TYEAVTTPYREGRILGRPT--ILSD--------HVASGTTVGYLGDFSQIV-WGQVGG--L--SFDVSDQATLNLGTPQA 273 (310) Q Consensus 209 ~~~~~~~~~~~~~l~G~pv--~~t~--------~~~~~~~~~~~gd~~~~~-~~~~~~--~--~v~~~~~~~~~~~~~~~ 273 (310) .... ..|+|| +.+. +.-.+...+ + |.+.-. -..... . +++..+....... T Consensus 243 ~~~~---------~~G~~v~~f~s~~G~I~L~~s~~m~~~~i-l-~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~---- 307 (463) T protein:vir:99 243 SGNV---------NTGYSVNGFYSSRGFIKLHGSTVMENELI-L-DESLQPLPNAPQPAKVTATVETKQKGAFENE---- 307 (463) T ss_pred CCce---------eeeeeccceeeeeeeeeeCCceecCCccc-c-cchhhcCCCCccCceeEEEEeeccCCCCCCc---- Confidence 2211 122222 1110 000000000 0 011000 000000 0 1111111111000 Q ss_pred ccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 274 PNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 274 ~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) .......+++...-+..=-.|+.++-.|.++ T Consensus 308 ------~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~ 338 (463) T protein:vir:99 308 ------EDRAGLSYKVVVNSDDAQSAPSEEVTATVSN 338 (463) T ss_pred ------ccccceEEEEEEECCCCCcccchheeeeeee Confidence 0111122222222222111223333333333 No 222 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=80.38 E-value=0.095 Score=26.18 Aligned_cols=283 Identities=11% Similarity=0.023 Sum_probs=119.6 Q ss_pred Cccc-----hhh----hHHHHHh--------------hccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCc Q lcl|NC_021307. 1 MAAG-----TAF----PVNHTQI--------------AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG 57 (310) Q Consensus 1 ~aa~-----~~~----~~~~~~~--------------~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 57 (310) ++.. ..+ ....... ..+++.+.. ..-|.+ -.++++.-+..+..+++.+.||.+++ T Consensus 44 ~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~ia~s~~t~~v~-~~~P~L-i~lvRra~p~LIa~DIwGVQPMTgPT 121 (529) T protein:vir:10 44 SKTDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAAGQSSGAIT-NIGPAV-IGMVRRAIPSLIAFDIAGVQPMTGPT 121 (529) T ss_pred hhcccccchhhhhhhhhhccchhhcccccccccccccccccccccc-cccchh-hhhHHHHHHhHHhhhhheeccCCchh Confidence 1110 000 0000001 111111111 112222 23555666777788888888887654 Q ss_pred eEEE----EE-cCC------------------------------------------------------------------ Q lcl|NC_021307. 58 VKIP----HW-TGD------------------------------------------------------------------ 66 (310) Q Consensus 58 ~~ip----~~-~~~------------------------------------------------------------------ 66 (310) .-|- +. +.. T Consensus 122 GLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~ 201 (529) T protein:vir:10 122 GQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNV 201 (529) T ss_pred hhhhhheeeecCCcCCCcccccccccccccccccccccccccccccccccccccccceeeccccceeeeccccccccccc Confidence 2220 00 000 Q ss_pred ----ce----------------------------eeee--cc---------cccccccccceeeeEeeeeeeEeeehhhH Q lcl|NC_021307. 67 ----VS----------------------------AAWI--GE---------GDMKPITKGDMSVQQVEPHKIATIFVASA 103 (310) Q Consensus 67 ----~~----------------------------a~~v--~E---------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ 103 (310) .. +... +| +..+++-.-.+++++++.+..+=....|- T Consensus 202 tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTi 281 (529) T protein:vir:10 202 TGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSI 281 (529) T ss_pred ccccccccccccCCccccccccccccccccccccccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccH Confidence 00 0000 01 12244555556777777777777777899 Q ss_pred HHhhcC----hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccc-------cceecc-----cchHH---- Q lcl|NC_021307. 104 ETVRAN----PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSV-------DLTPAT-----GTTYD---- 163 (310) Q Consensus 104 ell~~s----~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~-------~~~~~~-----~~~~~---- 163 (310) ||.+|- ..|.|+.|.+.|+..|...+|+.|+.=-......+..+.+... ...... ....+ T Consensus 282 ELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~ 361 (529) T protein:vir:10 282 ELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKA 361 (529) T ss_pred HHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHhhhhceeeeeeeeccccccccceeccccccccccchhHHHHHH Confidence 998864 4689999999999999999999999722222111111111000 000000 00111 Q ss_pred --HHHHHHHHHhhh--hcCCCCEEEEehHHHHHHHHh--hhccCcc----ccccccccccccccCCcee-eeeeEEEeCC Q lcl|NC_021307. 164 --AIGVNALSLLVN--AGKKWGATLLDDVAEPILNGA--KDANGRP----LFVESTYEAVTTPYREGRI-LGRPTILSDH 232 (310) Q Consensus 164 --~~~~~~~~~l~~--~~~~~~~~~~~~~~~~~l~~l--~d~~g~~----~~~~~~~~~~~~~~~~~~l-~G~pv~~t~~ 232 (310) ..+..+...+.. .....+.++|+++....|... .+..+.. -+..+...+. ..+.| .+++|+++++ T Consensus 362 L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~----~~G~l~~~~~vy~D~y 437 (529) T protein:vir:10 362 LLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALVDAGITPAAQGMASGLNADTTKGV----FAGVLGGRYKVYIDQY 437 (529) T ss_pred HHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhhhccccccccccccccceeecCCce----EEEEecCceEEEecCC Confidence 111122222322 223577899999999988632 1111100 0111111111 12333 3469999988 Q ss_pred CCCCceeEeeecc------eeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccCceEE- Q lcl|NC_021307. 233 VASGTTVGYLGDF------SQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVK- 305 (310) Q Consensus 233 ~~~~~~~~~~gd~------~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~- 305 (310) .+.+ .+++|-. ..+++...-..... ...+|.++ | =.++|+ .|++..+ +| |+. T Consensus 438 ~~~d--y~~vG~KG~~~~~~glfy~PYv~l~~~------------~~~dp~sf-q-P~~g~~--tRY~l~~-NP--~~~~ 496 (529) T protein:vir:10 438 ARQD--YFTMGYRGANNLDAGIYYCPYVALTPL------------RGSDPKNF-Q-PVMGFK--TRYAIGV-NP--FAES 496 (529) T ss_pred CCcc--eEEEEEeCCcccccceeeccccccccc------------cccCCCcc-c-ceeeee--eeeceee-cC--cccc Confidence 7654 2222211 01111111111100 01112221 1 123333 4665543 22 221 Q ss_pred --------EeecC Q lcl|NC_021307. 306 --------LTNAA 310 (310) Q Consensus 306 --------l~~aa 310 (310) +.+.. T Consensus 497 ~~~~~~~r~~~g~ 509 (529) T protein:vir:10 497 RTQAPTSRISNGM 509 (529) T ss_pred ccccccccccCCc Confidence 11111 No 223 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=77.48 E-value=0.12 Score=25.55 Aligned_cols=285 Identities=13% Similarity=0.059 Sum_probs=120.2 Q ss_pred Cccch-------hhhHHHH-----------HhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEE- Q lcl|NC_021307. 1 MAAGT-------AFPVNHT-----------QIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIP- 61 (310) Q Consensus 1 ~aa~~-------~~~~~~~-----------~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip- 61 (310) +-+.+ ++-.+.. .+..+++++.. ..-|.+ -.++++.-+..+..+++.+.||.+++.-|- T Consensus 49 ~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~i~es~~t~~v~-~~~P~L-i~lvRra~p~LIa~DIwGVQPMTgPTGLIFA 126 (521) T protein:vir:10 49 EYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAAGQTSGAVT-QIGPAV-MGMVRRAIPNLIAFDICGVQPMNSPTGQVFA 126 (521) T ss_pred ccchhHHHHHHhhhhhhhcccCccccccccccccccccccc-cCCchh-hhHHHHHHhhhhhhhceeeccCCchhhhhee Confidence 11111 1111100 01111111111 112222 235666667778899999999987653321 Q ss_pred ---EEcCC---------------ceeeeec-------------------------------------------------- Q lcl|NC_021307. 62 ---HWTGD---------------VSAAWIG-------------------------------------------------- 73 (310) Q Consensus 62 ---~~~~~---------------~~a~~v~-------------------------------------------------- 73 (310) +.... +++.|-+ T Consensus 127 MRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~ 206 (521) T protein:vir:10 127 LRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAA 206 (521) T ss_pred eeeeccCCccccccccccchhccccccccccccccccccccccccccccccccccccccccceecccccccCCCcccccc Confidence 10000 0011100 Q ss_pred ---------------------------c---------cccccccccceeeeEeeeeeeEeeehhhHHHhhcC----hhHH Q lcl|NC_021307. 74 ---------------------------E---------GDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN----PGNY 113 (310) Q Consensus 74 ---------------------------E---------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s----~~~~ 113 (310) | +..+++-.-.+++++++.+..+=....|-||.+|- ..|. T Consensus 207 ~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDA 286 (521) T protein:vir:10 207 KLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDA 286 (521) T ss_pred cccccccccccccceeecccccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCCh Confidence 1 11234445555677777766666777899998864 4689 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcccCcccccccccccc------c-ccceecc-----cchHHHHH------HHHHHHhhh Q lcl|NC_021307. 114 LGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTK------S-VDLTPAT-----GTTYDAIG------VNALSLLVN 175 (310) Q Consensus 114 ~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~------~-~~~~~~~-----~~~~~~~~------~~~~~~l~~ 175 (310) |+.|.+.|+..|...+|+.|+.=-....-.+..+.+. + ....... ....+... ......+.. T Consensus 287 EtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~ 366 (521) T protein:vir:10 287 DAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIAR 366 (521) T ss_pred HHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999995322222111111110 0 0000000 00111111 111112222 Q ss_pred --hcCCCCEEEEehHHHHHHHHhh--h-c--cC-ccccccccccccccccCCcee-eeeeEEEeCCCCCCceeEeeecc- Q lcl|NC_021307. 176 --AGKKWGATLLDDVAEPILNGAK--D-A--NG-RPLFVESTYEAVTTPYREGRI-LGRPTILSDHVASGTTVGYLGDF- 245 (310) Q Consensus 176 --~~~~~~~~~~~~~~~~~l~~l~--d-~--~g-~~~~~~~~~~~~~~~~~~~~l-~G~pv~~t~~~~~~~~~~~~gd~- 245 (310) .....+.++|+++....|...- + . .| ..-+..+.+... ..+.| .+++|+++++.+.+ .+++|-. T Consensus 367 ~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~----~~G~l~~~~~vy~D~y~~~d--y~~vG~KG 440 (521) T protein:vir:10 367 QTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFNTDTTKSV----FAGVLGGKYRVYIDQYAKQD--YFTVGYKG 440 (521) T ss_pred hcccccceEEEEchHHHHHHhhcccccccccccccccccccCCCce----EEEEecCceEEEecCCCCcc--eEEEEEeC Confidence 2245677999999988887421 0 0 00 111222221111 12333 34699999887654 2222211 Q ss_pred -----eeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecc-------------------- Q lcl|NC_021307. 246 -----SQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDV-------------------- 300 (310) Q Consensus 246 -----~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~-------------------- 300 (310) ..+++...-..... ...+|.++ | =.++|+ .|++..+- | T Consensus 441 ~~~~~~glfyaPYv~l~~~------------~~~dp~sf-q-P~~g~~--tRY~l~~N-P~~~~~~~~~~~~i~~~~~~~ 503 (521) T protein:vir:10 441 PNEMDAGIYYAPYVALTPL------------RGSDPKNF-Q-PVMGFK--TRYGIGIN-PFAESAAQAPASRIQSGMPSI 503 (521) T ss_pred Ccccccceeeccccccccc------------cccCCccc-c-ceeeee--eeeceeec-CcccccCCccceeecccchhh Confidence 11122211111000 01112121 1 123333 35554331 1 Q ss_pred -------CceEEEeecC Q lcl|NC_021307. 301 -------EAFVKLTNAA 310 (310) Q Consensus 301 -------~a~~~l~~aa 310 (310) ..|.+++.+= T Consensus 504 ~a~~~~~sy~r~v~v~~ 520 (521) T protein:vir:10 504 LNSLGKNAYFRRVYVKG 520 (521) T ss_pred hccccccceeeeeeecC Confidence 0122222211 No 224 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=74.24 E-value=0.16 Score=24.94 Aligned_cols=286 Identities=13% Similarity=0.098 Sum_probs=136.6 Q ss_pred CccchhhhHHHHHhhcccc------CCCCceechhhHHHHHHHHH---hhchhhhhcceeecCCCceEEEEEcC---Cce Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGD------SMFQGYLEPEQAQDYFAEAE---KTSIVQRVARKIPMGSTGVKIPHWTG---DVS 68 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~------~~~g~~i~~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~ip~~~~---~~~ 68 (310) --+-+.++.+..++..+|. ..+++.+..+..++-+..|- +.-.+.+-..+.|..+...++-.... ... T Consensus 12 ~~~~~~~~e~~~KS~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~sTv~~y~~~~~~G~~g~ 91 (462) T protein:vir:96 12 NKYADKFQEEVMKSYQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQSTVQKYDVYLRHGNVGH 91 (462) T ss_pred hhhhchhhHHHHHHHhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCcccc Confidence 1222344444555555533 22345566555544444332 22235555566666665545444433 244 Q ss_pred eeeecccccccccccceeeeEeeeeeeEeeehhhHHHhh-cChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccc----- Q lcl|NC_021307. 69 AAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVR-ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFD----- 142 (310) Q Consensus 69 a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~-~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~----- 142 (310) +.++.|+...+.+++.+...+...|-++..-.+|...-. .+..+.+....+.-...++..+|.+.|.|+..-.+ T Consensus 92 ~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~dai~~~a~tiE~a~Fygds~l~~~~~~~ 171 (462) T protein:vir:96 92 SRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDAIAVVAKTIEWASFYGDASLTADPTGQ 171 (462) T ss_pred ccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCcccc Confidence 688999999999999999999999999877766665443 34567778888888889999999999999876444 Q ss_pred -cccccccccc---cce-ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCcccccccccccccccc Q lcl|NC_021307. 143 -KNLDETTKSV---DLT-PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPY 217 (310) Q Consensus 143 -~~~~~~~~~~---~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~ 217 (310) ....+.++-. ... .-+.....+++......+...+....-++|+..+.+.|..-.-..-|.+.+++.... ..+. T Consensus 172 gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~~n~g~~-~~G~ 250 (462) T protein:vir:96 172 GLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQDNSGNV-NAGY 250 (462) T ss_pred ccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceEEEEcCCCCce-eeee Confidence 2222222211 111 122333344554455566778888889999999999997433333333333322211 1000 Q ss_pred --------------CCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhh--h Q lcl|NC_021307. 218 --------------REGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLW--Q 281 (310) Q Consensus 218 --------------~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~--~ 281 (310) .+.++.+-|-+...... ++.. -.....|...-..+ .. ..| + T Consensus 251 ~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~---------~~p~----ap~~~~vsaTv~t~------~~----g~f~~~ 307 (462) T protein:vir:96 251 NVQGFYSSRGFIKLHGSTVMENELILDESLQ---------PLPN----APQPATVKATVETG------KK----GLFTDE 307 (462) T ss_pred eccceeeeeeeeeeCCceecCcccccccccc---------cCCC----CCCCCceeEEEEeC------CC----CCCCCc Confidence 00111111221111110 0000 00000010000000 00 000 0 Q ss_pred cC--cEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 282 HN--LVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 282 ~~--~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +| ...+++.+.=+..=--|+..+-++.++ T Consensus 308 ~d~~~y~Y~V~avs~dgeS~PS~~VtaTva~ 338 (462) T protein:vir:96 308 HDRAELTYKVVVNSDDAQSAPSEAVTATVNN 338 (462) T ss_pred cCceeEEEEEEEECCCCccccceeeEeeeec Confidence 00 111111111110000112222222222 No 225 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=71.89 E-value=0.19 Score=24.54 Aligned_cols=276 Identities=11% Similarity=-0.021 Sum_probs=123.6 Q ss_pred CCCceechhhHHHHHHHHHhhchhhhhcc------eeecCCCceEEEEEcCCceeeee-c-ccccccccccceeeeEeee Q lcl|NC_021307. 21 MFQGYLEPEQAQDYFAEAEKTSIVQRVAR------KIPMGSTGVKIPHWTGDVSAAWI-G-EGDMKPITKGDMSVQQVEP 92 (310) Q Consensus 21 ~~g~~i~~~~~~~ii~~~~~~s~l~~~~~------~~~~~~~~~~ip~~~~~~~a~~v-~-Eg~~~~~~~~~~~~i~l~~ 92 (310) ++.-.....+...+.+.+.+.+....++. ....++.+++||+.....-.... . .|..-...+.++.+.+++. T Consensus 1 MA~~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~~ldq 80 (299) T protein:vir:79 1 MAALNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPKVLTN 80 (299) T ss_pred CccchhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEEEeec Confidence 21112346677888888888877655532 22344678999988653322222 2 2222223455667777777 Q ss_pred eeeEeeehhhHHHhh-cChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHHHH Q lcl|NC_021307. 93 HKIATIFVASAETVR-ANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNALS 171 (310) Q Consensus 93 ~k~~~~~~is~ell~-~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 171 (310) .|.-.+..=.-+.-+ .....+...+.+...+.++-.+|+-.+..--++... .......+..+.....+.+.++.. T Consensus 81 dr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~----~g~~~~~~~~T~~n~y~~i~~~~~ 156 (299) T protein:vir:79 81 QRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTA----LGNTADTTVLTTTNVLEVFDKLME 156 (299) T ss_pred cccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhh----cCCcccccccCHHHHHHHHHHHHH Confidence 765555433222222 111234444555555666667777655432111100 000111112223344555567777 Q ss_pred HhhhhcCC--CCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeC--CCCC------CceeEe Q lcl|NC_021307. 172 LLVNAGKK--WGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSD--HVAS------GTTVGY 241 (310) Q Consensus 172 ~l~~~~~~--~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~--~~~~------~~~~~~ 241 (310) .|.+.... +-..+++|..+..|.+-..- .+..-. ...+....+.-+.|.|.||+..+ .+.. |..... T Consensus 157 ~lde~~vP~~~rvl~vtp~~~~~L~~~~~f-~k~~~~--~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~~~ 233 (299) T protein:vir:79 157 KMTEARVPENGRILYVTPVVNTLIKNAKEI-QRTVNI--KDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKVGA 233 (299) T ss_pred HHHhcCCCCCCeEEEeCHHHHHHHhhchhh-hccccc--ccccceeeeeeeeecceEEEEechhhcCccceeccCccccC Confidence 77766553 34566788888777542210 111111 11112233445779999998633 2331 211111 Q ss_pred eecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecc--CceEEEeecC Q lcl|NC_021307. 242 LGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDV--EAFVKLTNAA 310 (310) Q Consensus 242 ~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~--~a~~~l~~aa 310 (310) -|---++++. ..+..+.+.....+....+... +++--.+.-..+.|.=|.+. +++..-..+| T Consensus 234 ~ak~in~ii~-~~~a~~~~~K~~~~~~~~P~~~------~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a 297 (299) T protein:vir:79 234 GAKQIFMSLV-HPSAIITPVSYQFSKLDEPTAV------TEGKYFYFEESFEDVFILNKKADAIQFVVEGA 297 (299) T ss_pred cccccceEEE-cCCeeeeeEeeeeEEeecCCCC------CccceeeeeeeeeeeeeeccccCeEEEEeeec Confidence 0100112333 2333444444444444333221 22221122222333333322 3443334444 No 226 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=68.24 E-value=0.24 Score=23.98 Aligned_cols=284 Identities=14% Similarity=0.131 Sum_probs=122.3 Q ss_pred Cccc--------hhhh-------------HHHHHhhcccc--CCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCc Q lcl|NC_021307. 1 MAAG--------TAFP-------------VNHTQIAQTGD--SMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG 57 (310) Q Consensus 1 ~aa~--------~~~~-------------~~~~~~~~~~~--~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 57 (310) |... +-|+ ....+-+-.|. +...-.+|...+..|-..+...+|+++..-+.+++.-- T Consensus 1 mtnfiesqnavteffdvlkknsgkseiknawnaklaengvtitdttfqlprklvesintallntnpvfkvfhvtnvgall 80 (318) T protein:vir:94 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 80 (318) T ss_pred CccchhhhhhHHHHHHHHhcccChhhhhhhhhhhhhhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhhhhee Confidence 1111 1111 11111111111 11112355566666666777788888877766654432 Q ss_pred eEEEEEcCCceeeeecccccccccccceeeeEeeeeeeEeeehhhHHH--hhcChhHHHHHHHHHHHHHHHHH-HHHHHH Q lcl|NC_021307. 58 VKIPHWTGDVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAET--VRANPGNYLGTMRTKVATAIALA-FDEAAL 134 (310) Q Consensus 58 ~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~el--l~~s~~~~~~~v~~~l~~a~~~~-~d~~~l 134 (310) +. ...+++.++....+|+.+++...++.--++.|--++....+.... +.+|.-.+...+..++..++..+ +|-+++ T Consensus 81 vs-rsfdssneaqvhkdgqtkteqaatltidtlepvmvyklqslaervkrlqmsyselynlivaeltqaivnkivdlalv 159 (318) T protein:vir:94 81 VS-RSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALV 159 (318) T ss_pred ee-ccccccchhhhhcccccccccceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHhhhhheeee Confidence 22 223556677888999999998888877778887777666665543 45666778888888888888665 577888 Q ss_pred cccCccccccccccc------cccccee-cccchHHHHHHHHHHHhhhhcCCCCEEEEehHHH-HHHHHhhhccCccccc Q lcl|NC_021307. 135 HGTDSPFDKNLDETT------KSVDLTP-ATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAE-PILNGAKDANGRPLFV 206 (310) Q Consensus 135 ~G~g~~~~~~~~~~~------~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~-~~l~~l~d~~g~~~~~ 206 (310) .|+|+.+-..+..-. ...+.+. ++.++..+.+.....-+.+.. ..-.++...... +.|..++-+..+. T Consensus 160 egdgtngfksidkeadvkkikkittkaksagktpfadaieeavdfvrpta-grrylivktedrkalldelrqatana--- 235 (318) T protein:vir:94 160 EGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA-GRRYLIVKTEDRKALLDELRQATANA--- 235 (318) T ss_pred ecCCcchhhhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCC-CceEEEEeccchHHHHHHHHhhhccc--- Confidence 999987655443211 1111112 233344443333322222221 122233333333 2233332211100 Q ss_pred cccccccccccCCceeeeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEE Q lcl|NC_021307. 207 ESTYEAVTTPYREGRILGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVA 286 (310) Q Consensus 207 ~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 286 (310) +-+|..-...+...+..++..++-|... --.++.++...-+.+.+....+. .-|.+|+-- T Consensus 236 ------------nvriknddteiasevgvdeiivytgska-------vkptvlvdqkyhidmqdltkvda-fewktnsnm 295 (318) T protein:vir:94 236 ------------NVRIKNDDTEIASEVGVDEIIVYTGSKA-------VKPTVLVDQKYHIDMQDLTKVDA-FEWKTNSNM 295 (318) T ss_pred ------------ceEEeccchhhhhhcCcceeEEeecccc-------ccceeEeccceecchhhhhhhhc-eeeccCCce Confidence 0011110000000111111111111100 00011111111111111111000 013344433 Q ss_pred EEEEEEeccEEeccCceEEEeec Q lcl|NC_021307. 287 VRVEAEYGLLINDVEAFVKLTNA 309 (310) Q Consensus 287 ~r~~~~~d~~v~~~~a~~~l~~a 309 (310) +.++.--.+.+..-+|=++++.. T Consensus 296 ilvetltsghvetynagavitvs 318 (318) T protein:vir:94 296 ILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred EEEEecccCcceeecCceeEEeC Confidence 44444444444433333444444 No 227 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=62.64 E-value=0.33 Score=23.21 Aligned_cols=289 Identities=12% Similarity=0.059 Sum_probs=123.3 Q ss_pred Cccchh--------------hhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEEEE--- Q lcl|NC_021307. 1 MAAGTA--------------FPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIPHW--- 63 (310) Q Consensus 1 ~aa~~~--------------~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~--- 63 (310) ++.... ..+.......+++++.... -|.++ .++++..+..+..+++.+.||.+++.-|.-. T Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~t~~v~~~-~P~Li-~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsr 120 (468) T protein:vir:10 43 LREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAGF-DPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSR 120 (468) T ss_pred HhccccccchhhHhhcCCcccchhhhhhhhccccccccc-Cchhh-hhHHHHHhhhhhhhceeeecCCccceeeeEEEEE Confidence 111111 1222222223333332222 33332 3556666777889999999998776443211 Q ss_pred --c-CCcee-------eee------------------------------------------------cc-----cccccc Q lcl|NC_021307. 64 --T-GDVSA-------AWI------------------------------------------------GE-----GDMKPI 80 (310) Q Consensus 64 --~-~~~~a-------~~v------------------------------------------------~E-----g~~~~~ 80 (310) + .+.++ .|. +| +.++++ T Consensus 121 Y~n~~g~EAf~nEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~~~~f~E 200 (468) T protein:vir:10 121 YENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFRE 200 (468) T ss_pred ecCCCCccceeccccccccccccccccccccccccccccCCCCCcccccccccccccccccccchHHHhhcCCCCcccce Confidence 0 00000 010 01 122344 Q ss_pred cccceeeeEeeeeeeEeeehhhHHHhhcC----hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccc-cccccccccce Q lcl|NC_021307. 81 TKGDMSVQQVEPHKIATIFVASAETVRAN----PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKN-LDETTKSVDLT 155 (310) Q Consensus 81 ~~~~~~~i~l~~~k~~~~~~is~ell~~s----~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~-~~~~~~~~~~~ 155 (310) -.-.+++++++.+..+=....|-||.+|- ..|.++.|.+.|+..|...+|+.|+.---+....+ ..+........ T Consensus 201 MaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~g~~~~Gv~d 280 (468) T protein:vir:10 201 MSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFD 280 (468) T ss_pred eeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheeccccccccccc Confidence 44555677777777666777899998864 46789999999999999999998886433211110 00000000000 Q ss_pred ecccch----HHHHH-------HHHHHH-hhhhcCCCCEEEEehHHHHHHHHhhhccCcccccccccccc---------c Q lcl|NC_021307. 156 PATGTT----YDAIG-------VNALSL-LVNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEAV---------T 214 (310) Q Consensus 156 ~~~~~~----~~~~~-------~~~~~~-l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~---------~ 214 (310) -..... .+... ..+... ........+.++|+++....|.. .|..-+.+...... . T Consensus 281 ~~~~~~~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~----sG~l~~~~~~~~~~~~~~~~~D~t 356 (468) T protein:vir:10 281 LDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAM----AGVLDYSSGLNGAGGPSIGEVDDT 356 (468) T ss_pred ccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhh----cCcceecccccccccccccccccC Confidence 000010 11111 111111 12234567789999999999874 23332222211110 0 Q ss_pred cccCCceee-eeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeeccc------ccccchhhhhcCcEEE Q lcl|NC_021307. 215 TPYREGRIL-GRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTP------QAPNFVSLWQHNLVAV 287 (310) Q Consensus 215 ~~~~~~~l~-G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~ 287 (310) .....+.|. |++|+++.....+. ++.++.+|..+..++. +++.+..+ ...+|.+ || =.++| T Consensus 357 g~~~~G~l~~r~~vy~D~Ya~~~s------~~dY~~vG~KG~~~~d----~glfyaPYv~l~~~~~~dp~s-fq-P~~g~ 424 (468) T protein:vir:10 357 GNLAVGTINGRIKVFVDPYAANLS------DKHYYVIGYKGTSPYD----AGLFYCPYVPLQMVRSIDPNT-FQ-PKIGF 424 (468) T ss_pred cceEEEEecCceEEEEccccccCC------ccceEEEEEecCccee----ceeeeccccccccccccCCCc-cc-ceeee Confidence 011122333 56899887654321 1122223322222211 11111111 0111212 11 12333 Q ss_pred EEEEEeccEEeccCce-EEEeecC Q lcl|NC_021307. 288 RVEAEYGLLINDVEAF-VKLTNAA 310 (310) Q Consensus 288 r~~~~~d~~v~~~~a~-~~l~~aa 310 (310) + .|++..+- |=+. ..++.-. T Consensus 425 ~--tRY~l~~N-P~~~~~~~~~g~ 445 (468) T protein:vir:10 425 K--TRYGMVSN-PFVTTNGLYNGT 445 (468) T ss_pred e--eeeceeec-ccceeccccCCC Confidence 3 35554431 1000 0111111 No 228 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=61.63 E-value=0.35 Score=23.08 Aligned_cols=298 Identities=9% Similarity=-0.056 Sum_probs=125.8 Q ss_pred CccchhhhHH----------HHHhhcccc---CCCCc-eechhhHHHHHHHHHhhchh-----hhhcceeecCCCceEEE Q lcl|NC_021307. 1 MAAGTAFPVN----------HTQIAQTGD---SMFQG-YLEPEQAQDYFAEAEKTSIV-----QRVARKIPMGSTGVKIP 61 (310) Q Consensus 1 ~aa~~~~~~~----------~~~~~~~~~---~~~g~-~i~~~~~~~ii~~~~~~s~l-----~~~~~~~~~~~~~~~ip 61 (310) +..|.+.... ......+.+ .+.+. +..++ .+. +++...+ ..+.++..+.+..+++- T Consensus 41 i~~g~~~~~~~~t~~w~~d~l~~~~~~~ta~~~a~~T~i~V~~--~~~---f~~~~l~~~~~~~EvirVtsVng~~lTV~ 115 (418) T protein:vir:96 41 TSVVGSTTAKASTHGYFSKTMVFASAVVTAEALADATVLTVEN--SDG---LTKGMIFYNEATGENMRLELVNGLNLTVK 115 (418) T ss_pred hcccCccccceeEEEEEeeEeeeeeEEEEEEEecCceEEEecC--Ccc---cccccEEEEecCCeEEEEEEEeCCEEEEE Confidence 1122221110 000000000 00000 11110 011 2233322 22344455567777777 Q ss_pred EEcCCceeeeeccc-------ccccccccceeeeEeeeeeeEeeehhhHHHhhcChh-----------HHHHHHHHHHHH Q lcl|NC_021307. 62 HWTGDVSAAWIGEG-------DMKPITKGDMSVQQVEPHKIATIFVASAETVRANPG-----------NYLGTMRTKVAT 123 (310) Q Consensus 62 ~~~~~~~a~~v~Eg-------~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s~~-----------~~~~~v~~~l~~ 123 (310) |...+..+.-++.| ..++|..-..+.....+..+.-++.|-++...-|.- ++.....++|.+ T Consensus 116 RG~~~t~aa~iaag~~~~~ig~~~eEGsd~~ta~~~k~~~vsN~tQIf~e~vsVSgTAqA~v~qaGvsn~~~~e~d~l~~ 195 (418) T protein:vir:96 116 RQTGRIAAAIIAANTKLIVIGTAFEEGSQRPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDF 195 (418) T ss_pred EccCCeeeeeeecCceEEEeecCcccccccCCcceecceeccchhheehhhhhhhhhhhhhhhhcCcchhHHHHHHHHHH Confidence 76555444333332 334454444444455555555666666655543322 222222344555 Q ss_pred HHHHHHHHHHHcccC---ccc--cc----cccc---c---cccccceecccchHHHHHHHHHHHhh-hhcCCC------C Q lcl|NC_021307. 124 AIALAFDEAALHGTD---SPF--DK----NLDE---T---TKSVDLTPATGTTYDAIGVNALSLLV-NAGKKW------G 181 (310) Q Consensus 124 a~~~~~d~~~l~G~g---~~~--~~----~~~~---~---~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~------~ 181 (310) . ...+|.+++.|.. +.+ +. .... . .+..........+.+.+...+..... ..+... - T Consensus 196 ~-kv~iE~ali~g~~~~~~~ng~p~~~t~R~m~gI~~f~~~Nvi~ag~~~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y 274 (418) T protein:vir:96 196 H-ATEQETAIFFGQAFMGTYNGQPLHTTQGIVDAIRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMF 274 (418) T ss_pred H-HHHHHHhhhccccccCCCCCcccccccchhHHHHhhccccccccCCCCcCCHHHHHHHHHHHHhhcCCCCCcccceEE Confidence 5 4477888888873 211 11 1110 0 11111111112334444444444333 111222 2 Q ss_pred EEEEehHHHHHHHHhhhccCccccccccccccccccCCceeeeeeEEEeCCCCCCcee---EeeecceeeeEEee--ccc Q lcl|NC_021307. 182 ATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILGRPTILSDHVASGTTV---GYLGDFSQIVWGQV--GGL 256 (310) Q Consensus 182 ~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G~pv~~t~~~~~~~~~---~~~gd~~~~~~~~~--~~~ 256 (310) +++++.+....|.++-. .-++ -+.....|.......-..--++++.++++|.++.. +++-|.+.+-+... +.. T Consensus 275 ~~~V~a~~k~~I~k~~~-~I~~-~~~en~~G~vv~~~~Td~G~v~ii~n~~~pad~I~~g~mlVvD~~~vkL~yL~~R~~ 352 (418) T protein:vir:96 275 CDTVGMRTMQDIGRFFG-EVTV-TQRETSYGMVFTEWKFFKGRLIIKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNA 352 (418) T ss_pred EEEeChHHHHHHhhhhc-eeEe-ccccceeceEEEEEEeeccEEEEEecCCCCccccCcceEEEEecCceEEEEecCCCc Confidence 36789999999987743 2222 22222222222221111222588899988877632 33446665544443 344 Q ss_pred EEEEeecceee---ecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeec--C Q lcl|NC_021307. 257 SFDVSDQATLN---LGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNA--A 310 (310) Q Consensus 257 ~v~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a--a 310 (310) ..+.+-..+-. ...+.....-..-+.++ ....+...+.++++.++|++- | T Consensus 353 ~~E~l~k~G~~~~~~~~~~~~~~~~D~~~G~----l~~Eltle~~N~~a~a~itgl~~~ 407 (418) T protein:vir:96 353 KVENYGQGGGENKSGATDYSYGHGVDAQGGS----LTSEWALELLNPQGCAVITGLQKA 407 (418) T ss_pred cchhcccCCCcccccccccccccccccccCE----EEEEEEEEeecccccEEeeccccc Confidence 44433222200 00000000000123333 355677788999999999864 3 No 229 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=61.55 E-value=0.35 Score=23.07 Aligned_cols=286 Identities=12% Similarity=0.047 Sum_probs=124.1 Q ss_pred Cc------------cchhhhHHHHH-----------hhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCc Q lcl|NC_021307. 1 MA------------AGTAFPVNHTQ-----------IAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG 57 (310) Q Consensus 1 ~a------------a~~~~~~~~~~-----------~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 57 (310) ++ +..++-.|+.- +..++++.-.+ +-|. .-.++++.-+..+..+++.+.||++++ T Consensus 45 ~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~es~~t~~v~~-~~P~-li~lvrRa~p~LIa~DIwGVQPMTgPT 122 (522) T protein:vir:69 45 FEVSPEYKDEKIAQAFGSFLTEAEIGGDHGYNAQNIAAGQTSGAVTQ-IGPA-VMGMVRRAIPNLIAFDICGVQPMNSPT 122 (522) T ss_pred hhcccccchhHHHHhhhhhhhhhccccccCCCccccccccccccccc-ccch-HHHHHHHHHhhhhhhhceeeccCCchh Confidence 11 11122222110 11111111111 1122 223566667778888999999997765 Q ss_pred eEEE----EEcCC---------------ceeee----------------------------------------------- Q lcl|NC_021307. 58 VKIP----HWTGD---------------VSAAW----------------------------------------------- 71 (310) Q Consensus 58 ~~ip----~~~~~---------------~~a~~----------------------------------------------- 71 (310) .-|- +.... +++.| T Consensus 123 GLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t 202 (522) T protein:vir:69 123 GQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSS 202 (522) T ss_pred hhheeeeeeccCCcccCccccccccccccccccccccccccccccccccccccccccccccccccceeeecccCCcCCCC Confidence 3221 00000 00000 Q ss_pred ----------------------------e--cc---------cccccccccceeeeEeeeeeeEeeehhhHHHhhcC--- Q lcl|NC_021307. 72 ----------------------------I--GE---------GDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN--- 109 (310) Q Consensus 72 ----------------------------v--~E---------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s--- 109 (310) . +| +..+++-.-.+++++++.+..+=....|-||.+|- T Consensus 203 ~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAI 282 (522) T protein:vir:69 203 ADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAV 282 (522) T ss_pred CcccccccchhccccccccceeeccccchhhhhhcccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHh Confidence 0 11 11345555666777777777777778899998864 Q ss_pred -hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccc-----cccceecccch-------HHHH------HHHHH Q lcl|NC_021307. 110 -PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTK-----SVDLTPATGTT-------YDAI------GVNAL 170 (310) Q Consensus 110 -~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~-----~~~~~~~~~~~-------~~~~------~~~~~ 170 (310) ..|.|+.|.+.|+..|...+|+.|+.=-....-.+..+.+. ..........+ .+.. +..+. T Consensus 283 HGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~a 362 (522) T protein:vir:69 283 HGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEA 362 (522) T ss_pred cCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeeccccccccccccceeecccccccccchhHHHHHHHHHHHHHHHH Confidence 46899999999999999999999985332222111111110 00000000001 1111 11122 Q ss_pred HHhhh--hcCCCCEEEEehHHHHHHHHhh-----hccC-ccccccccccccccccCCcee-eeeeEEEeCCCCCCceeEe Q lcl|NC_021307. 171 SLLVN--AGKKWGATLLDDVAEPILNGAK-----DANG-RPLFVESTYEAVTTPYREGRI-LGRPTILSDHVASGTTVGY 241 (310) Q Consensus 171 ~~l~~--~~~~~~~~~~~~~~~~~l~~l~-----d~~g-~~~~~~~~~~~~~~~~~~~~l-~G~pv~~t~~~~~~~~~~~ 241 (310) ..+.. .....+.++|+++....|...- .+.| ..-+..+.+... ..+.| .+++|+++++.+.+ .++ T Consensus 363 n~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~----~~G~l~~~~~vy~D~y~~~d--y~~ 436 (522) T protein:vir:69 363 VEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLASGFNTDTTKSV----FAGVLGGKYRVYIDQYAKQD--YFT 436 (522) T ss_pred HHHHHhcccccccEEEEchhHHHHHhhcccccccccccccccccccCCCce----EEEEecCceEEEecCCCCcc--eEE Confidence 22222 2235778999999998886421 0101 111222221111 12333 34699999887654 222 Q ss_pred eecc------eeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe------ccCceEEEeec Q lcl|NC_021307. 242 LGDF------SQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIN------DVEAFVKLTNA 309 (310) Q Consensus 242 ~gd~------~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~------~~~a~~~l~~a 309 (310) +|-. ..+++...-..... ...+|.++ | =.++|+ .|++..+- ..+-.++|... T Consensus 437 vG~KG~~~~~~glfyaPYv~l~~~------------~~~dp~sf-q-P~~g~~--tRY~l~vNP~~~~~~~~~~~ri~~g 500 (522) T protein:vir:69 437 VGYKGANEMDAGIYYAPYVALTPL------------RGSDPKNF-Q-PVMGFK--TRYGIGVNPFAESSLQAPGARIQSG 500 (522) T ss_pred EEEeCCcccccceeeccccccccc------------cccCCccc-c-ceeeee--eeeceeecCcccccCCcccceeecc Confidence 3211 11122221111110 01122221 1 123333 35554331 00112233322 Q ss_pred C Q lcl|NC_021307. 310 A 310 (310) Q Consensus 310 a 310 (310) . T Consensus 501 ~ 501 (522) T protein:vir:69 501 M 501 (522) T ss_pred c Confidence 2 No 230 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=56.31 E-value=0.46 Score=22.44 Aligned_cols=285 Identities=12% Similarity=0.051 Sum_probs=118.3 Q ss_pred Cccch-------hhhHHH-----------HHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEE- Q lcl|NC_021307. 1 MAAGT-------AFPVNH-----------TQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIP- 61 (310) Q Consensus 1 ~aa~~-------~~~~~~-----------~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip- 61 (310) +-+.+ ++-.+. ..+..+++++... .-|.+ -.++++.-+..+..+++.+.||.+++.-|. T Consensus 49 ~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~iaes~~t~~v~~-~~P~L-i~lvRra~p~LIa~DIwGVQPMTgPTGLIFA 126 (521) T protein:vir:72 49 EYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAAGQTSGAVTQ-IGPAV-MGMVRRAIPNLIAFDICGVQPMNSPTGQVFA 126 (521) T ss_pred cccchHHHHHHhhhhhhhcccCccccCccccccccccccccc-CCchh-hhHHHHHHhhhhhhhceeeccCCchhhhhee Confidence 11111 111110 0011111211111 12222 235566667778889999999977543221 Q ss_pred ---EEcCC---------------ceeeee--------------------------------------------------- Q lcl|NC_021307. 62 ---HWTGD---------------VSAAWI--------------------------------------------------- 72 (310) Q Consensus 62 ---~~~~~---------------~~a~~v--------------------------------------------------- 72 (310) +.... +.+.|- T Consensus 127 MRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~ 206 (521) T protein:vir:72 127 LRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAA 206 (521) T ss_pred eeeeecCCCCCcccccccchhcccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCCcc Confidence 10000 000000 Q ss_pred --------------------------cc---------cccccccccceeeeEeeeeeeEeeehhhHHHhhcC----hhHH Q lcl|NC_021307. 73 --------------------------GE---------GDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN----PGNY 113 (310) Q Consensus 73 --------------------------~E---------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s----~~~~ 113 (310) +| +..+++-.-.+++++++.+...=....|-||.+|- ..|. T Consensus 207 ~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDA 286 (521) T protein:vir:72 207 KLDAEIKKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDA 286 (521) T ss_pred ccccccccccccCceeeeecccchhhhhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCCh Confidence 01 01133444444666666666666677899998864 4689 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcccCcccccccccccc------c-ccceecc-----cchHHHHH------HHHHHHhhh Q lcl|NC_021307. 114 LGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTK------S-VDLTPAT-----GTTYDAIG------VNALSLLVN 175 (310) Q Consensus 114 ~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~------~-~~~~~~~-----~~~~~~~~------~~~~~~l~~ 175 (310) |+.|.+.|+..|...+|+.|+.=-....-.+..+.+. + ....... ....+... ......+.. T Consensus 287 EtELaNILSTEImlEINReii~~i~~sa~~g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~ 366 (521) T protein:vir:72 287 DAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIAR 366 (521) T ss_pred HHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999995322222111111110 0 0000000 00111111 111112222 Q ss_pred --hcCCCCEEEEehHHHHHHHHhh--hc-cC---ccccccccccccccccCCcee-eeeeEEEeCCCCCCceeEeeecc- Q lcl|NC_021307. 176 --AGKKWGATLLDDVAEPILNGAK--DA-NG---RPLFVESTYEAVTTPYREGRI-LGRPTILSDHVASGTTVGYLGDF- 245 (310) Q Consensus 176 --~~~~~~~~~~~~~~~~~l~~l~--d~-~g---~~~~~~~~~~~~~~~~~~~~l-~G~pv~~t~~~~~~~~~~~~gd~- 245 (310) .....+.++|+++....|...- +. .+ ..-+..+.+... ..+.| .+++|+++++.+.+ .+++|-. T Consensus 367 ~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~----~~G~l~~~~~vy~D~y~~~d--y~~vG~KG 440 (521) T protein:vir:72 367 QTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFSTDTTKSV----FAGVLGGKYRVYIDQYAKQD--YFTVGYKG 440 (521) T ss_pred hcccccceEEEEchHHHHHHhhcccccccccccccccccccCCCce----EEEEccCceEEEecCCCCcc--eEEEEEeC Confidence 2245677999999988887421 00 00 001111111111 12233 45799999887654 2222211 Q ss_pred -----eeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecc-------C------------ Q lcl|NC_021307. 246 -----SQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDV-------E------------ 301 (310) Q Consensus 246 -----~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~-------~------------ 301 (310) ..+++...-..... ...+|.++ | =.++|+ .|++..+- | + T Consensus 441 ~~~~~~glfyaPYv~l~~~------------~~~dp~sf-q-P~~g~~--tRY~l~~N-P~~~~~~~~~a~~i~~~~~~~ 503 (521) T protein:vir:72 441 PNEMDAGIYYAPYVALTPL------------RGSDPKNF-Q-PVMGFK--TRYGIGIN-PFAESAAQAPASRIQSGMPSI 503 (521) T ss_pred Ccccccceeeccccccccc------------cccCCccc-c-ceeeee--eeeceeec-CcccccCcccceeecCcChhh Confidence 11122211111000 01112121 1 123333 35554331 1 1 Q ss_pred --------ceEEEeecC Q lcl|NC_021307. 302 --------AFVKLTNAA 310 (310) Q Consensus 302 --------a~~~l~~aa 310 (310) -|.+++.+= T Consensus 504 ~a~~~~~sy~r~v~v~~ 520 (521) T protein:vir:72 504 LNSLGKNAYFRRVYVKG 520 (521) T ss_pred hcCccccceeeeeeecC Confidence 122221111 No 231 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=53.98 E-value=0.51 Score=22.17 Aligned_cols=270 Identities=10% Similarity=0.022 Sum_probs=116.3 Q ss_pred CCceechhhHHHHHHHHHhhchhhhhcc------eeecCCCceEEEEEcC--CceeeeecccccccccccceeeeEeeee Q lcl|NC_021307. 22 FQGYLEPEQAQDYFAEAEKTSIVQRVAR------KIPMGSTGVKIPHWTG--DVSAAWIGEGDMKPITKGDMSVQQVEPH 93 (310) Q Consensus 22 ~g~~i~~~~~~~ii~~~~~~s~l~~~~~------~~~~~~~~~~ip~~~~--~~~a~~v~Eg~~~~~~~~~~~~i~l~~~ 93 (310) -.-.+...+...+.+.....+....+.+ +...++.+++||.... +-..+-...|-...+-+.+++..+|+.. T Consensus 1 Main~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~et~tl~~D 80 (285) T protein:vir:79 1 MTVVLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGKETVKLTHE 80 (285) T ss_pred CcchhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccccccceeeeEEEeecc Confidence 1112334556667777766655555432 3445577899999742 3333434444444445566666677665 Q ss_pred eeEeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccccceecccchHHHHHHHHHHHh Q lcl|NC_021307. 94 KIATIFVASAETVRANPGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVDLTPATGTTYDAIGVNALSLL 173 (310) Q Consensus 94 k~~~~~~is~ell~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 173 (310) +.-.+..=.-+.-+.....+...+.+...+...-.+|+-.|.---+... .....+.+.....+.+.++...+ T Consensus 81 R~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~--------~~~~~~~T~~nv~~~i~~~~~~l 152 (285) T protein:vir:79 81 DWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAA--------KKATDSITKDNALDAYDTAEAYM 152 (285) T ss_pred ccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcc--------cccccccCHHHHHHHHHHHHHHH Confidence 5443332222222211223333333334444455666554432111110 00111122233444445566666 Q ss_pred hhhcC-CCCEEEEehHHHHHHHHhhhccCccccccccccccccccCCceeee-eeEEEeC--CCCCCceeEeeecceeee Q lcl|NC_021307. 174 VNAGK-KWGATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYREGRILG-RPTILSD--HVASGTTVGYLGDFSQIV 249 (310) Q Consensus 174 ~~~~~-~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~~~l~G-~pv~~t~--~~~~~~~~~~~gd~~~~~ 249 (310) .+... .+-.++|+|..+..|.+-+.-. +.+...............+.|.| .|++..+ .++.-+ ++.--+++ T Consensus 153 de~~vp~~rvl~vTp~~~~~Lk~s~~~~-r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~----~~k~Infi 227 (285) T protein:vir:79 153 FDNEVPGGFVMFVSSAYYTALKQSAAVT-RTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLG----ITNHVNFI 227 (285) T ss_pred HHcCCCCceEEEEChHHHHHHHhhhhhh-eecccccceeccceeeeeccccceeEEEEcchhhccCcC----cchhccEE Confidence 66544 3445778888888776433211 11111111111112234567888 8887643 343210 00001122 Q ss_pred EEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecc--CceEEEeecC Q lcl|NC_021307. 250 WGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDV--EAFVKLTNAA 310 (310) Q Consensus 250 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~--~a~~~l~~aa 310 (310) +.. ....+.+.........++.... .-|...+.-..+.|.=|.+. +++-.-..|| T Consensus 228 iv~-~~a~i~~~K~~~~~~f~P~~~~-----~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~ 284 (285) T protein:vir:79 228 LTP-LSAIAPIVKYDSVSVIDPSTDR-----SGNRWTIKGLSYYDAIVLDNAKKGIYVAATAG 284 (285) T ss_pred Eec-CceeccceeeeeeEeECCCCCC-----Ccceeeeeeeeeeeeeehhhccceeeeeeccc Confidence 222 2223333333333232222211 12223333334445544433 4455555555 No 232 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=50.36 E-value=0.61 Score=21.75 Aligned_cols=284 Identities=10% Similarity=0.054 Sum_probs=120.3 Q ss_pred Cccchh---------hhH-----------HHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEE Q lcl|NC_021307. 1 MAAGTA---------FPV-----------NHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKI 60 (310) Q Consensus 1 ~aa~~~---------~~~-----------~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~i 60 (310) ..+.-. +.. ...+ ..+++.+.. ..-|.+ -.+++++-+..+..+++.+.||.+++.-| T Consensus 56 ~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~ia-~s~~s~~v~-~~~P~L-i~lvRra~p~LIa~DIwGVQPMTgPTGLI 132 (534) T protein:vir:10 56 VNSMVDVKGRIEEARLAEANIGGDHGYDATKIA-SGETSGSIT-NVGPAV-MGLVRRAIPQLIAFDICGVQPMTSSTGQV 132 (534) T ss_pred hhhhhccccchhhcccccccccccccccccccc-ccccccccc-cccchh-hhHHHHHHHhhhhhhhheeccCCchhhhh Confidence 111111 110 0000 111111111 112222 23566666777888999999998765332 Q ss_pred E--E--Ec-C-----C---------ceeeeec------------------------------------------------ Q lcl|NC_021307. 61 P--H--WT-G-----D---------VSAAWIG------------------------------------------------ 73 (310) Q Consensus 61 p--~--~~-~-----~---------~~a~~v~------------------------------------------------ 73 (310) . | .. . . +.+.|.+ T Consensus 133 FAMRsrY~n~~~~~s~~EAf~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~ 212 (534) T protein:vir:10 133 FTLRAIYGGNSQDANAREAFHPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVD 212 (534) T ss_pred eeeeeeecCCCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 1 1 10 0 0 0011100 Q ss_pred ----------------------------------c---------cccccccccceeeeEeeeeeeEeeehhhHHHhhcC- Q lcl|NC_021307. 74 ----------------------------------E---------GDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN- 109 (310) Q Consensus 74 ----------------------------------E---------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s- 109 (310) | +..+++-.-.+++++++.+...=....|-||.+|- T Consensus 213 ~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLK 292 (534) T protein:vir:10 213 ALPADQTEAGLAYKWLLANGYAVETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLR 292 (534) T ss_pred cccCCccccccccccccccccceecccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHH Confidence 1 01234455556777777777777777899998864 Q ss_pred ---hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccc-ccc-----cc-ccceecc-----cchHHHHHHHHHHHh- Q lcl|NC_021307. 110 ---PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLD-ETT-----KS-VDLTPAT-----GTTYDAIGVNALSLL- 173 (310) Q Consensus 110 ---~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~-~~~-----~~-~~~~~~~-----~~~~~~~~~~~~~~l- 173 (310) ..|.++.|.+.|+..|...+|+.|+.---+..-.+-. .++ .+ ....... ....+.. ..+...+ T Consensus 293 AIHGLDAEtELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~-~~L~~~i~ 371 (534) T protein:vir:10 293 AVHGLDADSELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESY-KALVVQID 371 (534) T ss_pred HhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheeecccccccccccceeeeeccccccchhHHHHHH-HHHHHHHH Confidence 4678999999999999999999888643321111000 000 00 0000000 0011111 1222222 Q ss_pred ------hh--hcCCCCEEEEehHHHHHHHHhhhccCccccccccc------cccccccCCceee-eeeEEEeCCCCCCce Q lcl|NC_021307. 174 ------VN--AGKKWGATLLDDVAEPILNGAKDANGRPLFVESTY------EAVTTPYREGRIL-GRPTILSDHVASGTT 238 (310) Q Consensus 174 ------~~--~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~------~~~~~~~~~~~l~-G~pv~~t~~~~~~~~ 238 (310) .. .....+.++|+++....|.. .|...+.+... -+.......+.|. +++|+++++.+.+ T Consensus 372 ~~an~i~~~T~rg~~n~~v~S~~Va~~L~~----~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d-- 445 (534) T protein:vir:10 372 KEANEIARQTGRGQGNFIICSRNVAAALGH----TDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVED-- 445 (534) T ss_pred HHHHHHHHhhccccccEEEEchhHHHHHhh----ccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcc-- Confidence 11 22357789999999988853 22221111000 0111111123333 4699999987764 Q ss_pred eEeeecc------eeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe------ccCceEEE Q lcl|NC_021307. 239 VGYLGDF------SQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIN------DVEAFVKL 306 (310) Q Consensus 239 ~~~~gd~------~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~------~~~a~~~l 306 (310) .+++|-. ..+++...-.... ....+|.++ | =.++|+ .|++..+- ..+-+.++ T Consensus 446 y~~vG~KG~~~~~~glfyaPYv~l~~------------~~~~dp~sf-q-P~~g~~--tRY~l~~NP~~~~~~~~~~~~i 509 (534) T protein:vir:10 446 YFTVGYKGASEMDAGLYYCPYVALTP------------LRGTDPKNF-Q-PVLGFK--TRYGVKLHPMADATQNKGFAKI 509 (534) T ss_pred eEEEEEeCCcccccceeecccccccc------------ccccCCccc-c-ceeeee--eeeceeecCcccccCCcccccc Confidence 2222211 0112221111000 001122221 1 123333 35554431 11112233 Q ss_pred eec--------------------C Q lcl|NC_021307. 307 TNA--------------------A 310 (310) Q Consensus 307 ~~a--------------------a 310 (310) ... = T Consensus 510 ~~g~~~~~~~ag~n~~~~~~~Vk~ 533 (534) T protein:vir:10 510 SNGMPQHTNMFGKNAFFRRVLVAG 533 (534) T ss_pred ccCCcchhhhcccccceeeeeeec Confidence 321 1 No 233 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=45.77 E-value=0.76 Score=21.24 Aligned_cols=280 Identities=11% Similarity=0.002 Sum_probs=127.2 Q ss_pred CccchhhhHHH----------HHh-hccc------cCCCCceechhhHHHHHHHHHh---hchhhhhcceeecCCCceEE Q lcl|NC_021307. 1 MAAGTAFPVNH----------TQI-AQTG------DSMFQGYLEPEQAQDYFAEAEK---TSIVQRVARKIPMGSTGVKI 60 (310) Q Consensus 1 ~aa~~~~~~~~----------~~~-~~~~------~~~~g~~i~~~~~~~ii~~~~~---~s~l~~~~~~~~~~~~~~~i 60 (310) -|....-+++. .+. ..+| +-.+|+.+.-+-.++-+..+-. .-.+.+-....|..+...++ T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ftf~~~i~k~~a~STV~ey 99 (514) T protein:vir:10 20 RAVAFDTNKEDILNENLPENVKKSAFTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERDFTLYNDIAKQPVDNTVLKY 99 (514) T ss_pred eeeeecCcHHHHHHHhcchhhhhhhhccccccCCccccCccchhhhhhccceeEeeecCcchhhhhhcCCchhhHHHhhh Confidence 11111111111 111 1111 1123344554444443333322 22244445556655554444 Q ss_pred EEEcC---CceeeeecccccccccccceeeeEeeeeeeEeeehhhHHHhh-cChhHHHHHHHHHHHHHHHHHHHHHHHcc Q lcl|NC_021307. 61 PHWTG---DVSAAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVR-ANPGNYLGTMRTKVATAIALAFDEAALHG 136 (310) Q Consensus 61 p~~~~---~~~a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~-~s~~~~~~~v~~~l~~a~~~~~d~~~l~G 136 (310) -.... ...+.++.|+.-.+.+++.+....+..+-++....+|..+-. ++..+.+....+.-...++..+|.+.|.| T Consensus 100 ~~~~~~G~~G~~~f~~E~gi~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~n~i~d~~~~~~~dai~~ia~tiE~a~FyG 179 (514) T protein:vir:10 100 TQYYSHGRTGHSLFQPEIGIGDVNNPNERQRTINIKYIVDTHVTSIALQRANTIVDSLKVQEYAAISTVIKTDEWAMFYG 179 (514) T ss_pred hhhcccCcccccccccccccCcCCCcceEEEEEeeeeeeeeeeeeehhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhh Confidence 33332 234678999999999999999999998888766555544332 36678888888888889999999999998 Q ss_pred cCcccc------cccccccccccc---e-ecccchHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHHHHhhhccCccccc Q lcl|NC_021307. 137 TDSPFD------KNLDETTKSVDL---T-PATGTTYDAIGVNALSLLVNAGKKWGATLLDDVAEPILNGAKDANGRPLFV 206 (310) Q Consensus 137 ~g~~~~------~~~~~~~~~~~~---~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~ 206 (310) +..-.+ .-..+..+-+.. . .-+.....+++..+...+...+....-++|+..+.+.|...-...-|.+.. T Consensus 180 Ds~L~s~~~~~gleFDGl~~lI~~~NvIDarG~~Ls~~~ln~aA~~i~~gfGt~TD~ylp~~vka~f~~~~~~~qRV~~~ 259 (514) T protein:vir:10 180 DADLTSGQKGEGLQFDGLFKLIAPENHIDLRGGRLSPAALNMAARKIGEGFGTPTDAYMPIGIKADFVNQHLNGQRVMLP 259 (514) T ss_pred cccCCCccccCcchhhhHHHhhcCCCeEecCCCCccHHHHhhhhhhhhcccCChhheeCchHHHHHHhhcccCcceEEee Confidence 764332 222222222211 1 112222334444444455666888888999999998887544443333222 Q ss_pred cccccccccccCCceeeeeeEE--EeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeecccccccchhhhhcCc Q lcl|NC_021307. 207 ESTYEAVTTPYREGRILGRPTI--LSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNL 284 (310) Q Consensus 207 ~~~~~~~~~~~~~~~l~G~pv~--~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (310) .... ....|.|+- .+.. +... +-|+ . ++.... .+....... + ..-.... T Consensus 260 -~n~~--------~~~~G~~v~~f~s~~---G~I~-L~gs--~-im~~~n----------~L~~~~~~~--~-~Ap~~~~ 310 (514) T protein:vir:10 260 -GQTG--------GMTTGLDIDKFLSAH---GSIR-IQGS--T-IMDSDN----------KLDFDRPVS--P-TAPTAPQ 310 (514) T ss_pred -cCcc--------ceeeeeeccceeEec---ccee-ecCC--e-eecccc----------cCccCCccC--C-cCCCCCc Confidence 1111 113333331 1111 0000 0000 0 011000 000000000 0 0112233 Q ss_pred EEEEEEEEeccEEeccCceEEE----------------eecC Q lcl|NC_021307. 285 VAVRVEAEYGLLINDVEAFVKL----------------TNAA 310 (310) Q Consensus 285 ~~~r~~~~~d~~v~~~~a~~~l----------------~~aa 310 (310) +++-.+.. +....+|+-+..- +.++ T Consensus 311 va~svT~~-~~g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~ 351 (514) T protein:vir:10 311 LSATVTPD-GGGLWHEADKTDSKGEVILNKEVGVEQSYVAVM 351 (514) T ss_pred ceEEEecC-cccccCcccccccccccccccccceeEEEEEEE Confidence 44443322 2222232222211 1111 No 234 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=44.04 E-value=0.82 Score=21.05 Aligned_cols=295 Identities=10% Similarity=-0.027 Sum_probs=118.6 Q ss_pred CccchhhhHHHHH----------hhcccc----CCCCceechhhHHHHHHHHHhhchh-----hhhcceeecCCCceEEE Q lcl|NC_021307. 1 MAAGTAFPVNHTQ----------IAQTGD----SMFQGYLEPEQAQDYFAEAEKTSIV-----QRVARKIPMGSTGVKIP 61 (310) Q Consensus 1 ~aa~~~~~~~~~~----------~~~~~~----~~~g~~i~~~~~~~ii~~~~~~s~l-----~~~~~~~~~~~~~~~ip 61 (310) +..|......+.. ...+.+ ...+.+..+. .+. +.+...+ ..+.++..+.+..+++- T Consensus 41 i~~g~~~ta~ast~~w~~d~~~~~~~~~ta~a~a~~T~l~ve~--~~~---f~~~~l~~~~~~~Evirv~sVng~~lTV~ 115 (418) T protein:vir:10 41 TSVVGSTTAKASTHGYFSKTMVFASAVVTAEAAADATVLTVEN--SDG---LTKGMIFYNEATGENMRLELVNGLNLTVK 115 (418) T ss_pred hhcccccccceeEEEEEEEEEeeeeEEEEEEEecCceEEEEcC--cce---eccccEEEEccCCeEEEEEEEeCCEEEEE Confidence 2222222111000 000000 0000011100 001 2222221 11334444556777777 Q ss_pred EEcCCceeeeecc-------cccccccccceeeeEeeeeeeEeeehhhHHHhhcC-----------h-hHHHHHHHHHHH Q lcl|NC_021307. 62 HWTGDVSAAWIGE-------GDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN-----------P-GNYLGTMRTKVA 122 (310) Q Consensus 62 ~~~~~~~a~~v~E-------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s-----------~-~~~~~~v~~~l~ 122 (310) |...+..+.-+++ |.+++|..-..+.....+..+.-++.|=++..+-| . .-+++...+++- T Consensus 116 Rg~~~t~aaaia~n~~~~~Ig~~~eEGsd~~ta~~~k~~~vsNvtQIF~~avsvSgTaqAs~~q~Gvsn~~ese~drk~~ 195 (418) T protein:vir:10 116 RQTGRISAAIIAANTKLIVIGTAFEEGSQRPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDF 195 (418) T ss_pred EecCCeeEEEEecCceEEEeccccccccccCCcceecceeccchhhhhhhhhhhhhhhhhccccccCchHHHHHHHHHHH Confidence 7665554433332 22334443333333333333333333322222211 1 124455445555 Q ss_pred HHHHHHHHHHHHccc----Cccccc--cccccc---------ccccceecccchHHHHHHHHHHHhh-hhcCCC------ Q lcl|NC_021307. 123 TAIALAFDEAALHGT----DSPFDK--NLDETT---------KSVDLTPATGTTYDAIGVNALSLLV-NAGKKW------ 180 (310) Q Consensus 123 ~a~~~~~d~~~l~G~----g~~~~~--~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~------ 180 (310) ++ ..+|+++++|. ++..+. ...++. +.++....+..+.+.+...+..... ...... T Consensus 196 ~a--v~iEkalI~G~~~~~~~~~g~~R~m~GIl~~vr~~~~gnVv~a~~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~ 273 (418) T protein:vir:10 196 HA--TEQETAIFFGQAFMGTYNGQPLHTTQGIVDAVRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVM 273 (418) T ss_pred HH--HHHHHHHhcccccCCCcCCcchhhHHHHHHHHhhhcccceeccCCCCccCHHHHHHHHHHHhhccCCCccccccee Confidence 44 38899999995 223221 111111 1111111122345555544444332 111122 Q ss_pred CEEEEehHHHHHHHHhhhccCccccccccccccccccCC-----ceeeeeeEEEeCCCCCCceeEeeecceeeeEEee-- Q lcl|NC_021307. 181 GATLLDDVAEPILNGAKDANGRPLFVESTYEAVTTPYRE-----GRILGRPTILSDHVASGTTVGYLGDFSQIVWGQV-- 253 (310) Q Consensus 181 ~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~~~~~~~~-----~~l~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~-- 253 (310) -+++++.+....+.++-- +=++. +.....|....... -.|.-.|++..-+||+|+. ++-|..++-+... T Consensus 274 f~~~V~~~~k~~I~k~~~-~I~~~-~~e~~~G~vv~~~~~~~G~I~L~~~p~~~~~~lp~g~m--lVvD~~~vkL~~L~~ 349 (418) T protein:vir:10 274 FCDTVGMRTMQDIGRFFG-EVTVT-QRETSYGMVFTEWKFFKGRLILKEHPLFSAIGISPGFA--VVVDVPAVKLAYMDG 349 (418) T ss_pred EEEEeChHHHHHhhhhhh-heeec-ccceeeeEEEEEEEcceEEEEeecccccccccCCCceE--EEEccccceEEEecc Confidence 346778888888876631 11111 11111121111110 1122236666668999874 4457766655544 Q ss_pred cccEEEEeecceee---ecccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 254 GGLSFDVSDQATLN---LGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 254 ~~~~v~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) +.+..+.+-..+-. ...+.....-..-+.++ ....+...+.++.+.+++++-- T Consensus 350 R~~~~E~l~k~G~~~~~~~~~~~~~~~~D~~kG~----iv~E~tLe~~N~~a~avitgl~ 405 (418) T protein:vir:10 350 RNAKVENYGQGGGENKSGATDYSYGHGVDAQGGS----LTSEWALELLNPQGCAVITGLQ 405 (418) T ss_pred ccccchhcccCCCcccccccccccccccccccce----EEEEeeeeeecccceEEeeccc Confidence 55555544332200 00000000000123333 3557778889999999988643 No 235 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=42.38 E-value=0.89 Score=20.87 Aligned_cols=288 Identities=11% Similarity=0.021 Sum_probs=118.5 Q ss_pred Ccc-------chhhhHHHHH-----------hhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEE-- Q lcl|NC_021307. 1 MAA-------GTAFPVNHTQ-----------IAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKI-- 60 (310) Q Consensus 1 ~aa-------~~~~~~~~~~-----------~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~i-- 60 (310) |-+ ...+-.+... +..+++.+.. ..-|.+ -.++++.-+..+..+++.+.||.+++.-| T Consensus 49 ~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~~s~~t~~v~-~~~P~L-i~lvRra~p~LIa~DIwGVQPMTgPTGLIFA 126 (524) T protein:vir:98 49 VYRDEKIVESFGGFLAEAEIAGDHNYDQTNIASGKSSGAIT-NIGPAV-IGMVRRAIPNLIAFDICGVQPMTGPTGQVFA 126 (524) T ss_pred cccchHHHHhhhccccccccccccccccccccccccccccc-cccchh-hhHHHHHHHhhhhhhhheeccCCchhhhhhh Confidence 111 1111111110 1111111111 122222 23556666777788889988887654222 Q ss_pred -----EEEc--CC-----c---------eeeee----------------------------------------------- Q lcl|NC_021307. 61 -----PHWT--GD-----V---------SAAWI----------------------------------------------- 72 (310) Q Consensus 61 -----p~~~--~~-----~---------~a~~v----------------------------------------------- 72 (310) +-.. .+ . ++.|. T Consensus 127 mRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tg 206 (524) T protein:vir:98 127 LRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTG 206 (524) T ss_pred hheeecCCCCCcccccccccccccccccccccCCccccccccccccccccccccccccccccccceeccccccCcccccc Confidence 1100 00 0 00000 Q ss_pred ------------------------------cc---------cccccccccceeeeEeeeeeeEeeehhhHHHhhcC---- Q lcl|NC_021307. 73 ------------------------------GE---------GDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN---- 109 (310) Q Consensus 73 ------------------------------~E---------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s---- 109 (310) +| +..+++-.-.+++++++.+...=....|-||.+|- T Consensus 207 t~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVH 286 (524) T protein:vir:98 207 ADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVH 286 (524) T ss_pred cccccccccccccccccceeecccccchhhhhhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhc Confidence 01 12234445555677777766666677899988864 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccccccccccc-------ceec-----ccchHHHH------HHHHHH Q lcl|NC_021307. 110 PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSVD-------LTPA-----TGTTYDAI------GVNALS 171 (310) Q Consensus 110 ~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~~-------~~~~-----~~~~~~~~------~~~~~~ 171 (310) ..|.|+.|.+.|+..|...+|+.|+.=-......+..+.+..+. .... +....+.. +..+.. T Consensus 287 GLDAEtELsNILSTEImlEINReii~~i~~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an 366 (524) T protein:vir:98 287 GMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEAN 366 (524) T ss_pred CCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheeceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHH Confidence 46899999999999999999999985332222222121111100 0000 00011111 112222 Q ss_pred Hhhh--hcCCCCEEEEehHHHHHHHHh----hhccCccccccccccccccccCCcee-eeeeEEEeCCCCCCceeEeeec Q lcl|NC_021307. 172 LLVN--AGKKWGATLLDDVAEPILNGA----KDANGRPLFVESTYEAVTTPYREGRI-LGRPTILSDHVASGTTVGYLGD 244 (310) Q Consensus 172 ~l~~--~~~~~~~~~~~~~~~~~l~~l----~d~~g~~~~~~~~~~~~~~~~~~~~l-~G~pv~~t~~~~~~~~~~~~gd 244 (310) .+.. .....+.++|+++....|..+ .+..+..--.. ...... ....+.| .+++|+++++.+.+ .+++|- T Consensus 367 ~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~s~~~~~~~-~~d~~~-~~~~G~l~~~~~vy~D~y~~~d--y~~vG~ 442 (524) T protein:vir:98 367 EIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTL-NVDTTK-AVFAGVLGGTYKVYIDQYARQD--YFTVGF 442 (524) T ss_pred HHHHhhccccccEEEEchHHHHHHhhhhcccccccchhhccc-ccCCcc-ceEEEEecCceEEEecCCCCcc--eEEEEe Confidence 2322 223477899999998888742 11111110000 000000 0011233 35799999887654 222221 Q ss_pred c------eeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe-------------------- Q lcl|NC_021307. 245 F------SQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIN-------------------- 298 (310) Q Consensus 245 ~------~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~-------------------- 298 (310) . ..+++...-..... ...+|.++ | =.++|+ .|++..+- T Consensus 443 KG~~~~~~glfyaPYv~l~~~------------~~~dp~sf-q-P~~g~~--tRY~l~~NP~~~~~~~~~~~ri~~g~~~ 506 (524) T protein:vir:98 443 KGDNEMDAGIYYAPYVALTPL------------RGSDPKNF-Q-PVMGFK--TRYGIGINPFANSRSQAPADRITSGMIS 506 (524) T ss_pred eCCcccccceeeccccccccc------------cccCCccc-c-ceeeee--eeeceeecCcccccCCccccccccCcch Confidence 1 01112211111000 01112111 1 122332 34444321 Q ss_pred ----ccCc-eEEEeecC Q lcl|NC_021307. 299 ----DVEA-FVKLTNAA 310 (310) Q Consensus 299 ----~~~a-~~~l~~aa 310 (310) ..++ |.++-.|- T Consensus 507 ~~~ag~n~~~r~~~Vk~ 523 (524) T protein:vir:98 507 KEMCGKNAYFRKVWVKG 523 (524) T ss_pred HhhcCccceeeEeeecc Confidence 0011 11111111 No 236 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=42.30 E-value=0.89 Score=20.86 Aligned_cols=290 Identities=9% Similarity=-0.010 Sum_probs=120.3 Q ss_pred CccchhhhHHHHHh------------------hccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEE- Q lcl|NC_021307. 1 MAAGTAFPVNHTQI------------------AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIP- 61 (310) Q Consensus 1 ~aa~~~~~~~~~~~------------------~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip- 61 (310) |-+..++....... ..++++...+ .-|.+ -.++++.-+..+..+++.+.||.+++.-|. T Consensus 49 ~~~~~~~~e~~~~~l~~~~~~~~~~~~~~~i~est~t~~v~~-~~P~L-i~lvRra~p~LIa~DIwGVQPMTgPTGLIFA 126 (529) T protein:vir:10 49 VYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAAGQSSGAITN-IGPAV-IGMVRRAIPSLIAFDIAGVQPMTGPTGQVFA 126 (529) T ss_pred ccchhhhhhhhhcccchhhccccccccccccccccccccccc-cCchh-hhhHHHHHhhhhhheeeeeecCCchhhhhhh Confidence 11111111111000 1111111111 12222 235556667777888899999865431110 Q ss_pred ---EE---cC--------------------------------------------------------------------C- Q lcl|NC_021307. 62 ---HW---TG--------------------------------------------------------------------D- 66 (310) Q Consensus 62 ---~~---~~--------------------------------------------------------------------~- 66 (310) +. .. . T Consensus 127 MRsrY~~~~~~~~~~eaf~~~y~Pda~~sga~~~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~ 206 (529) T protein:vir:10 127 LRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASV 206 (529) T ss_pred hheeecCCccccccccccccccccccccccccccccccccCccccccccccccccccCcceeeeecccceeccccccccc Confidence 00 00 0 Q ss_pred ---------------------ce------eeee--cc---------cccccccccceeeeEeeeeeeEeeehhhHHHhhc Q lcl|NC_021307. 67 ---------------------VS------AAWI--GE---------GDMKPITKGDMSVQQVEPHKIATIFVASAETVRA 108 (310) Q Consensus 67 ---------------------~~------a~~v--~E---------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~ 108 (310) .. +-.. +| +.++++-.-.+++++++.+..+=....|-||.+| T Consensus 207 ~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQD 286 (529) T protein:vir:10 207 TVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQD 286 (529) T ss_pred ccCccccCcccccccccccccccccccccccchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHH Confidence 00 0000 01 1124455556677777777777777889999886 Q ss_pred C----hhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc-cc-----cccc-ccceec-----ccchHHH------HH Q lcl|NC_021307. 109 N----PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL-DE-----TTKS-VDLTPA-----TGTTYDA------IG 166 (310) Q Consensus 109 s----~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~-~~-----~~~~-~~~~~~-----~~~~~~~------~~ 166 (310) - ..|.|+.|.+.|+..|...+|+.|+.---+-...+- .. ...+ ...... .....+. .+ T Consensus 287 LKAVHGLDAEtELsNILStEImlEINReii~~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i 366 (529) T protein:vir:10 287 LRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQI 366 (529) T ss_pred HHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHH Confidence 4 467899999999999999999998864332111000 00 0000 000000 0001111 11 Q ss_pred HHHHHHhhh--hcCCCCEEEEehHHHHHHHHh--hhccCccccccccccccccccCCceee-eeeEEEeCCCCCCceeEe Q lcl|NC_021307. 167 VNALSLLVN--AGKKWGATLLDDVAEPILNGA--KDANGRPLFVESTYEAVTTPYREGRIL-GRPTILSDHVASGTTVGY 241 (310) Q Consensus 167 ~~~~~~l~~--~~~~~~~~~~~~~~~~~l~~l--~d~~g~~~~~~~~~~~~~~~~~~~~l~-G~pv~~t~~~~~~~~~~~ 241 (310) ..+...+.. .....+.++|+++....|... .+...-.-+..............+.|. |++|+++++.+.+ .++ T Consensus 367 ~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--y~~ 444 (529) T protein:vir:10 367 DKEANEIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQD--YFT 444 (529) T ss_pred HHHHHHHHHhhccccceEEEEchHHHHHHHhhhhhccccccccccccccccCCceEEEEecCceEEEecCCCCcc--eEE Confidence 122222322 223577899999999888632 111100001111111111111223443 4699999887654 222 Q ss_pred eecc------eeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEec-----cC-ceEEEeec Q lcl|NC_021307. 242 LGDF------SQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIND-----VE-AFVKLTNA 309 (310) Q Consensus 242 ~gd~------~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~-----~~-a~~~l~~a 309 (310) +|-. ..+++...-..... ...+|.++ | =.++|+ .|++..+-= .+ ..+++... T Consensus 445 vG~KG~~~~~~glfy~PYv~l~~~------------~~~dp~sf-q-P~~g~~--tRY~l~~NP~~~~~~~~~~~r~~~g 508 (529) T protein:vir:10 445 MGYRGANNLDAGIYYCPYVALTPL------------RGSDPKNF-Q-PVMGFK--TRYAIGVNPFAESRTQAPQGRITSG 508 (529) T ss_pred EEEeCCcccccceeeccccccccc------------cccCCCcc-c-ceeeee--eeeceeecCccccccccccccccCC Confidence 2211 01111111111100 01122221 1 123343 466554321 11 12223222 Q ss_pred C Q lcl|NC_021307. 310 A 310 (310) Q Consensus 310 a 310 (310) . T Consensus 509 ~ 509 (529) T protein:vir:10 509 M 509 (529) T ss_pred c Confidence 2 No 237 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=42.03 E-value=0.9 Score=20.83 Aligned_cols=285 Identities=13% Similarity=0.051 Sum_probs=119.0 Q ss_pred CccchhhhHHHHH-----------hhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEE----EE-c Q lcl|NC_021307. 1 MAAGTAFPVNHTQ-----------IAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIP----HW-T 64 (310) Q Consensus 1 ~aa~~~~~~~~~~-----------~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip----~~-~ 64 (310) +-+.-.+-.++.- ...++++.-.. +-|. .-.++++.-+..+..+++.+.||.+++.-|. +. + T Consensus 54 ~~~~~~~l~e~~~~~~~~~~~t~i~~~~~t~~v~~-~~P~-l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n 131 (519) T protein:vir:10 54 SEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAVTQ-IGPA-VMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGK 131 (519) T ss_pred HHHHhhhcchhccCCccccCccccccccccccccc-cchh-HHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecC Confidence 0011111111110 01111111111 1121 1224556667778889999999977543321 00 0 Q ss_pred C-----C---------ceeeee---------------------------------------------------------- Q lcl|NC_021307. 65 G-----D---------VSAAWI---------------------------------------------------------- 72 (310) Q Consensus 65 ~-----~---------~~a~~v---------------------------------------------------------- 72 (310) . + +.+.|- T Consensus 132 ~~~~~~g~ea~~~~nEadt~fSG~~~~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~ 211 (519) T protein:vir:10 132 DPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVT 211 (519) T ss_pred CccccccccccccccccccccCccccccccccccccccccccccccccccccccceeccccccccCCCCcCccccccccc Confidence 0 0 000000 Q ss_pred -----------c--------c---------cccccccccceeeeEeeeeeeEeeehhhHHHhhcC----hhHHHHHHHHH Q lcl|NC_021307. 73 -----------G--------E---------GDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN----PGNYLGTMRTK 120 (310) Q Consensus 73 -----------~--------E---------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s----~~~~~~~v~~~ 120 (310) + | +..+++-.-.+++++++.+..+=....|-||.+|- ..|.|+.|.+. T Consensus 212 ~~~~~~~~~~~~~gmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNI 291 (519) T protein:vir:10 212 ALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGI 291 (519) T ss_pred cccccccccccccccccchhhccccCCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHH Confidence 0 1 11234444555677777777777777899998864 46899999999 Q ss_pred HHHHHHHHHHHHHHcccCccccccccccccc--c-----cceec---ccch--------HHHHHHHHHHHhhh--hcCCC Q lcl|NC_021307. 121 VATAIALAFDEAALHGTDSPFDKNLDETTKS--V-----DLTPA---TGTT--------YDAIGVNALSLLVN--AGKKW 180 (310) Q Consensus 121 l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~--~-----~~~~~---~~~~--------~~~~~~~~~~~l~~--~~~~~ 180 (310) |+..|...+|+.|+.=-....-.+..+.+.. . ..... .+.. ....+..+...+.. ..... T Consensus 292 LSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~g 371 (519) T protein:vir:10 292 LATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAG 371 (519) T ss_pred HHHHHHHHhhHHHHhhhhhhhhcceeecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 9999999999999953222221111111110 0 00000 0000 11111122222322 23445 Q ss_pred CEEEEehHHHHHHHHhhh--c----cCccccccccccccccccCCcee-eeeeEEEeCCCCCCceeEeeecc------ee Q lcl|NC_021307. 181 GATLLDDVAEPILNGAKD--A----NGRPLFVESTYEAVTTPYREGRI-LGRPTILSDHVASGTTVGYLGDF------SQ 247 (310) Q Consensus 181 ~~~~~~~~~~~~l~~l~d--~----~g~~~~~~~~~~~~~~~~~~~~l-~G~pv~~t~~~~~~~~~~~~gd~------~~ 247 (310) +.++|+++....|...-. . ..+..+..+.+... ..+.| .+++|+++++.+.+ .+++|-. .. T Consensus 372 n~ii~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~----~~G~l~~~~~vy~D~y~~~d--y~~vG~KG~~~~~~g 445 (519) T protein:vir:10 372 NFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAV----FAGVLGGKYRVYIDQYARSD--YFTIGYKGSNEMDAG 445 (519) T ss_pred cEEEEchHHHHHHhhccchhccccccccccccccCCCce----EEEEecCceEEEecCCCCcc--eEEEEEecCcccccc Confidence 789999999988864320 0 00111111111111 12333 34699999987764 2222211 01 Q ss_pred eeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecc-------CceEEEee------------ Q lcl|NC_021307. 248 IVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDV-------EAFVKLTN------------ 308 (310) Q Consensus 248 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~-------~a~~~l~~------------ 308 (310) +++...-..... ...+|.++ | =.++|+ .|++..+- | +--+++.+ T Consensus 446 lfyaPYv~l~~~------------~~~dp~sf-q-P~~g~~--tRY~l~~N-P~~~~~~~~~~~~i~~g~~~~a~~~~~n 508 (519) T protein:vir:10 446 IYYAPYVALTPL------------RGSDPKNF-Q-PVMGFK--TRYGIGIN-PFADPAAQAPTKRIQNGMPDIVNSLGLN 508 (519) T ss_pred eeeccccccccc------------cccCCccc-c-ceeeee--eeeceeec-CcccccccCccceeccCchhhhccccCc Confidence 112211110000 01112121 1 123333 35544321 1 00111111 Q ss_pred --------cC Q lcl|NC_021307. 309 --------AA 310 (310) Q Consensus 309 --------aa 310 (310) |= T Consensus 509 ~y~r~v~v~~ 518 (519) T protein:vir:10 509 GYFRRVYVKG 518 (519) T ss_pred eeeeeeeeec Confidence 11 No 238 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=41.34 E-value=0.93 Score=20.75 Aligned_cols=286 Identities=12% Similarity=0.071 Sum_probs=116.6 Q ss_pred Cccc------------hhhhHHHHH-----------hhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCc Q lcl|NC_021307. 1 MAAG------------TAFPVNHTQ-----------IAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG 57 (310) Q Consensus 1 ~aa~------------~~~~~~~~~-----------~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 57 (310) ++.. ..+-.+.+- +..+++.+... +-|. .-.++++.-+..+..+++.+.||.+++ T Consensus 41 ~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia~s~~t~~v~~-~~P~-ll~lvRRa~~~LIa~DIwGVQPMTgPT 118 (514) T protein:vir:56 41 INNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGVTTGAVTN-IGPT-VMGMVRRAIPQLIAFDIAGVQPMTGPT 118 (514) T ss_pred HhcCCcccchhhhhhhhcccccccccccccccccccccccccccccc-cchh-HHHHHHHHHHhhhhhhhheeccCCchh Confidence 1111 111111100 00111111111 1122 223566666777888999999998765 Q ss_pred eEEE----EE-c---CCce---------eeeec----------------------------------------------- Q lcl|NC_021307. 58 VKIP----HW-T---GDVS---------AAWIG----------------------------------------------- 73 (310) Q Consensus 58 ~~ip----~~-~---~~~~---------a~~v~----------------------------------------------- 73 (310) .-|- +. . .+.+ +.|-+ T Consensus 119 GLIFAMRsrY~~~~~tg~EAf~~~nEadt~fSG~~~~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~ 198 (514) T protein:vir:56 119 SQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLA 198 (514) T ss_pred hhheeeeeeecCCCcccccccccccccCcCcccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 3321 11 0 0001 11100 Q ss_pred -------------------------------c---------cccccccccceeeeEeeeeeeEeeehhhHHHhhcC---- Q lcl|NC_021307. 74 -------------------------------E---------GDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN---- 109 (310) Q Consensus 74 -------------------------------E---------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s---- 109 (310) | +..+++-.-.+++++++.+...=....|-||.+|- T Consensus 199 ~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVH 278 (514) T protein:vir:56 199 VAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVH 278 (514) T ss_pred ccccccccccccccccchhhhhhhhhhhhhhhhcccCCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhc Confidence 1 11234444555666777766666777899998864 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHH---cccCc-ccccccccccc--ccccee---cccc--hHHHHHHHHHHHhh---- Q lcl|NC_021307. 110 PGNYLGTMRTKVATAIALAFDEAAL---HGTDS-PFDKNLDETTK--SVDLTP---ATGT--TYDAIGVNALSLLV---- 174 (310) Q Consensus 110 ~~~~~~~v~~~l~~a~~~~~d~~~l---~G~g~-~~~~~~~~~~~--~~~~~~---~~~~--~~~~~~~~~~~~l~---- 174 (310) ..|.|+.|.+.|+..|...+|+.|+ +-.-+ +......+... ...... ..+. ..+.. ..+...++ T Consensus 279 GLDAEtELsNILSTEImlEINReii~~l~~~atv~~~~~~~~~~~~G~~d~~~~~d~~~~~~~~e~~-~~l~~~i~~~an 357 (514) T protein:vir:56 279 GLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAY-KALLIQIEKEAN 357 (514) T ss_pred CCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhcccccccccccccccccccccccchHHHHHH-HHHHHHHHHHHH Confidence 4689999999999999999999995 22111 11100000000 000000 0000 11111 11221221 Q ss_pred -----hhcCCCCEEEEehHHHHHHHHh--hhccCccccccccc-cccccccCCcee-eeeeEEEeCCCCCCceeEeeecc Q lcl|NC_021307. 175 -----NAGKKWGATLLDDVAEPILNGA--KDANGRPLFVESTY-EAVTTPYREGRI-LGRPTILSDHVASGTTVGYLGDF 245 (310) Q Consensus 175 -----~~~~~~~~~~~~~~~~~~l~~l--~d~~g~~~~~~~~~-~~~~~~~~~~~l-~G~pv~~t~~~~~~~~~~~~gd~ 245 (310) ......+.++|+++....|... .+...-.-+.+... ........-+.| .+++|+++++.+.+ .+++| T Consensus 358 ~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~~~~~~d~~~~~~aG~l~~~~~vy~D~y~~~d--y~~vG-- 433 (514) T protein:vir:56 358 EIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVND--YFTVG-- 433 (514) T ss_pred HHHhhcccccccEEEEchhHHHHHHhhhhhccccccCccccccccccCcceEEEEecCceEEEecCCCCcc--eEEEE-- Confidence 1234677899999999888631 11000000010000 000000011233 45799999987754 22222 Q ss_pred eeeeEEeecccEEEEeecceeeeccc------ccccchhhhhcCcEEEEEEEEeccEEeccCceEE----Eee------- Q lcl|NC_021307. 246 SQIVWGQVGGLSFDVSDQATLNLGTP------QAPNFVSLWQHNLVAVRVEAEYGLLINDVEAFVK----LTN------- 308 (310) Q Consensus 246 ~~~~~~~~~~~~v~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~----l~~------- 308 (310) ..+..+++ +++.+..+ ...+|.++ | =.++|+ .|++..+- | |.- ... T Consensus 434 ------~KG~~~~~----~glfyaPYv~l~~~~~~dp~sf-q-P~~g~~--tRY~l~~N-P--y~~~~~~~~~~~~~~~~ 496 (514) T protein:vir:56 434 ------FKGSTEMD----AGVFYSPYVPLTPLRGSDSKNF-Q-PVIGFK--TRYGVQVN-P--FADPTASATKVGNGAPV 496 (514) T ss_pred ------EecCccee----cceeeccccccccccccCCccc-c-ceeeee--eeeceeeC-C--CCCccccccccCCcchh Confidence 11111111 11111110 01122221 1 123333 46655432 2 210 001 Q ss_pred cC Q lcl|NC_021307. 309 AA 310 (310) Q Consensus 309 aa 310 (310) +| T Consensus 497 ~a 498 (514) T protein:vir:56 497 AA 498 (514) T ss_pred hh Confidence 01 No 239 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=38.47 E-value=1.1 Score=20.43 Aligned_cols=286 Identities=15% Similarity=0.086 Sum_probs=120.8 Q ss_pred Cc------------cchhhhHHHHH-----------hhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCc Q lcl|NC_021307. 1 MA------------AGTAFPVNHTQ-----------IAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG 57 (310) Q Consensus 1 ~a------------a~~~~~~~~~~-----------~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 57 (310) ++ +...+-.++.- +..++++.... .-|.+ -.++++.-+..+..+++.+.||.+++ T Consensus 43 ~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~es~~t~~v~~-~~P~L-i~lvRra~p~LIa~DIwGVQPMTgPT 120 (528) T protein:vir:80 43 FAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAAGQTTGAITN-VGPAV-IGMVRRAIPNLIAFDICGVQPMSTPT 120 (528) T ss_pred hhccccccchHHHHhhhhhccccccccccCCcccccccccccccccc-CCchh-hhHHHHHHhhhhhhhhheeccCCchh Confidence 11 11111111110 11122222111 12222 23566667778889999999997653 Q ss_pred eEEEEEc----CC---------------cee------------------------------------------------- Q lcl|NC_021307. 58 VKIPHWT----GD---------------VSA------------------------------------------------- 69 (310) Q Consensus 58 ~~ip~~~----~~---------------~~a------------------------------------------------- 69 (310) .-|--.. .. +++ T Consensus 121 GLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~ 200 (528) T protein:vir:80 121 SQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQN 200 (528) T ss_pred hhheeeeeeecCCccccccccccccccccccccccccccccccccccccccccccccccccccceecccccccccccccc Confidence 2211000 00 000 Q ss_pred -----------------------------e--------eecc---------cccccccccceeeeEeeeeeeEeeehhhH Q lcl|NC_021307. 70 -----------------------------A--------WIGE---------GDMKPITKGDMSVQQVEPHKIATIFVASA 103 (310) Q Consensus 70 -----------------------------~--------~v~E---------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ 103 (310) + -.+| +.++++-.-.+++++++.+..+=....|- T Consensus 201 ~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTi 280 (528) T protein:vir:80 201 VTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSI 280 (528) T ss_pred ccccccCccccCCcccccccccccccccccccccccchhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccH Confidence 0 0011 12244555556777777777777777899 Q ss_pred HHhhcC----hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccc-------cceec---cc--chHHHH-- Q lcl|NC_021307. 104 ETVRAN----PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSV-------DLTPA---TG--TTYDAI-- 165 (310) Q Consensus 104 ell~~s----~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~-------~~~~~---~~--~~~~~~-- 165 (310) ||.+|- ..|.|+.|.+.|+..|...+|+.|+.=-......+-.+.+..+ ..... .+ ...+.. T Consensus 281 ELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~ 360 (528) T protein:vir:80 281 EVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKS 360 (528) T ss_pred HHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeeccccccccccchhHHHHHH Confidence 998864 4689999999999999999999997432222111111111000 00000 00 011111 Q ss_pred ----HHHHHHHhhh--hcCCCCEEEEehHHHHHHHHhh-----hcc-CccccccccccccccccCCceee-eeeEEEeCC Q lcl|NC_021307. 166 ----GVNALSLLVN--AGKKWGATLLDDVAEPILNGAK-----DAN-GRPLFVESTYEAVTTPYREGRIL-GRPTILSDH 232 (310) Q Consensus 166 ----~~~~~~~l~~--~~~~~~~~~~~~~~~~~l~~l~-----d~~-g~~~~~~~~~~~~~~~~~~~~l~-G~pv~~t~~ 232 (310) +..+...+.. .....+.++|+++....|...- ... ....+..+.+. ....+.|. +++|+++++ T Consensus 361 L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~----~~~~G~l~~~~~vy~D~y 436 (528) T protein:vir:80 361 LIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTK----AVFAGVLAGKYKVFIDQY 436 (528) T ss_pred HHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCC----ceEEEEecCceEEEecCC Confidence 1122222322 2334578999999998886421 011 11111111111 11123343 469999988 Q ss_pred CCCCceeEeee---cc---eeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe-----cc- Q lcl|NC_021307. 233 VASGTTVGYLG---DF---SQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIN-----DV- 300 (310) Q Consensus 233 ~~~~~~~~~~g---d~---~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~-----~~- 300 (310) .+.+ .+++| +- ..+++...-.....+. . +|.+ || =.++|+ .|++..+- .. T Consensus 437 ~~~d--y~~vG~KG~~~~~~glfy~PYv~l~~~~~--------~----dp~s-fq-P~~g~~--tRY~l~~NP~~~~~~~ 498 (528) T protein:vir:80 437 ARQD--YFTVGYKGDNEMDAGIYYAPYVALTPLRA--------T----DPQS-FH-PVLGFK--TRYGIGINPFADSKSQ 498 (528) T ss_pred CCcc--eEEEEEeCCcccccceeecccccceeeEe--------e----CCcc-cc-ceeeee--eeeceeecCcccccCC Confidence 7654 22222 11 1122222211111111 1 1111 11 123332 34444321 00 Q ss_pred ------------------Cc-eEEEeecC Q lcl|NC_021307. 301 ------------------EA-FVKLTNAA 310 (310) Q Consensus 301 ------------------~a-~~~l~~aa 310 (310) ++ |.++..|= T Consensus 499 ~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~ 527 (528) T protein:vir:80 499 APSARITSGMLSKDSVGKNAYFRRVWVKG 527 (528) T ss_pred cccccccccchhhhhcCccceeEEeeecc Confidence 11 11111111 No 240 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=35.93 E-value=1.2 Score=20.15 Aligned_cols=286 Identities=14% Similarity=0.026 Sum_probs=110.1 Q ss_pred CccchhhhHHHHHhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceee-----cCCCceEEE-----EEcCCceee Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIP-----MGSTGVKIP-----HWTGDVSAA 70 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~-----~~~~~~~ip-----~~~~~~~a~ 70 (310) +++-..-..+....+.++-+..++.-... .. .....+.+.......| ...+...+. ..+...++- T Consensus 116 ~~~nq~gtEAlfnEadt~fSg~~~~~~~~--~~---~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~GM~Ta~aE~l 190 (462) T protein:vir:10 116 RPANSDFREALFNEPNAGFSGGAGTGLSN--YD---PTASSSAVNDAEGANPGLLNDSPAGTYEVTGDATGMATATAEAL 190 (462) T ss_pred cccccccchhhhccCCcCccccccccccc--cc---cccccccccccccccceeecCCCccceecccccccccchhcccc Confidence 22211111111222222211111100000 00 0000000000000000 001111110 011111111 Q ss_pred eecc-cccccccccceeeeEeeeeeeEeeehhhHHHhhcC----hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccc- Q lcl|NC_021307. 71 WIGE-GDMKPITKGDMSVQQVEPHKIATIFVASAETVRAN----PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKN- 144 (310) Q Consensus 71 ~v~E-g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~s----~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~- 144 (310) .-++ +..+++-.-.+++++++.+..+=....|-||.+|- ..|.++.|.+.|+..|...+|+.|+.---+..-.+ T Consensus 191 g~~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~~k 270 (462) T protein:vir:10 191 DDSSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVNAVKGA 270 (462) T ss_pred CCccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhheeee Confidence 1112 34577888888899998888888888999998864 46889999999999999999999986433221110 Q ss_pred cccccccc--c--ceecccchHHHHHHHHHHHh---------hhhcCCCCEEEEehHHHHHHHHhhhccCcccccccccc Q lcl|NC_021307. 145 LDETTKSV--D--LTPATGTTYDAIGVNALSLL---------VNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYE 211 (310) Q Consensus 145 ~~~~~~~~--~--~~~~~~~~~~~~~~~~~~~l---------~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~ 211 (310) ........ . ....+....+.. ..+...+ .......+.++|+++....|.. .|..-+.|.... T Consensus 271 ~~~~~~~Gv~dl~~~~~gr~~~e~~-k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~----sG~l~~~p~~~~ 345 (462) T protein:vir:10 271 IANTATDGIFDLDVDSNGRWSVEKF-KGLLFQIERDSNAIGQETRRGKGNILICSADVASALGM----AGVLDYAPGLQG 345 (462) T ss_pred cccccccceeeeccccchHHHHHHH-HHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhh----ccchhccccccc Confidence 00000000 0 000011111111 1222222 2233567789999999888842 232222221100 Q ss_pred c-------cccccCCcee-eeeeEEEeCCCCCCceeEeeecceeeeEEeecccEEEEeecceeeeccc------ccccch Q lcl|NC_021307. 212 A-------VTTPYREGRI-LGRPTILSDHVASGTTVGYLGDFSQIVWGQVGGLSFDVSDQATLNLGTP------QAPNFV 277 (310) Q Consensus 212 ~-------~~~~~~~~~l-~G~pv~~t~~~~~~~~~~~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~------~~~~~~ 277 (310) . .......+.| .+++|+++.....+. ++.++.+|..+..+++ +++.+..+ ...+|. T Consensus 346 ~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns------~~dy~~vG~KG~~~~~----~glfy~PYv~l~~~~~~dp~ 415 (462) T protein:vir:10 346 NSALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVA------DKHFYVAGYKGTSPYD----AGLFYCPYVPLQQVRAINPN 415 (462) T ss_pred cccccccccccceeEEEecCceEEEEecccCCCc------ccceEEEEEeCCcccc----cceeeccccccccccccCCc Confidence 0 0111112333 446888887654321 1112222322222111 12211111 011121 Q ss_pred hhhhcCcEEEEEEEEeccEEe-------cc---------CceEEEeecC Q lcl|NC_021307. 278 SLWQHNLVAVRVEAEYGLLIN-------DV---------EAFVKLTNAA 310 (310) Q Consensus 278 ~~~~~~~~~~r~~~~~d~~v~-------~~---------~a~~~l~~aa 310 (310) ++ | =.++|+ .|++..+- ++ --|.++..|= T Consensus 416 sf-q-P~~g~~--tRY~l~~NP~t~~~~~~~~~~~~~~n~y~r~~~v~~ 460 (462) T protein:vir:10 416 TF-Q-PKIGFK--TRYGMVSNPFSGGLTQGSGALTANANKYYRRVQVAN 460 (462) T ss_pred cc-c-ceeeee--eeeeeeecCCCCCcCCccccccccCcceeeeEEeec Confidence 21 1 123333 35544321 10 1133333333 No 241 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=32.43 E-value=1.4 Score=19.74 Aligned_cols=289 Identities=11% Similarity=0.007 Sum_probs=124.9 Q ss_pred CccchhhhHHHHHhhccccCC------CCceechhhHHHHHHHHH---hhchhhhhcceeecCCCceEEEEEcC---Cce Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSM------FQGYLEPEQAQDYFAEAE---KTSIVQRVARKIPMGSTGVKIPHWTG---DVS 68 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~------~g~~i~~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~ip~~~~---~~~ 68 (310) =+--+....+..+...+|-+. +|+.+..|-.++-|..|- +.-.+.+-..+.+..+...+|-.... ... T Consensus 12 ~~~~~~~~e~~~Ks~~agy~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~ 91 (468) T protein:vir:63 12 EVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGH 91 (468) T ss_pred ccChhHHHHHHHHHHHcCcccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcccchhhhhhhhheeeeccCcccc Confidence 122223323333334433322 344565555444333332 22224444445555555444444433 244 Q ss_pred eeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhc-ChhHHHHHHHHHHHHHHHHHHHHHHHcccCcc--ccccc Q lcl|NC_021307. 69 AAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRA-NPGNYLGTMRTKVATAIALAFDEAALHGTDSP--FDKNL 145 (310) Q Consensus 69 a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~-s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~--~~~~~ 145 (310) +.++.|+...+.+++.+.......|-++..-.+|..+-.. +..+.+....+.-...++..+|.+.|.|+..- .+... T Consensus 92 ~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~ 171 (468) T protein:vir:63 92 TRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQ 171 (468) T ss_pred ccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCcc Confidence 6889999999999999999999999999866666554433 34577788888888889999999999998654 22211 Q ss_pred -----cccc---ccccceeccc-chHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHH-HHhhhccCcccccccccccccc Q lcl|NC_021307. 146 -----DETT---KSVDLTPATG-TTYDAIGVNALSLLVNAGKKWGATLLDDVAEPIL-NGAKDANGRPLFVESTYEAVTT 215 (310) Q Consensus 146 -----~~~~---~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l-~~l~d~~g~~~~~~~~~~~~~~ 215 (310) .+.. ..-......+ ....+++..+...+...+....-++|+..+.+.| .......-+... +.... T Consensus 172 ~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L~~q~~v~~--~n~~~--- 246 (468) T protein:vir:63 172 AGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVR--DNGNN--- 246 (468) T ss_pred ccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEEEEc--CCCCc--- Confidence 1111 1111111112 2223333334444555677778899999888777 332222221111 11111 Q ss_pred ccCCceeeeeeEE--EeC--CCCCCceeEeeecceeeeEE------eecccEEEEeecceeeecccccccchhhhhcC-- Q lcl|NC_021307. 216 PYREGRILGRPTI--LSD--HVASGTTVGYLGDFSQIVWG------QVGGLSFDVSDQATLNLGTPQAPNFVSLWQHN-- 283 (310) Q Consensus 216 ~~~~~~l~G~pv~--~t~--~~~~~~~~~~~gd~~~~~~~------~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~-- 283 (310) ...|.||- ++. .+...... +++|...+--. -..+-.+..+... ..... |... T Consensus 247 -----~~~G~~v~g~~sa~G~I~l~gs~-il~~~~~l~~~~~~~~~Apsp~~vsaT~~~--------~~~g~--~~~~~~ 310 (468) T protein:vir:63 247 -----VSVGFNIQGFHSARGFIKLHGST-VMENEQILDERILALPTAPQPAKVTATQEA--------GKKGQ--FRAEDL 310 (468) T ss_pred -----eeeeecccceecceeeeeecCce-eeccccCCCcccccccccccCCccceeeec--------ccCCc--ccCCCc Confidence 12222220 000 00000000 11111110000 0000000000000 00000 0000 Q ss_pred -cEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 284 -LVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 284 -~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ...+|+...=+..=--++..+-++.+| T Consensus 311 a~y~Y~v~~vs~~GES~pS~~vtvTVaa 338 (468) T protein:vir:63 311 AAHEYKVVVSSDDAESIASEVATATVTA 338 (468) T ss_pred ceEEEEEEEECCCCccccccceEEEecC Confidence 011111111111111122233333333 No 242 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=30.90 E-value=1.5 Score=19.56 Aligned_cols=289 Identities=11% Similarity=0.036 Sum_probs=118.7 Q ss_pred CccchhhhHHHHHh------------------hccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCceEEE- Q lcl|NC_021307. 1 MAAGTAFPVNHTQI------------------AQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTGVKIP- 61 (310) Q Consensus 1 ~aa~~~~~~~~~~~------------------~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip- 61 (310) +-+..++....... ..++++...+ .-|.+ -.++++.-+..+..+++.+.||.+++.-|. T Consensus 49 ~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i~~st~t~~v~~-~~P~L-i~lvRra~p~LIa~DIwGVQPMTgPTGLIFA 126 (529) T protein:vir:10 49 VYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAAGQSSGAITN-IGPAV-IGMVRRAIPSLIAFDIAGVQPMTGPTGQVFA 126 (529) T ss_pred ccchhhhhhhhhccchhhcccccccccccccccccccccccc-cCchh-hhhHHHHHHhhhhhhhheeccCCchhhhhhe Confidence 11111111111000 1111111111 12222 235566667778888999999876542211 Q ss_pred ---EEcCC-----------------------------------------------------------------------c Q lcl|NC_021307. 62 ---HWTGD-----------------------------------------------------------------------V 67 (310) Q Consensus 62 ---~~~~~-----------------------------------------------------------------------~ 67 (310) +.... . T Consensus 127 MRsrY~~~~~~~~~~eaf~~~~~pda~~sga~~~ga~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~ 206 (529) T protein:vir:10 127 LRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASV 206 (529) T ss_pred eeeeecCCcccccccccccccccccccccccccccccccccccccccccccccccccccceeeecccCceeecccccccc Confidence 00000 0 Q ss_pred ee-e---------------------------e--ecc---------cccccccccceeeeEeeeeeeEeeehhhHHHhhc Q lcl|NC_021307. 68 SA-A---------------------------W--IGE---------GDMKPITKGDMSVQQVEPHKIATIFVASAETVRA 108 (310) Q Consensus 68 ~a-~---------------------------~--v~E---------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~ 108 (310) .+ . . .+| +.++++-.-.+++++++.+..+=....|-||.+| T Consensus 207 ~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsTa~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQD 286 (529) T protein:vir:10 207 TVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQD 286 (529) T ss_pred ccCccccCcccccccccccccccccccccchhhhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHH Confidence 00 0 0 001 1123444555667777777777777789999886 Q ss_pred C----hhHHHHHHHHHHHHHHHHHHHHHHHcccCccccccc-ccc-cc----c-ccceec-----ccchHH------HHH Q lcl|NC_021307. 109 N----PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNL-DET-TK----S-VDLTPA-----TGTTYD------AIG 166 (310) Q Consensus 109 s----~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~-~~~-~~----~-~~~~~~-----~~~~~~------~~~ 166 (310) - ..|.|+.|.+.|+..|...+|+.|+.---+....+- ..+ +. + ...... .....+ ..+ T Consensus 287 LKAVHGLDAEtELsNILStEImlEINReii~~l~~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i 366 (529) T protein:vir:10 287 LRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQI 366 (529) T ss_pred HHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHH Confidence 4 467899999999999999999988864332111000 000 00 0 000000 000111 111 Q ss_pred HHHHHHhhh--hcCCCCEEEEehHHHHHHHHhh---hccCccccccccccccccccCCceee-eeeEEEeCCCCCCceeE Q lcl|NC_021307. 167 VNALSLLVN--AGKKWGATLLDDVAEPILNGAK---DANGRPLFVESTYEAVTTPYREGRIL-GRPTILSDHVASGTTVG 240 (310) Q Consensus 167 ~~~~~~l~~--~~~~~~~~~~~~~~~~~l~~l~---d~~g~~~~~~~~~~~~~~~~~~~~l~-G~pv~~t~~~~~~~~~~ 240 (310) ..+...+.. .....+.++|+++....|...- .+.+... ..............+.|. |++|+++++.+.+ .+ T Consensus 367 ~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~-~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--y~ 443 (529) T protein:vir:10 367 DKEANEIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQGM-ASGLNADTTKGVFAGILGGRYKVYIDQYARQD--YF 443 (529) T ss_pred HHHHHHHHHhhccccceEEEEchHHHHHHHhhccccccccccc-ccccccccCCceEEEEecCceEEEecCCCCcc--eE Confidence 122222322 2235778999999998886321 1000000 000000111111223433 4699999887654 22 Q ss_pred eeecceeeeEEeecccEEEEeecceeeeccc------ccccchhhhhcCcEEEEEEEEeccEEec-----cC-ceEEEee Q lcl|NC_021307. 241 YLGDFSQIVWGQVGGLSFDVSDQATLNLGTP------QAPNFVSLWQHNLVAVRVEAEYGLLIND-----VE-AFVKLTN 308 (310) Q Consensus 241 ~~gd~~~~~~~~~~~~~v~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~r~~~~~d~~v~~-----~~-a~~~l~~ 308 (310) ++| ..+..+++ +++.+..+ ...+|.++ | =.++|+ .|++..+-= .+ ..+++.. T Consensus 444 ~vG--------~KG~~~~~----~glfy~PYv~l~~~~~~dp~sf-q-P~~g~~--tRY~l~~NP~~~~~~~~~~~r~~~ 507 (529) T protein:vir:10 444 TMG--------YRGANNLD----AGIYYCPYVALTPLRGFDPKNF-Q-PVMGFK--TRYAIGVNPFAESRTQAPQGRITS 507 (529) T ss_pred EEE--------EeCCcccc----cceeeccccccccccccCCCcc-c-ceeeee--eeeceeecCccccccccccccccC Confidence 222 11111111 11111100 01122221 1 123333 466554321 11 1222222 Q ss_pred cC Q lcl|NC_021307. 309 AA 310 (310) Q Consensus 309 aa 310 (310) .. T Consensus 508 g~ 509 (529) T protein:vir:10 508 GM 509 (529) T ss_pred Cc Confidence 22 No 243 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=26.03 E-value=2 Score=18.94 Aligned_cols=289 Identities=11% Similarity=0.007 Sum_probs=123.3 Q ss_pred CccchhhhHHHHHhhccccCC------CCceechhhHHHHHHHHH---hhchhhhhcceeecCCCceEEEEEcC---Cce Q lcl|NC_021307. 1 MAAGTAFPVNHTQIAQTGDSM------FQGYLEPEQAQDYFAEAE---KTSIVQRVARKIPMGSTGVKIPHWTG---DVS 68 (310) Q Consensus 1 ~aa~~~~~~~~~~~~~~~~~~------~g~~i~~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~ip~~~~---~~~ 68 (310) =|.-+.......++..+|-+. +|+.+..|-.++-|..|- +.-.+.+-..+.+..+...+|-.... ... T Consensus 11 ~~n~~~~~e~~~Ks~~agy~~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~ 90 (467) T protein:vir:80 11 EVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGH 90 (467) T ss_pred hcccccCHHHHHHHHHcccccCCccccCcchhhhhhhhhhhheeeccccchhhhhhcccchhhhhhhhheeeeccCcccc Confidence 111111122223333333222 344565555444333322 22224444444555555444444333 244 Q ss_pred eeeecccccccccccceeeeEeeeeeeEeeehhhHHHhhc-ChhHHHHHHHHHHHHHHHHHHHHHHHcccCcc--ccccc Q lcl|NC_021307. 69 AAWIGEGDMKPITKGDMSVQQVEPHKIATIFVASAETVRA-NPGNYLGTMRTKVATAIALAFDEAALHGTDSP--FDKNL 145 (310) Q Consensus 69 a~~v~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~ell~~-s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~--~~~~~ 145 (310) +.++.|+...+.+++.+.......|-++..-.+|..+-.. +..+.+....+.-...++..+|.+.|.|+..- .+... T Consensus 91 ~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~ 170 (467) T protein:vir:80 91 TRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQ 170 (467) T ss_pred ccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCcc Confidence 6889999999999999999999999999866666554433 34577788888888889999999999998654 22211 Q ss_pred -----cccc---ccccceeccc-chHHHHHHHHHHHhhhhcCCCCEEEEehHHHHHH-HHhhhccCcccccccccccccc Q lcl|NC_021307. 146 -----DETT---KSVDLTPATG-TTYDAIGVNALSLLVNAGKKWGATLLDDVAEPIL-NGAKDANGRPLFVESTYEAVTT 215 (310) Q Consensus 146 -----~~~~---~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l-~~l~d~~g~~~~~~~~~~~~~~ 215 (310) .+.. ..-......+ ....+++..+...+...+....-++|+..+.+.| .......-+... +.... T Consensus 171 ~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L~~q~~v~~--~n~~~--- 245 (467) T protein:vir:80 171 AGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLVR--DNGNN--- 245 (467) T ss_pred ccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEEEEc--CCCCc--- Confidence 1111 1111111112 2223333334444555677778899999888777 332222221111 11111 Q ss_pred ccCCceeeeeeEE--EeC--CCCCCceeEeeecceeeeEE------eecccEEEEeecceeeecccccccchhhhhcC-- Q lcl|NC_021307. 216 PYREGRILGRPTI--LSD--HVASGTTVGYLGDFSQIVWG------QVGGLSFDVSDQATLNLGTPQAPNFVSLWQHN-- 283 (310) Q Consensus 216 ~~~~~~l~G~pv~--~t~--~~~~~~~~~~~gd~~~~~~~------~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~-- 283 (310) ...|.||- ++. .+...... +++|...+--. -..+-.+..+... ..... |... T Consensus 246 -----~~~G~~v~g~~sa~G~I~l~gs~-il~~~~~l~~~~~~~~~Apsp~~vsaT~~~--------~~~g~--~~~~~~ 309 (467) T protein:vir:80 246 -----VSVGFNIQGFHSARGFIKLHGST-VMENEQILDERILALPTAPQPAKVTATQEA--------GKKGQ--FRAEDL 309 (467) T ss_pred -----eeeeecccceecceeeeeecCce-eeccccCCCcccccccccccCCccceeeec--------ccCCc--ccCCCc Confidence 12222220 000 00000000 11111110000 0000000000000 00000 0000 Q ss_pred -cEEEEEEEEeccEEeccCceEEEeecC Q lcl|NC_021307. 284 -LVAVRVEAEYGLLINDVEAFVKLTNAA 310 (310) Q Consensus 284 -~~~~r~~~~~d~~v~~~~a~~~l~~aa 310 (310) ...+|+...=+..=--++..+-++.+| T Consensus 310 a~y~Y~v~~vs~~GES~pS~~vtvTVaa 337 (467) T protein:vir:80 310 AAHEYKVVVSSDDAESIASEVATATVTA 337 (467) T ss_pred ceEEEEEEEECCCCccccccceEEEecC Confidence 011111111111111122233333333 No 244 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=25.26 E-value=2.1 Score=18.84 Aligned_cols=286 Identities=14% Similarity=0.080 Sum_probs=119.7 Q ss_pred Cccc------------hhhhHHHH-----------HhhccccCCCCceechhhHHHHHHHHHhhchhhhhcceeecCCCc Q lcl|NC_021307. 1 MAAG------------TAFPVNHT-----------QIAQTGDSMFQGYLEPEQAQDYFAEAEKTSIVQRVARKIPMGSTG 57 (310) Q Consensus 1 ~aa~------------~~~~~~~~-----------~~~~~~~~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 57 (310) ++.. ..+-.++. .+..++++.-.. +-|. .-.++++.-+..+..+++.+.||.+++ T Consensus 43 ~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~es~~t~~v~~-~~P~-Li~lvRRa~p~LIa~DIwGVQPMTgPT 120 (528) T protein:vir:66 43 FAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAAGQTTGAITN-VGPA-VIGMVRRAIPNLIAFDICGVQPMSTPT 120 (528) T ss_pred hhcccchhhHHHHHhhhhhhhhhcccccccccchhcccccccccccc-Cchh-HHHHHHHHHHhhhhhhhheeecCCchh Confidence 1111 01111111 011122222111 1122 223556666777888899999997631 Q ss_pred eE-------E-----------------------------------------EEE------cCCce--------------- Q lcl|NC_021307. 58 VK-------I-----------------------------------------PHW------TGDVS--------------- 68 (310) Q Consensus 58 ~~-------i-----------------------------------------p~~------~~~~~--------------- 68 (310) .. + ... ..+.+ T Consensus 121 GlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~ 200 (528) T protein:vir:66 121 SQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQN 200 (528) T ss_pred hhheeeeeeecCCcccccccccccccccccccccccccccccccCCccceeecccccccccccceeeecccccceeeecc Confidence 00 0 000 00000 Q ss_pred ----------------------------ee--------eecc---------cccccccccceeeeEeeeeeeEeeehhhH Q lcl|NC_021307. 69 ----------------------------AA--------WIGE---------GDMKPITKGDMSVQQVEPHKIATIFVASA 103 (310) Q Consensus 69 ----------------------------a~--------~v~E---------g~~~~~~~~~~~~i~l~~~k~~~~~~is~ 103 (310) .+ -.+| +.++++-.-.+++++++.+..+=....|- T Consensus 201 ~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTi 280 (528) T protein:vir:66 201 VTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSI 280 (528) T ss_pred ccccccccCcccccccccccccccccccceecccccchhhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccH Confidence 00 0011 12245555566777777777777778899 Q ss_pred HHhhcC----hhHHHHHHHHHHHHHHHHHHHHHHHcccCcccccccccccccc-------cce---eccc--chHHHH-- Q lcl|NC_021307. 104 ETVRAN----PGNYLGTMRTKVATAIALAFDEAALHGTDSPFDKNLDETTKSV-------DLT---PATG--TTYDAI-- 165 (310) Q Consensus 104 ell~~s----~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~~~~~~~~~~~~~~-------~~~---~~~~--~~~~~~-- 165 (310) ||.+|- ..|.|..|.+.|+..|...+|+.|+.=-......+-.+.+..+ ... -..+ ...+.. T Consensus 281 ELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~ 360 (528) T protein:vir:66 281 EVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKS 360 (528) T ss_pred HHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeecccccccccchhHHHHHHH Confidence 998864 4689999999999999999999997432222111111111000 000 0000 001111 Q ss_pred ----HHHHHHHhhh--hcCCCCEEEEehHHHHHHHHhh--h----ccCccccccccccccccccCCceee-eeeEEEeCC Q lcl|NC_021307. 166 ----GVNALSLLVN--AGKKWGATLLDDVAEPILNGAK--D----ANGRPLFVESTYEAVTTPYREGRIL-GRPTILSDH 232 (310) Q Consensus 166 ----~~~~~~~l~~--~~~~~~~~~~~~~~~~~l~~l~--d----~~g~~~~~~~~~~~~~~~~~~~~l~-G~pv~~t~~ 232 (310) +..+...+.. .....+.++|+++....|...- + ......+..+... ....+.|. +++|+++++ T Consensus 361 L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~----~~~~G~l~~~~~vy~D~y 436 (528) T protein:vir:66 361 LIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTK----AVFAGVLAGKYKVFIDQY 436 (528) T ss_pred HHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCC----ceeEEEecCceEEEecCC Confidence 1122222322 2334578999999998886421 0 1111111111111 11123343 479999988 Q ss_pred CCCCceeEeee---cc---eeeeEEeecccEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEe-------- Q lcl|NC_021307. 233 VASGTTVGYLG---DF---SQIVWGQVGGLSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLIN-------- 298 (310) Q Consensus 233 ~~~~~~~~~~g---d~---~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~-------- 298 (310) .+.+ .+++| +- ..+++...-.....+. .+ |.+ || =.++|+ .|++..+- T Consensus 437 ~~~d--y~~vG~KG~~~~~~glfyaPYv~l~~~~~--------~d----p~s-fq-P~~g~~--tRY~l~vNP~~~~~~~ 498 (528) T protein:vir:66 437 ARQD--YFTVGYKGDNEMDAGIYYAPYVALTPLRA--------TD----PQS-FH-PVLGFK--TRYGIGINPFADSKSQ 498 (528) T ss_pred CCcc--eEEEEEeCCcccccceeecccccceeeEe--------eC----Ccc-cc-ceeeee--eeeceeecCcccccCc Confidence 7654 22222 11 1122222211111111 11 111 11 122332 34444321 Q ss_pred ----------------ccCc-eEEEeecC Q lcl|NC_021307. 299 ----------------DVEA-FVKLTNAA 310 (310) Q Consensus 299 ----------------~~~a-~~~l~~aa 310 (310) ..++ |.++-.|= T Consensus 499 ~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~ 527 (528) T protein:vir:66 499 EPSARITSGMLSKDSVGKNAYFRRVWVKG 527 (528) T ss_pred cccccccccchhhhhcCccceeEEeeecc Confidence 0011 11111111 No 245 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=22.56 E-value=2.4 Score=18.47 Aligned_cols=277 Identities=12% Similarity=0.020 Sum_probs=121.9 Q ss_pred hccccCCCCce-echhhHHHHHHHHHhhchhhh-hcceee---------c---CCCceEEEEEcCCceeeeecccccc-- Q lcl|NC_021307. 15 AQTGDSMFQGY-LEPEQAQDYFAEAEKTSIVQR-VARKIP---------M---GSTGVKIPHWTGDVSAAWIGEGDMK-- 78 (310) Q Consensus 15 ~~~~~~~~g~~-i~~~~~~~ii~~~~~~s~l~~-~~~~~~---------~---~~~~~~ip~~~~~~~a~~v~Eg~~~-- 78 (310) |..+....+.. ....++..+.....+.+++.. +...-. . .+.++++.... .-...+|.+++.. T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~-~L~g~gv~Gd~~leG 79 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSV-HLRGKPTYGDARVEG 79 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeee-ecccCCcccCceeec Confidence 43333333222 245667777777777776554 422100 0 01222222211 1122344333333 Q ss_pred cccccceeeeEeeeeeeEeeehhhHHHh-hcChhHHHHHHHHHHHHHHHHHHHHHHHcc-cCccccc------------- Q lcl|NC_021307. 79 PITKGDMSVQQVEPHKIATIFVASAETV-RANPGNYLGTMRTKVATAIALAFDEAALHG-TDSPFDK------------- 143 (310) Q Consensus 79 ~~~~~~~~~i~l~~~k~~~~~~is~ell-~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G-~g~~~~~------------- 143 (310) .+....|.+-++.+..+..-+.....+- +.+..+|...-++.|..-+.+..|+.+|.- .|+-... T Consensus 80 nee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~~~ 159 (364) T protein:vir:93 80 KEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTGYA 159 (364) T ss_pred cccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCccccc Confidence 3666788777777776666665433322 356789999999999999999999966632 2211000 Q ss_pred -ccc----------c--ccccccceecccchHHHHHHHHHHHhhhhc----------------CCCCEEEEehHHHHHHH Q lcl|NC_021307. 144 -NLD----------E--TTKSVDLTPATGTTYDAIGVNALSLLVNAG----------------KKWGATLLDDVAEPILN 194 (310) Q Consensus 144 -~~~----------~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~----------------~~~~~~~~~~~~~~~l~ 194 (310) +.. . .+.... ..+++....+.+..+...++... ...-+++|||..+..|+ T Consensus 160 ~N~v~aPt~~r~~~~~~at~~~~-l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) T protein:vir:93 160 GNPLDAPDVDHLLYGGVATSKAS-LAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATDMR 238 (364) T ss_pred ccccCCCCCCcEEeccccCchhh-ccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhhhh Confidence 000 0 000000 11122222233333333322221 12337899999999987 Q ss_pred HhhhccCccc-ccccc-----ccccccccCCceeeeeeEEEeCCCC-------CCce----eEeeecceeeeE--Eeecc Q lcl|NC_021307. 195 GAKDANGRPL-FVEST-----YEAVTTPYREGRILGRPTILSDHVA-------SGTT----VGYLGDFSQIVW--GQVGG 255 (310) Q Consensus 195 ~l~d~~g~~~-~~~~~-----~~~~~~~~~~~~l~G~pv~~t~~~~-------~~~~----~~~~gd~~~~~~--~~~~~ 255 (310) .-.+ |.++ ++... .......+.-+.+.|++++--.++. .+.. .+++| .+.+.+ +-.+| T Consensus 239 ~~t~--~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllG-aQA~~~a~g~~~g 315 (364) T protein:vir:93 239 TAAG--GTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMG-RQAGVIAYGTANG 315 (364) T ss_pred hcCC--HHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhhheec-ceeeEEEeecCCC Confidence 4332 1111 11110 0112223344667888876554442 1111 11222 233222 22234 Q ss_pred cEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecc----CceEEEeecC Q lcl|NC_021307. 256 LSFDVSDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYGLLINDV----EAFVKLTNAA 310 (310) Q Consensus 256 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~----~a~~~l~~aa 310 (310) +.+...++.. + -.|...+-+...+|++..+= -.+..|--+| T Consensus 316 ~~~~w~Ee~~-----D---------~gn~~~i~~~~i~G~kK~rF~~~DfGvi~idtaa 360 (364) T protein:vir:93 316 LRFDWEETVK-----D---------YGNEPAIAAGFIAGMKKARFNNKDFGVISIDTAA 360 (364) T ss_pred CCceeeeccc-----C---------CCCchhhhhhhHhhhhhcccCCccceEEEecccc Confidence 4443322211 0 12233343444444433322 2334443444 No 246 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=21.91 E-value=2.5 Score=18.38 Aligned_cols=295 Identities=13% Similarity=0.060 Sum_probs=128.7 Q ss_pred ccchh---hhHHHHHhhccccCCCCceechhhHHHHHHHHH---hhchhhhhcceeecCCCceEEEEEcC---Cceeeee Q lcl|NC_021307. 2 AAGTA---FPVNHTQIAQTGDSMFQGYLEPEQAQDYFAEAE---KTSIVQRVARKIPMGSTGVKIPHWTG---DVSAAWI 72 (310) Q Consensus 2 aa~~~---~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~ip~~~~---~~~a~~v 72 (310) ...+- ++.+...+.+. ....|+.+.-+..++-+..+- +.-.+++-....|..+...++-.... ....... T Consensus 1 ~~~~~~~~~~~a~~~al~~-a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~rhG~~g~s~~ 79 (470) T protein:vir:10 1 MPYEHLKHLDEATLKALNA-AGQVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVVTARHDKIGYAAF 79 (470) T ss_pred CChhHhhhhhHHHHHHHHH-hhhcchhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhhhhhccccccccceee Confidence 11111 11111111111 222233355444444333332 22234555555666665555543222 2233456 Q ss_pred cccccccccccceeeeEeeeeeeEeeehhhHHH---hhcChhHHHHHHHHHHHHHHHHHHHHHHHcccCc--------cc Q lcl|NC_021307. 73 GEGDMKPITKGDMSVQQVEPHKIATIFVASAET---VRANPGNYLGTMRTKVATAIALAFDEAALHGTDS--------PF 141 (310) Q Consensus 73 ~Eg~~~~~~~~~~~~i~l~~~k~~~~~~is~el---l~~s~~~~~~~v~~~l~~a~~~~~d~~~l~G~g~--------~~ 141 (310) .|++-.+.+++.+...+...|-++....+|.-. ++.+..+++..+.+.---.++..+|.+.|.|+.. .. T Consensus 80 ~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~s~~~g~~~ 159 (470) T protein:vir:10 80 REGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLGDDVPGSPN 159 (470) T ss_pred cccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhccccccccCcccC Confidence 899999999999999999999999999999774 3344558888888888888999999999999652 12 Q ss_pred ccccccccccccce-------ecccchHHHHHHHHHHHh--hhhcCCCCEEEEehHHHHHHHHhhhccCccccccccccc Q lcl|NC_021307. 142 DKNLDETTKSVDLT-------PATGTTYDAIGVNALSLL--VNAGKKWGATLLDDVAEPILNGAKDANGRPLFVESTYEA 212 (310) Q Consensus 142 ~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~~~~~l~~l~d~~g~~~~~~~~~~~ 212 (310) +....+..+-+... .-+.....+.+......+ ...+..+.-++|+..+.+.|..-....-|.+.+.+.... T Consensus 160 gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qRv~~~~N~~~~ 239 (470) T protein:vir:10 160 NLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISRVMTTADRRAG 239 (470) T ss_pred ceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceEEEEecCCCce Confidence 22223332222211 112222334444444444 467888889999999999998766666665555443221 Q ss_pred cccccCCceeeeeeE--EEeCC--CCCCceeEeeecceeee---EEe------ecccEEEEeec-ceeeecccccccchh Q lcl|NC_021307. 213 VTTPYREGRILGRPT--ILSDH--VASGTTVGYLGDFSQIV---WGQ------VGGLSFDVSDQ-ATLNLGTPQAPNFVS 278 (310) Q Consensus 213 ~~~~~~~~~l~G~pv--~~t~~--~~~~~~~~~~gd~~~~~---~~~------~~~~~v~~~~~-~~~~~~~~~~~~~~~ 278 (310) ..|+|| +++-. +.-+...+ +.++.+.. +.. .-.+.+.++.. .......+.... T Consensus 240 ---------~~G~~v~~f~sa~G~I~L~~s~~-m~~~~k~~p~~l~~~v~~~aAP~~~~tv~~t~~~~a~~~~sk~g--- 306 (470) T protein:vir:10 240 ---------LLGADAQSYIGVRGEHSLYPSQF-LGDFHKFNPARFGAEVGDFAAPSNSWTVSTTDNFVTLPYNSGLG--- 306 (470) T ss_pred ---------eeeeeccceeeeeeeeeeccccc-ccchhhcCcccCCcccCCcccCceeEEeecCCCceeecccCCCC--- Confidence 122221 11100 00000000 01000000 000 00011111100 000000000000 Q ss_pred hhh-cC--cEEEEEEEEeccE------------------------EeccCceEEEe-ecC Q lcl|NC_021307. 279 LWQ-HN--LVAVRVEAEYGLL------------------------INDVEAFVKLT-NAA 310 (310) Q Consensus 279 ~~~-~~--~~~~r~~~~~d~~------------------------v~~~~a~~~l~-~aa 310 (310) -|. ++ +..+++-.+.|=. ..+++-|..-. .++ T Consensus 307 ~~~~~~v~sy~y~v~~~~gds~s~~v~vt~t~~~v~kgv~ltI~~~~~v~yv~IYRk~~~ 366 (470) T protein:vir:10 307 DPANTTVYSYAFKAANFYGESAAKYIDVYIDSTEAGKGVRFQFHGLVNVKWLDVYRKDPG 366 (470) T ss_pred cccCcceeEEEEEEEEecCCCCcceEEEEEeeehhcceeEEEEecCCCCcEEEEEeecCC Confidence 000 00 0111111111100 00011111110 000 Done!