Query lcl|Aclame:protein:vir:94673|NCBI_annot:major capsid protein|genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Match_columns 419 No_of_seqs 143 out of 1167 Neff 10.1 Searched_HMMs 1612 Date Mon Dec 2 04:15:50 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_36 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_36_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:94673 Length: 419 100.0 8.8E-81 5.5E-84 459.5 45.1 419 1-419 1-419 (419) 2 protein:vir:100135 Length: 418 100.0 1.7E-67 1E-70 386.7 42.4 399 1-419 4-417 (418) 3 protein:vir:4339 Length: 395 # 100.0 2.3E-66 1.4E-69 380.5 43.6 393 1-417 1-395 (395) 4 protein:vir:101650 Length: 497 100.0 2.3E-66 1.4E-69 380.4 41.5 409 1-419 1-495 (497) 5 protein:vir:7855 Length: 497 # 100.0 2.3E-66 1.4E-69 380.4 41.5 409 1-419 1-495 (497) 6 protein:vir:10364 Length: 390 100.0 4.3E-66 2.7E-69 378.9 41.1 388 1-415 1-390 (390) 7 protein:vir:485 Length: 407 # 100.0 4.5E-66 2.8E-69 378.9 40.9 388 1-419 1-402 (407) 8 protein:vir:81070 Length: 390 100.0 4.9E-66 3.1E-69 378.6 40.9 388 1-415 1-390 (390) 9 protein:vir:97053 Length: 390 100.0 6E-66 3.7E-69 378.2 40.6 388 1-415 1-390 (390) 10 protein:vir:1886 Length: 385 # 100.0 1.8E-65 1.1E-68 375.5 40.5 384 1-418 1-385 (385) 11 protein:vir:191 Length: 385 # 100.0 1.8E-65 1.1E-68 375.5 40.5 384 1-418 1-385 (385) 12 protein:vir:1328 Length: 392 # 100.0 1.9E-65 1.2E-68 375.4 38.9 388 1-418 1-392 (392) 13 protein:vir:105038 Length: 428 100.0 1.4E-65 8.5E-69 376.2 38.0 402 1-417 1-428 (428) 14 protein:vir:4456 Length: 401 # 100.0 1.3E-64 7.9E-68 370.9 40.3 386 1-417 1-401 (401) 15 protein:vir:6242 Length: 390 # 100.0 9.2E-65 5.7E-68 371.7 38.1 386 1-418 1-390 (390) 16 protein:vir:100247 Length: 425 100.0 7.2E-64 4.5E-67 366.8 37.3 381 1-418 21-425 (425) 17 protein:vir:81227 Length: 413 100.0 2.4E-62 1.5E-65 358.4 42.6 400 1-419 2-412 (413) 18 protein:vir:80376 Length: 435 100.0 8.8E-63 5.5E-66 360.8 38.8 402 1-419 1-435 (435) 19 protein:vir:1433 Length: 435 # 100.0 1.3E-62 8.1E-66 359.9 38.9 401 1-419 1-435 (435) 20 protein:vir:4997 Length: 397 # 100.0 6.4E-62 4E-65 356.1 39.9 376 1-419 1-387 (397) 21 protein:vir:101607 Length: 379 100.0 1.5E-61 9.3E-65 354.0 41.5 376 1-417 1-379 (379) 22 protein:vir:4953 Length: 397 # 100.0 1.2E-61 7.3E-65 354.6 40.6 376 1-419 1-387 (397) 23 protein:vir:4700 Length: 415 # 100.0 2.3E-61 1.4E-64 353.0 41.6 397 1-419 1-406 (415) 24 protein:vir:4600 Length: 415 # 100.0 2.3E-61 1.4E-64 353.0 41.6 397 1-419 1-406 (415) 25 protein:vir:4830 Length: 397 # 100.0 1.3E-61 8.3E-65 354.3 39.7 378 1-419 1-387 (397) 26 protein:vir:4511 Length: 409 # 100.0 6.1E-62 3.8E-65 356.2 37.2 395 1-419 1-408 (409) 27 protein:vir:104256 Length: 458 100.0 3.1E-61 2E-64 352.3 40.4 401 1-417 5-458 (458) 28 protein:vir:81100 Length: 415 100.0 7.1E-61 4.4E-64 350.3 41.6 397 1-419 1-406 (415) 29 protein:vir:98339 Length: 415 100.0 7.1E-61 4.4E-64 350.3 41.6 397 1-419 1-406 (415) 30 protein:vir:79987 Length: 415 100.0 7.1E-61 4.4E-64 350.3 41.6 397 1-419 1-406 (415) 31 protein:vir:9410 Length: 415 # 100.0 1.4E-60 8.7E-64 348.7 40.9 396 1-419 1-406 (415) 32 protein:vir:7409 Length: 408 # 100.0 8.8E-61 5.4E-64 349.8 39.1 382 1-419 1-395 (408) 33 protein:vir:95376 Length: 425 100.0 1.3E-60 7.8E-64 349.0 39.1 393 1-419 6-423 (425) 34 protein:vir:1025 Length: 408 # 100.0 1.3E-60 7.8E-64 349.0 38.6 382 1-419 1-395 (408) 35 protein:vir:1268 Length: 397 # 100.0 7.9E-60 4.9E-63 344.6 41.0 383 1-417 1-397 (397) 36 protein:vir:81160 Length: 371 100.0 4E-60 2.5E-63 346.2 38.8 356 1-417 1-371 (371) 37 protein:vir:3991 Length: 404 # 100.0 6.7E-60 4.2E-63 345.0 39.4 382 1-419 1-395 (404) 38 protein:vir:8102 Length: 543 # 100.0 2E-59 1.2E-62 342.4 41.0 387 1-418 126-543 (543) 39 protein:vir:102119 Length: 404 100.0 1.5E-59 9.3E-63 343.1 39.0 389 1-419 1-402 (404) 40 protein:vir:6212 Length: 434 # 100.0 1.5E-58 9.3E-62 337.6 40.2 402 1-419 1-433 (434) 41 protein:vir:3845 Length: 395 # 100.0 2.6E-58 1.6E-61 336.3 39.8 371 1-419 1-385 (395) 42 protein:vir:9704 Length: 394 # 100.0 5.7E-58 3.5E-61 334.4 40.8 383 1-419 1-392 (394) 43 protein:vir:102873 Length: 392 100.0 7E-58 4.3E-61 333.9 40.6 373 1-419 1-386 (392) 44 protein:vir:107593 Length: 392 100.0 7E-58 4.3E-61 333.9 40.6 373 1-419 1-386 (392) 45 protein:vir:105004 Length: 392 100.0 7E-58 4.3E-61 333.9 40.6 373 1-419 1-386 (392) 46 protein:vir:102082 Length: 392 100.0 7E-58 4.3E-61 333.9 40.6 373 1-419 1-386 (392) 47 protein:vir:1383 Length: 421 # 100.0 4.1E-58 2.5E-61 335.2 38.6 376 1-419 1-385 (421) 48 protein:vir:2685 Length: 387 # 100.0 1.8E-58 1.1E-61 337.2 33.5 374 1-419 1-383 (387) 49 protein:vir:94424 Length: 387 100.0 1.8E-58 1.1E-61 337.2 33.5 374 1-419 1-383 (387) 50 protein:vir:96978 Length: 387 100.0 1.8E-58 1.1E-61 337.2 33.5 374 1-419 1-383 (387) 51 protein:vir:93881 Length: 387 100.0 7E-58 4.3E-61 333.9 34.9 374 1-419 1-383 (387) 52 protein:vir:3870 Length: 400 # 100.0 6.3E-57 3.9E-60 328.7 39.6 383 1-418 1-400 (400) 53 protein:vir:9361 Length: 402 # 100.0 6.5E-58 4E-61 334.1 33.4 374 1-419 16-398 (402) 54 protein:vir:80128 Length: 466 100.0 5.8E-57 3.6E-60 328.9 36.7 400 1-419 16-450 (466) 55 protein:vir:8420 Length: 477 # 100.0 7.7E-57 4.8E-60 328.2 37.2 407 1-419 8-473 (477) 56 protein:vir:100884 Length: 389 100.0 8E-56 5E-59 322.7 39.2 370 1-419 1-384 (389) 57 protein:vir:98635 Length: 377 100.0 2.2E-57 1.3E-60 331.2 30.5 359 1-417 1-377 (377) 58 protein:vir:100172 Length: 394 100.0 1E-55 6.3E-59 322.1 38.7 371 1-419 1-386 (394) 59 protein:vir:4092 Length: 390 # 100.0 1.3E-55 8.3E-59 321.4 38.5 358 4-419 1-370 (390) 60 protein:vir:93616 Length: 645 100.0 5.7E-56 3.5E-59 323.5 36.3 396 1-419 193-641 (645) 61 protein:vir:962 Length: 397 # 100.0 9.1E-56 5.6E-59 322.3 36.2 370 1-417 1-397 (397) 62 protein:vir:1084 Length: 437 # 100.0 6.2E-55 3.8E-58 317.8 38.2 386 1-419 1-431 (437) 63 protein:vir:9574 Length: 300 # 100.0 9.9E-56 6.1E-59 322.1 28.6 279 125-417 1-300 (300) 64 protein:vir:7771 Length: 330 # 100.0 1.5E-55 9E-59 321.2 28.5 295 115-419 1-325 (330) 65 protein:vir:41 Length: 299 # N 100.0 1.6E-55 9.8E-59 321.0 28.5 282 119-418 1-299 (299) 66 protein:vir:1638 Length: 298 # 100.0 2.7E-55 1.7E-58 319.7 28.7 280 128-416 1-298 (298) 67 protein:vir:9759 Length: 303 # 100.0 3.1E-55 1.9E-58 319.4 28.7 284 126-417 1-303 (303) 68 protein:vir:105905 Length: 304 100.0 3.6E-55 2.2E-58 319.1 27.8 285 116-416 1-304 (304) 69 protein:vir:94142 Length: 304 100.0 3.6E-55 2.2E-58 319.1 27.8 285 116-416 1-304 (304) 70 protein:vir:78640 Length: 352 100.0 4E-54 2.5E-57 313.3 30.7 346 1-419 1-348 (352) 71 protein:vir:94771 Length: 298 100.0 1.6E-54 1E-57 315.4 28.4 280 128-416 1-298 (298) 72 protein:vir:5739 Length: 366 # 100.0 2.3E-54 1.4E-57 314.6 29.1 341 56-417 1-366 (366) 73 protein:vir:97148 Length: 324 100.0 1E-53 6.2E-57 311.1 29.8 301 80-419 1-317 (324) 74 protein:vir:78523 Length: 338 100.0 1E-53 6.3E-57 311.1 29.0 308 107-419 1-337 (338) 75 protein:vir:78223 Length: 333 100.0 1.1E-53 6.5E-57 311.0 28.9 307 107-418 1-333 (333) 76 protein:vir:96392 Length: 324 100.0 1.7E-53 1.1E-56 309.8 29.7 301 84-419 1-317 (324) 77 protein:vir:78830 Length: 324 100.0 1.7E-53 1.1E-56 309.8 29.7 301 84-419 1-317 (324) 78 protein:vir:9309 Length: 324 # 100.0 2.6E-53 1.6E-56 308.9 30.1 299 91-419 1-317 (324) 79 protein:vir:96762 Length: 632 100.0 1.8E-52 1.1E-55 304.3 33.9 385 1-416 185-632 (632) 80 protein:vir:9643 Length: 377 # 100.0 1.4E-52 9E-56 304.8 32.8 347 1-417 1-377 (377) 81 protein:vir:4226 Length: 326 # 100.0 1.9E-53 1.2E-56 309.6 27.6 305 97-419 1-325 (326) 82 protein:vir:100632 Length: 381 100.0 1.1E-52 7.1E-56 305.4 31.5 347 1-419 1-375 (381) 83 protein:vir:104085 Length: 320 100.0 2.2E-53 1.3E-56 309.3 27.0 300 101-419 1-319 (320) 84 protein:vir:8187 Length: 311 # 100.0 4.5E-53 2.8E-56 307.6 28.3 281 126-418 1-311 (311) 85 protein:vir:99749 Length: 324 100.0 8.9E-53 5.5E-56 305.9 29.6 301 80-419 1-317 (324) 86 protein:vir:80684 Length: 315 100.0 3.2E-53 2E-56 308.4 26.9 281 124-419 1-308 (315) 87 protein:vir:103955 Length: 324 100.0 1.1E-52 7.1E-56 305.4 29.5 301 80-419 1-317 (324) 88 protein:vir:101291 Length: 381 100.0 4.6E-52 2.8E-55 302.0 32.8 347 1-419 1-372 (381) 89 protein:vir:9509 Length: 381 # 100.0 4.6E-52 2.8E-55 302.0 32.8 347 1-419 1-372 (381) 90 protein:vir:4856 Length: 293 # 100.0 1E-52 6.4E-56 305.6 28.0 272 119-419 1-283 (293) 91 protein:vir:96223 Length: 324 100.0 2.7E-52 1.7E-55 303.3 29.5 301 80-419 1-317 (324) 92 protein:vir:95963 Length: 395 100.0 5.6E-51 3.5E-54 296.1 36.3 358 1-419 1-378 (395) 93 protein:vir:95763 Length: 297 100.0 3.6E-52 2.3E-55 302.6 28.6 281 116-418 1-297 (297) 94 protein:vir:2344 Length: 397 # 100.0 2.5E-52 1.5E-55 303.5 25.9 288 104-419 1-308 (397) 95 protein:vir:78350 Length: 383 100.0 1.5E-51 9.3E-55 299.2 29.6 365 1-419 1-377 (383) 96 protein:vir:2430 Length: 318 # 100.0 4.4E-52 2.7E-55 302.2 26.0 295 101-419 1-315 (318) 97 protein:vir:99920 Length: 311 100.0 1.1E-51 6.6E-55 300.0 27.4 283 125-417 1-311 (311) 98 protein:vir:2504 Length: 305 # 100.0 1.5E-50 9.6E-54 293.7 27.0 279 124-419 1-302 (305) 99 protein:vir:97397 Length: 517 100.0 4.8E-38 3E-41 225.2 30.4 380 1-419 124-516 (517) 100 protein:vir:4197 Length: 314 # 100.0 3.5E-38 2.2E-41 225.9 25.3 293 104-419 1-313 (314) 101 protein:vir:4159 Length: 315 # 100.0 2E-38 1.3E-41 227.2 23.0 295 96-414 1-315 (315) 102 protein:vir:3158 Length: 321 # 100.0 1.1E-33 6.6E-37 201.4 26.1 298 101-419 1-321 (321) 103 protein:vir:4074 Length: 480 # 100.0 3.6E-34 2.2E-37 203.9 21.1 360 1-419 111-479 (480) 104 protein:vir:3033 Length: 272 # 100.0 5E-32 3.1E-35 192.2 24.6 264 124-419 1-271 (272) 105 protein:vir:9820 Length: 272 # 100.0 5E-32 3.1E-35 192.2 24.6 264 124-419 1-271 (272) 106 protein:vir:93742 Length: 274 99.9 2.6E-24 1.6E-27 149.9 22.5 265 124-419 1-272 (274) 107 protein:vir:3613 Length: 272 # 99.9 2.2E-23 1.4E-26 144.8 20.3 263 124-417 1-272 (272) 108 protein:vir:96833 Length: 275 99.8 4E-22 2.5E-25 137.9 20.9 266 123-419 1-273 (275) 109 protein:vir:97433 Length: 274 99.8 8.1E-22 5E-25 136.2 22.6 265 124-419 1-272 (274) 110 protein:vir:94494 Length: 274 99.8 8.1E-22 5E-25 136.2 22.6 265 124-419 1-272 (274) 111 protein:vir:96123 Length: 274 99.8 7.1E-22 4.4E-25 136.5 22.0 265 124-419 1-272 (274) 112 protein:vir:80930 Length: 278 99.8 6.7E-22 4.2E-25 136.7 21.8 269 124-418 1-278 (278) 113 protein:vir:105334 Length: 276 99.8 8.4E-22 5.2E-25 136.1 21.3 265 124-419 1-272 (276) 114 protein:vir:1239 Length: 274 # 99.8 1.5E-20 9.1E-24 129.3 21.5 261 124-419 1-272 (274) 115 protein:vir:94933 Length: 330 99.8 9.2E-21 5.7E-24 130.4 17.7 305 94-418 1-330 (330) 116 protein:vir:96262 Length: 274 99.8 1.1E-19 6.5E-23 124.6 21.6 265 124-419 1-272 (274) 117 protein:vir:95898 Length: 274 99.8 1.1E-19 6.5E-23 124.6 21.6 265 124-419 1-272 (274) 118 protein:vir:95107 Length: 270 99.7 2.4E-19 1.5E-22 122.7 20.5 261 126-419 1-267 (270) 119 protein:vir:79928 Length: 393 99.7 5.2E-19 3.2E-22 120.8 19.8 350 28-419 1-383 (393) 120 protein:vir:739 Length: 231 # 99.7 4.8E-18 3E-21 115.5 17.4 226 159-417 1-231 (231) 121 protein:vir:108211 Length: 318 99.6 1.2E-17 7.3E-21 113.4 16.5 286 118-418 1-318 (318) 122 protein:vir:93858 Length: 400 99.6 6.3E-17 3.9E-20 109.4 19.8 381 1-415 8-400 (400) 123 protein:vir:97255 Length: 310 99.6 5E-16 3.1E-19 104.5 22.5 285 123-417 1-310 (310) 124 protein:vir:99424 Length: 360 99.6 2.8E-15 1.7E-18 100.4 22.3 307 88-419 1-359 (360) 125 protein:vir:8324 Length: 410 # 99.5 8.2E-15 5.1E-18 97.8 19.7 380 1-415 1-410 (410) 126 protein:vir:7990 Length: 273 # 99.5 6.5E-15 4E-18 98.4 19.1 260 128-417 1-273 (273) 127 protein:vir:102605 Length: 273 99.5 7E-15 4.3E-18 98.2 19.2 259 128-417 1-273 (273) 128 protein:vir:105822 Length: 273 99.5 7E-15 4.3E-18 98.2 19.2 259 128-417 1-273 (273) 129 protein:vir:80180 Length: 381 99.4 8.7E-14 5.4E-17 92.2 17.2 294 113-419 1-306 (381) 130 protein:vir:94576 Length: 347 99.3 7.6E-14 4.7E-17 92.5 15.3 291 98-417 1-347 (347) 131 protein:vir:94622 Length: 341 99.3 1E-13 6.4E-17 91.8 15.9 289 116-419 1-341 (341) 132 protein:vir:2201 Length: 345 # 99.3 2.4E-13 1.5E-16 89.7 15.8 290 110-417 1-345 (345) 133 protein:vir:8885 Length: 347 # 99.3 2.5E-13 1.6E-16 89.7 15.5 296 98-418 1-347 (347) 134 protein:vir:1541 Length: 347 # 99.2 9.1E-13 5.6E-16 86.6 15.7 295 110-419 1-347 (347) 135 protein:vir:3364 Length: 347 # 99.2 5.1E-13 3.2E-16 88.0 14.0 297 98-419 1-347 (347) 136 protein:vir:94711 Length: 347 99.2 2.1E-13 1.3E-16 90.1 11.8 291 110-418 1-347 (347) 137 protein:vir:5974 Length: 324 # 99.2 1E-11 6.3E-15 80.9 19.1 270 126-419 1-294 (324) 138 protein:vir:10450 Length: 344 99.1 2E-12 1.3E-15 84.7 13.4 288 114-417 1-344 (344) 139 protein:vir:6324 Length: 335 # 99.1 2.2E-11 1.4E-14 79.0 18.9 288 113-419 1-330 (335) 140 protein:vir:1583 Length: 351 # 99.1 3.6E-11 2.3E-14 77.8 18.6 273 126-419 1-298 (351) 141 protein:vir:99675 Length: 324 99.1 8.9E-12 5.5E-15 81.2 15.0 249 157-419 1-298 (324) 142 protein:vir:3136 Length: 322 # 99.1 2.2E-11 1.4E-14 79.0 16.9 287 123-419 1-320 (322) 143 protein:vir:80213 Length: 334 99.1 1.9E-11 1.2E-14 79.4 16.5 291 110-419 1-334 (334) 144 protein:vir:78935 Length: 335 99.1 5.6E-11 3.5E-14 76.8 19.0 288 113-419 1-330 (335) 145 protein:vir:102944 Length: 330 99.1 7.2E-11 4.5E-14 76.2 19.3 273 124-419 1-300 (330) 146 protein:vir:95318 Length: 328 99.1 1.3E-11 8.1E-15 80.2 15.1 235 100-360 1-328 (328) 147 protein:vir:103285 Length: 296 99.1 4E-11 2.5E-14 77.6 17.7 277 118-415 1-296 (296) 148 protein:vir:103323 Length: 364 99.0 4.4E-10 2.8E-13 71.9 21.2 289 118-419 1-341 (364) 149 protein:vir:78739 Length: 332 99.0 6.4E-11 4E-14 76.5 16.0 291 104-415 1-332 (332) 150 protein:vir:107687 Length: 319 99.0 5.6E-10 3.5E-13 71.3 20.1 299 81-415 1-319 (319) 151 protein:vir:80068 Length: 301 98.9 8.8E-10 5.4E-13 70.2 20.0 274 127-415 1-301 (301) 152 protein:vir:102655 Length: 322 98.9 7.5E-10 4.6E-13 70.6 17.7 296 113-418 1-322 (322) 153 protein:vir:100057 Length: 375 98.9 1.8E-09 1.1E-12 68.6 19.6 295 110-419 1-373 (375) 154 protein:vir:9927 Length: 295 # 98.9 8.3E-10 5.1E-13 70.4 17.8 263 123-419 1-290 (295) 155 protein:vir:103759 Length: 330 98.8 2.3E-10 1.4E-13 73.4 13.5 236 98-360 1-330 (330) 156 protein:vir:104342 Length: 314 98.8 2E-09 1.2E-12 68.3 17.6 293 102-415 1-314 (314) 157 protein:vir:79642 Length: 329 98.7 4.8E-09 3E-12 66.2 18.2 304 80-418 1-329 (329) 158 protein:vir:107826 Length: 331 98.7 2.6E-09 1.6E-12 67.7 16.2 235 100-360 1-331 (331) 159 protein:vir:107388 Length: 331 98.7 2.6E-09 1.6E-12 67.7 16.2 235 100-360 1-331 (331) 160 protein:vir:98525 Length: 331 98.7 2.6E-09 1.6E-12 67.7 16.2 235 100-360 1-331 (331) 161 protein:vir:9875 Length: 296 # 98.7 3.7E-09 2.3E-12 66.8 16.9 270 117-418 1-296 (296) 162 protein:vir:99075 Length: 392 98.7 5.8E-09 3.6E-12 65.7 16.6 273 130-419 1-316 (392) 163 protein:vir:106647 Length: 303 98.6 7E-09 4.3E-12 65.3 16.1 265 121-419 1-298 (303) 164 protein:vir:7324 Length: 335 # 98.6 2.6E-09 1.6E-12 67.6 13.7 237 98-361 1-335 (335) 165 protein:vir:97031 Length: 402 98.6 3.3E-09 2.1E-12 67.1 14.1 288 118-419 1-339 (402) 166 protein:vir:105645 Length: 400 98.6 1.5E-08 9.3E-12 63.5 16.5 293 113-419 1-335 (400) 167 protein:vir:8843 Length: 317 # 98.4 1.3E-07 8E-11 58.4 18.0 286 121-419 1-317 (317) 168 protein:vir:7019 Length: 401 # 98.4 3.8E-08 2.4E-11 61.3 14.8 292 113-419 1-335 (401) 169 protein:vir:108303 Length: 418 98.4 4E-07 2.5E-10 55.6 20.1 265 127-419 1-324 (418) 170 protein:vir:94070 Length: 339 98.2 2.3E-07 1.4E-10 57.0 14.8 314 79-415 1-339 (339) 171 protein:vir:5255 Length: 304 # 98.1 6.6E-07 4.1E-10 54.5 15.9 268 129-414 1-304 (304) 172 protein:vir:95131 Length: 325 98.1 1E-06 6.5E-10 53.4 16.6 276 101-419 1-297 (325) 173 protein:vir:3643 Length: 336 # 98.0 1E-06 6.3E-10 53.4 14.6 307 88-415 1-336 (336) 174 protein:vir:79548 Length: 652 98.0 7.5E-06 4.7E-09 48.7 26.3 390 1-414 188-652 (652) 175 protein:vir:80446 Length: 367 98.0 5.7E-06 3.5E-09 49.3 18.7 276 123-419 1-339 (367) 176 protein:vir:97331 Length: 319 98.0 7.7E-06 4.8E-09 48.6 19.3 284 80-419 1-296 (319) 177 protein:vir:94800 Length: 319 98.0 7.7E-06 4.8E-09 48.6 19.3 284 80-419 1-296 (319) 178 protein:vir:93966 Length: 400 98.0 1.2E-06 7.7E-10 53.0 14.7 375 1-415 8-400 (400) 179 protein:vir:101557 Length: 336 98.0 2.2E-06 1.4E-09 51.6 15.7 307 88-415 1-336 (336) 180 protein:vir:78558 Length: 336 97.9 2.2E-06 1.4E-09 51.6 15.2 304 88-415 1-336 (336) 181 protein:vir:107120 Length: 329 97.9 1.2E-05 7.4E-09 47.6 19.0 295 73-419 1-308 (329) 182 protein:vir:174 Length: 423 # 97.8 1.5E-05 9.1E-09 47.1 18.1 265 128-419 1-303 (423) 183 protein:vir:105374 Length: 423 97.8 2E-05 1.2E-08 46.4 18.2 269 128-419 1-334 (423) 184 protein:vir:106734 Length: 336 97.8 4.5E-06 2.8E-09 49.9 14.4 307 88-415 1-336 (336) 185 protein:vir:1663 Length: 393 # 97.7 3.6E-06 2.3E-09 50.4 13.7 376 1-415 1-393 (393) 186 protein:vir:95512 Length: 693 97.7 2.7E-05 1.7E-08 45.6 25.2 388 1-415 258-693 (693) 187 protein:vir:3525 Length: 423 # 97.7 2.8E-05 1.7E-08 45.5 17.8 264 128-419 1-318 (423) 188 protein:vir:96792 Length: 315 97.7 3E-05 1.9E-08 45.4 18.5 268 126-419 1-284 (315) 189 protein:vir:1781 Length: 221 # 97.6 4.2E-06 2.6E-09 50.1 12.7 186 213-419 1-204 (221) 190 protein:vir:94989 Length: 349 97.6 3.9E-05 2.4E-08 44.7 18.5 273 126-419 1-319 (349) 191 protein:vir:104011 Length: 337 97.5 5E-05 3.1E-08 44.2 18.7 300 102-417 1-337 (337) 192 protein:vir:79171 Length: 337 97.5 5.3E-05 3.3E-08 44.0 18.7 300 102-417 1-337 (337) 193 protein:vir:1829 Length: 355 # 97.4 6.5E-05 4E-08 43.5 18.3 306 102-419 1-350 (355) 194 protein:vir:107732 Length: 379 97.3 4.7E-05 2.9E-08 44.3 15.1 331 63-415 1-379 (379) 195 protein:vir:78387 Length: 349 97.3 9E-05 5.6E-08 42.8 18.9 273 126-419 1-319 (349) 196 protein:vir:1153 Length: 338 # 97.3 0.0001 6.3E-08 42.5 17.8 303 102-419 1-338 (338) 197 protein:vir:79157 Length: 339 97.2 0.00013 8.1E-08 41.9 17.4 301 102-418 1-339 (339) 198 protein:vir:98566 Length: 355 97.2 0.00014 8.7E-08 41.7 18.6 308 102-419 1-350 (355) 199 protein:vir:95603 Length: 463 97.1 4.4E-05 2.7E-08 44.5 12.5 314 71-419 1-337 (463) 200 protein:vir:99311 Length: 463 97.1 4.4E-05 2.7E-08 44.5 12.5 314 71-419 1-337 (463) 201 protein:vir:78186 Length: 337 97.1 0.00019 1.2E-07 41.0 17.7 300 102-417 1-337 (337) 202 protein:vir:105522 Length: 423 97.0 0.00022 1.4E-07 40.6 19.1 267 128-419 1-318 (423) 203 protein:vir:96079 Length: 382 97.0 8.8E-05 5.5E-08 42.8 13.2 333 63-415 1-382 (382) 204 protein:vir:100331 Length: 342 96.9 0.00027 1.7E-07 40.2 18.3 306 102-418 1-342 (342) 205 protein:vir:3746 Length: 336 # 96.8 0.0003 1.9E-07 39.9 18.0 299 101-418 1-336 (336) 206 protein:vir:78777 Length: 358 96.8 0.00034 2.1E-07 39.6 17.9 308 98-419 1-348 (358) 207 protein:vir:3783 Length: 336 # 96.7 0.00041 2.5E-07 39.2 18.0 299 101-418 1-336 (336) 208 protein:vir:5694 Length: 357 # 96.5 0.0006 3.7E-07 38.3 16.7 305 102-419 1-349 (357) 209 protein:vir:270 Length: 341 # 96.4 0.00062 3.8E-07 38.2 16.1 304 98-419 1-334 (341) 210 protein:vir:861 Length: 318 # 96.3 0.00013 7.8E-08 42.0 10.0 306 84-415 1-318 (318) 211 protein:vir:2016 Length: 357 # 96.3 0.00072 4.5E-07 37.8 17.0 304 102-419 1-349 (357) 212 protein:vir:99576 Length: 388 96.0 0.00026 1.6E-07 40.2 10.2 335 63-415 1-388 (388) 213 protein:vir:6061 Length: 357 # 95.9 0.0013 8.3E-07 36.3 16.7 308 102-419 1-352 (357) 214 protein:vir:95875 Length: 401 95.6 0.0019 1.2E-06 35.6 16.2 296 114-418 1-401 (401) 215 protein:vir:94870 Length: 318 94.7 0.0017 1E-06 35.8 10.3 309 84-415 1-318 (318) 216 protein:vir:348 Length: 321 # 94.6 0.0039 2.4E-06 33.8 13.5 285 104-415 1-321 (321) 217 protein:vir:98856 Length: 343 94.5 0.0042 2.6E-06 33.6 18.4 304 102-419 1-341 (343) 218 protein:vir:95451 Length: 313 94.4 0.0045 2.8E-06 33.4 14.7 276 125-418 1-313 (313) 219 protein:vir:96666 Length: 462 93.3 0.0079 4.9E-06 32.1 16.9 305 71-419 1-341 (462) 220 protein:vir:103886 Length: 302 93.1 0.0087 5.4E-06 31.9 17.2 269 87-417 1-302 (302) 221 protein:vir:63741 Length: 468 93.0 0.009 5.6E-06 31.8 12.9 302 71-419 1-329 (468) 222 protein:vir:80835 Length: 464 92.9 0.0096 5.9E-06 31.7 13.6 308 78-419 1-349 (464) 223 protein:vir:80491 Length: 467 91.9 0.014 8.4E-06 30.8 13.4 301 73-419 1-328 (467) 224 protein:vir:100851 Length: 514 91.2 0.01 6.5E-06 31.5 9.4 313 83-419 1-354 (514) 225 protein:vir:2736 Length: 348 # 90.9 0.018 1.1E-05 30.1 20.6 281 128-418 1-348 (348) 226 protein:vir:96490 Length: 348 89.9 0.024 1.5E-05 29.5 20.7 281 128-418 1-348 (348) 227 protein:vir:4902 Length: 348 # 89.6 0.025 1.6E-05 29.3 18.3 284 128-418 1-348 (348) 228 protein:vir:79008 Length: 299 88.4 0.033 2E-05 28.7 21.9 264 128-419 1-299 (299) 229 protein:vir:80986 Length: 528 87.1 0.041 2.5E-05 28.2 19.6 351 56-419 1-505 (528) 230 protein:vir:5670 Length: 514 # 81.2 0.088 5.4E-05 26.4 18.5 349 60-419 1-501 (514) 231 protein:vir:106998 Length: 468 76.4 0.14 8.4E-05 25.3 17.6 351 56-419 1-447 (468) 232 protein:vir:102823 Length: 470 76.0 0.14 8.7E-05 25.3 10.7 292 76-419 1-323 (470) 233 protein:vir:98143 Length: 524 75.8 0.14 8.8E-05 25.2 20.3 354 22-419 1-497 (524) 234 protein:vir:78148 Length: 123 68.8 0.2 0.00012 24.5 7.2 109 307-417 1-123 (123) 235 protein:vir:100603 Length: 529 67.9 0.25 0.00015 23.9 19.9 359 21-419 1-512 (529) 236 protein:vir:78920 Length: 290 65.2 0.29 0.00018 23.5 20.0 259 128-417 1-290 (290) 237 protein:vir:6601 Length: 528 # 64.5 0.3 0.00019 23.5 20.9 357 25-419 1-501 (528) 238 protein:vir:5942 Length: 523 # 64.0 0.31 0.00019 23.4 16.4 333 42-419 1-523 (523) 239 protein:vir:962 Length: 397 # 52.6 0.55 0.00034 22.0 18.6 346 1-408 12-397 (397) 240 protein:vir:104915 Length: 470 52.4 0.55 0.00034 22.0 18.9 348 51-419 1-454 (470) 241 protein:vir:98480 Length: 348 52.0 0.57 0.00035 21.9 18.9 282 125-416 1-348 (348) 242 protein:vir:103463 Length: 521 47.4 0.7 0.00044 21.4 19.4 360 21-419 1-508 (521) 243 protein:vir:6901 Length: 522 # 45.1 0.78 0.00048 21.2 20.8 361 18-419 1-503 (522) 244 protein:vir:99888 Length: 309 44.5 0.8 0.0005 21.1 13.0 270 129-418 1-309 (309) 245 protein:vir:1025 Length: 408 # 42.9 0.87 0.00054 20.9 22.0 379 1-419 4-406 (408) 246 protein:vir:101039 Length: 529 42.0 0.9 0.00056 20.8 20.3 361 21-419 1-518 (529) 247 protein:vir:102335 Length: 312 41.3 0.93 0.00058 20.7 20.9 266 128-419 1-310 (312) 248 protein:vir:100172 Length: 394 40.9 0.95 0.00059 20.7 16.1 349 1-419 5-389 (394) 249 protein:vir:1383 Length: 421 # 40.2 0.98 0.00061 20.6 22.9 371 1-419 4-396 (421) 250 protein:vir:106286 Length: 534 40.1 0.99 0.00061 20.6 21.4 368 23-419 1-515 (534) 251 protein:vir:8846 Length: 705 # 36.8 1.2 0.00071 20.2 10.8 117 1-121 576-705 (705) 252 protein:vir:3870 Length: 400 # 32.0 1.5 0.00091 19.7 21.1 356 1-408 11-400 (400) 253 protein:vir:101811 Length: 529 31.5 1.5 0.00092 19.6 21.0 361 21-419 1-518 (529) 254 protein:vir:7214 Length: 521 # 28.6 1.7 0.0011 19.3 20.7 361 21-419 1-508 (521) 255 protein:vir:7409 Length: 408 # 26.7 1.9 0.0012 19.0 21.9 377 1-417 4-408 (408) 256 protein:vir:104549 Length: 462 24.4 2.2 0.0014 18.7 19.6 335 42-419 1-446 (462) 257 protein:vir:6212 Length: 434 # 24.1 2.2 0.0014 18.7 22.1 375 5-419 1-422 (434) 258 protein:vir:79712 Length: 285 23.5 2.3 0.0014 18.6 20.1 261 128-418 1-285 (285) 259 protein:vir:105464 Length: 346 23.2 2.3 0.0015 18.6 20.7 264 128-419 1-300 (346) No 1 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=8.8e-81 Score=459.47 Aligned_cols=419 Identities=100% Similarity=1.428 Sum_probs=383.5 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) ||..++|+|++++++++.+......++.+++.++.+++.++++++++++..+++.++...+..+................ T Consensus 1 m~~~~~lee~~a~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) T protein:vir:94 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 99999999999999999999988888888999999999999999999999999988888877777666666666666666 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) .+...+...+...++.......++......+..............+..+.++..++|+.+.+.|+..+.....++++|++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~ 160 (419) T protein:vir:94 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) T ss_pred ccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhccee Confidence 67777766667777777777777777777777777777777777777888899999999999999999999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLTY 240 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~~ 240 (419) .++.++.+.||+.++.+.+.....+.++||+||+.+|+++++|+++++++++++++++||+|+++|++++++||.++|++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~l~~~i~~~la~ 240 (419) T protein:vir:94 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLTY 240 (419) T ss_pred eeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHhHHHHHHHHHHHHHH Confidence 99999999999999998888888889999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHh Q lcl|Aclame:pro 241 GLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQ 320 (419) Q Consensus 241 a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~k 320 (419) ++++++|.+||+|+|+++|+||++.+++............+....++++.++++.+...++.+++|+|||++|..|++++ T Consensus 241 a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k 320 (419) T protein:vir:94 241 GLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQ 320 (419) T ss_pred HHHHHHHHHHHhccCcccccceecccccccccccccccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHh Confidence 99999999999999999999999999888777776677777788899999999999999999999999999999999999 Q ss_pred ccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEec Q lcl|Aclame:pro 321 APGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRAN 400 (419) Q Consensus 321 d~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d 400 (419) +++|+++++++++.++.+++|+|+||+++++||++++++|||+++|+++++.+++++++++.+++|.+|++.||++.|+| T Consensus 321 ~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d 400 (419) T protein:vir:94 321 APGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRAN 400 (419) T ss_pred hcCCCceeecCCcccCCCccccceeeEEcCCCCCccEEEeeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeec Confidence 99898888899999999999999999999999999999999999999999999999999998889999999999999999 Q ss_pred cEEecccceEEEEecCCCC Q lcl|Aclame:pro 401 LAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 401 ~~~~~~~a~~~~~~~aa~~ 419 (419) +++++|+||++++++++|| T Consensus 401 ~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 401 LAVYQPKAFVRVTFAAATT 419 (419) T ss_pred cEEeccccEEEEEeccCCC Confidence 9999999999999999999 No 2 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=1.7e-67 Score=386.65 Aligned_cols=399 Identities=27% Similarity=0.392 Sum_probs=288.2 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARG--------------LADALQAESDRAAARAALLRTAPPAPKGP 66 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~--------------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 66 (419) |+..+.++++.+...+..+++++..++++.+.++.++ +.++++++++++..+..+++......+.. T Consensus 4 ~~~~~~~~~~~~~~~el~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~ 83 (418) T protein:vir:10 4 MNEPRQFGRKSGGDSHPEQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATVDELLIKQGELQARLLEAEQK 83 (418) T ss_pred chhHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6666666555433222222222222222222222211 12223333333333343333333332222 Q ss_pred HhhcccccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHH Q lcl|Aclame:pro 67 ADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPT 146 (419) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~ 146 (419) .... .........+...+...+....+.+........... .............+..+..++.++|+.+...|++ T Consensus 84 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~ 157 (418) T protein:vir:10 84 LARG--GGSAELETPKTLGQLVTESEEMKGMDGSARKSVRVR----VDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIA 157 (418) T ss_pred Hhhc--ccccccchhhhhhHHhhhHHHHHHHHHHHhhhhhhh----hHHHHHHHhhhhccCCCCCCccccchhHHHHHHH Confidence 2111 111122222333333333344444433333221111 1111111112223334556777899999999999 Q ss_pred hhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhh Q lcl|Aclame:pro 147 TPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADD 226 (419) Q Consensus 147 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d 226 (419) .+...++|+++|++++++++.+++|+.+.. +..+.|++|++.+|+++++|++|++.++|++++++||+++++| T Consensus 158 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-------~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d 230 (418) T protein:vir:10 158 PPQRKMTIRDLLMPGQTSSSSIEYTVETGF-------TNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDD 230 (418) T ss_pred HHhhhhhHHhhcceeeccCCceeEEEEecC-------CCceeeeccCccccccccceeeEEEeeeeEEEeehhhHHHHHh Confidence 999999999999999999999999987542 3578999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcE Q lcl|Aclame:pro 227 NSQLMGYIQGRLTYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDG 305 (419) Q Consensus 227 ~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (419) ++++++||+++|++++++++|.+||+|+|++ +|.||++.++..... ...++...++++.+++..+...++.+++ T Consensus 231 s~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 305 (418) T protein:vir:10 231 APALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPS-----ITLANATPIDKIRLALLQAVLAEFPATG 305 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccc-----ccccccccHHHHHHHHHhhccccCCCCE Confidence 9999999999999999999999999999987 599999876544322 2223345688999999999999999999 Q ss_pred EEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccch Q lcl|Aclame:pro 306 VVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADF 385 (419) Q Consensus 306 ~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~ 385 (419) |+|||.++..|++++|.+|+|+| +++.++.+++|+|+||+++++||++++++|||+++|+++++.+++++++++...+ T Consensus 306 ~v~n~~~~~~L~~lkd~~G~~i~--~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~ 383 (418) T protein:vir:10 306 IVLNPIDWASIELTKDSQGRYIV--GNPVNGTTPRLWNLPVVETQAMTANEFLVGAFSMAAQIFDRMEIEVLLSTENVDD 383 (418) T ss_pred EEEcHHHHHHHHHhhcCCCceec--cccccCCCceecceeeEEcCCCCCCcEEEeeccceEEEEEecceEEEEecccchh Confidence 99999999999999999999866 3455677889999999999999999999999999899999999999999988888 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 386 FTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 386 ~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) |.+|++.||++.|+||++++|+||++++++++++ T Consensus 384 f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~~ 417 (418) T protein:vir:10 384 FEKNMVSIRAEERLALAVYRPESFVTGALVEQAG 417 (418) T ss_pred hhcCceEEEEEEeeccEEecccceEEEEeccCCC Confidence 9999999999999999999999999999999999 No 3 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=2.3e-66 Score=380.48 Aligned_cols=393 Identities=29% Similarity=0.381 Sum_probs=293.1 Q ss_pred CCcc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MPPT-PTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG 79 (419) Q Consensus 1 M~~~-~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (419) |... +.|++.++++.+..++++...++..+..++.+...++++++.+++..++..++....+........... ..... T Consensus 1 m~~~~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 79 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQIKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKR-DGGEE 79 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc-ccccc Confidence 6543 456666666666666665555555444444444555555666655555555554443333222221111 11112 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD 159 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~ 159 (419) ..+............+.+........... . .. .... .....++.++|+.+...|+..+...++|+++|+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~-----~~-~~~~-~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~ 148 (395) T protein:vir:43 80 APKTAGQMVAESLKEQGVTSSLRGSHRVS----M-----PR-SAIT-SIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVA 148 (395) T ss_pred hhhhHHHHHHHHHHHHHHHHHhhhhhhhh----h-----hh-hhhc-ccCCCCccccchhhHHHHHHHHHhhhhHHhhcc Confidence 22222232222222233322222111100 0 01 1111 223344556666778889999999999999999 Q ss_pred eecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHHH Q lcl|Aclame:pro 160 QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLT 239 (419) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~ 239 (419) +++++++.++||+.++. ...+.||+|++.+|+++++|++++++++|++++++||+++++|++++++||.++|+ T Consensus 149 ~~~~~~~~~~~~~~~~~-------~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~l~~~v~~~la 221 (395) T protein:vir:43 149 PGTTESNSVEYVRETGF-------VNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILDDASALQSYIDARAR 221 (395) T ss_pred ceecCCCceEEEEEecC-------CCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhHHHHHHHHHHHHH Confidence 99999999999987643 24688999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhccCccc-ccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHH Q lcl|Aclame:pro 240 YGLRFLRDRQLLNGNGSTE-MQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL 318 (419) Q Consensus 240 ~a~~~~~d~~il~G~g~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 318 (419) +++++++|.+||+|+|+++ |.||++..+....... ........++++.+++..+...+..+++|+|||+++..|++ T Consensus 222 ~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~ 298 (395) T protein:vir:43 222 YGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSG---VVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIEL 298 (395) T ss_pred HHHHHHHHHHHHhccCCCCccccccccccccccccc---cccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHH Confidence 9999999999999999975 5899987665443322 23334457899999999999999999999999999999999 Q ss_pred HhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEE Q lcl|Aclame:pro 319 DQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFR 398 (419) Q Consensus 319 ~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r 398 (419) ++|++|+|++. ++.++.+++|+|+||+++++||++++++|||+++|+++++.+++++++++.+.+|++|++.||++.| T Consensus 299 lkd~~G~~i~~--~~~~~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r 376 (395) T protein:vir:43 299 NKDAENRYIIG--SPQNGTTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEER 376 (395) T ss_pred hhccCCceecc--ccccCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEe Confidence 99999998663 4556778899999999999999999999999999999999999999999888889999999999999 Q ss_pred eccEEecccceEEEEecCC Q lcl|Aclame:pro 399 ANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 399 ~d~~~~~~~a~~~~~~~aa 417 (419) +|+++++|+||+++++++| T Consensus 377 ~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 377 LAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred eccEEecccceEEEEeccC Confidence 9999999999999999999 No 4 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=2.3e-66 Score=380.45 Aligned_cols=409 Identities=24% Similarity=0.279 Sum_probs=271.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTT----EQVQEIVAEARGLADALQAESDRAAA------RAALLRTAPPAPKGPADGG 70 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~----~~~~~~~~e~~~~~~~~~~~~~~~~~------~~~~l~~~~~~~~~~~~~~ 70 (419) ||....|+++..++.++++++.... .+.+++.++....+..++++++.... +...+....+..+...... T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999988777666655444322 22333222222222222222221111 1111111111111111111 Q ss_pred ccccccccchhhhhhHHHHhHHHHHH-------------HHHh---hhhhhhhHHHHHHHHHH----hhhcccccccccC Q lcl|Aclame:pro 71 TPLTPAEAGTFRSLAQRFADSDGLRE-------------YRAR---DKRGQFQVEMRDIDPNR----LLSRDAPAGTITN 130 (419) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~---~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~ 130 (419) ...... ...+.........+.+.. .... ........+.+...... ........ +.+. T Consensus 81 e~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 157 (497) T protein:vir:10 81 EVRNLK--QIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPF-GSTG 157 (497) T ss_pred Hhhhhh--hHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhc-ccCc Confidence 000000 000000000000000000 0000 00000000000000000 01111112 2334 Q ss_pred CcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeee Q lcl|Aclame:pro 131 PNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTL 210 (419) Q Consensus 131 ~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~ 210 (419) .+++++|+.+...|++.++..+++++++++++++++.++||+.++. .+.++||+|++.+|+++++|++|++.+ T Consensus 158 ~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~-------~~~a~wv~E~~~~~~s~~~f~~i~~~~ 230 (497) T protein:vir:10 158 TFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAA-------HNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) T ss_pred ccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCC-------CCcceeeccCcccccccccceeeEeee Confidence 5666788888888999999999999999999999999999987642 347889999999999999999999999 Q ss_pred EEEEEeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccccc----------- Q lcl|Aclame:pro 211 KTVAHWLPITRQAADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAP----------- 279 (419) Q Consensus 211 ~k~~~~~~vs~ell~d~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~----------- 279 (419) +|++++++||+||++|++++++||.++|++++++++|.+||+|+|+++|.||++.++............ T Consensus 231 ~k~a~~~~iS~ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (497) T protein:vir:10 231 GKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKF 310 (497) T ss_pred eeeEeecHhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhh Confidence 999999999999999999999999999999999999999999999999999998766543322211111 Q ss_pred --------------------------------------chhhhHHHHHHHHHHhhhhh-ccCCcEEEEehHHHHHHHHHh Q lcl|Aclame:pro 280 --------------------------------------ATDEPPLVDIRRAKTVAEIA-GFPPDGVVVHPQDWESIELDQ 320 (419) Q Consensus 280 --------------------------------------~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~k 320 (419) .+..+...++..++..+... +..+++|+|||.+|..|+++| T Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk 390 (497) T protein:vir:10 311 PADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK 390 (497) T ss_pred hcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhh Confidence 11223344455555555444 345668999999999999999 Q ss_pred ccCCceeccCCcc-----ccCCCcccccceeEecCCCCcCcEEEEeccce-EEEEEecceEEEEeecccchhhcCcEEEE Q lcl|Aclame:pro 321 APGSGVFRVIANV-----QGEATPRIWGLNVVSTVAIAQGTALVGGFRQG-ATLWSRQGITVLMTDSHADFFTANTLVIL 394 (419) Q Consensus 321 d~~g~~~~~~~~~-----~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~r 394 (419) |.+|+|+|..... ....+++|+|+||+++++||++++++|||+++ |.+++|.+++|+++++..++|++|++.|| T Consensus 391 d~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r 470 (497) T protein:vir:10 391 DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVR 470 (497) T ss_pred cCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEE Confidence 9999987753221 12345689999999999999999999999985 55789999999999998889999999999 Q ss_pred EEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 395 AEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 395 ~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ++.|+|+.+++|+||++++++++++ T Consensus 471 ~~~r~~~~v~~p~A~~~l~~~~~~~ 495 (497) T protein:vir:10 471 AEERLGLLVYRPSAFQLIQLKKGAT 495 (497) T ss_pred EEEeecceeeccccEEEEEecCCcc Confidence 9999999999999999999999999 No 5 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=2.3e-66 Score=380.45 Aligned_cols=409 Identities=24% Similarity=0.279 Sum_probs=271.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTT----EQVQEIVAEARGLADALQAESDRAAA------RAALLRTAPPAPKGPADGG 70 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~----~~~~~~~~e~~~~~~~~~~~~~~~~~------~~~~l~~~~~~~~~~~~~~ 70 (419) ||....|+++..++.++++++.... .+.+++.++....+..++++++.... +...+....+..+...... T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999988777666655444322 22333222222222222222221111 1111111111111111111 Q ss_pred ccccccccchhhhhhHHHHhHHHHHH-------------HHHh---hhhhhhhHHHHHHHHHH----hhhcccccccccC Q lcl|Aclame:pro 71 TPLTPAEAGTFRSLAQRFADSDGLRE-------------YRAR---DKRGQFQVEMRDIDPNR----LLSRDAPAGTITN 130 (419) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~---~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~ 130 (419) ...... ...+.........+.+.. .... ........+.+...... ........ +.+. T Consensus 81 e~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 157 (497) T protein:vir:78 81 EVRNLK--QIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPF-GSTG 157 (497) T ss_pred Hhhhhh--hHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhc-ccCc Confidence 000000 000000000000000000 0000 00000000000000000 01111112 2334 Q ss_pred CcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeee Q lcl|Aclame:pro 131 PNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTL 210 (419) Q Consensus 131 ~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~ 210 (419) .+++++|+.+...|++.++..+++++++++++++++.++||+.++. .+.++||+|++.+|+++++|++|++.+ T Consensus 158 ~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~-------~~~a~wv~E~~~~~~s~~~f~~i~~~~ 230 (497) T protein:vir:78 158 TFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAA-------HNNAAAVAEAGTYPFSSEEFARVYEQV 230 (497) T ss_pred ccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCC-------CCcceeeccCcccccccccceeeEeee Confidence 5666788888888999999999999999999999999999987642 347889999999999999999999999 Q ss_pred EEEEEeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccccc----------- Q lcl|Aclame:pro 211 KTVAHWLPITRQAADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAP----------- 279 (419) Q Consensus 211 ~k~~~~~~vs~ell~d~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~----------- 279 (419) +|++++++||+||++|++++++||.++|++++++++|.+||+|+|+++|.||++.++............ T Consensus 231 ~k~a~~~~iS~ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (497) T protein:vir:78 231 GKVANALTITDEGLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKF 310 (497) T ss_pred eeeEeecHhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhh Confidence 999999999999999999999999999999999999999999999999999998766543322211111 Q ss_pred --------------------------------------chhhhHHHHHHHHHHhhhhh-ccCCcEEEEehHHHHHHHHHh Q lcl|Aclame:pro 280 --------------------------------------ATDEPPLVDIRRAKTVAEIA-GFPPDGVVVHPQDWESIELDQ 320 (419) Q Consensus 280 --------------------------------------~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~k 320 (419) .+..+...++..++..+... +..+++|+|||.+|..|+++| T Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk 390 (497) T protein:vir:78 311 PADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK 390 (497) T ss_pred hcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhh Confidence 11223344455555555444 345668999999999999999 Q ss_pred ccCCceeccCCcc-----ccCCCcccccceeEecCCCCcCcEEEEeccce-EEEEEecceEEEEeecccchhhcCcEEEE Q lcl|Aclame:pro 321 APGSGVFRVIANV-----QGEATPRIWGLNVVSTVAIAQGTALVGGFRQG-ATLWSRQGITVLMTDSHADFFTANTLVIL 394 (419) Q Consensus 321 d~~g~~~~~~~~~-----~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~r 394 (419) |.+|+|+|..... ....+++|+|+||+++++||++++++|||+++ |.+++|.+++|+++++..++|++|++.|| T Consensus 391 d~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r 470 (497) T protein:vir:78 391 DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVR 470 (497) T ss_pred cCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEE Confidence 9999987753221 12345689999999999999999999999985 55789999999999998889999999999 Q ss_pred EEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 395 AEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 395 ~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ++.|+|+.+++|+||++++++++++ T Consensus 471 ~~~r~~~~v~~p~A~~~l~~~~~~~ 495 (497) T protein:vir:78 471 AEERLGLLVYRPSAFQLIQLKKGAT 495 (497) T ss_pred EEEeecceeeccccEEEEEecCCcc Confidence 9999999999999999999999999 No 6 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=4.3e-66 Score=378.94 Aligned_cols=388 Identities=29% Similarity=0.420 Sum_probs=296.6 Q ss_pred CCccHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MPPTPT-LEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG 79 (419) Q Consensus 1 M~~~~~-L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (419) |+.+.+ |+++++++.++++.+.....+...++++.++.+++++++++.+++++++++......+...... .. T Consensus 1 m~e~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~-------~~ 73 (390) T protein:vir:10 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGG-------DV 73 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------cc Confidence 777755 6666777766666655555444556777788888888888888888877766654433322111 11 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD 159 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~ 159 (419) ..+...+...+.+..+.+.............. ..... ...........|++++|+.+ ..|+..+...++|+++|+ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~-~~~~~~~~~~~g~~~~~~~~-~~ii~~~~~~~~l~~~~~ 148 (390) T protein:vir:10 74 QHVSVGDLFVASEQFQASAGRWNDRSARATMN---IKAAL-NTASTDAAGSAGALTTPNRL-PGFITQPDARLTVRDLIG 148 (390) T ss_pred cccchhhhhhhhHHHHHHHHhhhhhhhhhhhH---HHHHH-HhhhcccccccccccchhHH-HHHHHHHHhhchhhhhcc Confidence 12222333333444444444433332222111 11111 11222333445566666555 567777888899999999 Q ss_pred eecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHHH Q lcl|Aclame:pro 160 QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLT 239 (419) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~ 239 (419) ++|++++.+++|+.++. .+.+.|++|++++|+++++|+++++.+++++++++||+++++|++++++||.++|+ T Consensus 149 ~~~~~~~~~~~~~~~~~-------~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~l~~~i~~~l~ 221 (390) T protein:vir:10 149 SGRTDSALIEYVQETGF-------VNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQLASYMNNRLI 221 (390) T ss_pred eeeccCCceEEEEEecC-------CcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHHHHhHHHHHHHHHHHHH Confidence 99999999999987643 24689999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHH Q lcl|Aclame:pro 240 YGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL 318 (419) Q Consensus 240 ~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 318 (419) +++++++|++||+|+|++ +|.||++.++..... ........++++.+++..+...++.+++|+|||++|..|++ T Consensus 222 ~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~ 296 (390) T protein:vir:10 222 RGLKVKEDAEILRGTGANDGLLGLIPQATTYAAP-----TTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIEL 296 (390) T ss_pred HHHHHHHHHHHhhcCCCCcccccccccccccccc-----ccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH Confidence 999999999999999987 499999876543321 22233446788999999999999999999999999999999 Q ss_pred HhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEE Q lcl|Aclame:pro 319 DQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFR 398 (419) Q Consensus 319 ~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r 398 (419) ++|++|+|+|..+ .++.+++|+|+||++++.||++++++|||+++|.++++.+++++++++. .+|.+|++.||++.| T Consensus 297 lkd~~g~~l~~~~--~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~~~r 373 (390) T protein:vir:10 297 AKDANNQYLIGNA--RGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVN-DDFQRNMVTVLAEER 373 (390) T ss_pred hhcCCCceeecCC--cCcCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecc-cccccCcEEEEEEEe Confidence 9999999876543 3455679999999999999999999999999999999999999998754 579999999999999 Q ss_pred eccEEecccceEEEEec Q lcl|Aclame:pro 399 ANLAVYQPKAFVRVTFA 415 (419) Q Consensus 399 ~d~~~~~~~a~~~~~~~ 415 (419) +|+++++|+||++++++ T Consensus 374 ~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 374 LALVVYRPEALISGSFA 390 (390) T ss_pred eccEEeccccEEEEEeC Confidence 99999999999999999 No 7 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=4.5e-66 Score=378.86 Aligned_cols=388 Identities=12% Similarity=0.122 Sum_probs=295.3 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |..+++|++..+++++..++++...+ ...++.++....+..+++.++.++++++......+.............. . T Consensus 1 l~~~k~l~~~i~e~~~~~~~~k~~~~---~~~~~~e~~~~~l~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~-~ 76 (407) T protein:vir:48 1 MADVKDVEQVAQELQRKFDDFKEKND---KRIDAIEQEKGKLAGEVETLNGKLAELENLKSDLEAELAEVKRPAGGTQ-N 76 (407) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-c Confidence 99999999998888888887765443 2344445556667777777777777666665544433322111111000 0 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) . . ..+..+.+....+++.. ..+...+.. .... ++...|++++|+.+.+.|++.++..++++++|++ T Consensus 77 ~--~-----~~e~~~a~~~~l~~g~~-~~~~~~e~~-----a~~~-~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~ 142 (407) T protein:vir:48 77 K--V-----ASEHKEAFIGFMRKGRE-DGLRELERK-----ALQV-GNDEDGGYAIPEELDRTILTLLKDEVVMRQEATV 142 (407) T ss_pred c--h-----hhHHHHHHHHHHhccch-hhhhHHHHH-----hhhc-ccCCCCcccccHhHHHHHHHHHHhhhhhhhhcee Confidence 0 0 01111222222222211 112222211 1112 2334567889999999999999999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCcccccccc-cceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST-LSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGRL 238 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~~l 238 (419) +|++++.+.+|+..+ +..+.|++|++.+|+++ ++|+++++.++|++++++||+|+++|+. ++++||.++| T Consensus 143 ~~~~~~~~~~~~~~~--------~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l 214 (407) T protein:vir:48 143 ITLGGSDYKKLVNLG--------GTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSEL 214 (407) T ss_pred eecCCCceEEEEecC--------CcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHH Confidence 999999999988653 45789999999999875 7999999999999999999999999985 8999999999 Q ss_pred HHHHHHHHHHHHHhccCcccccceecccccccccccc-------ccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPK-------PTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQ 311 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (419) ++++++++|.+|++|+|+++|.||++.+......... ......+...++++.+++..+...|..+++|+||++ T Consensus 215 ~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~ 294 (407) T protein:vir:48 215 ALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNS 294 (407) T ss_pred HHHHHHHHHhhhhccCCCCccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHH Confidence 9999999999999999999999999876544332211 122333445689999999999999999999999999 Q ss_pred HHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCc-----CcEEEEeccceEEEEEecceEEEEeecccchh Q lcl|Aclame:pro 312 DWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQ-----GTALVGGFRQGATLWSRQGITVLMTDSHADFF 386 (419) Q Consensus 312 ~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~-----~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~ 386 (419) ++..|++++|.+|+|+| ++++..+.+++|+|+||+++++||. ..+++|||+++|.+++|.++++..+ .+| T Consensus 295 ~~~~L~~lkD~~Gr~l~-~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d----~~~ 369 (407) T protein:vir:48 295 SLFAIRLLKDNDGNYLW-RPGIELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRD----PYT 369 (407) T ss_pred HHHHHHHhhccCCceee-ccCcCCCCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEee----ccc Confidence 99999999999999764 5667778889999999999999985 2378899999999999999888754 357 Q ss_pred hcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 387 TANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 387 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+|++.||++.|+|+++++|+||++++++++++ T Consensus 370 ~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa~~ 402 (407) T protein:vir:48 370 NKPFVGFYTTKRTGGMLVDSQAIKLMKIGAATR 402 (407) T ss_pred cCCcEEEEEEEEeccEEecccceEEEEeeccCC Confidence 899999999999999999999999999999999 No 8 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=4.9e-66 Score=378.63 Aligned_cols=388 Identities=28% Similarity=0.400 Sum_probs=295.9 Q ss_pred CCccHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MPPTPT-LEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG 79 (419) Q Consensus 1 M~~~~~-L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (419) |..+.+ |++++.++.++++.+....++...+.++.++.+++++++++.+++++++++......+..... ... T Consensus 1 m~~l~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~-------~~~ 73 (390) T protein:vir:81 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAG-------GDV 73 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------ccc Confidence 888855 677777777766655554444445677778888889999999888888776655443332211 111 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD 159 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~ 159 (419) ..+...+...+....+.+.............. ..... .....+....+|+++ |+.+...|+..+...++|+++|+ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~-~~~~~~~~~~~g~~~-~~~~~~~ii~~~~~~~~l~~~~~ 148 (390) T protein:vir:81 74 QHVSVGDMFVASEQFQASAGRWNDRSARATMN---IKAAL-NTASTDAAGSAGALT-TPNRLPGFITPPDARLTVRDLIG 148 (390) T ss_pred ccccchhhhhhhHHHHHHHHHHhhhhhhhhhH---HHHHH-HhhccccccCCccee-chhhhHHHHHHHhhhhhhhhhcc Confidence 22222333333333343333333222111111 11111 111222333444444 44556677788899999999999 Q ss_pred eecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHHH Q lcl|Aclame:pro 160 QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLT 239 (419) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~ 239 (419) +++++++.+++|+.++. ...+.|++||+.+|+++++|+++++.++|++++++||+|+++|++++++||.++|+ T Consensus 149 ~~~~~~~~~~~~~~~~~-------~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~~~~~i~~~l~ 221 (390) T protein:vir:81 149 SGRTDSALIEYVQETGF-------VNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQLASYMNNRLI 221 (390) T ss_pred eeeccCCceEEEEEecC-------CcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHHhHHHHHHHHHHHHH Confidence 99999999999987642 24688999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhccCccc-ccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHH Q lcl|Aclame:pro 240 YGLRFLRDRQLLNGNGSTE-MQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL 318 (419) Q Consensus 240 ~a~~~~~d~~il~G~g~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 318 (419) +++++++|++||+|+|+++ |.||++..+.... ....+....++++.+++..+...++.+++|+|||++|..|++ T Consensus 222 ~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~ 296 (390) T protein:vir:81 222 RGLKVKEDAEILRGTGANDGLLGLIPQATTYAA-----PTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIEL 296 (390) T ss_pred HHHHHHHHHHHHhcCCCCCcccceeeccccccc-----ccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH Confidence 9999999999999999975 9999986654332 122334456788999999999999999999999999999999 Q ss_pred HhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEE Q lcl|Aclame:pro 319 DQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFR 398 (419) Q Consensus 319 ~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r 398 (419) ++|++|+|+|.. ...+.+++|+|+||+++++||++++++|||+++|+++++.+++++++++. .+|++|++.||++.| T Consensus 297 lkd~~G~~l~~~--~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~-~~~~~~~v~~r~~~r 373 (390) T protein:vir:81 297 AKDANNQYLIGN--ARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVG-EDFQRNMITVLAEER 373 (390) T ss_pred hhcCCCceeecC--cccccCceecceeeEEcCCCCCCcEEEEehhceEEEEEecceEEEEeccc-chhhcCcEEEEEEEe Confidence 999999987643 34566789999999999999999999999999999999999999988764 479999999999999 Q ss_pred eccEEecccceEEEEec Q lcl|Aclame:pro 399 ANLAVYQPKAFVRVTFA 415 (419) Q Consensus 399 ~d~~~~~~~a~~~~~~~ 415 (419) +|+++++|+||++++++ T Consensus 374 ~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 374 LALVVYRPEALISGSFA 390 (390) T ss_pred eccEEecccceEEEEeC Confidence 99999999999999999 No 9 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=6e-66 Score=378.17 Aligned_cols=388 Identities=29% Similarity=0.405 Sum_probs=301.1 Q ss_pred CCccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MPPTP-TLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG 79 (419) Q Consensus 1 M~~~~-~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (419) |...+ +|+++++++.++++.+.....+...+.++.++.+++++++++.+.+++++++............... T Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~------- 73 (390) T protein:vir:97 1 MTDITAKLEATLANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDV------- 73 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------- Confidence 77774 5888888887777776666555556778888888899999998888888777665554433322211 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD 159 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~ 159 (419) ..+.........+..+.+................ ... ..... ..+..++.++|+.+...|++.+...++++++++ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~-~~~~~-~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~ 148 (390) T protein:vir:97 74 QHVSVGDMFVASEQFQASTGRWNDRSARATMNIK---AAL-NTAST-DAAGSAGALTTPNRLPGFITPPDARLTVRDLIG 148 (390) T ss_pred ccccchhhhhhhHHHHHHHHHhhhhhhhhhhHHH---HHH-Hhhhc-ccccccccccchhhhHHHHHHHhhhhhhHhhcc Confidence 1122222222333333333333332222111111 111 11112 223445556666777888889999999999999 Q ss_pred eecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHHH Q lcl|Aclame:pro 160 QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLT 239 (419) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~ 239 (419) ++|+.++.+++|+.++. .+.+.|++||+++|+++++|+++++.+++++++++||+|+++|++++++||.++|+ T Consensus 149 ~~~~~~~~~~~~~~~~~-------~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~l~~~i~~~la 221 (390) T protein:vir:97 149 SGRTDSALIEYVQETGF-------VNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQLASYMNNRLI 221 (390) T ss_pred eeeccCCceEEEEEecC-------CcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhHHHHHHHHHHHHH Confidence 99999999999987642 24689999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhccCccc-ccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHH Q lcl|Aclame:pro 240 YGLRFLRDRQLLNGNGSTE-MQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL 318 (419) Q Consensus 240 ~a~~~~~d~~il~G~g~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 318 (419) +++++++|++||+|+|+++ |.||++.++.... ....+....++++.+++..+...+..+++|+|||++|..|++ T Consensus 222 ~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~-----~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~ 296 (390) T protein:vir:97 222 RGLKVKEDAEILRGTGANDGLLGLIPQATTYAA-----PTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIEL 296 (390) T ss_pred HHHHHHHHHHHhhcCCCCccccceeeccccccc-----cccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH Confidence 9999999999999999875 9999986654332 122334556788999999999999999999999999999999 Q ss_pred HhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEE Q lcl|Aclame:pro 319 DQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFR 398 (419) Q Consensus 319 ~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r 398 (419) +||++|+|+|.. ..++.+++|+|+||+++++||++++++|||+++|.++++.++++++++++ .+|++|++.||++.| T Consensus 297 lkd~~G~~l~~~--~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~f~~~~~~~r~~~r 373 (390) T protein:vir:97 297 AKDANNQYLIGN--ARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVN-DDFQRNMVTVLAEER 373 (390) T ss_pred hhcCCCceeecC--ccCCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecc-cccccCcEEEEEEEe Confidence 999999987643 34566789999999999999999999999999899999999999998654 469999999999999 Q ss_pred eccEEecccceEEEEec Q lcl|Aclame:pro 399 ANLAVYQPKAFVRVTFA 415 (419) Q Consensus 399 ~d~~~~~~~a~~~~~~~ 415 (419) +|+++++|+||++++++ T Consensus 374 ~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 374 LALVVYRPEALITGSFA 390 (390) T ss_pred eccEEeccccEEEEEeC Confidence 99999999999999999 No 10 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=1.8e-65 Score=375.54 Aligned_cols=384 Identities=26% Similarity=0.369 Sum_probs=285.2 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |+.+.+|+++++++.++++++....+ ...++..++.++++++++++.+++++++...+..+........ .... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~---~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~ 73 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQK---AEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAE----NPGE 73 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----ccch Confidence 99999999998888877766543321 2222333344455555555555555444443333222211111 1111 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) .+...... ..+..+.+........ ... . +.........+|+ ++|+.+...|+..+...++|+++|++ T Consensus 74 ~~~~~~~~-~~~~~~~~~~~~~~~~----~~~-~------~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~~~l~~~~~~ 140 (385) T protein:vir:18 74 KKSFSERA-AEELIKSWDGKQGTFG----AKT-F------NKSLGSDADSAGS-LIQPMQIPGIIMPGLRRLTIRDLLAQ 140 (385) T ss_pred hhhhHHHH-HHHHHHHHHHhhccch----hhH-H------HhhhccccccCCc-eecchhhhHHHHHhhhccchhhhcce Confidence 11111111 1111111111111000 000 0 0111112223344 45555677788888999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLTY 240 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~~ 240 (419) +|++++.++||+.+.. ...+.|++|++.+|+++++|+++++.++|++++++||+|+++|++++++||.++|++ T Consensus 141 ~~~~~~~~~~~~~~~~-------~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~~l~~~i~~~la~ 213 (385) T protein:vir:18 141 GRTSSNALEYVREEVF-------TNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDAPMLQSYINNRLMY 213 (385) T ss_pred ecccCcceEEEEEecC-------CcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhHHHHHHHHHHHHHH Confidence 9999989999987642 347889999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccCccc-ccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHH Q lcl|Aclame:pro 241 GLRFLRDRQLLNGNGSTE-MQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELD 319 (419) Q Consensus 241 a~~~~~d~~il~G~g~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 319 (419) ++++++|.+||+|+|+++ |.||++..+..... ...+....++++.+++..+...++.+++|+|||+++..|+++ T Consensus 214 a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~-----~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l 288 (385) T protein:vir:18 214 GLALKEEGQLLNGDGTGDNLEGLNKVATAYDTS-----LNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALL 288 (385) T ss_pred HHHHHHHHHHHhccCCCCccccccccccccccc-----ccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHh Confidence 999999999999999975 68998865443322 122334568899999999999999999999999999999999 Q ss_pred hccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEe Q lcl|Aclame:pro 320 QAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRA 399 (419) Q Consensus 320 kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~ 399 (419) +|++|+++|. ++.++.+++|+|+||+++++||++.+++|||+++|+++++.+++++++++..++|++|++.||++.|+ T Consensus 289 kd~~G~~l~~--~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~ 366 (385) T protein:vir:18 289 KDNEGRYIFG--GPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERL 366 (385) T ss_pred hcCCCceecc--CcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEee Confidence 9999998763 35577889999999999999999999999999999999999999999998888999999999999999 Q ss_pred ccEEecccceEEEEecCCC Q lcl|Aclame:pro 400 NLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 400 d~~~~~~~a~~~~~~~aa~ 418 (419) |+++++|+||+++++++++ T Consensus 367 ~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 367 ALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred ccEEecccceEEEEeccCC Confidence 9999999999999999999 No 11 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=1.8e-65 Score=375.54 Aligned_cols=384 Identities=26% Similarity=0.369 Sum_probs=285.2 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |+.+.+|+++++++.++++++....+ ...++..++.++++++++++.+++++++...+..+........ .... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~---~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~ 73 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQK---AEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAE----NPGE 73 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----ccch Confidence 99999999998888877766543321 2222333344455555555555555444443333222211111 1111 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) .+...... ..+..+.+........ ... . +.........+|+ ++|+.+...|+..+...++|+++|++ T Consensus 74 ~~~~~~~~-~~~~~~~~~~~~~~~~----~~~-~------~~~~~~~~~~~g~-~i~~~~~~~ii~~~~~~~~l~~~~~~ 140 (385) T protein:vir:19 74 KKSFSERA-AEELIKSWDGKQGTFG----AKT-F------NKSLGSDADSAGS-LIQPMQIPGIIMPGLRRLTIRDLLAQ 140 (385) T ss_pred hhhhHHHH-HHHHHHHHHHhhccch----hhH-H------HhhhccccccCCc-eecchhhhHHHHHhhhccchhhhcce Confidence 11111111 1111111111111000 000 0 0111112223344 45555677788888999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLTY 240 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~~ 240 (419) +|++++.++||+.+.. ...+.|++|++.+|+++++|+++++.++|++++++||+|+++|++++++||.++|++ T Consensus 141 ~~~~~~~~~~~~~~~~-------~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~~l~~~i~~~la~ 213 (385) T protein:vir:19 141 GRTSSNALEYVREEVF-------TNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDAPMLQSYINNRLMY 213 (385) T ss_pred ecccCcceEEEEEecC-------CcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhHHHHHHHHHHHHHH Confidence 9999989999987642 347889999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccCccc-ccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHH Q lcl|Aclame:pro 241 GLRFLRDRQLLNGNGSTE-MQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELD 319 (419) Q Consensus 241 a~~~~~d~~il~G~g~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 319 (419) ++++++|.+||+|+|+++ |.||++..+..... ...+....++++.+++..+...++.+++|+|||+++..|+++ T Consensus 214 a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~-----~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~l 288 (385) T protein:vir:19 214 GLALKEEGQLLNGDGTGDNLEGLNKVATAYDTS-----LNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALL 288 (385) T ss_pred HHHHHHHHHHHhccCCCCccccccccccccccc-----ccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHh Confidence 999999999999999975 68998865443322 122334568899999999999999999999999999999999 Q ss_pred hccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEe Q lcl|Aclame:pro 320 QAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRA 399 (419) Q Consensus 320 kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~ 399 (419) +|++|+++|. ++.++.+++|+|+||+++++||++.+++|||+++|+++++.+++++++++..++|++|++.||++.|+ T Consensus 289 kd~~G~~l~~--~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~ 366 (385) T protein:vir:19 289 KDNEGRYIFG--GPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERL 366 (385) T ss_pred hcCCCceecc--CcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEee Confidence 9999998763 35577889999999999999999999999999999999999999999998888999999999999999 Q ss_pred ccEEecccceEEEEecCCC Q lcl|Aclame:pro 400 NLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 400 d~~~~~~~a~~~~~~~aa~ 418 (419) |+++++|+||+++++++++ T Consensus 367 ~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 367 ALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred ccEEecccceEEEEeccCC Confidence 9999999999999999999 No 12 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=1.9e-65 Score=375.38 Aligned_cols=388 Identities=16% Similarity=0.074 Sum_probs=291.1 Q ss_pred CCcc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MPPT--PTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA 78 (419) Q Consensus 1 M~~~--~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 78 (419) |+.. ++|+++++++.+++..+....+ .+.+.++.++.+++++.+++.+++++++..+.................... T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~-~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFA-GKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGS 79 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhh-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCccc Confidence 8765 6777777777777666655443 245677888888999999999998887654444333322222111111111 Q ss_pred chhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 79 GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) ...+.. ... ...+......+ +.+....... ...+..+.++++++|+.+...|...+.....++.++ T Consensus 80 ~~~~~~-----~~~-~~~~~r~g~~~----~~~~~~~~~~----~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~ 145 (392) T protein:vir:13 80 GAQRSA-----DHD-DDAVLRAGNLG----EARSFEFAPE----KRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGA 145 (392) T ss_pred chhhhh-----hHH-HHHHHhccchh----hhHHHHhhhh----hhcccccCCCccccccchHHHHHHHHhhhhhhhhcc Confidence 111000 000 01111111111 1111111111 111223344566777778888888888888999999 Q ss_pred ceecccC-cceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 159 DQQNADY-NVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 159 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) +++++++ ..+.+|+.++ ...++||+|++.+|+++++|++++++++|++++++||+|+++|+. ++++||.+ T Consensus 146 ~~~~~~~~~~~~~~~~~~--------~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 217 (392) T protein:vir:13 146 STFTTSDANPMDFTVITG--------RATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVS 217 (392) T ss_pred eeeecCCCceeEEEEEcC--------CcceeeecccccccccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHH Confidence 9998865 4578887653 357899999999999999999999999999999999999999975 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHH Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESI 316 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 316 (419) +|++++++++|.+||+|+|+++|.||++..+...... .........++++++++..+...|..+++|+||++++..| T Consensus 218 ~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~---~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l 294 (392) T protein:vir:13 218 DAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAF---GEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQM 294 (392) T ss_pred HHHHHHHHHHHHHHhcccCCccccccccccccccccc---cccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHH Confidence 9999999999999999999999999998765433322 2223344568899999999999999999999999999999 Q ss_pred HHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEE Q lcl|Aclame:pro 317 ELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAE 396 (419) Q Consensus 317 ~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~ 396 (419) ++++|++|+|+| .++++.+.+++|+|+||+++++||++.+++|||++ |+++++.+++++.+.+ .+|.+|++.||++ T Consensus 295 ~~lkd~~G~~l~-~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~-~~i~~~~~~~i~~~~~--~~~~~~~~~~r~~ 370 (392) T protein:vir:13 295 RKLKDANGQYLW-QSALTVGAPDTFNGKVVETDDGMPADKVLFADLSK-YRVRFAGSLRVDRSVD--AKFSTDQIVYRFL 370 (392) T ss_pred HHhhccCCceee-cCCcCCCCCceecceeeEEcCCCCCCcEEEeeccc-eeEEeecceEEEeecc--ccccCCcEEEEEE Confidence 999999998765 56777788899999999999999999999999996 6778899999988765 4699999999999 Q ss_pred EEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 397 FRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 397 ~r~d~~~~~~~a~~~~~~~aa~ 418 (419) .|+|+++++|+||++++++++- T Consensus 371 ~r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 371 QRADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred EEeccEEecccceEEEEeeccC Confidence 9999999999999999998888 No 13 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=1.4e-65 Score=376.19 Aligned_cols=402 Identities=14% Similarity=0.098 Sum_probs=289.3 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) ||++.+|+++|+++.++++++.....+.+.+.++..+.++++++++++++.++++++....................... T Consensus 1 M~kl~~L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~ 80 (428) T protein:vir:10 1 MPQIEELRRQRAGINEQIQALATIEATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAVI 80 (428) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccc Confidence 99999999999999999998887666666677888888899999999999988877665544333222111111100000 Q ss_pred hhhhhHHHHhHHHHHHHHHhh-hhhhhhHHHHHHHHH--HhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhh Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARD-KRGQFQVEMRDIDPN--RLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL 157 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~ 157 (419) .+............+...... .++... ........ ............+..|++++|+.+.+.|++.++..++++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~ 159 (428) T protein:vir:10 81 VKAEPKQYTGAGMTRMVMSIAAAQGNLQ-DAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKL 159 (428) T ss_pred cccccchhhhHHHHHHHHHHHHhhhhHH-HHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhh Confidence 000000000000001111000 000000 00000000 00001111122344677889999999999999999999998 Q ss_pred -cceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHH Q lcl|Aclame:pro 158 -LDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQ 235 (419) Q Consensus 158 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~ 235 (419) ++++|+.++.+.+|+.++ +..++|++||+.+|+++++|++|++.+++++++++||+|+++|+ +++++||. T Consensus 160 ~~~~~~~~~g~~~~p~~~~--------~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~ 231 (428) T protein:vir:10 160 GARSIPLPNGNMSLPRLAG--------GATASYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVL 231 (428) T ss_pred cceeeecCCcceEEEEEeC--------CcceeeeccCccccccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHH Confidence 788898888899998753 35789999999999999999999999999999999999999987 69999999 Q ss_pred HHHHHHHHHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHH---HHHHHHHhhhhhccCCcEEEEehH Q lcl|Aclame:pro 236 GRLTYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLV---DIRRAKTVAEIAGFPPDGVVVHPQ 311 (419) Q Consensus 236 ~~l~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 311 (419) ++|++++++++|++||+|+|++ +|+||++..+................+.++ +...+.......+..++.|+||+. T Consensus 232 ~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~ 311 (428) T protein:vir:10 232 QDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNR 311 (428) T ss_pred HHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHH Confidence 9999999999999999999985 899999876654443333222333223333 333334445566677889999999 Q ss_pred HHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC--------cEEEEeccceEEEEEecceEEEEeeccc Q lcl|Aclame:pro 312 DWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG--------TALVGGFRQGATLWSRQGITVLMTDSHA 383 (419) Q Consensus 312 ~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~--------~~~~~d~~~~~~~~~~~~~~i~~~~~~~ 383 (419) ++..|++++|++|+|+|.. ..+++|+|+||+++++||.+ .+++|||++ |+++++++++++++++.. T Consensus 312 ~~~~L~~lkd~~G~~i~~~-----~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~i~i~~~~~~~ 385 (428) T protein:vir:10 312 TYMKLFGLRDGNGNKVYPE-----MAQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFND-VVIGEDGNMKVDFSKEAS 385 (428) T ss_pred HHHHHHHhhccCCceeccC-----CCCCeeeceeeEEeccccccccCCCccceEEEEecce-EEEEEecceEEEeecccc Confidence 9999999999999987632 23458999999999999864 479999996 556789999999988753 Q ss_pred ---------chhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 384 ---------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 384 ---------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) .+|++|+++||++.|+|+++.+|+||++++-..= T Consensus 386 ~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 386 YIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 5699999999999999999999999999976666 No 14 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=1.3e-64 Score=370.88 Aligned_cols=386 Identities=11% Similarity=0.103 Sum_probs=287.5 Q ss_pred CCcc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MPPT-PTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG 79 (419) Q Consensus 1 M~~~-~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (419) |..- ++|++.+++++++.++++...++ ..++.++....+..+++.++.++++++..................... T Consensus 1 m~~~lk~l~~~~~el~~~~~~~k~~~~~---~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 76 (401) T protein:vir:44 1 MAVDIKDVEQVAQELQQKFDDFKAKNDK---RVEAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQ- 76 (401) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc- Confidence 6655 88888888888888777655443 234444455555666666666666655555444333222111100000 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD 159 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~ 159 (419) . .. ..+..+.+....+.+. ...+...+.. .... +....|++++|+.+.+.|++.++..++|+++|+ T Consensus 77 -~-~~-----~~e~~~a~~~~lr~~~-~~~~~~~e~~-----a~~~-~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~ 142 (401) T protein:vir:44 77 -N-KV-----AAEHKDAFVGFLRKGR-EDGLRDLERK-----ALQV-GTDEDGGYAVPEELDRSILSLLKDEVVMRQEAT 142 (401) T ss_pred -c-ch-----hHHHHHHHHHHHhhhh-hhhhHHHHHH-----Hhhc-CCCCCCceeccHhHHHHHHHHHHhhhhhhhhce Confidence 0 00 0111111222111111 1111111111 1112 223556789999999999999999999999999 Q ss_pred eecccCcceeeeeeccccceeccccccceeecCcccccccc-cceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHHH Q lcl|Aclame:pro 160 QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST-LSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGR 237 (419) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~~ 237 (419) ++|++++.+.+|+..+ +..+.|++|++.+|..+ ++|++|++.++|++++++||+|+++|+. +|++||.++ T Consensus 143 ~~~~~~~~~~~~~~~~--------~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~ 214 (401) T protein:vir:44 143 VITVGGSDYKKLVNLG--------GTASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSE 214 (401) T ss_pred eeecCCCceEEEEecC--------CccceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHH Confidence 9999998888887653 45788999999999765 8999999999999999999999999975 899999999 Q ss_pred HHHHHHHHHHHHHHhccCcccccceecccccccccccc-------ccccchhhhHHHHHHHHHHhhhhhccCCcEEEEeh Q lcl|Aclame:pro 238 LTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPK-------PTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHP 310 (419) Q Consensus 238 l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (419) |++++++++|.+||+|+|+++|+||++........... ..+.......+++++++++.+...|..+++|+||+ T Consensus 215 la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~ 294 (401) T protein:vir:44 215 LATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNN 294 (401) T ss_pred HHHHHHHHHHhhhhccCCCCccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcH Confidence 99999999999999999999999999876644332221 12223344568999999999999999999999999 Q ss_pred HHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC-----cEEEEeccceEEEEEecceEEEEeecccch Q lcl|Aclame:pro 311 QDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG-----TALVGGFRQGATLWSRQGITVLMTDSHADF 385 (419) Q Consensus 311 ~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~-----~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~ 385 (419) +++..|++++|.+|+|+| ++++..+.+++|+|+||+++++||.. .+++|||+++|.+++|.++++..+ .+ T Consensus 295 ~~~~~L~~lkd~~G~~l~-~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~----~~ 369 (401) T protein:vir:44 295 NSLFAIRLLKDTEGNYLW-RPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRD----PY 369 (401) T ss_pred HHHHHHHHhhccCCceee-cCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeee----cc Confidence 999999999999999764 56677788889999999999999852 278899999999999999888754 35 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 386 FTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 386 ~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) |.+|++.||++.|+|+++++|+||+++++++| T Consensus 370 ~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 370 TNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred ccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 78999999999999999999999999999999 No 15 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=9.2e-65 Score=371.66 Aligned_cols=386 Identities=16% Similarity=0.078 Sum_probs=291.4 Q ss_pred CCc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MPP--TPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA 78 (419) Q Consensus 1 M~~--~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 78 (419) |+. +++|+++++++.+++..+.....+ +.+.++.++.++.++++++.+++++++..+.....+.............. T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~L~~~~~~-~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (390) T protein:vir:62 1 MDATTLSANFEARERATAELRTLTDEFAG-KEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGS 79 (390) T ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHhhc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 654 467777777777766665544332 45778888899999999999999887765555443322221111111111 Q ss_pred chhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 79 GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) ...+.. .......++... .+ ..+..... .....++.+.++++++|+.....|...++....++.+| T Consensus 80 ~~~~~~--~~~~~~~~r~~~----~~----~~r~~~~~----~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~ 145 (390) T protein:vir:62 80 GAQRSA--DVDDDATLRAGN----LG----EARSFEFA----PEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGA 145 (390) T ss_pred cchhhc--chHHHHHHhhhh----hh----hhHHHHhh----hhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcc Confidence 111000 000111111110 00 11111110 11112334456678888888899999999999999999 Q ss_pred ceecccCc-ceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 159 DQQNADYN-VLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 159 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) +++++++. .+.+|+.++ ...+.||+|++.+|+++++|++++++++|++++++||+|+++|+. ++++||.+ T Consensus 146 ~~~~~~~~~~~~~p~~~~--------~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 217 (390) T protein:vir:62 146 TTFTTSDANPLDFTVITG--------RSSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVS 217 (390) T ss_pred eeeecCCCceeEEEEEcC--------CcceeeecccccccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHH Confidence 99998654 578888764 346899999999999999999999999999999999999999985 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHH Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESI 316 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 316 (419) +|+++++.++|.+||+|+| +|.||++........... ...+...++++++++..+...|..+++|+||++++..| T Consensus 218 ~l~~~i~~~~d~~~l~G~G--~p~Gi~~~~~~~~~~~~~---~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L 292 (390) T protein:vir:62 218 DAGPAIGDAMGRHFITGTG--QPRGILTDASPATATFLA---TDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQM 292 (390) T ss_pred HHHHHHHHHHHhhhhccCC--ccccccccccccccceec---ccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHH Confidence 9999999999999999987 699999876544332222 22334568889999999999999999999999999999 Q ss_pred HHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEE Q lcl|Aclame:pro 317 ELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAE 396 (419) Q Consensus 317 ~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~ 396 (419) +++||.+|+|+| ++++..+.+.+|+|+||++++.+|++.+++|||++ |+++++.++++..+.+ .+|.+|++.||++ T Consensus 293 ~~lkd~~g~~l~-~~~~~~g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~-~~i~~~~~~~v~~~~~--~~~~~~~~~~~~~ 368 (390) T protein:vir:62 293 RKLKDANGQYLW-QSGLTVGAPSLFNGKVVETDDGMPADKILFADLSK-YRVRFAGSLRVDRSVD--AKFSTDQIVYRFL 368 (390) T ss_pred HHhhccCCCeee-cCCcCCCccceecccceEEecCCCCccEEEeeccc-eeEEeecceEEEeecc--ccccCCcEEEEEE Confidence 999999999754 56677788889999999999999999999999996 6678899999998875 4699999999999 Q ss_pred EEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 397 FRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 397 ~r~d~~~~~~~a~~~~~~~aa~ 418 (419) .|+|+++++|+||++++++++. T Consensus 369 ~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 369 QRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred EEeCcEeechhheEEEEeecCC Confidence 9999999999999999999999 No 16 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=7.2e-64 Score=366.76 Aligned_cols=381 Identities=14% Similarity=0.112 Sum_probs=274.1 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTT-EQVQEIVAEARGLADALQAESD---------RAAARAALLRTAPPAPKGPADGG 70 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~-~~~~~~~~e~~~~~~~~~~~~~---------~~~~~~~~l~~~~~~~~~~~~~~ 70 (419) || +.|.|.|++..+++.++.... ++.+++.++.++..++++++.. ++..+++.++...+......... T Consensus 21 ~~--~~l~e~ra~~~~e~~~l~~~~~~~~~~~k~~~~~~~~~~~~~~~~~e~~~~~~~~~~ei~~~~~~~~~~~~~~~~~ 98 (425) T protein:vir:10 21 VP--RGIISVRAEGPTEVKALIENLQKAFHDFKAEHTKQLDAVKAGLPTSDALAKVDKVSADLEALQAAVDEANIKIAAA 98 (425) T ss_pred hh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 55 666677666655554443332 2334444444444444443322 22222222222221111111000 Q ss_pred ccccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhh Q lcl|Aclame:pro 71 TPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDL 150 (419) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~ 150 (419) ... .... ......+..+.+......+.. .+... .++.+.|++++|+.+.+.|++.++. T Consensus 99 ~~~-----~~~~---~~~~~~~~~~af~~~l~~~e~-------------~~al~-~~t~~~gG~lvP~~~~~~ii~~~~~ 156 (425) T protein:vir:10 99 QMG-----ANGV---KPLRDPEYTEAFKAHVKRGDV-------------QAALN-KGEDSEGGYLTPIEWDRTITNKLVL 156 (425) T ss_pred hcc-----cccc---cccccHHHHHHHHHHhhhhhh-------------HHHhh-cCcCCCCceeccHhHHHHHHHHHHh Confidence 000 0000 000011111122221111110 01111 2234567788999999999999999 Q ss_pred hhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccc-cceeeEEeeeEEEEEeehhhHHHHhhH-H Q lcl|Aclame:pro 151 PLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST-LSFDTITTTLKTVAHWLPITRQAADDN-S 228 (419) Q Consensus 151 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~~~vs~ell~d~-~ 228 (419) .++|+++|+++|++++.+++|+.++ +..+.|++|++.+|+++ ++|++++++++|++++++||+|+++|+ + T Consensus 157 ~s~l~~l~~~~~~~~~~~~~~~~~~--------~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~ 228 (425) T protein:vir:10 157 ISPMRQLCRVQPVSKAGFSKLFNMG--------GTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEI 228 (425) T ss_pred hhhhhhhceeeeccCCceEEEEEcC--------CcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchh Confidence 9999999999999999999998653 34789999999999886 799999999999999999999999997 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccc-------cccccchhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 229 QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQP-------KPTAPATDEPPLVDIRRAKTVAEIAGF 301 (419) Q Consensus 229 ~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (419) ++++||.++|++++++++|.+||+|+|+++|.||++.......... ...+.......++++++++..+...|+ T Consensus 229 ~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~ 308 (425) T protein:vir:10 229 DLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFT 308 (425) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhc Confidence 9999999999999999999999999999999999987654333221 122334455678999999999999999 Q ss_pred CCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCc-----CcEEEEeccceEEEEEecceEE Q lcl|Aclame:pro 302 PPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQ-----GTALVGGFRQGATLWSRQGITV 376 (419) Q Consensus 302 ~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~-----~~~~~~d~~~~~~~~~~~~~~i 376 (419) .+++|+|||+++..|++++|++|+|+| +++...+.+++|+|+||+++++||. ..++||||+++|++++|.++++ T Consensus 309 ~~a~~vmn~~~~~~L~~lkD~~G~~l~-~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v 387 (425) T protein:vir:10 309 GNARFAMNRNTQRQVRKLKDGQGNYLW-QPSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRV 387 (425) T ss_pred cCCEEEEchHHHHHHHHhhcCCCceee-ccCccCCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEE Confidence 999999999999999999999999765 4566777889999999999999985 3478999999999999988877 Q ss_pred EEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 377 LMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 377 ~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~ 418 (419) ..+ .+|.+|++.||++.|+|+++++|+||++++++++= T Consensus 388 ~~d----~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 388 LRD----PYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred Eec----ccccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 653 34779999999999999999999999999999999 No 17 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=2.4e-62 Score=358.43 Aligned_cols=400 Identities=22% Similarity=0.272 Sum_probs=268.6 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--ccccccccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADG--GTPLTPAEA 78 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~--~~~~~~~~~ 78 (419) |+.... ...++.+++.+++++..++.....++.++..++++...+.++................... ......... T Consensus 2 ~ke~~~--~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (413) T protein:vir:81 2 VKEAGD--APTNAQVAEIAEVKSMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYKSIGE 79 (413) T ss_pred hhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhhhhhh Confidence 222221 1122233333444444444444333333333333333333322221111111111111000 000000000 Q ss_pred chhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 79 GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) ...+..... ........................... ....++.+..++.++|+.+.+.|+..+...++|+++| T Consensus 80 ~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~ 152 (413) T protein:vir:81 80 FFAKRAGDQ------IKQQAGGAQLNYSVGEYVAPRVKAASD-PASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLM 152 (413) T ss_pred hhhhhhhhH------HHHHHHHHHhhhhhhhhhhhHHHhhhh-hhhhcccccccccccchhhHHHHHHHHhhhhhHHhhc Confidence 000111110 000000000000001111111111111 1122334566777889999999999999999999999 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCcccccccc-cceeeEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST-LSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGR 237 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~ 237 (419) +++|++++.++||+.+.... ....++||+||+.+|+++ ++|+++++.++|++++++||+|+|+|+++|++||+++ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~----~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~l~~~i~~~ 228 (413) T protein:vir:81 153 DNLTMTNTTIKYLMEKANRV----VEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYDFLVSYINAR 228 (413) T ss_pred ceeeccCCceeEEEeccccc----cccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHHHHHHHHHHH Confidence 99999999999998775331 234689999999999988 6899999999999999999999999998999999999 Q ss_pred HHHHHHHHHHHHHHhccCccc-ccceeccccccccccccccccchhhhHHHHHHHHHHhhhhh-ccCCcEEEEehHHHHH Q lcl|Aclame:pro 238 LTYGLRFLRDRQLLNGNGSTE-MQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIA-GFPPDGVVVHPQDWES 315 (419) Q Consensus 238 l~~a~~~~~d~~il~G~g~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 315 (419) |++++++++|++||+|+|+++ |.||++.++..+.... +....++++..++..+... .+.+++|+|||++|.. T Consensus 229 la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~------~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~ 302 (413) T protein:vir:81 229 LLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVS------NKDELADSIYKAMTNISLATPFQADALVINPLDYQE 302 (413) T ss_pred HHHHHHHHHHHHHhccCCCCCccccccccccccccccc------ccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHH Confidence 999999999999999999975 5899987765433222 2234567777777665543 3455679999999999 Q ss_pred HHHHhccCCceeccCCc------cccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcC Q lcl|Aclame:pro 316 IELDQAPGSGVFRVIAN------VQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTAN 389 (419) Q Consensus 316 l~~~kd~~g~~~~~~~~------~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 389 (419) |+++||++|+|+|.+.. .....+++|+|+||+++++||++.+++|||+++|+++++.+++++++++..++|++| T Consensus 303 l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~ 382 (413) T protein:vir:81 303 LRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVVGAFRSAASVLRKGGVRIDSTNTNVDDFENN 382 (413) T ss_pred HHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEEEecccEEEEEEecceEEEEeccccchhhcC Confidence 99999999998774322 122345689999999999999999999999999999999999999999988899999 Q ss_pred cEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 390 TLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 390 ~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ++.||++.|+|+++++|+||+++++++++| T Consensus 383 ~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 412 (413) T protein:vir:81 383 LITVRAEERVGLMVTFPEAIVQLDVAEVVT 412 (413) T ss_pred cEEEEEEEeeccEEecccceEEEEecCCCC Confidence 999999999999999999999999999999 No 18 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=8.8e-63 Score=360.78 Aligned_cols=402 Identities=14% Similarity=0.101 Sum_probs=291.6 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc- Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG- 79 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~- 79 (419) |.. ++|+++|+++.++++++.....+.+.+.++..+.++++++++++++.+++++++..................... T Consensus 1 M~l-~eL~~~r~~~~~~~~~l~~~~~e~~~l~~ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~ 79 (435) T protein:vir:80 1 MNV-NELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTAS 79 (435) T ss_pred CCH-HHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhccc Confidence 876 899999999999998887666666678888888999999999999999988876544322221111110000000 Q ss_pred h-----hhhhhHHHHhHHHHHHHHHhh-hhhhhhHHHHHHHHHH--hhhcccccccccCCcccccchhhhHHHHHhhhhh Q lcl|Aclame:pro 80 T-----FRSLAQRFADSDGLREYRARD-KRGQFQVEMRDIDPNR--LLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLP 151 (419) Q Consensus 80 ~-----~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~ 151 (419) . .+............+...... .++............. ..............|++++|+.+.+.|++.++.. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~ 159 (435) T protein:vir:80 80 AAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPK 159 (435) T ss_pred cccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhh Confidence 0 000000000000000000000 0000000000000000 0111111223345577889999999999999999 Q ss_pred hhHHhh-cceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH--- Q lcl|Aclame:pro 152 LLVADL-LDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN--- 227 (419) Q Consensus 152 ~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~--- 227 (419) ++++++ ++++|+.++.++||+.++ +..+.||+|++.+|+++++|++|++.++|++++++||+|+++|+ T Consensus 160 ~~i~~~~~~~v~~~~~~~~~p~~~~--------~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~ 231 (435) T protein:vir:80 160 SVVRKLGARTLPLSNGNITIPRLKG--------GAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVN 231 (435) T ss_pred chhhhccceeeecCCCceEEEEEeC--------CcceeeeccCccccccccceeeEEEeeEEEEEeehhhHHHHHhhccc Confidence 999998 889999888999998764 34789999999999999999999999999999999999999986 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhh--ccCCc Q lcl|Aclame:pro 228 SQLMGYIQGRLTYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIA--GFPPD 304 (419) Q Consensus 228 ~~~~~~i~~~l~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 304 (419) +++++||.++|++++++++|.+||+|+|++ +|.||++..+....... ....+....+.++.+++..+... ++.++ T Consensus 232 ~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~--~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 309 (435) T protein:vir:80 232 PNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITA--SDGSTLQKIETDLGKAILALENADANLTQP 309 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeec--ccccchhhHHHHHHHHHHHhhccccccccC Confidence 369999999999999999999999999985 79999987655443322 22233334456677777666544 55678 Q ss_pred EEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC--------cEEEEeccceEEEEEecceEE Q lcl|Aclame:pro 305 GVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG--------TALVGGFRQGATLWSRQGITV 376 (419) Q Consensus 305 ~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~--------~~~~~d~~~~~~~~~~~~~~i 376 (419) +|+|||.++..|++++|++|+|+|. . ..+++|+|+||++++.||.+ .+++|||++ |+++++.++++ T Consensus 310 ~~vmn~~~~~~L~~lkd~~G~~l~~--~---~~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~~~i 383 (435) T protein:vir:80 310 GWIMAPRTFRFLEGLRDGNGNKVYP--E---LANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGD-VFIGEEETLEI 383 (435) T ss_pred EEEEcHHHHHHHHhhhccCCceecc--C---CCCCeEeeeeeEEeccccccccCCCCcceEEEEEccc-EEEEeecceEE Confidence 9999999999999999999998763 1 23568999999999999863 589999998 55788999999 Q ss_pred EEeeccc---------chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 377 LMTDSHA---------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 377 ~~~~~~~---------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +++++.. .+|++|++.||++.|+||++++|+||++++-.+-.. T Consensus 384 ~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 384 DYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred EEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 9988764 569999999999999999999999999999888777 No 19 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=1.3e-62 Score=359.85 Aligned_cols=401 Identities=13% Similarity=0.083 Sum_probs=290.7 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-- Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA-- 78 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-- 78 (419) |+ +++|+++++++.++++++.....+.+.+.++.++.++.++++++.++.+++++++.................... T Consensus 1 M~-i~eL~e~r~~~~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~ 79 (435) T protein:vir:14 1 MN-VNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAP 79 (435) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhc Confidence 75 678999999999999888776667777888888899999999999999998877665433322111111000000 Q ss_pred ----chhhhhhHHHHhHHHHHHHHHh--hhhhhhhHHHHHHHH--HHhhhcccccccccCCcccccchhhhHHHHHhhhh Q lcl|Aclame:pro 79 ----GTFRSLAQRFADSDGLREYRAR--DKRGQFQVEMRDIDP--NRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDL 150 (419) Q Consensus 79 ----~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~ 150 (419) ...+........ ..+..+... ..++......+.... ................|++++|+.+...|++.++. T Consensus 80 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~ 158 (435) T protein:vir:14 80 AAAPVHAQPKALEVKG-AKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRP 158 (435) T ss_pred cccccccccchhhhhH-HHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhh Confidence 000000000000 000000000 000000000000000 00111111122233456788999999999999999 Q ss_pred hhhHHhh-cceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-- Q lcl|Aclame:pro 151 PLLVADL-LDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-- 227 (419) Q Consensus 151 ~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-- 227 (419) .++++++ ++.+|+.++.+++|+.++ +..++||+|++.+|+++++|++|++.++|++++++||+|+++|+ T Consensus 159 ~~~i~~~~~~~~~~~~~~~~~p~~~~--------~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~ 230 (435) T protein:vir:14 159 KSVVRKLGARTLPLSNGNITIPRLKG--------GAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGV 230 (435) T ss_pred hchhhhhcceeeecCCCceEEEEEeC--------CcceeeeccCccccccccceeEEEeeeEEEEEeehhhHHHHHhhcc Confidence 9999987 888998888899998764 34789999999999999999999999999999999999999997 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhh--ccCC Q lcl|Aclame:pro 228 -SQLMGYIQGRLTYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIA--GFPP 303 (419) Q Consensus 228 -~~~~~~i~~~l~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 303 (419) ++|++||.++|++++++++|++|++|+|++ +|.||++........... ...+....+.++.+++..+... ++.+ T Consensus 231 ~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 308 (435) T protein:vir:14 231 NPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITAS--DASTLQKIETDLGKVILALENADANLTQ 308 (435) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccc--cccchhhHHHHHHHHHHHhhhccccccC Confidence 469999999999999999999999999985 799998765443332222 2233334455666666665543 5567 Q ss_pred cEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC--------cEEEEeccceEEEEEecceE Q lcl|Aclame:pro 304 DGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG--------TALVGGFRQGATLWSRQGIT 375 (419) Q Consensus 304 ~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~--------~~~~~d~~~~~~~~~~~~~~ 375 (419) ++|+|||.++..|++++|.+|+|+|. . ..+++|+|+||++++.||.+ .+++|||++ |++++|.+++ T Consensus 309 ~~~v~n~~~~~~L~~lkd~~G~~l~~--~---~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~~~ 382 (435) T protein:vir:14 309 PGWIMAPRTFRFLEGLRDGNGNKVYP--E---LANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGD-VFIGEEETLE 382 (435) T ss_pred CEEEEcHHHHHHHHHhhccCCceecc--C---CCCCeeecceeEeeccccccccCCCccceEEEeeccc-EEEEEecccE Confidence 89999999999999999999998763 2 23568999999999999863 589999997 5578899999 Q ss_pred EEEeeccc---------chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 376 VLMTDSHA---------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 376 i~~~~~~~---------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +.++++.. .+|++|++.||++.|+||++++|+||++++-.+..+ T Consensus 383 ~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 383 IDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred EEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 99998754 569999999999999999999999999999999988 No 20 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=6.4e-62 Score=356.07 Aligned_cols=376 Identities=14% Similarity=0.137 Sum_probs=283.5 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |+++.+|+++++++.++.+++.+..++...-.+...+.+++++++++.++++.+.+.+..................... T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 79 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPL- 79 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc- Confidence 9999999999999999998887766655433333344566777777777776666655544433332221111111100 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) ... .........+.+..... +.... ........+.+.|++++|+.+...|+..++..++|+++|++ T Consensus 80 ~~~--~~~~~~~~~~~~~~~l~-~~~~~-----------~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~ 145 (397) T protein:vir:49 80 TKN--EEEVKANFVKDFKNLVR-GRYQN-----------LLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNV 145 (397) T ss_pred cch--hhHHHHHHHHHHHHHhh-cchhh-----------HHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcce Confidence 000 00111112222222211 11100 00111123345677889999999999999999999999999 Q ss_pred ecccCccee--eeeeccccceeccccccceeecCcccccccc-cceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLE--YIRDTSGTAGAGSTWNKAAVVPEGTAKPQST-LSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 161 ~~~~~~~~~--~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) .+++++..+ +++... ..+.+.||+|++.+|+++ ++|++|++++++++++++||+++++|+. ++++||.+ T Consensus 146 ~~~~~~~~~~~~~~~~~-------~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 218 (397) T protein:vir:49 146 ENVTTLTGSRVYEKWAD-------ITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSG 218 (397) T ss_pred eeccCCcceEEEEeecc-------CCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHH Confidence 998776544 444322 234688999999999876 7999999999999999999999999986 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHH Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESI 316 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 316 (419) +|++++++++|.+||+|+|++.|.+. ...++++.+++..+...|..++.|+|||+++..| T Consensus 219 ~l~~~~~~~~d~ail~G~g~~~~~~~--------------------~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l 278 (397) T protein:vir:49 219 WIAKKVVVTRNKAILEAIGTLPNKPT--------------------LAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTAL 278 (397) T ss_pred HHHHHHHHHHHHHHHhcccccccccc--------------------ccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHH Confidence 99999999999999999998765431 1237888999999999999999999999999999 Q ss_pred HHHhccCCceeccCCccccCCCcccccceeEecC--CCCc-----CcEEEEeccceEEEEEecceEEEEeecccchhhcC Q lcl|Aclame:pro 317 ELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTV--AIAQ-----GTALVGGFRQGATLWSRQGITVLMTDSHADFFTAN 389 (419) Q Consensus 317 ~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~--~~~~-----~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 389 (419) ++++|++|+|+| .+++.++.+++|+|+||++++ .+|. ..+++|||+++|+++++.+++++++++.+.+|.+| T Consensus 279 ~~lkd~~g~~l~-~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 357 (397) T protein:vir:49 279 KKVKNAMGDYLM-ERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETD 357 (397) T ss_pred HHhhccCCceee-cccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcC Confidence 999999999865 456677888999999998754 4453 35899999999999999999999999888899999 Q ss_pred cEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 390 TLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 390 ~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ++.||++.|+|+++++|+||++++++++.| T Consensus 358 ~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~ 387 (397) T protein:vir:49 358 TTKVRVIDRFDVVSTDTEAFVPASFKAIAD 387 (397) T ss_pred eeeEEEEEeeccEEecccceEEEEeccccc Confidence 999999999999999999999999999888 No 21 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=1.5e-61 Score=354.03 Aligned_cols=376 Identities=15% Similarity=0.166 Sum_probs=267.3 Q ss_pred CCccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSL-TTEQVQEIVAEARGLAD-ALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA 78 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~-~~~~~~~~~~e~~~~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 78 (419) |. .++++++++++.+++++... ..++.+.+.++.+.... ..+.+.+++..+...+++..+..+......... . T Consensus 1 m~-~~e~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~----~ 75 (379) T protein:vir:10 1 ME-ALEIKVALEAIKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKS----E 75 (379) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc----c Confidence 76 66677776666655543322 23333333333332222 122233444444444444433322222111111 1 Q ss_pred chhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 79 GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) ...+......... ..... ..+......... ......++.++..+|+.+...|+..+...+.++++| T Consensus 76 ~~~~~~~~~~~~~---~~~~~---------~~~~~~~~~~~~--~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~ 141 (379) T protein:vir:10 76 DKSDSLVKSITEN---FNDIK---------EVRNGKSIQVKA--VGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIV 141 (379) T ss_pred ccchhHHHHHHHH---HHhHH---------HHHhhhhhhhhh--hcccccCCCCccccchhhhhHHHHhHHhhhhHHhhc Confidence 1111111111000 00000 000100001111 111122344444678889999999999999999999 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRL 238 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l 238 (419) +++++.++.++||+.++.+ ...+.|++||+.+|+++++|++|+++++|++++++||+|+++|++++++||.++| T Consensus 142 ~~~~~~~~~~~~~~~~~~~------~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~~l~~~i~~~l 215 (379) T protein:vir:10 142 GAVSISGGTYTFVRENGAG------EGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLPFLTSFIPNAL 215 (379) T ss_pred eeeeccCCceEEEEeecCC------CcccccccCCccccccccceeeeEeeeeeEEeeehhhHHHHhhHHHHHHHHHHHH Confidence 9999999999999987533 3467799999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL 318 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 318 (419) ++++++++|.+|+.|.|++.+.+... .+....++++.+++..+...++.+++|+|||.+|..|++ T Consensus 216 a~~~~~~~~~~~~~g~~~~~~~~~~~---------------~~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~ 280 (379) T protein:vir:10 216 RRDYAKAENAAFNAVLAANATASTEI---------------ITNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILV 280 (379) T ss_pred HHHHHHHHHHHHhccccccccccccc---------------ccCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH Confidence 99999999999999988765443221 112234678899999999999999999999999999999 Q ss_pred HhccCCceeccCC-ccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEE Q lcl|Aclame:pro 319 DQAPGSGVFRVIA-NVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEF 397 (419) Q Consensus 319 ~kd~~g~~~~~~~-~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~ 397 (419) +||++|+|++.++ ...++.+.+|+|+||++++.||++++++|||++++++ .+.++.++++++..++|.+|++.||++. T Consensus 281 lkd~~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~~-~~~~~~i~~~~~~~~~f~~~~~~~r~~~ 359 (379) T protein:vir:10 281 TQKSVGAGYGLPGVVTQDNGVLRINGIPLFRATWLAANKYYVGDWTRVTKV-TTEGLSLEFSEVEGTNFVKNNITARIEA 359 (379) T ss_pred hhccCCceeccCCccCCCCCcceecceeeEecCCCCCCceEEeecccEEEE-EEeceEEEEeecccccccCCcEEEEEEE Confidence 9999999876533 2235666799999999999999999999999986554 5788999999988888999999999999 Q ss_pred EeccEEecccceEEEEecCC Q lcl|Aclame:pro 398 RANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 398 r~d~~~~~~~a~~~~~~~aa 417 (419) |+|+++++|+||+++++++. T Consensus 360 R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 360 QVALAVEQPAALIFGDFTAV 379 (379) T ss_pred EeccEEecCccEEEEEecCC Confidence 99999999999999999999 No 22 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=1.2e-61 Score=354.61 Aligned_cols=376 Identities=14% Similarity=0.124 Sum_probs=274.6 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |+++.+|++.++++.++++.+.+..++...-.....+.+++++.+++.+.++++.++...................... T Consensus 1 Mk~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~- 79 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKKPL- 79 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc- Confidence 9999999999988888877665544332111111123345555566665555555444443332222111111110000 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) . ...........+.+......+. . .... .....+.+.|++++|+.+...|+..++..++|+++|++ T Consensus 80 ~--~~~~~~~~~~~~~~~~~l~~~~-~--------~~~~---~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~ 145 (397) T protein:vir:49 80 T--KSEEEVKAGFVKDFKNLVRGRY-Q--------NLLD---SKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNV 145 (397) T ss_pred c--cchhHHHHHHHHHHHHHHhcch-h--------HHHH---HhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhce Confidence 0 0011111122222222211110 0 0000 01123345678899999999999999999999999999 Q ss_pred ecccCccee--eeeeccccceeccccccceeecCcccccc-cccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLE--YIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQG 236 (419) Q Consensus 161 ~~~~~~~~~--~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~ 236 (419) .|+++.... +|+... ....+.||+|++.+|+ ++++|+++++++++++++++||+|+++|+ .++++||.+ T Consensus 146 ~~~~~~~~~~~~~~~~~-------~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 218 (397) T protein:vir:49 146 ENVTTLTGSRVYEKWTD-------ITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSG 218 (397) T ss_pred eecccCccceEEEeecc-------CCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHH Confidence 998765544 444322 2346899999999997 57999999999999999999999999997 589999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHH Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESI 316 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 316 (419) +|++++++++|.+||+|+|++.+.+.. ..++++.+++..+...+..+++|+|||+++..| T Consensus 219 ~l~~~~~~~~d~ai~~G~g~~~~~~~~--------------------~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l 278 (397) T protein:vir:49 219 WIAKKVVVTRNKAILEAIAALPTKPTL--------------------TKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTAL 278 (397) T ss_pred HHHHHHHHHHHHHHHhhcccccccccc--------------------ccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHH Confidence 999999999999999999987654321 237888999999999999999999999999999 Q ss_pred HHHhccCCceeccCCccccCCCcccccceeEecC--CCCc-----CcEEEEeccceEEEEEecceEEEEeecccchhhcC Q lcl|Aclame:pro 317 ELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTV--AIAQ-----GTALVGGFRQGATLWSRQGITVLMTDSHADFFTAN 389 (419) Q Consensus 317 ~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~--~~~~-----~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 389 (419) +++||++|+|+| ++++.++.+++|+|+||++++ .+|. ..+++|||+++|.++++.+++++++++.+.+|.+| T Consensus 279 ~~lkd~~G~~l~-~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 357 (397) T protein:vir:49 279 KKVKNALGDYLM-ERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETD 357 (397) T ss_pred HHhhcCCCceee-ccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEeccccchhhcC Confidence 999999999765 556777888999999998754 3443 34899999999999999999999999888899999 Q ss_pred cEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 390 TLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 390 ~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ++.||++.|+|+++++|+||++++++++.+ T Consensus 358 ~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~ 387 (397) T protein:vir:49 358 TTKVRVIDRFDVVATDTEAFVPASFKAIAD 387 (397) T ss_pred ceeEEEEeeeCcEEecccceEEEEeecccC Confidence 999999999999999999999999999888 No 23 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=2.3e-61 Score=353.02 Aligned_cols=397 Identities=11% Similarity=0.113 Sum_probs=285.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-c Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA-G 79 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~ 79 (419) |++.++|++++.++++++.+...... +.+.++..+.++.+++++++++.+++++++..+..+.............. . T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~--~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYAT--RALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH--HHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 99888888877777666544433222 12233334455666667777777666665555444333222211111111 1 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcc-cccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRD-APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) ..+.................... ...+.+.+......... ......+.+++.++|+.+.+.|++.+++.++|+++| T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:47 79 EARTYRNQANINDLGISIQNTKV---TSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred hhhhhHHHHHHHHHHHhhhhhhh---hHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhc Confidence 11111111111111111111111 11111222222111111 122233567788999999999999999999999999 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCccccccc-ccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) +++++.++..++|+... .....+.|++|++.+|+. .++|++|++.+++++++++||+++++|+. ++++||++ T Consensus 156 ~~~~~~~~~~~~~~~~~------~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 229 (415) T protein:vir:47 156 TVKRVTNGSGKYPVVRQ------SEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKL 229 (415) T ss_pred ceeeccCCceeEEEEEe------cCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHH Confidence 99999988877776542 233478899999999975 58999999999999999999999999975 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHH Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESI 316 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 316 (419) +|++++++++|.+||+|+|++.|.++...... .......+....++++.+++..+...++.+++|+|||++|..| T Consensus 230 ~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L 304 (415) T protein:vir:47 230 WMARTIAATRNKAIIDVITKGSTGSTSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKL 304 (415) T ss_pred HHHHHHHHHHHHHHhhccccCCcccccccccc-----ccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHH Confidence 99999999999999999999877766543221 1122233445568999999999999999999999999999999 Q ss_pred HHHhccCCceeccCCccccCCCcccccceeEecCCCCcC-----cEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 317 ELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG-----TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 317 ~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~-----~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) ++++|++|+|+| .+++.++.+++|+|+||++++++|.+ .+++|||+++|.++++.++++++++ |.++++ T Consensus 305 ~~lkd~~G~~i~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----~~~~~~ 378 (415) T protein:vir:47 305 DKMKDKLGNYLI-QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGE 378 (415) T ss_pred HHhhccCCCeee-ccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeec-----cccCce Confidence 999999999865 55677888899999999999999854 3799999999999999999998765 567788 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+|++.|+|+++.+|+||++++++++.+ T Consensus 379 ~~~~~~r~d~~v~~~~a~~~~~~~~~~~ 406 (415) T protein:vir:47 379 CLMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) T ss_pred EEEEEEEeccEEeccccEEEEEeeccCC Confidence 9999999999999999999999999999 No 24 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=2.3e-61 Score=353.02 Aligned_cols=397 Identities=11% Similarity=0.113 Sum_probs=285.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-c Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA-G 79 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~ 79 (419) |++.++|++++.++++++.+...... +.+.++..+.++.+++++++++.+++++++..+..+.............. . T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~--~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYAT--RALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH--HHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 99888888877777666544433222 12233334455666667777777666665555444333222211111111 1 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcc-cccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRD-APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) ..+.................... ...+.+.+......... ......+.+++.++|+.+.+.|++.+++.++|+++| T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:46 79 EARTYRNQANINDLGISIQNTKV---TSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred hhhhhHHHHHHHHHHHhhhhhhh---hHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhc Confidence 11111111111111111111111 11111222222111111 122233567788999999999999999999999999 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCccccccc-ccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) +++++.++..++|+... .....+.|++|++.+|+. .++|++|++.+++++++++||+++++|+. ++++||++ T Consensus 156 ~~~~~~~~~~~~~~~~~------~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 229 (415) T protein:vir:46 156 TVKRVTNGSGKYPVVRQ------SEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKL 229 (415) T ss_pred ceeeccCCceeEEEEEe------cCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHH Confidence 99999988877776542 233478899999999975 58999999999999999999999999975 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHH Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESI 316 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 316 (419) +|++++++++|.+||+|+|++.|.++...... .......+....++++.+++..+...++.+++|+|||++|..| T Consensus 230 ~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L 304 (415) T protein:vir:46 230 WMARTIAATRNKAIIDVITKGSTGSTSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKL 304 (415) T ss_pred HHHHHHHHHHHHHHhhccccCCcccccccccc-----ccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHH Confidence 99999999999999999999877766543221 1122233445568999999999999999999999999999999 Q ss_pred HHHhccCCceeccCCccccCCCcccccceeEecCCCCcC-----cEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 317 ELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG-----TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 317 ~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~-----~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) ++++|++|+|+| .+++.++.+++|+|+||++++++|.+ .+++|||+++|.++++.++++++++ |.++++ T Consensus 305 ~~lkd~~G~~i~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----~~~~~~ 378 (415) T protein:vir:46 305 DKMKDKLGNYLI-QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGE 378 (415) T ss_pred HHhhccCCCeee-ccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeec-----cccCce Confidence 999999999865 55677888899999999999999854 3799999999999999999998765 567788 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+|++.|+|+++.+|+||++++++++.+ T Consensus 379 ~~~~~~r~d~~v~~~~a~~~~~~~~~~~ 406 (415) T protein:vir:46 379 CLMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) T ss_pred EEEEEEEeccEEeccccEEEEEeeccCC Confidence 9999999999999999999999999999 No 25 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=1.3e-61 Score=354.32 Aligned_cols=378 Identities=13% Similarity=0.114 Sum_probs=284.6 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |+++.+|++.++++.++++++....+......+...++.++++++++.+.++++.++....................... T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcccccc Confidence 99999999999999998888776665543333333455667777777777776666655443333222111111111100 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) . .......+..+.+...... . ........ ..++++.+++++|+.+.+.|++.++..++|+++|++ T Consensus 81 ~---~~~~~~~~~~~~~~~~~~~-~--------~~~~~~~~---~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~ 145 (397) T protein:vir:48 81 K---SEEEVKAGFVKDFKNLVRG-R--------YQNLLDSK---TDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNV 145 (397) T ss_pred c---hhhHHHHHHHHHHHHHHhh-h--------hhHHHHHh---hccCCccccccccHHHHHHHHHHHHHHHHHHhhhce Confidence 0 0111111222222222111 1 00111111 122345577889999999999999999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCccccccc-ccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGRL 238 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~~l 238 (419) +|+++....++..... .....++|++|++.+|++ +++|++|++++++++++++||+|+++|+. ++++||+++| T Consensus 146 ~~~~~~~~~~~~~~~~-----~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l 220 (397) T protein:vir:48 146 ENVTTLTGSRVYEKWA-----DITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWI 220 (397) T ss_pred eeccCCcceEEEEeec-----CCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHH Confidence 9998877666643321 123468999999999987 58999999999999999999999999975 8999999999 Q ss_pred HHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL 318 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 318 (419) ++++++++|.+|++|+|++++.+.. ..++++.+++..+...+..++.|+|||+++..|++ T Consensus 221 ~~~~~~~~d~~il~G~g~~~~~~~~--------------------~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~ 280 (397) T protein:vir:48 221 AKKVVVTRNKAILEAIATLPTKPTL--------------------TKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKK 280 (397) T ss_pred HHHHHHHHHHHHhhccccccccccc--------------------ccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHH Confidence 9999999999999999987654321 23778889999999999999999999999999999 Q ss_pred HhccCCceeccCCccccCCCcccccceeEecC--CCC-----cCcEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 319 DQAPGSGVFRVIANVQGEATPRIWGLNVVSTV--AIA-----QGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 319 ~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~--~~~-----~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) +||++|+|+| ++++.++.+++|+|+||++++ .+| ...+++|||+++|.++++.+++++++++...+|.+|++ T Consensus 281 lkd~~G~~i~-~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 359 (397) T protein:vir:48 281 VKNAFGDYLM-ERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTT 359 (397) T ss_pred hhcCCCceee-ccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCce Confidence 9999999865 456777888999999998754 333 34589999999999999999999999988888999999 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .||++.|+|+++++|+||++++++++++ T Consensus 360 ~~r~~~r~d~~~~~~~a~~~~~~~~~~~ 387 (397) T protein:vir:48 360 KIRVIDRFDVVATDTESFVPASFKAIAD 387 (397) T ss_pred eEEEEeeeccEEecccceEEEEeccccc Confidence 9999999999999999999999999988 No 26 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=6.1e-62 Score=356.19 Aligned_cols=395 Identities=11% Similarity=0.073 Sum_probs=290.2 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |. +++|+++++++.+++..+..+.+ .+.+++|.++.+++++++++.++++++++++...................... T Consensus 1 M~-l~eL~e~r~~l~~e~~~l~~k~~-~~~~t~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 78 (409) T protein:vir:45 1 MK-LHELKQKRNTIATDMRALNEKIG-DNAWTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDP 78 (409) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHhh-cCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCC Confidence 98 68999999988887776655432 23467788888999999999999888877766655444433322221111111 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) ....... ....+.+......+.. .+................+....|++++|+.+.+.|++.++..++|+++|++ T Consensus 79 ~~~~~~~---~~~~~a~~~~l~~~~~--~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~ 153 (409) T protein:vir:45 79 ENNSQQD---EKRAQVFDKWMRHGAS--ELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQI 153 (409) T ss_pred CCcchhh---HHHHHHHHHHHHhhhh--hccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhcee Confidence 1111111 1111111111111100 1111111111111122223345578899999999999999999999999999 Q ss_pred ecccCcc-eeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEE-EeehhhHHHHhhH-HHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNV-LEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVA-HWLPITRQAADDN-SQLMGYIQGR 237 (419) Q Consensus 161 ~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~-~~~~vs~ell~d~-~~~~~~i~~~ 237 (419) ++++++. ..+++..+ ....+.|++|++.+|+++++|+++++.++|++ ++++||+|+++|+ ++|++||.++ T Consensus 154 ~~~~~~~~~~~~~~~~-------~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~ 226 (409) T protein:vir:45 154 LTTSDGRTMEWATADG-------TSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARR 226 (409) T ss_pred eecCCCceEEEEeecc-------CccccccccccccccccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHH Confidence 9997764 44444332 12356799999999999999999999999986 5789999999997 5999999999 Q ss_pred HHHHHHHHHHHHHHhccCcc---cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEE--EEehHH Q lcl|Aclame:pro 238 LTYGLRFLRDRQLLNGNGST---EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGV--VVHPQD 312 (419) Q Consensus 238 l~~a~~~~~d~~il~G~g~~---~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 312 (419) |+++++.++|++||+|+|++ +|+||++...... .........++++.+++..+...|..++.| +||+.+ T Consensus 227 la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~------~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~ 300 (409) T protein:vir:45 227 IAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTT------QTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNT 300 (409) T ss_pred HHHHHHHHHHHHhhccCCCCCccccceeeecccccc------ccccccccchHHHHHHHHhhhhhhccCCeEEEEECHHH Confidence 99999999999999999985 7999987544321 122333456789999999999999888876 679999 Q ss_pred HHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCc-----CcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 313 WESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQ-----GTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 313 ~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~-----~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) +..|+++||++|+|+| ++++..+.+.+|+|+||+++++||. ..+++|||++ |++.++.++.++++.+ .+|. T Consensus 301 ~~~l~~lkd~~G~~i~-~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~-~~i~~~~~~~~~~~~d--~~~~ 376 (409) T protein:vir:45 301 LKLISEMEDGQGRPLW-LPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDR-FIIRRVRYMILKRLVE--RYAE 376 (409) T ss_pred HHHHHHhhcCCCceee-ccCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhh-hheeeccceEEEEeec--cccc Confidence 9999999999999765 5677778889999999999999985 3478899997 5567788998887764 4688 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|++.||++.|+|+++++|+||+++++++++- T Consensus 377 ~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~~ 408 (409) T protein:vir:45 377 YDQTGFLAFHRFDCILEDTSAIKALVGKGSVG 408 (409) T ss_pred CCcEEEEEEEEeccEeechhheEEEEeccCCC Confidence 99999999999999999999999999999999 No 27 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=3.1e-61 Score=352.28 Aligned_cols=401 Identities=12% Similarity=0.061 Sum_probs=258.0 Q ss_pred CCccHH---------------HHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPPTPT---------------LEEQRAALLARLD-----DTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAP 60 (419) Q Consensus 1 M~~~~~---------------L~e~~~~l~~~~~-----~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~ 60 (419) |-++++ +.++..++..... ++....+.......|.++..+...++++.+.++.++..+.. T Consensus 5 ~~~~~~e~~~~e~a~~~~~~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k~~~E~~~~le~~~ee~k~l~ee~~~~~~~~ 84 (458) T protein:vir:10 5 INKLKEELGLGDLAKSLEGLTAAQKAQEAERMRKEQEEKELARMNDLVSKAVGEDRKRLEEALELVKSLDEKSKKSNELF 84 (458) T ss_pred hhhhhhhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222221 0000000000000 00000000011111112222222222222222222211111 Q ss_pred HHHHHH-----------Hhh----cccccccccchhhh---hhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcc Q lcl|Aclame:pro 61 PAPKGP-----------ADG----GTPLTPAEAGTFRS---LAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRD 122 (419) Q Consensus 61 ~~~~~~-----------~~~----~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 122 (419) ...... ... ............+. ...........+.+.....+.......... ..... T Consensus 85 a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~----~~~~a 160 (458) T protein:vir:10 85 AQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQ----RHLKA 160 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhh----hhhhh Confidence 100000 000 00000000000000 000000000111111111111100000000 00011 Q ss_pred cccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCccccccc--- Q lcl|Aclame:pro 123 APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS--- 199 (419) Q Consensus 123 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~--- 199 (419) ...+.....++.++|+.+.+.|++.+...++++++|+++|++++...+|+.++ ...+.||+|++.++++ T Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~--------~~~a~~v~e~~~~~~~~~~ 232 (458) T protein:vir:10 161 VNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPD--------AGKATWVAASTYGTDTTTG 232 (458) T ss_pred hhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecC--------Ccceeeccccccccccccc Confidence 11223345677889999999999999999999999999999999888887653 4578999999988865 Q ss_pred ---ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccc Q lcl|Aclame:pro 200 ---TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPK 275 (419) Q Consensus 200 ---~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~ 275 (419) +++|+++++.++|++++++||+++++|+ +++++||.++|++++++++|.+||+|+|+++|.||++.++........ T Consensus 233 ~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~ 312 (458) T protein:vir:10 233 EEVKGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVT 312 (458) T ss_pred ccccccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceee Confidence 5689999999999999999999999997 589999999999999999999999999999999999987655443332 Q ss_pred -ccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccC---CccccCCCcccccceeEecCC Q lcl|Aclame:pro 276 -PTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVI---ANVQGEATPRIWGLNVVSTVA 351 (419) Q Consensus 276 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~---~~~~~~~~~~l~G~pv~~~~~ 351 (419) ..........++++.++++.+...|+.++.|+|||.+|..|++++|++|+|++.+ .....+.+++|+|+||+++++ T Consensus 313 ~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~ 392 (458) T protein:vir:10 313 EAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEY 392 (458) T ss_pred cccccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEccc Confidence 2333344557899999999999999999999999999999999999999987643 334456677999999999999 Q ss_pred CCcC----cEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 352 IAQG----TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 352 ~~~~----~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) ||++ .+++|||.++|.++++.++++..++ ++.+|++.||++.|+|+.+++|+||++.+++++ T Consensus 393 ~p~~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~----~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 393 FPAKANSAEFAVIVYKDNFVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred cccccCCcceEEEEecccEEEEEeeceEEEeec----ccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 9874 5899999988999999999987654 467899999999999999999999999999999 No 28 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=7.1e-61 Score=350.34 Aligned_cols=397 Identities=11% Similarity=0.108 Sum_probs=285.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc- Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG- 79 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~- 79 (419) |+++++|++++.++++++.......+ +.+.++....++.++++++.++.+++++++..+.................. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~--~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQIDLKVKYAT--RALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH--HHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 99999999988887777655443322 223444445566777777777777766666555443332222111111111 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcc-cccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRD-APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) ..+.................... ...+.+.+......... ......+.+|+.++|+.+.+.|++.++..++|+++| T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:81 79 EARTYRNQANINDLGISIQNTKV---TSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhh---HHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhhe Confidence 11111110000000001111100 11111111111111111 112234566888999999999999999999999999 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCccccccc-ccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) +++++.++..+++.... .+...++|++|++++|+. .++|+++++.+++++++++||+|+++|+. ++++||.+ T Consensus 156 ~~~~~~~~~~~~~~~~~------~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 229 (415) T protein:vir:81 156 TVKRVTNGSGKYPVVRQ------SEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKL 229 (415) T ss_pred eeeeccCCceeEEEEee------cCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHH Confidence 99999877666655332 234578899999999975 58999999999999999999999999975 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHH Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESI 316 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 316 (419) +|++++++++|.+|++|+|+++|.+....... ............++++.+++..+...++.++.|+||+++|..| T Consensus 230 ~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l 304 (415) T protein:vir:81 230 WMARTIAATRNKAIIDVITKGSTGSTSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKL 304 (415) T ss_pred HHHHHHHHHHHHHHhhccccCccccccccccc-----cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHH Confidence 99999999999999999999877765443221 1122333445668999999999999999999999999999999 Q ss_pred HHHhccCCceeccCCccccCCCcccccceeEecCCCCcCc-----EEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 317 ELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT-----ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 317 ~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~-----~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) +++||++|+|+| .+++.++.+++|+|+||++++.+|.+. +++|||+++|+++++.++++++++ |.++++ T Consensus 305 ~~lkd~~G~~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----~~~~~~ 378 (415) T protein:vir:81 305 DKMKDKLGNYLI-QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGE 378 (415) T ss_pred HHhhccCCceee-ccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----cccCce Confidence 999999999865 456777888999999999999998643 899999998989999999998765 456778 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+|++.|+|+++++|+||++++++++++ T Consensus 379 ~~~~~~r~d~~v~~~~a~~~~~~~~~~~ 406 (415) T protein:vir:81 379 CLMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) T ss_pred EEEEEEEeccEEeccccEEEEEEeccCC Confidence 8999999999999999999999999999 No 29 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=7.1e-61 Score=350.34 Aligned_cols=397 Identities=11% Similarity=0.108 Sum_probs=285.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc- Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG- 79 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~- 79 (419) |+++++|++++.++++++.......+ +.+.++....++.++++++.++.+++++++..+.................. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~--~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQIDLKVKYAT--RALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH--HHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 99999999988887777655443322 223444445566777777777777766666555443332222111111111 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcc-cccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRD-APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) ..+.................... ...+.+.+......... ......+.+|+.++|+.+.+.|++.++..++|+++| T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:98 79 EARTYRNQANINDLGISIQNTKV---TSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhh---HHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhhe Confidence 11111110000000001111100 11111111111111111 112234566888999999999999999999999999 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCccccccc-ccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) +++++.++..+++.... .+...++|++|++++|+. .++|+++++.+++++++++||+|+++|+. ++++||.+ T Consensus 156 ~~~~~~~~~~~~~~~~~------~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 229 (415) T protein:vir:98 156 TVKRVTNGSGKYPVVRQ------SEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKL 229 (415) T ss_pred eeeeccCCceeEEEEee------cCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHH Confidence 99999877666655332 234578899999999975 58999999999999999999999999975 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHH Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESI 316 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 316 (419) +|++++++++|.+|++|+|+++|.+....... ............++++.+++..+...++.++.|+||+++|..| T Consensus 230 ~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l 304 (415) T protein:vir:98 230 WMARTIAATRNKAIIDVITKGSTGSTSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKL 304 (415) T ss_pred HHHHHHHHHHHHHHhhccccCccccccccccc-----cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHH Confidence 99999999999999999999877765443221 1122333445668999999999999999999999999999999 Q ss_pred HHHhccCCceeccCCccccCCCcccccceeEecCCCCcCc-----EEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 317 ELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT-----ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 317 ~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~-----~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) +++||++|+|+| .+++.++.+++|+|+||++++.+|.+. +++|||+++|+++++.++++++++ |.++++ T Consensus 305 ~~lkd~~G~~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----~~~~~~ 378 (415) T protein:vir:98 305 DKMKDKLGNYLI-QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGE 378 (415) T ss_pred HHhhccCCceee-ccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----cccCce Confidence 999999999865 456777888999999999999998643 899999998989999999998765 456778 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+|++.|+|+++++|+||++++++++++ T Consensus 379 ~~~~~~r~d~~v~~~~a~~~~~~~~~~~ 406 (415) T protein:vir:98 379 CLMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) T ss_pred EEEEEEEeccEEeccccEEEEEEeccCC Confidence 8999999999999999999999999999 No 30 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=7.1e-61 Score=350.34 Aligned_cols=397 Identities=11% Similarity=0.108 Sum_probs=285.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc- Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG- 79 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~- 79 (419) |+++++|++++.++++++.......+ +.+.++....++.++++++.++.+++++++..+.................. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~--~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQIDLKVKYAT--RALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH--HHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 99999999988887777655443322 223444445566777777777777766666555443332222111111111 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcc-cccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRD-APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) ..+.................... ...+.+.+......... ......+.+|+.++|+.+.+.|++.++..++|+++| T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:79 79 EARTYRNQANINDLGISIQNTKV---TSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhh---HHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhhe Confidence 11111110000000001111100 11111111111111111 112234566888999999999999999999999999 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCccccccc-ccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) +++++.++..+++.... .+...++|++|++++|+. .++|+++++.+++++++++||+|+++|+. ++++||.+ T Consensus 156 ~~~~~~~~~~~~~~~~~------~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 229 (415) T protein:vir:79 156 TVKRVTNGSGKYPVVRQ------SEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKL 229 (415) T ss_pred eeeeccCCceeEEEEee------cCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHH Confidence 99999877666655332 234578899999999975 58999999999999999999999999975 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHH Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESI 316 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 316 (419) +|++++++++|.+|++|+|+++|.+....... ............++++.+++..+...++.++.|+||+++|..| T Consensus 230 ~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l 304 (415) T protein:vir:79 230 WMARTIAATRNKAIIDVITKGSTGSTSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKL 304 (415) T ss_pred HHHHHHHHHHHHHHhhccccCccccccccccc-----cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHH Confidence 99999999999999999999877765443221 1122333445668999999999999999999999999999999 Q ss_pred HHHhccCCceeccCCccccCCCcccccceeEecCCCCcCc-----EEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 317 ELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT-----ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 317 ~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~-----~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) +++||++|+|+| .+++.++.+++|+|+||++++.+|.+. +++|||+++|+++++.++++++++ |.++++ T Consensus 305 ~~lkd~~G~~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----~~~~~~ 378 (415) T protein:vir:79 305 DKMKDKLGNYLI-QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGE 378 (415) T ss_pred HHhhccCCceee-ccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----cccCce Confidence 999999999865 456777888999999999999998643 899999998989999999998765 456778 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+|++.|+|+++++|+||++++++++++ T Consensus 379 ~~~~~~r~d~~v~~~~a~~~~~~~~~~~ 406 (415) T protein:vir:79 379 CLMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) T ss_pred EEEEEEEeccEEeccccEEEEEEeccCC Confidence 8999999999999999999999999999 No 31 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=1.4e-60 Score=348.71 Aligned_cols=396 Identities=10% Similarity=0.102 Sum_probs=281.7 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-c Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQE-IVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAE-A 78 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~ 78 (419) |++.++|++++.++.+++.+.. ++.+. +.++..+.++.++.+++.++.++++++...+................ . T Consensus 1 mk~~~el~~~l~el~~~~~~~~---~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKV---KYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEV 77 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHH---HHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 8888888887777666554333 33332 34444555667777777777777666555443333221111111111 1 Q ss_pred chhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhh-cccccccccCCcccccchhhhHHHHHhhhhhhhHHhh Q lcl|Aclame:pro 79 GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLS-RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL 157 (419) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~ 157 (419) ........................ ...+.+.+....... ........+.+|+.++|+.+...|++.++..++|+++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~ 154 (415) T protein:vir:94 78 NEASTYRNQANINDLGISIQNTKV---TSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKY 154 (415) T ss_pred cchhhHHHHHHHHHHHhhhhhhhh---hHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhh Confidence 111000000000000000000000 011111111111111 1111223356688899999999999999999999999 Q ss_pred cceecccCcceeeeeeccccceeccccccceeecCccccccc-ccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHH Q lcl|Aclame:pro 158 LDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQ 235 (419) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~ 235 (419) |+++++.++..+++.... .+...+.|++|++++|+. .++|++|++.+++++++++||+|+++|+. ++++||. T Consensus 155 ~~~~~~~~~~~~~~~~~~------~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~ 228 (415) T protein:vir:94 155 VTVKRVTNGSGKYPVVRQ------SEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELK 228 (415) T ss_pred cceeeccCCceeEEEEee------cCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHH Confidence 999999877666654432 234578899999999975 58999999999999999999999999975 8999999 Q ss_pred HHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHH Q lcl|Aclame:pro 236 GRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWES 315 (419) Q Consensus 236 ~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (419) ++|++++++++|.+|++|+|++.|.++...... .......+....++++.+++..+...++.+++|+|||++|.. T Consensus 229 ~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~ 303 (415) T protein:vir:94 229 LWMARTIAATRNKAIIDVITKGSTGSTSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAK 303 (415) T ss_pred HHHHHHHHHHHHHHHhhccccCccccccccccc-----cccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHH Confidence 999999999999999999999887766543221 112223334456899999999999999999999999999999 Q ss_pred HHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCc-----EEEEeccceEEEEEecceEEEEeecccchhhcCc Q lcl|Aclame:pro 316 IELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT-----ALVGGFRQGATLWSRQGITVLMTDSHADFFTANT 390 (419) Q Consensus 316 l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~-----~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 390 (419) |+++||++|+|+| .+++.++.+++|+|+||++++++|.+. +++|||+++|+++++.++++++++ |.+++ T Consensus 304 l~~lkd~~G~~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~-----~~~~~ 377 (415) T protein:vir:94 304 LDKMKDKLGNYLI-QPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFG 377 (415) T ss_pred HHHhhccCCCeee-ccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----cccCc Confidence 9999999999865 556777888999999999999998654 899999999999999999998764 56778 Q ss_pred EEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 391 LVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 391 ~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +.||++.|+|+++++|+||++++++++++ T Consensus 378 ~~~r~~~r~d~~~~~~~a~~~~~~~~~~~ 406 (415) T protein:vir:94 378 ECLMIAVRQDCRILDYKSAIVIEYDDSER 406 (415) T ss_pred eEEEEEEEeccEEeccccEEEEEEeccCC Confidence 89999999999999999999999999999 No 32 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=8.8e-61 Score=349.84 Aligned_cols=382 Identities=13% Similarity=0.114 Sum_probs=278.7 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARG---LADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAE 77 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~---~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 77 (419) |..-++|+|+++++.+..++++...++++.+.++.+. ...+++++++.+.++.+.+++.....+............. T Consensus 1 m~~~m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) T protein:vir:74 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) T ss_pred CChhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 8888888888888888888777777776665544332 3345555666666666555555443332221111111110 Q ss_pred cchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhh Q lcl|Aclame:pro 78 AGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL 157 (419) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~ 157 (419) . .. ...........+.+....+...... . ....+.. ..+....|++++|+.+...|++.++..++|+++ T Consensus 81 ~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~--~-----~~~~~a~-~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~ 149 (408) T protein:vir:74 81 P-LN--KSENELKDKFVKDFVNMVRNPMAFL--N-----TVSSKTE-TSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQY 149 (408) T ss_pred c-cc--chhhhhHHHHHHHHHHHHhcchhhh--h-----hhhhhhh-cccccCCCceeechhHhhHHHHHHhhhcchhhh Confidence 0 00 0011111122222222222111110 0 1111111 223345578889999999999999999999999 Q ss_pred cceecccCcceeeeeeccccceeccccccceeecCcccccc-cccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHH Q lcl|Aclame:pro 158 LDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQ 235 (419) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~ 235 (419) |+++|++++...++..... .....+.|++|++.+|+ ++++|++|+++++|++++++||+|+++|+. ++++||. T Consensus 150 ~~~~~~~~~~~~~~~~~~~-----~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~ 224 (408) T protein:vir:74 150 VRVESVSTSSGSRVYEKWT-----DVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLS 224 (408) T ss_pred cceeeccCCcceEEEEeec-----CCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHH Confidence 9999998765555433211 12345689999999997 469999999999999999999999999975 8999999 Q ss_pred HHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHH-HhhhhhccCCcEEEEehHHHH Q lcl|Aclame:pro 236 GRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAK-TVAEIAGFPPDGVVVHPQDWE 314 (419) Q Consensus 236 ~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 314 (419) ++|++++++++|.+||+|+|+++|.|.. ..++++.+++ ..+...|..+++|+|||.++. T Consensus 225 ~~l~~~~~~~~d~~il~G~G~~~~~~~~--------------------~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~ 284 (408) T protein:vir:74 225 SWIAKKVVVTRNQAIIAAMGTVPKKPTI--------------------ANFDDVITMINTSVDPAIIATSSLLTNQSGLN 284 (408) T ss_pred HHHHHHHHHHHHHHHhhccccccccccc--------------------ccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHH Confidence 9999999999999999999988765421 1256666666 478888889999999999999 Q ss_pred HHHHHhccCCceeccCCccccCCCcccccceeEecC--CCCc-----CcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 315 SIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTV--AIAQ-----GTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 315 ~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~--~~~~-----~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) .|+++||++|+|+| .+++.++.+++|+|+||++++ .+|. ..+++|||+++|.+++|.+++++++++....|. T Consensus 285 ~l~~lkd~~G~~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~ 363 (408) T protein:vir:74 285 KLALVKTAEGKYLL-EPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFE 363 (408) T ss_pred HHHHhhcCCCceEe-ccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccchhh Confidence 99999999999764 566777888999999999865 4553 348999999999999999999999998878899 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|++.||++.|+|+++++|+||++++++++++ T Consensus 364 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~ 395 (408) T protein:vir:74 364 TDTTKIRVIDRFDVKATDSEALVAGSFTAIAD 395 (408) T ss_pred cceeeEEEEEeeCcEEecccceEEEEeecccC Confidence 99999999999999999999999999988877 No 33 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=1.3e-60 Score=348.99 Aligned_cols=393 Identities=16% Similarity=0.155 Sum_probs=261.3 Q ss_pred CCccHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARL---DDTSLTTEQVQEIVAE---------ARGLADALQAESDRAAARAALLRTAPPAPKGPAD 68 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~---~~~~~~~~~~~~~~~e---------~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 68 (419) |.-.++|+++++++.+.+ +++.....+.+...++ .+..++.++++.+++++...+++..........+ T Consensus 6 ~~~~~el~~~~~~l~el~~~~~el~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~~~~~~~~~l~ 85 (425) T protein:vir:95 6 LMLTKKIEQRKAALDELVKREQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEIAQLEDELE 85 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 223334444443333222 2222222222211111 1122233344444444444444444433333322 Q ss_pred hcccccccccchhhhhhHH-HHhHHHHHHHHHhhhhhh--hhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHH Q lcl|Aclame:pro 69 GGTPLTPAEAGTFRSLAQR-FADSDGLREYRARDKRGQ--FQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVP 145 (419) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~ 145 (419) .............+..+.. .................. ............ ....+++++++++|+.+.+.|+ T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~gg~~vP~~~~~~Ii 159 (425) T protein:vir:95 86 QINSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKF------RNLRAVAGGELTIPEVVVNRIM 159 (425) T ss_pred HhhhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHH------HhhcccccCceeccHHHHHHHH Confidence 2221111111111111100 000011011111100000 011111111111 1122345688899999999999 Q ss_pred HhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccc-cceeeEEeeeEEEEEeehhhHHHH Q lcl|Aclame:pro 146 TTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST-LSFDTITTTLKTVAHWLPITRQAA 224 (419) Q Consensus 146 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~~~vs~ell 224 (419) +.++..++++++|+++++++ ...+|+.. +.+.+.|++|++++|+++ ++|++|++++++++++++||+|++ T Consensus 160 ~~l~~~~~i~~~~~~~~~~g-~~~ip~~~--------~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell 230 (425) T protein:vir:95 160 DIMGDYTTLYPLVDKIRVKG-TTRILVDT--------DTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLL 230 (425) T ss_pred HHHHhhhhHHHhhceeecCc-eeEEEEec--------CCccccccccccccccccccccceeeeeheeeeeeehhhHHHH Confidence 99999999999999999865 56888754 345789999999999988 689999999999999999999999 Q ss_pred hhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcc--cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 225 DDNS-QLMGYIQGRLTYGLRFLRDRQLLNGNGST--EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGF 301 (419) Q Consensus 225 ~d~~-~~~~~i~~~l~~a~~~~~d~~il~G~g~~--~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (419) +|++ ++++||.++|++++++++|.+||+|+|++ +|.||++....... .........++++.+++..+...+. T Consensus 231 ~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (425) T protein:vir:95 231 QDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQ-----VTVEADNNLLKNLVKQIGLIDTGDD 305 (425) T ss_pred hccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccc-----cccccccchHHHHHHHHHhhhhhcc Confidence 9985 89999999999999999999999999974 89999975332211 1222344567888888888877664 Q ss_pred --CCcEEEEehHHHH----HHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceE Q lcl|Aclame:pro 302 --PPDGVVVHPQDWE----SIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGIT 375 (419) Q Consensus 302 --~~~~~~~~~~~~~----~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 375 (419) .+++|+||+.++. .|++++|.+|+|++..+ .+..++|+|+||++++.||++.++||||++ |+++++.+++ T Consensus 306 ~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~---~~~~~~l~G~pvv~~~~~~~~~i~~Gd~~~-~~~~~~~~~~ 381 (425) T protein:vir:95 306 SVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLP---NLRTPDLLGLRVVFNNFLDDDTVLFGEFEQ-YTLVERENIT 381 (425) T ss_pred ccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccC---CCCCccccceeeEEcCcCCCccEEEEeccc-EEEEeecceE Confidence 4567999999853 46778899999876533 345678999999999999999999999997 6777899999 Q ss_pred EEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 376 VLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 376 i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +.++++. +|.+|++.||++.|+|+++++|+||+++++++.+. T Consensus 382 i~~~~~~--~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~ 423 (425) T protein:vir:95 382 IDSSTHV--KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQ 423 (425) T ss_pred EEeeccc--ccccCceEEEEEEeeCcEeecccceEEEEecCcCC Confidence 9998864 69999999999999999999999999999999888 No 34 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=1.3e-60 Score=348.97 Aligned_cols=382 Identities=12% Similarity=0.106 Sum_probs=275.9 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEAR---GLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAE 77 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~---~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 77 (419) |...++|+|+++++.+..+++....+++....++.. ++.++++++++.+.+++++++.+.................. T Consensus 1 m~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) T protein:vir:10 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) T ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 777777777777777777776666666554433322 23344555555555555555554443332221111111000 Q ss_pred cchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhh Q lcl|Aclame:pro 78 AGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL 157 (419) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~ 157 (419) ..............+.+....+....... ....+.. ..+....|++++|+.+.+.|++.+++.++|+++ T Consensus 81 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~a~-~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~ 149 (408) T protein:vir:10 81 ---PLNKSENELKDKFVKDFVNMVRNPMAFMN-------TVSSKTE-TSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQY 149 (408) T ss_pred ---ccccchhhhHHHHHHHHHHHhhcchhhhh-------hhhhhhh-hcccccCCceeccHhHHHHHHHHHHhhchhhhh Confidence 00111111222233333333322221111 0111111 223345577889999999999999999999999 Q ss_pred cceecccCcceeeeeeccccceeccccccceeecCcccccccc-cceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHH Q lcl|Aclame:pro 158 LDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST-LSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQ 235 (419) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~ 235 (419) |+++|+++....++..... .....+.|++|++.+|+.+ ++|++|++++++++++++||+|+++|+. ++++||. T Consensus 150 ~~~~~~~~~~~~~~~~~~~-----~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~ 224 (408) T protein:vir:10 150 VRVESVSTSNGSRVYEKWT-----DVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLS 224 (408) T ss_pred cceeeccCCcceEEEeecc-----ccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHH Confidence 9999998776665543211 1224688999999999865 8999999999999999999999999975 8999999 Q ss_pred HHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHH-HhhhhhccCCcEEEEehHHHH Q lcl|Aclame:pro 236 GRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAK-TVAEIAGFPPDGVVVHPQDWE 314 (419) Q Consensus 236 ~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 314 (419) ++|++++++++|.+|++|+|++.+.+- ...++++.+++ ..+...|..++.|+|||++|. T Consensus 225 ~~l~~~~~~~~~~~il~g~g~~~~~~~--------------------~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~ 284 (408) T protein:vir:10 225 SWIAKKVVVTRNQAIIEVMKAAPKKPT--------------------IAKFDDVITMINTAVDPAIIATSSLLTNQSGLN 284 (408) T ss_pred HHHHHHHHHHHHHHHhhcccccccccc--------------------cccHHHHHHHHHHhhhhhhccCCEEEEcHHHHH Confidence 999999999999999999998764321 12366777776 468888888899999999999 Q ss_pred HHHHHhccCCceeccCCccccCCCcccccceeEecC--CCCcC-----cEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 315 SIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTV--AIAQG-----TALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 315 ~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~--~~~~~-----~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) .|++++|++|+|+| ++++.++.+++|+|+||++++ .+|.. .+++|||+++|.++++.++++.++++....|. T Consensus 285 ~l~~lkd~~G~~i~-~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~ 363 (408) T protein:vir:10 285 KLALVKTAEGKYLL-EPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFE 363 (408) T ss_pred HHHHhhccCCceEe-ccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhh Confidence 99999999999865 456777888999999999854 45653 28999999999999999999999998888899 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|++.||++.|+|+++++|+||++++++++++ T Consensus 364 ~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~ 395 (408) T protein:vir:10 364 TDTTKIRVIDRFDVKATDSEALVAGSFSAIAD 395 (408) T ss_pred cCceEEEEEEeeccEEeccccEEEEEeecccc Confidence 99999999999999999999999999998766 No 35 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=7.9e-60 Score=344.61 Aligned_cols=383 Identities=11% Similarity=0.084 Sum_probs=262.9 Q ss_pred CC--ccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 1 MP--PTPTLEEQRAALLARLDDTSLTTEQVQ-EIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAE 77 (419) Q Consensus 1 M~--~~~~L~e~~~~l~~~~~~~~~~~~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 77 (419) || -.+.|++++++++++.++++...++.. +..+...++++.++++++.+.+..+....................... T Consensus 1 ~~~~m~k~l~el~~~~~~~~~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:12 1 MPMQMSKKEIALRQQFTEKKQQADKALQEGNTDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQ 80 (397) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhccc Confidence 44 335566666666665555544332211 111112222333333333333322222222222111111111110000 Q ss_pred cchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhh Q lcl|Aclame:pro 78 AGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL 157 (419) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~ 157 (419) ..+.........+..+.+...........+.+.... .. ......+..+..|++++|+.+.+.|++.+...++|+++ T Consensus 81 --~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~-~~-~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~ 156 (397) T protein:vir:12 81 --RSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLD-SP-EFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQY 156 (397) T ss_pred --ccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHh-hh-hhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhh Confidence 000111111111222222222222221111111111 11 11222344556678899999999999999999999999 Q ss_pred cceecccCccee--eeeeccccceeccccccceeecCccccccc-ccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHH Q lcl|Aclame:pro 158 LDQQNADYNVLE--YIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGY 233 (419) Q Consensus 158 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~ 233 (419) |+++|+.+.+.. +++.. +...++||+||+++|+. .++|++|+++++|++++++||+|+++|+. ++++| T Consensus 157 ~~~~~~~~~~~~~~~~~~~--------~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~ 228 (397) T protein:vir:12 157 VTVEPVTTRSGTRLLEKNA--------DMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTY 228 (397) T ss_pred cceeeccCCceeEEEEEec--------CCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHHHHHH Confidence 999998765444 44433 34578999999999975 58999999999999999999999999975 89999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHH-hhhhhccCCcEEEEehHH Q lcl|Aclame:pro 234 IQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKT-VAEIAGFPPDGVVVHPQD 312 (419) Q Consensus 234 i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 312 (419) |.++|++++++++|.+|++|+|+++|.|+.+ ++++..++. .+...+..+++|+|||++ T Consensus 229 i~~~l~~~~~~~~d~~il~G~g~~~~~g~~~---------------------~~~i~~~~~~~l~~~~~~~a~~~~n~~~ 287 (397) T protein:vir:12 229 VAKWFAKKSVVTRNNLILAAIASLKKVDIDG---------------------LDGIKKALNVTLDPMVAPGSIVLTNQDG 287 (397) T ss_pred HHHHHHHHHHHHHHHHHHhcccccccccccc---------------------HHHHHHHHhhccchhhhCCCEEEEcHHH Confidence 9999999999999999999999999888753 456666664 678889999999999999 Q ss_pred HHHHHHHhccCCceeccCCccccCCCcccccceeEecCCC-Cc-----CcEEEEeccceEEEEEecceEEEEeecccchh Q lcl|Aclame:pro 313 WESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAI-AQ-----GTALVGGFRQGATLWSRQGITVLMTDSHADFF 386 (419) Q Consensus 313 ~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~-~~-----~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~ 386 (419) |.+|++++|++|+|+| ++++.++.+++|+|+||++++.+ |. ..+++|||+++|.++++.+++++++++....| T Consensus 288 ~~~L~~lkd~~G~~l~-~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f 366 (397) T protein:vir:12 288 YDWLDTLKDGTGRYLL-QPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAF 366 (397) T ss_pred HHHHHHhhccCCceee-cccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecceEEEEeccccchh Confidence 9999999999999865 55677788899999999876653 32 23899999999999999999999999888889 Q ss_pred hcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 387 TANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 387 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) .+|++.||++.|+|+++++|+||+++++++- T Consensus 367 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 367 ETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred hcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 9999999999999999999999999999999 No 36 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=4e-60 Score=346.20 Aligned_cols=356 Identities=14% Similarity=0.164 Sum_probs=272.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |. ++|+++++++.+..++.+....+ +..+.+++++++++.+++++++++...+................. T Consensus 1 M~--k~l~~l~e~~~~~~~e~~~~~~~------~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-- 70 (371) T protein:vir:81 1 MP--KELRELLEQINNKKEEARKLLAE------NKIEEAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKEPLKPTVQ-- 70 (371) T ss_pred Cc--HHHHHHHHHHHHHHHHHHHHhhH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchh-- Confidence 88 47888877777777665554322 222346667777777777777766655544333322221111100 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) ...+..+.+... ++..... ... .+++..|++++|+.+...|++.++..++|++++++ T Consensus 71 --------~~~~~~~~~~~~---------l~~~~~~-----a~~-~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~ 127 (371) T protein:vir:81 71 --------VKENEVEAFVNH---------IRTRFRN-----AMS-EGSNQDGGYTVPQDIQTRINELRESKDALQNLITV 127 (371) T ss_pred --------hHHHHHHHHHHH---------HHHHHHH-----hhc-cCCCccCceeecHhHHHHHHHHHHhhhhhhhhcee Confidence 001111111111 1111111 111 22345678899999999999999999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCcccccc-cccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRL 238 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l 238 (419) +|++++...++.... .+.+.++||+||+.+|+ ++++|++++++++|++++++||+|+++|+ .+|++||.++| T Consensus 128 ~~~~~~~~~~~~~~~------~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l 201 (371) T protein:vir:81 128 EPVTTLSGSRVFKKR------SQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWI 201 (371) T ss_pred eeccCCceeEEEEee------cCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHH Confidence 999877666554332 23457899999999997 56999999999999999999999999997 58999999999 Q ss_pred HHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHH-HhhhhhccCCcEEEEehHHHHHHH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAK-TVAEIAGFPPDGVVVHPQDWESIE 317 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~ 317 (419) ++++++++|.+|++|+|++.|.|+.+ ++++..++ ..+...+..+++|+|||++|..|+ T Consensus 202 ~~a~~~~~~~~i~~g~g~~~~~~~~~---------------------~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~ 260 (371) T protein:vir:81 202 GDESRVTRNGLIINVLNTKAKTAIAD---------------------LDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLD 260 (371) T ss_pred HHHHHHHHHHHHHhhccccccccccc---------------------HHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHH Confidence 99999999999999999988877643 34444444 457778888899999999999999 Q ss_pred HHhccCCceeccCCccccCCCcccccceeEecCCCCc------------CcEEEEeccceEEEEEecceEEEEeecccch Q lcl|Aclame:pro 318 LDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQ------------GTALVGGFRQGATLWSRQGITVLMTDSHADF 385 (419) Q Consensus 318 ~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~------------~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~ 385 (419) ++||++|+|+| .+++.++.+++|+|+||++++.+|. ..+++|||+++|.++++.+++++++++..+. T Consensus 261 ~lkd~~g~~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~ 339 (371) T protein:vir:81 261 TLKDQNGQYLL-QPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDA 339 (371) T ss_pred HhhccCCCeee-ecccCCCCCceecceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccch Confidence 99999999865 5567788889999999999999873 2589999999999999999999999988888 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 386 FTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 386 ~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) |.+|++.||++.|+|+++++|+||+++++++| T Consensus 340 f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 340 FETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred hhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 99999999999999999999999999999999 No 37 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=6.7e-60 Score=344.99 Aligned_cols=382 Identities=12% Similarity=0.107 Sum_probs=276.2 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARG---LADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAE 77 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~---~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 77 (419) |+.-.+|+|+++++.+..++++...++++.+..+.+. ...+++++++.+..+.+.++......+............. T Consensus 1 ~~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (404) T protein:vir:39 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (404) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 9999999998888888888887777777665544332 2333444444444454444444333222221111111111 Q ss_pred cchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhh Q lcl|Aclame:pro 78 AGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL 157 (419) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~ 157 (419) ..... . ........+.+......+.... .. ...+. ...++...|++++|+.+...|++.++..++|+++ T Consensus 81 ~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~--~~-----~e~~a-~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~ 149 (404) T protein:vir:39 81 PLNKS-E--YELKDKFVKEFVNMVRNPMAFL--NT-----VSSKT-ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQY 149 (404) T ss_pred ccccc-h--hhhHHHHHHHHHHHHhcchhhh--hh-----hhhhh-hhcccccCCceeccHHHHHHHHHHHHhhhhHHhh Confidence 11000 0 0111122222222222111110 00 11111 1223445677899999999999999999999999 Q ss_pred cceecccCcceeeeeeccccceeccccccceeecCcccccc-cccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHH Q lcl|Aclame:pro 158 LDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQ 235 (419) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~ 235 (419) |+++|+.++...++..... .....+.||+|++.+|+ ++++|++++++++|++++++||+|+++|+ .++++||. T Consensus 150 ~~~~~~~~~~~~~~~~~~~-----~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~ 224 (404) T protein:vir:39 150 VRVESVSTSNGSRVYEKWT-----DVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLS 224 (404) T ss_pred cceeeccCCcceEEEEeec-----CCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHH Confidence 9999988766555543211 12346899999999998 46999999999999999999999999997 58999999 Q ss_pred HHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHH-hhhhhccCCcEEEEehHHHH Q lcl|Aclame:pro 236 GRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKT-VAEIAGFPPDGVVVHPQDWE 314 (419) Q Consensus 236 ~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 314 (419) ++|++++++++|.+||+|+|++.|.+.. ..++++..++. .+...+..+++|+|||++|. T Consensus 225 ~~l~~~~~~~~d~~il~g~g~~~~~~~~--------------------~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~ 284 (404) T protein:vir:39 225 SWIAKKVVVTRNQAIIAAMGTVPKKPTI--------------------AKFDDVITMINTSVDPAIIATSSLLTNQSGLN 284 (404) T ss_pred HHHHHHHHHHHHHHHHhccccccccccc--------------------ccHHHHHHHHHHhhhhhhccCCEEEEcHHHHH Confidence 9999999999999999999988765432 12556666665 56777888889999999999 Q ss_pred HHHHHhccCCceeccCCccccCCCcccccceeEecCC--CCc-----CcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 315 SIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVA--IAQ-----GTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 315 ~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~--~~~-----~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) .|+++||++|+|+| ++++.++.+++|+|+||++++. +|. ..+++|||+++|.++++.+++++++++...+|. T Consensus 285 ~L~~lkd~~G~~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~ 363 (404) T protein:vir:39 285 KLALVKTAEGKYLL-EPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFE 363 (404) T ss_pred HHHHhhccCCceee-ccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhhh Confidence 99999999999865 5567778889999999998654 443 248999999999999999999999998888899 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|++.||++.|+|+++++|+||+++++++++. T Consensus 364 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~ 395 (404) T protein:vir:39 364 TDTTKIRVIDRFDVKTTDSEALVAGSFTAIAD 395 (404) T ss_pred hceeeEEEEeeeccEEecccceEEEEeecccc Confidence 99999999999999999999999999988877 No 38 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=2e-59 Score=342.42 Aligned_cols=387 Identities=14% Similarity=0.088 Sum_probs=259.2 Q ss_pred CC----------------ccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MP----------------PTPTLEEQRAALLARLDDTSLTT-EQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAP 63 (419) Q Consensus 1 M~----------------~~~~L~e~~~~l~~~~~~~~~~~-~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 63 (419) ++ .+.++.++++...++++.+.... ++...+.++..+.+++++++.+.+.....+++...+.. T Consensus 126 a~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~ 205 (543) T protein:vir:81 126 SSQGGRGDYDRDAILEPDSIEDCRFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATKIIERFDDE 205 (543) T ss_pred hHHHhhHHHHHhhhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 12222222222222222211111 11122222333334444444444444443333333222 Q ss_pred HHHHhhcccccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhH- Q lcl|Aclame:pro 64 KGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPG- 142 (419) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~- 142 (419) ........... .. ....+.+.. ..+......+........... ...+.+.+.+++++|+.+.. T Consensus 206 e~~~~~~~~~~-----~~---------~~~~~a~~~-~~~~~~~~~l~~~e~~~~~~~-~~~~~t~~~gg~lip~~~~~~ 269 (543) T protein:vir:81 206 DSTLARQCLAT-----SS---------PAYLRAWSK-MARNPHAAILTEEEKRAINEV-RAMGLTKADGGYLVPFQLDPT 269 (543) T ss_pred HHHHhhhhhhh-----hh---------hhhhhHHHH-HHHhhHHHHhhhhhhhhhhhh-hhcccccccCcccCchhhhhH Confidence 22111100000 00 000000000 000001111111111111111 11223345567778777655 Q ss_pred HHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHH Q lcl|Aclame:pro 143 IVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQ 222 (419) Q Consensus 143 ~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~e 222 (419) +|...+...++++.++++.++ ++.+.+|+.++ +..+.||+||+.+|+++++|+++++.+++++++++||++ T Consensus 270 ii~~~~~~~~~l~~~~~~~~~-~g~~~~~~~~~--------~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e 340 (543) T protein:vir:81 270 VIITSNGSLNDIRRFARQVVA-TGDVWHGVSSA--------AVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIE 340 (543) T ss_pred HHHHHHhhhchhhhhcccccC-CcceEEEEecC--------CcceeecccCccccccccccceeeeeeeeeEeeehhhHH Confidence 456778888999999998766 45677887653 457899999999999999999999999999999999999 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 223 AADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGF 301 (419) Q Consensus 223 ll~d~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (419) +++|++++++||.++|++++++++|.+||+|+|++ +|.||++....... ...+.......++++.+++..+...|. T Consensus 341 ll~d~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~l~~~~~ 417 (543) T protein:vir:81 341 ALQDEANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAA---EIAPVTAETFALADVYAVYEQLAARHR 417 (543) T ss_pred HHhccHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhcccccc---cccccccccccHHHHHHHHHhhhcccc Confidence 99999999999999999999999999999999985 89999986543322 222334445678999999999999999 Q ss_pred CCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCc----------EEEEeccceEEEEEe Q lcl|Aclame:pro 302 PPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT----------ALVGGFRQGATLWSR 371 (419) Q Consensus 302 ~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~----------~~~~d~~~~~~~~~~ 371 (419) .+++|+|||+++..|++++|++|+|+|. ++..+.+++|+|+||+++++||.+. +++|||+ .|+++++ T Consensus 418 ~~~~~v~n~~~~~~l~~lkd~~G~~l~~--~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~-~~~i~~~ 494 (543) T protein:vir:81 418 RQGAWLANNLIYNKIRQFDTQGGAGLWT--TIGNGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQ-NYVIADR 494 (543) T ss_pred CCcEEEEcHHHHHHHHHhhcCCCceecc--CcCCCCCccccceeeEEeccccccccccccCCcceEEEeecc-ceeEEee Confidence 9999999999999999999999998764 3445677899999999999998643 7899998 4777889 Q ss_pred cceEEEEeeccc--chhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 372 QGITVLMTDSHA--DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 372 ~~~~i~~~~~~~--~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~ 418 (419) .++++.++++.. ..|.+|++.|+++.|+|+++++|+||++++++++. T Consensus 495 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 495 IGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred cccEEEEeccccccchhhcCceEEEEEEeeccEeecccceEEEEecccC Confidence 999999887643 35778999999999999999999999999999999 No 39 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=1.5e-59 Score=343.08 Aligned_cols=389 Identities=13% Similarity=0.114 Sum_probs=271.1 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |.+ +|++++++++++.+++....++.....+ +++.++++++.+++++++++...+................. . T Consensus 1 M~k--~l~el~~~~~~~~~e~~~~~~~~~~~~e----e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-~ 73 (404) T protein:vir:10 1 MSK--ELRELLNQLDSKNKELNSLLNKDGVTAE----ELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGK-E 73 (404) T ss_pred CcH--HHHHHHHHHHHHHHHHHHHHhhcCCCHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc-c Confidence 985 4555555555555555444443222122 23344555666666655544443333322221111111110 1 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) ...... .....+.+.....+..... .........++ ...+..+.|+.++|+.+...|+..++..++|++++++ T Consensus 74 ~~~~~~---~~~~~~~~~~~~~~~~~~~---~~~~~~~e~~a-~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~ 146 (404) T protein:vir:10 74 ENVIYN---GALFVRAIADNLLKQKNQR---GLNLSEKEINA-ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDY 146 (404) T ss_pred hhhHHH---HHHHHHHHHHHHHHHHHhh---hhcchhhHHhh-hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhce Confidence 000000 0011111111111111000 00001111111 1223345677889999999999999999999999999 Q ss_pred ecccCc--ceeeeeeccccceeccccccceeecCccccccc--ccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHH Q lcl|Aclame:pro 161 QNADYN--VLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS--TLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQ 235 (419) Q Consensus 161 ~~~~~~--~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~--~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~ 235 (419) .|+.++ .+.||+..+ ...++|++|++.+|.+ +++|++++++++|++++++||+|+++|+. ++++||. T Consensus 147 ~~~~~~~g~~~~~~~~~--------~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~ 218 (404) T protein:vir:10 147 EPVFTRSGSRTYEKRSK--------QKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWII 218 (404) T ss_pred eeccCCccceEEEEecC--------CcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHH Confidence 998654 456666543 4578999999999885 58999999999999999999999999975 8999999 Q ss_pred HHHHHHHHHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHH-hhhhhccCCcEEEEehHHH Q lcl|Aclame:pro 236 GRLTYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKT-VAEIAGFPPDGVVVHPQDW 313 (419) Q Consensus 236 ~~l~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 313 (419) ++|++++++++|.+||+|+|++ .|.||++..++.+... +....++++..++. .+...+..+++|+|||++| T Consensus 219 ~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~-------~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~ 291 (404) T protein:vir:10 219 NWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITL-------PKSPALKDFKKCKNVELLNVFKATSSWIVNQDGF 291 (404) T ss_pred HHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeec-------cccccHHHHHHHHHhhhhccccCCCEEEEcHHHH Confidence 9999999999999999999986 5788887655433322 22234677777765 5777888888999999999 Q ss_pred HHHHHHhccCCceeccCCccccCCCcccccceeEe-cCCCCcC-----cEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 314 ESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVS-TVAIAQG-----TALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 314 ~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~-~~~~~~~-----~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) ..|+++||.+|+|+| .+++.++.+++|+|+||++ ++.++.+ .+++|||+++|.++++.+++++++++....|. T Consensus 292 ~~L~~lkd~~G~~l~-~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~ 370 (404) T protein:vir:10 292 NYLDSLEDKTGRPYL-QPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFE 370 (404) T ss_pred HHHHHhhccCCceee-ccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccccchhh Confidence 999999999999865 4567778889999999985 4555543 38999999999999999999999988777899 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|++.||++.|+|+++++|+||++++++++.+ T Consensus 371 ~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa~ 402 (404) T protein:vir:10 371 TNTTKARIIMRIDGNVKDSEALLIAEIPVESV 402 (404) T ss_pred cCceEEEEEEeeccEEecccceEEEEeecccC Confidence 99999999999999999999999999999999 No 40 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=1.5e-58 Score=337.61 Aligned_cols=402 Identities=13% Similarity=0.097 Sum_probs=267.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQV---QEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAE 77 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~---~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 77 (419) |.-..-|+++.++++++..+++...++. .+..++.++.+++++++++.++++++++++................... T Consensus 1 M~l~el~~~~~~~~~~~~a~l~~~~~~~~~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~~~~ 80 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRLAELQGKVEKNEVRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPEKKE 80 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhhhc Confidence 8744444455555555555444443322 1223344566777777888887777776655443333221111100000 Q ss_pred cchh-hhhhHHHHhHHHHHHHHHh---------hhhhhhhHHHHHHHHHHhhh-----cccccccccCCcccccchhhhH Q lcl|Aclame:pro 78 AGTF-RSLAQRFADSDGLREYRAR---------DKRGQFQVEMRDIDPNRLLS-----RDAPAGTITNPNVPHLPQLVPG 142 (419) Q Consensus 78 ~~~~-~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~p~~~~~ 142 (419) .... +............+..... ........+.+......... .....+..++.|++++|+.+.+ T Consensus 81 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~ 160 (434) T protein:vir:62 81 DPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNGSVTIPDFLSK 160 (434) T ss_pred chhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcccccccceecchhhHH Confidence 0000 0000000000011100000 00001111111111111100 0111122345578899999999 Q ss_pred HHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHH Q lcl|Aclame:pro 143 IVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQ 222 (419) Q Consensus 143 ~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~e 222 (419) .|++.++..++++++|+++++++ .+++|+....... ....|.+|++.+|.++++|++|++.++|++++++||+| T Consensus 161 ~Ii~~l~~~~~i~~~~~~~~~~~-~~~~p~~~~~~~a-----~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~e 234 (434) T protein:vir:62 161 EIITYAQEENFLRRLGTGVKTKE-NIKYPVLVKKAEA-----QGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKK 234 (434) T ss_pred HHHHhhhhhhhhhhhcceeccCC-ceEEEEEecCCcc-----cceecccccccccccccceeeEEeeheeeEeehhhHHH Confidence 99999999999999999998765 4788876543211 12234567889999999999999999999999999999 Q ss_pred HHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcccc-cceeccccccccccccccccchhhhHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 223 AADDNS-QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEM-QGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAG 300 (419) Q Consensus 223 ll~d~~-~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p-~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (419) +++|+. +|++||.++|++++++++|.+||+|+|+++| .|+++..++. ..++....++++++++..+...| T Consensus 235 ll~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~--------~~~~~~~~~d~l~~l~~~l~~~~ 306 (434) T protein:vir:62 235 LLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVE--------FKTDEKNLYDALVKMKNTPVKEV 306 (434) T ss_pred HHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeeccccc--------ccccccchhhHHHHHHhhcchhh Confidence 999985 8999999999999999999999999999865 4666543321 12334457899999999999999 Q ss_pred cCCcEEEEehHHHHHHHHHhccCCceeccCC-ccccCCCcccccceeEecCCCCcCc------EEEEeccceEEEEEec- Q lcl|Aclame:pro 301 FPPDGVVVHPQDWESIELDQAPGSGVFRVIA-NVQGEATPRIWGLNVVSTVAIAQGT------ALVGGFRQGATLWSRQ- 372 (419) Q Consensus 301 ~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~-~~~~~~~~~l~G~pv~~~~~~~~~~------~~~~d~~~~~~~~~~~- 372 (419) ..+++|+||+.++..|+++||++|+|+|.+. ...++.+++|+|+||++++.||.+. ++||||++++ ++++. T Consensus 307 ~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~~~-i~~~~g 385 (434) T protein:vir:62 307 RKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSKFY-IQDVIG 385 (434) T ss_pred hcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEeeccceE-EEEeec Confidence 9999999999999999999999999877542 3456778899999999999998644 7889999754 56665 Q ss_pred ceEEEEeecccchhhcCcEEEEEEEEeccEEec-ccceEEEEe--cCCCC Q lcl|Aclame:pro 373 GITVLMTDSHADFFTANTLVILAEFRANLAVYQ-PKAFVRVTF--AAATT 419 (419) Q Consensus 373 ~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~-~~a~~~~~~--~aa~~ 419 (419) .++++.+.+ .+|.+|++.||++.|+|+++++ |.++.++++ ++++. T Consensus 386 ~~~i~~~~~--~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~ 433 (434) T protein:vir:62 386 SLEVQKLVE--LFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTG 433 (434) T ss_pred eeEEEeehh--hhcccCceEEEEEeeecceeecCcccceEEEEEeccCCC Confidence 466776654 4689999999999999999877 887776644 45555 No 41 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=2.6e-58 Score=336.32 Aligned_cols=371 Identities=12% Similarity=0.084 Sum_probs=259.3 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQE--IVAEAR---GLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTP 75 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~--~~~e~~---~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 75 (419) |.. ++|+++++++.++++++....++... ..++.. +..+.++++++.+.+..+..+............... .. T Consensus 1 M~~-~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 78 (395) T protein:vir:38 1 MNI-NQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPV-NK 78 (395) T ss_pred CCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc-cc Confidence 765 66888777777777666544432211 111111 122233333333333332222222111111111000 00 Q ss_pred cccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHH Q lcl|Aclame:pro 76 AEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVA 155 (419) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~ 155 (419) .. ...... ....+.+.... .+..... .......+..|++++|+.+...|++.+...++|+ T Consensus 79 ~~----~~~~~~---~~~~~~~~~~~--------~~~~~~~-----~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~ 138 (395) T protein:vir:38 79 KP----LPVKDG---KPDAQAMKNQF--------VKDFKNL-----VTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLE 138 (395) T ss_pred cc----cchhhh---hHHHHHHHHHH--------HHHHHHH-----HhhccCccCCCceecchhHhhHHHHHHHhhcchh Confidence 00 000000 00111111111 1111100 0111223445788999999999999999999999 Q ss_pred hhcceecccCcceeeeeeccccceeccccccceeecCcccccccc-cceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHH Q lcl|Aclame:pro 156 DLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST-LSFDTITTTLKTVAHWLPITRQAADDNS-QLMGY 233 (419) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~ 233 (419) ++|+++|+.++...++..... .....+.|++|++.+|+++ ++|++|+++++|++++++||+|+++|+. +|++| T Consensus 139 ~~~~~~~~~~~~~~~~~~~~~-----~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~ 213 (395) T protein:vir:38 139 SLANVENVTTSHGSRVYEKLA-----DITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQW 213 (395) T ss_pred hhcceeeccCCcceEEEEeec-----cCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHH Confidence 999999987766555443221 1234678999999999875 8999999999999999999999999975 89999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHH-hhhhhccCCcEEEEehHH Q lcl|Aclame:pro 234 IQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKT-VAEIAGFPPDGVVVHPQD 312 (419) Q Consensus 234 i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 312 (419) |.++|++++++++|.+|++|+|++.+.+.. ..++++.+++. .+...+..+++|+|||++ T Consensus 214 i~~~la~~~~~~~~~~il~g~g~~~~~~~~--------------------~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~ 273 (395) T protein:vir:38 214 LVNWAAKKDVVTRNAKILEVMGKAPKKPTI--------------------SQFDNIKDLENNTLDPAIESTSSFITNQSG 273 (395) T ss_pred HHHHHHHHHHHHHHHHHhhccccccccccc--------------------ccHHHHHHHHHHhhhhhhcCCCEEEEcHHH Confidence 999999999999999999999987653211 12455666654 677888889999999999 Q ss_pred HHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCc------CcEEEEeccceEEEEEecceEEEEeecccchh Q lcl|Aclame:pro 313 WESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQ------GTALVGGFRQGATLWSRQGITVLMTDSHADFF 386 (419) Q Consensus 313 ~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~------~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~ 386 (419) |..|++++|++|+|+| ++++.++.+++|+|+||+++++++. ..+++|||+++|+++++.+++++++++...+| T Consensus 274 ~~~L~~lkd~~G~~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~ 352 (395) T protein:vir:38 274 YNILSKVKDADGRYLM-QPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSF 352 (395) T ss_pred HHHHHHhhccCCceee-ccCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchh Confidence 9999999999999765 5567788889999999999887542 34899999999999999999999999888889 Q ss_pred hcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 387 TANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 387 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+|++.||++.|+|+++.+|+||+++++++++| T Consensus 353 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~ 385 (395) T protein:vir:38 353 EHDTTKLRFIDRFDVQLIDDGAFAAASFKTVAN 385 (395) T ss_pred hcCceEEEEEEeeccEEecccceEEEEeecccC Confidence 999999999999999999999999999999888 No 42 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=5.7e-58 Score=334.41 Aligned_cols=383 Identities=13% Similarity=0.089 Sum_probs=270.4 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQE-IVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG 79 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (419) |-.- .|+++++++.+..+++....++.+. +.++..+.+++++++++.+.++++++++..+..+............... T Consensus 1 M~~~-~l~el~~~l~e~~~~i~~~~~e~~~~~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~ 79 (394) T protein:vir:97 1 MFEE-KIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE 79 (394) T ss_pred CcHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Confidence 5432 4666666666666555555555443 4455556677888888888888888777665554443322221111111 Q ss_pred ---hhhhhhHHHHhHHHHH-HHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHH Q lcl|Aclame:pro 80 ---TFRSLAQRFADSDGLR-EYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVA 155 (419) Q Consensus 80 ---~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~ 155 (419) ..+............. ..............................+.+...|++++|+.+...|+..+...++|+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~ 159 (394) T protein:vir:97 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) T ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhh Confidence 1111111111000000 000000000000111111111111122233445667889999999999999999999999 Q ss_pred hhcceecccCcceeeeeeccccceeccccccceeecCcccccc-cccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHH Q lcl|Aclame:pro 156 DLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGY 233 (419) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~ 233 (419) ++|+++|+.++.+.+|+... .+..++|++|++.+|+ ++++|++|++.++|++++++||+||++|+. ++++| T Consensus 160 ~~~~~~~~~~~~~~~~~~~~-------~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~ 232 (394) T protein:vir:97 160 PFTTVYQAKKASGKYPVLQR-------ATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGI 232 (394) T ss_pred hhceeeeccCcceEEEEEec-------CCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHH Confidence 99999999988888887643 2356889999999997 569999999999999999999999999975 89999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHH Q lcl|Aclame:pro 234 IQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDW 313 (419) Q Consensus 234 i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (419) |.+++++++++++|.+|++|.|++.+.|. ..++++.+++...... ..++.|+|||++| T Consensus 233 i~~~la~~~~~~~~~~i~~g~~~~~~~~~---------------------~~~~~~~~~~~~~~~~-~~~a~~v~n~~~~ 290 (394) T protein:vir:97 233 VSESISQIKVNTTNDAIAKVLKSFTTKTV---------------------KNLDEIKALLNGGFDP-AYNVSLIVSQSFY 290 (394) T ss_pred HHHHHHHHHHHHHHHHHhhcccccccccc---------------------ccHHHHHHHHHhhhhh-hhCCEEEEcHHHH Confidence 99999999999999999998876544332 1256666666554433 3457899999999 Q ss_pred HHHHHHhccCCceeccCCccccCCCcccccceeEecC--CCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 314 ESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTV--AIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 314 ~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~--~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) ..|++++|++|+|+| .+++.++.+++|+|+||++++ .++++.+++|||+++|.+++|.+++++++++. .+.. T Consensus 291 ~~l~~lkd~~G~~i~-~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~-----~~~~ 364 (394) T protein:vir:97 291 QTLDTLKDGNGRYLL-QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNE-----IYGQ 364 (394) T ss_pred HHHHHhhccCCCeee-ecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEeccc-----ccce Confidence 999999999999865 456778888999999999854 56778899999999999999999999987643 3345 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .||++.|+|+++.+|+||+.++++++++ T Consensus 365 ~~~~~~r~d~~v~~~~a~~~~~~~~~~~ 392 (394) T protein:vir:97 365 YLQAVLRFGVSKVDDKAGYYVTFTPEPL 392 (394) T ss_pred eEEEEEEEccEEecccceEEEEeccccc Confidence 7999999999999999999999999999 No 43 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=7e-58 Score=333.92 Aligned_cols=373 Identities=15% Similarity=0.152 Sum_probs=263.7 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |. ++|+|+++++.++.++++...++. ..+.++.+.++++.+++++++.++..+................... T Consensus 1 M~--k~l~el~~~~~~~~~e~~~~~~~~------~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) T protein:vir:10 1 MS--KELRELLAKLEGKKEEVRSLMGED------KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDG 72 (392) T ss_pred Cc--HHHHHHHHHHHHHHHHHHHHhhHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccc Confidence 87 557777777777777666554322 1233455566666666666554443332222221111111111111 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) .. +....+.............+........... ....+++.|++++|+.+...|++.+.+.++|+++|++ T Consensus 73 ~~---------~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~ 142 (392) T protein:vir:10 73 EM---------EYRDVFMKALRNKPLNAEEREFLEDDLEQRA-MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTV 142 (392) T ss_pred hH---------HHHHHHHHHHhcccccHHHHHHHhhhhhhhh-ccccccCCCceecchhHHHHHHHHHHhhhhhhhhcee Confidence 10 0111111111111111111222222222222 2223445678889999999999999999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCccccccc-ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRL 238 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l 238 (419) +++.++..+++.... .+...+.||+|++++|++ .++|++|++.++|++++++||+|+++|+ +++++||.+.| T Consensus 143 ~~~~~~~~~~~~~~~------~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l 216 (392) T protein:vir:10 143 EPVRTRSGSRVLEKN------SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWL 216 (392) T ss_pred eeccCCceeEEEEee------cCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHH Confidence 999876655443221 234578999999999986 4899999999999999999999999997 58999999999 Q ss_pred HHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHH-HhhhhhccCCcEEEEehHHHHHHH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAK-TVAEIAGFPPDGVVVHPQDWESIE 317 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~ 317 (419) ++++++++|.+|++|+|++.+.|.. .++++.+++ ..+...|..++.|+|||++|..|+ T Consensus 217 ~~~i~~~~d~~~~~g~g~~~~~~~~---------------------~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~ 275 (392) T protein:vir:10 217 GKKSKVTRNVLILGVIEKLTKQAIK---------------------SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLD 275 (392) T ss_pred HHHHHHHHHHHHhhccccccccCcc---------------------CHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHH Confidence 9999999999999999987654432 256677766 468888888999999999999999 Q ss_pred HHhccCCceeccCCccccCCCcccccceeEe-cCCC-C--------cCcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 318 LDQAPGSGVFRVIANVQGEATPRIWGLNVVS-TVAI-A--------QGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 318 ~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~-~~~~-~--------~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) ++||++|+|+| .+++.++.+++|+|+|+++ ++.+ + ...+++|||+++|.+++|.+++++++++.+.+|+ T Consensus 276 ~lkd~~G~~l~-~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~ 354 (392) T protein:vir:10 276 KLKDKDGKYIL-QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) T ss_pred HhhccCCCeEe-ecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhh Confidence 99999999865 5567778889999987654 3332 1 1237899999999999999999999998888999 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|++.||++.|+|+++++|+||++++++++.. T Consensus 355 ~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~ 386 (392) T protein:vir:10 355 RNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAP 386 (392) T ss_pred cCceEEEEEEeeccEEecccceEEEEeccccc Confidence 99999999999999999999999998855443 No 44 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=7e-58 Score=333.92 Aligned_cols=373 Identities=15% Similarity=0.152 Sum_probs=263.7 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |. ++|+|+++++.++.++++...++. ..+.++.+.++++.+++++++.++..+................... T Consensus 1 M~--k~l~el~~~~~~~~~e~~~~~~~~------~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) T protein:vir:10 1 MS--KELRELLAKLEGKKEEVRSLMGED------KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDG 72 (392) T ss_pred Cc--HHHHHHHHHHHHHHHHHHHHhhHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccc Confidence 87 557777777777777666554322 1233455566666666666554443332222221111111111111 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) .. +....+.............+........... ....+++.|++++|+.+...|++.+.+.++|+++|++ T Consensus 73 ~~---------~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~ 142 (392) T protein:vir:10 73 EM---------EYRDVFMKALRNKPLNAEEREFLEDDLEQRA-MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTV 142 (392) T ss_pred hH---------HHHHHHHHHHhcccccHHHHHHHhhhhhhhh-ccccccCCCceecchhHHHHHHHHHHhhhhhhhhcee Confidence 10 0111111111111111111222222222222 2223445678889999999999999999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCccccccc-ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRL 238 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l 238 (419) +++.++..+++.... .+...+.||+|++++|++ .++|++|++.++|++++++||+|+++|+ +++++||.+.| T Consensus 143 ~~~~~~~~~~~~~~~------~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l 216 (392) T protein:vir:10 143 EPVRTRSGSRVLEKN------SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWL 216 (392) T ss_pred eeccCCceeEEEEee------cCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHH Confidence 999876655443221 234578999999999986 4899999999999999999999999997 58999999999 Q ss_pred HHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHH-HhhhhhccCCcEEEEehHHHHHHH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAK-TVAEIAGFPPDGVVVHPQDWESIE 317 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~ 317 (419) ++++++++|.+|++|+|++.+.|.. .++++.+++ ..+...|..++.|+|||++|..|+ T Consensus 217 ~~~i~~~~d~~~~~g~g~~~~~~~~---------------------~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~ 275 (392) T protein:vir:10 217 GKKSKVTRNVLILGVIEKLTKQAIK---------------------SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLD 275 (392) T ss_pred HHHHHHHHHHHHhhccccccccCcc---------------------CHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHH Confidence 9999999999999999987654432 256677766 468888888999999999999999 Q ss_pred HHhccCCceeccCCccccCCCcccccceeEe-cCCC-C--------cCcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 318 LDQAPGSGVFRVIANVQGEATPRIWGLNVVS-TVAI-A--------QGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 318 ~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~-~~~~-~--------~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) ++||++|+|+| .+++.++.+++|+|+|+++ ++.+ + ...+++|||+++|.+++|.+++++++++.+.+|+ T Consensus 276 ~lkd~~G~~l~-~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~ 354 (392) T protein:vir:10 276 KLKDKDGKYIL-QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) T ss_pred HhhccCCCeEe-ecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhh Confidence 99999999865 5567778889999987654 3332 1 1237899999999999999999999998888999 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|++.||++.|+|+++++|+||++++++++.. T Consensus 355 ~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~ 386 (392) T protein:vir:10 355 RNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAP 386 (392) T ss_pred cCceEEEEEEeeccEEecccceEEEEeccccc Confidence 99999999999999999999999998855443 No 45 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=7e-58 Score=333.92 Aligned_cols=373 Identities=15% Similarity=0.152 Sum_probs=263.7 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |. ++|+|+++++.++.++++...++. ..+.++.+.++++.+++++++.++..+................... T Consensus 1 M~--k~l~el~~~~~~~~~e~~~~~~~~------~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) T protein:vir:10 1 MS--KELRELLAKLEGKKEEVRSLMGED------KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDG 72 (392) T ss_pred Cc--HHHHHHHHHHHHHHHHHHHHhhHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccc Confidence 87 557777777777777666554322 1233455566666666666554443332222221111111111111 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) .. +....+.............+........... ....+++.|++++|+.+...|++.+.+.++|+++|++ T Consensus 73 ~~---------~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~ 142 (392) T protein:vir:10 73 EM---------EYRDVFMKALRNKPLNAEEREFLEDDLEQRA-MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTV 142 (392) T ss_pred hH---------HHHHHHHHHHhcccccHHHHHHHhhhhhhhh-ccccccCCCceecchhHHHHHHHHHHhhhhhhhhcee Confidence 10 0111111111111111111222222222222 2223445678889999999999999999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCccccccc-ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRL 238 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l 238 (419) +++.++..+++.... .+...+.||+|++++|++ .++|++|++.++|++++++||+|+++|+ +++++||.+.| T Consensus 143 ~~~~~~~~~~~~~~~------~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l 216 (392) T protein:vir:10 143 EPVRTRSGSRVLEKN------SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWL 216 (392) T ss_pred eeccCCceeEEEEee------cCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHH Confidence 999876655443221 234578999999999986 4899999999999999999999999997 58999999999 Q ss_pred HHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHH-HhhhhhccCCcEEEEehHHHHHHH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAK-TVAEIAGFPPDGVVVHPQDWESIE 317 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~ 317 (419) ++++++++|.+|++|+|++.+.|.. .++++.+++ ..+...|..++.|+|||++|..|+ T Consensus 217 ~~~i~~~~d~~~~~g~g~~~~~~~~---------------------~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~ 275 (392) T protein:vir:10 217 GKKSKVTRNVLILGVIEKLTKQAIK---------------------SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLD 275 (392) T ss_pred HHHHHHHHHHHHhhccccccccCcc---------------------CHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHH Confidence 9999999999999999987654432 256677766 468888888999999999999999 Q ss_pred HHhccCCceeccCCccccCCCcccccceeEe-cCCC-C--------cCcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 318 LDQAPGSGVFRVIANVQGEATPRIWGLNVVS-TVAI-A--------QGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 318 ~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~-~~~~-~--------~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) ++||++|+|+| .+++.++.+++|+|+|+++ ++.+ + ...+++|||+++|.+++|.+++++++++.+.+|+ T Consensus 276 ~lkd~~G~~l~-~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~ 354 (392) T protein:vir:10 276 KLKDKDGKYIL-QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) T ss_pred HhhccCCCeEe-ecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhh Confidence 99999999865 5567778889999987654 3332 1 1237899999999999999999999998888999 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|++.||++.|+|+++++|+||++++++++.. T Consensus 355 ~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~ 386 (392) T protein:vir:10 355 RNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAP 386 (392) T ss_pred cCceEEEEEEeeccEEecccceEEEEeccccc Confidence 99999999999999999999999998855443 No 46 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=7e-58 Score=333.92 Aligned_cols=373 Identities=15% Similarity=0.152 Sum_probs=263.7 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |. ++|+|+++++.++.++++...++. ..+.++.+.++++.+++++++.++..+................... T Consensus 1 M~--k~l~el~~~~~~~~~e~~~~~~~~------~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) T protein:vir:10 1 MS--KELRELLAKLEGKKEEVRSLMGED------KVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDG 72 (392) T ss_pred Cc--HHHHHHHHHHHHHHHHHHHHhhHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccc Confidence 87 557777777777777666554322 1233455566666666666554443332222221111111111111 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) .. +....+.............+........... ....+++.|++++|+.+...|++.+.+.++|+++|++ T Consensus 73 ~~---------~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~ 142 (392) T protein:vir:10 73 EM---------EYRDVFMKALRNKPLNAEEREFLEDDLEQRA-MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTV 142 (392) T ss_pred hH---------HHHHHHHHHHhcccccHHHHHHHhhhhhhhh-ccccccCCCceecchhHHHHHHHHHHhhhhhhhhcee Confidence 10 0111111111111111111222222222222 2223445678889999999999999999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCccccccc-ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRL 238 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l 238 (419) +++.++..+++.... .+...+.||+|++++|++ .++|++|++.++|++++++||+|+++|+ +++++||.+.| T Consensus 143 ~~~~~~~~~~~~~~~------~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l 216 (392) T protein:vir:10 143 EPVRTRSGSRVLEKN------SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWL 216 (392) T ss_pred eeccCCceeEEEEee------cCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHH Confidence 999876655443221 234578999999999986 4899999999999999999999999997 58999999999 Q ss_pred HHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHH-HhhhhhccCCcEEEEehHHHHHHH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAK-TVAEIAGFPPDGVVVHPQDWESIE 317 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~ 317 (419) ++++++++|.+|++|+|++.+.|.. .++++.+++ ..+...|..++.|+|||++|..|+ T Consensus 217 ~~~i~~~~d~~~~~g~g~~~~~~~~---------------------~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~ 275 (392) T protein:vir:10 217 GKKSKVTRNVLILGVIEKLTKQAIK---------------------SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLD 275 (392) T ss_pred HHHHHHHHHHHHhhccccccccCcc---------------------CHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHH Confidence 9999999999999999987654432 256677766 468888888999999999999999 Q ss_pred HHhccCCceeccCCccccCCCcccccceeEe-cCCC-C--------cCcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 318 LDQAPGSGVFRVIANVQGEATPRIWGLNVVS-TVAI-A--------QGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 318 ~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~-~~~~-~--------~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) ++||++|+|+| .+++.++.+++|+|+|+++ ++.+ + ...+++|||+++|.+++|.+++++++++.+.+|+ T Consensus 276 ~lkd~~G~~l~-~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~ 354 (392) T protein:vir:10 276 KLKDKDGKYIL-QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) T ss_pred HhhccCCCeEe-ecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhh Confidence 99999999865 5567778889999987654 3332 1 1237899999999999999999999998888999 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|++.||++.|+|+++++|+||++++++++.. T Consensus 355 ~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~ 386 (392) T protein:vir:10 355 RNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAP 386 (392) T ss_pred cCceEEEEEEeeccEEecccceEEEEeccccc Confidence 99999999999999999999999998855443 No 47 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=4.1e-58 Score=335.22 Aligned_cols=376 Identities=11% Similarity=0.037 Sum_probs=270.9 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc- Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEAR-GLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA- 78 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~-~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~- 78 (419) |.-.+.|++.++++.+..++.....++++.+..+.+ ++.+.+.+++++++++++.++...+................. T Consensus 1 Mn~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGG 80 (421) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 877767776666666666655555555555443332 335556666666666666555554443333222111111000 Q ss_pred -chhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhh Q lcl|Aclame:pro 79 -GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL 157 (419) Q Consensus 79 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~ 157 (419) ................+.+....+....... .+ .+..++.|++++|+.+...|+..++..++|+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~r---a~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l 147 (421) T protein:vir:13 81 RVIINGDSKEEKRSLQLSAMSKTIRGIQLSEE----------ER---DIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEH 147 (421) T ss_pred ccccccchhHHHHHHHHHHHHHhhhccchhHH----------Hh---hccccCCcceecchhhHHHHHHHHHhhhhhhhh Confidence 0000000111111111122111111100000 01 122345578899999999999999999999999 Q ss_pred cceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 158 LDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) |+++|++++.+++|+..... ...++|++|++.+|+++++|++|++.+++++++++||+|+++|++ ++++||.+ T Consensus 148 ~~~~~~~~~~~~~~~~~~~~------~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~ 221 (421) T protein:vir:13 148 CHVIPVNRNAGKMPVRAGAS------VDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNE 221 (421) T ss_pred ceeeeccCCceEEEEeecCC------ccceeeccccccccccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHH Confidence 99999999999999866432 234678999999999999999999999999999999999999985 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHH Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESI 316 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 316 (419) +|+++++.++|..++ +.|+|+++.++. ..++++.+++..+...++.+++|+|||++|..| T Consensus 222 ~la~~~~~~~~~~i~-----~~~~g~~~~~~~---------------~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l 281 (421) T protein:vir:13 222 EFAEFAVNTENAEIV-----KQAKAVLAEETI---------------NDYAGLVKTINSLVPNARKRAIIVTNSDGRAYL 281 (421) T ss_pred HHHHHHHHHhhhhHh-----hhhhhccccccc---------------cchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHH Confidence 999999999987776 467787653321 237889999999999999999999999999999 Q ss_pred HHHhccCCceeccCCccccCCCcccccceeEecCCCCcC-----cEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 317 ELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG-----TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 317 ~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~-----~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) ++++|++|+|+|. ++..+.+++|+|+||++++++|.+ .+++|||+++|+++++.+++++++++. +|.+|++ T Consensus 282 ~~lkd~~G~~i~~--~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~--~f~~~~~ 357 (421) T protein:vir:13 282 DGLMDKQGRPLLK--ELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEA--GYTKNET 357 (421) T ss_pred HHhhcCCCceeec--CcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeeccc--ccccCee Confidence 9999999998764 356677889999999999999854 379999999999999999999998864 5999999 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .||++.|+|+++++|+||+.+++..... T Consensus 358 ~~r~~~r~d~~~~~~~a~~~~~~~~~~a 385 (421) T protein:vir:13 358 IARIIERFDVNSPLDKSSDAEKIRKFGV 385 (421) T ss_pred EEEEEeeecceeecchhhheeeecccce Confidence 9999999999999999976655543221 No 48 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=1.8e-58 Score=337.20 Aligned_cols=374 Identities=13% Similarity=0.051 Sum_probs=259.2 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQE-------IVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPL 73 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~-------~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 73 (419) ||++.+|++++.++.++++++.....+... ...+.+..++.++++++++..++++++........... .. T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~---~~ 77 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKG---EA 77 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc---cc Confidence 999999999999998888777665433221 12223334444555555555554444332221111110 00 Q ss_pred cccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhh Q lcl|Aclame:pro 74 TPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLL 153 (419) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~ 153 (419) ........+. ... ..+.++...... ... ...............++.+.|++++|+.+...|++.+...++ T Consensus 78 ~~~~~~~~~~-~~~--~~~~~r~~~~~~---~~~----~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~ 147 (387) T protein:vir:26 78 YQSLSDNEKM-VKA--KAEFYRHAILPN---EFE----KPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQ 147 (387) T ss_pred CCCCchhHHH-HHH--HHHHHHHHHhhh---hHH----HHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhch Confidence 0000011111 010 111122111111 000 001111111111112234557889999999999999999999 Q ss_pred HHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHH Q lcl|Aclame:pro 154 VADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMG 232 (419) Q Consensus 154 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~ 232 (419) |+++++++++++. .+|+... ...++.|++||+.+++++++|+++++.+++++++++||+||++|+ .++++ T Consensus 148 l~~~~~~~~~~~~--~~p~~~~-------~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~ 218 (387) T protein:vir:26 148 LREKARLTNIKGL--EIPRVSY-------TLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVN 218 (387) T ss_pred hhhhceeeecCCc--eeeeeec-------cCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHH Confidence 9999999998764 4555332 235688999999999999999999999999999999999999997 59999 Q ss_pred HHHHHHHHHHHHHHHH-HHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehH Q lcl|Aclame:pro 233 YIQGRLTYGLRFLRDR-QLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQ 311 (419) Q Consensus 233 ~i~~~l~~a~~~~~d~-~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (419) ||.++|+++++++++. .|.+|+|+++|.|+++..++..+ +....+++++++++.+...|+.++.|+||+. T Consensus 219 ~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~---------~~~~~~d~i~~~~~~l~~~y~~na~~imn~~ 289 (387) T protein:vir:26 219 WVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV---------EGADMYDAIINALADLHEDYRDNATIYMRYA 289 (387) T ss_pred HHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc---------cccchHHHHHHHHhccChhhhcCCEEEEech Confidence 9999999999999765 56678899999999876544322 2234588999999999999999999999999 Q ss_pred HHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 312 DWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 312 ~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) ++..+.++++.+|++++. +.+.+|+|+||++++.++ .++||||+++|.++ .++.++..++ ..+|++ T Consensus 290 t~~~~~~~~~~~~~~~~~------~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~--~~~~~~~~~~----~~~~~~ 355 (387) T protein:vir:26 290 DYVKIISVLSNGTTNFFD------TPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY--DGTTYDTDKD----VKKGEY 355 (387) T ss_pred HHHHHHHHHhcCCCcccc------cCCccccccceEEecCCC--ceeeechhhhhhhh--hhhhheeccc----ccCCce Confidence 999988777777777653 446789999999999875 47999999876654 4555555443 347899 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .|++..|+|+++++|+||+++++++++. T Consensus 356 ~~~~~~r~Dg~v~~~~A~~~l~~ka~~~ 383 (387) T protein:vir:26 356 LFVLTAWYDQQRTLDSAFRIAKAKENTG 383 (387) T ss_pred EEEEEEEeCcEeechhheEEEEeecCCC Confidence 9999999999999999999999987766 No 49 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=1.8e-58 Score=337.20 Aligned_cols=374 Identities=13% Similarity=0.051 Sum_probs=259.2 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQE-------IVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPL 73 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~-------~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 73 (419) ||++.+|++++.++.++++++.....+... ...+.+..++.++++++++..++++++........... .. T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~---~~ 77 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKG---EA 77 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc---cc Confidence 999999999999998888777665433221 12223334444555555555554444332221111110 00 Q ss_pred cccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhh Q lcl|Aclame:pro 74 TPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLL 153 (419) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~ 153 (419) ........+. ... ..+.++...... ... ...............++.+.|++++|+.+...|++.+...++ T Consensus 78 ~~~~~~~~~~-~~~--~~~~~r~~~~~~---~~~----~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~ 147 (387) T protein:vir:94 78 YQSLSDNEKM-VKA--KAEFYRHAILPN---EFE----KPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQ 147 (387) T ss_pred CCCCchhHHH-HHH--HHHHHHHHHhhh---hHH----HHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhch Confidence 0000011111 010 111122111111 000 001111111111112234557889999999999999999999 Q ss_pred HHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHH Q lcl|Aclame:pro 154 VADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMG 232 (419) Q Consensus 154 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~ 232 (419) |+++++++++++. .+|+... ...++.|++||+.+++++++|+++++.+++++++++||+||++|+ .++++ T Consensus 148 l~~~~~~~~~~~~--~~p~~~~-------~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~ 218 (387) T protein:vir:94 148 LREKARLTNIKGL--EIPRVSY-------TLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVN 218 (387) T ss_pred hhhhceeeecCCc--eeeeeec-------cCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHH Confidence 9999999998764 4555332 235688999999999999999999999999999999999999997 59999 Q ss_pred HHHHHHHHHHHHHHHH-HHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehH Q lcl|Aclame:pro 233 YIQGRLTYGLRFLRDR-QLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQ 311 (419) Q Consensus 233 ~i~~~l~~a~~~~~d~-~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (419) ||.++|+++++++++. .|.+|+|+++|.|+++..++..+ +....+++++++++.+...|+.++.|+||+. T Consensus 219 ~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~---------~~~~~~d~i~~~~~~l~~~y~~na~~imn~~ 289 (387) T protein:vir:94 219 WVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV---------EGADMYDAIINALADLHEDYRDNATIYMRYA 289 (387) T ss_pred HHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc---------cccchHHHHHHHHhccChhhhcCCEEEEech Confidence 9999999999999765 56678899999999876544322 2234588999999999999999999999999 Q ss_pred HHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 312 DWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 312 ~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) ++..+.++++.+|++++. +.+.+|+|+||++++.++ .++||||+++|.++ .++.++..++ ..+|++ T Consensus 290 t~~~~~~~~~~~~~~~~~------~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~--~~~~~~~~~~----~~~~~~ 355 (387) T protein:vir:94 290 DYVKIISVLSNGTTNFFD------TPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY--DGTTYDTDKD----VKKGEY 355 (387) T ss_pred HHHHHHHHHhcCCCcccc------cCCccccccceEEecCCC--ceeeechhhhhhhh--hhhhheeccc----ccCCce Confidence 999988777777777653 446789999999999875 47999999876654 4555555443 347899 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .|++..|+|+++++|+||+++++++++. T Consensus 356 ~~~~~~r~Dg~v~~~~A~~~l~~ka~~~ 383 (387) T protein:vir:94 356 LFVLTAWYDQQRTLDSAFRIAKAKENTG 383 (387) T ss_pred EEEEEEEeCcEeechhheEEEEeecCCC Confidence 9999999999999999999999987766 No 50 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=1.8e-58 Score=337.20 Aligned_cols=374 Identities=13% Similarity=0.051 Sum_probs=259.2 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQE-------IVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPL 73 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~-------~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 73 (419) ||++.+|++++.++.++++++.....+... ...+.+..++.++++++++..++++++........... .. T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~---~~ 77 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKG---EA 77 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc---cc Confidence 999999999999998888777665433221 12223334444555555555554444332221111110 00 Q ss_pred cccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhh Q lcl|Aclame:pro 74 TPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLL 153 (419) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~ 153 (419) ........+. ... ..+.++...... ... ...............++.+.|++++|+.+...|++.+...++ T Consensus 78 ~~~~~~~~~~-~~~--~~~~~r~~~~~~---~~~----~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~ 147 (387) T protein:vir:96 78 YQSLSDNEKM-VKA--KAEFYRHAILPN---EFE----KPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQ 147 (387) T ss_pred CCCCchhHHH-HHH--HHHHHHHHHhhh---hHH----HHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhch Confidence 0000011111 010 111122111111 000 001111111111112234557889999999999999999999 Q ss_pred HHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHH Q lcl|Aclame:pro 154 VADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMG 232 (419) Q Consensus 154 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~ 232 (419) |+++++++++++. .+|+... ...++.|++||+.+++++++|+++++.+++++++++||+||++|+ .++++ T Consensus 148 l~~~~~~~~~~~~--~~p~~~~-------~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~ 218 (387) T protein:vir:96 148 LREKARLTNIKGL--EIPRVSY-------TLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVN 218 (387) T ss_pred hhhhceeeecCCc--eeeeeec-------cCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHH Confidence 9999999998764 4555332 235688999999999999999999999999999999999999997 59999 Q ss_pred HHHHHHHHHHHHHHHH-HHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehH Q lcl|Aclame:pro 233 YIQGRLTYGLRFLRDR-QLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQ 311 (419) Q Consensus 233 ~i~~~l~~a~~~~~d~-~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (419) ||.++|+++++++++. .|.+|+|+++|.|+++..++..+ +....+++++++++.+...|+.++.|+||+. T Consensus 219 ~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~---------~~~~~~d~i~~~~~~l~~~y~~na~~imn~~ 289 (387) T protein:vir:96 219 WVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV---------EGADMYDAIINALADLHEDYRDNATIYMRYA 289 (387) T ss_pred HHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc---------cccchHHHHHHHHhccChhhhcCCEEEEech Confidence 9999999999999765 56678899999999876544322 2234588999999999999999999999999 Q ss_pred HHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 312 DWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 312 ~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) ++..+.++++.+|++++. +.+.+|+|+||++++.++ .++||||+++|.++ .++.++..++ ..+|++ T Consensus 290 t~~~~~~~~~~~~~~~~~------~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~--~~~~~~~~~~----~~~~~~ 355 (387) T protein:vir:96 290 DYVKIISVLSNGTTNFFD------TPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY--DGTTYDTDKD----VKKGEY 355 (387) T ss_pred HHHHHHHHHhcCCCcccc------cCCccccccceEEecCCC--ceeeechhhhhhhh--hhhhheeccc----ccCCce Confidence 999988777777777653 446789999999999875 47999999876654 4555555443 347899 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .|++..|+|+++++|+||+++++++++. T Consensus 356 ~~~~~~r~Dg~v~~~~A~~~l~~ka~~~ 383 (387) T protein:vir:96 356 LFVLTAWYDQQRTLDSAFRIAKAKENTG 383 (387) T ss_pred EEEEEEEeCcEeechhheEEEEeecCCC Confidence 9999999999999999999999987766 No 51 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=7e-58 Score=333.94 Aligned_cols=374 Identities=13% Similarity=0.047 Sum_probs=254.6 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQE-------IVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPL 73 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~-------~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 73 (419) ||++.+|++++.++.++.+++....++... ..++.++.++.++++++.++.++++++......... ... T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~----~~~ 76 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKD----TGE 76 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----ccc Confidence 999999999888888877766554433211 112233334444445554444444333222111110 001 Q ss_pred cccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhh Q lcl|Aclame:pro 74 TPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLL 153 (419) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~ 153 (419) .+............+ .+.++........ . ...............++.+.|+++||+.+.+.|++.+...++ T Consensus 77 ~~~~~~~~~~~~~~~--~~~~r~~~~~~~~---~----~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~ 147 (387) T protein:vir:93 77 AYQSLNDHEKMVKAK--AEFYRHAILPNEF---E----KPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQ 147 (387) T ss_pred cCCCcchhhHHHHHH--HHHHHHHhhhhhh---h----hhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhch Confidence 111111111111111 1111211111110 0 000000111111122344567889999999999999999999 Q ss_pred HHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHH Q lcl|Aclame:pro 154 VADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMG 232 (419) Q Consensus 154 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~ 232 (419) |+++|+++++++. .+|+... +..++.|++|++.+++++++|++|++.++|++++++||+||++|+ .++++ T Consensus 148 l~~~~~v~~~~~~--~~p~~~~-------~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~ 218 (387) T protein:vir:93 148 LREKARLTNIKGL--EIPRVSY-------TLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVN 218 (387) T ss_pred hhhheeeeecCCc--eEEEEee-------cCCccccccCcccccccccccceeeeeheeeeeechhhHHHHhhhHHHHHH Confidence 9999999998754 4555332 235688999999999999999999999999999999999999997 59999 Q ss_pred HHHHHHHHHHHHHHHH-HHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehH Q lcl|Aclame:pro 233 YIQGRLTYGLRFLRDR-QLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQ 311 (419) Q Consensus 233 ~i~~~l~~a~~~~~d~-~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (419) ||.++|+++++++++. .|.+|+|+++|.|+++..++.. .+....+++++++++.+...|+.++.|+||+. T Consensus 219 ~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~---------v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~ 289 (387) T protein:vir:93 219 WVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKE---------VEGADMYDAIINALADLHEDYRDNATIYMRYA 289 (387) T ss_pred HHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccccc---------ccccchHHHHHHHHhccChhhhcCCEEEEech Confidence 9999999999999776 5667899999999987544322 12234588999999999999999999999999 Q ss_pred HHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 312 DWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 312 ~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) ++..+..+++.+|++++. +.+.+|+|+||++++.++ .+++|||+++|.++ .++.+....+ +.++++ T Consensus 290 t~~~~~~~~~d~~~~~~~------~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~--~~~~~~~~~~----~~~~~~ 355 (387) T protein:vir:93 290 DYVKIISVLSNGTTNFFD------TPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY--DGTTYDTDKD----VKKGEY 355 (387) T ss_pred HHHHHHHHHhcCCCcccc------cCCccccccceEEecCCC--ceeeeehhhhheeh--hhheeeeccc----ccCCce Confidence 988876665555666553 345689999999999876 47999999876643 4555554433 467999 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .|++..|+|+++++|+||+.+++++++. T Consensus 356 ~~~~~~r~d~~v~~~eA~~~l~~k~~~~ 383 (387) T protein:vir:93 356 LFVLTAWYDQQRTLDSAFRIAKAKENTG 383 (387) T ss_pred eEEEEeeeCceeechhheEEEEeecCCC Confidence 9999999999999999999999977666 No 52 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=6.3e-57 Score=328.71 Aligned_cols=383 Identities=12% Similarity=0.098 Sum_probs=264.1 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEAR-----GLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTP 75 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~-----~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 75 (419) |..-..|+++.+++++..+++....++.+++.++.+ ...++++++++.+..+++.+++.....+........... T Consensus 1 ~~l~e~i~e~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~ 80 (400) T protein:vir:38 1 MTLDEKLAAVKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSG 80 (400) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 777777776666666666666666666655443321 233444555566666665555554443333222111111 Q ss_pred c--ccchhhhhhHHHHhHHHHHH---HHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhh Q lcl|Aclame:pro 76 A--EAGTFRSLAQRFADSDGLRE---YRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDL 150 (419) Q Consensus 76 ~--~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~ 150 (419) . .....+.............. .............................+.....|++++|+.+.+.|++.+.. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~ 160 (400) T protein:vir:38 81 KKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQT 160 (400) T ss_pred ccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHh Confidence 1 11111111111110000000 000000000000000000111111122223345667889999999999999999 Q ss_pred hhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccc-cccceeeEEeeeEEEEEeehhhHHHHhhH-H Q lcl|Aclame:pro 151 PLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVAHWLPITRQAADDN-S 228 (419) Q Consensus 151 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~ 228 (419) .+.|+++++++|++++.+++|+.+.. .+.+.|++|++.+|+ ++++|++|++.++|++++++||+||++|+ + T Consensus 161 ~~~l~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~ 233 (400) T protein:vir:38 161 VVDLKPFTNVFQASTQKGTYPTVANA-------TTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAI 233 (400) T ss_pred hhhhhhcceeEeccCcceEEEEEecC-------CCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHH Confidence 99999999999999988889886532 346889999999987 57999999999999999999999999997 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEE Q lcl|Aclame:pro 229 QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVV 308 (419) Q Consensus 229 ~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (419) ++++||.++++++++.++|.+|++|+|++.+.|+.+ ++++.+++...... ..+++|+| T Consensus 234 ~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~-~~~a~~v~ 291 (400) T protein:vir:38 234 DLVGLIAQNGQQIKVNTTNGAVATLLKGFTAKTISS---------------------VDDLKHINNVDLDP-AYSRVIIA 291 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc---------------------HHHHHHHHHhhhhh-hhCcEEEE Confidence 899999999999999999999999998766554421 45555555433333 34679999 Q ss_pred ehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC-----cEEEEeccceEEEEEecceEEEEeeccc Q lcl|Aclame:pro 309 HPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG-----TALVGGFRQGATLWSRQGITVLMTDSHA 383 (419) Q Consensus 309 ~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~-----~~~~~d~~~~~~~~~~~~~~i~~~~~~~ 383 (419) ||++|..|++++|++|+|+| .+++.++.+++|+|+||++++.+|.+ .+++|||+++|+++++.++++.++++.. T Consensus 292 ~~~~~~~l~~lkd~~G~~i~-~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~ 370 (400) T protein:vir:38 292 SQSFYNFLDTVKDGNGRYLL-QDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWVDDQI 370 (400) T ss_pred cHHHHHHHHHhhccCCCeee-ecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEecccc Confidence 99999999999999999865 45777888899999999999998753 3799999999999999999999987542 Q ss_pred chhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 384 DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 384 ~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~ 418 (419) | ...||++.|+|+++++|+||+.++++++- T Consensus 371 --~---~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 371 --Y---GQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred --c---ceeEEEEEEeccEEecccceEEEEeecCC Confidence 3 45799999999999999999999998887 No 53 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=6.5e-58 Score=334.12 Aligned_cols=374 Identities=13% Similarity=0.067 Sum_probs=256.0 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQE-------IVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPL 73 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~-------~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 73 (419) ||++.+|++++.++.++.+++.....+... ...+.+..++.++++++.+.+++++++.......... ... T Consensus 16 mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~---~~~ 92 (402) T protein:vir:93 16 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDK---GEA 92 (402) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc---ccc Confidence 999999999988888777766554433211 1122233344455555555555444433222111111 100 Q ss_pred cccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhh Q lcl|Aclame:pro 74 TPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLL 153 (419) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~ 153 (419) ........+ .... ..+.++....... ....... ......... .++.+.|+++||+.+...|++.+...++ T Consensus 93 ~~~~~~~~~-~~~~--~~~~~r~~~~~~~---~~~~~~~---~~~~~~a~~-~~t~~~GG~lIP~~~~~~Ii~~~~~~~~ 162 (402) T protein:vir:93 93 YQSLSDNEK-MVKA--KAEFYRHAILPNE---FEKPSME---AQRLLHALP-TGNDSGGDKLLPKTLSKEIVSEPFAKNQ 162 (402) T ss_pred CCCCchhHH-HHHH--HHHHHHHHHhhhh---HHHHHHh---HHHHHhhhc-cCCCcCCccccchhHHHHHHHhHHhhhh Confidence 000111111 1111 1111221111110 0001000 011111111 2234557889999999999999999999 Q ss_pred HHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHH Q lcl|Aclame:pro 154 VADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMG 232 (419) Q Consensus 154 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~ 232 (419) ++++|+++++++. .+|+... ...++.|++|++.+++++++|+++++.+++++++++||+||++|+ .++++ T Consensus 163 l~~~~~v~~~~~~--~~p~~~~-------~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~ 233 (402) T protein:vir:93 163 LREKARLTNIKGL--EIPRVSY-------TLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVN 233 (402) T ss_pred hhhhceeeecCCc--eeeeeec-------cCCccccccccccccccccccceeeecceeeeeechhhHHHHhhhHHHHHH Confidence 9999999998754 4555332 234688999999999999999999999999999999999999997 59999 Q ss_pred HHHHHHHHHHHHHHHH-HHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehH Q lcl|Aclame:pro 233 YIQGRLTYGLRFLRDR-QLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQ 311 (419) Q Consensus 233 ~i~~~l~~a~~~~~d~-~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (419) ||.++|+++++++++. .|.+|+|+++|.|+++..++..+ +....+++++++++.+...|..++.|+||+. T Consensus 234 ~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~---------~~~~~~d~l~~~~~~l~~~y~~na~~imn~~ 304 (402) T protein:vir:93 234 WVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV---------EGADMYDAIINALADLHEDYRDNATIYMRYA 304 (402) T ss_pred HHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc---------cccchHHHHHHHHhccChhhhcCCEEEEech Confidence 9999999999999765 56678999999999875544322 2233588999999999999999999999999 Q ss_pred HHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 312 DWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 312 ~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) ++..++++++.+|++++. +.+.+|+|+||++++.++ .+++|||+++|.+++ ++.++..+. ..+|++ T Consensus 305 t~~~~~~~~~d~~~~~~~------~~~~~llG~PV~~t~~~~--~i~~GDf~~~~~~~~--~~~~~~~~~----~~~~~~ 370 (402) T protein:vir:93 305 DYVKIISVLSNGTTNFFD------TPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINYD--GTTYDTDKD----VKKGEY 370 (402) T ss_pred HHHHHHHHHhcCCCcccc------cCCccccccceEEecCCC--ceeeechhhhhhhhh--hhhhhhhhc----ccCCce Confidence 998888777777776653 446789999999999876 579999998776553 444444332 236999 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .|++..|+|+++++|+||+.+++++++. T Consensus 371 ~~~~~~r~Dg~v~~~~A~~~l~ik~~~~ 398 (402) T protein:vir:93 371 LFVLTAWYDQQRTLDSAFRIAKAKENTG 398 (402) T ss_pred EEEEEEEeCcEEechhheEEEEeecCCC Confidence 9999999999999999999999977655 No 54 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=5.8e-57 Score=328.88 Aligned_cols=400 Identities=14% Similarity=0.094 Sum_probs=248.5 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc- Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQE--IVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAE- 77 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~--~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~- 77 (419) ...+.+|++++.+++++.+++....++.+. .....++..+.++++..+++++++.++.++...+............. T Consensus 16 ~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~le~el~e~~~~~~~~~ 95 (466) T protein:vir:80 16 KAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKELENELEQLNNKEPKNN 95 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Confidence 333344444444444444444333322211 01111223444444444454444444444333332222111111100 Q ss_pred ---cchhhhhhHHHHh-HHHHHHHHHhh-hhhhh----hHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhh Q lcl|Aclame:pro 78 ---AGTFRSLAQRFAD-SDGLREYRARD-KRGQF----QVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTP 148 (419) Q Consensus 78 ---~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~ 148 (419) ............. ....+...... ...+. ....+........ ........+++++++|+.+.+.|+..+ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~~~~vP~~~~~~i~~~l 173 (466) T protein:vir:80 96 SEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRT--LAQQKRAVSGAELTIPDVMLELLRDNM 173 (466) T ss_pred chhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHH--HhhhhhhhccccccccHHHHHHHHHhh Confidence 0000000000000 00011100000 00000 0011111111110 111122234566889999999999999 Q ss_pred hhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH Q lcl|Aclame:pro 149 DLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS 228 (419) Q Consensus 149 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~ 228 (419) ...++++++|++.++++. .++++.+ ....+.|++|++.+|+++++|++|++.+++++++++||++|++|++ T Consensus 174 ~~~~~l~~~~~v~~~~g~-~~~~~~~--------~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~ 244 (466) T protein:vir:80 174 HRYSKLISKVRLRPLKGT-ARQNIAG--------AIPEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTLEDSD 244 (466) T ss_pred hhhhhhhhheeeeecCce-eEeeeec--------CCcceeecccccccccccccccceeecceeeeeehhhhHHHHhcch Confidence 999999999999998753 4565543 3346889999999999999999999999999999999999999985 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccc------------------hhhhHHHHH Q lcl|Aclame:pro 229 -QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPA------------------TDEPPLVDI 289 (419) Q Consensus 229 -~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~ 289 (419) ++++||+.+|+++++.++|.+||+|+|+++|.||++..+..+.......... .....+.++ T Consensus 245 ~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (466) T protein:vir:80 245 LNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSEL 324 (466) T ss_pred HHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHH Confidence 8999999999999999999999999999999999986544333222111110 011122333 Q ss_pred HHHHHhhhhhccCCc-EEEEehHHHHHHHHHhc---cCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccce Q lcl|Aclame:pro 290 RRAKTVAEIAGFPPD-GVVVHPQDWESIELDQA---PGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQG 365 (419) Q Consensus 290 ~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~kd---~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~ 365 (419) ...+......+.+++ .|+||+.++..|..++. .+|.+.+ .+.+ ...|+|+||+++++||++++++|+|+. T Consensus 325 ~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~---~~~~--~~~i~G~pvv~s~~~~~~~~~~g~~~~- 398 (466) T protein:vir:80 325 VLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVA---SLNN--TMPIVGGDIVILDFIPDNDIIGGYGSL- 398 (466) T ss_pred HHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccc---cCCC--cccccccceeecCccCccceeeecccc- Confidence 333444455555554 59999999999998873 3443322 1112 235899999999999999999999886 Q ss_pred EEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 366 ATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 366 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) |.+++|.++++..+.+ .+|.+|++.||+..|+|+++++|+||++++++.... T Consensus 399 y~i~~r~~~~i~~~~~--~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~ 450 (466) T protein:vir:80 399 YLLAERADIKLAQSEH--VRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANP 450 (466) T ss_pred EEEEeecceEEEechh--hhhhcCcEEEEEEEEEccEEeccCceEEEEecCCCc Confidence 6788999999998875 459999999999999999999999999998887644 No 55 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=7.7e-57 Score=328.23 Aligned_cols=407 Identities=14% Similarity=0.092 Sum_probs=258.7 Q ss_pred CCc-cHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc- Q lcl|Aclame:pro 1 MPP-TPTLEEQRAALLARLDDTSLTTEQ------VQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTP- 72 (419) Q Consensus 1 M~~-~~~L~e~~~~l~~~~~~~~~~~~~------~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~- 72 (419) |.. +.+|+++++++.++++.+....+. .++..++.++..++++++++.+++.++++++.............. T Consensus 8 m~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~~~~~~~~~~ 87 (477) T protein:vir:84 8 LRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIERSGKLEAET 87 (477) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh Confidence 322 344444455444444433222210 111223334445556666665555444444433322211110000 Q ss_pred --cc-ccccchhhhhhHHHHhHHHHHHHHHhhh----hhhh------------hHHHHHHHHHHhhhcccccccccCCcc Q lcl|Aclame:pro 73 --LT-PAEAGTFRSLAQRFADSDGLREYRARDK----RGQF------------QVEMRDIDPNRLLSRDAPAGTITNPNV 133 (419) Q Consensus 73 --~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (419) .. .......+............+....... .... ........ ...........+...+|. T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~gg~ 166 (477) T protein:vir:84 88 KTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIA-KVGEEYRDLDRNGGTGGY 166 (477) T ss_pred hhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHH-HhhhhhccccccCCCcce Confidence 00 0000011111110000011111000000 0000 00000000 001111222233344566 Q ss_pred cccchhhhHHHHHhhhhhhhHHhhcceecccC--cceeeeeeccccceeccccccceeecCcc-----cccccccceeeE Q lcl|Aclame:pro 134 PHLPQLVPGIVPTTPDLPLLVADLLDQQNADY--NVLEYIRDTSGTAGAGSTWNKAAVVPEGT-----AKPQSTLSFDTI 206 (419) Q Consensus 134 ~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~-----~~~~~~~~~~~v 206 (419) +++|+.+.+.|++.++..++++++++++++.+ +.+.||+.++. ...+.|++||+ .+|+++++|+++ T Consensus 167 lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~-------~~~a~~~~Eg~~~~~~~~~~s~~~f~~i 239 (477) T protein:vir:84 167 AVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTG-------TSTAIQAADNAALTAPSAHEVDLTDGFV 239 (477) T ss_pred eeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecC-------cceeeeeccCcccccccccccccceeeE Confidence 66777788999999999999999999988754 46788886542 23567999985 467889999999 Q ss_pred EeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCc-ccccceecccccccccccccc-ccchhh Q lcl|Aclame:pro 207 TTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-TEMQGILTTPGIGTYQQPKPT-APATDE 283 (419) Q Consensus 207 ~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~-~~p~Gi~~~~~~~~~~~~~~~-~~~~~~ 283 (419) +++++|++++++||+||++|+ +++++||.++|+++++.++|.+||+|+|+ ++|.||++.+++......... +..... T Consensus 240 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~ 319 (477) T protein:vir:84 240 QANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEKHQ 319 (477) T ss_pred EEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchhhHH Confidence 999999999999999999996 59999999999999999999999999997 589999998876554443222 222234 Q ss_pred hHHHHHHHHHHhhhhhccCC-cEEEEehHHHHHHHHHhccCCceeccCC------------ccccCCCcccccceeEecC Q lcl|Aclame:pro 284 PPLVDIRRAKTVAEIAGFPP-DGVVVHPQDWESIELDQAPGSGVFRVIA------------NVQGEATPRIWGLNVVSTV 350 (419) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~kd~~g~~~~~~~------------~~~~~~~~~l~G~pv~~~~ 350 (419) ..++++++++..+...+..+ ++|+|||+++..|++++|.+|+|+|.+. .+..+.+++|+|+||++++ T Consensus 320 ~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~ 399 (477) T protein:vir:84 320 IIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDP 399 (477) T ss_pred HHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecC Confidence 56777888888888877654 4799999999999999999999977542 2444566799999999999 Q ss_pred CCCcC--------cEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEE-ecccceEEEEecCCCC Q lcl|Aclame:pro 351 AIAQG--------TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAV-YQPKAFVRVTFAAATT 419 (419) Q Consensus 351 ~~~~~--------~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~-~~~~a~~~~~~~aa~~ 419 (419) .||++ .++||||+++ +++. .++.+.++++. ++.++++.|++..++++.. +||+||++++.++.++ T Consensus 400 ~~p~~~~~~~d~~~i~~gd~~~~-~i~~-~~~~~~~~~~~--~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~~~ 473 (477) T protein:vir:84 400 TLPTTLGTGTDQDVIHVLRASDL-ALFE-SSVRMRALQET--RAENLSVLLQVYGYLAFTAARFPQSVVEIGGTALTA 473 (477) T ss_pred cccccccccCCcceEEEEEeceE-EEEe-eceeEEecccc--ccccceeeeeehhhhhhhhhccccceEEeecccccc Confidence 99964 4799999864 4454 46778777654 4677889999988888755 5699999999988877 No 56 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=8e-56 Score=322.65 Aligned_cols=370 Identities=14% Similarity=0.096 Sum_probs=242.6 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc- Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG- 79 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~- 79 (419) |..+++ .++++++++++++...++.........+..++++.+++++.++++.++.++...+............... T Consensus 1 meeL~~---~~~~~~~~~~e~~~~l~~~~~~~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 77 (389) T protein:vir:10 1 MDKLQT---LFNDVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGS 77 (389) T ss_pred ChHHHH---HHHHHHHHHHHHHHHHHHHHHhHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 555444 4444444444444333322111111112223333334333333333333333322211111110000000 Q ss_pred -hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 80 -TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 80 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) .......... ....+.+....+... .........++..|++++|+.+...|+..+...++|+++| T Consensus 78 ~~~~~~~~~~~-~~~~~~~~~~lr~~~-------------~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~ 143 (389) T protein:vir:10 78 KKGTDLSKKPI-DAKKKAINDFIHSHG-------------KVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLV 143 (389) T ss_pred ccccccchhHH-HHHHHHHHHHhhcch-------------hhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhc Confidence 0000000000 001111111111100 0111112334456788999999999999999999999999 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCcccccc-cccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) +++|++++.+.+|+... ....+.|++|++++|+ ++++|+++++.++++++++++|+|+++|+. ++++||.+ T Consensus 144 ~~~~~~~~~~~~~~~~~-------~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 216 (389) T protein:vir:10 144 TKTPVTTPKGTYPILKR-------ATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQ 216 (389) T ss_pred ceeeccCCeeEEEEEec-------CCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHH Confidence 99999998889888754 2346679999999986 689999999999999999999999999974 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHh-hhhhccCCcEEEEehHHHHH Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTV-AEIAGFPPDGVVVHPQDWES 315 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 315 (419) +|++++++++|.+|++|.|++.+.|.. ....++++.++++. +...+ +++|+||+.+|.. T Consensus 217 ~la~~~~~~~~~~i~~g~~~~~~~~~~------------------~~~~~d~l~~~~~~~~~~~~--~a~~~~n~~~~~~ 276 (389) T protein:vir:10 217 SIKEKSVNTYNAMIAPVLQSFTAKKTT------------------TDTLVDSLKHILNVDLDPAY--SRALVVTQSLFNT 276 (389) T ss_pred HHHHHHHHHHHHHHhhhhccccccccc------------------ccccHHHHHHHHHhhhhhhh--CcEEEecHHHHHH Confidence 999999999999999998876554321 12235666666653 33333 5789999999999 Q ss_pred HHHHhccCCceeccCCcc---ccCCCcccccceeEecCCC-CcC-----cEEEEeccceEEEEEecceEEEEeecccchh Q lcl|Aclame:pro 316 IELDQAPGSGVFRVIANV---QGEATPRIWGLNVVSTVAI-AQG-----TALVGGFRQGATLWSRQGITVLMTDSHADFF 386 (419) Q Consensus 316 l~~~kd~~g~~~~~~~~~---~~~~~~~l~G~pv~~~~~~-~~~-----~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~ 386 (419) |+++||++|+|+|.++.. .++.+++|+|+||++++.+ +.+ .+++|||+++|.+++++++++.++++. .| T Consensus 277 L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~ 354 (389) T protein:vir:10 277 LDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSK--IY 354 (389) T ss_pred HHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeeccc--cc Confidence 999999999988754321 2345679999999876543 322 279999999999999999999988754 35 Q ss_pred hcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 387 TANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 387 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+ .+|++.|+|+++++|+||++++++++++ T Consensus 355 ~~---~~~~~~r~d~~~~~~~a~~~~~~~~~~~ 384 (389) T protein:vir:10 355 GK---YLGAAFRFGVQKADSKAGYFVTNTDVPG 384 (389) T ss_pred cc---eEEEEEEeccEEecccceEEEEeeccCC Confidence 44 6899999999999999999999998888 No 57 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=2.2e-57 Score=331.23 Aligned_cols=359 Identities=11% Similarity=0.004 Sum_probs=253.6 Q ss_pred CCcc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MPPT-PTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAG 79 (419) Q Consensus 1 M~~~-~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (419) |... +++++.+++..+ +.+..++.. ..++..+.++ +..+.+.+++.+....+ .. .... ........ T Consensus 1 M~i~~k~~~~~~~~~~~----l~~~~~~~~-~~ee~~~~~~---~~~~~~~~~~~~~~~~e-~~-~~~~-~~~~~~~l-- 67 (377) T protein:vir:98 1 MAINLKELPKYREAVAE----LSAKISAGA-TSEEQEKLFE---AAFTTMGDEILAKNEEE-ME-RMFD-LRDKNREL-- 67 (377) T ss_pred CCCcHHHHHHHHHHHHH----HHHHHHhhh-hhHHHHHHHH---HHHHhHHHHHHHHHHHH-HH-HHHH-hccCCccc-- Confidence 5443 223333222222 222221111 1111111122 22222222222111110 00 0000 00000000 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD 159 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~ 159 (419) ..+. +........ ....+.+++++|+.+.+.|++.+...++++++|+ T Consensus 68 ----------t~ee-----------------~~~~~~~~~------~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~ 114 (377) T protein:vir:98 68 ----------TAEE-----------------IKFFNDIDK------NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVIN 114 (377) T ss_pred ----------CHHH-----------------HHHHHHHHh------ccCCCCCccccCHHHHHHHHHHHHHhhhhhhhee Confidence 0000 011111111 1234567889999999999999999999999999 Q ss_pred eecccCcceeeeeeccccceeccccccceeecCccccc-ccccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHHH Q lcl|Aclame:pro 160 QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKP-QSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGR 237 (419) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~~ 237 (419) +.++++. .++|+.++ .+.+.|++|+++.+ +++++|+++++.++|++++++||++|++|++ ++++||+++ T Consensus 115 v~~~~~~-~~~~~~~~--------~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~ 185 (377) T protein:vir:98 115 FKNTSLR-LKALTAET--------SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQ 185 (377) T ss_pred eEecCcc-eEEEEecC--------CcceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHH Confidence 9998654 68887553 45789999988765 5789999999999999999999999999986 899999999 Q ss_pred HHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHH Q lcl|Aclame:pro 238 LTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIE 317 (419) Q Consensus 238 l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 317 (419) +++++++++|.+|++|+|+++|.||++..+.............+.....+.+.++...++..|+.+++|+||+.++..++ T Consensus 186 la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~ 265 (377) T protein:vir:98 186 LKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKK 265 (377) T ss_pred HHHHHHHHHhhceEeccCCCcceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999876554444444444444444556778888888999999999999999999999 Q ss_pred HHhccCCceeccCCc-------------cccCCCcccccce--eEecCCCCcCcEEEEeccceEEEEEecceEEEEeecc Q lcl|Aclame:pro 318 LDQAPGSGVFRVIAN-------------VQGEATPRIWGLN--VVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSH 382 (419) Q Consensus 318 ~~kd~~g~~~~~~~~-------------~~~~~~~~l~G~p--v~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~ 382 (419) ++||.+|+++|+... ..++.+.+++|+| |+.+++||++++++|||++ |.+++|.+++++.+++. T Consensus 266 klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~-Y~i~~r~~~~i~~~~~~ 344 (377) T protein:vir:98 266 RPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEEYDQT 344 (377) T ss_pred hhhccCCceEEEecccchhhccccccccCCCCccccccCCCceEEecCCCCcccEEEEEecc-eeEEeecceEEEeechh Confidence 999999999874211 1235556899988 6788999999999999998 88899999999988764 Q ss_pred cchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 383 ADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 383 ~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) +|.+|++.||++.|+|+++++|+||++++++.- T Consensus 345 --~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 345 --FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred --hhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 699999999999999999999999999999999 No 58 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=1e-55 Score=322.08 Aligned_cols=371 Identities=14% Similarity=0.112 Sum_probs=249.9 Q ss_pred CCccHHHHHHHHHHHHHHHHHH-HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTS-LTTEQVQ---EIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPA 76 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~-~~~~~~~---~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 76 (419) |..+++|.+++++..+.+.... ...++.. +..++.+..++.++.+.++++.+++.++................... T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 80 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQPN 80 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhccc Confidence 8887777666554433333221 1111111 11222333444555555555555554443332222111111110000 Q ss_pred ccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHh Q lcl|Aclame:pro 77 EAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVAD 156 (419) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~ 156 (419) .. ..... ......+++....+..... ........+++.|++++|+.+...|++.++..++|++ T Consensus 81 ~~----~~~~~-~~~~~~~~~~~~l~~~~~~------------~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~ 143 (394) T protein:vir:10 81 GT----DLKKK-PIDAKKKAINDFIHSHGKV------------IDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLST 143 (394) T ss_pred cc----chhhh-HHHHHHHHHHHHHhccchh------------hhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhh Confidence 00 00000 0011112222221111100 0111122345667789999999999999999999999 Q ss_pred hcceecccCcceeeeeeccccceeccccccceeecCcccccc-cccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHH Q lcl|Aclame:pro 157 LLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYI 234 (419) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i 234 (419) +|+++|++++++.||.... ....+.|++|++.+|+ ++++|++|++.++|++++++||+|+++|+ +++++|| T Consensus 144 ~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i 216 (394) T protein:vir:10 144 LVTKTPVTTPKGTYPILKR-------ATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLV 216 (394) T ss_pred hceeeeccCCceEEEEEec-------CCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHH Confidence 9999999998888887653 2346789999999997 56999999999999999999999999997 5899999 Q ss_pred HHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHH Q lcl|Aclame:pro 235 QGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWE 314 (419) Q Consensus 235 ~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (419) .++|++++++++|.+|++|+|++.|.++.+ ...++++.+++.......+ +++|+|||++|. T Consensus 217 ~~~la~~~~~~~~~~il~g~g~~~~~~~~~------------------~~~~d~l~~~~~~~~~~~~-~a~~vmn~~~~~ 277 (394) T protein:vir:10 217 GQSINEKSVNTYNAMIAPVLQSFTAKATTT------------------DTLVDSLKHILNVDLDPAY-SRALVVTQSLFN 277 (394) T ss_pred HHHHHHHHHHHHHHHHhhcccccccccccc------------------cccHHHHHHHHHhhhhhhc-cCEEEecHHHHH Confidence 999999999999999999999877655422 1234566666654333333 578999999999 Q ss_pred HHHHHhccCCceeccCCc---cccCCCcccccceeEecCCC--CcC----cEEEEeccceEEEEEecceEEEEeecccch Q lcl|Aclame:pro 315 SIELDQAPGSGVFRVIAN---VQGEATPRIWGLNVVSTVAI--AQG----TALVGGFRQGATLWSRQGITVLMTDSHADF 385 (419) Q Consensus 315 ~l~~~kd~~g~~~~~~~~---~~~~~~~~l~G~pv~~~~~~--~~~----~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~ 385 (419) +|++++|++|+|+|.+.. ...+.+++|+|+||++++.+ |.+ .+++|||+++|+++++.++++.++++.. T Consensus 278 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~-- 355 (394) T protein:vir:10 278 TLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKI-- 355 (394) T ss_pred HHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEecccc-- Confidence 999999999998764322 22345679999999987643 321 3799999999999999999999877543 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 386 FTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 386 ~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) |.+ .+|++.|+|+++++|+||+.++++++.+ T Consensus 356 ~~~---~~~~~~r~d~~~~~~~ai~~~~~~~~~~ 386 (394) T protein:vir:10 356 YGR---YLGAAFRFGVKQADSNAGYFVTNTDAAS 386 (394) T ss_pred cce---eEEEEEEeccEEeccccEEEEEeecccC Confidence 544 6899999999999999999999999888 No 59 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=1.3e-55 Score=321.42 Aligned_cols=358 Identities=13% Similarity=0.057 Sum_probs=238.6 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcccccccccchhh Q lcl|Aclame:pro 4 TPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARA-ALLRTAPPAPKGPADGGTPLTPAEAGTFR 82 (419) Q Consensus 4 ~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (419) +++|+++++++....+++....++.....++ .+.+++ ..++..... .+.+........ T Consensus 1 ik~L~e~~~e~~e~~~~~~~~~~~~~~~~e~-~~~~~~---~~~~~~~~~~~~~~~~~~~~~~----------------- 59 (390) T protein:vir:40 1 MNNLDKKDSETLNISTAFLNAIKEGATEAEQ-VTAFTN---MAEQIQNNIIAQARKEVNREMN----------------- 59 (390) T ss_pred CchHHHHHHHHHHHHHHHHHHHhhhhhHHHH-HHHHHH---HHHHHHHHHHHHHHHHHHHHHH----------------- Confidence 6666666665554444333332222111111 111111 111111100 000000000000 Q ss_pred hhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceec Q lcl|Aclame:pro 83 SLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQN 162 (419) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~ 162 (419) .......+..+ ....+.+....... .....+.+++++|+.+.+.|++.+...++|+++|+++| T Consensus 60 ----------~~~~~~~~~~~-~l~~~~r~~~~~~~------~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~ 122 (390) T protein:vir:40 60 ----------DNNVLASRGAN-ALTSDESKYYNEVI------AGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVN 122 (390) T ss_pred ----------HHHHHHhcCch-hccHHHHHHHHHHH------hccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeee Confidence 00000000000 00001111111111 11234567889999999999999999999999999999 Q ss_pred ccCcceeeeeeccccceeccccccceeecCcccccc-cccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHHHHHH Q lcl|Aclame:pro 163 ADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGRLTY 240 (419) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~~l~~ 240 (419) ++++...+|+.++ .+.+.|++|++.+++ ++++|+++++++++++++++||+||++|++ ++++||+++|++ T Consensus 123 ~~~~~~~i~~~~~--------~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ 194 (390) T protein:vir:40 123 TTATTEWIISVGD--------VATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGE 194 (390) T ss_pred cCCceeEEEEEcC--------CcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHH Confidence 9998888887653 357899999988775 689999999999999999999999999986 899999999999 Q ss_pred HHHHHHHHHHHhccCcccccceeccccccccccccccccc--h---hhhHHHHHHHHHHhhhhhccCCcEEEEehHHHH- Q lcl|Aclame:pro 241 GLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPA--T---DEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWE- 314 (419) Q Consensus 241 a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~--~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 314 (419) +++.++|++||+|+|+++|.||++..+..+.......... + ....+..+..++.........+++|+||+.++. T Consensus 195 ~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~ 274 (390) T protein:vir:40 195 AMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWS 274 (390) T ss_pred HHHHHHHhhhhcccCCCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHH Confidence 9999999999999999999999986654433222221111 1 111222223332222223445778999998853 Q ss_pred ---HHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 315 ---SIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 315 ---~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) .+++++|++|+|++. ..++|+||+++++||++++++|||++ |++++|.+++++++++. +|.+|++ T Consensus 275 ~l~~~~~~~d~~G~~v~~---------~~~~g~pvv~~~~~p~~~i~~Gd~s~-~~i~~~~~~~v~~~~~~--~f~~~~~ 342 (390) T protein:vir:40 275 KIYAATSYMTPQGVWVTG---------ILPVPLEIVQSVAVPVGKAVAGRAKD-YFMGIGSEQVIRTSTEY--RLLDDET 342 (390) T ss_pred HHHHHhhccCCCCccccc---------cCCCceeEEEcCCCCCCcEEEEeece-EEEEeecceEEEecchh--hhhcCcE Confidence 445789999987541 23579999999999999999999997 67788999999988754 6999999 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .||++.|+|+++++|+||+++++++.-. T Consensus 343 ~~r~~~r~dg~v~~~~A~~~l~~~~~~~ 370 (390) T protein:vir:40 343 LYYAKQYANGRPKDNSSFLVFDITGLEG 370 (390) T ss_pred EEEEEEEeCCEEecccceEEEEeeccCC Confidence 9999999999999999999999888733 No 60 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=5.7e-56 Score=323.47 Aligned_cols=396 Identities=14% Similarity=0.071 Sum_probs=263.2 Q ss_pred CCc---cHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 1 MPP---TPTLEEQRAALLARLDDTSLTT-EQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPA 76 (419) Q Consensus 1 M~~---~~~L~e~~~~l~~~~~~~~~~~-~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 76 (419) |+. ++.|+++++++.++++++.... ++.+.+.++..+.++.+++++++++.++++++...........+....... T Consensus 193 ~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~ 272 (645) T protein:vir:93 193 MNIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEHYDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGNG 272 (645) T ss_pred cchhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 443 4667777777777766654433 344566777788899999999999999988877644433322221111000 Q ss_pred ---ccchhhhh--hHHHHhHHH----HHHHHH-hhhhhhhhHHHHH-----HHHHHhhhc---ccccccccCCcccccch Q lcl|Aclame:pro 77 ---EAGTFRSL--AQRFADSDG----LREYRA-RDKRGQFQVEMRD-----IDPNRLLSR---DAPAGTITNPNVPHLPQ 138 (419) Q Consensus 77 ---~~~~~~~~--~~~~~~~~~----~~~~~~-~~~~~~~~~~~~~-----~~~~~~~~~---~~~~~~~~~~~~~~~p~ 138 (419) ........ ......... .+.+.. +..........+. ......... ..........|++++|+ T Consensus 273 ~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~ 352 (645) T protein:vir:93 273 NVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQ 352 (645) T ss_pred ccccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCch Confidence 00000000 000000000 011100 0000000000000 000000011 11112223457889999 Q ss_pred hhhHHHHHhhhhhhhHHhhcceeccc----CcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEE Q lcl|Aclame:pro 139 LVPGIVPTTPDLPLLVADLLDQQNAD----YNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVA 214 (419) Q Consensus 139 ~~~~~i~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~ 214 (419) .+...|++.++..+++++++....++ ...+++|+.++ +..++||+||+.+|+++++|++++++++|++ T Consensus 353 ~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~--------~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla 424 (645) T protein:vir:93 353 EYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVS--------GGAAGWVGEGKTKPLTKFDFESITFSHAKVS 424 (645) T ss_pred hhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeec--------CcceEEeccCccccccccceeEEEEeeEEEE Confidence 99999999999999999886654332 12466776653 4578999999999999999999999999999 Q ss_pred EeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcc----cccceeccccccccccccccccchhhhHHHHH Q lcl|Aclame:pro 215 HWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGST----EMQGILTTPGIGTYQQPKPTAPATDEPPLVDI 289 (419) Q Consensus 215 ~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~----~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (419) +++++|+||++|+ +++++||+++|++++++++|.+||+|+|++ .|.|+++.. .. ... ....+.++ T Consensus 425 ~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~--~~-----~~~---~~~~~~d~ 494 (645) T protein:vir:93 425 AIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDV--KG-----TAS---SGNPDADA 494 (645) T ss_pred EeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccc--cc-----ccc---ccchHHHH Confidence 9999999999986 699999999999999999999999998764 477776421 11 111 11234566 Q ss_pred HHHHHhhhhhcc--CCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEE Q lcl|Aclame:pro 290 RRAKTVAEIAGF--PPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGAT 367 (419) Q Consensus 290 ~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~ 367 (419) ..++..+..++. .+++|+|||.++..|+++||++|+++|+.. +..+++|+|+||+++++||++ ++++||++.+ T Consensus 495 ~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~---~~~~~tL~G~PV~~s~~vp~~-~~~gd~s~~~- 569 (645) T protein:vir:93 495 EAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPDM---TLLGGSFQGLPVIVSQYVGDQ-LVLVNAPDIY- 569 (645) T ss_pred HHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCceeecCC---CCCCceeeceeeEEeccCCcc-eeEeccccEE- Confidence 667666655433 456899999999999999999999876432 344579999999999999865 6789999754 Q ss_pred EEEecceEEEEeeccc--------------------chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 368 LWSRQGITVLMTDSHA--------------------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 368 ~~~~~~~~i~~~~~~~--------------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ++++.++.+.++++.. ++|++|+++||++.|+||+++||+||++++-..==+ T Consensus 570 ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~ 641 (645) T protein:vir:93 570 LADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGS 641 (645) T ss_pred EEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCc Confidence 5567788887765433 359999999999999999999999999987332222 No 61 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=9.1e-56 Score=322.34 Aligned_cols=370 Identities=12% Similarity=0.143 Sum_probs=246.4 Q ss_pred CC--------ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MP--------PTPTLEEQRAALLARLDDTSLTTEQVQEIVA---------EARGLADALQAESDRAAARAALLRTAPPAP 63 (419) Q Consensus 1 M~--------~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~---------e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 63 (419) |. .+++|+++..+|+++.+++....++.....+ +.++..++++.+++.+++++++++...... T Consensus 1 m~~k~~~l~~~~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l 80 (397) T protein:vir:96 1 MALKQLILNKQIKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDL 80 (397) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33 3334444444444444444433333332222 222334444444444444444444433332 Q ss_pred HHHHhhcccccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHh--hhcccccccccCCcccccchhhh Q lcl|Aclame:pro 64 KGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRL--LSRDAPAGTITNPNVPHLPQLVP 141 (419) Q Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~p~~~~ 141 (419) ............... ...... ....+.... ......+....... .......+.....+++.+|+.+. T Consensus 81 ~~~~~~~~~~~~~~~---~~~~~~-----~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~ 149 (397) T protein:vir:96 81 EDELAKAADPTDQKP---KDGEKR-----KMKKFKVTE---EELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELL 149 (397) T ss_pred HHHHHhhhhhhhhhh---HHHHHH-----HHHHHhhhh---HHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHH Confidence 222211111110000 000000 000000000 00000000000000 11111223445667788999999 Q ss_pred HHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccc-cccceeeEEeeeEEEEEeehhh Q lcl|Aclame:pro 142 GIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVAHWLPIT 220 (419) Q Consensus 142 ~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~~~vs 220 (419) ..|.++ .....++.+|++++++++++.+|.... ++..++|++|++.+|+ ++++|++|++.++++++++++| T Consensus 150 ~~i~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s 221 (397) T protein:vir:96 150 QPQLEP-KDIVDLSKYVRSVPVNSASGKFPVISK-------SGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPIS 221 (397) T ss_pred HHHHHh-hhhhhHHHhhhhccccccceeEEEEec-------cCCccccccccccccccccccccceeecHhHhhcchhhH Confidence 888874 667789999999999988888887543 2356789999999997 5799999999999999999999 Q ss_pred HHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhh Q lcl|Aclame:pro 221 RQAADDNS-QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIA 299 (419) Q Consensus 221 ~ell~d~~-~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (419) +++++|+. ++++||.++++++++.++|.+|++|+|.+.|.|+. .++++.++++..... T Consensus 222 ~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~---------------------~~d~~~~~~~~~~~~ 280 (397) T protein:vir:96 222 QEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTATAKSVV---------------------GVDGLKDLINKEIKK 280 (397) T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc---------------------chHHHHHHHHHhhhh Confidence 99999975 89999999999999999999999999988776643 256666777665544 Q ss_pred ccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC------cEEEEeccceEEEEEecc Q lcl|Aclame:pro 300 GFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG------TALVGGFRQGATLWSRQG 373 (419) Q Consensus 300 ~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~------~~~~~d~~~~~~~~~~~~ 373 (419) ++ +++|+|||++|..|++++|++|+|+| .+++.++.+++|+|+||++++.+..+ .+++|||+++|+++++.+ T Consensus 281 ~~-~a~~v~n~~~~~~l~~lkd~~G~~~~-~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 358 (397) T protein:vir:96 281 VY-DVKLFISASMYSELDKLKDKNGRYLL-QDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQ 358 (397) T ss_pred hc-CcEEEEcHHHHHHHHHhhccCCCeEe-ccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecc Confidence 43 67899999999999999999999865 55777888899999999986654322 389999999999999999 Q ss_pred eEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 374 ITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 374 ~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) +++.++++.. | .+.+|++.|+|+++++|+||+++++++| T Consensus 359 ~~~~~~~~~~--~---~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 359 VSVSWVDNNI--Y---GQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred eEEEEecccc--c---ceeEEEEEEEccEEecccceEEEEeecC Confidence 9999876532 3 5679999999999999999999999999 No 62 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=6.2e-55 Score=317.77 Aligned_cols=386 Identities=12% Similarity=0.047 Sum_probs=244.1 Q ss_pred CCccHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc- Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDD----TSLTTEQVQEIVAEA---RGLADALQAESDRAAARAALLRTAPPAPKGPADGGTP- 72 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~----~~~~~~~~~~~~~e~---~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~- 72 (419) |+ +++|+++.+++++++.+ ++...++.....++. +...+++.++++++.+++++++............... T Consensus 1 Mk-i~elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~ 79 (437) T protein:vir:10 1 MK-IEKLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDL 79 (437) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 98 66777665555444433 332222222222222 2222333334444333333332222111111000000 Q ss_pred --------ccccccchhhhhhHHHHh-----------HHHHHHHHHhhhhhhhhH-----HHHHHHHHHhh-hccccccc Q lcl|Aclame:pro 73 --------LTPAEAGTFRSLAQRFAD-----------SDGLREYRARDKRGQFQV-----EMRDIDPNRLL-SRDAPAGT 127 (419) Q Consensus 73 --------~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~-~~~~~~~~ 127 (419) .................. .................. ........... ........ T Consensus 80 ~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~ 159 (437) T protein:vir:10 80 VAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGI 159 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhc Confidence 000000000000000000 000000000000000000 00000000000 11112223 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCccccccc-ccceeeE Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFDTI 206 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~v 206 (419) ....+++++|+.+...|... .....++.++++++++++.+.+|+... ....++|++|++.+|+. +++|++| T Consensus 160 ~~~~~g~lvp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~e~~~~~e~~~~~~~~v 231 (437) T protein:vir:10 160 ALKDGKVIIPETILTPEKEV-HQFPRLGSLVRTESVTTTTGKLPIFNN-------STDLLTAHTEYGQTTKNATPVITPI 231 (437) T ss_pred ccccccccchHHHHHHHHHh-hhhhhhhhcceeEeeccCceeeEEeec-------cccccccccccccccccccccceee Confidence 45567788999998888665 567789999999999988888887653 23568899999999974 5899999 Q ss_pred EeeeEEEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhH Q lcl|Aclame:pro 207 TTTLKTVAHWLPITRQAADDNS-QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPP 285 (419) Q Consensus 207 ~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 285 (419) ++.+++++++++||+|+++|+. +|++||.++|+++++.++|.+|++|+|++.|.+.. ... T Consensus 232 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~-------------------~~~ 292 (437) T protein:vir:10 232 LWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKTTS-------------------TYL 292 (437) T ss_pred eeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc-------------------ccc Confidence 9999999999999999999975 89999999999999999999999999987654321 112 Q ss_pred HHHHHHHHH-hhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCC--CcC-----cE Q lcl|Aclame:pro 286 LVDIRRAKT-VAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAI--AQG-----TA 357 (419) Q Consensus 286 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~--~~~-----~~ 357 (419) ++++.+++. .+...|..+++|+|||+++..|++++|++|+|+| .+++.++.+++|+|+||++++++ |.+ .+ T Consensus 293 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~-~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~ 371 (437) T protein:vir:10 293 LGDLKKVLNVTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLL-QPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNI 371 (437) T ss_pred hhhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeee-ccCccCCCCcccccceeEEecccccCCcCCCceEE Confidence 345555553 6788888889999999999999999999999865 56777788899999999997764 432 28 Q ss_pred EEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEe--cCCCC Q lcl|Aclame:pro 358 LVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTF--AAATT 419 (419) Q Consensus 358 ~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~--~aa~~ 419 (419) ++|||+++|.+++|.++++.+++. |..+.+.+++..|+|+++++|+||+.++. ++.++ T Consensus 372 ~~gd~~~~~~~~~r~~~~~~~~~~----~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~ 431 (437) T protein:vir:10 372 VVAPLKKAVINFKLTEITGQFQDT----YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTV 431 (437) T ss_pred EEeeccccEEEEeeeceEEEEecc----cccccceeeEEEEEccEEecccceEEEEeecccccc Confidence 999999999999999999987654 44556688999999999999999999873 34443 No 63 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=9.9e-56 Score=322.13 Aligned_cols=279 Identities=14% Similarity=0.079 Sum_probs=233.6 Q ss_pred cccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCccccccccccee Q lcl|Aclame:pro 125 AGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFD 204 (419) Q Consensus 125 ~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 204 (419) ...++..++.++|+.+...|++.++..+.++++|+++|+.++.+++|+.++ +..++||+|++.+|+++++|+ T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~--------~~~a~wv~Eg~~~~~s~~~f~ 72 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDF--------DSDIDIVAENGKKTHGGVSLD 72 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEec--------CcceEEeeCCcccccccccce Confidence 223344455677888899999999999999999999999988899998653 357899999999999999999 Q ss_pred eEEeeeEEEEEeehhhHHHHh---h-HHHHHHHHHHHHHHHHHHHHHHHHHhccCcc--c---ccceecccccccccccc Q lcl|Aclame:pro 205 TITTTLKTVAHWLPITRQAAD---D-NSQLMGYIQGRLTYGLRFLRDRQLLNGNGST--E---MQGILTTPGIGTYQQPK 275 (419) Q Consensus 205 ~v~~~~~k~~~~~~vs~ell~---d-~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~--~---p~Gi~~~~~~~~~~~~~ 275 (419) +++++++|++++++||+||++ | ..+++++|.+++++++++++|.++|+|++.+ . +.|.....+. .. T Consensus 73 ~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~-----~~ 147 (300) T protein:vir:95 73 PVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKK-----VT 147 (300) T ss_pred eeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccc-----cc Confidence 999999999999999999984 3 3689999999999999999999999996543 2 2333322221 11 Q ss_pred ccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC Q lcl|Aclame:pro 276 PTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG 355 (419) Q Consensus 276 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 355 (419) .....+....++++.+++..+...++++++|+|||+++.+|+++||++|+++| +....++.+++|+|+||++++.+|.+ T Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~-~~~~~~~~~~~l~G~Pv~~s~~v~~~ 226 (300) T protein:vir:95 148 QTVPFKDTNPDESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLY-PELAWGGVPDAINGLAVDKNRTVSYS 226 (300) T ss_pred eeecccccchHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeec-cCccccCCCceecceeeEEecCCCCC Confidence 22233345668899999999999999999999999999999999999999876 55666778899999999999999864 Q ss_pred c------EEEEeccceEEEEEecceEEEEeeccc------chhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 356 T------ALVGGFRQGATLWSRQGITVLMTDSHA------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 356 ~------~~~~d~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) . +++|||++++.+..|++++++++++.. ++|++|++.||++.|+|+++++|+||++++-++- T Consensus 227 ~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 227 QTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred CCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 3 788999998878889999999987543 3699999999999999999999999999999999 No 64 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=1.5e-55 Score=321.22 Aligned_cols=295 Identities=14% Similarity=0.093 Sum_probs=239.8 Q ss_pred HHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcc Q lcl|Aclame:pro 115 PNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGT 194 (419) Q Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~ 194 (419) ......+......+...+++++| .+.+.|++.+++.++++++++++++.++.++||+.++ ...+.|++|++ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~--------~~~a~~v~Eg~ 71 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTP-EQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTG--------AVSASWTGEAE 71 (330) T ss_pred CcccccchhhccccCCCcceech-hHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcC--------CcceeEecCCC Confidence 00001111222223344555555 5566788889999999999999999998899998764 34789999999 Q ss_pred cccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcc-cccceecccccccc- Q lcl|Aclame:pro 195 AKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTY- 271 (419) Q Consensus 195 ~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~- 271 (419) .+|+++++|++++++++|++++++||+|+++|+ .+++++|.++|++++++++|++||+|+|++ +|.|+++....... T Consensus 72 ~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~ 151 (330) T protein:vir:77 72 RKPITKGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSL 151 (330) T ss_pred ccccccceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcccccccccccccee Confidence 999999999999999999999999999999987 599999999999999999999999999986 56788875432222 Q ss_pred -ccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCc----cccCCCccccccee Q lcl|Aclame:pro 272 -QQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIAN----VQGEATPRIWGLNV 346 (419) Q Consensus 272 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~----~~~~~~~~l~G~pv 346 (419) ............+.++++.+++..+...+..+++|+||++++..|+++||.+|+++|.... +....+++|+|+|| T Consensus 152 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV 231 (330) T protein:vir:77 152 ADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPT 231 (330) T ss_pred ecccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceee Confidence 2223334455667789999999999999999999999999999999999999998775432 22235579999999 Q ss_pred EecCCCCcCc------EEEEeccceEEEEEecceEEEEeeccc----------------chhhcCcEEEEEEEEeccEEe Q lcl|Aclame:pro 347 VSTVAIAQGT------ALVGGFRQGATLWSRQGITVLMTDSHA----------------DFFTANTLVILAEFRANLAVY 404 (419) Q Consensus 347 ~~~~~~~~~~------~~~~d~~~~~~~~~~~~~~i~~~~~~~----------------~~~~~~~~~~r~~~r~d~~~~ 404 (419) +++++||++. +++|||+++ +++++.+++++++++.. +.|++|++.||++.|+|++++ T Consensus 232 ~~~~~~p~~~~~~~~~~~~gd~s~~-~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 310 (330) T protein:vir:77 232 YVADNVVNGTVGNRVVGVMGDFSQV-IWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVN 310 (330) T ss_pred EEeccccCCCCCCccEEEEEecceE-EEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEe Confidence 9999999764 899999985 57789999999877542 459999999999999999999 Q ss_pred cccceEEEEecCCCC Q lcl|Aclame:pro 405 QPKAFVRVTFAAATT 419 (419) Q Consensus 405 ~~~a~~~~~~~aa~~ 419 (419) +|+||++++.+++.+ T Consensus 311 ~~~a~~~i~~~~~~~ 325 (330) T protein:vir:77 311 DKDAFVKLTDQVAGT 325 (330) T ss_pred cccceEEEEeccCCc Confidence 999999999999888 No 65 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=1.6e-55 Score=321.03 Aligned_cols=282 Identities=15% Similarity=0.140 Sum_probs=242.3 Q ss_pred hhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccc Q lcl|Aclame:pro 119 LSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ 198 (419) Q Consensus 119 ~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 198 (419) .........+++.++.++|+.+.+.|++.+.+.++++++|+++|++++...+|+.+ ...++||+|++++|+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~---------~~~a~~v~E~~~~~~ 71 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMS---------GVGAFWVDEAERIQT 71 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEc---------CCceeeeecCccccc Confidence 23333344455566778999999999999999999999999999999888888643 346889999999999 Q ss_pred cccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccc Q lcl|Aclame:pro 199 STLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPT 277 (419) Q Consensus 199 ~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~ 277 (419) ++++|+++++.++|++++++||+|+++|+ .+++++|.++|++++++++|.+||+|+|+++|.|+++...... . T Consensus 72 ~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~------~ 145 (299) T protein:vir:41 72 SKPTFTKAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDAS------N 145 (299) T ss_pred cccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccc------e Confidence 99999999999999999999999999987 5899999999999999999999999999999999987533221 1 Q ss_pred ccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCc- Q lcl|Aclame:pro 278 APATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT- 356 (419) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~- 356 (419) ........++++.+++..+...++.+++|+|||+++.+|++++|.+|++++. +.+.. ..++|+|+||+++++||.+. T Consensus 146 ~~~~~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~-~~~~~-~~~~l~G~PV~~~~~~~~~~~ 223 (299) T protein:vir:41 146 LVEETANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFN-TATSN-GVDDVLGLPIAYTPKYTFGDK 223 (299) T ss_pred eeccccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeec-CCcCC-CCceecceeeEEecccCCCCC Confidence 1223345689999999999999999999999999999999999999998764 33333 34689999999999999876 Q ss_pred ---EEEEeccceEEEEEecceEEEEeeccc------------chhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 357 ---ALVGGFRQGATLWSRQGITVLMTDSHA------------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 357 ---~~~~d~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~ 418 (419) +++|||++ ++++++++++++++++.. .+|++|++.||++.|+|+++++|+||++++.+++- T Consensus 224 ~~~~~~gdfs~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 224 DISELVGDWNQ-AYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred ceEEEEEeccc-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 99999987 557889999999987543 35899999999999999999999999999999999 No 66 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=2.7e-55 Score=319.72 Aligned_cols=280 Identities=15% Similarity=0.088 Sum_probs=234.6 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEE Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTIT 207 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 207 (419) ++..++.++|+.+...|++.+++.+.++++|+++|+.++..++|+.++ ...++||+|++++|+++++|++++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~--------~~~a~~v~E~~~~~~~~~~f~~v~ 72 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM--------DSEIDVVAESGKKTHGGVTLAPQT 72 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEec--------CcceEEecCCccccccccceeEEE Confidence 445566778878888888999999999999999999988899998764 357899999999999999999999 Q ss_pred eeeEEEEEeehhhHHHHhh---H-HHHHHHHHHHHHHHHHHHHHHHHHhccC--cccccceeccccccccccccccccch Q lcl|Aclame:pro 208 TTLKTVAHWLPITRQAADD---N-SQLMGYIQGRLTYGLRFLRDRQLLNGNG--STEMQGILTTPGIGTYQQPKPTAPAT 281 (419) Q Consensus 208 ~~~~k~~~~~~vs~ell~d---~-~~~~~~i~~~l~~a~~~~~d~~il~G~g--~~~p~Gi~~~~~~~~~~~~~~~~~~~ 281 (419) +.++|++++++||+|++++ + .+|++||+++|++++++++|.++++|++ ++.+.++................... T Consensus 73 l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) T protein:vir:16 73 MVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) T ss_pred EeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccc Confidence 9999999999999999953 3 4799999999999999999999999964 34444443322222222222223333 Q ss_pred hhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC------ Q lcl|Aclame:pro 282 DEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG------ 355 (419) Q Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~------ 355 (419) ....++++.+++..+...++++++|+||++++..|+++||.+|+|+| ++.+..+.+++|+|+||++++.||.+ T Consensus 153 ~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~-~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~~ 231 (298) T protein:vir:16 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALF-PELKWGATPDTINGLPVDVNKTVSDMSLTQRD 231 (298) T ss_pred cccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeee-cCcccCCCCceecceeeEEecccccccCCCcc Confidence 44567899999999999999999999999999999999999999865 55667778889999999999999863 Q ss_pred cEEEEeccceEEEEEecceEEEEeeccc------chhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 356 TALVGGFRQGATLWSRQGITVLMTDSHA------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) Q Consensus 356 ~~~~~d~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~a 416 (419) .+++|||++++.++.+++++++++++.. ++|++|++.||+++|+|+++++|+||++++-++ T Consensus 232 ~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 232 RAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 5888999998888889999999987542 369999999999999999999999999999988 No 67 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=3.1e-55 Score=319.41 Aligned_cols=284 Identities=14% Similarity=0.052 Sum_probs=236.3 Q ss_pred ccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceee Q lcl|Aclame:pro 126 GTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDT 205 (419) Q Consensus 126 ~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~ 205 (419) -++.+.++.++|+.+...|++.++..+.++++|+++|+.++..++|+.++ +..++||+|++++|+++++|++ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~--------~~~a~wv~E~~~~~~s~~~f~~ 72 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTL--------DSDIDVVAENGKKTHGGLSLEP 72 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEec--------CcceEEeecCccccccccceee Confidence 22345667889999999999999999999999999999998899998764 3578999999999999999999 Q ss_pred EEeeeEEEEEeehhhHHHHh---h-HHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccc-cccccccccccccc Q lcl|Aclame:pro 206 ITTTLKTVAHWLPITRQAAD---D-NSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTP-GIGTYQQPKPTAPA 280 (419) Q Consensus 206 v~~~~~k~~~~~~vs~ell~---d-~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~-~~~~~~~~~~~~~~ 280 (419) ++++++|+++++++|+||++ | ..+++++|.+++++++++++|.++|+|+++....+..... +............+ T Consensus 73 v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 152 (303) T protein:vir:97 73 VTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFT 152 (303) T ss_pred EEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccc Confidence 99999999999999999984 3 3589999999999999999999999997653332221111 11111111222223 Q ss_pred hhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC----- Q lcl|Aclame:pro 281 TDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG----- 355 (419) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~----- 355 (419) +....++++.+++..+...++.++.|+|||+++.+|+++||++|++++.+....++.+++|+|+||+++++||.+ T Consensus 153 ~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~ 232 (303) T protein:vir:97 153 ESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGADEAE 232 (303) T ss_pred cccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccccCC Confidence 344568999999999998899999999999999999999999999887665555667789999999999999853 Q ss_pred ---cEEEEeccceEEEEEecceEEEEeeccc------chhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 356 ---TALVGGFRQGATLWSRQGITVLMTDSHA------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 356 ---~~~~~d~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) .+++|||+..+.++.|.+++++++++.. ++|++|++.||++.|+|+++++|+||++++-+.. T Consensus 233 ~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 233 SKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred CccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 3899999988888899999999887542 3699999999999999999999999999999999 No 68 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=3.6e-55 Score=319.09 Aligned_cols=285 Identities=13% Similarity=0.096 Sum_probs=239.7 Q ss_pred HHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCccc Q lcl|Aclame:pro 116 NRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTA 195 (419) Q Consensus 116 ~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~ 195 (419) .+........-..++.++.++|+.+.+.|++.+...++++++|+++|++++.+++|+.++ ...+.|++|++. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~--------~~~a~~v~E~~~ 72 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAK--------GVGAYWVSETER 72 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeC--------CcceEEeecCcc Confidence 111111222223455667889999999999999999999999999999998899998763 347899999999 Q ss_pred ccccccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccc Q lcl|Aclame:pro 196 KPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQP 274 (419) Q Consensus 196 ~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~ 274 (419) +|+++++|++++++++|++++++||+|+++|+. ++++||+++|++++++++|.+||+|+|+++|.|+......... .. T Consensus 73 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~-~~ 151 (304) T protein:vir:10 73 IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGA-EE 151 (304) T ss_pred cccccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccc-cc Confidence 999999999999999999999999999999875 8999999999999999999999999999988887654333222 22 Q ss_pred cccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCc Q lcl|Aclame:pro 275 KPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQ 354 (419) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 354 (419) ...........++++.+++..+...+..+++|+|||+++..|++++|++|+|+|. ..+++|+|+||+++++||. T Consensus 152 ~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~------~~~~~l~G~PV~~~~~~~~ 225 (304) T protein:vir:10 152 KGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFD------ANGNEIMGLPLSYTGADVY 225 (304) T ss_pred cccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeec------CCCccccceeeEEeccccc Confidence 2233334556799999999999999999999999999999999999999998763 2346899999999999985 Q ss_pred ----CcEEEEeccceEEEEEecceEEEEeeccc--------------chhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 355 ----GTALVGGFRQGATLWSRQGITVLMTDSHA--------------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) Q Consensus 355 ----~~~~~~d~~~~~~~~~~~~~~i~~~~~~~--------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~a 416 (419) +.+++|||++ ++++++++++++++++.. ++|++|++.||++.|+|+++++|+||++++.+- T Consensus 226 ~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 226 DKKKSLALMGDWDY-ARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCCCcEEEEEehhh-EEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 4599999997 556789999999887643 469999999999999999999999999999999 No 69 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=3.6e-55 Score=319.09 Aligned_cols=285 Identities=13% Similarity=0.096 Sum_probs=239.7 Q ss_pred HHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCccc Q lcl|Aclame:pro 116 NRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTA 195 (419) Q Consensus 116 ~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~ 195 (419) .+........-..++.++.++|+.+.+.|++.+...++++++|+++|++++.+++|+.++ ...+.|++|++. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~--------~~~a~~v~E~~~ 72 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAK--------GVGAYWVSETER 72 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeC--------CcceEEeecCcc Confidence 111111222223455667889999999999999999999999999999998899998763 347899999999 Q ss_pred ccccccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccc Q lcl|Aclame:pro 196 KPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQP 274 (419) Q Consensus 196 ~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~ 274 (419) +|+++++|++++++++|++++++||+|+++|+. ++++||+++|++++++++|.+||+|+|+++|.|+......... .. T Consensus 73 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~-~~ 151 (304) T protein:vir:94 73 IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGA-EE 151 (304) T ss_pred cccccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccc-cc Confidence 999999999999999999999999999999875 8999999999999999999999999999988887654333222 22 Q ss_pred cccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCc Q lcl|Aclame:pro 275 KPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQ 354 (419) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 354 (419) ...........++++.+++..+...+..+++|+|||+++..|++++|++|+|+|. ..+++|+|+||+++++||. T Consensus 152 ~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~------~~~~~l~G~PV~~~~~~~~ 225 (304) T protein:vir:94 152 KGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFD------ANGNEIMGLPLSYTGADVY 225 (304) T ss_pred cccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeec------CCCccccceeeEEeccccc Confidence 2233334556799999999999999999999999999999999999999998763 2346899999999999985 Q ss_pred ----CcEEEEeccceEEEEEecceEEEEeeccc--------------chhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 355 ----GTALVGGFRQGATLWSRQGITVLMTDSHA--------------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) Q Consensus 355 ----~~~~~~d~~~~~~~~~~~~~~i~~~~~~~--------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~a 416 (419) +.+++|||++ ++++++++++++++++.. ++|++|++.||++.|+|+++++|+||++++.+- T Consensus 226 ~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 226 DKKKSLALMGDWDY-ARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCCCcEEEEEehhh-EEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 4599999997 556789999999887643 469999999999999999999999999999999 No 70 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=4e-54 Score=313.32 Aligned_cols=346 Identities=14% Similarity=0.075 Sum_probs=235.2 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |..+.+|++..+.++++.+.+... ++.+ +............. ....... T Consensus 1 ~eei~~l~~~~~~l~~~~~~l~~~---------------------~d~~-------e~e~~~~~~~~~~~---~~~~~~~ 49 (352) T protein:vir:78 1 MEDIKQLETEKAGLQQRFNIVERQ---------------------VQDI-------EEKEKAKVKDKGEA---YQSLNDN 49 (352) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHH---------------------HHHH-------HHHHHHHhhhcccc---ccccchh Confidence 554444444444444333322211 1111 11111000000000 0000011 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) .+.. ... .+..+...... ..... ........+... .+..+.|++++|+.+.+.|++.+...++||++|++ T Consensus 50 ~~~~-~~~--~~~~r~~~~~~---~~~~~---~~~~~~~~~al~-~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v 119 (352) T protein:vir:78 50 EKLV-KAK--AEFYRHAILPN---EFEKP---SMEAQRLLHALP-TGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL 119 (352) T ss_pred hhHH-HHH--HHHHHHHhhhh---HHHHH---HhhHHHHHHHhc-cCCCCCCceeccHhHHHHHHHHHHhhcchhhheee Confidence 1100 000 11111111110 00000 000001111111 22346678899999999999999999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLT 239 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~ 239 (419) .++++. .+|+... ....+.||+|++.+|+++++|++|++.++|++++++||+||++|+ .++++||.++|+ T Consensus 120 ~~~~~~--~~p~~~~-------~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la 190 (352) T protein:vir:78 120 TNIKGL--EIPRVSY-------TLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQ 190 (352) T ss_pred EecCCc--eEEEEec-------CCCcccccccccccccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHH Confidence 988653 4555432 234689999999999999999999999999999999999999997 599999999999 Q ss_pred HHHHHHHHH-HHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHH Q lcl|Aclame:pro 240 YGLRFLRDR-QLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL 318 (419) Q Consensus 240 ~a~~~~~d~-~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 318 (419) ++++++++. .|.+|+|+++|.|+++..++..+ +....+++++++++.+...|+.++.|+||+.++..|++ T Consensus 191 ~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~---------t~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~ 261 (352) T protein:vir:78 191 SGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV---------EGANMYDAIINALADLHEDYRDNATIYMRYADYVKIIS 261 (352) T ss_pred HHHHHHHHHhhhhcCCCCcccccceeccccccc---------cccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHH Confidence 999998655 56688999999999876554332 22234889999999999999999999999999999999 Q ss_pred HhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEE Q lcl|Aclame:pro 319 DQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFR 398 (419) Q Consensus 319 ~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r 398 (419) +++.+|++++. +.+.+|+|+||++++.++ .+++|||++++... .++.++...+ ..++++.|++..| T Consensus 262 ~~~~~~~~~~~------~~~~~llG~PV~~~~~~~--~~~~Gdf~~~~~~~--~~~~~~~~~~----~~~g~~~f~~~~r 327 (352) T protein:vir:78 262 VLSNGTTNFFD------TPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY--DGTTYDTDKD----VKKGEYLFVLTAW 327 (352) T ss_pred HHhccCCcccc------cCCccccccceEEecCCC--ceeEeehhhhhhhh--hhheeeeecc----ccCCeeEEEEEee Confidence 99888888763 345689999999999865 47999999866543 4555554433 3478999999999 Q ss_pred eccEEecccceEEEEecCCCC Q lcl|Aclame:pro 399 ANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 399 ~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|+++++|+||+.++++|+++ T Consensus 328 ~Dg~~~~~eA~~~l~~~a~~~ 348 (352) T protein:vir:78 328 YDQQRTLDSAFRIAKAKESTG 348 (352) T ss_pred eCceeechhheEEEEeecccC Confidence 999999999999999999999 No 71 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=1.6e-54 Score=315.45 Aligned_cols=280 Identities=15% Similarity=0.087 Sum_probs=233.1 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEE Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTIT 207 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 207 (419) ++..++.++|+.+...|++.++..++++++|+++++.++.+++|+.++ ...++||+|++++|+++++|++++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~--------~~~a~~v~Eg~~~~~~~~~f~~v~ 72 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM--------DSEIDVVAESGKKTHGGVTLAPQT 72 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEec--------CcceEEeeCCccccccccceeEEE Confidence 555677888999999999999999999999999999988899998764 347899999999999999999999 Q ss_pred eeeEEEEEeehhhHHHHhh----HHHHHHHHHHHHHHHHHHHHHHHHHhccC--cccccceeccccccccccccccccch Q lcl|Aclame:pro 208 TTLKTVAHWLPITRQAADD----NSQLMGYIQGRLTYGLRFLRDRQLLNGNG--STEMQGILTTPGIGTYQQPKPTAPAT 281 (419) Q Consensus 208 ~~~~k~~~~~~vs~ell~d----~~~~~~~i~~~l~~a~~~~~d~~il~G~g--~~~p~Gi~~~~~~~~~~~~~~~~~~~ 281 (419) +.++|++++++||+|++++ ..+++++|+++|++++++++|.++|+|.+ ++.+.......+.............. T Consensus 73 l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) T protein:vir:94 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) T ss_pred EeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccc Confidence 9999999999999999853 24799999999999999999999999953 22221111111111111222222333 Q ss_pred hhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC------ Q lcl|Aclame:pro 282 DEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG------ 355 (419) Q Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~------ 355 (419) ....++++.+++..+...+.++++|+|||+++.+|++++|.+|+|+| ++...++.+++|+|+||++++.||.+ T Consensus 153 ~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~-~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~~ 231 (298) T protein:vir:94 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALF-PELKWGATPDTINGLPVDVNKTVSDMSLTQRD 231 (298) T ss_pred cccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeee-cCcccCCCCceecceeeEEecccccccCCCcc Confidence 44568899999999999999999999999999999999999999865 56667788899999999999999853 Q ss_pred cEEEEeccceEEEEEecceEEEEeeccc------chhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 356 TALVGGFRQGATLWSRQGITVLMTDSHA------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) Q Consensus 356 ~~~~~d~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~a 416 (419) .+++|||++++.++.++++++++.++.. ++|++|++.||++.|+|+++++|+||++++-++ T Consensus 232 ~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 232 RAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 5888999998888889999999877542 369999999999999999999999999999888 No 72 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=2.3e-54 Score=314.64 Aligned_cols=341 Identities=13% Similarity=0.070 Sum_probs=237.0 Q ss_pred HHHHHHHHHHHHhhc--ccccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHH-HHHHhhhcccccccccCCc Q lcl|Aclame:pro 56 LRTAPPAPKGPADGG--TPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDI-DPNRLLSRDAPAGTITNPN 132 (419) Q Consensus 56 l~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 132 (419) |.+... .+.... ...........+..+..+.+. .+.... .++......+.. ............+.++..| T Consensus 1 ~a~~~a---~~~~~~~~~~~~~~~~~~~~~kg~~~~~~--~~a~a~--~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~G 73 (366) T protein:vir:57 1 MAAAVA---VPVKAHSVAPGIIIKEELQQYKGAGMTRM--VMSIAA--GKGNLADAAKFAATELGDTGLSMAISTAAGSG 73 (366) T ss_pred Cccccc---ccccccccccccccccccccccchhHHHH--HHHHHh--cccchhHHHHHHHHhhcchhhhhhccccccCC Confidence 110000 000000 000000000000001111110 000000 011110000000 0000000111122334567 Q ss_pred ccccchhhhHHHHHhhhhhhhHHhh-cceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeE Q lcl|Aclame:pro 133 VPHLPQLVPGIVPTTPDLPLLVADL-LDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLK 211 (419) Q Consensus 133 ~~~~p~~~~~~i~~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~ 211 (419) ++++|+.+.+.|++.++..++++.+ ++++|+.++.+++|+.++ +..++||+|++.+|+++++|+++++.++ T Consensus 74 g~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~--------~~~a~wv~E~~~~~~s~~~f~~i~~~~~ 145 (366) T protein:vir:57 74 GALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSG--------GATAGYVGEGKDVVATGATFDDVKLSAK 145 (366) T ss_pred ccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeC--------CcceeeeccCccccccccceeEEEEeeE Confidence 8889999999999999999999998 899999888899999764 3578999999999999999999999999 Q ss_pred EEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcc-cccceeccccccccccccccccch--hhhHHH Q lcl|Aclame:pro 212 TVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPAT--DEPPLV 287 (419) Q Consensus 212 k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~--~~~~~~ 287 (419) |++++++||+||++|+ ++++++|+++|++++++++|++||+|+|++ +|+||++..+..........+..+ ..+.+. T Consensus 146 k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~ 225 (366) T protein:vir:57 146 TMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLTTIDEYL 225 (366) T ss_pred EEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchhhHHHHH Confidence 9999999999999987 599999999999999999999999999975 899999877655443332222221 222233 Q ss_pred HHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC--------cEEE Q lcl|Aclame:pro 288 DIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG--------TALV 359 (419) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~--------~~~~ 359 (419) +++.........+..++.|+||+.++..|++++|++|+++|. +..+++|+|+||+++++||.+ .+++ T Consensus 226 ~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~-----~~~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~ 300 (366) T protein:vir:57 226 DSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYP-----EMSQGILKGYPIQRTSAIPANLGDDGNESEIYF 300 (366) T ss_pred HHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCceecc-----CCCCCeecceeeEEccccccccccCCCccEEEE Confidence 444555555667778899999999999999999999998763 233568999999999999863 4899 Q ss_pred EeccceEEEEEecceEEEEeeccc---------chhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 360 GGFRQGATLWSRQGITVLMTDSHA---------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 360 ~d~~~~~~~~~~~~~~i~~~~~~~---------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) |||++ +++.++.+++++++++.. ..|++|++++|++.|+||+++||+||++++-..= T Consensus 301 gdfs~-~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 301 CDFND-VVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred Eecce-EEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 99997 557889999999887642 4689999999999999999999999999976555 No 73 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=1e-53 Score=311.14 Aligned_cols=301 Identities=13% Similarity=0.071 Sum_probs=239.8 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD 159 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~ 159 (419) .+ +........+.|.....+. . ..+ ......+..++.++|+.+.+.|++.+++.++++++|+ T Consensus 1 ~~----~~~~~~~~~~~f~~~~~~~------------~-~~~-a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~ 62 (324) T protein:vir:97 1 ME----QTQKLKLNLQHFASNNVKP------------Q-VFN-PDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGK 62 (324) T ss_pred Cc----cchhHHHHHHHHHHhhhhh------------h-hhc-cccccccCCCcceechhHHHHHHHHHHhhcchhhhcc Confidence 00 0000111111221111111 0 011 1112234556778999999999999999999999999 Q ss_pred eecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHH Q lcl|Aclame:pro 160 QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRL 238 (419) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l 238 (419) ++|++++.+++|+.++ ...+.|++|++.+|+++++|+++++.++|++++++||+|+++|+ .+++++|.++| T Consensus 63 ~~~~~~~~~~ip~~~~--------~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l 134 (324) T protein:vir:97 63 YEPMEGTEKKFTFWAD--------KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMI 134 (324) T ss_pred eeeccCCceEEEEEec--------CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHH Confidence 9999998899998763 35789999999999999999999999999999999999999997 59999999999 Q ss_pred HHHHHHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIE 317 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 317 (419) ++++++++|++||+|+|++ .|.|+++...... ....+...++++.+++..+...++.+++|+|||.++..|+ T Consensus 135 ~~aia~~~d~a~l~G~g~~~~~~gi~~~~~~~~-------~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~ 207 (324) T protein:vir:97 135 AEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTN-------KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLR 207 (324) T ss_pred HHHHHHHHHHHhhccCCCCccCccccccccccc-------eeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHH Confidence 9999999999999999986 5778776432211 1223345689999999999999999999999999999999 Q ss_pred HHhccCCceeccCCccccCCCcccccceeEecCCCC--cCcEEEEeccceEEEEEecceEEEEeeccc------------ Q lcl|Aclame:pro 318 LDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIA--QGTALVGGFRQGATLWSRQGITVLMTDSHA------------ 383 (419) Q Consensus 318 ~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~--~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~------------ 383 (419) +++|++|++++. .+.+++|+|+||++++.++ .+.+++|||+++ +++++++++++++++.. T Consensus 208 ~lkd~~g~~~~~-----~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~~-~i~~~~~~~i~~~~~~~~~~~~~~~~~~~ 281 (324) T protein:vir:97 208 KIVDPETKERIY-----DRNSDTLDGLPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPV 281 (324) T ss_pred HhhcCCCceeec-----CCCCccccceeeEeecCCCCCcceEEEEecccE-EEEEecCcEEEEeecccccccccccccch Confidence 999999998764 2446789999999988754 567999999975 56778999999987643 Q ss_pred chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 384 DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 384 ~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ++|++|++.||++.|+|+++++|+||++++.+.+.+ T Consensus 282 ~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 317 (324) T protein:vir:97 282 NLFEQDMVALRATMHVALHIADDKAFAKLVPADKKT 317 (324) T ss_pred hhhhcCcEEEEEEEEeccEEecccceEEEEeccCCC Confidence 469999999999999999999999999999998877 No 74 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=1e-53 Score=311.11 Aligned_cols=308 Identities=18% Similarity=0.171 Sum_probs=247.9 Q ss_pred hHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceecccccc Q lcl|Aclame:pro 107 QVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNK 186 (419) Q Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (419) -+.+.+....... .... +..+..++.++|+.+.+.|++.++..++|+++|+++|+.++.+++|+.+......+.++.. T Consensus 1 ~~~~~e~~~~~~~-~~~~-~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~ 78 (338) T protein:vir:78 1 MATLNELAPNTAG-SNHQ-GRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGT 78 (338) T ss_pred CcchHHhhhhhcc-cccc-cceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecccc Confidence 1122222222222 1122 2233445568999999999999999999999999999999999999998887777777888 Q ss_pred ceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcc---cccce Q lcl|Aclame:pro 187 AAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGST---EMQGI 262 (419) Q Consensus 187 a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~---~p~Gi 262 (419) +.|++|++.+|+++++|++++++++|++++++||+|+++|+ .++++||+++|++++++++|.+||+|+|++ +|.|+ T Consensus 79 ~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi 158 (338) T protein:vir:78 79 SNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGI 158 (338) T ss_pred cccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccc Confidence 99999999999999999999999999999999999999987 599999999999999999999999999975 46777 Q ss_pred eccccccccccccccccchhhhHHHHHHHHHHhhhh-hccCCcEEEEehHHHHHHH---HHhccCCceeccCCccccCCC Q lcl|Aclame:pro 263 LTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEI-AGFPPDGVVVHPQDWESIE---LDQAPGSGVFRVIANVQGEAT 338 (419) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~l~---~~kd~~g~~~~~~~~~~~~~~ 338 (419) .+........... .........++++.+++..+.. .+...++|+|||+++..|+ +++|.+|+++| ++...++.+ T Consensus 159 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~-~~~~~~~~~ 236 (338) T protein:vir:78 159 DTNNVIVNTTNVD-YLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDP-TRINLAASA 236 (338) T ss_pred ccccccccccccc-cccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceee-cccccCCCC Confidence 6654433322222 2223344568888888777654 3456678999999988875 46789999765 556677788 Q ss_pred cccccceeEecCCCCc---------CcEEEEeccceEEEEEecceEEEEeeccc------------chhhcCcEEEEEEE Q lcl|Aclame:pro 339 PRIWGLNVVSTVAIAQ---------GTALVGGFRQGATLWSRQGITVLMTDSHA------------DFFTANTLVILAEF 397 (419) Q Consensus 339 ~~l~G~pv~~~~~~~~---------~~~~~~d~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~ 397 (419) ++|+|+||+++++||. ..+++|||++ ++++++++++++++++.. ++|++|++.||++. T Consensus 237 ~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~ 315 (338) T protein:vir:78 237 GDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQ-LKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEV 315 (338) T ss_pred ceeeeeeEEEccccCccccccCCcccEEEEEecce-EEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEE Confidence 9999999999999985 2478999987 667789999999987642 56999999999999 Q ss_pred EeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 398 RANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 398 r~d~~~~~~~a~~~~~~~aa~~ 419 (419) |+|++++||+||++++-++++. T Consensus 316 r~d~~v~~~~a~~~l~~~~~~~ 337 (338) T protein:vir:78 316 TFGWLLGDKQAFVKFVDDEDPD 337 (338) T ss_pred EeccEeecccceEEEecccCCC Confidence 9999999999999999999999 No 75 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=1.1e-53 Score=311.02 Aligned_cols=307 Identities=19% Similarity=0.184 Sum_probs=247.6 Q ss_pred hHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceecccccc Q lcl|Aclame:pro 107 QVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNK 186 (419) Q Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (419) -..+++....... ....+..++.++.++|+.+.+.|++.++..++++++|++++++++..++|+.++.....+..++. T Consensus 1 ~a~l~el~~~~~~--~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~ 78 (333) T protein:vir:78 1 MATLNELLPNSAG--SNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGT 78 (333) T ss_pred CchhHHhhhhccc--ccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCcc Confidence 1112222211111 12223334445558899999999999999999999999999999999999999887777777788 Q ss_pred ceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCccc---ccce Q lcl|Aclame:pro 187 AAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTE---MQGI 262 (419) Q Consensus 187 a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~---p~Gi 262 (419) +.|++|++.+|+++++|+++++.++|++++++||+|+++|+ .++++||+++|++++++++|.+||+|+|+++ |.|+ T Consensus 79 ~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~ 158 (333) T protein:vir:78 79 SNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGI 158 (333) T ss_pred cccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccccc Confidence 88999999999999999999999999999999999999986 5899999999999999999999999999864 5566 Q ss_pred eccccccccccccccccchhhhHHHHHHHHHHhhhhh-ccCCcEEEEehHHHHHHHH---HhccCCceeccCCccccCCC Q lcl|Aclame:pro 263 LTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIA-GFPPDGVVVHPQDWESIEL---DQAPGSGVFRVIANVQGEAT 338 (419) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~---~kd~~g~~~~~~~~~~~~~~ 338 (419) .+..+....... ..........++++.+++..+... +..+++|+|||.++..|++ ++|.+|++++ ++....+.+ T Consensus 159 ~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~-~~~~~~~~~ 236 (333) T protein:vir:78 159 DTDNVIANTTNV-DYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDP-SRINLAAQT 236 (333) T ss_pred cccccccccccc-cccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceee-cCccccCCC Confidence 655544433222 222334455788999998887655 4456689999999988765 6788898765 566677788 Q ss_pred cccccceeEecCCCCcC---------cEEEEeccceEEEEEecceEEEEeeccc---------chhhcCcEEEEEEEEec Q lcl|Aclame:pro 339 PRIWGLNVVSTVAIAQG---------TALVGGFRQGATLWSRQGITVLMTDSHA---------DFFTANTLVILAEFRAN 400 (419) Q Consensus 339 ~~l~G~pv~~~~~~~~~---------~~~~~d~~~~~~~~~~~~~~i~~~~~~~---------~~~~~~~~~~r~~~r~d 400 (419) ++|+|+||+++++||.+ .+++|||++ ++++++++++++++++.. ++|++|++.||++.|+| T Consensus 237 ~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d 315 (333) T protein:vir:78 237 GDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQ-LKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFG 315 (333) T ss_pred ceeeceeeEEccccCCCccccCCCccEEEEEeccc-EEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEc Confidence 99999999999999865 489999998 556678999999987642 46999999999999999 Q ss_pred cEEecccceEEEEecCCC Q lcl|Aclame:pro 401 LAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 401 ~~~~~~~a~~~~~~~aa~ 418 (419) +++++|+||++++.+++| T Consensus 316 ~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 316 WLLGDKQAFVKFVDDEQP 333 (333) T ss_pred cEEecccceEEEeccCCC Confidence 999999999999999999 No 76 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=1.7e-53 Score=309.84 Aligned_cols=301 Identities=13% Similarity=0.085 Sum_probs=238.3 Q ss_pred hhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc Q lcl|Aclame:pro 84 LAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA 163 (419) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~ 163 (419) ..+.......++.|.....+ .. . ........+..++.++|+.+.+.|++.+.+.++++++++++|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~------------~~-~-~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~ 66 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVK------------PQ-V-FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM 66 (324) T ss_pred CCcchhhhHHHHHHHHHhhh------------hh-h-hccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeec Confidence 00000001111111111110 00 0 1111222345566789999999999999999999999999999 Q ss_pred cCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHH Q lcl|Aclame:pro 164 DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGL 242 (419) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~ 242 (419) .++.++||+.++ .+.++||+|++.+|+++++|+++++.++|++++++||+|+++|+ .++++||.++|++++ T Consensus 67 ~~~~~~~p~~~~--------~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai 138 (324) T protein:vir:96 67 EGTEKKFTFWAD--------KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAF 138 (324) T ss_pred cCCceEEEEEec--------CcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHH Confidence 998899998763 35789999999999999999999999999999999999999987 589999999999999 Q ss_pred HHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhc Q lcl|Aclame:pro 243 RFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQA 321 (419) Q Consensus 243 ~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd 321 (419) ++++|.++|+|+|++ .|.|+.+..+... ....+...++++.+++..+...+..+++|+|||+++..|++++| T Consensus 139 ~~~~d~a~l~G~g~~~~~~gi~~~~~~~~-------~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d 211 (324) T protein:vir:96 139 YKKFDEAGILNQGNNPFGKSIAQSIEKTN-------KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVD 211 (324) T ss_pred HHHHHHHHhccCCCCCcCccccccccccc-------eeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhc Confidence 999999999999976 5777765433221 12233456999999999999999999999999999999999999 Q ss_pred cCCceeccCCccccCCCcccccceeEecCCC--CcCcEEEEeccceEEEEEecceEEEEeeccc------------chhh Q lcl|Aclame:pro 322 PGSGVFRVIANVQGEATPRIWGLNVVSTVAI--AQGTALVGGFRQGATLWSRQGITVLMTDSHA------------DFFT 387 (419) Q Consensus 322 ~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~--~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~------------~~~~ 387 (419) .+|++++. ++.+++|+|+||++++.+ +.+.+++|||++ ++++++++++++++++.. .+|+ T Consensus 212 ~~G~~~~~-----~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) T protein:vir:96 212 PETKERIY-----DRNSDSLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) T ss_pred cCCCeeec-----CCCCCcccceeeEeeCCCCCCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhh Confidence 99998754 345678999999997764 566799999997 456779999999987643 4699 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|++.||++.|+|+++++|+||++++.+...| T Consensus 286 ~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~ 317 (324) T protein:vir:96 286 QDMVALRATMHVALHIADDKAFAKLVPADKRT 317 (324) T ss_pred cCcEEEEEEEEEccEEecccceEEEecccccC Confidence 99999999999999999999999999877666 No 77 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=1.7e-53 Score=309.84 Aligned_cols=301 Identities=13% Similarity=0.085 Sum_probs=238.3 Q ss_pred hhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc Q lcl|Aclame:pro 84 LAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA 163 (419) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~ 163 (419) ..+.......++.|.....+ .. . ........+..++.++|+.+.+.|++.+.+.++++++++++|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~------------~~-~-~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~ 66 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVK------------PQ-V-FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM 66 (324) T ss_pred CCcchhhhHHHHHHHHHhhh------------hh-h-hccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeec Confidence 00000001111111111110 00 0 1111222345566789999999999999999999999999999 Q ss_pred cCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHH Q lcl|Aclame:pro 164 DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGL 242 (419) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~ 242 (419) .++.++||+.++ .+.++||+|++.+|+++++|+++++.++|++++++||+|+++|+ .++++||.++|++++ T Consensus 67 ~~~~~~~p~~~~--------~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai 138 (324) T protein:vir:78 67 EGTEKKFTFWAD--------KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAF 138 (324) T ss_pred cCCceEEEEEec--------CcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHH Confidence 998899998763 35789999999999999999999999999999999999999987 589999999999999 Q ss_pred HHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhc Q lcl|Aclame:pro 243 RFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQA 321 (419) Q Consensus 243 ~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd 321 (419) ++++|.++|+|+|++ .|.|+.+..+... ....+...++++.+++..+...+..+++|+|||+++..|++++| T Consensus 139 ~~~~d~a~l~G~g~~~~~~gi~~~~~~~~-------~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d 211 (324) T protein:vir:78 139 YKKFDEAGILNQGNNPFGKSIAQSIEKTN-------KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVD 211 (324) T ss_pred HHHHHHHHhccCCCCCcCccccccccccc-------eeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhc Confidence 999999999999976 5777765433221 12233456999999999999999999999999999999999999 Q ss_pred cCCceeccCCccccCCCcccccceeEecCCC--CcCcEEEEeccceEEEEEecceEEEEeeccc------------chhh Q lcl|Aclame:pro 322 PGSGVFRVIANVQGEATPRIWGLNVVSTVAI--AQGTALVGGFRQGATLWSRQGITVLMTDSHA------------DFFT 387 (419) Q Consensus 322 ~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~--~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~------------~~~~ 387 (419) .+|++++. ++.+++|+|+||++++.+ +.+.+++|||++ ++++++++++++++++.. .+|+ T Consensus 212 ~~G~~~~~-----~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) T protein:vir:78 212 PETKERIY-----DRNSDSLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) T ss_pred cCCCeeec-----CCCCCcccceeeEeeCCCCCCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhh Confidence 99998754 345678999999997764 566799999997 456779999999987643 4699 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|++.||++.|+|+++++|+||++++.+...| T Consensus 286 ~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~ 317 (324) T protein:vir:78 286 QDMVALRATMHVALHIADDKAFAKLVPADKRT 317 (324) T ss_pred cCcEEEEEEEEEccEEecccceEEEecccccC Confidence 99999999999999999999999999877666 No 78 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=2.6e-53 Score=308.91 Aligned_cols=299 Identities=13% Similarity=0.093 Sum_probs=236.9 Q ss_pred HHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcc--cccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcce Q lcl|Aclame:pro 91 SDGLREYRARDKRGQFQVEMRDIDPNRLLSRD--APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVL 168 (419) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~ 168 (419) .+ .+...+.+.+.+......... ......+..++.++|+.+.+.|++.+.+.++++++|+++|+.++.+ T Consensus 1 ~~---------~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~ 71 (324) T protein:vir:93 1 ME---------QTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) T ss_pred Cc---------hhHHHHHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCce Confidence 00 000001111111111111111 1111223334457888899999999999999999999999999989 Q ss_pred eeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 169 EYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRD 247 (419) Q Consensus 169 ~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d 247 (419) +||+.++ ...++|++|++.+|+++++|+++++.++|++++++||+|+++|+ .+++++|++++++++++++| T Consensus 72 ~ip~~~~--------~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d 143 (324) T protein:vir:93 72 KFTFWAD--------KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFD 143 (324) T ss_pred EEEEEec--------CcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Confidence 9998763 35789999999999999999999999999999999999999997 59999999999999999999 Q ss_pred HHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCce Q lcl|Aclame:pro 248 RQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGV 326 (419) Q Consensus 248 ~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~ 326 (419) +++|+|+|++ .|.|+++...... ....+...++++.+++..+...++.++.|+|||+++..|++++|++|++ T Consensus 144 ~a~l~G~g~~~~~~~~~~~~~~~~-------~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~ 216 (324) T protein:vir:93 144 EAGILNQGNNPFGKSIAQSIEKTN-------KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKE 216 (324) T ss_pred HHHhcCCCCCCcCccccccccccc-------eeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCe Confidence 9999999975 5777766433221 1222345689999999999999999999999999999999999999998 Q ss_pred eccCCccccCCCcccccceeEecCC--CCcCcEEEEeccceEEEEEecceEEEEeeccc------------chhhcCcEE Q lcl|Aclame:pro 327 FRVIANVQGEATPRIWGLNVVSTVA--IAQGTALVGGFRQGATLWSRQGITVLMTDSHA------------DFFTANTLV 392 (419) Q Consensus 327 ~~~~~~~~~~~~~~l~G~pv~~~~~--~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~ 392 (419) ++. ++.+++|+|+||++++. ++.+.+++|||+++ +++.+++++++++++.. .+|++|++. T Consensus 217 ~~~-----~~~~~~l~G~PVv~~~~~~~~~~~i~~gdfs~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~ 290 (324) T protein:vir:93 217 RIY-----DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) T ss_pred eec-----CCCCCcccceeeEeecCCCCCcceEEEEecceE-EEEEecCcEEEEeecccccccccccccchhhhhcCcEE Confidence 764 24567899999998776 45677999999974 56779999999988753 469999999 Q ss_pred EEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 393 ILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 393 ~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ||++.|+|+++++|+||++++.+.+.| T Consensus 291 ~r~~~r~d~~v~~~~a~~~l~~a~~~~ 317 (324) T protein:vir:93 291 LRATMHVALHIADDKAFAKLVPADKRT 317 (324) T ss_pred EEEEEEeccEEecccceEEEecccccC Confidence 999999999999999999999888877 No 79 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=1.8e-52 Score=304.32 Aligned_cols=385 Identities=15% Similarity=0.136 Sum_probs=233.8 Q ss_pred CCccHHHHHH------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHH Q lcl|Aclame:pro 1 MPPTPTLEEQ------------------------------RAALLARLDDTSLTTEQVQEIVAEARGLADALQ---AESD 47 (419) Q Consensus 1 M~~~~~L~e~------------------------------~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~---~~~~ 47 (419) ||......+. +.+-+++..++....+... .+...++.- ..++ T Consensus 185 ~~~~~~~~~~~~~~~~~~r~~~~~a~~~~~~~~~a~~~~~~~~E~~r~~eI~~l~~~~~-----~~~~~~~ai~~g~sld 259 (632) T protein:vir:96 185 MPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFS-----QRSLAQEAIQKGHTVD 259 (632) T ss_pred ccchhhhhhccccccccccchhhcccccchhhhhhhhhhhhhhhHHHHHHHHHHHHHhh-----hhhhHHHHHhccccHH Confidence 3221111000 0000001111111000000 000000000 0001 Q ss_pred HHHHHH-HHHHHHHHHHHHHHhhcccccccccchhhhhhHH---HHhHHHHHHHHHhhh----h----hhhhH------- Q lcl|Aclame:pro 48 RAAARA-ALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQR---FADSDGLREYRARDK----R----GQFQV------- 108 (419) Q Consensus 48 ~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~----~----~~~~~------- 108 (419) +..+++ +.+...........................+... .......+....... . ..... T Consensus 260 ~~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G 339 (632) T protein:vir:96 260 QFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASG 339 (632) T ss_pred HHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhh Confidence 111000 0000000000000000000000000000000000 000000000000000 0 00000 Q ss_pred -HHHH--HHHHHhhhcccccccccCCcccccc-hhhhHHHHHhhhhhhhHHhh-cceecccCcceeeeeeccccceeccc Q lcl|Aclame:pro 109 -EMRD--IDPNRLLSRDAPAGTITNPNVPHLP-QLVPGIVPTTPDLPLLVADL-LDQQNADYNVLEYIRDTSGTAGAGST 183 (419) Q Consensus 109 -~~~~--~~~~~~~~~~~~~~~~~~~~~~~~p-~~~~~~i~~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (419) ..+. ........+....++.+ .|++++| +.+...|++.++..+.++++ ++++|+.++.++||+.++ T Consensus 340 ~~arg~~~~~~~l~~ra~~~~t~~-~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~-------- 410 (632) T protein:vir:96 340 KEARGFYMPHEVLVQRQLEKKTAG-KGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTS-------- 410 (632) T ss_pred hhhhhhhhhHHHHHHhhhhccccc-ccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeC-------- Confidence 0000 00011112222333333 3455555 55678888888888899888 788898888999998774 Q ss_pred cccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCc-ccccc Q lcl|Aclame:pro 184 WNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-TEMQG 261 (419) Q Consensus 184 ~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~-~~p~G 261 (419) +..++||+|++.+|+++++|++++++++|++++++||++|++|+ ++++++|+++|+.+++.++|.+||+|+|+ ++|.| T Consensus 411 ~~~a~wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~G 490 (632) T protein:vir:96 411 GANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVG 490 (632) T ss_pred CceeEeecCCccccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccce Confidence 35789999999999999999999999999999999999999886 69999999999999999999999999996 68999 Q ss_pred eeccccccccccccccccchhhhHHHHHHHHHHhhhhhcc--CCcEEEEehHHHHHHHH--HhccCCceeccCCccccCC Q lcl|Aclame:pro 262 ILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGF--PPDGVVVHPQDWESIEL--DQAPGSGVFRVIANVQGEA 337 (419) Q Consensus 262 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~--~kd~~g~~~~~~~~~~~~~ 337 (419) |++..++...... .....++++.++...+...+. .+++|+||+.++..|++ ++|.+|+|+|. T Consensus 491 i~~~~~~~~~~~~------~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~-------- 556 (632) T protein:vir:96 491 LLNMTGVPALTYP------AGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ-------- 556 (632) T ss_pred eeecccccceecc------cccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeec-------- Confidence 9987765443221 122346777777777776653 45689999999888876 67999988653 Q ss_pred CcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 338 TPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) Q Consensus 338 ~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~a 416 (419) +++|+|+||+++++||.+++++|||+++ +++++.++.+.++++. +|.+|++.|+++.|+|+++++|++|++++.+| T Consensus 557 ~~~l~G~pv~~s~~ip~~~~~~gd~s~~-~i~~~~~~~i~~~~~~--~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 557 NNEVNGYRAEASNQIPADTWIFGDWSQI-VIAMWGVLDLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred CCeecccceEeccccccCcEEEeecceE-EEEEecceEEEEcccc--ccccCceEEEEEeecCceeechhhhhheeecC Confidence 3589999999999999999999999974 5667899999998875 58999999999999999999999999999999 No 80 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=1.4e-52 Score=304.78 Aligned_cols=347 Identities=12% Similarity=0.022 Sum_probs=227.1 Q ss_pred CCcc----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 1 MPPT----PTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPA 76 (419) Q Consensus 1 M~~~----~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 76 (419) |... .+++++++++....++ ... .++.. +.+++..+++.+++.+...... . .... ....... T Consensus 1 M~i~~~~~~~~~e~~~~l~~~~~~-------~~~-~e~~~---~~~~~~~~~~~~~~~~~~~~e~-~-~~~~-~~~~~~~ 66 (377) T protein:vir:96 1 MAINLKELPKYREAVAELSAKISA-------GAT-PEEQE---KLFEAAFTTMGDEILAKNEEEM-E-RMFD-LRDKNRE 66 (377) T ss_pred CCccHHHHHHHHHHHHHHHHHHhh-------ccc-HHHHH---HHHHHHHHHHHHHHHHHHHHHH-H-HHHH-hccCCcc Confidence 5443 2333333333322221 111 11111 1112222222222221111100 0 0000 0000000 Q ss_pred ccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHh Q lcl|Aclame:pro 77 EAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVAD 156 (419) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~ 156 (419) . ..+ .+....... .....++|++++|+.+.+.|++.+...+++++ T Consensus 67 l------------t~e-----------------e~~~~~~~~------~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~ 111 (377) T protein:vir:96 67 L------------TAE-----------------EIKFFNDID------KNVGGKDKFKLLPEETMVQVFDDLVAEHPLLK 111 (377) T ss_pred c------------CHH-----------------HHHHHHHHH------hcCCCCCCceecCHHHHHHHHHHHHhhhhhhh Confidence 0 000 011111111 12234667889999999999999999999999 Q ss_pred hcceecccCcceeeeeeccccceeccccccceeecCccccc-ccccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHH Q lcl|Aclame:pro 157 LLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKP-QSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYI 234 (419) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i 234 (419) +|++.++++ ..++|+.++ .+.+.|++|+++.+ .++++|+++++.+||++++++||++|++|++ ++++|| T Consensus 112 ~~~v~~~~~-~~~i~~~~~--------~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i 182 (377) T protein:vir:96 112 VINFKNTSL-RLKALTAET--------SGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFI 182 (377) T ss_pred hceeEecCC-ceEEEEecC--------CcceeEeecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHH Confidence 999999865 467877543 45789999998865 5689999999999999999999999999986 899999 Q ss_pred HHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccc-----------hhhhHHHHHHHHHHhhhhhcc-- Q lcl|Aclame:pro 235 QGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPA-----------TDEPPLVDIRRAKTVAEIAGF-- 301 (419) Q Consensus 235 ~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~-- 301 (419) ++++++++++++|.+|++|+|+++|.||++................ ......+.+.+.+..+...+. T Consensus 183 ~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 262 (377) T protein:vir:96 183 TEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVN 262 (377) T ss_pred HHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccc Confidence 9999999999999999999999999999987654443322211110 001112223333333322222 Q ss_pred ---------CCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccce--eEecCCCCcCcEEEEeccceEEEEE Q lcl|Aclame:pro 302 ---------PPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLN--VVSTVAIAQGTALVGGFRQGATLWS 370 (419) Q Consensus 302 ---------~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~p--v~~~~~~~~~~~~~~d~~~~~~~~~ 370 (419) .++.|+|||.++..+. +++.+.. .+|.+.+++|+| |++++.||++++++|||++ |.+++ T Consensus 263 ~~~~~~~~~~~a~~~mn~~t~~~~~------~~~~~~~---~~G~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~-Y~i~~ 332 (377) T protein:vir:96 263 DKKHPLKIAGQVKLLLNPEDRWTLE------AKFTSRN---QFGEYVTVLPHGITILESLAVETGKAIAFVANR-YDAFM 332 (377) T ss_pred cccccccccCceEEEEchhhHHhcc------ccccccC---CCCCceeccCCCceEEecCCCCcccEEEEEcCc-EEEEE Confidence 3457999999987652 3333322 245566788877 6778999999999999998 88899 Q ss_pred ecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 371 RQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 371 ~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) |.+++++.+++. +|.+|++.||+..|+||++++|+||++++++-- T Consensus 333 r~~~~i~~~~~~--~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 333 ATASTIEEYDQT--FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ecccEEEeehhh--hhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 999999988764 699999999999999999999999999999988 No 81 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=1.9e-53 Score=309.59 Aligned_cols=305 Identities=15% Similarity=0.073 Sum_probs=232.7 Q ss_pred HHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccc Q lcl|Aclame:pro 97 YRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSG 176 (419) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 176 (419) +.....+.... ......++...+ +..++.++|+.+.+.|++.+.+.++++++++++|++++.+++|+.++ T Consensus 1 ~~~~~~r~~~~-------~~~~e~~a~~~~--~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~- 70 (326) T protein:vir:42 1 MAVNPDRTTPF-------LGVNDPKVAQTG--DSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTG- 70 (326) T ss_pred CCCCccchhhh-------cCcchhhheecc--ccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeC- Confidence 00000000000 000011111111 12223357777888899999999999999999999999999998764 Q ss_pred cceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 177 TAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNG 255 (419) Q Consensus 177 ~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g 255 (419) +..++||+||+.+|+++++|+++++.++|+++++++|+|+++|+ .++++||.++|++++++++|+++|+|+| T Consensus 71 -------~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~g 143 (326) T protein:vir:42 71 -------DVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTD 143 (326) T ss_pred -------CcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccC Confidence 35789999999999999999999999999999999999999987 5899999999999999999999999999 Q ss_pred cccccceeccccccccccccccccchh-hhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccc Q lcl|Aclame:pro 256 STEMQGILTTPGIGTYQQPKPTAPATD-EPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQ 334 (419) Q Consensus 256 ~~~p~Gi~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~ 334 (419) +++|.|+++.................. ......+..+...+...+..++.|+|||+++..|+++||++|+++|...... T Consensus 144 s~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~ 223 (326) T protein:vir:42 144 SPFPTFLAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYT 223 (326) T ss_pred CCccccccccccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeecccccc Confidence 999999987655433332222222211 1122234455666777888889999999999999999999999876543222 Q ss_pred ----cCCCcccccceeEecCCCCcCcE--EEEeccceEEEEEecceEEEEeeccc------------chhhcCcEEEEEE Q lcl|Aclame:pro 335 ----GEATPRIWGLNVVSTVAIAQGTA--LVGGFRQGATLWSRQGITVLMTDSHA------------DFFTANTLVILAE 396 (419) Q Consensus 335 ----~~~~~~l~G~pv~~~~~~~~~~~--~~~d~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~ 396 (419) ....++|+|+||++++++|+++. ++|||++++ ++++++++++++++.. ..|++|++.||++ T Consensus 224 ~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~-~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~ 302 (326) T protein:vir:42 224 EENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLV-WGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVE 302 (326) T ss_pred CccccccCceeeeeeEEEcCCCCCCceEEEEeecceEE-EEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEE Confidence 12345799999999999999874 678999865 6678999999877543 3489999999999 Q ss_pred EEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 397 FRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 397 ~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .|+|+++.+|+||++++..++.. T Consensus 303 ~~~d~~v~~~~a~~~l~~~~~~~ 325 (326) T protein:vir:42 303 AEYAFHCNDKDAFVKLTNVDATE 325 (326) T ss_pred EEeccEEecccceEEEeeccccC Confidence 99999999999999998888777 No 82 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=1.1e-52 Score=305.35 Aligned_cols=347 Identities=12% Similarity=0.072 Sum_probs=224.1 Q ss_pred CCc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MPP--TPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA 78 (419) Q Consensus 1 M~~--~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 78 (419) |+. ..+++++++++....++.... ++..+.++.+ .+++....... ...+.... ... T Consensus 1 m~~kl~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~---~~~~~~~~~~~-~~~e~~~~-~~~--------- 58 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQ--------ERQNELYGDM---INQLFEETKLQ-AKAEAERV-SSL--------- 58 (381) T ss_pred CchhHHHHHHHHHHHHHHHHHhhhHH--------HHHHHHHHHH---HHhhhhhHHHH-HHHHHHHH-HHh--------- Confidence 443 344444443333332211110 0001111111 01111100000 00000000 000 Q ss_pred chhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 79 GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) .........+.+....... ..+...|++++|+.+.+.|++.+...+++|++| T Consensus 59 ---------------------~~~~~~l~~~e~~~~~~~~-------~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a 110 (381) T protein:vir:10 59 ---------------------PKSAQTLSANQRNFFMDIN-------KSVGYKEEKLLPEETIDRIFEDLTTNHPLLADL 110 (381) T ss_pred ---------------------cccccccCHHHHHHHHHHh-------hcCCCCCceecCHHHHHHHHHHHHhhcceeeee Confidence 0000000011111111110 122356778999999999999999999999999 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCccccc-ccccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKP-QSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) ++.++++ ...+|+.+. .+.+.|++|+++.+ +.+++|+++++.++|++++++||++|++|++ +|++||+. T Consensus 111 ~v~~~~~-~~~i~~~~~--------~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~ 181 (381) T protein:vir:10 111 GIKNAGL-RLKFLKSET--------SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRV 181 (381) T ss_pred eeEecCc-ceEEEeecC--------CcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHH Confidence 9999865 457777553 35788999987765 5689999999999999999999999999986 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc---------cchhhhHHHHHHHHHHhh-------hhhc Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA---------PATDEPPLVDIRRAKTVA-------EIAG 300 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~-------~~~~ 300 (419) ++++++++++|.+|++|+|+++|.||++.............. .......+..+...+..+ ...+ T Consensus 182 ~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 261 (381) T protein:vir:10 182 QIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAV 261 (381) T ss_pred HHHHHHHHHhhceeEecccCCCceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccc Confidence 999999999999999999999999999753321111111000 001111122222222111 1234 Q ss_pred cCCcEEEEehHHHHHHHHHh---ccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEE Q lcl|Aclame:pro 301 FPPDGVVVHPQDWESIELDQ---APGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVL 377 (419) Q Consensus 301 ~~~~~~~~~~~~~~~l~~~k---d~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~ 377 (419) ..+..|+|||.++..|+.++ +++|+|.+.. ..|+||+++++||+++++||||++ |++++|.+++++ T Consensus 262 ~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~l----------p~g~~vv~~~~~p~~~i~fGDfs~-Y~i~~r~~~~i~ 330 (381) T protein:vir:10 262 KGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL----------PFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQ 330 (381) T ss_pred cCceEEEEchhhHHhhccccccCCCCCceeecC----------CCCceeEEcCCCCcCcEEEEEccc-EEEEEecccEEE Confidence 56678999999999988654 6677765432 148899999999999999999997 888999999999 Q ss_pred EeecccchhhcCcEEEEEEEEeccEEecccceEEEEec-----CCCC Q lcl|Aclame:pro 378 MTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFA-----AATT 419 (419) Q Consensus 378 ~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~-----aa~~ 419 (419) .+++. +|.+|++.||+..|+||++++|+||++++++ ++++ T Consensus 331 ~~~~~--~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~~~ 375 (381) T protein:vir:10 331 KFKET--LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALE 375 (381) T ss_pred eechh--hhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCccccc Confidence 88764 6999999999999999999999999998887 3333 No 83 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=2.2e-53 Score=309.31 Aligned_cols=300 Identities=12% Similarity=0.057 Sum_probs=234.3 Q ss_pred hhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeecccccee Q lcl|Aclame:pro 101 DKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGA 180 (419) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (419) ..++. ...... +.... ..+..++.++|+.+.+.|++.+.+.++++++|+++++.++.++||+..+ T Consensus 1 ~~~~~-~~~~~~--------~~~~~-t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~----- 65 (320) T protein:vir:10 1 MAAGT-AFQVDH--------AQIAQ-TGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIG----- 65 (320) T ss_pred CCCCc-cCCHHH--------HHhhc-cccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeC----- Confidence 11110 000000 00011 1122233357777888888999999999999999999998899998753 Q ss_pred ccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccc Q lcl|Aclame:pro 181 GSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEM 259 (419) Q Consensus 181 ~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p 259 (419) ...++|++|++.+|+++++|+++++.++|++++++||+|+++|+ .+++++|.++|++++++++|++||+|+|++.| T Consensus 66 ---~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~ 142 (320) T protein:vir:10 66 ---DVSAQWIGEGDMKPITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFP 142 (320) T ss_pred ---CcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCC Confidence 45789999999999999999999999999999999999999986 58999999999999999999999999999988 Q ss_pred cceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccc----c Q lcl|Aclame:pro 260 QGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQ----G 335 (419) Q Consensus 260 ~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~----~ 335 (419) .++........................+++.+++..+...+..++.|+|||+++.+|+++||++|++++...... . T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~ 222 (320) T protein:vir:10 143 TYLAQTTKSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSP 222 (320) T ss_pred cccccccccccceecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCcccc Confidence 887654332222222222222223334467788888889999999999999999999999999999877532221 2 Q ss_pred CCCcccccceeEecCCCCcCc--EEEEeccceEEEEEecceEEEEeeccc------------chhhcCcEEEEEEEEecc Q lcl|Aclame:pro 336 EATPRIWGLNVVSTVAIAQGT--ALVGGFRQGATLWSRQGITVLMTDSHA------------DFFTANTLVILAEFRANL 401 (419) Q Consensus 336 ~~~~~l~G~pv~~~~~~~~~~--~~~~d~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~ 401 (419) ...++++|+||++++++|.++ +++|||++++ ++.+++++++++++.. ++|++|++.||++.|+|+ T Consensus 223 ~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~ 301 (320) T protein:vir:10 223 FRAGRIVSRPTILSDHVADGTTVGYMGDFRNVI-WGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAF 301 (320) T ss_pred ccCceeeeeeeEecCCCCCCceEEEEeecceEE-EEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeecc Confidence 234689999999999999986 5689999754 6779999999887653 459999999999999999 Q ss_pred EEecccceEEEEecCCCC Q lcl|Aclame:pro 402 AVYQPKAFVRVTFAAATT 419 (419) Q Consensus 402 ~~~~~~a~~~~~~~aa~~ 419 (419) ++++|+||++++..++|- T Consensus 302 ~v~~~~a~~~l~~~~ap~ 319 (320) T protein:vir:10 302 HNNDKDAFVKLTNVVTPD 319 (320) T ss_pred EEecccceEEEEeccCCC Confidence 999999999999777766 No 84 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=4.5e-53 Score=307.55 Aligned_cols=281 Identities=16% Similarity=0.127 Sum_probs=231.2 Q ss_pred ccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceee Q lcl|Aclame:pro 126 GTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDT 205 (419) Q Consensus 126 ~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~ 205 (419) -.+.+.|++++|+.+.+.|++.++..++++++|+++|+.++..++|+.++ +..++|++||+.+|+++++|++ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~--------~~~a~wv~Eg~~~~~~~~~f~~ 72 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA--------PPRGEVVGEGAQKSESTATFAP 72 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeC--------CceeEEeecCcccccccceeeE Confidence 34456678899999999999999999999999999999988999998764 4578999999999999999999 Q ss_pred EEeeeEEEEEeehhhHHHHh---hH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcc---cccceeccccccccccccccc Q lcl|Aclame:pro 206 ITTTLKTVAHWLPITRQAAD---DN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGST---EMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 206 v~~~~~k~~~~~~vs~ell~---d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~---~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +++.++|++++++||+|+++ |+ .+|+++|.+++++++++++|.++++|++.+ .|.|+.+.... ........ T Consensus 73 v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~--~~~~~~~~ 150 (311) T protein:vir:81 73 VTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILD--TTNIVELT 150 (311) T ss_pred EEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccc--cceeeeec Confidence 99999999999999999995 33 479999999999999999999999997543 45677653211 11111122 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC--- Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG--- 355 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~--- 355 (419) .......+.++..++..+...+.++++|+|||.++..|+++||.+|+++| ++...++.+++|+|+||++++.||.+ T Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~-~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~ 229 (311) T protein:vir:81 151 TGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLY-PELGFGTDVASFAGLNAAVSDTVRGGPEA 229 (311) T ss_pred ccccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeee-cCccccCCCceecceeEEecccccccccc Confidence 22223345566677777777778888899999999999999999999866 45566778899999999999999743 Q ss_pred ---------------cEEEEeccceEEEEEecceEEEEeeccc-----chhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 356 ---------------TALVGGFRQGATLWSRQGITVLMTDSHA-----DFFTANTLVILAEFRANLAVYQPKAFVRVTFA 415 (419) Q Consensus 356 ---------------~~~~~d~~~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 415 (419) .+++|||++ +++..+.+++++++++.. ++|++|++.||++.|+|+++++|+||++++.+ T Consensus 230 ~~~~~~~~~~~~~~~~~~~gDfs~-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a 308 (311) T protein:vir:81 230 VTASTGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) T ss_pred cccccchhcccCCccEEEEEeccc-EEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEee Confidence 368999998 445568899999887642 46999999999999999999999999999887 Q ss_pred CCC Q lcl|Aclame:pro 416 AAT 418 (419) Q Consensus 416 aa~ 418 (419) ... T Consensus 309 ~~~ 311 (311) T protein:vir:81 309 DES 311 (311) T ss_pred ccC Confidence 766 No 85 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=8.9e-53 Score=305.94 Aligned_cols=301 Identities=13% Similarity=0.068 Sum_probs=237.6 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD 159 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~ 159 (419) ..|. .......+.|.....++ .. ... ........++.++|+.+.+.|++.+.+.++|+++|+ T Consensus 1 ~~k~----~~~~~~~~~~~~~~~~~------------~~-~~a-~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~ 62 (324) T protein:vir:99 1 MEQT----QKLKLNLQHFASNNVKP------------QV-FNP-DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGK 62 (324) T ss_pred CCCc----hHhhHHHHHHHHHhhhh------------hh-ccc-cceeccCCCcceechhHHHHHHHHHHhhchhhhhcc Confidence 0000 00001111111111110 00 011 111223344457888899999999999999999999 Q ss_pred eecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHH Q lcl|Aclame:pro 160 QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRL 238 (419) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l 238 (419) ++|+.++.+.||+.++ ...+.|++|++.+|+++++|+++++.++|++++++||+|+++|+ .++++||.++| T Consensus 63 ~~~~~~~~~~~p~~~~--------~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l 134 (324) T protein:vir:99 63 YEPMEGTEKKFTFWAD--------KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMI 134 (324) T ss_pred eeeccCCceEEEEEec--------CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHH Confidence 9999999999998753 35789999999999999999999999999999999999999997 58999999999 Q ss_pred HHHHHHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIE 317 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 317 (419) ++++++++|+++|+|+|++ .|.|+.+..... .....+...++++.+++..+...++.++.|+|||+++..|+ T Consensus 135 ~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~-------~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~ 207 (324) T protein:vir:99 135 AEAFYKKFDEAGILNQGNNPFGKSIAQSIEKT-------NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLR 207 (324) T ss_pred HHHHHHHHHHHhhhcCCCCccCcccccccccc-------ceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHH Confidence 9999999999999999986 577776532221 12233446689999999999999999999999999999999 Q ss_pred HHhccCCceeccCCccccCCCcccccceeEecCCCC--cCcEEEEeccceEEEEEecceEEEEeeccc------------ Q lcl|Aclame:pro 318 LDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIA--QGTALVGGFRQGATLWSRQGITVLMTDSHA------------ 383 (419) Q Consensus 318 ~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~--~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~------------ 383 (419) +++|.+|++++. .+.+++|+|+||++++.++ .+.+++|||+++ +++++.+++++++++.. T Consensus 208 ~l~d~~g~~~~~-----~~~~~~l~G~PVv~~~~~~~~~~~~i~gd~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 281 (324) T protein:vir:99 208 KIVDPETKERIY-----DRNSDTLDGLPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPV 281 (324) T ss_pred HhhcCCCceeec-----CCCCccccceeEEeecCCCCCcceEEEEecccE-EEEEecCcEEEEeecccccccccccccch Confidence 999999998764 2456789999999998866 456999999975 56778999999987643 Q ss_pred chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 384 DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 384 ~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ++|++|++.||++.|+|+++.+|+||++++.+...+ T Consensus 282 ~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~ 317 (324) T protein:vir:99 282 NLFEQDMVALRATMHVALHIADDKAFAKLVPADKKT 317 (324) T ss_pred hhhhcCcEEEEEEEEEccEEecccceEEEEeccCCC Confidence 459999999999999999999999999999999888 No 86 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=3.2e-53 Score=308.41 Aligned_cols=281 Identities=17% Similarity=0.147 Sum_probs=224.3 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccce Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSF 203 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 203 (419) ..-+.++.|++++|+.+...|++.+++.++++++++++|+.++.++||+.++ ...++||+|++.+|+++++| T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~--------~~~a~wv~Eg~~~~~s~~~f 72 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG--------VPRAKIVGEGEVKPSASVDV 72 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC--------CcceEEeeCCccccccccce Confidence 2223345677889999999999999999999999999999998999998764 45789999999999999999 Q ss_pred eeEEeeeEEEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhccCcc---cccceecccccccccccc Q lcl|Aclame:pro 204 DTITTTLKTVAHWLPITRQAADDNS-----QLMGYIQGRLTYGLRFLRDRQLLNGNGST---EMQGILTTPGIGTYQQPK 275 (419) Q Consensus 204 ~~v~~~~~k~~~~~~vs~ell~d~~-----~~~~~i~~~l~~a~~~~~d~~il~G~g~~---~p~Gi~~~~~~~~~~~~~ 275 (419) +++++.++|++++++||+||++++. .|+++|.+++++++++++|.++|+|+|.+ .+.|+.+.. ... T Consensus 73 ~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~------~~~ 146 (315) T protein:vir:80 73 SAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSL------NKT 146 (315) T ss_pred eeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccccccccc------ccc Confidence 9999999999999999999997653 28899999999999999999999998643 233433221 111 Q ss_pred ccccchhhhHHHHHHHHHHhhhhh-ccCCcEEEEehHHHHHHHHHhccCCceec---cCCccccCCCcccccceeEecCC Q lcl|Aclame:pro 276 PTAPATDEPPLVDIRRAKTVAEIA-GFPPDGVVVHPQDWESIELDQAPGSGVFR---VIANVQGEATPRIWGLNVVSTVA 351 (419) Q Consensus 276 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~kd~~g~~~~---~~~~~~~~~~~~l~G~pv~~~~~ 351 (419) ..........+.++.+++..+... +..+++|+|||+++..|+++++.+|++.+ +.+....+.+++|+|+||+++++ T Consensus 147 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~ 226 (315) T protein:vir:80 147 KNIVDATDSATADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASST 226 (315) T ss_pred cceeeccccchHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCc Confidence 112222233467788888777544 44567899999999999999887765322 12344456678999999999999 Q ss_pred CCcC---------cEEEEeccceEEEEEecceEEEEeeccc------chhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 352 IAQG---------TALVGGFRQGATLWSRQGITVLMTDSHA------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) Q Consensus 352 ~~~~---------~~~~~d~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~a 416 (419) ||.+ .+++|||+++++ ..+++++++++++.. ++|++|++.||++.|+|+++++|+||++++.++ T Consensus 227 ~~~~~~~~~~~~~~~~~GDfs~~~~-g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~ 305 (315) T protein:vir:80 227 VSGAPEMSPASGVKAIVGDFSRVHW-GFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) T ss_pred CCcccccccccccEEEEeecccEEE-EEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeecc Confidence 9864 378899998554 558899999877532 469999999999999999999999999999999 Q ss_pred CCC Q lcl|Aclame:pro 417 ATT 419 (419) Q Consensus 417 a~~ 419 (419) +|. T Consensus 306 a~~ 308 (315) T protein:vir:80 306 APK 308 (315) T ss_pred CCC Confidence 888 No 87 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=1.1e-52 Score=305.35 Aligned_cols=301 Identities=13% Similarity=0.070 Sum_probs=237.9 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD 159 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~ 159 (419) ..|. ......++.|.....++ .. .. .........++.++|+.+.+.|++.+.+.++++++|+ T Consensus 1 ~~~~----~~~~~~~~~f~~~~~~~------------~~-~~-a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~ 62 (324) T protein:vir:10 1 MEQT----QKLKLNLQHFASNNVKP------------QV-FN-PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGK 62 (324) T ss_pred CCCc----hHHHHHHHHHHHHhhcc------------ce-ec-ccceeccCCCcceechhHHHHHHHHHHhhchhhhhcc Confidence 0000 00001111111111111 00 01 1111223344557888889999999999999999999 Q ss_pred eecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHH Q lcl|Aclame:pro 160 QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRL 238 (419) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l 238 (419) ++|+.++.+.||+.++ ...+.|++|++.+|+++++|+++++.++|++++++||+|+++|+ .++++||.+++ T Consensus 63 ~~~~~~~~~~~p~~~~--------~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l 134 (324) T protein:vir:10 63 YEPMEGTEKKFTFWAD--------KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMI 134 (324) T ss_pred eeeccCCceEEEEEeC--------CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHH Confidence 9999999999998753 35789999999999999999999999999999999999999997 58999999999 Q ss_pred HHHHHHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIE 317 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 317 (419) ++++++++|.++|+|+|++ .|.|+++..... .....+...++++.+++..+...++.+++|+|||+++..|+ T Consensus 135 ~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~-------~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~ 207 (324) T protein:vir:10 135 AEAFYKKFDEAGILNQGNNPFGKSIAQSIEKT-------NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLR 207 (324) T ss_pred HHHHHHHHHHHhhhcCCCCccCcccccccccc-------ceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHH Confidence 9999999999999999986 578877643221 12223345689999999999999999999999999999999 Q ss_pred HHhccCCceeccCCccccCCCcccccceeEecCCCC--cCcEEEEeccceEEEEEecceEEEEeeccc------------ Q lcl|Aclame:pro 318 LDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIA--QGTALVGGFRQGATLWSRQGITVLMTDSHA------------ 383 (419) Q Consensus 318 ~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~--~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~------------ 383 (419) +++|++|++++. ++.+++|+|+||++++.++ .+.+++|||++++ ++++++++++++++.. T Consensus 208 ~l~d~~g~~~~~-----~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 281 (324) T protein:vir:10 208 KIVDPETKERIY-----DRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKNEDGTPV 281 (324) T ss_pred HhhccCCceeec-----CCCCccccceeEEeecCCCCCcceEEEEecccEE-EEEecCcEEEEeecccccccccccccch Confidence 999999998764 2446789999999987765 5679999999754 6778999999987643 Q ss_pred chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 384 DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 384 ~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ++|++|++.||++.|+|+++.+|+||++++.+++.+ T Consensus 282 ~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~ 317 (324) T protein:vir:10 282 NLFEQDMVALRATMHVALHIADDKAFAKLVPADKKT 317 (324) T ss_pred hhhhcCcEEEEEEEEEccEEecccceEEEEeccCCC Confidence 459999999999999999999999999999999988 No 88 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=4.6e-52 Score=302.05 Aligned_cols=347 Identities=12% Similarity=0.074 Sum_probs=226.0 Q ss_pred CCc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MPP--TPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA 78 (419) Q Consensus 1 M~~--~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 78 (419) |+. ..++++.+.++.+.++..... ++..+..++ .++++.+.... +...+. +...... .. T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~--------~~~~~~~~~---~~~~~~~~~~~-~~~~e~-~~~~~~~-~~----- 61 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQ--------ERQNELYGD---MINQLFEETKL-QAKAEA-ERVSSLP-KS----- 61 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhh--------HHHHHHHHH---HHHhhhhhHHH-HHHHHH-HHHHHhc-cC----- Confidence 443 233333333332222211100 000011110 11111111100 000000 0000000 00 Q ss_pred chhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 79 GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) +. ....+.+...... ...+.+.|++++|+.+.+.|++.+...++++++| T Consensus 62 ------~~------------------~lt~~e~~~~~~~-------~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~ 110 (381) T protein:vir:10 62 ------AQ------------------SLSANQRSFFMDI-------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADL 110 (381) T ss_pred ------cc------------------cccHHHHHHHHHH-------hcccCCCCceecCHHHHHHHHHHHHhhccceehe Confidence 00 0000001111110 1123456789999999999999999999999999 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCccccc-ccccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKP-QSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) ++.+++++ ..+|+.++ .+.+.|++|++..+ +++++|+++++.+||++++++||++|++|++ +|++||++ T Consensus 111 ~v~~~~~~-~~i~~~~~--------~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~ 181 (381) T protein:vir:10 111 GIKNAGLR-LKFLKSET--------SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRV 181 (381) T ss_pred eeEecCcc-eEEEEecC--------CcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHH Confidence 99998764 67887653 45789999998876 4579999999999999999999999999976 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceecccccccccccc---------ccccchhhhHHHHHHHHHHhhh-------hhc Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPK---------PTAPATDEPPLVDIRRAKTVAE-------IAG 300 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~-------~~~ 300 (419) ++++++++++|.+|++|+|+++|.||++..+........ ..+.......++.+...+..+. ..| T Consensus 182 ~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 261 (381) T protein:vir:10 182 QIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAV 261 (381) T ss_pred HHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccc Confidence 999999999999999999999999999764432111111 0111112223344444444433 235 Q ss_pred cCCcEEEEehHHHHHHHHHh---ccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEE Q lcl|Aclame:pro 301 FPPDGVVVHPQDWESIELDQ---APGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVL 377 (419) Q Consensus 301 ~~~~~~~~~~~~~~~l~~~k---d~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~ 377 (419) ..+..|+|||.++..|+.++ +++|+|.+.. ..|++|++++.||++++++|||++ |.+++|.+++++ T Consensus 262 ~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l----------~~g~~vv~s~~~p~~~iifgDfs~-Y~i~~r~~~~i~ 330 (381) T protein:vir:10 262 KGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL----------PFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQ 330 (381) T ss_pred cCceEEEEccccHHhhccccccCCCCCceeecC----------CCCceEEecCCCCcCcEEEEeccc-EEEEEecccEEE Confidence 56678999999999988765 4556654321 137789999999999999999997 888999999999 Q ss_pred EeecccchhhcCcEEEEEEEEeccEEecccceEEEEecC--CCC Q lcl|Aclame:pro 378 MTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAA--ATT 419 (419) Q Consensus 378 ~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~a--a~~ 419 (419) .+++. +|.+|++.||+..|+||++++++||++++++. +++ T Consensus 331 ~~~~~--~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~ 372 (381) T protein:vir:10 331 KFKET--LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) T ss_pred eechh--HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCc Confidence 88764 69999999999999999999999999976554 444 No 89 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=4.6e-52 Score=302.05 Aligned_cols=347 Identities=12% Similarity=0.074 Sum_probs=226.0 Q ss_pred CCc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MPP--TPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA 78 (419) Q Consensus 1 M~~--~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 78 (419) |+. ..++++.+.++.+.++..... ++..+..++ .++++.+.... +...+. +...... .. T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~--------~~~~~~~~~---~~~~~~~~~~~-~~~~e~-~~~~~~~-~~----- 61 (381) T protein:vir:95 1 MTINLSETFANAKNEFINAVNNGEPQ--------ERQNELYGD---MINQLFEETKL-QAKAEA-ERVSSLP-KS----- 61 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhh--------HHHHHHHHH---HHHhhhhhHHH-HHHHHH-HHHHHhc-cC----- Confidence 443 233333333332222211100 000011110 11111111100 000000 0000000 00 Q ss_pred chhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhc Q lcl|Aclame:pro 79 GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~ 158 (419) +. ....+.+...... ...+.+.|++++|+.+.+.|++.+...++++++| T Consensus 62 ------~~------------------~lt~~e~~~~~~~-------~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~ 110 (381) T protein:vir:95 62 ------AQ------------------SLSANQRSFFMDI-------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADL 110 (381) T ss_pred ------cc------------------cccHHHHHHHHHH-------hcccCCCCceecCHHHHHHHHHHHHhhccceehe Confidence 00 0000001111110 1123456789999999999999999999999999 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCccccc-ccccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKP-QSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~ 236 (419) ++.+++++ ..+|+.++ .+.+.|++|++..+ +++++|+++++.+||++++++||++|++|++ +|++||++ T Consensus 111 ~v~~~~~~-~~i~~~~~--------~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~ 181 (381) T protein:vir:95 111 GIKNAGLR-LKFLKSET--------SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRV 181 (381) T ss_pred eeEecCcc-eEEEEecC--------CcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHH Confidence 99998764 67887653 45789999998876 4579999999999999999999999999976 89999999 Q ss_pred HHHHHHHHHHHHHHHhccCcccccceecccccccccccc---------ccccchhhhHHHHHHHHHHhhh-------hhc Q lcl|Aclame:pro 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPK---------PTAPATDEPPLVDIRRAKTVAE-------IAG 300 (419) Q Consensus 237 ~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~-------~~~ 300 (419) ++++++++++|.+|++|+|+++|.||++..+........ ..+.......++.+...+..+. ..| T Consensus 182 ~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 261 (381) T protein:vir:95 182 QIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAV 261 (381) T ss_pred HHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccc Confidence 999999999999999999999999999764432111111 0111112223344444444433 235 Q ss_pred cCCcEEEEehHHHHHHHHHh---ccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEE Q lcl|Aclame:pro 301 FPPDGVVVHPQDWESIELDQ---APGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVL 377 (419) Q Consensus 301 ~~~~~~~~~~~~~~~l~~~k---d~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~ 377 (419) ..+..|+|||.++..|+.++ +++|+|.+.. ..|++|++++.||++++++|||++ |.+++|.+++++ T Consensus 262 ~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l----------~~g~~vv~s~~~p~~~iifgDfs~-Y~i~~r~~~~i~ 330 (381) T protein:vir:95 262 KGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL----------PFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQ 330 (381) T ss_pred cCceEEEEccccHHhhccccccCCCCCceeecC----------CCCceEEecCCCCcCcEEEEeccc-EEEEEecccEEE Confidence 56678999999999988765 4556654321 137789999999999999999997 888999999999 Q ss_pred EeecccchhhcCcEEEEEEEEeccEEecccceEEEEecC--CCC Q lcl|Aclame:pro 378 MTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAA--ATT 419 (419) Q Consensus 378 ~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~a--a~~ 419 (419) .+++. +|.+|++.||+..|+||++++++||++++++. +++ T Consensus 331 ~~~~~--~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~ 372 (381) T protein:vir:95 331 KFKET--LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) T ss_pred eechh--HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCc Confidence 88764 69999999999999999999999999976554 444 No 90 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=1e-52 Score=305.60 Aligned_cols=272 Identities=15% Similarity=0.134 Sum_probs=230.8 Q ss_pred hhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcc--eeeeeeccccceeccccccceeecCcccc Q lcl|Aclame:pro 119 LSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNV--LEYIRDTSGTAGAGSTWNKAAVVPEGTAK 196 (419) Q Consensus 119 ~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~ 196 (419) .-+. ....+++.|++++|+.+.+.|++.++..++++++|+++|+.+.. +.+|+... ....+.|++|++++ T Consensus 1 ~l~~-~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~-------~~~~a~~v~Eg~~~ 72 (293) T protein:vir:48 1 MLDS-KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTD-------ITGLANIDDEAGKI 72 (293) T ss_pred Ccee-ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecC-------CCcceeeecCCccc Confidence 1111 22334556788999999999999999999999999999987654 44554332 23568999999999 Q ss_pred ccc-ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccc Q lcl|Aclame:pro 197 PQS-TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQP 274 (419) Q Consensus 197 ~~~-~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~ 274 (419) |+. +++|++++++++|++++++||+|+++|+ .++++||.+++++++++++|.+|++|.|++.+. T Consensus 73 ~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~-------------- 138 (293) T protein:vir:48 73 ADIDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTK-------------- 138 (293) T ss_pred ccccccceeEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccccc-------------- Confidence 985 6999999999999999999999999997 589999999999999999999999988754321 Q ss_pred cccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecC--CC Q lcl|Aclame:pro 275 KPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTV--AI 352 (419) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~--~~ 352 (419) .....++++.+++.++...+..++.|+||++++..|+++||.+|+|+| ++++.++.+++|+|+||++++ .+ T Consensus 139 ------~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~-~~~~~~~~~~~l~G~Pv~~~~~~~~ 211 (293) T protein:vir:48 139 ------PTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLM-ERDVKSPTGYSIAGFAVKEISDRWL 211 (293) T ss_pred ------ccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEe-ecCcCCCCCceecceeeEEeccccc Confidence 122358899999999999999999999999999999999999999765 566778888999999998754 34 Q ss_pred Cc-----CcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 353 AQ-----GTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 353 ~~-----~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) |. ..+++|||+++|+++++.+++++++++.+.+|.+|++.||++.|+|+++++|+||++++++++++ T Consensus 212 ~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 283 (293) T protein:vir:48 212 PNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIAD 283 (293) T ss_pred CCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeecccc Confidence 43 24799999999999999999999999888889999999999999999999999999999999888 No 91 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=2.7e-52 Score=303.26 Aligned_cols=301 Identities=13% Similarity=0.078 Sum_probs=235.5 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcc Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD 159 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~ 159 (419) ..|. ......++.|.....+ .... .. .....+..++.++|+.+.+.|++.+.+.++++++++ T Consensus 1 ~~~~----~~~~~~~~~f~~~~~~------------~~~~-~a-~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~ 62 (324) T protein:vir:96 1 MEQT----QKLKLNLQHFASNNVK------------PQVF-NP-DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGK 62 (324) T ss_pred CCcc----hhhhHHHHHHHHhhhh------------hhhc-cc-ccccccCCCcceechhHHHHHHHHHHhhchhhhhcc Confidence 0000 0000111111111110 0000 11 111222344557888899999999999999999999 Q ss_pred eecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHH Q lcl|Aclame:pro 160 QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRL 238 (419) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l 238 (419) ++|++++.++||+.++ .+.+.||+|++.+|+++++|+++++.++|++++++||+|+++|+ .+++++|.+++ T Consensus 63 ~~~~~~~~~~~p~~~~--------~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l 134 (324) T protein:vir:96 63 YEPMEGTEKKFTFWAD--------KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMI 134 (324) T ss_pred eeeccCCceEEEEEec--------CcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHH Confidence 9999999899998763 34689999999999999999999999999999999999999987 58999999999 Q ss_pred HHHHHHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIE 317 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 317 (419) ++++++++|+++|+|+|++ .|.|+.+..... .........++++.+++..+...++.+++|+||++++..|+ T Consensus 135 ~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~-------~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~ 207 (324) T protein:vir:96 135 AEAFYKKFDEAGILNQGNNPFGKSIAQSIKKT-------NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLR 207 (324) T ss_pred HHHHHHHHHHHhhhcCCCCCcCcccccccccc-------ceecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHH Confidence 9999999999999999976 566765532211 12223345689999999999999999999999999999999 Q ss_pred HHhccCCceeccCCccccCCCcccccceeEecCCC--CcCcEEEEeccceEEEEEecceEEEEeeccc------------ Q lcl|Aclame:pro 318 LDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAI--AQGTALVGGFRQGATLWSRQGITVLMTDSHA------------ 383 (419) Q Consensus 318 ~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~--~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~------------ 383 (419) +++|++|++++. ++.+++|+|+||+++... +.+.+++|||++ ++++++++++++++++.. T Consensus 208 ~lkd~~G~~~~~-----~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 281 (324) T protein:vir:96 208 KIVDPETKERIY-----DRNSDSLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPV 281 (324) T ss_pred HhhCCCCCeeec-----CCCCCcccceeeEeecCCCCCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccch Confidence 999999998763 345678999999997665 456799999997 456778999999987643 Q ss_pred chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 384 DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 384 ~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ++|++|++.||+++|+|+++++|+||++++.+...| T Consensus 282 ~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~ 317 (324) T protein:vir:96 282 NLFEQDMVALRATMHVALHIADDKAFAKLVPADKRT 317 (324) T ss_pred hhhhcCcEEEEEEEEeccEEecccceEEEecccccC Confidence 469999999999999999999999999999888877 No 92 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=5.6e-51 Score=296.07 Aligned_cols=358 Identities=11% Similarity=0.041 Sum_probs=236.1 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |...+.+++..+.+.+..+++....++.....++. +.+.+.++++...+.+..... . +.... T Consensus 1 mt~~~~~~e~~~~~~e~~~~~~~~~~~~~~~e~~~----~~~~~~~~~~~~~~~~~~~~e-~-~~~~~------------ 62 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQFANLVQNGASDEEQS----KAFGAMFDALSNDLQEEITAE-I-NNRVV------------ 62 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHH----HHHHHHHHHHHHHHHHHHHHH-H-HHHHH------------ Confidence 88888887777777666666554443332221111 111222222222111100000 0 00000 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) ... ....+..+ ....+.+.+... .. ..+...|++++|+.+.+.|++.+...++++++|++ T Consensus 63 -----~~~-------~~~~r~~~-~l~~ee~~~~~~-~~------~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v 122 (395) T protein:vir:95 63 -----DNG-------ILAKRSQD-PLTSEERKFFND-IN------YDVGYTDEKILPETVVERVFDDLQKDHPLLSKINF 122 (395) T ss_pred -----HHH-------HHhhcCcc-ccchHHHHHHHH-Hh------hccCCCCceeccHHHHHHHHHHHHhhhhhhhhcee Confidence 000 00000000 001111111111 10 12345678899999999999999999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCcccc-cccccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAK-PQSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGRL 238 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~-~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~~l 238 (419) .++++ ...+|+.++ .+.+.|+.|+++. ++++++|+++++.+|+++++++||++|++|++ ++++||+++| T Consensus 123 ~~~~~-~~~i~~~~~--------~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~l 193 (395) T protein:vir:95 123 QNAGI-KTRVIKADP--------AGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQI 193 (395) T ss_pred EecCC-ceEEEEecC--------CcceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHH Confidence 99876 457777543 3578898887665 56789999999999999999999999999985 8999999999 Q ss_pred HHHHHHHHHHHHHhccCcc--cccceeccccccccccccccccchhhhHHHHH-------HHHHHhh-------hhhccC Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGST--EMQGILTTPGIGTYQQPKPTAPATDEPPLVDI-------RRAKTVA-------EIAGFP 302 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~--~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~-------~~~~~~ 302 (419) ++++++++|++||+|+|++ +|+||++............. .+....++++ ..++..+ ...+.. T Consensus 194 a~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~--~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 271 (395) T protein:vir:95 194 QEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKA--SSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDG 271 (395) T ss_pred HHHHHHHHhhheeeccCCCCcCceeeeeccccccccccccc--ccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcC Confidence 9999999999999999986 69999986544332222111 1111122222 2222221 123445 Q ss_pred CcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccc--cceeEecCCCCcCcEEEEeccceEEEEEecceEEEEee Q lcl|Aclame:pro 303 PDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIW--GLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTD 380 (419) Q Consensus 303 ~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~--G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~ 380 (419) +..|+||+.++. +..|+|.|.+ .++.+.+++ |+||+++++||+++++||||++ |++++|.++++++++ T Consensus 272 ~~~~~mn~~t~~------~~~g~~~~~~---~~G~~~~~lg~g~~v~~~~~~p~~~i~fgdfs~-y~i~~r~~~~i~~~~ 341 (395) T protein:vir:95 272 KVALVVNPRDSW------DVQARYTYLT---ANGGFVTVLPYNVTIITSEFVPEGKLVAFVTDR-YNAVRGGGLTVKKFD 341 (395) T ss_pred ceEEEEcchhhh------hcCCcceecc---CCCcceeccCCcceEEEcCCCCCCcEEEEeccc-EEEEEecceEEEecc Confidence 668999999875 3456665533 345667776 5568999999999999999998 788899999999887 Q ss_pred cccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 381 SHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 381 ~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) + .+|.+|++.||+..|+|+++++++||++++++.++- T Consensus 342 ~--~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~ 378 (395) T protein:vir:95 342 Q--TLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVASA 378 (395) T ss_pred c--hhhhCCcEEEEEEEEECCEEeccccEEEEEeeccCC Confidence 5 469999999999999999999999999999873322 No 93 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=3.6e-52 Score=302.60 Aligned_cols=281 Identities=14% Similarity=0.093 Sum_probs=233.3 Q ss_pred HHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcc-eeeeeeccccceeccccccceeecCcc Q lcl|Aclame:pro 116 NRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNV-LEYIRDTSGTAGAGSTWNKAAVVPEGT 194 (419) Q Consensus 116 ~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~Eg~ 194 (419) .............++.++.++|+.+.+.|++.+.+.++++++|++++++++. ..+|+.. +...+.|++|++ T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~--------~~~~a~~v~Eg~ 72 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQT--------DGISAYWVNETE 72 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEc--------CCceeEEeecCc Confidence 0111111111223445566889999999999999999999999999997654 5566544 345789999999 Q ss_pred cccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccc Q lcl|Aclame:pro 195 AKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQ 273 (419) Q Consensus 195 ~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~ 273 (419) .+|+++++|++++++++|++++++||+|+++|+ .++++||++++++++++++|.++|+|+|++.|.|+++..+... T Consensus 73 ~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~--- 149 (297) T protein:vir:95 73 KIKTDKPEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDAN--- 149 (297) T ss_pred cccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccc--- Confidence 999999999999999999999999999999987 5899999999999999999999999999999999987543221 Q ss_pred ccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCC-- Q lcl|Aclame:pro 274 PKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVA-- 351 (419) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~-- 351 (419) ....+...++++.+++.++...+..+++|+|||+++.+|++++|.+|++++. +.+++|+|+||+.+.. T Consensus 150 ----~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~------~~~~~l~G~Pv~~~~~~~ 219 (297) T protein:vir:95 150 ----KVIGGPINYDNILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYD------KAANTIDGITTVDLKSAR 219 (297) T ss_pred ----eecccccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeec------CCCCcccceeeEeecCCC Confidence 1122334688999999999999999999999999999999999999998652 3456899999997654 Q ss_pred CCcCcEEEEeccceEEEEEecceEEEEeeccc------------chhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 352 IAQGTALVGGFRQGATLWSRQGITVLMTDSHA------------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 352 ~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~ 418 (419) ++.+.+++|||+++ +++.+.+++++++++.. +.|++|++.||++.|+|+++++|+||++++.++.+ T Consensus 220 ~~~~~~~~gd~s~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 220 FEKGDLLAGDFDNL-IYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred CCCceEEEEecccE-EEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 56789999999974 56779999999987653 45999999999999999999999999999999998 No 94 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=2.5e-52 Score=303.49 Aligned_cols=288 Identities=14% Similarity=0.078 Sum_probs=227.0 Q ss_pred hhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccc Q lcl|Aclame:pro 104 GQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGST 183 (419) Q Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (419) +.+..+.+.. .....+..+++++|+. ...|++.+++.++++++++++++.++.++||+.+. T Consensus 1 ~g~~~e~~~~----------~~~~t~~~~g~l~~~~-~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~-------- 61 (397) T protein:vir:23 1 MGFSADHSQI----------AQTKDTMFTGYLDPVQ-AKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTG-------- 61 (397) T ss_pred CCcCHHHHHH----------hhccCCCCccccchhH-HHHHHHHHHhccchhhhcceeeccCCceEEEEEcC-------- Confidence 1111111110 1112233455566654 56667777888999999999999998899998764 Q ss_pred cccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccce Q lcl|Aclame:pro 184 WNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGI 262 (419) Q Consensus 184 ~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi 262 (419) ...+.||+|++.+|+++++|+++++.+||++++++||+|+++|+ .+++++|+++|++++++++|++||+|+|++++.+. T Consensus 62 ~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~ 141 (397) T protein:vir:23 62 DVSAQWIGEGDMKPITKGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQG 141 (397) T ss_pred CcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCccccc Confidence 34789999999999999999999999999999999999999987 59999999999999999999999999998755443 Q ss_pred eccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccC-----C Q lcl|Aclame:pro 263 LTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGE-----A 337 (419) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~-----~ 337 (419) +...... .........++++.+++..+...+..+++|+||++++..|+++||++|+++|.+.. ..+ . T Consensus 142 ~~~~~~~-------~~~~~~~~~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~-~~~~~~~~~ 213 (397) T protein:vir:23 142 YLDQSNK-------TQSISPNAYQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVEST-YESLTTPFR 213 (397) T ss_pred ccccccc-------eeeecccchhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccc-ccccccccc Confidence 3221111 11122234467778888889999999999999999999999999999998765433 322 3 Q ss_pred CcccccceeEecCCCCcCcE--EEEeccceEEEEEecceEEEEeeccc------------chhhcCcEEEEEEEEeccEE Q lcl|Aclame:pro 338 TPRIWGLNVVSTVAIAQGTA--LVGGFRQGATLWSRQGITVLMTDSHA------------DFFTANTLVILAEFRANLAV 403 (419) Q Consensus 338 ~~~l~G~pv~~~~~~~~~~~--~~~d~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~ 403 (419) +++|+|+||+++++||+++. ++|||++++ +.+++++.++++++.. .+|++|++.||++.|+|+++ T Consensus 214 ~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~-i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v 292 (397) T protein:vir:23 214 EGRILGRPTILSDHVAEGDVVGYAGDFSQII-WGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLI 292 (397) T ss_pred CceeeeeeEEEeCCCCCCceEEEEeecceEE-EEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccce Confidence 45899999999999998864 789999755 5678999999876543 45999999999999999999 Q ss_pred ecccceEEEEecCCCC Q lcl|Aclame:pro 404 YQPKAFVRVTFAAATT 419 (419) Q Consensus 404 ~~~~a~~~~~~~aa~~ 419 (419) ++|+||++++.++..+ T Consensus 293 ~~~~a~~~~~~~~~~~ 308 (397) T protein:vir:23 293 NDVNAFVKLTFDPVLT 308 (397) T ss_pred ecccceEEEeeccccc Confidence 9999999999977766 No 95 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=1.5e-51 Score=299.21 Aligned_cols=365 Identities=10% Similarity=0.026 Sum_probs=228.6 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) |+ ..|++..+++.++.+++.+..++.....++ ...+.+.++.+++.+.+.. ........+..... T Consensus 1 M~--~kl~~~~~~~~e~~~~l~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~------- 65 (383) T protein:vir:78 1 MT--IKLKNNLANYEEKRTAFVNAVKNEDTQEIQ----NKAYVEMVDAMAADIMEQA--KKEARQEADAYISA------- 65 (383) T ss_pred Cc--hhHHHHHHHHHHHHHHHHHHHhccChHHHH----HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHh------- Confidence 77 346666666666555554433221111111 1111111222222111000 00000000000000 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) ... ...+..+.+..... . . ....+.|++++|+.+.+.|++.+...++++++|++ T Consensus 66 ------------------~~g-~~~lt~~e~~~~~~-~-----~-~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v 119 (383) T protein:vir:78 66 ------------------SRT-DKNITNEEIKFFND-I-----N-KEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGM 119 (383) T ss_pred ------------------cCC-hhhhhHHHHHHHHH-H-----h-ccCCCCCccccCHHHHHHHHHHHHhhccceeeeee Confidence 000 00000111111111 1 0 12345678899999999999999999999999999 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCccccc-ccccceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKP-QSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGRL 238 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~~l 238 (419) .+++++ .++|+.++ .+.+.|++|+++.+ .++++|+++++.++|++++++||++|++|++ ++++||++++ T Consensus 120 ~~~~~~-~~i~~~~~--------~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l 190 (383) T protein:vir:78 120 RTTGLR-TKFLKSET--------SGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQI 190 (383) T ss_pred EecCCc-eEEEEEcC--------CcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHH Confidence 998765 68887653 35788999988764 5689999999999999999999999999985 8999999999 Q ss_pred HHHHHHHHHHHHHhccCcccccceeccccccccccccc--cccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHH Q lcl|Aclame:pro 239 TYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKP--TAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESI 316 (419) Q Consensus 239 ~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 316 (419) ++++++++|.+||+|+|+++|.||++..+......... ....+....++++......+. .++.+..|+||..++..+ T Consensus 191 ~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~~~ 269 (383) T protein:vir:78 191 EEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELT-DVYKYHSVKENGHPLNVA 269 (383) T ss_pred HHHHHHHHhhheEeccCCCCceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHH-HHHhccchhcccchhhhc Confidence 99999999999999999999999997543222111111 111222233444444443333 333444455555555554 Q ss_pred HHHh--ccCCceeccCCcc----ccCCCcccccce--eEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhc Q lcl|Aclame:pro 317 ELDQ--APGSGVFRVIANV----QGEATPRIWGLN--VVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTA 388 (419) Q Consensus 317 ~~~k--d~~g~~~~~~~~~----~~~~~~~l~G~p--v~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~ 388 (419) ++++ .+.+.++.+++.. .+|.+.+++|+| |+++++||++++++|||++ |.+++|.+++++.+++ .+|.+ T Consensus 270 ~~~~~~~n~~~~~~~~~~~~~~~~~G~~~t~l~~~~~iv~s~~~p~~~iifgdfs~-Y~i~~r~~~~i~~~~~--~~f~~ 346 (383) T protein:vir:78 270 GKVTLLVNPTDAWDVKKQYTSLNANGVYVTALPFNLNIIESLFVPEKKAISYVAER-YDALIGGPLDIGTYDQ--TLAIE 346 (383) T ss_pred CceEEEEcCcchhhhccchhccCCCCceeeecCCCceEEecCCCCcccEEEeeccc-eEEEecccceEEecch--hhhhc Confidence 4443 1111222122111 234445677666 7789999999999999998 8889999999998775 46999 Q ss_pred CcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 389 NTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 389 ~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) |++.||+..|+|+++++|+||++++++-+.. T Consensus 347 d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~~ 377 (383) T protein:vir:78 347 DLNLYAAKQFAYGKAKDDKAAAVWTLNINPA 377 (383) T ss_pred CceEEEEEEEEcCEEecCCeEEEEEEEecCC Confidence 9999999999999999999999987765444 No 96 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=4.4e-52 Score=302.16 Aligned_cols=295 Identities=13% Similarity=0.072 Sum_probs=235.9 Q ss_pred hhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeecccccee Q lcl|Aclame:pro 101 DKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGA 180 (419) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (419) .+++. ...... +.... ..++.++.++|+.+.+.|++.++..++++++|+++|+.++.++||+.++ T Consensus 1 ~~~~~-~~~~e~--------~~~~~-~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~----- 65 (318) T protein:vir:24 1 MAAGT-AFAVDH--------AQIAQ-TGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVG----- 65 (318) T ss_pred CCCCC-CCCHHH--------HHhhc-ccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeC----- Confidence 12211 111101 00011 1123344467888888999999999999999999999999999998764 Q ss_pred ccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccc Q lcl|Aclame:pro 181 GSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEM 259 (419) Q Consensus 181 ~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p 259 (419) ...++|++|++.+|+++++|+++++.++|+++++++|+|+++|+ .+++++|.++|++++++++|.+||+|+|++.| T Consensus 66 ---~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~ 142 (318) T protein:vir:24 66 ---DVSAQWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFP 142 (318) T ss_pred ---CcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCC Confidence 35789999999999999999999999999999999999999987 58999999999999999999999999999999 Q ss_pred cceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCC-- Q lcl|Aclame:pro 260 QGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEA-- 337 (419) Q Consensus 260 ~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~-- 337 (419) .|+++.......... ........+++.+++..+...+..+++|+|||+++..|+++||++|+|+|.. ++.++. T Consensus 143 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~-~~~~~~~~ 217 (318) T protein:vir:24 143 TYIGQTTKAISIADT----TGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIE-STYGEAAS 217 (318) T ss_pred ccccccccccccccc----ccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecC-ccccCccc Confidence 999875433222211 2222334566778888899999999999999999999999999999987644 333332 Q ss_pred ---CcccccceeEecCCCCcCc--EEEEeccceEEEEEecceEEEEeeccc------------chhhcCcEEEEEEEEec Q lcl|Aclame:pro 338 ---TPRIWGLNVVSTVAIAQGT--ALVGGFRQGATLWSRQGITVLMTDSHA------------DFFTANTLVILAEFRAN 400 (419) Q Consensus 338 ---~~~l~G~pv~~~~~~~~~~--~~~~d~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d 400 (419) ..+++|+||++++.+|.++ +++|||++ ++++++++++++++++.. +.|++|++.||++.|+| T Consensus 218 ~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~-~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d 296 (318) T protein:vir:24 218 PFRSGRIVARPTILSDHVVEGTTVGFMGDFSQ-LIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYA 296 (318) T ss_pred cccCceEEEEeeEEeCCCCCCccEEEEeecce-EEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEc Confidence 3578999999999999775 57899997 456779999999877643 45999999999999999 Q ss_pred cEEecccceEEEEecCCCC Q lcl|Aclame:pro 401 LAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 401 ~~~~~~~a~~~~~~~aa~~ 419 (419) +++++|+||++++..++-+ T Consensus 297 ~~v~~~~a~~~i~~~~a~~ 315 (318) T protein:vir:24 297 FHCNDAEAFVALTNVVSGG 315 (318) T ss_pred cEEecccceEEEEeeccCC Confidence 9999999999999888777 No 97 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=1.1e-51 Score=300.04 Aligned_cols=283 Identities=18% Similarity=0.121 Sum_probs=226.0 Q ss_pred cccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCccccccccccee Q lcl|Aclame:pro 125 AGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFD 204 (419) Q Consensus 125 ~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 204 (419) ..+.++.++.++|+.+.+.|++.+++.+.++++|+++|++++..+||+.++ ...++||+|++++|+++++|+ T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~--------~~~a~wv~Eg~~~~~~~~~f~ 72 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNG--------RPKAEFVGEGQQKSSTTGEFD 72 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeC--------CceeEEeecCcccccccceee Confidence 233456677788999999999999999999999999999988899998764 357899999999999999999 Q ss_pred eEEeeeEEEEEeehhhHHHHh---hH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccc-ccc Q lcl|Aclame:pro 205 TITTTLKTVAHWLPITRQAAD---DN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKP-TAP 279 (419) Q Consensus 205 ~v~~~~~k~~~~~~vs~ell~---d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~-~~~ 279 (419) ++++.++|++++++||+||++ |+ .+|++||+++|++++++++|+++|+|+|++++.++....+......... ... T Consensus 73 ~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~ 152 (311) T protein:vir:99 73 FVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTA 152 (311) T ss_pred EEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccc Confidence 999999999999999999984 43 5899999999999999999999999999766555443222222111111 112 Q ss_pred chhhhHHHHHHHHHHhhhhh--ccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC-- Q lcl|Aclame:pro 280 ATDEPPLVDIRRAKTVAEIA--GFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG-- 355 (419) Q Consensus 280 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~-- 355 (419) ......+.++..++..+... ....++|+||+.++..|+++||.+|+|+| ++...++.+++|+|+||++++.+|.+ T Consensus 153 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~-~~~~~~~~~~~l~G~Pv~~s~~i~~~~~ 231 (311) T protein:vir:99 153 DTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKF-PELGLGIGVSSFEGIDASVSDTVNGGDE 231 (311) T ss_pred cccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeee-cCcccCCCCceecceeeEeecccccccc Confidence 22233455566666555443 23456799999999999999999999765 66777778889999999999988632 Q ss_pred --------------cEEEEeccceEEEEEecceEEEEeeccc-----chhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 356 --------------TALVGGFRQGATLWSRQGITVLMTDSHA-----DFFTANTLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) Q Consensus 356 --------------~~~~~d~~~~~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~a 416 (419) .+++|||++++.+..+++++++++++.. ++|++|++.||++.|+|++++|| +|++++.++ T Consensus 232 ~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~ 310 (311) T protein:vir:99 232 ADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAV 310 (311) T ss_pred cccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecCh-hHeeeeccc Confidence 3678999998888889999999887643 46999999999999999999997 577777777 Q ss_pred C Q lcl|Aclame:pro 417 A 417 (419) Q Consensus 417 a 417 (419) | T Consensus 311 A 311 (311) T protein:vir:99 311 A 311 (311) T ss_pred C Confidence 7 No 98 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=1.5e-50 Score=293.67 Aligned_cols=279 Identities=11% Similarity=0.054 Sum_probs=222.6 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCccc-----ccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTA-----KPQ 198 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-----~~~ 198 (419) .+..+++.++.++|+.+.+.|++.+++.++|+++++++++.++.+++|+.++ ...+.||+|++. +|. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~--------~~~a~wv~E~~~~~~~~~~~ 72 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT--------LPEADWVGESATDPKGVKPT 72 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeC--------CcceEEeecccccccccccc Confidence 3334455677889999999999999999999999999999999999998764 357899999986 455 Q ss_pred cccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCccccccee---ccccccccccc Q lcl|Aclame:pro 199 STLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGIL---TTPGIGTYQQP 274 (419) Q Consensus 199 ~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~---~~~~~~~~~~~ 274 (419) ++++|++++++++|++++++||+|+++|+ .++++||+++|++++++++|.+||+|+|++.+.+.. +.......... T Consensus 73 s~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~ 152 (305) T protein:vir:25 73 SKVTWANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVE 152 (305) T ss_pred cccceeeEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCcccccccccccccccccc Confidence 68999999999999999999999999997 589999999999999999999999999875433322 22111111111 Q ss_pred cccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCc Q lcl|Aclame:pro 275 KPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQ 354 (419) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 354 (419) .........+.++.+..+...+...++..+.|+|||.++..|+++||++|+++|. +++|+|+||++++.+|. T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~--------~~~l~G~Pv~~~~~~~~ 224 (305) T protein:vir:25 153 VVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFR--------DDSFAGFRTFFNRNGAW 224 (305) T ss_pred ccccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeec--------CCcccccceEEcCccCC Confidence 2222222333455555666666666677778999999999999999999998763 24799999999999874 Q ss_pred ----CcEEEEeccceEEEEEecceEEEEeeccc--------chhhcCcEEEEEEEEeccEEecccceEEEEecCCC--C Q lcl|Aclame:pro 355 ----GTALVGGFRQGATLWSRQGITVLMTDSHA--------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAAT--T 419 (419) Q Consensus 355 ----~~~~~~d~~~~~~~~~~~~~~i~~~~~~~--------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~--~ 419 (419) +.+++|||++ ++++++++++++++++.. .+|++|++.+|++.|+|+.+++|+||++++..++. + T Consensus 225 ~~~~~~~~~gd~s~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~ 302 (305) T protein:vir:25 225 DADAAIEVIADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVA 302 (305) T ss_pred CCCccEEEEEecce-EEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccC Confidence 4689999997 566788999999887542 46999999999999999999999999999887543 3 No 99 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=4.8e-38 Score=225.17 Aligned_cols=380 Identities=12% Similarity=0.076 Sum_probs=205.1 Q ss_pred CCccH---HHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 1 MPPTP---TLEEQRAALLARLDDTSLTTEQVQEI---VAEARGLADALQAESDR-AAARAALLRTAPPAPKGPADGGTPL 73 (419) Q Consensus 1 M~~~~---~L~e~~~~l~~~~~~~~~~~~~~~~~---~~e~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~ 73 (419) |-+-+ .+++..+....+..+.....++.... ..+.....++++.+.+. ..+....+................. T Consensus 124 a~~~a~I~~vke~~~~e~~~~~~~~a~~ee~~e~~~k~~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~~~~~ 203 (517) T protein:vir:97 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVE 203 (517) T ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhhcccc Confidence 22222 23332222222221111111111111 11111111111111111 1111112221111111111100000 Q ss_pred cccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhh Q lcl|Aclame:pro 74 TPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLL 153 (419) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~ 153 (419) .......................... ...........-.....+++..|..+...|......... T Consensus 204 ----~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~ 268 (517) T protein:vir:97 204 ----ALKVTPEATEFLKTREAEVAYMSASL-----------TKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGS 268 (517) T ss_pred ----cccccchhhHHHHHHHHHHHHHHhcc-----------cccccceeeeecccccccccccchHHHHHHHHhhhhhcc Confidence 00000000000000000000000000 000000000000111224566777777777666666666 Q ss_pred HHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH----- Q lcl|Aclame:pro 154 VADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS----- 228 (419) Q Consensus 154 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~----- 228 (419) ++.++++.++.. ..++..+ ....+.|+.||+.+|+++++|+.+++.++++++++++|+++++|+. T Consensus 269 i~~~~~~~~i~~--~~~~~~~--------~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~ 338 (517) T protein:vir:97 269 LLPFIRHENLPT--LVVGGDN--------ALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAG 338 (517) T ss_pred ceeeeeeccccc--eeeeccc--------ccceeeeeecCCcccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHH Confidence 666665544322 2233221 2346789999999999999999999999999999999999998763 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCcc-cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEE Q lcl|Aclame:pro 229 QLMGYIQGRLTYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVV 307 (419) Q Consensus 229 ~~~~~i~~~l~~a~~~~~d~~il~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (419) .|++||.++|+++++++++.+||+|+|++ ++.|+++..+... ......++...+++..+...... ..++.|+ T Consensus 339 ~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~------~~~~~~~~~~~d~i~~l~~a~~~-a~~a~~v 411 (517) T protein:vir:97 339 AILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAW------ATNVTGTTNIQELLEKLSVATPK-AADSTLV 411 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccccccccc------cccccccchHHHHHHHHHHHhhh-ccCCEEE Confidence 39999999999999999999999999986 5667775432111 11111222333444443322222 2467899 Q ss_pred EehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 308 VHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 308 ~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) |||.+|..|+++||++|+|+| ++...++.+.+++|..-+. +.++.+...++..+ .|.++.+.++....+. .+. T Consensus 412 mn~~t~~~I~klKD~~G~Yl~-~~~~~~~~~~~l~G~~~~~-~~~~~~~~~~~~~~-~y~i~~~~g~~~~~~f----d~~ 484 (517) T protein:vir:97 412 IHRNDLAAIRFLKDKNGNYVF-PVGVSNQTIATHFGFNRLV-QSVAVDEKTAVSLS-GYVTNGSRGMEFEQGT----ILV 484 (517) T ss_pred ECHHHHHHHHHhhcCCCCeec-cCcCCcccccccCCccccc-cccccCceeEeecc-ccEEEeecceeeeeee----ecc Confidence 999999999999999999865 5566677778888853222 23444555555544 5666667666543221 145 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +|+..|+.+.|+++.++.|++|+.+.+.+++. T Consensus 485 ~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~~ 516 (517) T protein:vir:97 485 ENNKEYLFEMPISGSLEYKGTTAYGTYTPPVA 516 (517) T ss_pred cCceeEeeeeeeccccccccceEEEEEcCCCC Confidence 78899999999999999999999999999999 No 100 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=3.5e-38 Score=225.94 Aligned_cols=293 Identities=11% Similarity=-0.006 Sum_probs=221.7 Q ss_pred hhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceec-ccCcceeeeeeccccceecc Q lcl|Aclame:pro 104 GQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQN-ADYNVLEYIRDTSGTAGAGS 182 (419) Q Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 182 (419) .. +++.... +. .... ....+|++++|+.+. .+++.+++.+++++++++++ +++....+++..... . T Consensus 1 ~~---~~~~~~~--~~-k~it--~~d~~gG~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~----~ 67 (314) T protein:vir:41 1 MD---FLNKPFQ--IT-PKID--VPDLGKGILAVQRFG-EFVREVRENSAIIKDARVLNALKSYEVDISRISLGV----E 67 (314) T ss_pred Cc---hhhhHHH--hh-cccc--cccCCCceeChHHHH-HHHHHHHhccchhhheeeecccCccceeecccccCc----c Confidence 00 0111111 11 1111 123346789998886 57788999999999999986 466677777643211 0 Q ss_pred ccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHhccCc--- Q lcl|Aclame:pro 183 TWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS---QLMGYIQGRLTYGLRFLRDRQLLNGNGS--- 256 (419) Q Consensus 183 ~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~---~~~~~i~~~l~~a~~~~~d~~il~G~g~--- 256 (419) ......|.+|.+..++++++|+++.+.+||+...+.||+|+|+|+. +|+++|...|++++++.++.++++|+|+ T Consensus 68 ~~~~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s 147 (314) T protein:vir:41 68 LEPGRNTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTT 147 (314) T ss_pred cccccccccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcC Confidence 1234567788888899999999999999999999999999999984 8999999999999999999999999985 Q ss_pred -----ccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccC---CcEEEEehHHHHHHHHHhccCCceec Q lcl|Aclame:pro 257 -----TEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFP---PDGVVVHPQDWESIELDQAPGSGVFR 328 (419) Q Consensus 257 -----~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~l~~~kd~~g~~~~ 328 (419) ++|.||++..+...... ...+.....+.+.+++..++..|++ +.+|+||+.++.+++++++..++++ T Consensus 148 ~~~~~~~p~G~l~~a~~~~~~~----~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l- 222 (314) T protein:vir:41 148 GRELYRINDGWMKLAGNQYTDA----EPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGL- 222 (314) T ss_pred cccchhcchhhhhhcccceeec----CccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcc- Confidence 26889987654332211 1223344566778889999999875 4479999999999999999888875 Q ss_pred cCCccccCCCcccccceeEecCCCC-----cCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEE Q lcl|Aclame:pro 329 VIANVQGEATPRIWGLNVVSTVAIA-----QGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAV 403 (419) Q Consensus 329 ~~~~~~~~~~~~l~G~pv~~~~~~~-----~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~ 403 (419) ++.....+.+.+|+|+||+..+.|| ++.++++||++. ++..+..++++..+ +..++++.|.+..|+|+.+ T Consensus 223 ~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nl-v~~~~~~ir~~~~~----~a~~~~~~~~~~~r~d~~~ 297 (314) T protein:vir:41 223 GDSALIGATGLQYDGIPIQYVPALDALGDDKARALLTVPTNL-VYGFWRNIRIEPKR----DAAMRRTEYIASLRADCNY 297 (314) T ss_pred cchhhhCCCCceecceeeEecccccccCCCCceEEEechhhe-EEEeeceeEEeecc----cCcCCeEEEEEEEEeceEE Confidence 4555667788899999999999874 578999999975 44556666665443 3468899999999999999 Q ss_pred ecccceEEEEecCCCC Q lcl|Aclame:pro 404 YQPKAFVRVTFAAATT 419 (419) Q Consensus 404 ~~~~a~~~~~~~aa~~ 419 (419) .+++|.++..+-.+.+ T Consensus 298 ~~~~aa~~~~~~~~~~ 313 (314) T protein:vir:41 298 EDENAAVAAVIDMSSG 313 (314) T ss_pred EEcCcEEEEEeeccCC Confidence 9999999888888888 No 101 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=2e-38 Score=227.24 Aligned_cols=295 Identities=13% Similarity=0.007 Sum_probs=212.3 Q ss_pred HHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceec-ccCcceeeeeec Q lcl|Aclame:pro 96 EYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQN-ADYNVLEYIRDT 174 (419) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~ 174 (419) .+.-...+. ........ .. .....+|++++|+.... +++.+.+.++++++|++++ +++....+++.. T Consensus 1 ~~~~~~~~~--------~~~~~~~k-~~--t~~d~~Gg~l~P~~~~~-~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g 68 (315) T protein:vir:41 1 MLTIEDIRG--------GKPFEIVP-KI--DVPDLGRGVLSVDRFGE-FVKAVRDSAVIIPEARIDNALKSYEKDISRLS 68 (315) T ss_pred Ccccchhhc--------CChhhhhh-hc--CCcCCCCceechHHHHH-HHHHHHhhhhhhhhceeeeccccccccccccc Confidence 000000000 00000111 11 12234678888988766 6678888999999999875 444444443321 Q ss_pred cccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 175 SGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS---QLMGYIQGRLTYGLRFLRDRQLL 251 (419) Q Consensus 175 ~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~---~~~~~i~~~l~~a~~~~~d~~il 251 (419) .. ........|.+|....++++++|+++.+.++++.+.+.||+++|+|+. +|+++|...+++++++.++.+++ T Consensus 69 ~~----~~~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~ 144 (315) T protein:vir:41 69 LV----LDVGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYL 144 (315) T ss_pred cC----cccccccccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhh Confidence 11 011123458888888999999999999999999999999999999974 89999999999999999999999 Q ss_pred hccCc------ccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccC---CcEEEEehHHHHHHHHHhcc Q lcl|Aclame:pro 252 NGNGS------TEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFP---PDGVVVHPQDWESIELDQAP 322 (419) Q Consensus 252 ~G~g~------~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~l~~~kd~ 322 (419) +|+|+ ++|.|+++..+....... ..........+.+.+++..++..|++ +.+|+||+.++..|++++++ T Consensus 145 nGdg~s~~p~~~~~~G~l~~a~~~~~~~~--~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~ 222 (315) T protein:vir:41 145 HGDTSSSDPLLRMSDGWLKLASEKLTESD--VDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKG 222 (315) T ss_pred ccCCcCcCccccccccceecccccccccc--cccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhcc Confidence 99985 356899886543322211 11222233456777888899998874 55799999999999999999 Q ss_pred CCceeccCCccccCCCcccccceeEecCCCC-----cCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEE Q lcl|Aclame:pro 323 GSGVFRVIANVQGEATPRIWGLNVVSTVAIA-----QGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEF 397 (419) Q Consensus 323 ~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~-----~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~ 397 (419) .|+++ +++....+.+.+|+|+||+..+.|| ++.++++||+++ .+..+.+++++..+. ..++.+.|.+.. T Consensus 223 ~g~~l-w~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl-~~~~~~~i~i~~~~~----a~~~~~~~~~~~ 296 (315) T protein:vir:41 223 RETGL-GDQALTGANSILYDGRPVQYVPALEALNDGKSRALFVVPTQL-VYGFWRNIKVVPDYD----AEMRLTKYVASL 296 (315) T ss_pred CCCcc-ccchhhcCCCceecccceEecccccccCCCCccEEEecccce-EEEeccccEEEeeec----CCCCceEEEEEE Confidence 99875 5677778889999999999999885 567999999874 556677788876654 346778899999 Q ss_pred EeccEEecccc--eEEEEe Q lcl|Aclame:pro 398 RANLAVYQPKA--FVRVTF 414 (419) Q Consensus 398 r~d~~~~~~~a--~~~~~~ 414 (419) |+|+.+.++++ ++++++ T Consensus 297 r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 297 RTDNHYEDEEGAVSATITV 315 (315) T ss_pred EeceeEEeccceeEeeeeC Confidence 99998877765 555666 No 102 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=1.1e-33 Score=201.37 Aligned_cols=298 Identities=11% Similarity=0.013 Sum_probs=204.9 Q ss_pred hhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeecccccee Q lcl|Aclame:pro 101 DKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGA 180 (419) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (419) ..+..+...+.. ........ .....++..+|+.+...|++.+.+.+.++++++++++.+....+++... T Consensus 1 ~~~k~~~~~l~~-----~~~~~~~~-~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~----- 69 (321) T protein:vir:31 1 MASRTINNDLSR-----ITEKNALT-VDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNI----- 69 (321) T ss_pred CchHHHHHHHHH-----HHHhcccc-ccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeecc----- Confidence 000001111111 11111111 1123344566666677777788888999999999999888887776432 Q ss_pred ccccccceeec-Cc-ccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 181 GSTWNKAAVVP-EG-TAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGNG 255 (419) Q Consensus 181 ~~~~~~a~~v~-Eg-~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~g 255 (419) ++...|++ |+ ...+.++++|+++++.++++.+.+.||+++|+|+ ++|+++|.+.++++++..++.++++|+| T Consensus 70 ---~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~ 146 (321) T protein:vir:31 70 ---GERHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDE 146 (321) T ss_pred ---CCcccccccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccc Confidence 22344555 33 3455678999999999999999999999999986 4899999999999999999999999999 Q ss_pred cccc------cceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccC--CcEEEEehHHHHHHHH-HhccCCce Q lcl|Aclame:pro 256 STEM------QGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFP--PDGVVVHPQDWESIEL-DQAPGSGV 326 (419) Q Consensus 256 ~~~p------~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~-~kd~~g~~ 326 (419) +++| .|+++...... .....++....++.+.+++..++..|++ +.+|+||++++..++. +++. +.+ T Consensus 147 ~~~~~~~~~n~G~l~~a~~~~----~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~-~~~ 221 (321) T protein:vir:31 147 DAEDSFENQNDGFITVAEGDV----ETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDR-DTP 221 (321) T ss_pred cCCCcccccchhhhhhhcccc----ccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcC-CCc Confidence 8766 46654322111 1111222334467788888889988874 3479999999988776 4554 444 Q ss_pred eccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccch-hhcCcEEEEEEEEeccEEec Q lcl|Aclame:pro 327 FRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADF-FTANTLVILAEFRANLAVYQ 405 (419) Q Consensus 327 ~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~~~r~~~r~d~~~~~ 405 (419) + +.....++.+.+|+|+||+.+++||++.+++++|++.++++ +.++++......... ...+.+.+....++|+.+.+ T Consensus 222 ~-~~~~l~~~~~~tl~G~pvv~~~~mP~~~il~t~~~nl~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~ 299 (321) T protein:vir:31 222 L-GDNVIMGEADVNPFSFPIIGSGLWPDDKAMFTDPQNLIYAL-YRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIEN 299 (321) T ss_pred c-ccchhhccccccccceeEEEcCCCCCCcEEEeccccEEEEE-eeccEEEEeecCccccccceeeEeeeeeecceeEec Confidence 4 44556667778999999999999999999999999866554 557777776654311 22344445556679999999 Q ss_pred ccceEEEEe-cC-------CCC Q lcl|Aclame:pro 406 PKAFVRVTF-AA-------ATT 419 (419) Q Consensus 406 ~~a~~~~~~-~a-------a~~ 419 (419) ++|++.++- .- +|| T Consensus 300 ~~a~a~~~~i~~~~~~~~~~~~ 321 (321) T protein:vir:31 300 TEAVVLAEGLGDPLEHLEEETS 321 (321) T ss_pred cccEEEEecCCcchhcccCCCC Confidence 999999983 32 222 No 103 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=100.00 E-value=3.6e-34 Score=203.93 Aligned_cols=360 Identities=10% Similarity=0.017 Sum_probs=188.3 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) +-+-+.+...+.......+..... +..+...++ .+.+....++.++..+++...+............. .+ T Consensus 111 a~~~a~v~~vks~~~~~e~~~~~~-e~~e~~~e~-----~e~~~~~~el~akl~el~k~~ee~k~~~~~~~~~~---~~- 180 (480) T protein:vir:40 111 SNKGAKVTKVREENKGEQEQMGAN-ETQEIMKQA-----IEAGVKVRELEAKVEELNKEREELKKEREASIPSE---KP- 180 (480) T ss_pred cchhhhhhhhhhhhhhhhhhhhhH-HHHHHHHhh-----hhhhhhhhhHHHHHHHHHhHHHHHhhhhhhhcccc---ch- Confidence 333333333332221111111000 000000000 00111111222222222221111111110000000 00 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcce Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ 160 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~ 160 (419) .+. .....+..............++.... .... + ....++.++|+. ...+........++...+.. T Consensus 181 ~~~------~~~e~r~~~~~~~~~~e~~~~~~~~~-----~~~~-~-~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 246 (480) T protein:vir:40 181 EDA------ERKFMRELGSKMAEMPEQGFLREFAN-----GADL-N-VVNSLGSITSKY-ARKSGIYDGAMKARFQGLTL 246 (480) T ss_pred hhh------hhHHHHHHHHHhccchhhhhhhhhhh-----hccc-c-ccccccccccch-hhheeechhhhhhhhhccee Confidence 000 00011111111100000001111110 0011 1 112233333333 33222222222222221111 Q ss_pred ecccCcceeeeeeccccceeccccccceeecCcccccccc--cceeeEEee---eEEEEEeehhhHHHHhhHHHHHHHHH Q lcl|Aclame:pro 161 QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST--LSFDTITTT---LKTVAHWLPITRQAADDNSQLMGYIQ 235 (419) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~--~~~~~v~~~---~~k~~~~~~vs~ell~d~~~~~~~i~ 235 (419) .. .+.....|++|+...+... .++....+. +++++....+|.++++|+++|++||. T Consensus 247 ~~-------------------~g~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDDa~~l~~~i~ 307 (480) T protein:vir:40 247 AE-------------------DGVDDTFISGTFKAGTDKNKSQTATKRSLRPQMAEAYLQMDKATVRGVNDSGALSEYVM 307 (480) T ss_pred ee-------------------ccccceeeeeeeecccccccccccccchhhHHHHHHHHHhHHHHHHHhhhhHHHHHHHH Confidence 11 1122345666655443322 233444444 46777778899999999889999999 Q ss_pred HHHHHHHHHHHHHHHHhcc--CcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCc-EEEEehHH Q lcl|Aclame:pro 236 GRLTYGLRFLRDRQLLNGN--GSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD-GVVVHPQD 312 (419) Q Consensus 236 ~~l~~a~~~~~d~~il~G~--g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 312 (419) ++|++.++++++.+||+|+ |++.+.|+.+... ......++.+.+ ..+++.+...|+.++ .|+||+.+ T Consensus 308 ~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~~-------~~~~~~~~~d~i---d~L~~al~~~y~~~a~~~vmn~~t 377 (480) T protein:vir:40 308 SEMVNRVIQKVEYNMILGSVDGSNGFYGLKTATD-------GWTKQIEYTDLF---EGITDAVAECSISDAITIVMSPQT 377 (480) T ss_pred HHHHHHHHHHHHHHhhccCCCCccccccceeecc-------cccccchhHHHH---HHHHHhhhHHhhCCCCEEEECHHH Confidence 9999999999999999995 4556777654211 111122233333 345566778887777 69999999 Q ss_pred HHHHHHHhccCCceeccCCccccCCCcccccceeEec-CCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 313 WESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVST-VAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 313 ~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~-~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) |+.|+++||++|+| +|++.++.+.+.+|+|+||+++ ..+|.+...++.++.++.++++. ++ .. ....+..++. T Consensus 378 ~~~I~klKD~~G~Y-i~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~~~~~~~~~~~~d~~-~~--~~--~~~~~~~~~~ 451 (480) T protein:vir:40 378 FAELRKAKGTDGHS-RFNELATKEQIAQSFGAVNLETRVWMPKDEVAVYNHDEYVLIGDLN-VE--NY--NDFDLRYNVE 451 (480) T ss_pred HHHHHHhhcCCCCe-eccCcccccCcceecccceeeeeccccCCcceeeeCCccEEEEecc-cc--ee--cccccccchh Confidence 99999999999997 5577888899999999998765 56788888889988888888874 22 22 1223568889 Q ss_pred EEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .|+.+.|+++.+.+|++++.+++...-- T Consensus 452 ~~~~e~~v~g~~~~~~~~~~~~~~~~~~ 479 (480) T protein:vir:40 452 QWLSETLVGGSIRGKNRSAYLKKKGSLG 479 (480) T ss_pred hhhhhhhhceeeEccccEEEEEeccCcC Confidence 9999999999999999999999988766 No 104 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=5e-32 Score=192.19 Aligned_cols=264 Identities=17% Similarity=0.173 Sum_probs=206.0 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceec----ccCcceeeeeeccccceeccccccceeecCccccccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQN----ADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS 199 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 199 (419) -+..++..+..++|+.+...|.+.......+.+++.+.. ..|..+++|+.+ ..+.+.|++||+.+|.+ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~--------~~~~a~~v~eg~~i~~~ 72 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWD--------YIGDAEDVAEGEAIPMT 72 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEec--------CCCCcccccCCCccccc Confidence 222335666789999999999998888888877766532 345568888754 23578899999999999 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +++++.+.+.+++++..+.+|+++..++ +++.+++.+++++++++.+|..++..-... .. T Consensus 73 ~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a-------------------~~ 133 (272) T protein:vir:30 73 QLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKS-------------------TQ 133 (272) T ss_pred ccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------------cc Confidence 9999999999999999999999998775 699999999999999999999998632110 00 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccC--CceeccCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPG--SGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~--g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) ..+....++.+.+++..+...+..+..|+|||.++..|++..... +......+...++..++++|+||++++.||.++ T Consensus 134 ~~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t 213 (272) T protein:vir:30 134 TVEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGT 213 (272) T ss_pred ccccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcce Confidence 011223477888888888888888889999999999998764221 111122233445667799999999999999999 Q ss_pred EEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 357 ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 357 ~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +++++.. ++.++.+.+++++.+++ ..++...++...|+++++.+|++++++++++|-- T Consensus 214 ~~~~~~~-a~~~~~~~~~~ve~~r~----~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~ 271 (272) T protein:vir:30 214 AYMVRKG-ALRIMLKRNTMVETDRD----ITKAINQIVANKHYGVYLYKAEKAVKITLKDAAK 271 (272) T ss_pred EEEEcCC-eEEEEecCCceeeeccc----cccceeEEEEEEEEEEEEEcCCceEEEEeccccc Confidence 9999877 45556678888876654 3456788999999999999999999999998777 No 105 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=5e-32 Score=192.19 Aligned_cols=264 Identities=17% Similarity=0.173 Sum_probs=206.0 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceec----ccCcceeeeeeccccceeccccccceeecCccccccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQN----ADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS 199 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 199 (419) -+..++..+..++|+.+...|.+.......+.+++.+.. ..|..+++|+.+ ..+.+.|++||+.+|.+ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~--------~~~~a~~v~eg~~i~~~ 72 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWD--------YIGDAEDVAEGEAIPMT 72 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEec--------CCCCcccccCCCccccc Confidence 222335666789999999999998888888877766532 345568888754 23578899999999999 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +++++.+.+.+++++..+.+|+++..++ +++.+++.+++++++++.+|..++..-... .. T Consensus 73 ~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a-------------------~~ 133 (272) T protein:vir:98 73 QLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKS-------------------TQ 133 (272) T ss_pred ccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------------cc Confidence 9999999999999999999999998775 699999999999999999999998632110 00 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccC--CceeccCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPG--SGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~--g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) ..+....++.+.+++..+...+..+..|+|||.++..|++..... +......+...++..++++|+||++++.||.++ T Consensus 134 ~~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t 213 (272) T protein:vir:98 134 TVEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGT 213 (272) T ss_pred ccccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcce Confidence 011223477888888888888888889999999999998764221 111122233445667799999999999999999 Q ss_pred EEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 357 ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 357 ~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +++++.. ++.++.+.+++++.+++ ..++...++...|+++++.+|++++++++++|-- T Consensus 214 ~~~~~~~-a~~~~~~~~~~ve~~r~----~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~ 271 (272) T protein:vir:98 214 AYMVRKG-ALRIMLKRNTMVETDRD----ITKAINQIVANKHYGVYLYKAEKAVKITLKDAAK 271 (272) T ss_pred EEEEcCC-eEEEEecCCceeeeccc----cccceeEEEEEEEEEEEEEcCCceEEEEeccccc Confidence 9999877 45556678888876654 3456788999999999999999999999998777 No 106 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.89 E-value=2.6e-24 Score=149.86 Aligned_cols=265 Identities=15% Similarity=0.130 Sum_probs=197.6 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc----cCcceeeeeeccccceeccccccceeecCccccccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS 199 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 199 (419) -+...+.-+..++|+.+...+.+.......+.+++..... .|..+++|+.+ ..+.+.++.||+.++.. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~--------~~g~~~~~~eg~~i~~~ 72 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV--------YSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeec--------cCCCcccccCCCccccc Confidence 2234456677899999999999888888777777765432 23456666643 23467889999999999 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +.+++...+..++.+..+.++++...++ .++.+.+.++++.++++.+|..++..-.++... T Consensus 73 ~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~------------------ 134 (274) T protein:vir:93 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT------------------ 134 (274) T ss_pred ccccceeEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc------------------ Confidence 9999999999999998999999987665 589999999999999999999998642211100 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccC--CceeccCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPG--SGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~--g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) ..+....++.+.+++..+...+.....++|||.++..|++..... .......+....+..++++|+||++++.+|.++ T Consensus 135 ~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t 214 (274) T protein:vir:93 135 VNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT 214 (274) T ss_pred ccccccCHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCCCCcce Confidence 011223477888888888887778888999999999998642100 000001122334566789999999999999999 Q ss_pred EEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 357 ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 357 ~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .++++... +.++.+.++.++..+.. .+....+++..++++++.+|+++++++.++++. T Consensus 215 ~~l~~~ga-i~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~ 272 (274) T protein:vir:93 215 AILAKKGA-VKLILKRDFFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) T ss_pred EEEEeCCe-EEEEecCCcccccccch----hhcccEEEEEEEEEEEEEcCCceEEEeeCcccc Confidence 99999775 44455677777765532 234568999999999999999999999999999 No 107 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.86 E-value=2.2e-23 Score=144.82 Aligned_cols=263 Identities=20% Similarity=0.168 Sum_probs=191.7 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc----cCcceeeeeeccccceeccccccceeecCccccccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS 199 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 199 (419) -+...+.-...++|+.+...+.+.......+.+++...+. .|..+++|+.+ ..+.+.+++||.+++.. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~--------~~gda~~~~eg~~i~~~ 72 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFT--------YIGDAADVAEGGEISLD 72 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeec--------cCccccccCCCCccChh Confidence 2222455567788999999999888888888888766543 24467777643 23467789999999999 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +.+.+...+..++.+..+.++++...++ .++.+.+.++++.++++.+|..++..-.. ... T Consensus 73 ~lt~~~~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~-------------------~~~ 133 (272) T protein:vir:36 73 KIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKT-------------------TSQ 133 (272) T ss_pred hcCCcceeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------------ccc Confidence 9999999999999998999999986665 68999999999999999999998853210 000 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCce-eccCCccccCCCcccccceeEecCCCCcCcE Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGV-FRVIANVQGEATPRIWGLNVVSTVAIAQGTA 357 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~-~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~ 357 (419) ..+....++.+.++...+...+.....++|||.++..|++.......+ ....+....+.-++++|++|++|+.||.++. T Consensus 134 ~~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~ 213 (272) T protein:vir:36 134 TVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA 213 (272) T ss_pred cccccccHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCce Confidence 112234577888898888888888888999999999998653321111 1111122234557899999999999999876 Q ss_pred EEEe--c-cceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 358 LVGG--F-RQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 358 ~~~d--~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) +... + ..++-++...+++++..+.. .+....+++..+++.++.+|+++++++++-+ T Consensus 214 ~~~~~~~~~gA~~~~~~~~~~vE~~R~~----~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 214 LMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eEEEEEecccceeeeecCCcccccccch----hhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 4322 2 22333455667777755432 2334578999999999999999999999999 No 108 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.83 E-value=4e-22 Score=137.88 Aligned_cols=266 Identities=15% Similarity=0.138 Sum_probs=195.5 Q ss_pred cccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc----cCcceeeeeeccccceeccccccceeecCcccccc Q lcl|Aclame:pro 123 APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ 198 (419) Q Consensus 123 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 198 (419) ....+.+.-...++|+.+...+.+.......+.+++.+.+. .|..+++|+.. ..+.+.++.||+.++. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~--------~ig~a~~~~~g~~i~~ 72 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFV--------YSGDAKVVPEGEEIPI 72 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeec--------cCCccccccCCCCcch Confidence 22223345566788999999999999988888888776543 24456676532 2346778999999999 Q ss_pred cccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccc Q lcl|Aclame:pro 199 STLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPT 277 (419) Q Consensus 199 ~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~ 277 (419) .+.+.+......++.+..+.++++....+ .++...+.++++.++++.+|..++.--+++. . T Consensus 73 ~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~------------~------ 134 (275) T protein:vir:96 73 DLIETKKRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGAT------------L------ 134 (275) T ss_pred hhcccceeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc------------c------ Confidence 99999999999999999999999987665 5888889999999999999999885321110 0 Q ss_pred ccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCC--ceeccCCccccCCCcccccceeEecCCCCcC Q lcl|Aclame:pro 278 APATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGS--GVFRVIANVQGEATPRIWGLNVVSTVAIAQG 355 (419) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g--~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 355 (419) ........++.+.++...+.........++|||..+..|++.....- ....-.+....+.-++++|++|++++.+|.+ T Consensus 135 ~~~~~~~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~ 214 (275) T protein:vir:96 135 KVEADITKLAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNKIKEG 214 (275) T ss_pred cccccccCHHHHHHHHHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCCCCcc Confidence 00112234788888888887777778889999999999987632110 0000112233456678999999999999999 Q ss_pred cEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 356 TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 356 ~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +.+++.... +.++.+.++.++..+.. .+....+++..+++.++.+|+++++++.+++.= T Consensus 215 t~~i~~~gA-~~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~ 273 (275) T protein:vir:96 215 EAILAKRGA-VKLITKRDFFLETERHA----SHKSTALFSDKHYVAYLYDESKVVKITKSASGL 273 (275) T ss_pred eEEEEeccc-eeeeecCCcccccccch----hhcCcEEEEeEEEEEEEEcCccEEEEEeccccc Confidence 998887663 44455667777765533 234557889999999999999999998877766 No 109 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.83 E-value=8.1e-22 Score=136.21 Aligned_cols=265 Identities=15% Similarity=0.124 Sum_probs=194.0 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc----cCcceeeeeeccccceeccccccceeecCccccccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS 199 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 199 (419) -+...+.-...++|+.+...+.+.......+.+++...+. .+..+++|+.. ..+.+..+.||+.++.. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~--------~~g~a~~~~~g~~i~~~ 72 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV--------YSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeec--------CCCccccccCCCccccc Confidence 2223455567899999999999888777777777766432 34566676532 23466788999999999 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +.+.+...+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++.--.++. . . . T Consensus 73 ~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~---------~-~-----~-- 135 (274) T protein:vir:97 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---------L-T-----V-- 135 (274) T ss_pred ccccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC---------c-c-----c-- Confidence 9999999999999998899999976654 5888999999999999999999885311110 0 0 0 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHh-c-cCCceeccCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQ-A-PGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~k-d-~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) ......++.+.++...+.........++|||..+..|++.. + -....-...+....+.-++++|++|++++.+|.++ T Consensus 136 -~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t 214 (274) T protein:vir:97 136 -NADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT 214 (274) T ss_pred -cccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCCcce Confidence 11123477888888888887777778899999999998631 1 00000001112334556789999999999999999 Q ss_pred EEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 357 ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 357 ~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .++++... +.++.+.++.++..+.. .+....+++..++++++.+|.++++++.+.+++ T Consensus 215 ~~l~~~gA-~~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~ 272 (274) T protein:vir:97 215 AILAKKGA-VKLILKRDFFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) T ss_pred EEEEeCcc-eEeeecCCceeccccch----hhcccEEEEEEEEEEEEEcCCceEEEecCcccc Confidence 99998774 44455677777765542 233457888899999999999999999999999 No 110 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.83 E-value=8.1e-22 Score=136.21 Aligned_cols=265 Identities=15% Similarity=0.124 Sum_probs=194.0 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc----cCcceeeeeeccccceeccccccceeecCccccccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS 199 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 199 (419) -+...+.-...++|+.+...+.+.......+.+++...+. .+..+++|+.. ..+.+..+.||+.++.. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~--------~~g~a~~~~~g~~i~~~ 72 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV--------YSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeec--------CCCccccccCCCccccc Confidence 2223455567899999999999888777777777766432 34566676532 23466788999999999 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +.+.+...+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++.--.++. . . . T Consensus 73 ~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~---------~-~-----~-- 135 (274) T protein:vir:94 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK---------L-T-----V-- 135 (274) T ss_pred ccccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC---------c-c-----c-- Confidence 9999999999999998899999976654 5888999999999999999999885311110 0 0 0 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHh-c-cCCceeccCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQ-A-PGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~k-d-~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) ......++.+.++...+.........++|||..+..|++.. + -....-...+....+.-++++|++|++++.+|.++ T Consensus 136 -~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t 214 (274) T protein:vir:94 136 -NADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGT 214 (274) T ss_pred -cccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCCcce Confidence 11123477888888888887777778899999999998631 1 00000001112334556789999999999999999 Q ss_pred EEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 357 ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 357 ~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .++++... +.++.+.++.++..+.. .+....+++..++++++.+|.++++++.+.+++ T Consensus 215 ~~l~~~gA-~~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~ 272 (274) T protein:vir:94 215 AILAKKGA-VKLILKRDFFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) T ss_pred EEEEeCcc-eEeeecCCceeccccch----hhcccEEEEEEEEEEEEEcCCceEEEecCcccc Confidence 99998774 44455677777765542 233457888899999999999999999999999 No 111 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.83 E-value=7.1e-22 Score=136.52 Aligned_cols=265 Identities=14% Similarity=0.122 Sum_probs=193.9 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc----cCcceeeeeeccccceeccccccceeecCccccccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS 199 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 199 (419) -+...+.-...++|+.+...+.+.......+.+++...+. .|..+++|+.. ..+.+..+.||+.++.. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~--------~~g~~~~~~~g~~i~~~ 72 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFT--------YSGDAQVIAEGEKIPVD 72 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeec--------cCCCccccCCCCcCchh Confidence 2222345567889999999999988888777777665432 24466776643 23466678999999999 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +.+++...+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++.--..+. .. T Consensus 73 ~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~---------------~~--- 134 (274) T protein:vir:96 73 QIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT---------------LT--- 134 (274) T ss_pred hcccceeEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---------------CC--- Confidence 9999999999999988899999986655 5889999999999999999999886321100 00 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccC--CceeccCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPG--SGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~--g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) .......++.+.++...+.........++|||..+..|++..... ...-...+....+.-++++|++|++++.+|.++ T Consensus 135 ~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~t 214 (274) T protein:vir:96 135 VEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGE 214 (274) T ss_pred cCcccccHHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcCCCCcce Confidence 011223478888888888877777888999999999998863211 000001122234556789999999999999999 Q ss_pred EEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 357 ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 357 ~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .++++... +-++.+.++.++..+.. .+....+++..+++.++++|+++++++.+++-- T Consensus 215 ~~l~~~gA-~~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~ 272 (274) T protein:vir:96 215 ALLAKKGA-VKLITKRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDE 272 (274) T ss_pred EEEEeCcc-eeeeecCCcccccccch----hhcccEEEEeeEEEEEEEcCccEEEEEcCcccc Confidence 99888664 44455667777654432 234567899999999999999999999998888 No 112 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.83 E-value=6.7e-22 Score=136.66 Aligned_cols=269 Identities=13% Similarity=0.057 Sum_probs=192.2 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc----cCcceeeeeeccccceeccccccceeecCccccccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS 199 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 199 (419) -+..++.-+..++|+.+...+.+.......+.+++..... .+..+++|+.. ..+.+.++.||+.++.. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~--------~~g~a~~~~~g~~i~~~ 72 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYK--------YIGDAQDVAEGAAIDYS 72 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeec--------cCCcceeecCCCcCccc Confidence 1112355577899999999999988888777777654432 24456666543 23467789999999999 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhcc-Ccccccceecccccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGN-GSTEMQGILTTPGIGTYQQPKPT 277 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~-g~~~p~Gi~~~~~~~~~~~~~~~ 277 (419) +.+++...+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++..- |... ...... T Consensus 73 ~lt~~~~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~-------------~~~~~~ 139 (278) T protein:vir:80 73 ALETESVKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTL-------------EVKGAI 139 (278) T ss_pred ccccceeeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------cccccc Confidence 9999999999999988899999987665 589999999999999999999888642 2110 001111 Q ss_pred ccchhhhHHHHHHHHHHhhhhhccCCc-EEEEehHHHHHHHHHhccCC--ceeccCCccccCCCcccccceeEecCCCCc Q lcl|Aclame:pro 278 APATDEPPLVDIRRAKTVAEIAGFPPD-GVVVHPQDWESIELDQAPGS--GVFRVIANVQGEATPRIWGLNVVSTVAIAQ 354 (419) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~kd~~g--~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 354 (419) ...+....++.+.++...+...+.... .++|||..+..|++...... ....-.+....+.-++++|++|++++.+|. T Consensus 140 t~~~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~ 219 (278) T protein:vir:80 140 NIGLIDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKLAD 219 (278) T ss_pred ccchhhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCCCc Confidence 112233446667777777766655443 47799999999987532211 111112223345677899999999999999 Q ss_pred CcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 355 GTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 355 ~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~ 418 (419) ++.++.... ++-++...++.++..+.. .+....+++..+++.++++|+++++++..+.. T Consensus 220 ~t~~l~~~g-Ai~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 220 GNALAVKAG-ALKTFLKRNLLAESGRDM----DHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred ceEEEEecc-ceeeeecCCcccccccch----hhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 999998866 344456677777755432 23455788999999999999999999999999 No 113 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.83 E-value=8.4e-22 Score=136.12 Aligned_cols=265 Identities=16% Similarity=0.157 Sum_probs=196.2 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc----cCcceeeeeeccccceeccccccceeecCccccccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS 199 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 199 (419) -+...+.-...++|+.+...+.+.......+.+++...+. .|..+++|+.. ..+.+.+++||.+++.. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~--------~igda~~~~eg~~i~~~ 72 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFV--------YSGDATVVPEGQKIPVD 72 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeec--------CCCccccccCCCccCcc Confidence 1112345567789999999999999888888888776542 45566666532 23467789999999999 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +.+.+......++.+..+.++++....+ .+....+.++++.++++.+|..++.=-. + . .. . T Consensus 73 ~lt~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~-----~----~---~~---~--- 134 (276) T protein:vir:10 73 KIETNRREAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALR-----G----T---KL---T--- 134 (276) T ss_pred ccccceeeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHh-----c----c---cc---c--- Confidence 9999999999999999999999987765 5889999999999999999998885100 0 0 00 0 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCcee--ccCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVF--RVIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~--~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) .......++.+.++...+........+++|||.++..|++......-.. .-.+....+.-++++|++|++++.+|.++ T Consensus 135 ~~~~~~t~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t 214 (276) T protein:vir:10 135 VSADIGTLAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKKLDEGE 214 (276) T ss_pred ccccccCHHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCCCCcce Confidence 0112234777888888888777778889999999999987643221100 00112234556789999999999999999 Q ss_pred EEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 357 ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 357 ~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+++... ++-++...++.++..+.. .+....+++..++..++.+|..+++++.++-++ T Consensus 215 ~~l~~~g-Ai~~~~~~~~~vE~dRd~----~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~ 272 (276) T protein:vir:10 215 AILAKRG-AVKLITKRDFFLETDRDP----STKTTALYSDKHYVAYLYDESKAVKVTKGAGTT 272 (276) T ss_pred EEEEecc-ceeeeecCCceeecccch----hhcccEEEEeeEEEEEEEcCcceEEEecCCcCC Confidence 9888866 444556777777765543 234557888999999999999999999988777 No 114 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.79 E-value=1.5e-20 Score=129.32 Aligned_cols=261 Identities=15% Similarity=0.146 Sum_probs=192.5 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc----cCcceeeeeeccccceeccccccceeecCccccccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS 199 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 199 (419) -+...+.-...++|+.+...+.+.......+.+++..... .|..+++|+-. ..+.+..+.||..++.. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~--------~ig~a~~~~~g~~i~~~ 72 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV--------YSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeec--------CCCccccccCCCccchh Confidence 2222345567889999999998888777777777665432 34566666532 23467788999999999 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +.+.+......++.+..+.++++....+ .++.+.+.++++.++++.+|..++.--.++. . . T Consensus 73 ~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~------------~------~ 134 (274) T protein:vir:12 73 ILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK------------L------T 134 (274) T ss_pred hcccceeeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccc------------c------c Confidence 9999999999999998999999865544 5788889999999999999999886322110 0 0 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHh------ccCCceeccCCccccCCCcccccceeEecCCC Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQ------APGSGVFRVIANVQGEATPRIWGLNVVSTVAI 352 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~k------d~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~ 352 (419) .......++.+.++...+..........+|||..+..|++.. ++.++ .+....+.-++++|++|++++.+ T Consensus 135 ~~~~a~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g----~~~~~~G~ig~~~G~~Vi~s~~~ 210 (274) T protein:vir:12 135 VNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELG----DDIIVKGAFGEALGAIIVRSNKL 210 (274) T ss_pred ccccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhhhhhcccccccc----ccceecccceeecCeeEEEeCCC Confidence 011223577888888888877777778899999999988742 11111 12223455678999999999999 Q ss_pred CcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 353 AQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 353 ~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) |.++.+++.... +.++.+.++.++..+... +....+++..+++.++.+|+.+++++.+.+++ T Consensus 211 p~~t~~l~~~gA-~~~~~~~~~~vE~~Rd~~----~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~ 272 (274) T protein:vir:12 211 EAGTAILAKKGA-VKLILKRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) T ss_pred CcceEEEEeccc-eeeeecCCceeccccchh----hcccEEEeeeEEEEEEEcCCceEEEEcCCccc Confidence 999988887664 444556777777665432 34457889999999999999999999999999 No 115 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.78 E-value=9.2e-21 Score=130.42 Aligned_cols=305 Identities=10% Similarity=0.053 Sum_probs=198.8 Q ss_pred HHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeee Q lcl|Aclame:pro 94 LREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRD 173 (419) Q Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 173 (419) +-..-.-..+++.+.....+-.. +..+ .+....+...|......|++...+.+.+++.++..++.++.+.|.|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~l-----~m~a-lTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~ 74 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPEL-----KMPT-VTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRE 74 (330) T ss_pred CceecCCccccceeehhcccccc-----chhh-hhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeee Confidence 00000001111111110000000 0010 01111233446666777888888888899999988888888999887 Q ss_pred ccccceeccccccceeecCcccccccc-cceeeEEeeeEEEEEeehhhHHHHh--hHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 174 TSGTAGAGSTWNKAAVVPEGTAKPQST-LSFDTITTTLKTVAHWLPITRQAAD--DNS-QLMGYIQGRLTYGLRFLRDRQ 249 (419) Q Consensus 174 ~~~~~~~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~~~vs~ell~--d~~-~~~~~i~~~l~~a~~~~~d~~ 249 (419) +. -+.++|...++..+++. .+|.+++...+.+++.+.|+.++.+ +++ +...+......++++.+.+.+ T Consensus 75 ~~--------lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~ 146 (330) T protein:vir:94 75 NV--------LGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQAS 146 (330) T ss_pred ec--------CCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHH Confidence 64 25788999888887765 5899999999999999999999964 344 788899999999999999999 Q ss_pred HHhccCc-ccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceec Q lcl|Aclame:pro 250 LLNGNGS-TEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFR 328 (419) Q Consensus 250 il~G~g~-~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~ 328 (419) +||||++ +++.|+++.........+. +.++....+++-.++..+......+..|+||+++..+|+.+....|++-. T Consensus 147 linGDs~~~~F~GL~~~~~~~q~i~tg---~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v 223 (330) T protein:vir:94 147 MITGDGTGNSFQGMMGLVAASQTISAG---ANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAI 223 (330) T ss_pred hhccCCCCccccchhhcCCcccEEecC---CCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCC Confidence 9999976 4677887643222111111 12233445666666666666566788999999999999999887776543 Q ss_pred cC--CccccCCCcccccceeEecCCCCcC----------cEEEEecc-----ceEEEEEe---cceEEEEeecccchhhc Q lcl|Aclame:pro 329 VI--ANVQGEATPRIWGLNVVSTVAIAQG----------TALVGGFR-----QGATLWSR---QGITVLMTDSHADFFTA 388 (419) Q Consensus 329 ~~--~~~~~~~~~~l~G~pv~~~~~~~~~----------~~~~~d~~-----~~~~~~~~---~~~~i~~~~~~~~~~~~ 388 (419) .+ .+..+....++.|+||+.++.+|.+ .+++.-|. +++.+... .++.+..- +..-.+ T Consensus 224 ~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~---G~~~~k 300 (330) T protein:vir:94 224 GEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNV---GAKENA 300 (330) T ss_pred CCcccccCCCEEeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeC---CCcccc Confidence 32 2233333356789999999999863 35565554 34444321 23444321 111245 Q ss_pred CcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 389 NTLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 389 ~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~ 418 (419) +.+.+++.+|++.++.+|+|+++++--..= T Consensus 301 ~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 301 DETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred ceeeEEEEEeeeeEEechhheeeeccccCC Confidence 677899999999999999999999644433 No 116 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.77 E-value=1.1e-19 Score=124.62 Aligned_cols=265 Identities=15% Similarity=0.122 Sum_probs=189.8 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc----cCcceeeeeeccccceeccccccceeecCccccccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS 199 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 199 (419) -+...+.-...++|+.+...+.+.......+.+++..-+. .|..+++|+.. ..+.+..+.||..++.. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~--------~ig~a~~~~~g~~i~~~ 72 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFI--------YSGDAKVVAEGEKIPTD 72 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeec--------CCCccccccCCCccchh Confidence 2222455567889999999999888888777777655432 34566776532 23466788999999999 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +.+.+...+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++.--.++. ... T Consensus 73 ~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~------------~~~----- 135 (274) T protein:vir:96 73 ILETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK------------LTV----- 135 (274) T ss_pred hcccceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc------------ccc----- Confidence 9999999999999988899999876554 5889999999999999999999885321110 000 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHh--ccCCceeccCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQ--APGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~k--d~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) ......++.+.++...+..........+|||..+..|++.. +.....-.-.+....+.-++++|++|++++.+|.++ T Consensus 136 -~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t 214 (274) T protein:vir:96 136 -EADITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGT 214 (274) T ss_pred -cccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCCCce Confidence 01123477788888888877777778899999999998741 100000000122234556789999999999999999 Q ss_pred EEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 357 ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 357 ~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+++.... +.++.+.++.++..+.. .+....+++..++++++++|+++++++..+-+- T Consensus 215 ~~l~~~gA-~~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~ 272 (274) T protein:vir:96 215 AILAKKGA-VKLITKRDFFLETDRDP----STKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) T ss_pred EEEEeccc-eeeeecCCccccccccc----ccccCEEEEeEEEEEEEEcCCcEEEEEcCCccc Confidence 88888664 44455677777765532 345567899999999999999999999544444 No 117 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.77 E-value=1.1e-19 Score=124.62 Aligned_cols=265 Identities=15% Similarity=0.122 Sum_probs=189.8 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc----cCcceeeeeeccccceeccccccceeecCccccccc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS 199 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 199 (419) -+...+.-...++|+.+...+.+.......+.+++..-+. .|..+++|+.. ..+.+..+.||..++.. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~--------~ig~a~~~~~g~~i~~~ 72 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFI--------YSGDAKVVAEGEKIPTD 72 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeec--------CCCccccccCCCccchh Confidence 2222455567889999999999888888777777655432 34566776532 23466788999999999 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +.+.+...+..++.+..+.++++....+ .++.+.+.++++.++++.+|..++.--.++. ... T Consensus 73 ~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~------------~~~----- 135 (274) T protein:vir:95 73 ILETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK------------LTV----- 135 (274) T ss_pred hcccceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc------------ccc----- Confidence 9999999999999988899999876554 5889999999999999999999885321110 000 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHh--ccCCceeccCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQ--APGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~k--d~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) ......++.+.++...+..........+|||..+..|++.. +.....-.-.+....+.-++++|++|++++.+|.++ T Consensus 136 -~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t 214 (274) T protein:vir:95 136 -EADITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGT 214 (274) T ss_pred -cccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCCCce Confidence 01123477788888888877777778899999999998741 100000000122234556789999999999999999 Q ss_pred EEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 357 ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 357 ~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+++.... +.++.+.++.++..+.. .+....+++..++++++++|+++++++..+-+- T Consensus 215 ~~l~~~gA-~~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~ 272 (274) T protein:vir:95 215 AILAKKGA-VKLITKRDFFLETDRDP----STKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) T ss_pred EEEEeccc-eeeeecCCccccccccc----ccccCEEEEeEEEEEEEEcCCcEEEEEcCCccc Confidence 88888664 44455677777765532 345567899999999999999999999544444 No 118 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.75 E-value=2.4e-19 Score=122.71 Aligned_cols=261 Identities=13% Similarity=0.108 Sum_probs=189.9 Q ss_pred ccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc----cCcceeeeeeccccceeccccccceeecCccccccccc Q lcl|Aclame:pro 126 GTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA----DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTL 201 (419) Q Consensus 126 ~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 201 (419) -+.+.-...++|+.+.+.+.+.......+.+++...+. .|..+++|.-. ..+.+..+.||++++..+. T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~--------~igdae~~~eg~~i~~~~l 72 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYA--------YIGAAEDLQEGVAMDTTQM 72 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeec--------CCCccccccCCCccchhhc Confidence 22234456789999999999998888888888776443 34566666532 3456778899999999999 Q ss_pred ceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccc Q lcl|Aclame:pro 202 SFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPA 280 (419) Q Consensus 202 ~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 280 (419) +++.-....++.+..+.++++....+ .+....+.++++..+++++|..++.- .+|. .... T Consensus 73 t~~~~~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~-----l~~a--------------~~~~ 133 (270) T protein:vir:95 73 SMTTTKVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAE-----LNKS--------------KQTA 133 (270) T ss_pred ccchheeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHH-----hccc--------------cccc Confidence 99999999999999999999977655 47888899999999999999988741 0010 0001 Q ss_pred hhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCC-cCcEEE Q lcl|Aclame:pro 281 TDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIA-QGTALV 359 (419) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~-~~~~~~ 359 (419) +....++.+.+++..+.+......+++|||.++..|++...-.+.. ...+....+.-++++|++|++++.++ .++.++ T Consensus 134 ~~~~t~~~~~dA~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~-~~~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l 212 (270) T protein:vir:95 134 TVSADATGILDAIEVFNSENDEDYVLYVNPKDYNKLVKSLFKVGGN-VQDRAISKGDLVEIVGVSDIVKSKRVSENTAFL 212 (270) T ss_pred ccccCHHHHHHHHHHhccccCCCcEEEEcHHHHHHHHhhhcccccc-cccchhcccccceecceeEEEeCCCCCceeEEE Confidence 1224567888899989888888889999999999998643111110 11222334567789999998876554 566666 Q ss_pred EeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 360 GGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 360 ~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +... ++-++...++.++..+.. .+....++...++..++.+|..+++++++++.| T Consensus 213 ~~~g-Ai~~~~~~~~~vEtdRd~----~~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~ 267 (270) T protein:vir:95 213 QRYG-AMEIVNKKKPEAYTDFDI----LKRTHLLSTNYHYSVNLKDETGVVKVTFKPSGS 267 (270) T ss_pred Eecc-ceeeeecCCceeeeccch----hhcccEEEeeeEEEEEEEccceEEEEEecCCCC Confidence 6644 444556667777765532 234557888899999999999999999999998 No 119 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.73 E-value=5.2e-19 Score=120.81 Aligned_cols=350 Identities=13% Similarity=0.093 Sum_probs=208.1 Q ss_pred HHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccchhhhhhHHHHhHHHHHHHHHhhhhhh Q lcl|Aclame:pro 28 VQEIVAEARGL--ADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQ 105 (419) Q Consensus 28 ~~~~~~e~~~~--~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 105 (419) .++.+.+.++. .+.-.+|...+..+.++-+...+. ..+.. .-. +... + +.+-..+...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~k~lr~~me~~et~~e~---~~~~~-------~~~-----~~e~--e-l~E~f~Kmm~G~ 62 (393) T protein:vir:79 1 MENWLKQLKESGFTETQVQEQKSLRTRMERGETLAEA---DANKL-------ALN-----EEET--Q-ILESFAKMMEGE 62 (393) T ss_pred CchHHHHHHhccCchhHHHHHHHHHHHhhhhhhhhhh---hhhhh-------hcc-----hhHH--H-HHHHHHHHhcCC Confidence 33333333221 222223333333333321111111 11000 000 0000 0 011111111111 Q ss_pred hhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc-cCcceeeeeeccccceecccc Q lcl|Aclame:pro 106 FQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA-DYNVLEYIRDTSGTAGAGSTW 184 (419) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 184 (419) .-. . ........++..+..++|..+.+.|.+....-...-.++..+.. .|.+..+|... . T Consensus 63 ~p~--~--------eV~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g---------~ 123 (393) T protein:vir:79 63 TPT--N--------EVNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIG---------I 123 (393) T ss_pred Cch--h--------heehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchh---------e Confidence 100 0 01111223455677889999999998877666666677777776 45566555322 2 Q ss_pred ccceeecCcccccccc---cceeeEEeeeEEEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcccc- Q lcl|Aclame:pro 185 NKAAVVPEGTAKPQST---LSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEM- 259 (419) Q Consensus 185 ~~a~~v~Eg~~~~~~~---~~~~~v~~~~~k~~~~~~vs~ell~d~~-~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p- 259 (419) -.+.-|+||++.|+.+ .+++.|++..+|++..+.+|+|+++||. ++.++....+.++|++..+..++++.-+.+- T Consensus 124 ~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ght 203 (393) T protein:vir:79 124 MRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHT 203 (393) T ss_pred eeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccce Confidence 3567789999999865 4689999999999999999999999997 8999999999999999999999999765432 Q ss_pred --cceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccc--- Q lcl|Aclame:pro 260 --QGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQ--- 334 (419) Q Consensus 260 --~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~--- 334 (419) -++.+.+...... -.-...-.++...+|+.+++.++.++.++++.++|||-.|..+.+-.-=.+-+....++.. T Consensus 204 vfDa~st~t~ahptG-r~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~ 282 (393) T protein:vir:79 204 VFDNYSTNKLAHTTG-LDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKG 282 (393) T ss_pred eeeccccCccceeec-CCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccc Confidence 2333222111111 0111234566789999999999999999999999999999998764221222111111111 Q ss_pred ---c--CCCccc-----ccceeEecCCCCcC------cEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEE Q lcl|Aclame:pro 335 ---G--EATPRI-----WGLNVVSTVAIAQG------TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFR 398 (419) Q Consensus 335 ---~--~~~~~l-----~G~pv~~~~~~~~~------~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r 398 (419) + ..|..| +.+.|++|+.+|-+ +++..|-+..-++..+++++.+..++. ..|...+....| T Consensus 283 ~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk----~rdiq~iKl~ER 358 (393) T protein:vir:79 283 APSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEK----ARGLQNIKMIER 358 (393) T ss_pred cchhhhhchhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcceeccccc----cccceeeeeeee Confidence 0 011112 23789999999843 356666665555566777777765543 457778999999 Q ss_pred eccEEecc-cceEEEEe---cCCCC Q lcl|Aclame:pro 399 ANLAVYQP-KAFVRVTF---AAATT 419 (419) Q Consensus 399 ~d~~~~~~-~a~~~~~~---~aa~~ 419 (419) +++.+++. +|+++.+- +-+.- T Consensus 359 YG~gvLn~gkaiavakNI~~~k~y~ 383 (393) T protein:vir:79 359 YGIGILNEGKAIAVAKNISMDKSYA 383 (393) T ss_pred eceeeeeCCceEEEEecceeecccc Confidence 99999886 67776642 22222 No 120 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.66 E-value=4.8e-18 Score=115.52 Aligned_cols=226 Identities=20% Similarity=0.194 Sum_probs=168.7 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGR 237 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~ 237 (419) ..-.-.|..+++|. + .+.+.-++||.+++....+++..+...++.+..+.|+++....+ .+......++ T Consensus 1 ~~~~~~Gdtit~P~--------~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q 70 (231) T protein:vir:73 1 ENGINLANLCEYPN--------D--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQ 70 (231) T ss_pred CccccCCceEEecc--------c--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHH Confidence 11111344667763 1 34788999999999999999999999999999999999986654 5788899999 Q ss_pred HHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHH Q lcl|Aclame:pro 238 LTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIE 317 (419) Q Consensus 238 l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 317 (419) ++.+|++++|..++.--.+. + ......+.++.+.+++..+...+..+.+.+|||..+..|+ T Consensus 71 ~~~~iA~kvD~di~~~~~~a------------~-------l~~~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lr 131 (231) T protein:vir:73 71 LGLSLANKVDDDLLKAAKTT------------S-------QTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIR 131 (231) T ss_pred HHHHHHHhhhHHHHHhhccc------------c-------ccccccccHHHHHHHHHHhccccccceEEEEcchHHHhhh Confidence 99999999999988421100 0 0011235688899999999888878888899999999999 Q ss_pred HHhccCCc-eeccCCccccCCCcccccceeEecCCCCcCcEEEEe---ccceEEEEEecceEEEEeecccchhhcCcEEE Q lcl|Aclame:pro 318 LDQAPGSG-VFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGG---FRQGATLWSRQGITVLMTDSHADFFTANTLVI 393 (419) Q Consensus 318 ~~kd~~g~-~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d---~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 393 (419) +..+.... .....+....|.-++++|+||++|+.+|.+..+..- -..++.++...++.++..+.. ......+ T Consensus 132 k~~~~~~~~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~----~~k~~~i 207 (231) T protein:vir:73 132 KDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDI----VTKTTVI 207 (231) T ss_pred hccchhhhhhhhccceeeecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeeccccc----cccccEE Confidence 85433211 011223344566789999999999999998776433 234556677788888866532 3445678 Q ss_pred EEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 394 LAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 394 r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) ++..++..++.+|..+++++++-. T Consensus 208 ~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 208 TADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred EEeEEEEEEEEcCccEEEEEeecC Confidence 999999999999999999999999 No 121 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.64 E-value=1.2e-17 Score=113.39 Aligned_cols=286 Identities=16% Similarity=0.088 Sum_probs=174.5 Q ss_pred hhhcccccccccCCcccc------cchhhhHHHHHhhhhhhhHHhhccee-cccCcceeeeeeccccceeccccccceee Q lcl|Aclame:pro 118 LLSRDAPAGTITNPNVPH------LPQLVPGIVPTTPDLPLLVADLLDQQ-NADYNVLEYIRDTSGTAGAGSTWNKAAVV 190 (419) Q Consensus 118 ~~~~~~~~~~~~~~~~~~------~p~~~~~~i~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~v 190 (419) .+. ....-.+..++... -|+.++..|.++.......-.+++.. ...+..+.|...... ....++.-| T Consensus 1 ~~~-~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~-----~~~~d~e~V 74 (318) T protein:vir:10 1 MTA-PTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPS-----FLEDDVADV 74 (318) T ss_pred CCC-CCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccc-----cccCcHhhc Confidence 000 00000111112222 27777788877776655555555554 334556665443221 123467788 Q ss_pred cCcccccccccceeeEEe-eeEEEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccc Q lcl|Aclame:pro 191 PEGTAKPQSTLSFDTITT-TLKTVAHWLPITRQAADDNS-QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGI 268 (419) Q Consensus 191 ~Eg~~~~~~~~~~~~v~~-~~~k~~~~~~vs~ell~d~~-~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~ 268 (419) +||+++|...+.++..++ ..+|.+..+.||+|++..+. +..+....+++++|++..|+.++.-= ..+.+ T Consensus 75 aEggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal---------~sa~t 145 (318) T protein:vir:10 75 AEFGEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALL---------QSPIV 145 (318) T ss_pred cCcccccccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHH---------hcccc Confidence 999999999999977777 55899999999999998764 88888999999999999999877521 00001 Q ss_pred cccccccccccc-----hhhhHHHHHHHHHHhh---------hhhccCCcEEEEehHHHHHHHHH------hccCCceec Q lcl|Aclame:pro 269 GTYQQPKPTAPA-----TDEPPLVDIRRAKTVA---------EIAGFPPDGVVVHPQDWESIELD------QAPGSGVFR 328 (419) Q Consensus 269 ~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~l~~~------kd~~g~~~~ 328 (419) .......+.... ...+..+.+..+...+ ..-++.++.++|||.+|..|++- -..++.+.+ T Consensus 146 ~~~~~s~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~ 225 (318) T protein:vir:10 146 PTLAVPTAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVS 225 (318) T ss_pred ccccCCcCCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhh Confidence 111111111100 0001111111111111 12245667899999999999543 223344444 Q ss_pred cCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecc--cchhhc-CcEEEEEEEEeccEEec Q lcl|Aclame:pro 329 VIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSH--ADFFTA-NTLVILAEFRANLAVYQ 405 (419) Q Consensus 329 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~--~~~~~~-~~~~~r~~~r~d~~~~~ 405 (419) ......+..++.++|+.|+.+..+|.+++++++-.....+.+..+++..-.+.. ..+... ..+..|+.++--..|.+ T Consensus 226 ~~~~~tg~~~g~~lGl~vi~s~~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~ 305 (318) T protein:vir:10 226 TAPDWTGNFPGSVMGLNVIRSRTFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQ 305 (318) T ss_pred hcccccccccceeeceEEeecCccCCCeeEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeC Confidence 444445566778999999999999999999999766556666555555433311 112222 33566788888899999 Q ss_pred ccceEEEEecCCC Q lcl|Aclame:pro 406 PKAFVRVTFAAAT 418 (419) Q Consensus 406 ~~a~~~~~~~aa~ 418 (419) |+|+++++-=-+| T Consensus 306 PkA~~~itgi~~~ 318 (318) T protein:vir:10 306 PKAALWLTGIVTP 318 (318) T ss_pred cceeEEEeeccCC Confidence 9999999866666 No 122 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.63 E-value=6.3e-17 Score=109.40 Aligned_cols=381 Identities=14% Similarity=0.100 Sum_probs=201.2 Q ss_pred CCccHHHHHH---HHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|Aclame:pro 1 MPPTPTLEEQ---RAALLARLDDTSLTTEQV--QEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTP 75 (419) Q Consensus 1 M~~~~~L~e~---~~~l~~~~~~~~~~~~~~--~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 75 (419) |++ +.|.|+ .++++...-.+++.++-. .+..++ -...++++.-+.++..++.+.+.......+. + T Consensus 8 ~~k-~~~~ek~~~~~~~~e~~~~lks~~~g~~~~~~~~~-~~k~~el~kT~Sel~~ei~k~e~eln~~~E~--------~ 77 (400) T protein:vir:93 8 MNK-PDLIEKQNRLAELKENNVSLKSQISGFEVKNAIED-LPKVQELEKTLSENSIEIIKIENELNAQEEK--------P 77 (400) T ss_pred ccc-chHHHHHHHHhhhhhhhhhhhhhhhccchhhhhhh-chhHHHHHHHHHHhHHHHHHHhhhhhhhhhh--------c Confidence 433 223333 233333322333333211 111111 1223444544444444443322222111100 0 Q ss_pred cccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHH Q lcl|Aclame:pro 76 AEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVA 155 (419) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~ 155 (419) +..... -+...-.++...|..-.....+..+.+..+.. .-.+.|.+..+....+|.-+...|...+....++. T Consensus 78 Kgk~~m---tefLkT~~A~~~fa~~l~~nsg~sd~knaW~A----~l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~ 150 (400) T protein:vir:93 78 KGKDKM---TNFIESQNAVTEFFDVLKKNSGKSEIKNAWSA----KLAENGVTITDTTFQLPRKLVESINTALLNTNPVF 150 (400) T ss_pred ccchhH---HHhhhhHHHHHHHHHHHHhhcCCcchhhhhhh----hhhhcccccCCchhhcchHHHHHHHHhhhccCCcc Confidence 010000 00011111111221111222222222222221 11222333344445778888888888888888888 Q ss_pred hhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHH Q lcl|Aclame:pro 156 DLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMG 232 (419) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~ 232 (419) +++++..+++-.+.. +... ...+.-+--|+++.++..+|..-++.|.-+..+..+.+-..++. ..|.+ T Consensus 151 ~f~~v~n~p~l~V~~----~~dt-----~~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~n 221 (400) T protein:vir:93 151 KVFHVTNVGALLVSR----SFDS-----ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYN 221 (400) T ss_pred cceeeecCCceeeec----chhh-----hcccceeccCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHH Confidence 877776663322111 1110 11222255688999999999999999998888888866555543 46999 Q ss_pred HHHHHHHHHHHH-HHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHH-HHhhhhhccCCcEEEEeh Q lcl|Aclame:pro 233 YIQGRLTYGLRF-LRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRA-KTVAEIAGFPPDGVVVHP 310 (419) Q Consensus 233 ~i~~~l~~a~~~-~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 310 (419) ||+++|.+.+.. ..+++++-|+|++...++.....+-.....+..+..+....+.++..- +..+.+...+...++++| T Consensus 222 YVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~ 301 (400) T protein:vir:93 222 LIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAE 301 (400) T ss_pred HHHHHHHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEecc Confidence 999999999996 579999999999876665432222222111111111122223333322 334444455566789999 Q ss_pred HHHHHHHHHhccCCceeccCCccccCCCcccccc-eeEecCCCCc-CcEEEEeccceEEEEEecceEEEEeecccchhhc Q lcl|Aclame:pro 311 QDWESIELDQAPGSGVFRVIANVQGEATPRIWGL-NVVSTVAIAQ-GTALVGGFRQGATLWSRQGITVLMTDSHADFFTA 388 (419) Q Consensus 311 ~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~-pv~~~~~~~~-~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~ 388 (419) ..|+.|+.+++++|++.+..++.. ..-.+-+|+ .+++...++. ...++.|-. +++ +-.++ .. .+.-.|.+ T Consensus 302 d~~A~L~~lk~a~~~a~f~~~n~d-~~IA~~fGv~~Lv~~Tr~~~~kp~V~VDek--~~i-~~~~~--~t--~~sf~~~t 373 (400) T protein:vir:93 302 DRKALLDELRQATANANVRIKNDD-TEIASEVGVDEIIVYTGSKALKPTVLVDQK--YHI-DMQDL--TK--VDAFEWKT 373 (400) T ss_pred chHHHHHHhcCCcceeeeeecccc-chhhhhcccceeeeeccCCCCCceeeeehh--hhc-cccCc--ee--ccceeeee Confidence 999999999999999877555433 333344565 3443444433 233333533 333 22232 22 22223667 Q ss_pred CcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 389 NTLVILAEFRANLAVYQPKAFVRVTFA 415 (419) Q Consensus 389 ~~~~~r~~~r~d~~~~~~~a~~~~~~~ 415 (419) |+-.+..+.+++|-+.-|++-++++++ T Consensus 374 Ns~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 374 NSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred ccceEEeeeeeccceecccceeeEeeC Confidence 777889999999999999999999998 No 123 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.61 E-value=5e-16 Score=104.47 Aligned_cols=285 Identities=13% Similarity=0.101 Sum_probs=178.7 Q ss_pred cccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccc Q lcl|Aclame:pro 123 APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLS 202 (419) Q Consensus 123 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 202 (419) .. +.+..-.....+......|++...+.+.+.+.++..++.++.+.|.|.......... .-.|-.=....+++..+ T Consensus 1 mp-altLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~---~v~~~~~~~g~~~~~~t 76 (310) T protein:vir:97 1 MA-SVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMA---GVGTTFSGAGAGKAAAT 76 (310) T ss_pred Cc-ccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccc---cccccccCCCccccccc Confidence 00 000000112234445566777777778888999999999988999887653322111 01111111234567788 Q ss_pred eeeEEeeeEEEEEeehhhHHHHh--hH-H-HHHHHHHHHHHHHHHHHHHHHHHhccCcccc-cceecccccccccccccc Q lcl|Aclame:pro 203 FDTITTTLKTVAHWLPITRQAAD--DN-S-QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEM-QGILTTPGIGTYQQPKPT 277 (419) Q Consensus 203 ~~~v~~~~~k~~~~~~vs~ell~--d~-~-~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p-~Gi~~~~~~~~~~~~~~~ 277 (419) |.+++...+-+++.+.|.+.+.+ .+ + +...+-.+...++++.+.+..+||||.++++ .|+++........... T Consensus 77 ~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~-- 154 (310) T protein:vir:97 77 FTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTG-- 154 (310) T ss_pred cceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecC-- Confidence 99999999999999999987654 22 3 5666667888899999999999999987654 4887653221111111 Q ss_pred ccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHH-hccCCceecc-CCccccCCCcccccceeEecCCCCcC Q lcl|Aclame:pro 278 APATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELD-QAPGSGVFRV-IANVQGEATPRIWGLNVVSTVAIAQG 355 (419) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-kd~~g~~~~~-~~~~~~~~~~~l~G~pv~~~~~~~~~ 355 (419) ...+....+++-.++..+....+.+..++|||++..+|+.+ +..+++.++. ..+..+....++.|+|++.++.+|.+ T Consensus 155 -~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~ 233 (310) T protein:vir:97 155 -ATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPTN 233 (310) T ss_pred -CCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCCC Confidence 11122345666666666665566788999999998888765 4444333332 24444444567889999999999853 Q ss_pred ----------cEEEEeccc-----eEEEEE---ecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 356 ----------TALVGGFRQ-----GATLWS---RQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 356 ----------~~~~~d~~~-----~~~~~~---~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) .+++.-|.. ++.+.. ..++.+..- +..=.++...+++.+|++.++.+|+|+++++--.- T Consensus 234 ~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~---G~~~~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 234 QTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDV---GESEDSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred ccccccCCceeEEEEeeCccccccceeccccCCccceeEEeC---CcccCCcceeEEEEEeeeEEEecccceeeeccccC Confidence 355554443 343321 122333321 11123566789999999999999999999964443 No 124 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.56 E-value=2.8e-15 Score=100.38 Aligned_cols=307 Identities=10% Similarity=0.020 Sum_probs=176.8 Q ss_pred HHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcc Q lcl|Aclame:pro 88 FADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNV 167 (419) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~ 167 (419) ..+. +...+ +.......+.......... ++ .+++......+.+..++.+.+++.++++++.+.. T Consensus 1 ~~~~----~~~~~---------~~n~~~~~i~k~~it~~~l--~~-g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~ 64 (360) T protein:vir:99 1 MSSN----STIDS---------VRNQNMNSLSQKDIGLAEL--DG-FQLPVDVTEEFLERMQKGVQILGMADTMTLARLE 64 (360) T ss_pred Ccch----hHHHH---------HhhhHHHHHHhhhcccccc--Cc-eeecHHHHHHHHHHHhhccchhhhcceeeccccc Confidence 0000 00000 0111111111111121111 23 3455566677778888899999999999998888 Q ss_pred eeeeeeccccceeccccccceeecCcccccc-cccceeeEEe-eeEEEEEeehhhHHHHhhHH-----HHHHHHHHHHHH Q lcl|Aclame:pro 168 LEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITT-TLKTVAHWLPITRQAADDNS-----QLMGYIQGRLTY 240 (419) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~-~~~k~~~~~~vs~ell~d~~-----~~~~~i~~~l~~ 240 (419) ..+++..-+....... .|++..+. .+++...+.+ ..+++-....+..+-++++. .+++.|++.|++ T Consensus 65 ~ei~kig~G~r~~r~~-------~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae 137 (360) T protein:vir:99 65 MEVPQFGVPRLSGHTR-------DEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIE 137 (360) T ss_pred ccccccccceeecccc-------ccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHH Confidence 8877654332222111 12222222 2334444555 34466666677777766542 478999999999 Q ss_pred HHHHHHHHHHHhccCcc---------c-----ccceeccccccc--ccccc---------cc---------------ccc Q lcl|Aclame:pro 241 GLRFLRDRQLLNGNGST---------E-----MQGILTTPGIGT--YQQPK---------PT---------------APA 280 (419) Q Consensus 241 a~~~~~d~~il~G~g~~---------~-----p~Gi~~~~~~~~--~~~~~---------~~---------------~~~ 280 (419) +++.-++...++|+... + ..|++....... +..+. .. ... T Consensus 138 ~~~~Dle~l~~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~ 217 (360) T protein:vir:99 138 RYGNDLGLMGIRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGN 217 (360) T ss_pred HHHHHHHHHHhhccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccc Confidence 99999999999997542 1 245554421000 00000 00 000 Q ss_pred hhhhHHHHHHHHHHhhhhhccCC----cEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 281 TDEPPLVDIRRAKTVAEIAGFPP----DGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) .......-+.+++..++..|+++ -.|+||+......+.....-...+ -.....+...-..+|+||+..+.+|++. T Consensus 218 ~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t~L-Gd~~l~g~~~~~~~Gipi~~v~~~pd~~ 296 (360) T protein:vir:99 218 PQPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLTEREDPL-GSAVIFGDSDITPFSYDLVGVNGFPDEY 296 (360) T ss_pred cccchHHHHHHHHHhcchhhhcCcccceEEEccCchHHHHHHHHhccCccc-chhheecccccccceeeeEEcCCCCCCc Confidence 01112334567788888888753 279999998777665432111111 0011222333457899999999999999 Q ss_pred EEEEeccceEEEEEecceEEEEeecccchhhcCc-EEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 357 ALVGGFRQGATLWSRQGITVLMTDSHADFFTANT-LVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 357 ~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +++.++.+.++++ +..++++...+...+-.+.. +.+.....+|+.+.+++|.++++--..|+ T Consensus 297 ~mlT~p~NLi~g~-~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~ 359 (360) T protein:vir:99 297 MMFTDPNNLAFGL-YEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPT 359 (360) T ss_pred eEEeccCceeEEe-eeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCC Confidence 9999999876655 56788876555432222221 33445667999999999999998888888 No 125 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.49 E-value=8.2e-15 Score=97.80 Aligned_cols=380 Identities=14% Similarity=0.081 Sum_probs=189.6 Q ss_pred CCccHHH----HHHHHHHHHHHHH-H-------HHHHHHHHH----HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPPTPTL----EEQRAALLARLDD-T-------SLTTEQVQE----IVAEARGLADALQAE---SDRAAARAALLRTAPP 61 (419) Q Consensus 1 M~~~~~L----~e~~~~l~~~~~~-~-------~~~~~~~~~----~~~e~~~~~~~~~~~---~~~~~~~~~~l~~~~~ 61 (419) |--...- .-++..+...+|- + .+.....++ +..|.+...+.++.+ .+++.....+.+.... T Consensus 1 ~~n~t~a~d~~~RR~~~~L~~~EvSvv~~PAY~nA~vt~vRe~e~~~~~e~~~~~e~~en~~e~~~~~~~~~~E~Rs~~~ 80 (410) T protein:vir:83 1 MGNATTASDEYIRRLENELREKESLVRGIYDRANASNRDVNEEEGQMVAECRGRMEQIKNQMEQAQEVNRIAFETRSKGQ 80 (410) T ss_pred CCCcccchhhHHHHHHHHhhhhheeeeccccccccccccchhhhccccccccCcccchhhhhHHHHHHHHHHHHHHHHHH Confidence 4322111 1111111100000 0 000000110 111111111111111 1111111111111111 Q ss_pred HHHHHHhhcccccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhh Q lcl|Aclame:pro 62 APKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVP 141 (419) Q Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 141 (419) +.................+.++ ..+.++++.....-.... .+.++... +....+.+.... .++|..+- T Consensus 81 ~i~~~~~~~r~~p~~~~veyRS------aGE~lkal~~~~~Gd~~A--~~~~e~~r---~a~~~~~Tgd~~-~~i~~~~v 148 (410) T protein:vir:83 81 AVDAAISAMRGSPVGTEVEYRS------AGEYMLDMWNSAQGNASA--ADRLEVYA---RAADHQKTGDLQ-GVIPDPIV 148 (410) T ss_pred HHHhhhccCcCCCCCCCccccc------HHHHHHHHhccCCchHHH--HHHHHHHH---HhhccCcccccc-cccchhHh Confidence 1111111111111111112222 233444443221111111 11111111 111122222222 34444455 Q ss_pred HHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhH Q lcl|Aclame:pro 142 GIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITR 221 (419) Q Consensus 142 ~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ 221 (419) .-.++++.+..++..++..+|..+.++.|+..+... .+..+.....--.||...+.++.+|+..+...++++++..+|+ T Consensus 149 ~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~-tV~~q~~~~kqa~EGd~L~~gKl~~~t~tA~ikTyGGyt~LSR 227 (410) T protein:vir:83 149 GPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRP-AVGLQGVAGGASDEKTELDSQKMVIDRLTVNAKTLGGYVNVSR 227 (410) T ss_pred hhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccc-cccccccccccccccccccccceeeeeccceeehhcCcccccc Confidence 555667777788888888899999999997765432 2221111111124899999999999999999999999999999 Q ss_pred HHHhhH-HHHHHHHHHHHHHHHHHHHHH---HHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhh Q lcl|Aclame:pro 222 QAADDN-SQLMGYIQGRLTYGLRFLRDR---QLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAE 297 (419) Q Consensus 222 ell~d~-~~~~~~i~~~l~~a~~~~~d~---~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (419) +.|+.+ ....+...+.|..+++.+-+. ++|.++-++ .......+...+...+.++...+. T Consensus 228 Q~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~----------------~~a~~~~Tad~~~~~i~da~~~v~ 291 (410) T protein:vir:83 228 QAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTG----------------AVGYGNATADNVASAIWQAAGAVY 291 (410) T ss_pred eeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----------------hhhhhhccHHHHHHHHHHHHHHHh Confidence 999865 588888999998888888766 444432110 001111222233333344444444 Q ss_pred hh--ccCCcEEEEehHHHHHHHHH-hccCCc--ee--ccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEE Q lcl|Aclame:pro 298 IA--GFPPDGVVVHPQDWESIELD-QAPGSG--VF--RVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWS 370 (419) Q Consensus 298 ~~--~~~~~~~~~~~~~~~~l~~~-kd~~g~--~~--~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~ 370 (419) .+ +..-..+.++|+++..+..+ ++-++. .- +-......+..+.++|+||+..+..++++.+|.|.. ++..|. T Consensus 292 da~~~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~~~-Ai~~~e 370 (410) T protein:vir:83 292 TAVKGMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFSTA-AIECFE 370 (410) T ss_pred hhhccceeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCeeeEeccc-eeeeee Confidence 44 33444678999997666543 221110 00 000111134567899999999999999999999866 677787 Q ss_pred ecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 371 RQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFA 415 (419) Q Consensus 371 ~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 415 (419) ..+..++..+.+-...++ .|- .|+.+.+.++.+++-+.-. T Consensus 371 S~~gp~qL~d~~i~nLt~---~yS--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 371 QRVGTLQVVEPSVFGLQV---AYA--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred cCCceeEeeCCchhhhhh---hhe--eeeeeccccccceeeeccC Confidence 776566666554322333 233 6778889999999988776 No 126 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.49 E-value=6.5e-15 Score=98.36 Aligned_cols=260 Identities=13% Similarity=0.048 Sum_probs=161.2 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhccee----cccCcceeeeeeccccceeccccccceeecCcccccccccce Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQ----NADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSF 203 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 203 (419) +. ....+|+.+...+.+.+.....+..++... ...+.++.+|+... ...+.+..++..++..+++. T Consensus 1 MA--~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~--------~~~~d~~~~~~~~~~~~~~~ 70 (273) T protein:vir:79 1 MA--FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVA--------PTVKDYKAAGRQTSADAISD 70 (273) T ss_pred Cc--chhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCc--------ccccccccCCCccCcccccc Confidence 11 123579999999998888888877776432 22355777776432 12344667787777777788 Q ss_pred eeEEeeeEEE-EEeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc-Ccccccceeccccccccccccccccc Q lcl|Aclame:pro 204 DTITTTLKTV-AHWLPITRQ-AADDNSQLMGYIQGRLTYGLRFLRDRQLLNGN-GSTEMQGILTTPGIGTYQQPKPTAPA 280 (419) Q Consensus 204 ~~v~~~~~k~-~~~~~vs~e-ll~d~~~~~~~i~~~l~~a~~~~~d~~il~G~-g~~~p~Gi~~~~~~~~~~~~~~~~~~ 280 (419) ..+++...+. +.-+.|++. ..+...++.+ +.++++.++++++|..++.=- +.+. ........ T Consensus 71 ~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~~--------------~~~~~~~~ 135 (273) T protein:vir:79 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT--------------ALTGSAPS 135 (273) T ss_pred ceEEEEEeeecccceeeccHHHHhhcccHHH-HHHHHHHHHHHHHHHHHHHHHhhccc--------------cccccccc Confidence 8888888664 445567763 3444557877 557788999999999776310 0000 00111122 Q ss_pred hhhhHHHHHHHHHHhhhhhccC--CcEEEEehHHHHHHHHHhccCCceec--cCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 281 TDEPPLVDIRRAKTVAEIAGFP--PDGVVVHPQDWESIELDQAPGSGVFR--VIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~kd~~g~~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) +....++.+..+...+...+.+ +-.++++|..+..|++..+.-..... -......+..++|+|++|+.++.+|.+. T Consensus 136 ~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~ 215 (273) T protein:vir:79 136 DADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD 215 (273) T ss_pred chhhHHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccC Confidence 3334577788887777777653 34678999999998765321111111 1122335667789999999999999654 Q ss_pred E--EEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 357 A--LVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 357 ~--~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) . .+.....++... .+...++..+. ...| ...+++...++.++++|+++++++.+.+ T Consensus 216 ~~~~~a~~~~A~~~a-~~~~~~e~~r~-~~~~---~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 216 DEQFVAFHPSAAAYV-SQIDTVEALRD-QDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ceEEEEEeccceeee-eehhhhhcccC-cccc---eeeeeeeeeeeeEEecCceEEEEeccCC Confidence 2 222222233222 22223333222 2223 3468899999999999999999988888 No 127 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.49 E-value=7e-15 Score=98.20 Aligned_cols=259 Identities=13% Similarity=0.042 Sum_probs=158.9 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhccee----cccCcceeeeeeccccceeccccccceeecCcccccccccce Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQ----NADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSF 203 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 203 (419) +. ....+|+.+...+.+.....+.+..++..- ...+.++.+|+.... .-+.+..++..++..+.+. T Consensus 1 MA--~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~--------~~~d~~~~~~~~~~~~~~~ 70 (273) T protein:vir:10 1 MA--FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP--------TVKDYKAAGRQTSADAISD 70 (273) T ss_pred Cc--chhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccc--------cccccccCCCccCcccccc Confidence 11 224579999999998888888887776432 123456777764321 1233555666665556666 Q ss_pred eeEEeeeEEE-EEeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc-Ccccccceeccccccccccccccccc Q lcl|Aclame:pro 204 DTITTTLKTV-AHWLPITRQ-AADDNSQLMGYIQGRLTYGLRFLRDRQLLNGN-GSTEMQGILTTPGIGTYQQPKPTAPA 280 (419) Q Consensus 204 ~~v~~~~~k~-~~~~~vs~e-ll~d~~~~~~~i~~~l~~a~~~~~d~~il~G~-g~~~p~Gi~~~~~~~~~~~~~~~~~~ 280 (419) ..+++...+. +..+.|++. ..+...++++ +.++++.+++.++|..++.=- +.+. ........ T Consensus 71 ~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~--------------~~~~~~~~ 135 (273) T protein:vir:10 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT--------------ALTGSAPT 135 (273) T ss_pred ceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc--------------cccccccc Confidence 7777776554 344456653 3444456877 567789999999999887410 0000 00111222 Q ss_pred hhhhHHHHHHHHHHhhhhhccC--CcEEEEehHHHHHHHHHhccCCceec--cCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 281 TDEPPLVDIRRAKTVAEIAGFP--PDGVVVHPQDWESIELDQAPGSGVFR--VIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~kd~~g~~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) +....++.+..+...+...+.+ +-.++++|..+..|++...--..... -......+..+++.|++|+.++.+|.+. T Consensus 136 ~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~ 215 (273) T protein:vir:10 136 DADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD 215 (273) T ss_pred chhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCC Confidence 3345678888888888777663 34578999999999764321111111 1122335666889999999999999753 Q ss_pred ---EEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 357 ---ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 357 ---~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) .+.+.. .++... .+...++..+ ....| ...+++...+++++++|+++++++.+.+ T Consensus 216 ~~~~~~~~~-~A~~~a-~q~~~~e~~r-~~~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 216 DEQFVAFHP-SAAAYV-SQIDTVEALR-DQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ccEEEEEec-cceeee-eeeehhhccc-CCCcc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 333332 233222 2222333222 22233 3468888999999999999999988888 No 128 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.49 E-value=7e-15 Score=98.20 Aligned_cols=259 Identities=13% Similarity=0.042 Sum_probs=158.9 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhccee----cccCcceeeeeeccccceeccccccceeecCcccccccccce Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQ----NADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSF 203 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 203 (419) +. ....+|+.+...+.+.....+.+..++..- ...+.++.+|+.... .-+.+..++..++..+.+. T Consensus 1 MA--~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~--------~~~d~~~~~~~~~~~~~~~ 70 (273) T protein:vir:10 1 MA--FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP--------TVKDYKAAGRQTSADAISD 70 (273) T ss_pred Cc--chhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccc--------cccccccCCCccCcccccc Confidence 11 224579999999998888888887776432 123456777764321 1233555666665556666 Q ss_pred eeEEeeeEEE-EEeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc-Ccccccceeccccccccccccccccc Q lcl|Aclame:pro 204 DTITTTLKTV-AHWLPITRQ-AADDNSQLMGYIQGRLTYGLRFLRDRQLLNGN-GSTEMQGILTTPGIGTYQQPKPTAPA 280 (419) Q Consensus 204 ~~v~~~~~k~-~~~~~vs~e-ll~d~~~~~~~i~~~l~~a~~~~~d~~il~G~-g~~~p~Gi~~~~~~~~~~~~~~~~~~ 280 (419) ..+++...+. +..+.|++. ..+...++++ +.++++.+++.++|..++.=- +.+. ........ T Consensus 71 ~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~--------------~~~~~~~~ 135 (273) T protein:vir:10 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT--------------ALTGSAPT 135 (273) T ss_pred ceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc--------------cccccccc Confidence 7777776554 344456653 3444456877 567789999999999887410 0000 00111222 Q ss_pred hhhhHHHHHHHHHHhhhhhccC--CcEEEEehHHHHHHHHHhccCCceec--cCCccccCCCcccccceeEecCCCCcCc Q lcl|Aclame:pro 281 TDEPPLVDIRRAKTVAEIAGFP--PDGVVVHPQDWESIELDQAPGSGVFR--VIANVQGEATPRIWGLNVVSTVAIAQGT 356 (419) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~kd~~g~~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 356 (419) +....++.+..+...+...+.+ +-.++++|..+..|++...--..... -......+..+++.|++|+.++.+|.+. T Consensus 136 ~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~ 215 (273) T protein:vir:10 136 DADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD 215 (273) T ss_pred chhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCC Confidence 3345678888888888777663 34578999999999764321111111 1122335666889999999999999753 Q ss_pred ---EEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 357 ---ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 357 ---~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) .+.+.. .++... .+...++..+ ....| ...+++...+++++++|+++++++.+.+ T Consensus 216 ~~~~~~~~~-~A~~~a-~q~~~~e~~r-~~~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 216 DEQFVAFHP-SAAAYV-SQIDTVEALR-DQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ccEEEEEec-cceeee-eeeehhhccc-CCCcc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 333332 233222 2222333222 22233 3468888999999999999999988888 No 129 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.36 E-value=8.7e-14 Score=92.19 Aligned_cols=294 Identities=14% Similarity=0.010 Sum_probs=163.3 Q ss_pred HHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc---cCcceeeeeeccccceecccccccee Q lcl|Aclame:pro 113 IDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA---DYNVLEYIRDTSGTAGAGSTWNKAAV 189 (419) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~a~~ 189 (419) +....-.....+.+..++.....+|+.+...+.+.++....+..++..... .+.++++|+.. ..++.. T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g---------~~~a~d 71 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS---------RAAVYD 71 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC---------cceeee Confidence 100010111122233344456677999999999999888888887665432 34467777632 234556 Q ss_pred ecCcccccccccceeeEEeeeEEEE-EeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccC--cc--ccccee Q lcl|Aclame:pro 190 VPEGTAKPQSTLSFDTITTTLKTVA-HWLPITRQ-AADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNG--ST--EMQGIL 263 (419) Q Consensus 190 v~Eg~~~~~~~~~~~~v~~~~~k~~-~~~~vs~e-ll~d~~~~~~~i~~~l~~a~~~~~d~~il~G~g--~~--~p~Gi~ 263 (419) +.++..++-.+.+...+++...+.- .-+.|++. ..+.+.++.+.+.+++..++++..|+.++.--. .. .+.... T Consensus 72 ~~~g~~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t 151 (381) T protein:vir:80 72 KQPQTPVNLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYS 151 (381) T ss_pred ecCCCcccccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 6777777777777777777775543 34567765 344455899999999999999999999875311 11 111111 Q ss_pred ccccccccccccccccchhhhHHHHHHHHHHhhhhhccCC--cEEEEehHHHHHHHHHhccCCceeccCCccccCCCccc Q lcl|Aclame:pro 264 TTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRI 341 (419) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l 341 (419) ....+.................++.+.++...+...+.+. -.++++|..+..|++...-....+.-......+..++| T Consensus 152 ~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i 231 (381) T protein:vir:80 152 YDTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTI 231 (381) T ss_pred ccccccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEE Confidence 1111111111222223334557888888888888777643 36789999999998643212211222233455667899 Q ss_pred ccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEeccc-ceEEEEecCCCC Q lcl|Aclame:pro 342 WGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPK-AFVRVTFAAATT 419 (419) Q Consensus 342 ~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~-a~~~~~~~aa~~ 419 (419) +|++|+.++.+|.+................. .+.-+. ....|..+..+++....+|.++...- .+-.... +.++ T Consensus 232 ~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~~--~~~~~~-~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g-~~~~ 306 (381) T protein:vir:80 232 LGMEVIVTTQIGINSLTGYVNGQGAPTQPTP--GVLGSP-YLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSG-AGAT 306 (381) T ss_pred cceEEEeecccccccccceeeeccccccccc--cccccc-cccccccceeeeeeeeeeceeeeeeeccceeeec-ceee Confidence 9999999999997543221111111111000 111111 12234455566777777777774432 2221111 1111 No 130 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.33 E-value=7.6e-14 Score=92.50 Aligned_cols=291 Identities=11% Similarity=0.099 Sum_probs=167.5 Q ss_pred HHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceeccc-Ccceeeeeeccc Q lcl|Aclame:pro 98 RARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSG 176 (419) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~ 176 (419) ++.... ......+....+..+..-...+ +.+..++.......+.++++.++.++. ++++.+|+... T Consensus 1 ma~~~~-----------~~~~~t~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~- 67 (347) T protein:vir:94 1 MANMNG-----------GQQMGKDQGKGMSAGDKLALFL-KVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGR- 67 (347) T ss_pred CCcccc-----------ccccccccccCCcccchHHHHH-HHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccc- Confidence 000000 0000001111111112122344 788889988889999999999988764 66888887532 Q ss_pred cceeccccccceeecCcccccc--cccceeeEEeeeEEEE-EeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 177 TAGAGSTWNKAAVVPEGTAKPQ--STLSFDTITTTLKTVA-HWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQL 250 (419) Q Consensus 177 ~~~~~~~~~~a~~v~Eg~~~~~--~~~~~~~v~~~~~k~~-~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~i 250 (419) .+++.+..|+.... .++..+++++...++- ..+.|. -+++. .++.+.+.++++.++++..|+.| T Consensus 68 --------~~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~Vd--diD~~q~~~D~rs~~~~~~g~ALA~~~D~~i 137 (347) T protein:vir:94 68 --------TKAAYLQPGENLDDKRKDMKHTEKTINIDGLLTADVLIY--DIEDAMNHYDVRSEYTAQLGESLAMAADGAV 137 (347) T ss_pred --------eeEeeeecCcCCCCCcCCccccceEEEEcchhhhhhhhh--hHHHHhcCcchHHHHHHHHHHHHHHHHHHHH Confidence 35566677777654 3567777766655541 222222 12222 36888899999999999999988 Q ss_pred Hh----ccCc---------ccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCc--EEEEehHHHHH Q lcl|Aclame:pro 251 LN----GNGS---------TEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWES 315 (419) Q Consensus 251 l~----G~g~---------~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 315 (419) +. +... +.+.+.... +................+++.+.++...+...+.+.. .++++|..+.. T Consensus 138 ~~~l~~~a~~~~~~~~~~~g~~~~~~v~--i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~ 215 (347) T protein:vir:94 138 LAEMAKLCNLPTANNENIAGLGKAHVLE--VGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSA 215 (347) T ss_pred HHHHHHhhccccccccccccCCcceeEe--eeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHH Confidence 63 2111 111111111 0011111122223455678888888888887777532 34668999999 Q ss_pred HHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCc-------------------------EEEEeccceEE-E- Q lcl|Aclame:pro 316 IELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT-------------------------ALVGGFRQGAT-L- 368 (419) Q Consensus 316 l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~-------------------------~~~~d~~~~~~-~- 368 (419) |.+..+.....+........+..+++.|++|+.++.+|.+. -+-+||++..- + T Consensus 216 LLk~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~ 295 (347) T protein:vir:94 216 ILAALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFN 295 (347) T ss_pred HHHhhcccccccccccccccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEe Confidence 88754433222222333445667789999999999998531 12233333221 1 Q ss_pred -------EEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 369 -------WSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 369 -------~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) +.-.++++++.+... ++.+ .+.+..-++..+++|++.+.++++.| T Consensus 296 ~~~A~~tv~~~~~~~e~~~~~~--~~~~--~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 296 HRSAVGTVKLKDMALERARRAN--FQAD--QIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred chhhhhhhhhcccceeeeechh--hhhh--hhhhhhhhcCcccccceeEEEEecCC Confidence 123344555554322 2333 46677779999999999999999999 No 131 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.33 E-value=1e-13 Score=91.80 Aligned_cols=289 Identities=11% Similarity=0.017 Sum_probs=161.4 Q ss_pred HHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceec---ccCcceeeeeeccccceeccccccceeecC Q lcl|Aclame:pro 116 NRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQN---ADYNVLEYIRDTSGTAGAGSTWNKAAVVPE 192 (419) Q Consensus 116 ~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 192 (419) .+...........+......+|+.+...+.+.++....+.++++..+ ..+.++.+|+... .++..+.+ T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g~---------~~~~d~~~ 71 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRISE---------LGVEDKAT 71 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccCc---------ceeeeecC Confidence 00000000111223334456799999999999998888888876543 2355788876432 23444556 Q ss_pred cccccccccceeeEEeeeEEE-EEeehhhHHH-HhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc--Ccccccceeccccc Q lcl|Aclame:pro 193 GTAKPQSTLSFDTITTTLKTV-AHWLPITRQA-ADDNSQLMGYIQGRLTYGLRFLRDRQLLNGN--GSTEMQGILTTPGI 268 (419) Q Consensus 193 g~~~~~~~~~~~~v~~~~~k~-~~~~~vs~el-l~d~~~~~~~i~~~l~~a~~~~~d~~il~G~--g~~~p~Gi~~~~~~ 268 (419) +..++-.+.+-..+++...+. +..+.|++.- .+.+.++.+.+.++++.++++++|..++.-- +++.+.+.. . T Consensus 72 ~~~i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~----~ 147 (341) T protein:vir:94 72 DVPVGVQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNV----F 147 (341) T ss_pred CCccccccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCcc----c Confidence 666666666667777777443 4556777653 4455689999999999999999999887521 111111110 0 Q ss_pred cccccccccccchhhhHHHHHHHHHHhhhhhccCC--cEEEEehHHHHHHHHHhccCCceeccCCccccCCCccccccee Q lcl|Aclame:pro 269 GTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNV 346 (419) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv 346 (419) . ..............++.+..+...+...+.+. -..+++|..+..|++...-....+.-......+..++++|++| T Consensus 148 ~--~~~~~~t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V 225 (341) T protein:vir:94 148 S--SSNGAITGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRV 225 (341) T ss_pred c--CccccccCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEE Confidence 0 00011111223345677777777777666542 3567899999999764211111111122344566678999999 Q ss_pred EecCCCCcCcEEEE---------------------------eccc-eEEEEEecce-EEEEee------------cccch Q lcl|Aclame:pro 347 VSTVAIAQGTALVG---------------------------GFRQ-GATLWSRQGI-TVLMTD------------SHADF 385 (419) Q Consensus 347 ~~~~~~~~~~~~~~---------------------------d~~~-~~~~~~~~~~-~i~~~~------------~~~~~ 385 (419) +.++.+|.+..... ++.. ..+.+.+... .+..-+ ..... T Consensus 226 ~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~ 305 (341) T protein:vir:94 226 IRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQS 305 (341) T ss_pred EEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhcccccccccccc Confidence 99999986532211 0000 0011111111 111000 00000 Q ss_pred hh--cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 386 FT--ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 386 ~~--~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) |. +=...+++..-++.++++|+|.+-++.++++- T Consensus 306 ~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 306 FENREQVWLMVGRQAYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred chhhhhhhhhhhhhhhcccccCcceeEEEecCcCCC Confidence 11 11123556667899999999998888777777 No 132 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.29 E-value=2.4e-13 Score=89.74 Aligned_cols=290 Identities=14% Similarity=0.141 Sum_probs=161.1 Q ss_pred HHHHHHHHh---hhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceeccc-Ccceeeeeeccccceeccccc Q lcl|Aclame:pro 110 MRDIDPNRL---LSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWN 185 (419) Q Consensus 110 ~~~~~~~~~---~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 185 (419) +........ ....... +....-...+ +.+..++.......+.++++.++.++. ++++.+|+... . T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~al~l-e~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~iG~---------~ 69 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVV-AAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGR---------T 69 (345) T ss_pred Ccccccchhcccccccccc-cCCchhHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEEeeecc---------e Confidence 000000000 0000000 0111112333 778888889999999999999988876 66888887532 3 Q ss_pred cceeecCccccccc--ccceeeEEeeeEE--EEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHh----cc Q lcl|Aclame:pro 186 KAAVVPEGTAKPQS--TLSFDTITTTLKT--VAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLN----GN 254 (419) Q Consensus 186 ~a~~v~Eg~~~~~~--~~~~~~v~~~~~k--~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~----G~ 254 (419) ++.....|++.... ++..++.++...+ +..+. |. -+++. .++.+.+.++++.++++..|+.++. +. T Consensus 70 ~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~-Vd--diD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a 146 (345) T protein:vir:22 70 QAAYLAPGENLDDKRKDIKHTEKVITIDGLLTADVL-IY--DIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLC 146 (345) T ss_pred EEEeeecCCCCCCCCCCcccceEEEEecchhhhhhh-Hh--hHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 45566667665443 3556664444333 22221 11 12222 3788999999999999999998873 11 Q ss_pred C-----cccccceeccccccccc--cccccccchhhhHHHHHHHHHHhhhhhccCCc--EEEEehHHHHHHHHHhccCCc Q lcl|Aclame:pro 255 G-----STEMQGILTTPGIGTYQ--QPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWESIELDQAPGSG 325 (419) Q Consensus 255 g-----~~~p~Gi~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~kd~~g~ 325 (419) . ++.|.|+-+........ ........+...+++.+..+...+...+.+.. ..+++|..+..|+.-+.-+.. T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~ 226 (345) T protein:vir:22 147 NVESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAA 226 (345) T ss_pred cccccccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhcccccccc Confidence 1 12233332211111111 11111223345678888888888877777643 467899999988754432222 Q ss_pred eeccCCccccCCCcccccceeEecCCCCcCc-----------------------EEEEeccceEEEE--------Eecce Q lcl|Aclame:pro 326 VFRVIANVQGEATPRIWGLNVVSTVAIAQGT-----------------------ALVGGFRQGATLW--------SRQGI 374 (419) Q Consensus 326 ~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~-----------------------~~~~d~~~~~~~~--------~~~~~ 374 (419) .+.-......+...+++|++|+.++.+|.+. ...++.+-..+++ .-.++ T Consensus 227 ~~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~ 306 (345) T protein:vir:22 227 NYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDL 306 (345) T ss_pred ccccccccccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecc Confidence 2222222334556789999999999887421 1111111111222 22233 Q ss_pred EEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 375 TVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 375 ~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) +++..+.. .+|. + .+++..-++.++++|+|.+.+++.-- T Consensus 307 ~~e~~r~~-~~~~-d--~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 307 ALERARRA-NFQA-D--QIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred eeeeeech-hHHH-H--HHHHHHhcCCcccccceeEEEEEeeC Confidence 45554432 2333 2 46777789999999999999877766 No 133 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.28 E-value=2.5e-13 Score=89.66 Aligned_cols=296 Identities=11% Similarity=0.087 Sum_probs=165.9 Q ss_pred HHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc-cCcceeeeeeccc Q lcl|Aclame:pro 98 RARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA-DYNVLEYIRDTSG 176 (419) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~ 176 (419) ++... .......+...++.....-...+ +.+..++....+..+.++++.++.++ +++++.+|+... T Consensus 1 ~a~~~-----------~~~~~~~~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~- 67 (347) T protein:vir:88 1 MANAT-----------GGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGR- 67 (347) T ss_pred CCCcc-----------cchhhhccCCCCccccchHHHHH-HHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecc- Confidence 00000 00001011112222222223344 78888888888888999999998875 466888887543 Q ss_pred cceeccccccceeecCcccccc--cccceeeEEeeeEEEE-EeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 177 TAGAGSTWNKAAVVPEGTAKPQ--STLSFDTITTTLKTVA-HWLPITRQ-AADDNSQLMGYIQGRLTYGLRFLRDRQLLN 252 (419) Q Consensus 177 ~~~~~~~~~~a~~v~Eg~~~~~--~~~~~~~v~~~~~k~~-~~~~vs~e-ll~d~~~~~~~i~~~l~~a~~~~~d~~il~ 252 (419) .++..+..|..... .++..+++++...++- ....|.+- -++...++.+.+.++++.++++..|+.++. T Consensus 68 --------~~~~~~~~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~ 139 (347) T protein:vir:88 68 --------TKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLA 139 (347) T ss_pred --------eeeeeeccccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHH Confidence 23444555655443 2456677776666542 22233221 122223688889999999999999998873 Q ss_pred ----ccCc-----ccccceecccccccccc-ccccccchhhhHHHHHHHHHHhhhhhccCC--cEEEEehHHHHHHHHHh Q lcl|Aclame:pro 253 ----GNGS-----TEMQGILTTPGIGTYQQ-PKPTAPATDEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWESIELDQ 320 (419) Q Consensus 253 ----G~g~-----~~p~Gi~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~k 320 (419) +... ..+.|+........... ............++.+.++...+...+.+. -.++++|..+..|++.. T Consensus 140 ~l~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~ 219 (347) T protein:vir:88 140 EMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSAL 219 (347) T ss_pred HHHHhhccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcch Confidence 1111 11223211110000000 011112233445777888887787776642 35688999999887654 Q ss_pred ccCCceeccCCccccCCCcccccceeEecCCCCcCc-------------------------EEEEeccceEEE-EE---- Q lcl|Aclame:pro 321 APGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGT-------------------------ALVGGFRQGATL-WS---- 370 (419) Q Consensus 321 d~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~-------------------------~~~~d~~~~~~~-~~---- 370 (419) ..+...+.-......+..+.+.|++|+.++.+|.+. .+-+|++....+ +. T Consensus 220 ~~~~~~~~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~ 299 (347) T protein:vir:88 220 MPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAV 299 (347) T ss_pred hhhhhhhccccchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhh Confidence 333333333344555666789999999999998421 122334432222 11 Q ss_pred ----ecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 371 ----RQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 371 ----~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~ 418 (419) -.++.++..+.. ..|. + .+++..-++.++++|++.+.++++++- T Consensus 300 g~v~~~d~~~e~~r~~-~~~~-d--~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 300 GTVKLKDMALERARRP-EFQA-D--QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred hheecccceeeeeech-hhHH-H--HhhhhhhhcCceeccceEEEEEeCCCC Confidence 122334444332 2332 2 577888899999999999999887777 No 134 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.23 E-value=9.1e-13 Score=86.60 Aligned_cols=295 Identities=13% Similarity=0.131 Sum_probs=156.6 Q ss_pred HHHHH-HHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc-cCcceeeeeeccccceeccccccc Q lcl|Aclame:pro 110 MRDID-PNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA-DYNVLEYIRDTSGTAGAGSTWNKA 187 (419) Q Consensus 110 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~a 187 (419) +.... ......+...++.....-... -+.+..++....+..+.++++.+..++ +++++.||+... .++ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~-ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~---------~t~ 70 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALF-LKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR---------TKA 70 (347) T ss_pred CCccccCCccccccccCCCcchHHHHH-HHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccc---------eee Confidence 00000 000001111111111111122 267778888888889999999988775 467888888653 234 Q ss_pred eeecCcccccc--cccceeeEEeeeEEEEE-eehhhHHHHh-hH-HHHHHHHHHHHHHHHHHHHHHHHHhc--cC----- Q lcl|Aclame:pro 188 AVVPEGTAKPQ--STLSFDTITTTLKTVAH-WLPITRQAAD-DN-SQLMGYIQGRLTYGLRFLRDRQLLNG--NG----- 255 (419) Q Consensus 188 ~~v~Eg~~~~~--~~~~~~~v~~~~~k~~~-~~~vs~ell~-d~-~~~~~~i~~~l~~a~~~~~d~~il~G--~g----- 255 (419) .....|..++. .+++..++++...+.-. ...| +.+-+ ++ .++.+.+.++.+.++++..|+.|+.- .+ T Consensus 71 ~~~~~g~~l~~~~~~~~~~e~~ltID~~~~~~~~V-ddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~ 149 (347) T protein:vir:15 71 AYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLI-YDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPD 149 (347) T ss_pred eeeccCCCCCCCCCCCccceEEEEechhhhhhHHh-hhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 44455655543 23455665554433311 1122 22222 12 37889999999999999999988731 01 Q ss_pred -cccccceeccccccccccccc----cccchhhhHHHHHHHHHHhhhhhccC--CcEEEEehHHHHHHHHHhccCCceec Q lcl|Aclame:pro 256 -STEMQGILTTPGIGTYQQPKP----TAPATDEPPLVDIRRAKTVAEIAGFP--PDGVVVHPQDWESIELDQAPGSGVFR 328 (419) Q Consensus 256 -~~~p~Gi~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~kd~~g~~~~ 328 (419) +..+.+.....++.......+ ........+++.+.++...+...+.+ +-..+++|..+..|++-.+-....+. T Consensus 150 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~ 229 (347) T protein:vir:15 150 ASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQ 229 (347) T ss_pred cccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccccccccc Confidence 111111100111111111111 11112334566677777677766664 22456799999999865443332222 Q ss_pred cCCccccCCCcccccceeEecCCCCcCcE----------------------EEEeccce-EEEEEe--------cceEEE Q lcl|Aclame:pro 329 VIANVQGEATPRIWGLNVVSTVAIAQGTA----------------------LVGGFRQG-ATLWSR--------QGITVL 377 (419) Q Consensus 329 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~----------------------~~~d~~~~-~~~~~~--------~~~~i~ 377 (419) -......+..++++|++|+.++.+|.+.+ +-++|... .+++.+ .++.++ T Consensus 230 ~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e 309 (347) T protein:vir:15 230 ALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALE 309 (347) T ss_pred ccccccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeee Confidence 22334455567899999999999985321 11122211 122222 223444 Q ss_pred EeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 378 MTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 378 ~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ..+.. .+|. ..+++...++.++++|++.+.++++-..- T Consensus 310 ~~~~~-~~~~---d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 310 RARRA-NYQA---DQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred ecccc-hhhh---hhhehhhhcCCceeccccEEEEecCCCCC Confidence 43322 2222 35677778899999999999998888777 No 135 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.22 E-value=5.1e-13 Score=87.96 Aligned_cols=297 Identities=11% Similarity=0.098 Sum_probs=160.3 Q ss_pred HHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc-cCcceeeeeeccc Q lcl|Aclame:pro 98 RARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA-DYNVLEYIRDTSG 176 (419) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~ 176 (419) ++.. ........+....+.....-...+ +.+..++.......+.++++.+..+. +++++.||+.... T Consensus 1 ~~~~-----------~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~ 68 (347) T protein:vir:33 1 MANI-----------QGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRT 68 (347) T ss_pred CCCC-----------ccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccce Confidence 0000 000000011111122222222345 88888998888888999999988775 4678888876432 Q ss_pred cceeccccccceeecCccccccc--ccceeeEEeeeEEEEE-eehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 177 TAGAGSTWNKAAVVPEGTAKPQS--TLSFDTITTTLKTVAH-WLPITRQ-AADDNSQLMGYIQGRLTYGLRFLRDRQLLN 252 (419) Q Consensus 177 ~~~~~~~~~~a~~v~Eg~~~~~~--~~~~~~v~~~~~k~~~-~~~vs~e-ll~d~~~~~~~i~~~l~~a~~~~~d~~il~ 252 (419) ++.....|+.++.. ++...+.++...+.-. ...|.+- =++...++.+.+.++.+.++++..|+.|+. T Consensus 69 ---------t~~~~~~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~ 139 (347) T protein:vir:33 69 ---------KAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLA 139 (347) T ss_pred ---------eeeeecCCCCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHH Confidence 33444455555432 3445554454333211 1111111 011122688889999999999999999872 Q ss_pred -----ccCcccccceecc---cccccccc----ccccccchhhhHHHHHHHHHHhhhhhccC--CcEEEEehHHHHHHHH Q lcl|Aclame:pro 253 -----GNGSTEMQGILTT---PGIGTYQQ----PKPTAPATDEPPLVDIRRAKTVAEIAGFP--PDGVVVHPQDWESIEL 318 (419) Q Consensus 253 -----G~g~~~p~Gi~~~---~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~ 318 (419) +.....+.+.... .+...... ...........+++.+.++...+...+.+ +-..+++|..+..|.+ T Consensus 140 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~ 219 (347) T protein:vir:33 140 ELAGLVNLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILA 219 (347) T ss_pred HHHHhhhhhcccccccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhc Confidence 2221111111100 00000001 01111123345678888888888877774 3356899999999886 Q ss_pred HhccCCceeccCCccccCCCcccccceeEecCCCCcCcE----------------------EEEeccce-EEEEEec--- Q lcl|Aclame:pro 319 DQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTA----------------------LVGGFRQG-ATLWSRQ--- 372 (419) Q Consensus 319 ~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~----------------------~~~d~~~~-~~~~~~~--- 372 (419) ...-....+.-......+..++++|++|+.++.+|.+.+ +-++|... .+++.+. T Consensus 220 ~~~~~~~d~~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g 299 (347) T protein:vir:33 220 ALMPNAANYQALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVG 299 (347) T ss_pred cccccccccccccccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhhe Confidence 543332222222334455667899999999999986421 11122111 1222222 Q ss_pred -----ceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 373 -----GITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 373 -----~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ++.++..+.. .+|. ..+++...++.++++|++.+.++++-..- T Consensus 300 ~v~~~~~~~e~~r~~-~~~~---d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 300 TVKLKDLALERARRA-NYQA---DQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred eeeeeceeeeeccch-hhhh---HhhhhhhhcCCceecccceEEEecCCCCC Confidence 2234433322 2232 24677777899999999999999888777 No 136 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.22 E-value=2.1e-13 Score=90.11 Aligned_cols=291 Identities=13% Similarity=0.132 Sum_probs=156.2 Q ss_pred HHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceeccc-Ccceeeeeeccccceeccccccce Q lcl|Aclame:pro 110 MRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKAA 188 (419) Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~ 188 (419) +...-......+...++.....-...+ +.+..+++......+.++++.+..++. ++++.||+... .++. T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~i-k~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~---------~tv~ 70 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFL-KVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGR---------TSGV 70 (347) T ss_pred CCCCCccccccccccCCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccccccccceEEEecccc---------eeee Confidence 000000000001111111111112233 566677777777788889998888864 66888887643 2344 Q ss_pred eecCccccccc--ccceeeEEeeeEEEEEeehhhHHHHhh---H---HHHHHHHHHHHHHHHHHHHHHHHHh-----ccC Q lcl|Aclame:pro 189 VVPEGTAKPQS--TLSFDTITTTLKTVAHWLPITRQAADD---N---SQLMGYIQGRLTYGLRFLRDRQLLN-----GNG 255 (419) Q Consensus 189 ~v~Eg~~~~~~--~~~~~~v~~~~~k~~~~~~vs~ell~d---~---~~~~~~i~~~l~~a~~~~~d~~il~-----G~g 255 (419) .+..|+.++.. +.+-.++++...++- +++.++.| . .++.+.+.++++.++++..|+.|+. .+. T Consensus 71 ~~t~G~~l~~~~~~~~~~e~~itID~~~----~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~ 146 (347) T protein:vir:94 71 YLAPGERLSDKRKGIKHTEKVITIDGLL----TADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNL 146 (347) T ss_pred eecCCCCcCCCCCCCCcceEEEEecchh----hhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 44455555332 233444344433331 22333322 2 2688889999999999999998863 111 Q ss_pred cc----cccceeccccccccccccc-cccchhhhHHHHHHHHHHhhhhhccCC--cEEEEehHHHHHHHHHhccCCceec Q lcl|Aclame:pro 256 ST----EMQGILTTPGIGTYQQPKP-TAPATDEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWESIELDQAPGSGVFR 328 (419) Q Consensus 256 ~~----~p~Gi~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~kd~~g~~~~ 328 (419) ++ .+.|+.....+........ .........++.+.++...+...+.+. -..+++|..+..|..-+.-....+. T Consensus 147 ~~~~~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~ 226 (347) T protein:vir:94 147 PAASNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYA 226 (347) T ss_pred ccccccccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhcc Confidence 11 1223221111111111111 112233445677777777777766543 2568999999888655443333233 Q ss_pred cCCccccCCCcccccceeEecCCCCcCc-----------E---------------EEEeccceE-EEEEec--------c Q lcl|Aclame:pro 329 VIANVQGEATPRIWGLNVVSTVAIAQGT-----------A---------------LVGGFRQGA-TLWSRQ--------G 373 (419) Q Consensus 329 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~-----------~---------------~~~d~~~~~-~~~~~~--------~ 373 (419) -......+..++++|++|+.|+.+|.+. + +-+||.... ++|.+. + T Consensus 227 ~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~ 306 (347) T protein:vir:94 227 ALIDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRD 306 (347) T ss_pred ccccccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhccc Confidence 3344555666889999999999998421 0 122222211 122222 1 Q ss_pred eEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 374 ITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 374 ~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~ 418 (419) ++++..+. ...|. ..+++..-++.++++|++.+.++.++|- T Consensus 307 ~~~e~~r~-~~~~~---d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 307 LALERDRD-VDAQG---DLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred ccccchhc-hhhHH---HHhhhhhhhcCcccccceeEEEEecCCC Confidence 23343322 22333 2578888899999999999999998777 No 137 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.18 E-value=1e-11 Score=80.86 Aligned_cols=270 Identities=11% Similarity=-0.014 Sum_probs=159.1 Q ss_pred ccccCCcccccchhhhHHHHHhhhhhhhHHh---------hcceec--ccCcceeeeeeccccceeccccccceeecCcc Q lcl|Aclame:pro 126 GTITNPNVPHLPQLVPGIVPTTPDLPLLVAD---------LLDQQN--ADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGT 194 (419) Q Consensus 126 ~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~---------~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~ 194 (419) -..+.-...++|+.+...+.....+...+.+ +..... .+|..+++|..... ++.+..+.|+. T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l-------~Gd~~~v~~~~ 73 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDL-------DGDSQVLNDTD 73 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccC-------CCcccccCCCc Confidence 1233446678899999888776666655422 222222 23556666654321 24677888999 Q ss_pred cccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccc Q lcl|Aclame:pro 195 AKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQ 273 (419) Q Consensus 195 ~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~ 273 (419) +++..+.+.++..-..++.+..+.++++...-+ .+....+.++++....+..+..+|.- ..|++......... T Consensus 74 ~i~~~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~-----l~g~~~~~~~~~~~- 147 (324) T protein:vir:59 74 DLVPQKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAE-----LAGVFSNDDMKDNK- 147 (324) T ss_pred ccchhhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhhhccccccce- Confidence 998888887777777777777788888765433 47788899999999999999887742 11111111111000 Q ss_pred ccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCC Q lcl|Aclame:pro 274 PKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIA 353 (419) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~ 353 (419) ....+........+.+.++...+.+....-.+|+||+.++..|++..-... +.....+..-++++|++|++++.|| T Consensus 148 ~dvsa~~~~~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~----~~~s~~~~~i~~~~G~~VivdD~~p 223 (324) T protein:vir:59 148 LDISGTADGIYSAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIEF----VKDSQSGIRFPTYMNKRVIVDDSMP 223 (324) T ss_pred eeeeccccceecHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhhh----ccccccCceeeeecccEEEEeCCCC Confidence 011111222345677888888888887788899999999999997642211 1111223345678999999999998 Q ss_pred cCc-------EEEEeccceEEE-EE-ecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEe---cCCCC Q lcl|Aclame:pro 354 QGT-------ALVGGFRQGATL-WS-RQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTF---AAATT 419 (419) Q Consensus 354 ~~~-------~~~~d~~~~~~~-~~-~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~---~aa~~ 419 (419) ... ...+-|..+.+. .. +....++..+. ...++..+....++ ++||..+..-.- ..+|| T Consensus 224 ~~~~~~~~~~y~s~l~~~GAi~~~~~~~~v~vE~dRd----~~~g~~~l~~r~~~---~~~p~G~s~~~~~~~~~sPt 294 (324) T protein:vir:59 224 VETLEDGTKVFTSYLFGAGALGYAEGQPEVPTETARN----ALGSQDILINRKHF---VLHPRGVKFTENAMAGTTPT 294 (324) T ss_pred ccccCCCCceEEEEEEecCeEEEeecCCCcceecccC----ccccceEEEEeeEE---EeEeeeEEecccccCCCCCC Confidence 521 111222222222 22 22344444332 23455566666654 356666555322 23344 No 138 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.13 E-value=2e-12 Score=84.69 Aligned_cols=288 Identities=13% Similarity=0.107 Sum_probs=160.0 Q ss_pred HHHHhhh------cccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceeccc-Ccceeeeeeccccceecccccc Q lcl|Aclame:pro 114 DPNRLLS------RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNK 186 (419) Q Consensus 114 ~~~~~~~------~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 186 (419) .....+. .....++....-...+ +.+..++.......+.++++.++.++. ++++.+|+... .+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG~---------~~ 70 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGR---------TQ 70 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeece---------eE Confidence 0000000 0001111122223344 788888888889999999999988875 66888887532 24 Q ss_pred ceeecCccccccc--ccceeeEEeeeEEEE-EeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHh----ccCc Q lcl|Aclame:pro 187 AAVVPEGTAKPQS--TLSFDTITTTLKTVA-HWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLN----GNGS 256 (419) Q Consensus 187 a~~v~Eg~~~~~~--~~~~~~v~~~~~k~~-~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~----G~g~ 256 (419) +..+.-|++.+.+ ++.-+++++...++- ..+.|. -+++. .++.+.+.++++.++++..|+.++. +... T Consensus 71 ~~~~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~Vd--DiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~ 148 (344) T protein:vir:10 71 AAYLAPGENLDDIRKDIKHTEKVITIDGLLTADVLIY--DIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNV 148 (344) T ss_pred EEeeecCCCCCCCCCCcccceEEEEEcchhhhhhhhh--hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 4555667666543 355566555544421 111221 12222 2788999999999999999998853 2111 Q ss_pred -----ccccceeccccccc--cccccccccchhhhHHHHHHHHHHhhhhhccCCc--EEEEehHHHHHHHHHhccCCcee Q lcl|Aclame:pro 257 -----TEMQGILTTPGIGT--YQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWESIELDQAPGSGVF 327 (419) Q Consensus 257 -----~~p~Gi~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~kd~~g~~~ 327 (419) ..|.|.-....+.. ..........+...+++.+.++...+...+.+.. ..+++|..+..|+.-+.-+...+ T Consensus 149 ~~~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~ 228 (344) T protein:vir:10 149 ESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANY 228 (344) T ss_pred ccccccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhccccccccc Confidence 12222211111110 1111112222334567778888888877776533 45679999998865432222222 Q ss_pred ccCCccccCCCcccccceeEecCCCCcCc---------------------EEEEeccceE-EEEE--------ecceEEE Q lcl|Aclame:pro 328 RVIANVQGEATPRIWGLNVVSTVAIAQGT---------------------ALVGGFRQGA-TLWS--------RQGITVL 377 (419) Q Consensus 328 ~~~~~~~~~~~~~l~G~pv~~~~~~~~~~---------------------~~~~d~~~~~-~~~~--------~~~~~i~ 377 (419) --......+..+++.|++|+.++.+|.+. .+.++++... +++. -.+++++ T Consensus 229 ~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e 308 (344) T protein:vir:10 229 AALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALE 308 (344) T ss_pred ccccceeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceee Confidence 12223334556678999999999998431 1112332211 1111 2223444 Q ss_pred EeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 378 MTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 378 ~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) ..+. ..+|. + .+++..-++.++++|++.+.+++++- T Consensus 309 ~~r~-~~~~~-d--~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 309 RARR-ANFQA-D--QIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred cccc-hhHHH-H--HHHHHhhcccceecccceEEEEeecC Confidence 4443 23343 2 46677789999999999988888877 No 139 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.13 E-value=2.2e-11 Score=78.98 Aligned_cols=288 Identities=14% Similarity=0.053 Sum_probs=160.0 Q ss_pred HHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceeccc-Ccceeeeeeccccceeccccccceeec Q lcl|Aclame:pro 113 IDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKAAVVP 191 (419) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~ 191 (419) +..-....+....+. ...-...+ +.+..++.......+.++++.++.++. ++++.+|+... .++++.. T Consensus 1 ms~~~~~tr~~~~~s-~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~---------~~~~~~~ 69 (335) T protein:vir:63 1 MSFLNDLTRPNYAGK-NADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGN---------VEAKGRR 69 (335) T ss_pred CCCcccchhhhcccc-cchhheeh-hhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeee---------eeeeccc Confidence 100000011111111 11223334 788889999999999999999999875 56888888632 2344444 Q ss_pred CcccccccccceeeEEeeeEEEEEeehhhHHH---HhhH---HHHHHHHHHHHHHHHHHHHHHHHH----hccCcccccc Q lcl|Aclame:pro 192 EGTAKPQSTLSFDTITTTLKTVAHWLPITRQA---ADDN---SQLMGYIQGRLTYGLRFLRDRQLL----NGNGSTEMQG 261 (419) Q Consensus 192 Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~el---l~d~---~~~~~~i~~~l~~a~~~~~d~~il----~G~g~~~p~G 261 (419) -|+......+..++..+....+- +++.+ +++. .++-+.+.+++++++++..|+.++ .+.....|.+ T Consensus 70 pG~~l~~~~~~~~k~~itVD~ll----~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~ 145 (335) T protein:vir:63 70 AGEELERSRVVNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVD 145 (335) T ss_pred CCcCcCCCCccccceEEEeccee----echhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc Confidence 45555444455566555555442 23333 2332 278899999999999999999765 3444322222 Q ss_pred eecc--ccccccc-cccccccchhhhHHHHHHHHHHhhhhhccCC-----cEEEEehHHHHHHHHHhccCCceec-cC-- Q lcl|Aclame:pro 262 ILTT--PGIGTYQ-QPKPTAPATDEPPLVDIRRAKTVAEIAGFPP-----DGVVVHPQDWESIELDQAPGSGVFR-VI-- 330 (419) Q Consensus 262 i~~~--~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~l~~~kd~~g~~~~-~~-- 330 (419) +-.. +|+.... .+..........++..+..+...+...+.+. -..+++|..|..|..-+.--++.|- .. T Consensus 146 ~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~ 225 (335) T protein:vir:63 146 LEDAFSPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGAT 225 (335) T ss_pred cCCCcCCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccccccccccccc Confidence 2111 1221111 1122222234444556666777777666542 3578999999998864221122111 01 Q ss_pred CccccCCCcccccceeEecCCCCcCc-----------EEEEeccceE-EEEEe--------cceEEEEeecccchhhcCc Q lcl|Aclame:pro 331 ANVQGEATPRIWGLNVVSTVAIAQGT-----------ALVGGFRQGA-TLWSR--------QGITVLMTDSHADFFTANT 390 (419) Q Consensus 331 ~~~~~~~~~~l~G~pv~~~~~~~~~~-----------~~~~d~~~~~-~~~~~--------~~~~i~~~~~~~~~~~~~~ 390 (419) .+...+....+.|+||+.++.+|.+. .+-+|+.... +++.+ .+++.++.++.. .|. T Consensus 226 ~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~-~~~--- 301 (335) T protein:vir:63 226 NDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNE-KFS--- 301 (335) T ss_pred ccccCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccc-hhh--- Confidence 11233455678999999999998542 2334443322 22222 222333333322 122 Q ss_pred EEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 391 LVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 391 ~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ..+.+..-++..+++|++.+.++++.... T Consensus 302 ~~i~~~~a~G~g~lRPe~a~~i~~tg~~~ 330 (335) T protein:vir:63 302 WVLDTFQMYNIGARRPDTAGAIELKGIGA 330 (335) T ss_pred HHhHHHHHcCCcccccceEEEEEEcCCCc Confidence 23445556899999999999999877776 No 140 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.09 E-value=3.6e-11 Score=77.82 Aligned_cols=273 Identities=10% Similarity=-0.039 Sum_probs=155.1 Q ss_pred ccccCCcccccchhhhHHHHHhhhhhhhHHh---------hcceecccCcceeeeeeccccceeccccccceeecCcccc Q lcl|Aclame:pro 126 GTITNPNVPHLPQLVPGIVPTTPDLPLLVAD---------LLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAK 196 (419) Q Consensus 126 ~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~ 196 (419) -..+.-...++|+.+...+.+...+...+.+ +......++..+++|.-.. -++.+..+.|+..+ T Consensus 1 MA~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~-------l~Gd~~~~~~~~~i 73 (351) T protein:vir:15 1 MAETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLND-------LTGDPDNWTDSDDI 73 (351) T ss_pred CCceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEeccccc-------CCCcccccCCCccc Confidence 1233446678899998888766555544422 1111223455666665321 12467788899998 Q ss_pred cccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccc--cccc Q lcl|Aclame:pro 197 PQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIG--TYQQ 273 (419) Q Consensus 197 ~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~--~~~~ 273 (419) +..+.+-..-.-..+..+..+.++++...-+ .+....|.++++....+..+..+|.- ..|++...... .... T Consensus 74 ~~~kitt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~-----l~gv~~~~~~~~~~~~d 148 (351) T protein:vir:15 74 DVNNLTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSV-----LKGVMGVTKIANSKVYD 148 (351) T ss_pred chheecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhhchhhcccceec Confidence 8888777666666777777788888764433 47888899999999999999887751 11111100000 0001 Q ss_pred ccccccchhhhHHHHHHHHHHhhhhhccC-CcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCC Q lcl|Aclame:pro 274 PKPTAPATDEPPLVDIRRAKTVAEIAGFP-PDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAI 352 (419) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~ 352 (419) .+..+........+.+.++...+-..... -.+|+||+.++..|++..--+ ++.....+..-++++|++|++++.| T Consensus 149 ~t~~~~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~----~~~~s~~~~~i~t~~G~~VivdD~~ 224 (351) T protein:vir:15 149 QTKVSPSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIE----TIQPQNGATPFEAYNGLRIVLDDDI 224 (351) T ss_pred cccccccccccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhh----hccccccCcccceecceEEEEcCCC Confidence 11112233345678889999888775443 578999999999999754211 1111112234578999999999999 Q ss_pred CcC-------cEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEe-----cCCCC Q lcl|Aclame:pro 353 AQG-------TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTF-----AAATT 419 (419) Q Consensus 353 ~~~-------~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~-----~aa~~ 419 (419) |.. ....+-|..+.+.+......+++.+... -..++..+....+ -++||..+..-+. ..+|| T Consensus 225 p~~~~~~~~~~ytsyl~~~GAi~~~~~~~~ve~~rd~~--~~~g~d~l~~r~~---~~~hp~G~s~~~~~~~~~~~sPt 298 (351) T protein:vir:15 225 EIDLTDKTKPVSTSYIFAPGAVRYSTNMRSTETKYDPL--INGGQDVIVQKRV---GTIHVAGTSIKASFSPSKASFPT 298 (351) T ss_pred ccccCCCCCceeEEEEEecceeeeecCCcCcceeeccc--CCCCceEEEEeee---eeeeeeeeeecccccccCcCCcC Confidence 842 1122233333333223333344444322 1223333333333 3477777765421 12344 No 141 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.09 E-value=8.9e-12 Score=81.16 Aligned_cols=249 Identities=12% Similarity=0.108 Sum_probs=133.1 Q ss_pred hcceecccCcceeeeeeccccceeccccccceeecCcccccc--cccceee--EEeeeEEEEEeehhhHHHHhhHHHHHH Q lcl|Aclame:pro 157 LLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ--STLSFDT--ITTTLKTVAHWLPITRQAADDNSQLMG 232 (419) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~--~~~~~~~--v~~~~~k~~~~~~vs~ell~d~~~~~~ 232 (419) +++.+. +++++++|+.... ++.+..-|+++.. .++.-.+ ++++-.++..+.--.-+=++...++.+ T Consensus 1 ~vr~i~-~g~s~~~~~iG~~---------~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~ 70 (324) T protein:vir:99 1 MTRTIT-SGKSAQFPVMGRT---------KARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRS 70 (324) T ss_pred Ceeeee-cCceEEEeeeeee---------EeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchh Confidence 333333 4678889886432 3333333444422 2233344 333333332211111011111236899 Q ss_pred HHHHHHHHHHHHHHHHHHHhc----cCcc---cccceeccccc--cccccccccccchhhhHHHHHHHHHHhhhhhccCC Q lcl|Aclame:pro 233 YIQGRLTYGLRFLRDRQLLNG----NGST---EMQGILTTPGI--GTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPP 303 (419) Q Consensus 233 ~i~~~l~~a~~~~~d~~il~G----~g~~---~p~Gi~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (419) ...++++.++++..|+.++.- .... ...+.....+. ................+++.+.++...+...+.+. T Consensus 71 e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~ 150 (324) T protein:vir:99 71 EYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPA 150 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCC Confidence 999999999999999887521 1101 11111100010 11111112223344567888888888887777653 Q ss_pred c--EEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcE------------------------ Q lcl|Aclame:pro 304 D--GVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTA------------------------ 357 (419) Q Consensus 304 ~--~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~------------------------ 357 (419) . ..+++|..+..|+.-+.-+...+.-.+....+..++++|++|+.|+.+|...+ T Consensus 151 ~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ 230 (324) T protein:vir:99 151 GDRTFYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTG 230 (324) T ss_pred CCCEEEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccccccccccccc Confidence 2 46789999987765433333333334555566778899999999999986311 Q ss_pred -EEEeccceE-EEEEec--------ceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 358 -LVGGFRQGA-TLWSRQ--------GITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 358 -~~~d~~~~~-~~~~~~--------~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +-+|++... +++.++ ++..+..++. ..|. ..+++..-++..+++|++.+.+++.+..| T Consensus 231 ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~-~~~~---d~i~~~~a~G~~~lRPe~a~~v~l~~~~~ 298 (324) T protein:vir:99 231 KMTVGADNVVGLFVHRSAVATLKLKDMALERARRP-EYQA---DQIIAKYAMGHGGLRPEAVGAIIFEDGET 298 (324) T ss_pred ccccccCceeEEEEehhheEEEeeecceecceech-hhHH---HhhhhhhhhcCcccccceEEEEEEccCcc Confidence 222222211 222222 2234444332 2233 24566777899999999999888777655 No 142 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=99.08 E-value=2.2e-11 Score=79.03 Aligned_cols=287 Identities=12% Similarity=0.075 Sum_probs=157.6 Q ss_pred cccccccCCccccc-chhhhHHHHHhhhhhhhHHhhcceecc-cCcceeeeeeccccceeccccccceeecCcccccccc Q lcl|Aclame:pro 123 APAGTITNPNVPHL-PQLVPGIVPTTPDLPLLVADLLDQQNA-DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST 200 (419) Q Consensus 123 ~~~~~~~~~~~~~~-p~~~~~~i~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 200 (419) .+.|..++.+...+ |+.|+..|...+.+......+.+.... .|..+.|++....+..-+ .+++.++-.+ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~tV~dY---------~~~~~i~~d~ 71 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTPVVRSR---------PEQGDFTFDN 71 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccccccccc---------cCCCCccccc Confidence 34444555554444 999999998887777766666664442 466788877654332211 2233333233 Q ss_pred cceeeEE--eeeEEEEEeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHh--ccCcccccceeccccccccccccc Q lcl|Aclame:pro 201 LSFDTIT--TTLKTVAHWLPITRQAADDNSQLMGYIQGRLTYGLRFLRDRQLLN--GNGSTEMQGILTTPGIGTYQQPKP 276 (419) Q Consensus 201 ~~~~~v~--~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~~a~~~~~d~~il~--G~g~~~p~Gi~~~~~~~~~~~~~~ 276 (419) ++-.+++ ++-.|+-++ .|+++..+.+.+|.+...++++.+++..+|..+.. -+|..+..++-+...+........ T Consensus 72 ltt~~~~l~IDq~KYfaf-~VdDD~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv 150 (322) T protein:vir:31 72 LDTGEISIILRDEVYAGN-AISKKLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFV 150 (322) T ss_pred CCCceEEEEEehhhhhcc-ccchhHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCcccee Confidence 3333334 444444444 48887777778999999999999999999997743 122211111100000000011111 Q ss_pred cccchhhhHHHHHHHHHHhhhhhccCC-cEE-EEehHHHHHHHHH-----hccCCceeccCCccc-cC--CCccccccee Q lcl|Aclame:pro 277 TAPATDEPPLVDIRRAKTVAEIAGFPP-DGV-VVHPQDWESIELD-----QAPGSGVFRVIANVQ-GE--ATPRIWGLNV 346 (419) Q Consensus 277 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~l~~~-----kd~~g~~~~~~~~~~-~~--~~~~l~G~pv 346 (419) ..+......|+.++++..++...+.+. ..| |++|..+..|..+ -..+++..-+..+.. .+ ..++++|+.| T Consensus 151 ~~gt~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF~V 230 (322) T protein:vir:31 151 GTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGIDL 230 (322) T ss_pred ccCCCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhceee Confidence 223344557888999988888887764 345 6789988877442 112233211111111 11 1578999999 Q ss_pred EecCCCCcCc--EEEEeccceEEEEEecceEEEEeec------------ccchhh---cCcEEEEEEEEeccEEecccce Q lcl|Aclame:pro 347 VSTVAIAQGT--ALVGGFRQGATLWSRQGITVLMTDS------------HADFFT---ANTLVILAEFRANLAVYQPKAF 409 (419) Q Consensus 347 ~~~~~~~~~~--~~~~d~~~~~~~~~~~~~~i~~~~~------------~~~~~~---~~~~~~r~~~r~d~~~~~~~a~ 409 (419) ++|+.++.+. ++.|.......-+...+.....+.. ..+.|. +..-.+|+.+|++..+.+|+.+ T Consensus 231 ~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l 310 (322) T protein:vir:31 231 FVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTARWGNGLVRDENL 310 (322) T ss_pred eeeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccceeeeeeecceeecccce Confidence 9999997532 2332211100000000000000000 001111 1223578999999999999999 Q ss_pred EEEEecCCCC Q lcl|Aclame:pro 410 VRVTFAAATT 419 (419) Q Consensus 410 ~~~~~~aa~~ 419 (419) +.+.-.+.++ T Consensus 311 ~~~~a~~~~~ 320 (322) T protein:vir:31 311 VCVLANADKV 320 (322) T ss_pred EEEEeccccc Confidence 9998888887 No 143 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.08 E-value=1.9e-11 Score=79.39 Aligned_cols=291 Identities=12% Similarity=0.012 Sum_probs=158.8 Q ss_pred HHHHHHHHhhhcccccccccCCcccccc-hhhhHHHHHhhhhhhhHHhhcceeccc-Ccceeeeeeccccceeccccccc Q lcl|Aclame:pro 110 MRDIDPNRLLSRDAPAGTITNPNVPHLP-QLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKA 187 (419) Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~p-~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a 187 (419) +........+. ... ..++....+. +.+..++.......+.++++.++.++. ++++.+|+... .++ T Consensus 1 m~~~~~~~~t~-~~~---~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG~---------~~~ 67 (334) T protein:vir:80 1 MTYPAANTHTR-PGW---GGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVGA---------STI 67 (334) T ss_pred CCCCcCCCccc-ccc---ccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeecc---------eee Confidence 00000000000 000 0112222333 888999999898999999999999875 66888887643 234 Q ss_pred eeecCcccccccccceeeEEeeeEEE-EEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHh----ccCcccc Q lcl|Aclame:pro 188 AVVPEGTAKPQSTLSFDTITTTLKTV-AHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLN----GNGSTEM 259 (419) Q Consensus 188 ~~v~Eg~~~~~~~~~~~~v~~~~~k~-~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~----G~g~~~p 259 (419) ++..-|+.+....+..+++++....+ .....|. -+++. .++-+.+.++++.++++..|++++. |.....| T Consensus 68 ~~~~~g~~l~~~~~~~~~~~l~ID~~l~~~~~Vd--diD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~ 145 (334) T protein:vir:80 68 AGRKAGEELVVQKNVSDKLNLTVDTVLYARHFFD--KFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAP 145 (334) T ss_pred eeecCCCCCCCCCcccCceEEEEeeeeehhhhHh--hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Confidence 44555666666666667766666653 2222222 12222 3799999999999999999997752 3222222 Q ss_pred cceecc--cccccccc---ccccccchhhhHHHHHHHHHHhhhhhccC-----CcEEEEehHHHHHHHHHhccCCceecc Q lcl|Aclame:pro 260 QGILTT--PGIGTYQQ---PKPTAPATDEPPLVDIRRAKTVAEIAGFP-----PDGVVVHPQDWESIELDQAPGSGVFRV 329 (419) Q Consensus 260 ~Gi~~~--~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~kd~~g~~~~~ 329 (419) .+.-.. +|+..... ......+....++..+..+...+...+.+ .-..+++|..|..|+.-..--.+.|-. T Consensus 146 ~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~ 225 (334) T protein:vir:80 146 AHLKPAFHDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGA 225 (334) T ss_pred ccccccccCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecc Confidence 211110 11111111 11112233334455555666666655554 235689999999988642211111100 Q ss_pred C---CccccCCCcccccceeEecCCCCcCc-----------EEEEeccceEE-EEEecc--------eEEEEeecccchh Q lcl|Aclame:pro 330 I---ANVQGEATPRIWGLNVVSTVAIAQGT-----------ALVGGFRQGAT-LWSRQG--------ITVLMTDSHADFF 386 (419) Q Consensus 330 ~---~~~~~~~~~~l~G~pv~~~~~~~~~~-----------~~~~d~~~~~~-~~~~~~--------~~i~~~~~~~~~~ 386 (419) . .....+...+++|+||+.|+.+|... .+-+||+.... ++.+.. ++.++.++.. .| T Consensus 226 s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~-~~ 304 (334) T protein:vir:80 226 KEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKK-DF 304 (334) T ss_pred ccccccccceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechh-hH Confidence 0 11223445678999999999999642 44555555432 222222 2223322211 12 Q ss_pred hcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 387 TANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 387 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) . + .+.+..-++.++++|+|.++++++-+-. T Consensus 305 ~-d--~i~~~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 305 G-H--YLDTFQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred H-H--HHHHHHHcCCceeccceEEEEEEeeecC Confidence 1 1 2344456899999999999998876655 No 144 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.08 E-value=5.6e-11 Score=76.78 Aligned_cols=288 Identities=15% Similarity=0.067 Sum_probs=157.7 Q ss_pred HHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceeccc-Ccceeeeeeccccceeccccccceeec Q lcl|Aclame:pro 113 IDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKAAVVP 191 (419) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~ 191 (419) +..-....+....+. ...-...+ +.+..++.......+.++++.++.++. ++++.+|+... .++++.. T Consensus 1 ms~~~~~t~~~~~~s-~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~---------~~~~~~~ 69 (335) T protein:vir:78 1 MSFLNDLTRPNYAGK-NADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGN---------VEAKGRR 69 (335) T ss_pred CCccccccccccccc-cchhhhhh-hhhhhHHHHHHHHhhhhccccceeeeccceeEEEeeeee---------eeecccc Confidence 111111111111111 12223444 788889988899999999999998874 56888887532 2334444 Q ss_pred CcccccccccceeeEEeeeEEEEEeehhhHHHH---hhH---HHHHHHHHHHHHHHHHHHHHHHHH----hccCcccccc Q lcl|Aclame:pro 192 EGTAKPQSTLSFDTITTTLKTVAHWLPITRQAA---DDN---SQLMGYIQGRLTYGLRFLRDRQLL----NGNGSTEMQG 261 (419) Q Consensus 192 Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell---~d~---~~~~~~i~~~l~~a~~~~~d~~il----~G~g~~~p~G 261 (419) -|+....+.+..++..+....+- +++.++ ++. -++-+.+.+++++++++..|+.++ .+.....|.. T Consensus 70 pG~~l~~~~~~~~k~~itID~ll----~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~ 145 (335) T protein:vir:78 70 AGEELERSRVVNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVD 145 (335) T ss_pred cCcccCCCCcccCCeEEEeccee----echhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 45444444445555555554432 333333 332 268899999999999999999765 3333322221 Q ss_pred eecc--cccccccc-ccccccchhhhHHHHHHHHHHhhhhhccC-----CcEEEEehHHHHHHHHHhccCCceec-cC-- Q lcl|Aclame:pro 262 ILTT--PGIGTYQQ-PKPTAPATDEPPLVDIRRAKTVAEIAGFP-----PDGVVVHPQDWESIELDQAPGSGVFR-VI-- 330 (419) Q Consensus 262 i~~~--~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~kd~~g~~~~-~~-- 330 (419) .-+. +|+..... ........+..+.+.+..+...+...+.+ .-..+++|..|..|+.-..--.+.|- .. T Consensus 146 ~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~ 225 (335) T protein:vir:78 146 LEDAFSPGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGAT 225 (335) T ss_pred cCCCcCCCcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccc Confidence 1111 12211111 11222223444455555555556555443 23578999999998864221122111 11 Q ss_pred CccccCCCcccccceeEecCCCCcCc-----------EEEEeccc-eEEEEEec--------ceEEEEeecccchhhcCc Q lcl|Aclame:pro 331 ANVQGEATPRIWGLNVVSTVAIAQGT-----------ALVGGFRQ-GATLWSRQ--------GITVLMTDSHADFFTANT 390 (419) Q Consensus 331 ~~~~~~~~~~l~G~pv~~~~~~~~~~-----------~~~~d~~~-~~~~~~~~--------~~~i~~~~~~~~~~~~~~ 390 (419) .+...+....+.|+||+.++.+|.+. .+-+|+.. ..+++.+. ++..+++++.. .|. T Consensus 226 ~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~-~~~--- 301 (335) T protein:vir:78 226 NDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHD-QFS--- 301 (335) T ss_pred cccccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccc-hhh--- Confidence 11233455678999999999999542 22234433 22233322 22333333222 122 Q ss_pred EEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 391 LVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 391 ~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ..+.+..-++..+++|+|.+.++++.... T Consensus 302 ~~i~~~~a~G~g~lRPe~a~~i~~tg~~~ 330 (335) T protein:vir:78 302 WVLDTFQMYNIGARRPDTAGAIELKGIEA 330 (335) T ss_pred HhhhHHHHcCCcccCcceEEEEEecCCCc Confidence 23445556899999999999999888777 No 145 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.07 E-value=7.2e-11 Score=76.19 Aligned_cols=273 Identities=12% Similarity=0.017 Sum_probs=155.2 Q ss_pred ccccccCCcccccchhhhHHHHHhhhhhhhHHh---------hcceecccCcceeeeeeccccceeccccccceeecCcc Q lcl|Aclame:pro 124 PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVAD---------LLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGT 194 (419) Q Consensus 124 ~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~ 194 (419) .+...+.-...++|+.+...+.....+...+.+ +......++..+++|.-.. -++.+..+.||. T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~-------l~G~~~~~~dg~ 73 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWND-------LTGDSEVLGNGD 73 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEeccccc-------CCCcccccCCCc Confidence 112234556678899998888777665544422 1122223456666665432 124566677885 Q ss_pred -cccccccceeeEEeeeEEEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccc-----c Q lcl|Aclame:pro 195 -AKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTP-----G 267 (419) Q Consensus 195 -~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~-----~ 267 (419) .++..+.+-..-.-..++.+..+.++++....+ .+....+.++++....+..+..+|.- ..|+++.. + T Consensus 74 ~~i~~~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~-----l~gvf~~~~~~~~~ 148 (330) T protein:vir:10 74 KALETGKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIAT-----LNGIFATGTAGEKG 148 (330) T ss_pred cccchhhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHH-----HHhhhhhhhcccch Confidence 577777777777777777777788888864433 57788899999988888888876641 11111110 0 Q ss_pred ccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeE Q lcl|Aclame:pro 268 IGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVV 347 (419) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~ 347 (419) ..........+........+.+.++...+.+....-.+|+||+.++..|++..--+ +......+..-++++|++|+ T Consensus 149 ~~~~~~~~~~~~~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~----~~~~s~~~~~i~~~~G~~Vi 224 (330) T protein:vir:10 149 ALEETHVSDQSKASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQ----YIQPTTATINIPTYLGYRVI 224 (330) T ss_pred hhhhhheecccccccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhh----hhcccccCcccccccceEEE Confidence 00011111112223334567788888888887777889999999999999753211 11112223345789999999 Q ss_pred ecCCCCcCc--EEEEeccceEEEEEe----cceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEe-----cC Q lcl|Aclame:pro 348 STVAIAQGT--ALVGGFRQGATLWSR----QGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTF-----AA 416 (419) Q Consensus 348 ~~~~~~~~~--~~~~d~~~~~~~~~~----~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~-----~a 416 (419) +++.||... ...+-|..+.+.+.. ....++..+. ...++..+....+ -++||..+..-+- .. T Consensus 225 vdD~~p~~~~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd----~~~g~~~l~~r~~---~~~hp~G~s~~~~~~~~~~~ 297 (330) T protein:vir:10 225 IDDGIAPTGDIYTSYLFRTGSIGLNTGNPSGLTTFETSRE----AAKGNDMIYTRRA---LVMHPYGVKWTGAEVDAGNI 297 (330) T ss_pred EeCCCCCCCCceeEEEEecCceeeecccCCccccccccCC----ccccceEEEEeeE---EEeeeeeeeecccccccCcC Confidence 999998532 111122222222211 1122333322 2344445555444 4466776665532 12 Q ss_pred CCC Q lcl|Aclame:pro 417 ATT 419 (419) Q Consensus 417 a~~ 419 (419) +|| T Consensus 298 sPt 300 (330) T protein:vir:10 298 TPS 300 (330) T ss_pred CcC Confidence 355 No 146 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=99.07 E-value=1.3e-11 Score=80.24 Aligned_cols=235 Identities=11% Similarity=0.036 Sum_probs=153.8 Q ss_pred hhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCc-ceeeeeeccccc Q lcl|Aclame:pro 100 RDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYN-VLEYIRDTSGTA 178 (419) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~ 178 (419) ..........+.+. ....-|......|++.+.+.+.|+..++......+ ...+.+.++ T Consensus 1 m~~~~~~~~TL~e~------------------Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~--- 59 (328) T protein:vir:95 1 MAVKGLTALTLADW------------------GKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSG--- 59 (328) T ss_pred CCccccccccHHHH------------------HhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeec--- Confidence 00000000000000 00112233444667777777888888888877533 455555543 Q ss_pred eeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 179 GAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS---QLMGYIQGRLTYGLRFLRDRQLLNGNG 255 (419) Q Consensus 179 ~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~---~~~~~i~~~l~~a~~~~~d~~il~G~g 255 (419) -++++|..=++..+.++.++.+++-..+-+++.+.|.+.+.+... ++...-.....+++.+.+...||+||. T Consensus 60 -----LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGds 134 (328) T protein:vir:95 60 -----LPSATWRLLNYGVQPSKSTTVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDS 134 (328) T ss_pred -----cCCceeeecCCccCcccceeEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Confidence 357889998999999999999999999999999999999887543 455556677999999999999999987 Q ss_pred ccccccee------cc-----------cccc------------------------------------------------- Q lcl|Aclame:pro 256 STEMQGIL------TT-----------PGIG------------------------------------------------- 269 (419) Q Consensus 256 ~~~p~Gi~------~~-----------~~~~------------------------------------------------- 269 (419) +.+|.++. +. .|.. T Consensus 135 a~~p~~F~GL~~R~~~~s~~~a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~ 214 (328) T protein:vir:95 135 SVNPQQFMGLSSRYSSLSAGNAQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEG 214 (328) T ss_pred cCChhhhcchhhhcCccccccccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeE Confidence 76655442 00 0000 Q ss_pred -----------------------ccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCce Q lcl|Aclame:pro 270 -----------------------TYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGV 326 (419) Q Consensus 270 -----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~ 326 (419) ......-...+...+.++.++.++..++.....+.+|+||.+....|++.....++. T Consensus 215 y~~~~~w~~Gl~i~d~r~vvrI~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~ 294 (328) T protein:vir:95 215 YRTHYKWDNGLALRDWRYVVRIANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSL 294 (328) T ss_pred EEEEEEeeeeeEEcCcccEEEEecCcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcce Confidence 000000112334556777888888888877777788999999999999875555554 Q ss_pred eccCCccccCCCcccccceeEecCCCCcCcEEEE Q lcl|Aclame:pro 327 FRVIANVQGEATPRIWGLNVVSTVAIAQGTALVG 360 (419) Q Consensus 327 ~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 360 (419) .+......+...-.+.|+||..++.+-.++-.+. T Consensus 295 ~~~~~~~~g~~~t~~~gipir~~dai~~tE~~vv 328 (328) T protein:vir:95 295 AISVKETEGEWWTSFRGVPIRETDALLETEARVV 328 (328) T ss_pred eeeeeccCCcceeEECCeEEEEEeeeecCccccC Confidence 4444444445566788999999998865543333 No 147 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=99.06 E-value=4e-11 Score=77.59 Aligned_cols=277 Identities=12% Similarity=0.017 Sum_probs=162.3 Q ss_pred hhhcccccccccCCcccccc--hhhhHHHHHhhhhhhhHHhhcceecccCc---ceeeeeeccccceeccccccceeecC Q lcl|Aclame:pro 118 LLSRDAPAGTITNPNVPHLP--QLVPGIVPTTPDLPLLVADLLDQQNADYN---VLEYIRDTSGTAGAGSTWNKAAVVPE 192 (419) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~p--~~~~~~i~~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~a~~v~E 192 (419) .+-.. ....+..+.. +.+.+.+.+.+......+.++++....+. .+.|...+ ..+.+.|++. T Consensus 1 ~~~~~-----a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~--------~~G~a~~~~~ 67 (296) T protein:vir:10 1 MGVDK-----ADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFD--------GVGIAQIVAD 67 (296) T ss_pred Ccccc-----hhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeee--------ccCceeEeCC Confidence 00000 0011111221 34445566656666666666665432222 33333222 2345677776 Q ss_pred c-ccccccccceeeEEeeeEEEEEeehhhHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccc Q lcl|Aclame:pro 193 G-TAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS----QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPG 267 (419) Q Consensus 193 g-~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~----~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~ 267 (419) + ..+|..+..+.......+.++..+.++.+=++.+. ++..--....++++.+.+|+.+++|+..-+..|++|.++ T Consensus 68 ~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~ 147 (296) T protein:vir:10 68 YTDDLPLVDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPN 147 (296) T ss_pred CccccceeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCC Confidence 5 44788888889999999999999999877665542 578888889999999999999999998878899999999 Q ss_pred ccccccccccccchhhhHHHHHHHHHHhhhh---hccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccc Q lcl|Aclame:pro 268 IGTYQQPKPTAPATDEPPLVDIRRAKTVAEI---AGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGL 344 (419) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~ 344 (419) +........... .+..++|+..++..+.. ....+..++++|..+..|.......|..+ +...-....+.++.+. T Consensus 148 v~~~~~~~~W~~--~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~-l~~ik~~~~~l~i~~~ 224 (296) T protein:vir:10 148 INNVVSGGSWSQ--PTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSY-GEFFRQNNSGVTVEFV 224 (296) T ss_pred CccccccCCccC--HHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccH-HHHHHHhcCCceEEEe Confidence 865544433332 33679999999887654 33456789999999988865544333221 1111111122334444 Q ss_pred eeEecCCCCc-CcEEEEeccceEE-EEEecceEEEEeecccchhhcCcEEEEEEEEec-cEEecccceEEE---Eec Q lcl|Aclame:pro 345 NVVSTVAIAQ-GTALVGGFRQGAT-LWSRQGITVLMTDSHADFFTANTLVILAEFRAN-LAVYQPKAFVRV---TFA 415 (419) Q Consensus 345 pv~~~~~~~~-~~~~~~d~~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~a~~~~---~~~ 415 (419) |...+..... +..++++.+.-++ +.....++..-.. ...-...++...|+. ..+++|.||+++ +++ T Consensus 225 ~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~~e-----~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 225 QYLNDYNGTGTSAAIAYEKDPNNMAIEIPEATNALPAQ-----PKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred eeeccCCCCcceEEEEEEcCCceEEEEcCcceeeeccc-----ccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 5444433211 2234444333222 2222222222111 111224566678775 678889999999 777 No 148 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.00 E-value=4.4e-10 Score=71.86 Aligned_cols=289 Identities=11% Similarity=0.045 Sum_probs=151.3 Q ss_pred hhhccc-cccc-ccCCccccc-chhhhHHHHHhhhhhhhHHhhcceeccc-CcceeeeeeccccceeccccccceeecCc Q lcl|Aclame:pro 118 LLSRDA-PAGT-ITNPNVPHL-PQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKAAVVPEG 193 (419) Q Consensus 118 ~~~~~~-~~~~-~~~~~~~~~-p~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg 193 (419) ....+. ..+. ..++....+ -+.+..++.......+.++++.++.++. ++++++|+....+ +++..-| T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~~~---------~~~~~~G 71 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGETE---------LQVLSPG 71 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeeeeE---------EeeeccC Confidence 100000 0000 011122222 3778888988888899999999998875 5689999864322 2222223 Q ss_pred ccccccccceeeEEeeeEEEEE-eehhhH-HHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHh---ccCccc--c---cce Q lcl|Aclame:pro 194 TAKPQSTLSFDTITTTLKTVAH-WLPITR-QAADDNSQ-LMGYIQGRLTYGLRFLRDRQLLN---GNGSTE--M---QGI 262 (419) Q Consensus 194 ~~~~~~~~~~~~v~~~~~k~~~-~~~vs~-ell~d~~~-~~~~i~~~l~~a~~~~~d~~il~---G~g~~~--p---~Gi 262 (419) +...-..+..++.++...++-. ...|.+ +=.++.-+ +-+.+..+++.++++..|+.++. -.+..+ | .++ T Consensus 72 ~~ld~~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~ 151 (364) T protein:vir:10 72 KSPDASPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPR 151 (364) T ss_pred cccCCCCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCc Confidence 3322223444555554444321 111111 01112224 66888899999999999998852 111011 0 111 Q ss_pred ecccccc-ccccccccccchhhhHHHHHHHHHHhhhhhccCCc--EEEEehHHHHHHHHHhccCCceecc--CCccccCC Q lcl|Aclame:pro 263 LTTPGIG-TYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWESIELDQAPGSGVFRV--IANVQGEA 337 (419) Q Consensus 263 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~kd~~g~~~~~--~~~~~~~~ 337 (419) ....|.. ...............+++.+..+...+...+.+.. ..+++|..|..|.+-..-=.+.|.. .+....+. T Consensus 152 ~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~ 231 (364) T protein:vir:10 152 VAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGF 231 (364) T ss_pred ccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccce Confidence 1111110 11111122233344556666777777777766443 5689999998887632101111110 11223344 Q ss_pred CcccccceeEecCCCCcCcE-----------------------EEEeccce-EEEEEe--------cceEEEEeecccch Q lcl|Aclame:pro 338 TPRIWGLNVVSTVAIAQGTA-----------------------LVGGFRQG-ATLWSR--------QGITVLMTDSHADF 385 (419) Q Consensus 338 ~~~l~G~pv~~~~~~~~~~~-----------------------~~~d~~~~-~~~~~~--------~~~~i~~~~~~~~~ 385 (419) ...+.|+||+.|+.+|.... ..+|+... .+.|.+ .+++.++.++.. . T Consensus 232 v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~-~ 310 (364) T protein:vir:10 232 VLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKK-E 310 (364) T ss_pred eEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccc-e Confidence 56789999999999984210 11333221 222322 344444443322 1 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 386 FTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 386 ~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) + ...+.+..-++..+++|+|.+.++.+++.+ T Consensus 311 ~---~~~ida~~a~G~g~lRPeaa~~i~~~~~~~ 341 (364) T protein:vir:10 311 K---TWYIDTFLAEGAIPDRWEAVAVVTAADTAE 341 (364) T ss_pred e---eeeeeeehcccCcccCccceEEEEecCCCC Confidence 1 123445666899999999999999988888 No 149 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.99 E-value=6.4e-11 Score=76.47 Aligned_cols=291 Identities=10% Similarity=0.049 Sum_probs=156.3 Q ss_pred hhhhHHHHHHHHHHhhhcccccccccCC-cccccchhhhHHHHHhhhhhhhHHhhcceeccc-Ccceeeeeeccccceec Q lcl|Aclame:pro 104 GQFQVEMRDIDPNRLLSRDAPAGTITNP-NVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAG 181 (419) Q Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 181 (419) .. .+.++..-- ..+....+..... -...+ +.+..++.......+.++.+.+..++. ++++.|++....+ T Consensus 1 ~~---~~~~~~~~~-~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~~~---- 71 (332) T protein:vir:78 1 MT---TLSNFSLPN-QANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLS---- 71 (332) T ss_pred Cc---ccccccCCc-cccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhccccccccccceEEEEecccee---- Confidence 00 011110000 0011111111111 12344 788889988888999999998887764 6788888864322 Q ss_pred cccccceeecCcccccc-cccceeeEEeeeEEEE-EeehhhHHHHh-hH-HHHHHHHHHHHHHHHHHHHHHHHHh----c Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVA-HWLPITRQAAD-DN-SQLMGYIQGRLTYGLRFLRDRQLLN----G 253 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~-~~~~vs~ell~-d~-~~~~~~i~~~l~~a~~~~~d~~il~----G 253 (419) +.....|+.+.- .+++-.++++...+.- ....|. .+-+ ++ .++.+.+.++.+.++++..|+.++. + T Consensus 72 -----~~~~~~g~~l~~~~~~~~~~~~l~ID~~ky~~~~Vd-diD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~a 145 (332) T protein:vir:78 72 -----AGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQFVY-SLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKA 145 (332) T ss_pred -----EeeecCCCCCCCCCCCCCceEEEEEehhhhhHHHHH-hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 233333444322 2344455555444321 112221 2222 12 2688999999999999999998764 2 Q ss_pred cCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCc--EEEEehHHHHHHHHHhccC--Ccee-c Q lcl|Aclame:pro 254 NGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWESIELDQAPG--SGVF-R 328 (419) Q Consensus 254 ~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~kd~~--g~~~-~ 328 (419) .....|.+. .++..... .+.....+...+++.++++...+...+.+.. .++++|..+..|.+.+|.. .+.+ - T Consensus 146 a~~~~~~~~--~~g~~~~~-~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~ 222 (332) T protein:vir:78 146 SAEASPVTG--EPGGFHVN-IGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGN 222 (332) T ss_pred hcccCcccc--cccccccc-cCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccc Confidence 222222111 11111111 1122233456678888999888888887544 3567999999988754321 0000 0 Q ss_pred cCCccccC-CCcccccceeEecCCCCcCc--------------EEEEeccceE-EEEEecce--------EEEEee--cc Q lcl|Aclame:pro 329 VIANVQGE-ATPRIWGLNVVSTVAIAQGT--------------ALVGGFRQGA-TLWSRQGI--------TVLMTD--SH 382 (419) Q Consensus 329 ~~~~~~~~-~~~~l~G~pv~~~~~~~~~~--------------~~~~d~~~~~-~~~~~~~~--------~i~~~~--~~ 382 (419) ..+....+ ..+.++|++|+.++.+|... .+-++|+... +++.+..+ .+++.. .. T Consensus 223 ~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~ 302 (332) T protein:vir:78 223 SQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN 302 (332) T ss_pred cccceecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccc Confidence 11112222 24678999999999998532 2334444422 22333322 222211 11 Q ss_pred cchhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 383 ADFFTANTLVILAEFRANLAVYQPKAFVRVTFA 415 (419) Q Consensus 383 ~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 415 (419) ...|. ..+++...++.++++|++.+.++-+ T Consensus 303 ~~~~~---d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 303 VQYQG---DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhhhH---hhhhhhhhhcCceecccceEEEeeC Confidence 22232 2567777899999999999999888 No 150 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.96 E-value=5.6e-10 Score=71.32 Aligned_cols=299 Identities=9% Similarity=0.042 Sum_probs=163.6 Q ss_pred hhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccc---hhhhHHHHHhhhhhhhHHhh Q lcl|Aclame:pro 81 FRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLP---QLVPGIVPTTPDLPLLVADL 157 (419) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p---~~~~~~i~~~~~~~~~l~~~ 157 (419) -|......+ +.......... ...........+.... +.+.+.+++........+.+ T Consensus 1 ~~~~~~~~~-------------------~~~~~~~~~~~--~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~ 59 (319) T protein:vir:10 1 MTTKKFDEA-------------------DKSNVEMYLIQ--AGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRV 59 (319) T ss_pred CCCcchhHH-------------------hhHHHHHHHhh--ccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhh Confidence 000000000 00000000000 0000000111112211 34555666767777777777 Q ss_pred cceecccCc---ceeeeeeccccceeccccccceeecCccc-ccccccceeeEEeeeEEEEEeehhhHHHHhhHH----H Q lcl|Aclame:pro 158 LDQQNADYN---VLEYIRDTSGTAGAGSTWNKAAVVPEGTA-KPQSTLSFDTITTTLKTVAHWLPITRQAADDNS----Q 229 (419) Q Consensus 158 ~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~----~ 229 (419) +++.+..+. ++.|...+ ..+.+.|++.++. +|..+..+.......+.++..+.++..=++.+. + T Consensus 60 i~v~~~~~~~~~~~~~~~~~--------~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~ 131 (319) T protein:vir:10 60 FPVTTELSPTDKTFEYMTFD--------KVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRP 131 (319) T ss_pred cccccCCCCceEEEEeeeec--------cccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCC Confidence 776533222 23333322 2346778877544 788888888899999999999988877555442 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccc--ccchhhhHHHHHHHHHHhhhh---hccCCc Q lcl|Aclame:pro 230 LMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPT--APATDEPPLVDIRRAKTVAEI---AGFPPD 304 (419) Q Consensus 230 ~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~---~~~~~~ 304 (419) +..--....++++.+.+|+.+++|+...+..|++|.+++......... ...+....++|+..++..+.. ....+. T Consensus 132 l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~ 211 (319) T protein:vir:10 132 LSTRKASACQLAHDQLVNRLVFKGSAPHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRAT 211 (319) T ss_pred hHHHHHHHHHHHHHHhhceEEEeecccccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeece Confidence 888888999999999999999999988889999999998766554332 223456788999888887753 233567 Q ss_pred EEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCc-CcEEEEeccceEE-EEEecceEEEEeecc Q lcl|Aclame:pro 305 GVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQ-GTALVGGFRQGAT-LWSRQGITVLMTDSH 382 (419) Q Consensus 305 ~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~-~~~~~~d~~~~~~-~~~~~~~~i~~~~~~ 382 (419) .++|+|+.+..|.......|...+ .-.-....+.+|.+.|...+..... +..+++..+.-++ +.....++..-.. T Consensus 212 ~L~L~p~~~~~L~~~~~~~~~t~l-~~lk~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~e-- 288 (319) T protein:vir:10 212 NILIPPSMRKVLAIRMPETTMSYL-DYFKSQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPAQ-- 288 (319) T ss_pred EEEecHHHHHhhhcccCCCCeeHH-HHHHHhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCcceeeeeee-- Confidence 899999999998654443332211 1111111122344444444332211 1223333322222 2212222222111 Q ss_pred cchhhcC-cEEEEEEEEec-cEEecccceEEEEec Q lcl|Aclame:pro 383 ADFFTAN-TLVILAEFRAN-LAVYQPKAFVRVTFA 415 (419) Q Consensus 383 ~~~~~~~-~~~~r~~~r~d-~~~~~~~a~~~~~~~ 415 (419) .++ ...+....|+. ..+++|.||++++-= T Consensus 289 ----~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 289 ----PKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred ----ecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 111 12333455655 566889999999766 No 151 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.93 E-value=8.8e-10 Score=70.23 Aligned_cols=274 Identities=14% Similarity=0.061 Sum_probs=161.1 Q ss_pred cccCCccccc---chhhhHHHHHhhhhhhhHHhhcceecccC---cceeeeeeccccceeccccccceeecCccc-cccc Q lcl|Aclame:pro 127 TITNPNVPHL---PQLVPGIVPTTPDLPLLVADLLDQQNADY---NVLEYIRDTSGTAGAGSTWNKAAVVPEGTA-KPQS 199 (419) Q Consensus 127 ~~~~~~~~~~---p~~~~~~i~~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~-~~~~ 199 (419) ..+...+... -+.+.+.+.+.+......|+++.+....+ ..+.|...+ ..+.+.|++.++. +|.. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~--------~~G~~~~~~~~~~dip~~ 72 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMT--------RSGAAKIIANGADDLPLV 72 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeec--------cceeEEEecCcccccccc Confidence 0111111111 14455677777777777787776643322 233333322 2245677777644 7888 Q ss_pred ccceeeEEeeeEEEEEeehhhHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccc Q lcl|Aclame:pro 200 TLSFDTITTTLKTVAHWLPITRQAADDNS----QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPK 275 (419) Q Consensus 200 ~~~~~~v~~~~~k~~~~~~vs~ell~d~~----~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~ 275 (419) +..+.......+.++..+.++..=++.+. ++..--....++++++.+|+.+++|+..-+..|++|.+++....... T Consensus 73 ~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~ 152 (301) T protein:vir:80 73 DVDMVRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPT 152 (301) T ss_pred cccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccC Confidence 88888999999999999988887665442 58888889999999999999999999888899999999876554332 Q ss_pred c-------cccchhhhHHHHHHHHHHhhhhh--c-cCCcEEEEehHHHHHHHHHh--ccCCceeccCCccccCCCccccc Q lcl|Aclame:pro 276 P-------TAPATDEPPLVDIRRAKTVAEIA--G-FPPDGVVVHPQDWESIELDQ--APGSGVFRVIANVQGEATPRIWG 343 (419) Q Consensus 276 ~-------~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~~~~~~~~l~~~k--d~~g~~~~~~~~~~~~~~~~l~G 343 (419) . ....+....++|+..++.++... + ..+..++|+|+.+..|...+ +.+|...+ ...-......+|.+ T Consensus 153 ~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl-~~l~~~~~~~~I~~ 231 (301) T protein:vir:80 153 TGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVL-KVLQDNAWFSAIVR 231 (301) T ss_pred cccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHH-HHHHHHcCcceEEE Confidence 2 23345667899999999887542 2 34568999999999997544 33332211 11000111123444 Q ss_pred ceeEecCCCCc-CcEEEEec-cceEEEEEecceEEEEeecccchhhcCc-EEEEEEEEe-ccEEecccceEEEEec Q lcl|Aclame:pro 344 LNVVSTVAIAQ-GTALVGGF-RQGATLWSRQGITVLMTDSHADFFTANT-LVILAEFRA-NLAVYQPKAFVRVTFA 415 (419) Q Consensus 344 ~pv~~~~~~~~-~~~~~~d~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~r~~~r~-d~~~~~~~a~~~~~~~ 415 (419) .|...+..... +..+++.. .+.+.+.....++..-. -.++. .......|+ +..+++|.||++++-= T Consensus 232 ~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~------e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 232 VPDLAGMGTAGSDSFAVIHDSNETAELIIPMDITRHPE------EYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred cceeccCCCCcccEEEEEecCCcEEEEEecCceeeecc------eecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 44444332211 11222222 22222222222222111 11222 222345555 4577889999999766 No 152 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.87 E-value=7.5e-10 Score=70.62 Aligned_cols=296 Identities=9% Similarity=0.047 Sum_probs=147.9 Q ss_pred HHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecC Q lcl|Aclame:pro 113 IDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPE 192 (419) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 192 (419) +.-..+... ...-+ +.-....+.++..+.....-+..+.|+..++..+-.++...+-..+...-........-.-.+. T Consensus 1 ~~~~~~~~~-~~~Ms-~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 78 (322) T protein:vir:10 1 MKLNAIMSM-LPLIA-GDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSAD 78 (322) T ss_pred Ccccceeee-eeeee-chhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccccC Confidence 000000000 00000 1112222322323333334445556666555333322221111111000000000000001111 Q ss_pred cc-cccccccceeeEEeeeEEEEEeehhhHHH-HhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc-Ccccccceecccccc Q lcl|Aclame:pro 193 GT-AKPQSTLSFDTITTTLKTVAHWLPITRQA-ADDNSQLMGYIQGRLTYGLRFLRDRQLLNGN-GSTEMQGILTTPGIG 269 (419) Q Consensus 193 g~-~~~~~~~~~~~v~~~~~k~~~~~~vs~el-l~d~~~~~~~i~~~l~~a~~~~~d~~il~G~-g~~~p~Gi~~~~~~~ 269 (419) +. ..|.....++..............|.+.- ++...+..+...+..+.+++++.|..|+.+- |... .|. ++.. T Consensus 79 ~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~-~~~---~gt~ 154 (322) T protein:vir:10 79 GTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPAS-IKG---TGQP 154 (322) T ss_pred cccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccc-ccc---cccc Confidence 11 23433333444444444444455666653 3445688888899999999999999888742 2211 110 0100 Q ss_pred cccc-ccccccchhhhHHHHHHHHHHhhhhhccCCc---EEEEehHHHHHHHHHhccCCceeccCCcc-ccCCCcccccc Q lcl|Aclame:pro 270 TYQQ-PKPTAPATDEPPLVDIRRAKTVAEIAGFPPD---GVVVHPQDWESIELDQAPGSGVFRVIANV-QGEATPRIWGL 344 (419) Q Consensus 270 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~l~~~kd~~g~~~~~~~~~-~~~~~~~l~G~ 344 (419) .... ......++....++.++.+...+...+.+.. .++++|..+..|.....-.+..|.-...+ ..+..++++|+ T Consensus 155 v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lGf 234 (322) T protein:vir:10 155 VEFLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMGY 234 (322) T ss_pred cccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeeeE Confidence 0000 0001122334567788888877877777643 35789999999887654333333222222 34667789999 Q ss_pred eeEecCCCCcC------------------cEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecc Q lcl|Aclame:pro 345 NVVSTVAIAQG------------------TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQP 406 (419) Q Consensus 345 pv~~~~~~~~~------------------~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~ 406 (419) .++.++.+|.. .++++.- .++.+....++..++++..+. .+...+++..-+|..+++| T Consensus 235 ~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k-~Av~~a~~~dv~~~i~~~~~~---~~a~~I~~~~~~Ga~ri~~ 310 (322) T protein:vir:10 235 TWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTD-MALGYHSCKDIWTKVAEDPSA---SFAWRIYSAFTADCVRVED 310 (322) T ss_pred EEEEeccCCccccccccccccCCCCccceeEEEEec-CceeEEEeeeeeEEeeccCCc---chhhhhhhhhhhCceEecc Confidence 99999999842 1333332 244444455666666543331 2234467778899999999 Q ss_pred cceEEEEecCCC Q lcl|Aclame:pro 407 KAFVRVTFAAAT 418 (419) Q Consensus 407 ~a~~~~~~~aa~ 418 (419) +.++.+....+- T Consensus 311 ~gVv~i~~~e~~ 322 (322) T protein:vir:10 311 EHIFKLRLKNSL 322 (322) T ss_pred CcEEEEEEeccC Confidence 999999998877 No 153 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.86 E-value=1.8e-09 Score=68.55 Aligned_cols=295 Identities=17% Similarity=0.169 Sum_probs=154.7 Q ss_pred HHHHHHHHhhh----cccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceeccc-Ccceeeeeeccccceecccc Q lcl|Aclame:pro 110 MRDIDPNRLLS----RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTW 184 (419) Q Consensus 110 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 184 (419) +.......... .....++....-...+ +.+..++.......+.++++.++.++. ++++.+++....+ T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~l-e~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~~t------- 72 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYL-KLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGRMT------- 72 (375) T ss_pred CccccccccCccccCCccccccccchHHHHH-HHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeeeeE------- Confidence 00000000000 0000011111112233 677888888888999999999988875 6688898864332 Q ss_pred ccceeecCccccc---ccccceeeEEeeeE--EEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHh---- Q lcl|Aclame:pro 185 NKAAVVPEGTAKP---QSTLSFDTITTTLK--TVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLN---- 252 (419) Q Consensus 185 ~~a~~v~Eg~~~~---~~~~~~~~v~~~~~--k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~---- 252 (419) +....-|+++. ..+....+.++... ++..+ .|. -+++. .++.+.+.++++.++++..|+.++. T Consensus 73 --~~~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~-~Vd--DiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~k 147 (375) T protein:vir:10 73 --SSFHTPGTPILGNADKAPPVAEKTIVMDDLLISSA-FVY--DLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITR 147 (375) T ss_pred --EeeecCCcCcCCccccCCCCCceEEEecchhhhhh-hHh--hHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222233321 12333333333333 22221 111 12222 2788999999999999999998863 Q ss_pred ccCcccccceec--cccccccc---cccccccchhhhHHHHHHHHHHhhhhhccCC--cEEEEehHHHHHHHHHhccCC- Q lcl|Aclame:pro 253 GNGSTEMQGILT--TPGIGTYQ---QPKPTAPATDEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWESIELDQAPGS- 324 (419) Q Consensus 253 G~g~~~p~Gi~~--~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~kd~~g- 324 (419) +.....|.+.-. .+|..... ........+...+++.+.++...+...+.+. -..+++|..|..|.+-+|.+. T Consensus 148 aa~~~~p~~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~ 227 (375) T protein:vir:10 148 GARSASPVSATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGL 227 (375) T ss_pred hhhhccccccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccce Confidence 322222222111 11211111 1122223456678888998888888777753 346789999998876654321 Q ss_pred --ceeccCCccccCCCcccccceeEecCCCCcCcE-------------------------------------EEEec--- Q lcl|Aclame:pro 325 --GVFRVIANVQGEATPRIWGLNVVSTVAIAQGTA-------------------------------------LVGGF--- 362 (419) Q Consensus 325 --~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~-------------------------------------~~~d~--- 362 (419) +-+.-.+....+....+.|++|+.++.+|...+ |-+|| T Consensus 228 ~n~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~ 307 (375) T protein:vir:10 228 VNRDVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELG 307 (375) T ss_pred eeecccccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeecccccccccccccc Confidence 111111112233345789999999999984321 11233 Q ss_pred cc-eEEEEEe--------cceEEEEeec-ccchhhcCcEEEEEEEEeccEEecccceEEEEecC-CCC Q lcl|Aclame:pro 363 RQ-GATLWSR--------QGITVLMTDS-HADFFTANTLVILAEFRANLAVYQPKAFVRVTFAA-ATT 419 (419) Q Consensus 363 ~~-~~~~~~~--------~~~~i~~~~~-~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~a-a~~ 419 (419) ++ ..+++.+ .+++++++.. ... .+-...+.+..-++..+.+|+|.+.++..+ ++. T Consensus 308 ~~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~~--~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~~~~ 373 (375) T protein:vir:10 308 AKSCGLIFQKEAAGVVEAIGPQVQVTNGDVSV--IYQGDVILGRMAMGADYLNPAAAVELYIGATAPS 373 (375) T ss_pred CceEEEEEchhheeeeeeeccccccccchhhh--eeeeeeeeeeeeeccCccCceeEEEEecCcCccc Confidence 11 1122222 2334444320 111 112245667788999999999999998874 333 No 154 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.86 E-value=8.3e-10 Score=70.38 Aligned_cols=263 Identities=11% Similarity=-0.021 Sum_probs=148.2 Q ss_pred cccccccCCcccccchhhhHHHHHhhhhhhhHHhh---cceeccc-CcceeeeeeccccceeccccccceeecCcccccc Q lcl|Aclame:pro 123 APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL---LDQQNAD-YNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ 198 (419) Q Consensus 123 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~---~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 198 (419) ......+.... +.+...-+.+.+.-+.-..|+++ .+..|+. +..+++|+.. ..+.+.-|+||+.+|. T Consensus 1 mAe~nlt~~~d-L~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~--------~tgda~dVaEGe~Ipl 71 (295) T protein:vir:99 1 MAEKNLNTMAD-LGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWE--------VTLDQTDPGEGETIPL 71 (295) T ss_pred CCCcccccHhh-ccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeee--------eecccccccCCcccch Confidence 11111111111 11222222333332333334444 4777876 4477777643 3457788999999999 Q ss_pred ccccee---eEEeeeEEEEEeehhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccc Q lcl|Aclame:pro 199 STLSFD---TITTTLKTVAHWLPITRQAADDN--SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQ 273 (419) Q Consensus 199 ~~~~~~---~v~~~~~k~~~~~~vs~ell~d~--~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~ 273 (419) +..+.. ..+++.+|++..+ |.|.++.+ .+-...-.++|..+++.+++..|+.--.++. .. T Consensus 72 skvt~~~~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat------------~t- 136 (295) T protein:vir:99 72 SKVTRTKDKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKP------------TK- 136 (295) T ss_pred hhheeeeeeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCc------------ee- Confidence 998865 4777778887754 99998644 3677888999999999999999996321110 00 Q ss_pred ccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccce-eEecCCC Q lcl|Aclame:pro 274 PKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLN-VVSTVAI 352 (419) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~p-v~~~~~~ 352 (419) .........+..+..++..+...+..+.+.++||.+.+.|++-..-+...--.++. .---.++|.. |+.|..+ T Consensus 137 ---~tg~~lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~---~~L~nfLG~q~II~S~kv 210 (295) T protein:vir:99 137 ---VKGVGLQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGM---TLLKNFLGMQNVIVMPSV 210 (295) T ss_pred ---eehhhHHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhh---hhhhhhhccceEEEcccC Confidence 01112233455555556666666667778999999999988653222111000010 0001388997 9999999 Q ss_pred CcCcEEEEeccceEEE-EEecceEEEEeecccchhhcCcEEEEEEEE-------------eccE---EecccceEEEEec Q lcl|Aclame:pro 353 AQGTALVGGFRQGATL-WSRQGITVLMTDSHADFFTANTLVILAEFR-------------ANLA---VYQPKAFVRVTFA 415 (419) Q Consensus 353 ~~~~~~~~d~~~~~~~-~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r-------------~d~~---~~~~~a~~~~~~~ 415 (419) |.|+++..-..+..+. .+-.+..+. ..-++..|++.+.+..+ +.+. +-+++++++.++. T Consensus 211 ~~G~~~aT~~~Ni~~ay~~~~~g~l~----~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~ 286 (295) T protein:vir:99 211 PEGKIYSTAVENLVFASLNVKGGDLG----GLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIE 286 (295) T ss_pred CCceEEEeeccceEEEEecCCchhhh----hhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEe Confidence 9999887754442221 111111111 01123345444444322 1222 3345799999996 Q ss_pred CCCC Q lcl|Aclame:pro 416 AATT 419 (419) Q Consensus 416 aa~~ 419 (419) +.-+ T Consensus 287 ~~~~ 290 (295) T protein:vir:99 287 AAAV 290 (295) T ss_pred cCcC Confidence 6655 No 155 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.82 E-value=2.3e-10 Score=73.44 Aligned_cols=236 Identities=12% Similarity=-0.002 Sum_probs=143.5 Q ss_pred HHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeecccc Q lcl|Aclame:pro 98 RARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGT 177 (419) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 177 (419) ++. -....-+...-.....|......|++.+.+.+.|++.++......+. +-+. T Consensus 1 m~~--------------------~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~t--g~~t---- 54 (330) T protein:vir:10 1 MAT--------------------LSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPT--GHRT---- 54 (330) T ss_pred CCc--------------------CCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCc--ccce---- Confidence 000 00000000000001122223345666677777777777766433222 1111 Q ss_pred ceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 178 AGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS---QLMGYIQGRLTYGLRFLRDRQLLNGN 254 (419) Q Consensus 178 ~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~---~~~~~i~~~l~~a~~~~~d~~il~G~ 254 (419) .+..+-++++|..=++..+.+..++.+++-..+-+++.+.|.+.+.+... ++.........+++.+.+...||+|+ T Consensus 55 -~vrt~LP~~~fR~lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD 133 (330) T protein:vir:10 55 -SVRTGLPTPTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGN 133 (330) T ss_pred -eEEeecCCchhhhcCCccccccceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 11223457789888889999999999999999999999999999876532 46666778899999999999999998 Q ss_pred Ccccccceecc-----------------cccc------------------------------------------------ Q lcl|Aclame:pro 255 GSTEMQGILTT-----------------PGIG------------------------------------------------ 269 (419) Q Consensus 255 g~~~p~Gi~~~-----------------~~~~------------------------------------------------ 269 (419) .+.+|.++.-. .|.. T Consensus 134 ~a~~p~~F~GL~kR~~~~ta~~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~ 213 (330) T protein:vir:10 134 DGIAPAEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGR 213 (330) T ss_pred CCCChhhccchhhhcCCCCCCchhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCc Confidence 76655544300 0000 Q ss_pred --------------------------ccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccC Q lcl|Aclame:pro 270 --------------------------TYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPG 323 (419) Q Consensus 270 --------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~ 323 (419) ..........+...+.++.++.+...++.......+|+||.+....|++..... T Consensus 214 y~~~~~~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k 293 (330) T protein:vir:10 214 MEGYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDK 293 (330) T ss_pred eeEEeeeeeeeeeeEEeCcccEEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhc Confidence 000000112223335667777777888877777788999999999999863333 Q ss_pred CceeccCCccccCCCcccccceeEecCCCCcCcEEEE Q lcl|Aclame:pro 324 SGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVG 360 (419) Q Consensus 324 g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 360 (419) ++..+-.....+.....+.|+||..++++-.++-.+. T Consensus 294 ~n~~l~~~~~~g~~~t~~~gipir~~Dail~tE~~vv 330 (330) T protein:vir:10 294 IANNLTWETVSGERVMTFDGIPVQRTDALLNTESRVV 330 (330) T ss_pred ccceeeeeecCCeeeEEECCeEEEEEeeeecCccccC Confidence 3323333333333345688999999998866543333 No 156 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.79 E-value=2e-09 Score=68.29 Aligned_cols=293 Identities=12% Similarity=0.040 Sum_probs=159.4 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccc--hhhhHHHHHhhhhhhhHHhhcceecccCc---ceeeeeeccc Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLP--QLVPGIVPTTPDLPLLVADLLDQQNADYN---VLEYIRDTSG 176 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--~~~~~~i~~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~ 176 (419) .......++.........-.. ......+.++.. +.+...|++.+......+.++++.+..+. ++.+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~---~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~--- 74 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGV---EKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEF--- 74 (314) T ss_pred CccchHHHHHHHHHHHHhhcc---cchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeee--- Confidence 111111111111111111000 000111222222 34455566666666666666665432222 2333222 Q ss_pred cceeccccccceeecCccc-ccccccceeeEEeeeEEEEEeehhhHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 177 TAGAGSTWNKAAVVPEGTA-KPQSTLSFDTITTTLKTVAHWLPITRQAADDNS----QLMGYIQGRLTYGLRFLRDRQLL 251 (419) Q Consensus 177 ~~~~~~~~~~a~~v~Eg~~-~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~----~~~~~i~~~l~~a~~~~~d~~il 251 (419) ...+.+.|++.++. +|..+..+.......+.++..+.++..=++.+. ++..--....++++.+.+|+.++ T Consensus 75 -----e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f 149 (314) T protein:vir:10 75 -----DGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVW 149 (314) T ss_pred -----ccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEE Confidence 23356778887644 888888899999999999999999876555432 58888889999999999999999 Q ss_pred hccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhh---hccCCcEEEEehHHHHHHHHHhccCCceec Q lcl|Aclame:pro 252 NGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEI---AGFPPDGVVVHPQDWESIELDQAPGSGVFR 328 (419) Q Consensus 252 ~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~ 328 (419) +|+...+..|++|.+++......... .+....++|+..++..+.. ....+..++|+|..+..|....+.+|..++ T Consensus 150 ~G~~~~g~~GLlN~p~v~~~~~~~~W--aT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl 227 (314) T protein:vir:10 150 SGSAPHGIVSVFDQPNINNVVATPNW--SVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYG 227 (314) T ss_pred eecccccceeEeecCCCccccCCCCc--ccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHH Confidence 99988889999999987655443333 3556779999999988864 234566799999988776543333332111 Q ss_pred cCCccccCCCcccccceeEecCCCCcCc-EEEEeccceEEEE-EecceEEEEeecccchhhcCc--EEEEEEEEe-ccEE Q lcl|Aclame:pro 329 VIANVQGEATPRIWGLNVVSTVAIAQGT-ALVGGFRQGATLW-SRQGITVLMTDSHADFFTANT--LVILAEFRA-NLAV 403 (419) Q Consensus 329 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~-~~~~d~~~~~~~~-~~~~~~i~~~~~~~~~~~~~~--~~~r~~~r~-d~~~ 403 (419) .-.-....+-+|.+.|-..+......+ .+++..+.-++-+ ....++..- .+... .......|+ +..+ T Consensus 228 -~~l~~n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~-------~e~~~~~~~~~~~~r~~Gv~i 299 (314) T protein:vir:10 228 -ELFTRNNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVLP-------AQPKDLHFRYPVTSKATGLIV 299 (314) T ss_pred -HHHHHhCCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccceeec-------ceecCceEEEcceeeeEEEEE Confidence 000001112234444444433322222 2333222222221 112222111 11111 223345566 4567 Q ss_pred ecccceEEE---Eec Q lcl|Aclame:pro 404 YQPKAFVRV---TFA 415 (419) Q Consensus 404 ~~~~a~~~~---~~~ 415 (419) ++|.||+++ +++ T Consensus 300 ~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 300 YRPLTMAVIKGITFA 314 (314) T ss_pred ECcceeEeeeeeecC Confidence 889999965 555 No 157 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.74 E-value=4.8e-09 Score=66.19 Aligned_cols=304 Identities=9% Similarity=0.059 Sum_probs=161.8 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhccccccccc--C-Ccccccc--hhhhHHHHHhhhhhhhH Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTIT--N-PNVPHLP--QLVPGIVPTTPDLPLLV 154 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~p--~~~~~~i~~~~~~~~~l 154 (419) ..-.+ .++..+. .+.............+... . .+..+.. +.+...|++........ T Consensus 1 ~~~~~-------------~~~~~~~------d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~ 61 (329) T protein:vir:79 1 MRGNI-------------MSKEMKY------DEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSA 61 (329) T ss_pred Cccch-------------hhhhhcc------chhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccch Confidence 00000 0000000 0000000000111111111 1 1112211 34556677777777777 Q ss_pred HhhcceecccCc---ceeeeeeccccceeccccccceeecCc-ccccccccceeeEEeeeEEEEEeehhhHHHHhhHH-- Q lcl|Aclame:pro 155 ADLLDQQNADYN---VLEYIRDTSGTAGAGSTWNKAAVVPEG-TAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-- 228 (419) Q Consensus 155 ~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~a~~v~Eg-~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-- 228 (419) +.++++.+..+. .+.|...+ ..+.+.|++.+ ..+|..+..+..-....+.++..+.++..=++.+. T Consensus 62 ~~~i~i~~~~~~~~~~~t~~~~~--------~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~ 133 (329) T protein:vir:79 62 LRVFPVTSELSDTDKTFEYQTFD--------KVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRT 133 (329) T ss_pred hhhcccccCCCCceeEEEeeeee--------cceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHh Confidence 777776543322 33333322 23456787765 46787788888888888888888888876555432 Q ss_pred --HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccc----ccccchhhhHHHHHHHHHHhhhhh--c Q lcl|Aclame:pro 229 --QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPK----PTAPATDEPPLVDIRRAKTVAEIA--G 300 (419) Q Consensus 229 --~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~--~ 300 (419) ++..--....++++.+.+|+.+++|++..+..|++|.+++....... ..+..+....++|+..++.++... + T Consensus 134 g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g 213 (329) T protein:vir:79 134 GKSLSTRKANAAQNAHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNG 213 (329) T ss_pred CCChHHHHHHHHHHHHHHhhccEEEeecccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCc Confidence 58888889999999999999999999888889999999987654432 233456677899999998888643 2 Q ss_pred -cCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCC-cCcEEEEeccceEEEE-EecceEEE Q lcl|Aclame:pro 301 -FPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIA-QGTALVGGFRQGATLW-SRQGITVL 377 (419) Q Consensus 301 -~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~-~~~~~~~d~~~~~~~~-~~~~~~i~ 377 (419) ..+..++|+|+.+..|.......|...+- -.-....+-+|.+.|-..+.... .+..++++...-++-+ .-..++.. T Consensus 214 ~~~p~~L~Lpp~~~~~L~~~~~~~~~tvl~-~lk~~~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l 292 (329) T protein:vir:79 214 QHRANMILIPPSMRKVLMVRMPETTMSYLD-YFKQQNGGITIESISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFNML 292 (329) T ss_pred eecccEEEecHHHHHHhhcccCCCCccHHH-HHHHhCCCcEEEEcccccccCCCCceEEEEEecCCceEEEecCcceeee Confidence 24568999999988886444333322111 00000111233344433332211 1223444333333222 11222221 Q ss_pred EeecccchhhcCc--EEEEEEEEec-cEEecccceEEEEecCCC Q lcl|Aclame:pro 378 MTDSHADFFTANT--LVILAEFRAN-LAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 378 ~~~~~~~~~~~~~--~~~r~~~r~d-~~~~~~~a~~~~~~~aa~ 418 (419) - .+... .......|+. ..+++|.||++++-=-.- T Consensus 293 ~-------~q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 293 T-------AQPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred e-------ceecCceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 1 11111 2233455655 566789999887533222 No 158 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.72 E-value=2.6e-09 Score=67.66 Aligned_cols=235 Identities=9% Similarity=0.025 Sum_probs=143.9 Q ss_pred hhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccch-hhhHHHHHhhhhhhhHHhhcceecccCcc-eeeeeecccc Q lcl|Aclame:pro 100 RDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQ-LVPGIVPTTPDLPLLVADLLDQQNADYNV-LEYIRDTSGT 177 (419) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~-~~~~~i~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~ 177 (419) ....+.....+.+. ...+-|. .+...|++.+.+.+.|+..++......+. ..+.+. T Consensus 1 m~~~~~~~~TL~e~------------------Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vr---- 58 (331) T protein:vir:10 1 MPTLSTTNPTLADV------------------AARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVR---- 58 (331) T ss_pred CCccccCcccHHHH------------------HHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEE---- Confidence 00000000000000 0001111 12334666777778888888877644332 112222 Q ss_pred ceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 178 AGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS---QLMGYIQGRLTYGLRFLRDRQLLNGN 254 (419) Q Consensus 178 ~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~---~~~~~i~~~l~~a~~~~~d~~il~G~ 254 (419) .+-++++|..=++..+.++.++.+++-..+-+++.+.|.+.+.+... ++...-...+.+++...+...||+|+ T Consensus 59 ----t~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD 134 (331) T protein:vir:10 59 ----SGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGD 134 (331) T ss_pred ----eccCCchhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 23457889888999999999999999999999999999999887543 45566677789999999999999998 Q ss_pred Cccccccee------cc-----------cccc------------------------------------------------ Q lcl|Aclame:pro 255 GSTEMQGIL------TT-----------PGIG------------------------------------------------ 269 (419) Q Consensus 255 g~~~p~Gi~------~~-----------~~~~------------------------------------------------ 269 (419) .+.+|.++. +. .|.. T Consensus 135 ~a~~p~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~ 214 (331) T protein:vir:10 135 SSIDAEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQ 214 (331) T ss_pred cccChhhhccchhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeee Confidence 665554442 00 0000 Q ss_pred ------------------------cccccccc-ccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCC Q lcl|Aclame:pro 270 ------------------------TYQQPKPT-APATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGS 324 (419) Q Consensus 270 ------------------------~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g 324 (419) ........ .+.+..+.++.++.+...++.......+|+||.+....|++...+.+ T Consensus 215 ~y~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~ 294 (331) T protein:vir:10 215 GYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKV 294 (331) T ss_pred EEEEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhcc Confidence 00000000 11233456677778888887776666789999999999998744333 Q ss_pred c-eeccCCccccCCCcccccceeEecCCCCcCcEEEE Q lcl|Aclame:pro 325 G-VFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVG 360 (419) Q Consensus 325 ~-~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 360 (419) . +.+......+.....+.|+||..++.+-.++-.+. T Consensus 295 ~~~~~~~~~~~g~~~t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 295 AASTLTMEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred ceeeeeeeecCCcceeEECCeeEEEeeeeecCccccC Confidence 3 33333344444555788999999998865543333 No 159 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.72 E-value=2.6e-09 Score=67.66 Aligned_cols=235 Identities=9% Similarity=0.025 Sum_probs=143.9 Q ss_pred hhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccch-hhhHHHHHhhhhhhhHHhhcceecccCcc-eeeeeecccc Q lcl|Aclame:pro 100 RDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQ-LVPGIVPTTPDLPLLVADLLDQQNADYNV-LEYIRDTSGT 177 (419) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~-~~~~~i~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~ 177 (419) ....+.....+.+. ...+-|. .+...|++.+.+.+.|+..++......+. ..+.+. T Consensus 1 m~~~~~~~~TL~e~------------------Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vr---- 58 (331) T protein:vir:10 1 MPTLSTTNPTLADV------------------AARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVR---- 58 (331) T ss_pred CCccccCcccHHHH------------------HHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEE---- Confidence 00000000000000 0001111 12334666777778888888877644332 112222 Q ss_pred ceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 178 AGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS---QLMGYIQGRLTYGLRFLRDRQLLNGN 254 (419) Q Consensus 178 ~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~---~~~~~i~~~l~~a~~~~~d~~il~G~ 254 (419) .+-++++|..=++..+.++.++.+++-..+-+++.+.|.+.+.+... ++...-...+.+++...+...||+|+ T Consensus 59 ----t~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD 134 (331) T protein:vir:10 59 ----SGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGD 134 (331) T ss_pred ----eccCCchhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 23457889888999999999999999999999999999999887543 45566677789999999999999998 Q ss_pred Cccccccee------cc-----------cccc------------------------------------------------ Q lcl|Aclame:pro 255 GSTEMQGIL------TT-----------PGIG------------------------------------------------ 269 (419) Q Consensus 255 g~~~p~Gi~------~~-----------~~~~------------------------------------------------ 269 (419) .+.+|.++. +. .|.. T Consensus 135 ~a~~p~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~ 214 (331) T protein:vir:10 135 SSIDAEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQ 214 (331) T ss_pred cccChhhhccchhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeee Confidence 665554442 00 0000 Q ss_pred ------------------------cccccccc-ccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCC Q lcl|Aclame:pro 270 ------------------------TYQQPKPT-APATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGS 324 (419) Q Consensus 270 ------------------------~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g 324 (419) ........ .+.+..+.++.++.+...++.......+|+||.+....|++...+.+ T Consensus 215 ~y~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~ 294 (331) T protein:vir:10 215 GYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKV 294 (331) T ss_pred EEEEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhcc Confidence 00000000 11233456677778888887776666789999999999998744333 Q ss_pred c-eeccCCccccCCCcccccceeEecCCCCcCcEEEE Q lcl|Aclame:pro 325 G-VFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVG 360 (419) Q Consensus 325 ~-~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 360 (419) . +.+......+.....+.|+||..++.+-.++-.+. T Consensus 295 ~~~~~~~~~~~g~~~t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 295 AASTLTMEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred ceeeeeeeecCCcceeEECCeeEEEeeeeecCccccC Confidence 3 33333344444555788999999998865543333 No 160 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.72 E-value=2.6e-09 Score=67.66 Aligned_cols=235 Identities=9% Similarity=0.025 Sum_probs=143.9 Q ss_pred hhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccch-hhhHHHHHhhhhhhhHHhhcceecccCcc-eeeeeecccc Q lcl|Aclame:pro 100 RDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQ-LVPGIVPTTPDLPLLVADLLDQQNADYNV-LEYIRDTSGT 177 (419) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~-~~~~~i~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~ 177 (419) ....+.....+.+. ...+-|. .+...|++.+.+.+.|+..++......+. ..+.+. T Consensus 1 m~~~~~~~~TL~e~------------------Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vr---- 58 (331) T protein:vir:98 1 MPTLSTTNPTLADV------------------AARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVR---- 58 (331) T ss_pred CCccccCcccHHHH------------------HHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEE---- Confidence 00000000000000 0001111 12334666777778888888877644332 112222 Q ss_pred ceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 178 AGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS---QLMGYIQGRLTYGLRFLRDRQLLNGN 254 (419) Q Consensus 178 ~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~---~~~~~i~~~l~~a~~~~~d~~il~G~ 254 (419) .+-++++|..=++..+.++.++.+++-..+-+++.+.|.+.+.+... ++...-...+.+++...+...||+|+ T Consensus 59 ----t~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD 134 (331) T protein:vir:98 59 ----SGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGD 134 (331) T ss_pred ----eccCCchhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 23457889888999999999999999999999999999999887543 45566677789999999999999998 Q ss_pred Cccccccee------cc-----------cccc------------------------------------------------ Q lcl|Aclame:pro 255 GSTEMQGIL------TT-----------PGIG------------------------------------------------ 269 (419) Q Consensus 255 g~~~p~Gi~------~~-----------~~~~------------------------------------------------ 269 (419) .+.+|.++. +. .|.. T Consensus 135 ~a~~p~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~ 214 (331) T protein:vir:98 135 SSIDAEKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQ 214 (331) T ss_pred cccChhhhccchhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeee Confidence 665554442 00 0000 Q ss_pred ------------------------cccccccc-ccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCC Q lcl|Aclame:pro 270 ------------------------TYQQPKPT-APATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGS 324 (419) Q Consensus 270 ------------------------~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g 324 (419) ........ .+.+..+.++.++.+...++.......+|+||.+....|++...+.+ T Consensus 215 ~y~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~ 294 (331) T protein:vir:98 215 GYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKV 294 (331) T ss_pred EEEEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhcc Confidence 00000000 11233456677778888887776666789999999999998744333 Q ss_pred c-eeccCCccccCCCcccccceeEecCCCCcCcEEEE Q lcl|Aclame:pro 325 G-VFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVG 360 (419) Q Consensus 325 ~-~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 360 (419) . +.+......+.....+.|+||..++.+-.++-.+. T Consensus 295 ~~~~~~~~~~~g~~~t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:98 295 AASTLTMEEIAGKKVVAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred ceeeeeeeecCCcceeEECCeeEEEeeeeecCccccC Confidence 3 33333344444555788999999998865543333 No 161 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.72 E-value=3.7e-09 Score=66.82 Aligned_cols=270 Identities=11% Similarity=0.007 Sum_probs=143.0 Q ss_pred HhhhcccccccccCCcc--cccchhhhHHHHHhhhhhhhHHhhcceecccCc-ceeeeeeccccceeccccccceeecCc Q lcl|Aclame:pro 117 RLLSRDAPAGTITNPNV--PHLPQLVPGIVPTTPDLPLLVADLLDQQNADYN-VLEYIRDTSGTAGAGSTWNKAAVVPEG 193 (419) Q Consensus 117 ~~~~~~~~~~~~~~~~~--~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~Eg 193 (419) -.+++.-.....+.... ...--.+.+.+......-..++...+..|+..+ .++.+ +.|.....+.-|+|| T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~-------k~~~y~gda~dVaEG 73 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTY-------AGYDVTLAEGNVPEG 73 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeec-------cceeeeeccccccCC Confidence 11122211111111111 111122333333333333334444477787644 44222 233455678899999 Q ss_pred ccccccccceee---EEeeeEEEEEeehhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccc Q lcl|Aclame:pro 194 TAKPQSTLSFDT---ITTTLKTVAHWLPITRQAADDN--SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGI 268 (419) Q Consensus 194 ~~~~~~~~~~~~---v~~~~~k~~~~~~vs~ell~d~--~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~ 268 (419) +.+|.+..+... .+++.+|++..+ |.|.++.+ .+-...-.++|..+++.+++..++.--.++. T Consensus 74 e~Iplskvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT---------- 141 (296) T protein:vir:98 74 EVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT---------- 141 (296) T ss_pred cccchhhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhccc---------- Confidence 999999988754 777788888775 99998644 3677888999999999999999986321110 Q ss_pred cccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCc-ccccceeE Q lcl|Aclame:pro 269 GTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATP-RIWGLNVV 347 (419) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~-~l~G~pv~ 347 (419) ... ......--.....-+.++...++.......+.++||.+.+.+++-..-+-+..+ ++.-. .++|.-|+ T Consensus 142 --~t~-~~t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~it~qt~f------G~tyl~nfLG~~II 212 (296) T protein:vir:98 142 --GTQ-DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGITTQTAF------GLTYLVDFTGTVII 212 (296) T ss_pred --cee-eechhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcCCccchhhee------chhhhhhccccEEE Confidence 000 000000000011112222334454443456789999999887742211111111 11111 27899999 Q ss_pred ecCCCCcCcEEEEeccceEEEE-EecceEEEEeecccchhhcCcEEEEEEEE-------------eccE---EecccceE Q lcl|Aclame:pro 348 STVAIAQGTALVGGFRQGATLW-SRQGITVLMTDSHADFFTANTLVILAEFR-------------ANLA---VYQPKAFV 410 (419) Q Consensus 348 ~~~~~~~~~~~~~d~~~~~~~~-~~~~~~i~~~~~~~~~~~~~~~~~r~~~r-------------~d~~---~~~~~a~~ 410 (419) .|..+|.|+++..-..+..+.+ +-.+..+- .. -.+..|++.+.+..+ +.+. +-++++++ T Consensus 213 ~S~kV~~G~~~~T~~~Ni~~ay~~~~~~~l~--~~--f~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv 288 (296) T protein:vir:98 213 STNDVTKGEIWATVPENIIFAYINPNNSELA--KE--FNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIV 288 (296) T ss_pred EcCcCCCceEEEeeecceEEEeecccccchh--hh--hccccccccceEEEeccccceeeehhHhHhHHHhcccccceEE Confidence 9999999998887655432221 11111111 11 112334444443322 1222 34467999 Q ss_pred EEEecCCC Q lcl|Aclame:pro 411 RVTFAAAT 418 (419) Q Consensus 411 ~~~~~aa~ 418 (419) +.++++++ T Consensus 289 ~~tI~~~~ 296 (296) T protein:vir:98 289 KVTLTPGV 296 (296) T ss_pred EEEecCCC Confidence 99999999 No 162 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.67 E-value=5.8e-09 Score=65.72 Aligned_cols=273 Identities=14% Similarity=0.044 Sum_probs=129.4 Q ss_pred CCcccccchhhhHHHHHhhhhhhhHHhhccee---cc---cCcceeeeeeccccceeccccccceeecCcccccccccce Q lcl|Aclame:pro 130 NPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQ---NA---DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSF 203 (419) Q Consensus 130 ~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~---~~---~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 203 (419) -....+.|+.+...+...++....+..++..- .. .+..+++++........ ....-.+++..+.-.+.+- T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~ 76 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHT----RKLRGAGAERNLTVSDFTE 76 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccccccee----eeccccccCCccccccccc Confidence 12345889999999999888888887776432 21 24456666432211110 0111123344444445555 Q ss_pred eeEEeeeEEE-EEeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccch Q lcl|Aclame:pro 204 DTITTTLKTV-AHWLPITRQ-AADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPAT 281 (419) Q Consensus 204 ~~v~~~~~k~-~~~~~vs~e-ll~d~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~ 281 (419) ..+++...+. +.-+.|+++ ...+..++...+.++..++++.++|..++.-- .+.+.+ ......... T Consensus 77 ~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~-~~a~~~-----------~~~~~~~~~ 144 (392) T protein:vir:99 77 DSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYE-----------AAGAVHEVA 144 (392) T ss_pred ceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-hccccc-----------ccccccccC Confidence 5566655332 333456666 44556688888888889999999999887421 111110 111112223 Q ss_pred hhhHHHHHHHHHHhhhhhccCCc-EEEEehHHHHHHHHHhccCCceec---cCCccccCCCcccccceeEecCCCCcCcE Q lcl|Aclame:pro 282 DEPPLVDIRRAKTVAEIAGFPPD-GVVVHPQDWESIELDQAPGSGVFR---VIANVQGEATPRIWGLNVVSTVAIAQGTA 357 (419) Q Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~kd~~g~~~~---~~~~~~~~~~~~l~G~pv~~~~~~~~~~~ 357 (419) ....|+.+..+...|...+.+.. .++++|..+..|++...-....+. .......+..+++.|++|+.++.+|.+.. T Consensus 145 ~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~ 224 (392) T protein:vir:99 145 PDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDA 224 (392) T ss_pred hhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeecccccccc Confidence 34567788888877776665433 467899988887743110000000 00112345667899999999999998776 Q ss_pred EEEeccceEEEEEecc-----------------eEEEEeecccchhhcCcEEEEE---EEEe----ccEEecccceEEE- Q lcl|Aclame:pro 358 LVGGFRQGATLWSRQG-----------------ITVLMTDSHADFFTANTLVILA---EFRA----NLAVYQPKAFVRV- 412 (419) Q Consensus 358 ~~~d~~~~~~~~~~~~-----------------~~i~~~~~~~~~~~~~~~~~r~---~~r~----d~~~~~~~a~~~~- 412 (419) +.+..+...... +.. +...+.......+..+...+.. ...+ .........+... T Consensus 225 ~a~~~~a~~~at-~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~ 303 (392) T protein:vir:99 225 YLYHPTAFIMAT-RAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIP 303 (392) T ss_pred eeeecccccccc-ccccccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeec Confidence 555432211110 000 0000000000001111111000 0000 0000000000000 Q ss_pred ------EecCCCC Q lcl|Aclame:pro 413 ------TFAAATT 419 (419) Q Consensus 413 ------~~~aa~~ 419 (419) .+..++. T Consensus 304 ~~v~v~~v~~~~~ 316 (392) T protein:vir:99 304 GSIEVAPEAGANA 316 (392) T ss_pred ceeeeeeeecccc Confidence 0000000 No 163 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.63 E-value=7e-09 Score=65.30 Aligned_cols=265 Identities=10% Similarity=0.015 Sum_probs=139.0 Q ss_pred ccccccccc-CCcccccchhhhHHHHHhhhhhhhHHhhcceecccCc-ceeeeeeccccceeccccccceeecCcccccc Q lcl|Aclame:pro 121 RDAPAGTIT-NPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYN-VLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ 198 (419) Q Consensus 121 ~~~~~~~~~-~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 198 (419) ...+...+. ..-+...--.+.+.+......-..++...+..|+..+ .++.++.. .+.....++-|+||+.+|. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~-----~~~y~gda~dVaEGe~Ipl 75 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFK-----VEDSEKPNGDVAEGDVIPL 75 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeee-----ceeeccccccccCCcccch Confidence 001110000 0000111122333333333333334444577777643 45444322 2234467888999999999 Q ss_pred ccccee---eEEeeeEEEEEeehhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccc Q lcl|Aclame:pro 199 STLSFD---TITTTLKTVAHWLPITRQAADDN--SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQ 273 (419) Q Consensus 199 ~~~~~~---~v~~~~~k~~~~~~vs~ell~d~--~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~ 273 (419) ++.+.. ..+++.+|++..+ |.|.++.+ .+-...-.++|..++..+++..|+.--.++ +... T Consensus 76 skvt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lkta------------T~t~ 141 (303) T protein:vir:10 76 TKVTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSA------------IENG 141 (303) T ss_pred hhheeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhc------------cccc Confidence 998754 5788889988855 99998644 367788889999999999999998621110 0000 Q ss_pred ccccccchhhhHHHHHHHHHHhh-------hhhccCCcEEEEehHHHHHHHHHhccCCc-eeccCCccccCCCcccccce Q lcl|Aclame:pro 274 PKPTAPATDEPPLVDIRRAKTVA-------EIAGFPPDGVVVHPQDWESIELDQAPGSG-VFRVIANVQGEATPRIWGLN 345 (419) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~l~~~kd~~g~-~~~~~~~~~~~~~~~l~G~p 345 (419) ....+.....+.+..++... ...+ .+.+.++||.+.+.|+.-..-+.+ ..+-.... -.++|.. T Consensus 142 ---~~t~~t~~s~~glq~Al~~~~~kl~~~~ed~-~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L-----~nfLG~~ 212 (303) T protein:vir:10 142 ---KRTNKTKLSAENLQGALSKGRANLSVLLDDE-ITPIAFVNPNDTAEYLANGFINSTGAQFGVNLL-----TPYVGVK 212 (303) T ss_pred ---ccccceeecHHHHHHHHHhhhhhcccccccc-ccEEEEEchHHHHHHhhcCCcchhhhhhhhhhh-----hhhhcce Confidence 00001111233444444322 2222 334889999999998753211111 11100001 1388999 Q ss_pred eEecCCCCcCcEEEEeccceEE--EEEecceEEEEeecccchhhcCcEEEEEEEE-------------eccE---Eeccc Q lcl|Aclame:pro 346 VVSTVAIAQGTALVGGFRQGAT--LWSRQGITVLMTDSHADFFTANTLVILAEFR-------------ANLA---VYQPK 407 (419) Q Consensus 346 v~~~~~~~~~~~~~~d~~~~~~--~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r-------------~d~~---~~~~~ 407 (419) |+.|..+|.|+++..-..+..+ .-.++.+.- .-.|..|.+.+.+..+ +.+. +-+++ T Consensus 213 II~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l~~------~f~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~d 286 (303) T protein:vir:10 213 IVEFADVPQGEVWMTVAENLNVAYANPRGELSR------AFAFATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENID 286 (303) T ss_pred EEEeccCCCceEEEeeccceEEEEecCchhhhh------hhhhccccccceEEEeccccceeeehhHhHhHHHhcccccc Confidence 9999999999988875444222 211211111 1123344444444322 1222 33457 Q ss_pred ceEEEEecCCCC Q lcl|Aclame:pro 408 AFVRVTFAAATT 419 (419) Q Consensus 408 a~~~~~~~aa~~ 419 (419) ++++.++.+.-. T Consensus 287 giv~~ti~~~e~ 298 (303) T protein:vir:10 287 AVIKVTIKKDEA 298 (303) T ss_pred eEEEEEEecccc Confidence 899998854432 No 164 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.63 E-value=2.6e-09 Score=67.63 Aligned_cols=237 Identities=11% Similarity=-0.038 Sum_probs=139.6 Q ss_pred HHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeecccc Q lcl|Aclame:pro 98 RARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGT 177 (419) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 177 (419) ++. .......+.+. .....+......|++.+.+.+.|+..++......+. +-+. T Consensus 1 m~~--~~~~a~TL~E~------------------Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~t--g~~~---- 54 (335) T protein:vir:73 1 MAL--IGQTLPSLLDI------------------YNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGS--KHKT---- 54 (335) T ss_pred CCc--CCCCchhHHHH------------------HhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCc--ccce---- Confidence 000 00000000000 000111122234666677777777777765433221 1111 Q ss_pred ceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 178 AGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGN 254 (419) Q Consensus 178 ~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~ 254 (419) .+..+-++++|..=++..+.+..++.+++-..+-+++.+.|.+.+.+-+ .++.........+++.+.+...||+|| T Consensus 55 -~vrt~LP~~~fR~lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD 133 (335) T protein:vir:73 55 -TIRAGIPEPVWRRYNQGVQPTKTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGN 133 (335) T ss_pred -eEEEecCCchhhhcCCccccccceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 1122345778988889999999999999999999999999999887643 246676777899999999999999998 Q ss_pred Ccccccceecc--------------------cccc--------------------------------------------- Q lcl|Aclame:pro 255 GSTEMQGILTT--------------------PGIG--------------------------------------------- 269 (419) Q Consensus 255 g~~~p~Gi~~~--------------------~~~~--------------------------------------------- 269 (419) .+.+|.++.-. .|.. T Consensus 134 sa~~p~~FdGL~kR~~~~st~~a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~ 213 (335) T protein:vir:73 134 TDAEPEAFMGLAPRFNTLSTSKAASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGG 213 (335) T ss_pred cCCChhhccchhhhhcCccccccCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCC Confidence 76666544310 0000 Q ss_pred ---------------------------ccccccc-cccchhhhHHHHHHHHHH--hhhhhccCCcEEEEehHHHHHHHHH Q lcl|Aclame:pro 270 ---------------------------TYQQPKP-TAPATDEPPLVDIRRAKT--VAEIAGFPPDGVVVHPQDWESIELD 319 (419) Q Consensus 270 ---------------------------~~~~~~~-~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~~ 319 (419) ....... ..+....++++.++.++. .++.......+|+||.+....|++. T Consensus 214 ~y~~~~~~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q 293 (335) T protein:vir:73 214 QFRAYRDEFKWDIGLSVRDWRSISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQ 293 (335) T ss_pred EEeEEEeeeeeeeeeEEeCcccEEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHH Confidence 0000000 012233455666666663 3444343446899999999999987 Q ss_pred hccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEe Q lcl|Aclame:pro 320 QAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGG 361 (419) Q Consensus 320 kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d 361 (419) ....++..+-.....+...-.++|+||..++++-.++-.+.- T Consensus 294 ~~~~~n~~l~~~~~~g~~~t~~~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 294 AMNAKNVNLTIEEYGGKKIVSFLGIPIRRVDAILNTESAVTA 335 (335) T ss_pred HhccCceeeeeeccCCceeEEECCeEEEEEeeeecCcccccC Confidence 554455444444444444456889999999988655433222 No 165 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.62 E-value=3.3e-09 Score=67.07 Aligned_cols=288 Identities=11% Similarity=0.037 Sum_probs=144.5 Q ss_pred hhhccc-cccc-ccCCccccc-chhhhHHHHHhhhhhhhHHhhcceeccc-CcceeeeeeccccceeccccccceeecCc Q lcl|Aclame:pro 118 LLSRDA-PAGT-ITNPNVPHL-PQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKAAVVPEG 193 (419) Q Consensus 118 ~~~~~~-~~~~-~~~~~~~~~-p~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg 193 (419) ....+. ..+. ..++....+ -+.+..++.......+.++++.++.++. ++++++|+....+ +++..-| T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~~~---------a~y~~~G 71 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETE---------LQVLAPG 71 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEeeeE---------Eeeeccc Confidence 100000 0000 011122222 3778888888888899999999998875 5689999864322 2222223 Q ss_pred ccccccccceeeEEeeeEEEEEeehhhHHHH---hhH---HH-HHHHHHHHHHHHHHHHHHHHHHh---ccC---c---- Q lcl|Aclame:pro 194 TAKPQSTLSFDTITTTLKTVAHWLPITRQAA---DDN---SQ-LMGYIQGRLTYGLRFLRDRQLLN---GNG---S---- 256 (419) Q Consensus 194 ~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell---~d~---~~-~~~~i~~~l~~a~~~~~d~~il~---G~g---~---- 256 (419) +...-..+..++..+....+= +++.++ ++. -+ +-+.+.+++++++++..|+.+|. ..+ + T Consensus 72 ~~ldg~~~~~~k~~ItID~lL----~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~ 147 (402) T protein:vir:97 72 QSPNATPTQADKNQLVIDTTV----IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER 147 (402) T ss_pred cccCCCCcccccEEEEeCcee----echhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 332222344445444444332 222222 222 24 56888999999999999997753 111 0 Q ss_pred ccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCc--EEEEehHHHHHHHHHhccCCceec--cCCc Q lcl|Aclame:pro 257 TEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWESIELDQAPGSGVFR--VIAN 332 (419) Q Consensus 257 ~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~kd~~g~~~~--~~~~ 332 (419) ..|.+......+ ...............+.+.+..+...+...+.+.. ..+++|..|..|++-..--.+.|. -.+. T Consensus 148 ~~~~~~~~g~s~-~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~ 226 (402) T protein:vir:97 148 NKPRVKGHGFSI-NVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGA 226 (402) T ss_pred ccCccccccccc-ccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCc Confidence 112221111111 11111111223445556667777777766665443 578999999988863211111111 0112 Q ss_pred cccCCCcccccceeEecCCCCcCcE-----------------EEEeccceE-EEEEecce-EEEEeecccchhhc--C-c Q lcl|Aclame:pro 333 VQGEATPRIWGLNVVSTVAIAQGTA-----------------LVGGFRQGA-TLWSRQGI-TVLMTDSHADFFTA--N-T 390 (419) Q Consensus 333 ~~~~~~~~l~G~pv~~~~~~~~~~~-----------------~~~d~~~~~-~~~~~~~~-~i~~~~~~~~~~~~--~-~ 390 (419) ...+....+.|+||+.|+.+|.+.. +-+|+.... ++|.+..+ +++.-+-..+.|.. . . T Consensus 227 ~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~ 306 (402) T protein:vir:97 227 TINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKT 306 (402) T ss_pred cccceeEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHH Confidence 3344556799999999999986310 113444322 22333222 11111100111110 0 0 Q ss_pred EEEEEEEEeccEEecccceEEEEecC----CCC Q lcl|Aclame:pro 391 LVILAEFRANLAVYQPKAFVRVTFAA----ATT 419 (419) Q Consensus 391 ~~~r~~~r~d~~~~~~~a~~~~~~~a----a~~ 419 (419) ..+.+..-++..+++|+|..++...- +.+ T Consensus 307 ~~id~~~a~G~g~~RPeaa~vv~~~~~~t~~~~ 339 (402) T protein:vir:97 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDA 339 (402) T ss_pred HHHHHHHHhCCcccCccceEEEEEecccccccC Confidence 11334555888999999998886544 222 No 166 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.58 E-value=1.5e-08 Score=63.47 Aligned_cols=293 Identities=10% Similarity=0.040 Sum_probs=146.4 Q ss_pred HHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceeccc-Ccceeeeeeccccceeccccccceeec Q lcl|Aclame:pro 113 IDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKAAVVP 191 (419) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~ 191 (419) +.......+.... +....- .+.-+.+..++.......+.++++.++.++. ++++.+|+... .++++.. T Consensus 1 Ms~~n~~t~p~~~-gsg~~~-aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~lG~---------s~a~y~~ 69 (400) T protein:vir:10 1 MSTPNNLTNVAVS-ASGEVD-SLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGE---------TELQVLA 69 (400) T ss_pred CCCCccccccccc-cccchh-hhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeee---------eEEeeec Confidence 0000000001111 111111 2223677788888888889999999999985 56899988632 2344444 Q ss_pred CcccccccccceeeEEeeeEEEE-EeehhhH--HHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHh----cc--Cc----c Q lcl|Aclame:pro 192 EGTAKPQSTLSFDTITTTLKTVA-HWLPITR--QAADDNSQ-LMGYIQGRLTYGLRFLRDRQLLN----GN--GS----T 257 (419) Q Consensus 192 Eg~~~~~~~~~~~~v~~~~~k~~-~~~~vs~--ell~d~~~-~~~~i~~~l~~a~~~~~d~~il~----G~--g~----~ 257 (419) -|+..--..+..++..+....+= ....|.+ |.+. .-+ +-+.+.++++.++++..|+.+|. +. -+ + T Consensus 70 pG~~ldg~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~-~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~ 148 (400) T protein:vir:10 70 PGQSPAATSTQADKNQLVIDATVIARNTVAHLHDVQG-DIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRT 148 (400) T ss_pred CCCCcCCCCcccCcEEEEeCceeeecchhhhHHHHhh-ccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 55554333344555555444432 1122211 1122 124 67889999999999999997762 20 01 1 Q ss_pred cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCc--EEEEehHHHHHHHHHhccC-CceeccC--Cc Q lcl|Aclame:pro 258 EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWESIELDQAPG-SGVFRVI--AN 332 (419) Q Consensus 258 ~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~kd~~-g~~~~~~--~~ 332 (419) .|-|+....++ .+..........+..+...+..+...+...+.+.. ++++.|..|..|+.- |.- .+.+-.. ++ T Consensus 149 ~~~g~~~g~s~-~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~-dkLvnrdf~~s~~g~ 226 (400) T protein:vir:10 149 NPRVKGHGFSV-NVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDA-DRIVDKSYTISQSGA 226 (400) T ss_pred cCCccccccce-eecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhC-CcccchhccccCCCc Confidence 22333221111 11111222222334455556666666665554432 455666666555432 100 0111111 22 Q ss_pred cccCCCcccccceeEecCCCCcCc---------------E--EEEeccceEE-EEEecce-EEEEeecccchhhc--C-c Q lcl|Aclame:pro 333 VQGEATPRIWGLNVVSTVAIAQGT---------------A--LVGGFRQGAT-LWSRQGI-TVLMTDSHADFFTA--N-T 390 (419) Q Consensus 333 ~~~~~~~~l~G~pv~~~~~~~~~~---------------~--~~~d~~~~~~-~~~~~~~-~i~~~~~~~~~~~~--~-~ 390 (419) ...+....+.|+||+.++.+|.+. - +-+|+....- +|.+..+ .++.-+-..+.|.. . . T Consensus 227 ~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~ 306 (400) T protein:vir:10 227 TIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKT 306 (400) T ss_pred cccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHH Confidence 233444578999999999998531 0 2245544322 2222221 11111111111110 1 1 Q ss_pred EEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 391 LVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 391 ~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ..+.+..-++..+++|+|.++++.+-..| T Consensus 307 ~~id~~~a~G~g~~RPeaa~vv~~~~~~~ 335 (400) T protein:vir:10 307 YYIDTFMSEGAIPDRWEAVSVVTTKRQST 335 (400) T ss_pred HHHHHHHHhCCcccchhheEEEEecCCcc Confidence 23345666899999999999999988777 No 167 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.42 E-value=1.3e-07 Score=58.36 Aligned_cols=286 Identities=11% Similarity=0.037 Sum_probs=164.9 Q ss_pred cccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccc Q lcl|Aclame:pro 121 RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST 200 (419) Q Consensus 121 ~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 200 (419) .+....+.++...+..-+.+.+.|...-....++..++...+..+...+|...+-.... ..-..||++.+... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~-------~~~~~EG~da~~~~ 73 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPG-------KNTRVEGEDATIKA 73 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCcc-------ccccccCccccccc Confidence 22222222223333445667788888878888888888877776667777654432211 11234777655543 Q ss_pred cce-eeEEeeeEEEEEeehhhHHHHhhH----HHHHHHHHHHHHHHHHHHHHHHHHhccCc---------ccccceeccc Q lcl|Aclame:pro 201 LSF-DTITTTLKTVAHWLPITRQAADDN----SQLMGYIQGRLTYGLRFLRDRQLLNGNGS---------TEMQGILTTP 266 (419) Q Consensus 201 ~~~-~~v~~~~~k~~~~~~vs~ell~d~----~~~~~~i~~~l~~a~~~~~d~~il~G~g~---------~~p~Gi~~~~ 266 (419) ..- ..+.=-..-+...+.||.-+...+ .+...|-...-...+.+-+|.++|+|.-. .+-.||++.. T Consensus 74 ~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i 153 (317) T protein:vir:88 74 GSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYY 153 (317) T ss_pred ccCCEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHh Confidence 221 111112223344455665543322 24556666666778889999999998521 1334554321 Q ss_pred ---------cccccccc--cccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCc--c Q lcl|Aclame:pro 267 ---------GIGTYQQP--KPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIAN--V 333 (419) Q Consensus 267 ---------~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~--~ 333 (419) |....... ..+..+......+++.+++.++...+..+..+++++.....|.++...++.+...... . T Consensus 154 ~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~ 233 (317) T protein:vir:88 154 KTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNR 233 (317) T ss_pred ccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeE Confidence 11111000 1111222235677888888889889888889999999999998885433333221111 0 Q ss_pred ccCCCc---cccc-ceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccce Q lcl|Aclame:pro 334 QGEATP---RIWG-LNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAF 409 (419) Q Consensus 334 ~~~~~~---~l~G-~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~ 409 (419) ...... +=+| ++++.+.+||++.++++|+.+.-+-+-|. +..+.-... -+.....++..+...+++|+|. T Consensus 234 ~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~-~~~e~laKt-----Gd~~k~~i~~E~tLe~~N~~a~ 307 (317) T protein:vir:88 234 IAQTVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYLRP-FFQHELAKT-----GDSEKRQLLVEYTFRVNNEKSG 307 (317) T ss_pred EEEEEEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeeccc-ceeeccCCC-----cccceeEEEEEEEEEEcCccce Confidence 000000 1123 57888999999999999998755544332 222211111 2445678888999999999999 Q ss_pred EEEEecCCCC Q lcl|Aclame:pro 410 VRVTFAAATT 419 (419) Q Consensus 410 ~~~~~~aa~~ 419 (419) +++..-+++= T Consensus 308 a~i~~l~~~~ 317 (317) T protein:vir:88 308 ALIRDVVAQL 317 (317) T ss_pred eEEEEecccC Confidence 9998766666 No 168 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.41 E-value=3.8e-08 Score=61.27 Aligned_cols=292 Identities=11% Similarity=0.014 Sum_probs=144.4 Q ss_pred HHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceeccc-Ccceeeeeeccccceeccccccceeec Q lcl|Aclame:pro 113 IDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKAAVVP 191 (419) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~ 191 (419) +.......+....+ .... ..+.-+.+..++.......+.++++.++.++. ++++.+|+.... ++.+.. T Consensus 1 Ms~~n~~t~~~~~~-sg~~-~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~G~s---------~~~~~~ 69 (401) T protein:vir:70 1 MSTPNNLTNVAVSA-SGEV-DSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGET---------ELQVLA 69 (401) T ss_pred CCCCcccccccccc-ccch-hHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeee---------Eeeeec Confidence 00000000001111 0111 12223677788888888889999999999885 568999986432 333333 Q ss_pred CcccccccccceeeEEeeeEEEEE-eehhhHHHHhhH---HH-HHHHHHHHHHHHHHHHHHHHHHh-----cc----C-c Q lcl|Aclame:pro 192 EGTAKPQSTLSFDTITTTLKTVAH-WLPITRQAADDN---SQ-LMGYIQGRLTYGLRFLRDRQLLN-----GN----G-S 256 (419) Q Consensus 192 Eg~~~~~~~~~~~~v~~~~~k~~~-~~~vs~ell~d~---~~-~~~~i~~~l~~a~~~~~d~~il~-----G~----g-~ 256 (419) -|+...-..+..++..+....+-. ...|. -+++. -+ +-+.+.+++++++++..|+.++. |- + . T Consensus 70 pG~~ld~~~~~~dK~~ItID~lL~a~~~V~--dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~ 147 (401) T protein:vir:70 70 PGQSPAATSTQADKNQLVIDATVIARNTVA--HLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKR 147 (401) T ss_pred CCCCcCCCCcccccEEEEeCceeehhhhhh--hHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Confidence 344433334455555444443311 11111 12222 14 56788899999999999986632 21 0 1 Q ss_pred ccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCc--EEEEehHHHHHHHHHhccC-Cceecc--CC Q lcl|Aclame:pro 257 TEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWESIELDQAPG-SGVFRV--IA 331 (419) Q Consensus 257 ~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~kd~~-g~~~~~--~~ 331 (419) ..|.|.-.. ...............+..+.+.+..+...+...+.+.. ++++.|..|..|..- +.- .+-|-. .+ T Consensus 148 ~~p~~~~~G-~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~-d~L~nrd~~~s~~g 225 (401) T protein:vir:70 148 TNPRVKGHG-FSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDA-DRIVDKTYTISQSG 225 (401) T ss_pred cCCCcCCCc-eEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhc-CcccchhhccccCC Confidence 123222111 11122222223334455577777788877776666543 344556655555432 110 011110 12 Q ss_pred ccccCCCcccccceeEecCCCCcCc---------------E--EEEeccceE-EEEEecce-EEEEeecccchhhc--C- Q lcl|Aclame:pro 332 NVQGEATPRIWGLNVVSTVAIAQGT---------------A--LVGGFRQGA-TLWSRQGI-TVLMTDSHADFFTA--N- 389 (419) Q Consensus 332 ~~~~~~~~~l~G~pv~~~~~~~~~~---------------~--~~~d~~~~~-~~~~~~~~-~i~~~~~~~~~~~~--~- 389 (419) ....+....+.|+||+.++.+|.+. . +-+|+.... ++|.+..+ .++.-+-..+.|.. . T Consensus 226 ~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~ 305 (401) T protein:vir:70 226 ATIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEK 305 (401) T ss_pred ccccceEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhh Confidence 2233444578999999999998632 1 113444322 22222221 11111111111110 0 Q ss_pred cEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 390 TLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 390 ~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ...+-+..-++..+++|+|.++++.+-..| T Consensus 306 ~~~id~~~a~g~g~~RPeaa~vv~~k~~~~ 335 (401) T protein:vir:70 306 TYYIDTFMAEGAIPDRWEAVSVVTTKRNTT 335 (401) T ss_pred HHHHHHHHHhCCcccchhheEEEeecCccc Confidence 112335666899999999999986655533 No 169 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=98.40 E-value=4e-07 Score=55.63 Aligned_cols=265 Identities=12% Similarity=0.003 Sum_probs=125.2 Q ss_pred cccCCcccccchhhhHHHHHhhhhhhhHHhhcceec-----ccCcceeeeeeccccceeccccccceeecCccccccccc Q lcl|Aclame:pro 127 TITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQN-----ADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTL 201 (419) Q Consensus 127 ~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 201 (419) ..+..+..+-|+.+...+++.++...++..++..-. -.+..+++|+..... +.++...+-.+. T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~------------v~dg~~~~~~~~ 68 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVK------------SASGRTLVKQPM 68 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCcee------------ecccCCcccccc Confidence 223344555689999999999999888877765422 123467776632211 112333333344 Q ss_pred ceeeEEeee--EEEEEeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 202 SFDTITTTL--KTVAHWLPITRQ-AADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 202 ~~~~v~~~~--~k~~~~~~vs~e-ll~d~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) +-..+.+.. +|.- -+.|+++ ...+..++...+.+....+++..+|..++.-- .+.+ + .. .. T Consensus 69 te~~v~l~id~~k~~-~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~-~~a~----~-------~~---gt 132 (418) T protein:vir:10 69 VDQTIPFKIAYQEHV-GLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTL-KKAF----H-------SS---GT 132 (418) T ss_pred ccceEEEEEeccccc-ceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH-hhcc----c-------cc---cc Confidence 444444444 3333 3445544 45556688888889999999999999887410 0000 0 00 00 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCc--EE-EEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcC Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPD--GV-VVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG 355 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 355 (419) ..+....|+++.++...+...+.+.. .| +++|..+..|.+-................+..+++.|+.|+.++.+|.. T Consensus 133 ~gt~~~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~ 212 (418) T protein:vir:10 133 PGVRPGAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKH 212 (418) T ss_pred CCcCcchHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeeceEEEEecCCCcc Confidence 11122347888888888877777532 44 6899988777543211100000111233456678999999999999853 Q ss_pred cEEEEeccceEEEEEecceEEEEe--ecc-cchhhcCc-EEEEEE---EEeccEE-ecccceEEEEe------------- Q lcl|Aclame:pro 356 TALVGGFRQGATLWSRQGITVLMT--DSH-ADFFTANT-LVILAE---FRANLAV-YQPKAFVRVTF------------- 414 (419) Q Consensus 356 ~~~~~d~~~~~~~~~~~~~~i~~~--~~~-~~~~~~~~-~~~r~~---~r~d~~~-~~~~a~~~~~~------------- 414 (419) +.....-...+.+....+..+.+. ... ......+. +.|-.. ..+...+ .+++-|++... T Consensus 213 tag~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i 292 (418) T protein:vir:10 213 TVGDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKI 292 (418) T ss_pred cccccccceeeecccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEe Confidence 211000000011111111111110 000 00011111 111110 0000011 01222322221 Q ss_pred cCCC---------------------------C Q lcl|Aclame:pro 415 AAAT---------------------------T 419 (419) Q Consensus 415 ~aa~---------------------------~ 419 (419) .++. . T Consensus 293 ~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a 324 (418) T protein:vir:10 293 SPSLNDGTATINNENGDPVSLTAYQNVTALPA 324 (418) T ss_pred ccccccccccccccccccccccCCCccccccc Confidence 1110 0 No 170 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=98.21 E-value=2.3e-07 Score=57.00 Aligned_cols=314 Identities=9% Similarity=-0.015 Sum_probs=153.2 Q ss_pred chhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccc----cchhhhHHHHHhhhhhhhH Q lcl|Aclame:pro 79 GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPH----LPQLVPGIVPTTPDLPLLV 154 (419) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~p~~~~~~i~~~~~~~~~l 154 (419) -..+.-.....+.+.+..................+ + ..........++..... ..+.+...|++.+...... T Consensus 1 ~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~---a-~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~ 76 (339) T protein:vir:94 1 MSINNDRTDIKQLEKVGIIFDGYSPKSISSEVSAY---A-MDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAA 76 (339) T ss_pred CceechHHHHHHHHhhceeeccchhhhcchhhHhh---h-ccccccccccccccccchhhhhhhhhchhheeecccccch Confidence 00000000111111110000000000000011111 0 01111111111112222 3455556667778888888 Q ss_pred HhhcceecccCc---ceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-H-- Q lcl|Aclame:pro 155 ADLLDQQNADYN---VLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-S-- 228 (419) Q Consensus 155 ~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~-- 228 (419) +.++++.+.+.. .+.|...+ ..+.+.+.+.+++.|..+......+-..+.+...+.++..=+..+ . T Consensus 77 ~~l~pv~t~g~w~~~t~~y~~~e--------~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~A~~~g 148 (339) T protein:vir:94 77 AKIFPEVKKGDWTTTYGVFIIAE--------PVGQVATYSDWSANGMSKANVNFESRQNYRYQTWTEYGDLEMATYGEAG 148 (339) T ss_pred hhhcccccCCCCcccEEEEeeee--------cccceEEcccccCCCcccccceeeEEeEEEEEEEEeecHHHHHHHHhhC Confidence 888888776532 34554433 345677888888888877544444444444444444553323222 1 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccc-ccccccccchhhhHHHHHHHHHHhhhhhcc----- Q lcl|Aclame:pro 229 -QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTY-QQPKPTAPATDEPPLVDIRRAKTVAEIAGF----- 301 (419) Q Consensus 229 -~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 301 (419) ++.+--....++++...+|+..++|+...+..|++|.+++... .........+....++|+..++..+...-. T Consensus 149 ~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~ 228 (339) T protein:vir:94 149 IDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITG 228 (339) T ss_pred CChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeee Confidence 5788888889999999999999999877788999999888553 334455667777889999988888754422 Q ss_pred -CCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCc---CcEEEEeccceEEEEEecceEEE Q lcl|Aclame:pro 302 -PPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQ---GTALVGGFRQGATLWSRQGITVL 377 (419) Q Consensus 302 -~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~---~~~~~~d~~~~~~~~~~~~~~i~ 377 (419) .+..++|.|..+..|... ...|..++ ..... .+-++.++....+.. +...++... ..+.....+. T Consensus 229 ~~~~~L~LP~~~~~~L~~~-n~~~~Tvl--~~lk~----n~pnl~i~~~~el~~a~g~~~~~~~~~----~~~~~~~~~~ 297 (339) T protein:vir:94 229 QERMVMALAPSALNNVNRT-NNFGLSAG--AKIAQ----TYPNIQFVAVPEFDTASGRLVQLWVPE----VNGQPTGEVA 297 (339) T ss_pred ccCcEEEecHHHHHhcccC-CcCCccHH--HHHHH----hcCCcEEEEccccccCCCceEEEEEEe----ccCCcceEEE Confidence 234688999998877543 22222111 00000 111233443333321 111111100 0000111121 Q ss_pred Eeecccc-hhhc--CcEEEEEEEEe-ccEEecccceEEEEec Q lcl|Aclame:pro 378 MTDSHAD-FFTA--NTLVILAEFRA-NLAVYQPKAFVRVTFA 415 (419) Q Consensus 378 ~~~~~~~-~~~~--~~~~~r~~~r~-d~~~~~~~a~~~~~~~ 415 (419) +...-.. ..+. -.+..-...|. +..+++|.||++++-= T Consensus 298 ~p~~~~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 298 FAEKLRSHSIERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred cchhhhccccEEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 1110000 0011 12233455564 4456789999998765 No 171 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=98.13 E-value=6.6e-07 Score=54.49 Aligned_cols=268 Identities=9% Similarity=0.044 Sum_probs=150.8 Q ss_pred cCCcccccc--hhhhHHHHHhhhhhhhHHhhcceecccCc---ceeeeeeccccceeccccccce--eecCc-ccccccc Q lcl|Aclame:pro 129 TNPNVPHLP--QLVPGIVPTTPDLPLLVADLLDQQNADYN---VLEYIRDTSGTAGAGSTWNKAA--VVPEG-TAKPQST 200 (419) Q Consensus 129 ~~~~~~~~p--~~~~~~i~~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~a~--~v~Eg-~~~~~~~ 200 (419) .++...++. +.+...|.+........+.++++.+.... ++.+... ...+.+. |++-+ .++|.-+ T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~--------d~~G~a~~~~i~~~a~dip~vd 72 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGA--------DEHGSLDDGLITVGTSTLDQVE 72 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeee--------eccCcccccccCCcCCccceee Confidence 111111111 22334444444455555566655443221 2222221 1223444 77655 6688888 Q ss_pred cceeeEEeeeEEEEEeehhhHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHHHhccCc-ccccceeccccccccccc- Q lcl|Aclame:pro 201 LSFDTITTTLKTVAHWLPITRQAADDNS----QLMGYIQGRLTYGLRFLRDRQLLNGNGS-TEMQGILTTPGIGTYQQP- 274 (419) Q Consensus 201 ~~~~~v~~~~~k~~~~~~vs~ell~d~~----~~~~~i~~~l~~a~~~~~d~~il~G~g~-~~p~Gi~~~~~~~~~~~~- 274 (419) ..+++-....+.++..+.+|.+=++.+. ++..--...+.+++...+|+..+.|+-. .+..|++|.+++...... T Consensus 73 ~~~~~~~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~ 152 (304) T protein:vir:52 73 VGFTPTRSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKG 152 (304) T ss_pred cccceeEEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecC Confidence 8899999999999988888866554432 4677677777888999999999999754 357899999998754333 Q ss_pred ----cccccchhhhHHHHHHHHHHhhhhhc---cCCcEEEEehHHHHHHHHHhccCCc-eec--cCCccccCCCcccccc Q lcl|Aclame:pro 275 ----KPTAPATDEPPLVDIRRAKTVAEIAG---FPPDGVVVHPQDWESIELDQAPGSG-VFR--VIANVQGEATPRIWGL 344 (419) Q Consensus 275 ----~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~l~~~kd~~g~-~~~--~~~~~~~~~~~~l~G~ 344 (419) ......+....++|+..++.++...- ..+..++|.|+.+..|.....++++ .++ +..+... ..|. T Consensus 153 ~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~~-----~~g~ 227 (304) T protein:vir:52 153 AAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLSA-----AAGR 227 (304) T ss_pred CccCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhccc-----ccCC Confidence 22344577788999998888875332 2466899999999988654333322 111 1111000 1233 Q ss_pred eeE--ec--CCCC-----cCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEE--EEEEEEecc-EEecccceEEE Q lcl|Aclame:pro 345 NVV--ST--VAIA-----QGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLV--ILAEFRANL-AVYQPKAFVRV 412 (419) Q Consensus 345 pv~--~~--~~~~-----~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~--~r~~~r~d~-~~~~~~a~~~~ 412 (419) |+- .. .... .+..++++.+.-++-+. -.+.+.+-. ...+|... .-.+.|+++ .+++|.||+++ T Consensus 228 ~l~I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~-vP~p~~~l~----~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~ 302 (304) T protein:vir:52 228 QVAIKALPSNYGTRVTDGKTRAMVYVNSKEHVIFD-VPMSPTVLD----AQPKGLLAFESGLRMAFGGVTFMEPDSALYV 302 (304) T ss_pred cceEEEecccccccCCCCceEEEEEecChhheEEe-cCccccccc----hhhcCCceEEecceeeeeeEEEEccceeeee Confidence 321 11 1111 12245555554444331 122222211 12334322 335666665 56779999999 Q ss_pred Ee Q lcl|Aclame:pro 413 TF 414 (419) Q Consensus 413 ~~ 414 (419) .. T Consensus 303 D~ 304 (304) T protein:vir:52 303 DY 304 (304) T ss_pred cC Confidence 99 No 172 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=98.11 E-value=1e-06 Score=53.37 Aligned_cols=276 Identities=9% Similarity=-0.062 Sum_probs=107.5 Q ss_pred hhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhh-------cceecccCcceeeeee Q lcl|Aclame:pro 101 DKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL-------LDQQNADYNVLEYIRD 173 (419) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~-------~~~~~~~~~~~~~~~~ 173 (419) +...... +.-|..+... ++.+.+....... ....++.+.-++.|.- T Consensus 1 m~lsD~~--------------------------vfN~~~~~a~-~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~ 53 (325) T protein:vir:95 1 MALSDLA--------------------------VYSEYAYSAF-SETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFF 53 (325) T ss_pred Cchhhhh--------------------------hhhhhhhhhh-hhhhhhhHhhhhhcccceeEeccccccCceeecccc Confidence 0000000 0001111111 1111111111111 1111222322233321 Q ss_pred ccccceeccccccceeecCccccccccc-ceeeEEeeeEEEEEeeh--hhHHHHh-hHH-HHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 174 TSGTAGAGSTWNKAAVVPEGTAKPQSTL-SFDTITTTLKTVAHWLP--ITRQAAD-DNS-QLMGYIQGRLTYGLRFLRDR 248 (419) Q Consensus 174 ~~~~~~~~~~~~~a~~v~Eg~~~~~~~~-~~~~v~~~~~k~~~~~~--vs~ell~-d~~-~~~~~i~~~l~~a~~~~~d~ 248 (419) .... ........+.+.+..+..+. +..++......-.+.+. ++..++. +.+ .+.+.|.+++++...+.+=. T Consensus 54 ~~l~----g~~~~~~~~~~~~~vt~~kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~ 129 (325) T protein:vir:95 54 AKVT----GGLVRRRNAYGSGTVAEKVLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLN 129 (325) T ss_pred cccc----ccccccccCCCCceeccceeccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHH Confidence 1100 00001112223333333332 23444444333333332 2232322 222 45555666666555444444 Q ss_pred HHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceec Q lcl|Aclame:pro 249 QLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFR 328 (419) Q Consensus 249 ~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~ 328 (419) .+|.+-.. .+....-....................+.++..++-+....-..|+||..++..|.+..-.+...++ T Consensus 130 ~~~~~l~~-----a~~~~~~~v~dis~~~~~~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~ 204 (325) T protein:vir:95 130 VGLGSVYS-----ALSQVSDVVYDATANTDAADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLF 204 (325) T ss_pred HHHHHHHH-----hhcccccceeeeecccCcccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhcccccccc Confidence 44432111 0000000001111111112222346778888888888777788999999999999987655443333 Q ss_pred cCCccccCCCcccccceeEecCCCCcCc------EEEEeccceEE-EEEecceEEEEeecccchhhcCcEEEEEEEEecc Q lcl|Aclame:pro 329 VIANVQGEATPRIWGLNVVSTVAIAQGT------ALVGGFRQGAT-LWSRQGITVLMTDSHADFFTANTLVILAEFRANL 401 (419) Q Consensus 329 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~------~~~~d~~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~ 401 (419) ...... .-++.+|++|+++|.||... ...+.|..+.. +.+..+......+..+ -.+-...+|.+. - T Consensus 205 ~~~g~~--~i~t~~G~~VIVdD~~p~~~~g~~~~ytty~lg~GAi~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~---t 277 (325) T protein:vir:95 205 TYGTVN--VVRDPFGKLLVMTDSPNLFAAGTPNVYHILGLVPGGVLIGQNNDFDANEETKNG--DENIIRTYQAEW---S 277 (325) T ss_pred ccCCcc--cccccCCcEEEEeCCCCCCCccCceeEEEEEEecCeEEecCCCCccccccccCc--ccceeeeeeeee---e Confidence 333222 23478999999999998532 11112222221 1111121111111111 011122333221 1 Q ss_pred EEecccceEEEEec--CCCC Q lcl|Aclame:pro 402 AVYQPKAFVRVTFA--AATT 419 (419) Q Consensus 402 ~~~~~~a~~~~~~~--aa~~ 419 (419) -++||..+..-+-. .+|| T Consensus 278 f~lhp~G~sw~~s~~g~sPt 297 (325) T protein:vir:95 278 YNIGVKGFAWDKANGGKSPT 297 (325) T ss_pred EEeecceeeeecccccCCcC Confidence 36789888874322 3444 No 173 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=98.00 E-value=1e-06 Score=53.43 Aligned_cols=307 Identities=10% Similarity=0.046 Sum_probs=155.2 Q ss_pred HHhHHHHHHHHHhhhh-----hhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHH----HHHhhhhhhhHHhhc Q lcl|Aclame:pro 88 FADSDGLREYRARDKR-----GQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGI----VPTTPDLPLLVADLL 158 (419) Q Consensus 88 ~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~----i~~~~~~~~~l~~~~ 158 (419) .-+...++.+...+.. .....+...+.. ......+...+++...+|+.+..- +++.+........++ T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~----da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~ 76 (336) T protein:vir:36 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAM----DAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELV 76 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhHHHHhhh----hhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhc Confidence 0000111111111000 000001111100 111222233333444566666653 345555555666677 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhh-HHHHhhHH---HHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPIT-RQAADDNS---QLMGYI 234 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs-~ell~d~~---~~~~~i 234 (419) ++.+++....+.... ...-..+.+.+.+.+...|..+......+...+.++..+.++ .|+.+-.. ++.+-- T Consensus 77 pv~t~g~W~~~~~~~-----~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~K 151 (336) T protein:vir:36 77 GESKKGDWTTLVAAF-----ITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASEL 151 (336) T ss_pred cccccCCccceeEEE-----eeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHH Confidence 765544322111111 111123466788888899999988788888899999999998 44544332 577888 Q ss_pred HHHHHHHHHHHHHHHHHhccCcccccceeccccccccccc-c-ccccchhhhHHHHHHHHHHhhhhhcc------CCcEE Q lcl|Aclame:pro 235 QGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQP-K-PTAPATDEPPLVDIRRAKTVAEIAGF------PPDGV 306 (419) Q Consensus 235 ~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~ 306 (419) ....++++.+.+|+-.+.|++..+..|++|.|.+...... + ....++....++|+..++..+..... .+..+ T Consensus 152 a~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL 231 (336) T protein:vir:36 152 NYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRM 231 (336) T ss_pred HHHHHHHHHHhhCcEEEEeccccceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEE Confidence 8889999999999999999988889999999887643332 2 22334557789999988888765332 35678 Q ss_pred EEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEE---e-cceEEEEeecc Q lcl|Aclame:pro 307 VVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWS---R-QGITVLMTDSH 382 (419) Q Consensus 307 ~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~---~-~~~~i~~~~~~ 382 (419) +|.+..+..|..- ...|..++ ..... .+-++.++..+.+.... |+. .+++.. . ....+.+...- T Consensus 232 ~LP~~~~~~Ls~~-n~~g~Tvl--~~lk~----n~Pnl~i~t~pEl~~a~---g~~--~~l~~~~~~~~~t~~~~~p~~~ 299 (336) T protein:vir:36 232 GLPPTAMSDLSKT-NQYGLAAA--AKLKD----IFPKLEFVTIPEYDTAS---GRL--VQLWAPRVEGKDTATCGFTEKM 299 (336) T ss_pred EechHHHHhccCC-CccCccHH--HHHHH----hcCccEEEEccccccCC---Cce--EEEEEEecCCCcceeeecchhh Confidence 9999987776432 22221111 00000 11122233332221110 110 111111 0 00111111000 Q ss_pred c-chhhc--CcEEEEEEEEecc-EEecccceEEEEec Q lcl|Aclame:pro 383 A-DFFTA--NTLVILAEFRANL-AVYQPKAFVRVTFA 415 (419) Q Consensus 383 ~-~~~~~--~~~~~r~~~r~d~-~~~~~~a~~~~~~~ 415 (419) . ...+. -.+..-+..|..| .+++|.||++++-= T Consensus 300 ~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 300 RAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred hccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 0 00011 1223334555555 45779999998765 No 174 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=98.00 E-value=7.5e-06 Score=48.68 Aligned_cols=390 Identities=14% Similarity=0.122 Sum_probs=161.4 Q ss_pred CCccHHHHHHH-------------------------------HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|Aclame:pro 1 MPPTPTLEEQR-------------------------------AALLARL-DDTSLTTEQVQEIVAEARGLADALQAE-SD 47 (419) Q Consensus 1 M~~~~~L~e~~-------------------------------~~l~~~~-~~~~~~~~~~~~~~~e~~~~~~~~~~~-~~ 47 (419) ||. .|+.+. ..++++. .+.+.....++++-......++.+..+ +. T Consensus 188 ~p~--~~~~~~~~~~~~~~~v~d~EPa~~~~pvqAaAP~~De~airAq~~aeeraRi~~I~~l~a~Fggr~~~l~~~~l~ 265 (652) T protein:vir:79 188 MPD--SIRNMITPPRNSAPRVQDDEPAASRTPVQAAAPVVDENSIRAQVLAEQKARVNGINDLFAMFGGRYQTLQAQCLA 265 (652) T ss_pred hHH--HHHHHhcccccccccccccccccccccccccCCcCchhHHHHHHHHHHHHHHHHHHHHHHhhccccchHHHHHhh Confidence 221 111100 0000000 000011111111111111111111111 00 Q ss_pred HHHHHHHHHHHHH-HHHHHHHhhccccccc--ccchhhhhhHHHHhHHHHHHHHHhhhhh--hhhHHHHHHHHHHhhhcc Q lcl|Aclame:pro 48 RAAARAALLRTAP-PAPKGPADGGTPLTPA--EAGTFRSLAQRFADSDGLREYRARDKRG--QFQVEMRDIDPNRLLSRD 122 (419) Q Consensus 48 ~~~~~~~~l~~~~-~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 122 (419) ...-..++.+..+ +.+.+...+.....+. .......+++...+.-..+.-.....+. -....++++.......+. T Consensus 266 d~~~s~e~ar~~il~~l~~~~~p~~~~~~~~~~~~~g~~~~d~~~~aL~~R~g~~~~~~~~~~~g~~L~elAr~~L~~~G 345 (652) T protein:vir:79 266 DPECSLEQAREKLLNEMGRESTPSNKNTPAHIYAGNGNFVGDGIRQALMARAGFEKTERDNVYNGMTLREYARMSLTERG 345 (652) T ss_pred ccCCCHHHHHHHHHHHHHhhcCCCCCCcceeEeeccchhhHHHHHHHHHhhcCCcccccCccccCccHHHHHHHHHHhhc Confidence 0000011111111 1111111110000000 0001111111111100000000000000 001122233332222222 Q ss_pred ccccc----------ccCCcccccchhhhHHHHHh-----hhhhhhHHhhcceeccc-Ccceeeeeeccccceecccccc Q lcl|Aclame:pro 123 APAGT----------ITNPNVPHLPQLVPGIVPTT-----PDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNK 186 (419) Q Consensus 123 ~~~~~----------~~~~~~~~~p~~~~~~i~~~-----~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 186 (419) ....+ .+.+. ...|..+.+.+... ......++..|+..++. .+..+..+. ..-+. T Consensus 346 ~~~~~~~~~~~v~~A~~hsT-sDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~l--------g~~~~ 416 (652) T protein:vir:79 346 IGVSSYNPMQMVGAAFTHST-SDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGM--------GGFSA 416 (652) T ss_pred cCCCCCCHHHHHHHHhhcCc-chHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeec--------CCCCC Confidence 11111 11111 12333333333222 22234555666655542 122222222 22345 Q ss_pred ceeecCcccccccccceeeEEeeeEEEEEeehhhHHH-HhhHHHHHHHHHHHHHHHHHHHHHHHHH---hccCc--cccc Q lcl|Aclame:pro 187 AAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQA-ADDNSQLMGYIQGRLTYGLRFLRDRQLL---NGNGS--TEMQ 260 (419) Q Consensus 187 a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~el-l~d~~~~~~~i~~~l~~a~~~~~d~~il---~G~g~--~~p~ 260 (419) ..-|.|+++.......=...++.+.+++..+.||+++ ++|--+...-|-..++++.++++++.++ .+++. +.-+ T Consensus 417 L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk 496 (652) T protein:vir:79 417 LRQVREGAEYKYVTTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNV 496 (652) T ss_pred ccccCCCCccceeeecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCc Confidence 6678899998877666677889999999999999996 5777777888899999999999988554 34432 1333 Q ss_pred cee-ccccccccccccccccchhhhHHHHHHHHHHhhhhh----ccCCcEEEEehHHHHHHHHHhccCCceeccCCcccc Q lcl|Aclame:pro 261 GIL-TTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIA----GFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQG 335 (419) Q Consensus 261 Gi~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~ 335 (419) .++ +... .+..+ +++.....+...+.++..-... +..|..|++.+........+..+..- ...+.+. T Consensus 497 ~LF~hA~H---~Nl~~--~aa~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v---~~a~~~~ 568 (652) T protein:vir:79 497 SLFDKAKH---ANVLE--SAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSV---KGADINA 568 (652) T ss_pred eeeccccc---ccccc--cccCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCC---ccccccc Confidence 344 2111 11111 1222233344444444333322 34566778888776666555432211 1112222 Q ss_pred CCCcccccc-eeEecCCCCcC---cEEEEeccc------eEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEec Q lcl|Aclame:pro 336 EATPRIWGL-NVVSTVAIAQG---TALVGGFRQ------GATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQ 405 (419) Q Consensus 336 ~~~~~l~G~-pv~~~~~~~~~---~~~~~d~~~------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~ 405 (419) +....+.|+ .|++++.+..+ ..++++... +|+-+ .++..++. ...|..|.+.|++...++.+++| T Consensus 569 ~~~Np~~~~~~~i~eprL~~~s~~~wylaa~~~~dtiev~yL~G-~~~P~ie~----~~gf~~dG~~~kvrlD~G~~~iD 643 (652) T protein:vir:79 569 GIINPVKDFATVIAEPRLDDNSQTTFYLAASKGSDTIEVAYLNG-VDTPYIDQ----MEGFSVDGVTTKVRIDAGVAPVD 643 (652) T ss_pred ccccccccccccccccccCCCCcccEEEecCCCCCeEEEEEecC-CCCCeeee----cCCCCcceEEEEEEEeccCceee Confidence 333344453 66667666432 233333221 11211 22333332 23499999999999999999999 Q ss_pred ccceEEEEe Q lcl|Aclame:pro 406 PKAFVRVTF 414 (419) Q Consensus 406 ~~a~~~~~~ 414 (419) =.++++.+- T Consensus 644 ~RG~~k~t~ 652 (652) T protein:vir:79 644 HRGLVKCTA 652 (652) T ss_pred ccceeeecC Confidence 999988877 No 175 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.00 E-value=5.7e-06 Score=49.34 Aligned_cols=276 Identities=14% Similarity=0.016 Sum_probs=118.6 Q ss_pred ccc-ccccCCcccccchhhhHHHHHhhhhhhhHHh---------hcceecccCcceeeeeeccccceeccccccceeecC Q lcl|Aclame:pro 123 APA-GTITNPNVPHLPQLVPGIVPTTPDLPLLVAD---------LLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPE 192 (419) Q Consensus 123 ~~~-~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 192 (419) ... ...+.-...++|+.+...+.+...+.+.|.+ +......++..+++|...... +...-+.+ T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~-------g~~~n~~~ 73 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLD-------SLEPNYGS 73 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCC-------CCccccCC Confidence 000 0001112345666666555554433333221 111123455566666543221 11111222 Q ss_pred ccc---ccccccce-eeEEeeeEEEEE--eehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHh---c----cCccc- Q lcl|Aclame:pro 193 GTA---KPQSTLSF-DTITTTLKTVAH--WLPITRQAADDNSQLMGYIQGRLTYGLRFLRDRQLLN---G----NGSTE- 258 (419) Q Consensus 193 g~~---~~~~~~~~-~~v~~~~~k~~~--~~~vs~ell~d~~~~~~~i~~~l~~a~~~~~d~~il~---G----~g~~~- 258 (419) ... .+-.+.+- .++-...+.-.+ .-.++..+- ..+..+.|.++++.--.+...+.+|. | ++++. T Consensus 74 d~~~~~~t~~kittg~~~a~v~~r~kaw~~~Dla~~ls--G~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~ 151 (367) T protein:vir:80 74 DNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELA--GSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNF 151 (367) T ss_pred CCCcccccccccccchheeeeehhcccchhhhHHHHhh--CchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccch Confidence 211 11112211 111111111111 122333322 13566666777665555555544443 1 11110 Q ss_pred ------------ccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCce Q lcl|Aclame:pro 259 ------------MQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGV 326 (419) Q Consensus 259 ------------p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~ 326 (419) +.+..+ -........+...........+.++...+-+....-++++||+.++..|++++--. T Consensus 152 ~~~~~~~~~~a~~~~~~~---~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~li~--- 225 (367) T protein:vir:80 152 ATIKTRGRVPAEVLGTAG---DMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIE--- 225 (367) T ss_pred hhhhhhhccccccccccC---ceeeeeeccCCCccceecHHHHHHHHHHhccccccccEEEEchHHHHHHHhccccc--- Confidence 111111 01111111111122334566777887777777777889999999999999874210 Q ss_pred eccCCccccCCCcccccceeEecCCCCcC------cEEEEeccceEEEEEecceE--EEEeecccchhh--cCcEEEEEE Q lcl|Aclame:pro 327 FRVIANVQGEATPRIWGLNVVSTVAIAQG------TALVGGFRQGATLWSRQGIT--VLMTDSHADFFT--ANTLVILAE 396 (419) Q Consensus 327 ~~~~~~~~~~~~~~l~G~pv~~~~~~~~~------~~~~~d~~~~~~~~~~~~~~--i~~~~~~~~~~~--~~~~~~r~~ 396 (419) +++..-....-++++|++|++++.||.. ....+-|..+.+.+...+.. +++.+. ... .+..-+... T Consensus 226 -~i~~sd~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~~E~~Rd---~~~~~~gG~d~L~~ 301 (367) T protein:vir:80 226 -FIPDSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRR---ELRGNGSGLEYILE 301 (367) T ss_pred -cccCCCCccccceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccceecccc---hhhhcCCceEEEEe Confidence 1111112334578999999999999942 11111222233322222211 233222 222 123333433 Q ss_pred EEeccEEecccceEEEEec-----------------CCCC Q lcl|Aclame:pro 397 FRANLAVYQPKAFVRVTFA-----------------AATT 419 (419) Q Consensus 397 ~r~d~~~~~~~a~~~~~~~-----------------aa~~ 419 (419) .|. .+.||..|...+-. .+|| T Consensus 302 Rr~--~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt 339 (367) T protein:vir:80 302 RKE--WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) T ss_pred eee--EEeecceeeecccccccccccccccccccccCCCC Confidence 333 67888887765321 2344 No 176 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=97.99 E-value=7.7e-06 Score=48.62 Aligned_cols=284 Identities=7% Similarity=-0.020 Sum_probs=125.1 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhh-c Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL-L 158 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~-~ 158 (419) ..+.+ +...+.+...+.-+... ...++.....+.+...+.+.......-..+ + T Consensus 1 ~~~~~---------------~~~~~~~~~~~~~~~~~-----------~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~ 54 (319) T protein:vir:97 1 MNKTI---------------KNATGMLKLNLQHFANK-----------SVEPGQTLLKNKHVGILERVTAVNAYSTPALI 54 (319) T ss_pred CCccc---------------ccccceeEeehhhhhcc-----------CCCcchHHHHHHHHHHHHHHHHHhhhhhhccc Confidence 00000 00001111111111111 122333344455555555433322211111 1 Q ss_pred c--eecccCcceeeeeeccccceeccccccceeecCcccccccccce--eeEEeeeEEEEEeehhhHHH-HhhHH-HH-- Q lcl|Aclame:pro 159 D--QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSF--DTITTTLKTVAHWLPITRQA-ADDNS-QL-- 230 (419) Q Consensus 159 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~--~~v~~~~~k~~~~~~vs~el-l~d~~-~~-- 230 (419) + ..-.+++.++||+.......-+ ....+ ....+++. ...+++-.+.-.+. | +.+ ...+. .+ T Consensus 55 N~~~e~~gg~tVkIp~i~~~gl~DY--~R~~g-------~~~g~vt~~~~t~tidqdR~~~F~-V-D~~D~~Etn~~l~a 123 (319) T protein:vir:97 55 SNDAIFMEGRSFTVMKGDTTELKDY--KRNAT-------NEFDHPKIEETTYFLDQEKYWGRF-V-DALDRKDTEGNIDI 123 (319) T ss_pred CcceEeccCcEEEEeeecccccccc--cCCCC-------cccCCcccceeEEEeecccccccc-c-chhhHhhhhchhhH Confidence 2 2234677899988764322111 11111 12223333 33333333332222 1 111 11222 23 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCc-EEEEe Q lcl|Aclame:pro 231 MGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD-GVVVH 309 (419) Q Consensus 231 ~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 309 (419) ...+.+.+...++-.+|...+..--.+ .........+....|+.+.+++..+.....+.. ..+|+ T Consensus 124 ~~i~~~~~~~~v~PEiDay~~skla~~--------------a~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vt 189 (319) T protein:vir:97 124 NYVVARQGAEVVAPYLDNLRFATLARN--------------KAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVS 189 (319) T ss_pred HHHHHHHHHHHhhhhhhHHHHHHHHhh--------------cccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeC Confidence 334455566666667787655421000 000111123445578888888888877766534 45689 Q ss_pred hHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCC--CCcCcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 310 PQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVA--IAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 310 ~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~--~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) |..+..|.+-..-....-........+..+.|.|++|+.++. +..-.++++.. .+..... +--.+++.......| T Consensus 190 p~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~-~A~~~~~-k~~~~~~~~p~~~~~- 266 (319) T protein:vir:97 190 PTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVG-EVLASPI-QADLAKTNSNIPGMF- 266 (319) T ss_pred HHHHHHHHhhhhhhccccccccceeeeeceeecCeEEEEecccccccceEEEEcC-Ceeeeee-eeeeeeccCCCcccc- Confidence 999888865422111110111223356667899999997633 22333444443 3333322 212233222112222 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ...++...++|..+.+|++..++....+.. T Consensus 267 --a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~ 296 (319) T protein:vir:97 267 --GTLAEQLLYTGAFVPEHLQKYIFTIGGTEV 296 (319) T ss_pred --ceeeeeeeeeeeEEeccccceEEEeecCCc Confidence 357888999999999998555554332222 No 177 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=97.99 E-value=7.7e-06 Score=48.62 Aligned_cols=284 Identities=7% Similarity=-0.020 Sum_probs=125.1 Q ss_pred hhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhh-c Q lcl|Aclame:pro 80 TFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL-L 158 (419) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~-~ 158 (419) ..+.+ +...+.+...+.-+... ...++.....+.+...+.+.......-..+ + T Consensus 1 ~~~~~---------------~~~~~~~~~~~~~~~~~-----------~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~ 54 (319) T protein:vir:94 1 MNKTI---------------KNATGMLKLNLQHFANK-----------SVEPGQTLLKNKHVGILERVTAVNAYSTPALI 54 (319) T ss_pred CCccc---------------ccccceeEeehhhhhcc-----------CCCcchHHHHHHHHHHHHHHHHHhhhhhhccc Confidence 00000 00001111111111111 122333344455555555433322211111 1 Q ss_pred c--eecccCcceeeeeeccccceeccccccceeecCcccccccccce--eeEEeeeEEEEEeehhhHHH-HhhHH-HH-- Q lcl|Aclame:pro 159 D--QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSF--DTITTTLKTVAHWLPITRQA-ADDNS-QL-- 230 (419) Q Consensus 159 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~--~~v~~~~~k~~~~~~vs~el-l~d~~-~~-- 230 (419) + ..-.+++.++||+.......-+ ....+ ....+++. ...+++-.+.-.+. | +.+ ...+. .+ T Consensus 55 N~~~e~~gg~tVkIp~i~~~gl~DY--~R~~g-------~~~g~vt~~~~t~tidqdR~~~F~-V-D~~D~~Etn~~l~a 123 (319) T protein:vir:94 55 SNDAIFMEGRSFTVMKGDTTELKDY--KRNAT-------NEFDHPKIEETTYFLDQEKYWGRF-V-DALDRKDTEGNIDI 123 (319) T ss_pred CcceEeccCcEEEEeeecccccccc--cCCCC-------cccCCcccceeEEEeecccccccc-c-chhhHhhhhchhhH Confidence 2 2234677899988764322111 11111 12223333 33333333332222 1 111 11222 23 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCc-EEEEe Q lcl|Aclame:pro 231 MGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD-GVVVH 309 (419) Q Consensus 231 ~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 309 (419) ...+.+.+...++-.+|...+..--.+ .........+....|+.+.+++..+.....+.. ..+|+ T Consensus 124 ~~i~~~~~~~~v~PEiDay~~skla~~--------------a~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vt 189 (319) T protein:vir:94 124 NYVVARQGAEVVAPYLDNLRFATLARN--------------KAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVS 189 (319) T ss_pred HHHHHHHHHHHhhhhhhHHHHHHHHhh--------------cccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeC Confidence 334455566666667787655421000 000111123445578888888888877766534 45689 Q ss_pred hHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCC--CCcCcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 310 PQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVA--IAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 310 ~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~--~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) |..+..|.+-..-....-........+..+.|.|++|+.++. +..-.++++.. .+..... +--.+++.......| T Consensus 190 p~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~-~A~~~~~-k~~~~~~~~p~~~~~- 266 (319) T protein:vir:94 190 PTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVG-EVLASPI-QADLAKTNSNIPGMF- 266 (319) T ss_pred HHHHHHHHhhhhhhccccccccceeeeeceeecCeEEEEecccccccceEEEEcC-Ceeeeee-eeeeeeccCCCcccc- Confidence 999888865422111110111223356667899999997633 22333444443 3333322 212233222112222 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ...++...++|..+.+|++..++....+.. T Consensus 267 --a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~ 296 (319) T protein:vir:94 267 --GTLAEQLLYTGAFVPEHLQKYIFTIGGTEV 296 (319) T ss_pred --ceeeeeeeeeeeEEeccccceEEEeecCCc Confidence 357888999999999998555554332222 No 178 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=97.98 E-value=1.2e-06 Score=52.97 Aligned_cols=375 Identities=15% Similarity=0.113 Sum_probs=154.0 Q ss_pred CCccHHHHHH---HHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|Aclame:pro 1 MPPTPTLEEQ---RAALLARLDDTSLTTEQV--QEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTP 75 (419) Q Consensus 1 M~~~~~L~e~---~~~l~~~~~~~~~~~~~~--~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 75 (419) |++ +.|.|+ .++++...-.+++.++-. .+..++ -...++++.-+.++.-++.+.+..+.+.++. T Consensus 8 ~~K-~~l~EK~~~~a~~~E~~~~LKS~~~G~evknaied-l~K~~EL~~TlS~~~iEI~~~en~LNa~~E~--------- 76 (400) T protein:vir:93 8 MNK-PDLIEKQNRLAELKENNVSLKSQISGFEVKNAIED-LPKVQELEKTLSENSIEIIKIENELNAQEEK--------- 76 (400) T ss_pred ccc-chHHHHHHHHhhhhhhhhhhhhhhhcchhhhhhhh-chhHHHHHHhHhhcchhhhhhhhhhhhhhhh--------- Confidence 433 223333 222322222233332211 111111 1223344444444444443333333221111 Q ss_pred cccchhhhhh-HHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhH Q lcl|Aclame:pro 76 AEAGTFRSLA-QRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLV 154 (419) Q Consensus 76 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l 154 (419) +..|... +.....+....|..-.+...+..+.+..+..-. .+.|.+.+.....+|+-+...|...+..+.++ T Consensus 77 ---~KGK~kMt~~i~sq~A~~eF~~vL~~N~G~S~~k~AW~A~L----~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v 149 (400) T protein:vir:93 77 ---PKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKL----AENGVTITDTTFQLPRKLVESINTALLNTNPV 149 (400) T ss_pred ---hhhhHHHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhH----hhcCcceeccchhccHHHHHHHHHhhhccCcc Confidence 1111111 111112222333333344444444333332222 23344445555677877777777777777666 Q ss_pred HhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhH-HHHhhH----HH Q lcl|Aclame:pro 155 ADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITR-QAADDN----SQ 229 (419) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~-ell~d~----~~ 229 (419) ...+.+..++. +.+..... +...|...-.|+.+.+...+|..-++.+ ++.++..|- ++..+. .. T Consensus 150 ~~vfHVT~~~~----~~V~~s~~-----s~~~Aq~HkdGqTK~eqa~~~~~~Tl~~--~~VY~~~S~Ae~~K~~~~sYse 218 (400) T protein:vir:93 150 FKVFHVTNVGA----LLVSRSFD-----SANEAQVHKDGQTKTEQAATLTIDTLEP--VMVYKLQSLAERVKRLQMSYSE 218 (400) T ss_pred eeeeeeccchh----hhHHhhhh-----hhhhhhhhccCCccccceeeeeeechhH--HHHHHHHHHHHHHHHhhhhHHH Confidence 65444433321 22211111 1235566667888877666665544444 333433332 233332 25 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHHhccCcccccceeccccccccccccccc-cchhhhHHHHHHHHHHhhhhhccCCcEEE Q lcl|Aclame:pro 230 LMGYIQGRLTYGLR-FLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA-PATDEPPLVDIRRAKTVAEIAGFPPDGVV 307 (419) Q Consensus 230 ~~~~i~~~l~~a~~-~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (419) +.+||..+|++++. +..|++++-|+|++....+.....+......++.+ .++.+...+.+..++.-+.+..++. .++ T Consensus 219 l~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdfvrptagrr-yli 297 (400) T protein:vir:93 219 LYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRR-YLI 297 (400) T ss_pred HHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCce-EEE Confidence 79999999999999 89999999999998765554433322222111111 1111112233334443343332222 233 Q ss_pred Eeh-HHHHHHHHHhccCCce-eccCCccccCCCcccccce---eEecCCCCcCcEEEEeccceEEEEEecceEEEEeecc Q lcl|Aclame:pro 308 VHP-QDWESIELDQAPGSGV-FRVIANVQGEATPRIWGLN---VVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSH 382 (419) Q Consensus 308 ~~~-~~~~~l~~~kd~~g~~-~~~~~~~~~~~~~~l~G~p---v~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~ 382 (419) +.. ...+.|..++.+..+. ..+..+ +.....-.|+. |+.-.. .-..-++.|-. |.+ +-++++ .-+ T Consensus 298 vktedrkalldelrqatanahvriknd--daeiasevgvdeiivytgsk-alkptvlvdqk--yhi-dmqdlt----kvd 367 (400) T protein:vir:93 298 VKTEDRKALLDELRQATANAHVRIKND--DAEIASEVGVDEIIVYTGSK-ALKPTVLVDQK--YHI-DMQDLT----KVD 367 (400) T ss_pred EeccchHHHHHHHHhhccccceEeecc--hhhhhhhcCcceeeeeeccc-cccceeeeccc--ccc-chhhhh----hhh Confidence 333 3334445555443221 111111 00001111221 111111 11112333322 211 112221 011 Q ss_pred cchhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 383 ADFFTANTLVILAEFRANLAVYQPKAFVRVTFA 415 (419) Q Consensus 383 ~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 415 (419) .-.|.+|.--+.++..-.|-+.--+|-+++++. T Consensus 368 afewktnsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 368 AFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred hheeccCCceEEEeecccCcceeeccceeEeeC Confidence 112455444445555555555444555555555 No 179 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=97.96 E-value=2.2e-06 Score=51.61 Aligned_cols=307 Identities=10% Similarity=0.055 Sum_probs=154.8 Q ss_pred HHhHHHHHHHHHhhhh-----hhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHH----HHhhhhhhhHHhhc Q lcl|Aclame:pro 88 FADSDGLREYRARDKR-----GQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIV----PTTPDLPLLVADLL 158 (419) Q Consensus 88 ~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i----~~~~~~~~~l~~~~ 158 (419) .-+...++.+...+.. .....+...+.. ......+...+++...+|..+..-| ++.+........++ T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~----da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~ 76 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAM----DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELV 76 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhhHHHhhh----hhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhc Confidence 0000111111111000 000001111111 1112222333344455666655444 44445555566677 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHH-HHhhHH---HHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQ-AADDNS---QLMGYI 234 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~e-ll~d~~---~~~~~i 234 (419) ++.+++....+.... ...-..+.+.+.+.+...|..+......+...+.++..+.++.. +-.-.. ++.+-- T Consensus 77 pv~t~g~W~~~~~~~-----~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~K 151 (336) T protein:vir:10 77 GESKKGDWTTLVAAF-----ITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASEL 151 (336) T ss_pred cccccCCccceeEEE-----eeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHH Confidence 765544322111111 11112346678888889999998878888889999999999944 444332 578888 Q ss_pred HHHHHHHHHHHHHHHHHhccCcccccceeccccccccccc-cc-cccchhhhHHHHHHHHHHhhhhhcc------CCcEE Q lcl|Aclame:pro 235 QGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQP-KP-TAPATDEPPLVDIRRAKTVAEIAGF------PPDGV 306 (419) Q Consensus 235 ~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~ 306 (419) ....++++.+.+|+-.+.|++..+..|++|.|.+...... +. ...++....++|+..++..+..... .+..+ T Consensus 152 a~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL 231 (336) T protein:vir:10 152 NYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRM 231 (336) T ss_pred HHHHHHHHHHhhCcEEEEeccccceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceE Confidence 8899999999999999999988889999999887643332 22 2334557789999988888765332 36678 Q ss_pred EEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEE---e-cceEEEEeecc Q lcl|Aclame:pro 307 VVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWS---R-QGITVLMTDSH 382 (419) Q Consensus 307 ~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~---~-~~~~i~~~~~~ 382 (419) +|.+..+..|..- ...|..++ ..... .+-++.++..+.+.... |+. .+++.. . ....+.+...- T Consensus 232 ~LP~~~~~~Ls~~-n~~g~Tvl--~~lk~----n~Pnl~i~t~pEl~~a~---G~~--~~l~~~~~~~~~t~~~~~p~~~ 299 (336) T protein:vir:10 232 GLPPTAMSDLSKT-NQYGLAAA--AKLKD----IFPKLEFVTIPEYDTAS---GRL--VQLWAPRVEGKDTATCGFTEKM 299 (336) T ss_pred EecHHHHHhccCC-CccCccHH--HHHHH----hcCccEEEEccccccCC---Cce--EEEEEEecCCCcceeeecchhh Confidence 9999987776432 22121111 00000 11122233322221110 110 111111 0 00111111000 Q ss_pred c-chhhc--CcEEEEEEEEecc-EEecccceEEEEec Q lcl|Aclame:pro 383 A-DFFTA--NTLVILAEFRANL-AVYQPKAFVRVTFA 415 (419) Q Consensus 383 ~-~~~~~--~~~~~r~~~r~d~-~~~~~~a~~~~~~~ 415 (419) . ...+. -.+..-+..|..| .+++|.||++++-= T Consensus 300 ~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 300 RAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred hccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 0 00011 1223334555555 45779999998765 No 180 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=97.92 E-value=2.2e-06 Score=51.57 Aligned_cols=304 Identities=11% Similarity=0.082 Sum_probs=155.9 Q ss_pred HHhHHHHHHHHHhhh---hh--hhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHH----HHhhhhhhhHHhhc Q lcl|Aclame:pro 88 FADSDGLREYRARDK---RG--QFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIV----PTTPDLPLLVADLL 158 (419) Q Consensus 88 ~~~~~~~~~~~~~~~---~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i----~~~~~~~~~l~~~~ 158 (419) .-+...++.+...+. .. ....++..+. .......+...+++...+|..+...| ++..........++ T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~a----~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~ 76 (336) T protein:vir:78 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYA----MDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELV 76 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHHHHHH----HhhhhhccccccCCCcchHHHHHHhcccceeeehhhhhhhhhhc Confidence 000001111110000 00 0000111111 11112222233344444565555433 44455555566666 Q ss_pred ceecccCc---ceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-H---HHH Q lcl|Aclame:pro 159 DQQNADYN---VLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-S---QLM 231 (419) Q Consensus 159 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~---~~~ 231 (419) ++.+++.. .+.|+.. ...+.+.+.+-+...|..+......+-..+.++..+.++.+=+..+ . ++. T Consensus 77 ~v~t~g~W~~~~~~~~~~--------e~~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~ 148 (336) T protein:vir:78 77 GESKKGDWTTLVAAFITA--------EPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLA 148 (336) T ss_pred ccccCCCccccEEEEeee--------ecceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcH Confidence 66555332 2223222 2335677888889999999999999999999999999996544433 2 578 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccc--cccchhhhHHHHHHHHHHhhhhhcc------CC Q lcl|Aclame:pro 232 GYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKP--TAPATDEPPLVDIRRAKTVAEIAGF------PP 303 (419) Q Consensus 232 ~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~------~~ 303 (419) +--....++++.+.+|+-.+.|++..+..|++|.+.+.......+ ....+....++|+..++..+...-. .+ T Consensus 149 ~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~ 228 (336) T protein:vir:78 149 SELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAV 228 (336) T ss_pred HHHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccc Confidence 888888899999999999999998889999999988764433222 2345567789999988887744331 23 Q ss_pred cEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEe----cceEEEEe Q lcl|Aclame:pro 304 DGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSR----QGITVLMT 379 (419) Q Consensus 304 ~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~----~~~~i~~~ 379 (419) ..++|.+..+..|... ...|-.++ ..... .+-++.|+..+.+... -|+- .+++... ....+.+. T Consensus 229 ~tL~Lp~~~~~~L~~~-n~~g~tv~--~~lk~----n~Pnl~i~t~pel~~A---gg~~--~~~~~~~~~~~~t~~~~~p 296 (336) T protein:vir:78 229 LHMGLPPTAMSDLSKT-NQYGLSAA--AKLKE----IFPKLEFVTIPEYDTA---SGRL--VQLWAPRVEGKDTATCGFT 296 (336) T ss_pred eEEEechHHHHhccCC-CccCccHH--HHHHH----hcCccEEEEccccccc---Ccce--EEEEEeeccCCcceeeecc Confidence 4688999988887532 22221111 00000 0112234333333211 0110 1111000 01111111 Q ss_pred ecc---cchhhcCcEEEEEEEEecc-EEecccceEEEEec Q lcl|Aclame:pro 380 DSH---ADFFTANTLVILAEFRANL-AVYQPKAFVRVTFA 415 (419) Q Consensus 380 ~~~---~~~~~~~~~~~r~~~r~d~-~~~~~~a~~~~~~~ 415 (419) ..- .-......+..-...|..| .+++|.||++++-= T Consensus 297 ~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 297 EKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred hhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 000 0000111223334555555 45779999988755 No 181 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=97.90 E-value=1.2e-05 Score=47.59 Aligned_cols=295 Identities=7% Similarity=-0.035 Sum_probs=128.2 Q ss_pred ccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhh Q lcl|Aclame:pro 73 LTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPL 152 (419) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~ 152 (419) ...--..-.+...... +...+.+...+.-+.+ -...++....-+.+...+.+.....+ T Consensus 1 ~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~-----------~~~~~nt~~l~~k~~~~LD~~~~~~~ 58 (329) T protein:vir:10 1 MDGIFITGVKTMNKEI-----------KNATGKLKLNLQHFAN-----------KSVEPGDTLLKNKHVGILEKVTAANS 58 (329) T ss_pred CCceEEechhhhhhhh-----------hcccceeEEehhhhcC-----------CccCCchhHHHHHHHHHHHHHHHhhc Confidence 0000000000000000 0001111111111111 11234444555566666655443322 Q ss_pred hHHh-hcc--eecccCcceeeeeeccccceeccccccceeecCcccccccccc--eeeEEeeeEEEEEeehhhHHH-Hhh Q lcl|Aclame:pro 153 LVAD-LLD--QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLS--FDTITTTLKTVAHWLPITRQA-ADD 226 (419) Q Consensus 153 ~l~~-~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~--~~~v~~~~~k~~~~~~vs~el-l~d 226 (419) .-.. +++ ....+++.++||+....... ......+ ...++++ ....+++-.+.-.+. |. .+ ... T Consensus 59 ~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~--DY~R~~g-------~~~g~vt~~~~t~tidqdR~~~F~-VD-~~D~dE 127 (329) T protein:vir:10 59 YSAPAVISNDAIFMQGRSFTVIKGDVTELK--DYKRNAT-------NEFDHPQIQETTYFLDQEKYWGRF-VD-ALDRRD 127 (329) T ss_pred eeeeeecccceeeccCcEEEEeeecccccc--cccCCCC-------ccccccccceeEEEeecccceeee-cc-hhhHhh Confidence 1111 122 23456788999987542211 1111111 1222333 333444444333332 11 11 112 Q ss_pred HH-H--HHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCC Q lcl|Aclame:pro 227 NS-Q--LMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPP 303 (419) Q Consensus 227 ~~-~--~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (419) +. . +...+.+.....++..+|...+.---++ .........+....|+.+.+++..+.....+. T Consensus 128 tn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~--------------a~~~~~~~~t~~nay~~i~~a~~~Lde~~vp~ 193 (329) T protein:vir:10 128 TEGNIDINYVVAKQASEVVAPYLDNLRFATLARN--------------KAKHLTVGSGADAQYDAVLDVSVELDEIGAGA 193 (329) T ss_pred hhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhh--------------cccccccccCHHHHHHHHHHHHHHHHhcCCCC Confidence 21 2 3344556667777777887655411000 00011122344557888888888887765443 Q ss_pred c-EEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCC--CCcCcEEEEeccceEEEEEecceEEEEee Q lcl|Aclame:pro 304 D-GVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVA--IAQGTALVGGFRQGATLWSRQGITVLMTD 380 (419) Q Consensus 304 ~-~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~--~~~~~~~~~d~~~~~~~~~~~~~~i~~~~ 380 (419) . ..+++|..+..|.+...-....-........+..++|.|++|+.++. +..-.++++..+ +..... +--.+++.. T Consensus 194 ~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g~Vg~idG~~Ii~vps~~~k~in~ii~~~~-A~~~~~-K~~~~~~~~ 271 (329) T protein:vir:10 194 SRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKGVQGELDGFTIVKVPSKMLQGVEAMAVIGE-VMASPI-QANEAKLNS 271 (329) T ss_pred CcEEEeCHHHHHHHHhhhhhhccccccccceeeeeeeeecCeEEEEecCCcccceeEEEEcCC-ceeeee-eeeeeeeeC Confidence 3 45689999888875321111000111223345667899999997643 233334444433 333222 212233322 Q ss_pred cccchhhcCcEEEEEEEEeccEEecccceEEEE-ecCCCC Q lcl|Aclame:pro 381 SHADFFTANTLVILAEFRANLAVYQPKAFVRVT-FAAATT 419 (419) Q Consensus 381 ~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~-~~aa~~ 419 (419) .... ++...++...++|..+.+|++..++. .+++++ T Consensus 272 p~~~---~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a~~ 308 (329) T protein:vir:10 272 NVPG---MFGTLAEQMLYTGAFVPEHLQKYIFTIGGKEVE 308 (329) T ss_pred CCCc---cchheeeeeeeeeeEEEccccCEEEEecccCcc Confidence 1121 23357889999999999998555443 333333 No 182 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=97.82 E-value=1.5e-05 Score=47.08 Aligned_cols=265 Identities=12% Similarity=-0.006 Sum_probs=118.3 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc-------cCcceeeeeeccccceeccccccceeecCcccccccc Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA-------DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST 200 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 200 (419) +...--..+|+.+....+..+++..++..++..-.- .+..++|++........+. .+ .+..+...+ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~-----~~--~~~~~~~~~ 73 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTP-----TG--DISGQNKNN 73 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeeccc-----Cc--ccCCcccCc Confidence 222212347899988888888888888777654221 2446666643221111110 01 111122223 Q ss_pred ccee--eEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhc-cCcccccceecccccccccccccc Q lcl|Aclame:pro 201 LSFD--TITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLTYGLRFLRDRQLLNG-NGSTEMQGILTTPGIGTYQQPKPT 277 (419) Q Consensus 201 ~~~~--~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~~a~~~~~d~~il~G-~g~~~p~Gi~~~~~~~~~~~~~~~ 277 (419) .+-. .+.++-+|.-.+--=..|...+..++++++... .++++..+|..++.- .+.. +.. . . T Consensus 74 l~e~~v~l~id~~k~va~~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a-~~~-----------~---g 137 (423) T protein:vir:17 74 LISGKATGRVGNYITVAVEYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNG-ALS-----------L---G 137 (423) T ss_pred cccceeEEEeeceeeeeeeecHHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhcc-ccc-----------c---c Confidence 3223 355555555554433444554556777766555 699999999988752 1110 000 0 0 Q ss_pred ccchhhhHHHHHHHHHHhhhhhccCC--cEEEEehHHHHHHHHHh----ccCCceeccCCccccCCCcccccceeEecCC Q lcl|Aclame:pro 278 APATDEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWESIELDQ----APGSGVFRVIANVQGEATPRIWGLNVVSTVA 351 (419) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~k----d~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~ 351 (419) ...+....|+++.++-..+...+.+. -..+++|..+..|++.. ...+.. -..-..++..+++.|+.|+.|+. T Consensus 138 t~~t~~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~--~~alr~g~i~G~i~GFdvy~Snn 215 (423) T protein:vir:17 138 SPNTPITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLV--RTAWENAQIPTNFGGIRALMSNG 215 (423) T ss_pred cCCcccccHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccc--hHHHhhccceeeecceEEEEeCC Confidence 01111124677777777776665542 34689999988876421 111000 00111122346899999999999 Q ss_pred CCcCcEEEEeccc------eEEE---EE--ecceEEE--EeecccchhhcCcEEEEE---EEEeccEEe------cccce Q lcl|Aclame:pro 352 IAQGTALVGGFRQ------GATL---WS--RQGITVL--MTDSHADFFTANTLVILA---EFRANLAVY------QPKAF 409 (419) Q Consensus 352 ~~~~~~~~~d~~~------~~~~---~~--~~~~~i~--~~~~~~~~~~~~~~~~r~---~~r~d~~~~------~~~a~ 409 (419) +|..+...+...- .+.. .. .....+. +....+..-..|.+.|-+ ..+....++ +.+-| T Consensus 216 ip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~ 295 (423) T protein:vir:17 216 LASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTA 295 (423) T ss_pred CccccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEE Confidence 9964322221110 0000 00 0000111 111111011112222211 111112111 22233 Q ss_pred EEEEecCCCC Q lcl|Aclame:pro 410 VRVTFAAATT 419 (419) Q Consensus 410 ~~~~~~aa~~ 419 (419) .+. +++++ T Consensus 296 ~v~--~~~~~ 303 (423) T protein:vir:17 296 TVT--ADANS 303 (423) T ss_pred EEE--ecccc Confidence 322 22211 No 183 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=97.77 E-value=2e-05 Score=46.36 Aligned_cols=269 Identities=12% Similarity=0.001 Sum_probs=115.4 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhcceec-----c--cCcceeeeeeccccceeccccccceeecCcccccccc Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQN-----A--DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST 200 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~-----~--~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 200 (419) +...--..+|+.+...+...++...++..++..-. . .+..+++++-......-+.. ..+..+.-.+ T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~-------~~~~~~~~~d 73 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPT-------GDISGQNKNN 73 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCC-------ccccccccCc Confidence 22221224689999888888888888877765421 1 24456665533222111110 1111222223 Q ss_pred ccee--eEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhc-cCcccccceecccccccccccccc Q lcl|Aclame:pro 201 LSFD--TITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLTYGLRFLRDRQLLNG-NGSTEMQGILTTPGIGTYQQPKPT 277 (419) Q Consensus 201 ~~~~--~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~~a~~~~~d~~il~G-~g~~~p~Gi~~~~~~~~~~~~~~~ 277 (419) .+-. .+.++-+|.-.+--=+.|+..+..++++++... .++++..+|..++.- .+.. +. . .. T Consensus 74 l~e~~v~l~id~~k~va~~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~-~~----------~-~g--- 137 (423) T protein:vir:10 74 LISGKATGRVGNYITVAVEYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNG-AL----------S-LG--- 137 (423) T ss_pred cccceeEEEeeceeeeeeeechHHHhcChhhHHHHHHHH-HHHHHHHHHHHHHHHHhhcc-cc----------c-cc--- Confidence 3333 355555555544433445554555677766555 699999999998752 1110 00 0 00 Q ss_pred ccchhhhHHHHHHHHHHhhhhhccCC--cEEEEehHHHHHHHHHhc--cCCceeccCCccccCCCcccccceeEecCCCC Q lcl|Aclame:pro 278 APATDEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWESIELDQA--PGSGVFRVIANVQGEATPRIWGLNVVSTVAIA 353 (419) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~kd--~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~ 353 (419) ...+....|+++.++-..+...+.+. -..+++|..+..|++... ......--..-..++..+++.|+.|+.|+.+| T Consensus 138 t~~t~~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip 217 (423) T protein:vir:10 138 SPNTPITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLA 217 (423) T ss_pred cCCcccchHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCCc Confidence 01111124677777766666655542 346899999888764211 00000000111112234689999999999999 Q ss_pred cCcEEEEecc----ceEEE-----EEecceEEEEe----ecccchhhcCcEEEEE---EEEeccEEe------cccceEE Q lcl|Aclame:pro 354 QGTALVGGFR----QGATL-----WSRQGITVLMT----DSHADFFTANTLVILA---EFRANLAVY------QPKAFVR 411 (419) Q Consensus 354 ~~~~~~~d~~----~~~~~-----~~~~~~~i~~~----~~~~~~~~~~~~~~r~---~~r~d~~~~------~~~a~~~ 411 (419) ..+...+..+ ....+ .+.....+.+. ...+..-.-|.+.|-+ ..+....++ +++-|++ T Consensus 218 ~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v 297 (423) T protein:vir:10 218 SRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATV 297 (423) T ss_pred cccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEE Confidence 6432211100 00000 00011111110 0000000011111111 011111111 1111111 Q ss_pred E-------------EecCC----------------CC Q lcl|Aclame:pro 412 V-------------TFAAA----------------TT 419 (419) Q Consensus 412 ~-------------~~~aa----------------~~ 419 (419) . ++.++ +. T Consensus 298 ~a~~~~~~~g~~tv~i~p~~i~~~~~~~~~~v~a~~a 334 (423) T protein:vir:10 298 TADANSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQVE 334 (423) T ss_pred EeeeeeccCCceeeeccCccccccCCccccccccccc Confidence 1 11111 00 No 184 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=97.76 E-value=4.5e-06 Score=49.88 Aligned_cols=307 Identities=10% Similarity=0.057 Sum_probs=152.4 Q ss_pred HHhHHHHHHHHHhhh---hh--hhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHH----HHhhhhhhhHHhhc Q lcl|Aclame:pro 88 FADSDGLREYRARDK---RG--QFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIV----PTTPDLPLLVADLL 158 (419) Q Consensus 88 ~~~~~~~~~~~~~~~---~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i----~~~~~~~~~l~~~~ 158 (419) .-+...++.+...+. .. ....++..+. .......+...+++...+|..+..-| ++..........++ T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~a----~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~ 76 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYA----MDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELV 76 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHHHHHH----HhhhhhccccccCCCcchHHHHHhhcCcceeeeeechhchhhhc Confidence 000001111110000 00 0000111110 11112222333344444565555433 34444444555566 Q ss_pred ceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH-H---HHHHHH Q lcl|Aclame:pro 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-S---QLMGYI 234 (419) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~-~---~~~~~i 234 (419) ++.+.+....+.... ...-..+.+.+.+.....|..+.....-.-..+.++..+.++.+=+..+ . ++.+-- T Consensus 77 ~v~t~g~w~~~~~~~-----~~~e~~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~K 151 (336) T protein:vir:10 77 GESKKGDWTTLVAAF-----ITAEPTTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASEL 151 (336) T ss_pred ccccCCCcceeeEEE-----EeeeeeeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHH Confidence 665543322111111 1111234556667788899999888888888999999999996644433 2 578888 Q ss_pred HHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccc--cccchhhhHHHHHHHHHHhhhhhcc------CCcEE Q lcl|Aclame:pro 235 QGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKP--TAPATDEPPLVDIRRAKTVAEIAGF------PPDGV 306 (419) Q Consensus 235 ~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~ 306 (419) ....++++.+.+|.-.+.|++..+..|++|.+.+.......+ ....+....++|+..++..+...-. .+..+ T Consensus 152 a~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL 231 (336) T protein:vir:10 152 NYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHM 231 (336) T ss_pred HHHHHHHHHHhhCeEEEEeecccceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEE Confidence 888899999999999999998889999999988764433222 2345567789999988887754331 23468 Q ss_pred EEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEe----cceEEEEeecc Q lcl|Aclame:pro 307 VVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSR----QGITVLMTDSH 382 (419) Q Consensus 307 ~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~----~~~~i~~~~~~ 382 (419) +|.+..+..|... ...|-.++ .-... ..-++.|+..+.+... -|+- .+++... ....+.+...- T Consensus 232 ~Lp~~~~~~L~~~-n~~g~tv~--~~lk~----n~Pnl~i~t~pel~~A---gg~~--~~~~~~~~~~~~t~~~~~P~~f 299 (336) T protein:vir:10 232 GLPPTAMSDLSKT-NQYGLSAA--AKLKE----IFPKLEFVTIPEYDTA---SGRL--VQLWAPRVEGKDTATCGFTEKM 299 (336) T ss_pred EechHHHHhccCC-CccCccHH--HHHHH----hCCccEEEEccccccc---CCce--EEEEEecccCCcceeeecChhh Confidence 8999988887532 22221111 00000 0112234433333211 0110 1111000 01111111000 Q ss_pred ---cchhhcCcEEEEEEEEecc-EEecccceEEEEec Q lcl|Aclame:pro 383 ---ADFFTANTLVILAEFRANL-AVYQPKAFVRVTFA 415 (419) Q Consensus 383 ---~~~~~~~~~~~r~~~r~d~-~~~~~~a~~~~~~~ 415 (419) .-......+..-...|..| .+++|.||++++-= T Consensus 300 ~~lpvq~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 300 RAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred hccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 0000111223334555555 45679999988755 No 185 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=97.74 E-value=3.6e-06 Score=50.40 Aligned_cols=376 Identities=15% Similarity=0.104 Sum_probs=155.0 Q ss_pred CCc--cHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 1 MPP--TPTLEEQRAALLARLDDTSLTTE--QVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPA 76 (419) Q Consensus 1 M~~--~~~L~e~~~~l~~~~~~~~~~~~--~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 76 (419) |.+ .-+-+.+.+++..-.-.++.... ++.+..++. ...++++.-+.++.-++.+.+..+.+.++ T Consensus 1 mnkpdliekqnrlaelkennvslksqisgfevknaiedl-~K~~ELe~TlSe~~iEI~k~en~LN~~eE----------- 68 (393) T protein:vir:16 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDL-PKVQELEKTLSENSIEIIKIENELNAQEE----------- 68 (393) T ss_pred CCCcchhhhhhhhhhhhhcccchhhhccchhhhhhhhhc-hhHHHHHHhHhhcchhhhhhhhhhhhhhh----------- Confidence 543 33334444444332222222211 111111111 22344444444444444433333322211 Q ss_pred ccchhhhhh-HHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHH Q lcl|Aclame:pro 77 EAGTFRSLA-QRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVA 155 (419) Q Consensus 77 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~ 155 (419) .+..|... +.....+....|..-.+...+..+.+..+..-. .+.|.+.+.....+|+-+...|...+..+.++. T Consensus 69 -~~KGK~kMt~~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L----~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~ 143 (393) T protein:vir:16 69 -KPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKL----AENGVTITDTTFQLPRKLVESINTALLNTNPVF 143 (393) T ss_pred -cchhhHHHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhH----hhcCcceeccchhccHHHHHHHHHhhhccCcce Confidence 01111111 111112222333333344444444333332222 233444455556778777777777777776665 Q ss_pred hhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhH-HHHhhH----HHH Q lcl|Aclame:pro 156 DLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITR-QAADDN----SQL 230 (419) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~-ell~d~----~~~ 230 (419) ..+.+..++ .+.+..... +...|...-.|+.+.+...+|..-++.+ ++.++..|- ++..+. ..+ T Consensus 144 ~vfHVT~~~----~~~V~~s~~-----s~~eAq~HkdGqTK~eqa~~~~~~Tl~~--~~VY~~~S~Ae~~K~~~~sYsel 212 (393) T protein:vir:16 144 KVFHVTNVG----ALLVSRSFD-----SANEAQVHKDGQTKTEQAATLTIDTLEP--VMVYKLQSLAERVKRLQMSYSEL 212 (393) T ss_pred eeeeeccch----hhhHHhhhh-----hhhhhhhhccCCccccceeeeeeechhH--HHHHHHHHHHHHHHHhhhhHHHH Confidence 544433332 122211111 1235566667888877666655444443 334443333 233442 247 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHhccCcccccceeccccccccccccccc-cchhhhHHHHHHHHHHhhhhhccCCcEEEE Q lcl|Aclame:pro 231 MGYIQGRLTYGLR-FLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA-PATDEPPLVDIRRAKTVAEIAGFPPDGVVV 308 (419) Q Consensus 231 ~~~i~~~l~~a~~-~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (419) .+||..+|++++. +..|++++-|+|++....+.....+......++.+ .++.+...+.+..++.-+.+..++. .+++ T Consensus 213 ~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagktpfadaieeavdfvrptagrr-yliv 291 (393) T protein:vir:16 213 YNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRR-YLIV 291 (393) T ss_pred HHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCce-EEEE Confidence 8999999999999 89999999999998755554433322221111111 1111112233334443343332222 2334 Q ss_pred ehH-HHHHHHHHhccCCce-eccCCccccCCCcccccce---eEecCCCCcCcEEEEeccceEEEEEecceEEEEeeccc Q lcl|Aclame:pro 309 HPQ-DWESIELDQAPGSGV-FRVIANVQGEATPRIWGLN---VVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHA 383 (419) Q Consensus 309 ~~~-~~~~l~~~kd~~g~~-~~~~~~~~~~~~~~l~G~p---v~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~ 383 (419) ... ..+.|..++.+..+. ..+..+-+. ...-.|+. |+.-.. .-..-++.|-. |.+ +-++++ .-+. T Consensus 292 ktedrkalldelrqatananvriknddte--iasevgvdeiivytgsk-alkptvlvdqk--yhi-dmqdlt----kvda 361 (393) T protein:vir:16 292 KTEDRKALLDELRQATANANVRIKNDDTE--IASEVGVDEIIVYTGSK-ALKPTVLVDQK--YHI-DMQDLT----KVDA 361 (393) T ss_pred eccchHHHHHHHHhhhccCceeeeccchh--hhhhcCcceeeeeeccc-cccceeeeccc--ccc-chhhhh----hhhh Confidence 333 333444554433221 111111000 00111221 111111 11112333322 111 111211 0111 Q ss_pred chhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 384 DFFTANTLVILAEFRANLAVYQPKAFVRVTFA 415 (419) Q Consensus 384 ~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 415 (419) -.|.+|.--+.++..-.|-+.--+|-+++++. T Consensus 362 fewktnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 362 FEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred heeccCCceEEEeecccCcceeeccceeEeeC Confidence 12455444445555555555444555555555 No 186 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=97.70 E-value=2.7e-05 Score=45.59 Aligned_cols=388 Identities=13% Similarity=0.071 Sum_probs=154.7 Q ss_pred CCccHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH-HHHHHHHhhcccccc-- Q lcl|Aclame:pro 1 MPPTPTLEEQ-RAALLARLDDTSLTTEQVQEIVAEARGLADALQA-ESDRAAARAALLRTAP-PAPKGPADGGTPLTP-- 75 (419) Q Consensus 1 M~~~~~L~e~-~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~-~~~~~~~~~~~l~~~~-~~~~~~~~~~~~~~~-- 75 (419) .|+.+.++.+ .++-.+++..+......... ...++.+ .+....--+++.++.+ +.+.....+...... T Consensus 258 ap~~adirA~~~aae~~r~aaI~a~fa~f~~-------~~a~l~a~~l~d~~~s~d~ar~~lL~~l~~~~~p~~~~~~~~ 330 (693) T protein:vir:95 258 APTEADIRARILAEESGRRSAITAAFGAFST-------GHAELLATCLNDMNITVDQAREKLLAAIGADTQPAAALSAGA 330 (693) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHHHhccC-------ChHHHHHHHHhhcCCCHHHHHHHHHHHHhhccCCCCCcCcCc Confidence 4443333222 12222222222222221111 0011100 0000000011111111 111110011000000 Q ss_pred -cccchhhhhhHHHHhHHHHHHHHHhhhh-hh-hhHHHHHHHHHHhhhccccccc----------ccCCcccccchhhhH Q lcl|Aclame:pro 76 -AEAGTFRSLAQRFADSDGLREYRARDKR-GQ-FQVEMRDIDPNRLLSRDAPAGT----------ITNPNVPHLPQLVPG 142 (419) Q Consensus 76 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~p~~~~~ 142 (419) ........+++.....-..+........ .. ....++++.......+.....+ .+.++ ...|..+.+ T Consensus 331 ~~~~~~g~~~~d~~~~al~~R~g~~~~~~~n~~~g~~L~elAr~~L~~rg~~~~~~~~~~~~~~a~~htT-SDFp~IL~~ 409 (693) T protein:vir:95 331 HIHAGNGNLVGDSVRASVLARIGRGERQADNAYNGMTLRELARASLVDRGIGVASLNAPQMVGLAFTHTS-SDFGLILLD 409 (693) T ss_pred cccCCchhHHHHHHHHHHHHhcCcccccCCccccCCcHHHHHHHHHHhcCCccCCCCHHHHHHHHHhcCc-chhHHHHHH Confidence 0000000111111100000000000000 00 0011222222222222211110 00011 112233333 Q ss_pred HHHHh-----hhhhhhHHhhcceeccc-CcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEe Q lcl|Aclame:pro 143 IVPTT-----PDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHW 216 (419) Q Consensus 143 ~i~~~-----~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~ 216 (419) .+... ......++..+...++. .+..+..+ ...-+...-|.|+++..-....=..-++.+.+++.. T Consensus 410 ~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~--------lg~~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~ 481 (693) T protein:vir:95 410 VANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVG--------LGEFSSLRQVREGAEYKYVTLGERGEQIILATYGEL 481 (693) T ss_pred HHHHHHHHHHHhhhhHHHHHhccCCCCcccccceee--------cCCCCChhhcCCCCceeeeecCCccceeehhhcCCe Confidence 22221 11223344444433332 11111111 112234556788888765555445567889999999 Q ss_pred ehhhHHH-HhhHHHHHHHHHHHHHHHHHHHHHHHHH---hccCc-ccccceeccccccccccccccccchhhhHHHHHHH Q lcl|Aclame:pro 217 LPITRQA-ADDNSQLMGYIQGRLTYGLRFLRDRQLL---NGNGS-TEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRR 291 (419) Q Consensus 217 ~~vs~el-l~d~~~~~~~i~~~l~~a~~~~~d~~il---~G~g~-~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 291 (419) +.||++. ++|--+...-|-..++++.++.++..++ .+++. ..-+.++..... +..++...+.+...+...+. T Consensus 482 ~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~---Nl~tga~sals~~sl~~a~~ 558 (693) T protein:vir:95 482 FSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHS---NLLTGAASALSIDSLSKAKT 558 (693) T ss_pred eeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeecccc---ccccccccccChHHHHHHHH Confidence 9999996 5777777777888899999999998555 34332 122333332111 11111111222223333333 Q ss_pred HHHhhh---------hhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccc-eeEecCCCCc--CcEE- Q lcl|Aclame:pro 292 AKTVAE---------IAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGL-NVVSTVAIAQ--GTAL- 358 (419) Q Consensus 292 ~~~~~~---------~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~-pv~~~~~~~~--~~~~- 358 (419) ++..-+ .-+..|..|++.+........+-.+.. ....+.+.+...-+.|+ +|+.++.+.+ ++.| T Consensus 559 am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~---~~~a~~~~~~~NP~~~~~~vi~~prL~~~s~~~Wy 635 (693) T protein:vir:95 559 QMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSES---VPGADVNSGIVNPIRAFAQVIGEPRLDDASATAWY 635 (693) T ss_pred HHHHhhcchhccCCceeecccceEEecchHHHHHHHHhcccc---ccccccccccccchhccccccccceecCCCCCceE Confidence 333222 223467778888777766666543321 11112223333335554 5666666642 2222 Q ss_pred E-Eeccc-----eEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 359 V-GGFRQ-----GATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFA 415 (419) Q Consensus 359 ~-~d~~~-----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 415 (419) + .|... +|+-+ .++..++. .+.|..|.+.|++...++.+++|-.++.+-.-+ T Consensus 636 l~a~~~~dtie~~yL~G-~~~P~ie~----~~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 636 MAAKKGSDTIEVAYLDG-VDTPYLEQ----QEGFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred EecCCCCCeEEEEEecC-CCCCeEee----cCCCCcceEEEEEEEeccCceeeccccccCCCC Confidence 2 22111 11111 22333332 234999999999999999999998877776655 No 187 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=97.68 E-value=2.8e-05 Score=45.53 Aligned_cols=264 Identities=11% Similarity=-0.001 Sum_probs=114.3 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc-------cCcceeeeeeccccceeccccccceeecCcccccccc Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA-------DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST 200 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 200 (419) +...--..+|+.+.......++...++.+++..-.- .+..+++++....+..-+.. +.+..+.-.+ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~-------~~~~~~~~~~ 73 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTET-------GDITGKDKNG 73 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccC-------cCCCCccccc Confidence 222222347899999888888888888777654221 14466666543222111100 0111122222 Q ss_pred cceee--EEeeeEEEEEeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccc Q lcl|Aclame:pro 201 LSFDT--ITTTLKTVAHWLPITRQ-AADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPT 277 (419) Q Consensus 201 ~~~~~--v~~~~~k~~~~~~vs~e-ll~d~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~ 277 (419) .+-.+ +.++-+|+..+ .++.+ ...+..++++++... ..+++..+|..++..--.+.+ +..| T Consensus 74 ~~e~~v~l~id~~k~~a~-~v~d~e~~l~i~~~~~~l~~a-~~ala~~vd~~l~~~l~~~a~----~~vg---------- 137 (423) T protein:vir:35 74 LFSAKATGKVGKYITVAV-EWTQIEEALKLNQLDQILSPI-HERMVTDLETELAHFMMNNGA----LSLG---------- 137 (423) T ss_pred cccceeeEEeccceeccc-eeCHHHHHhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhccc----cccc---------- Confidence 22233 44444444443 44444 445556787776655 477889999988752100011 0000 Q ss_pred ccchhhhHHHHHHHHHHhhhhhccCC-c-EEEEehHHHHHHHHHhc--cCCceeccCCccccCCCcccccceeEecCCCC Q lcl|Aclame:pro 278 APATDEPPLVDIRRAKTVAEIAGFPP-D-GVVVHPQDWESIELDQA--PGSGVFRVIANVQGEATPRIWGLNVVSTVAIA 353 (419) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~l~~~kd--~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~ 353 (419) ...+....|+++.++-..+...+.+. . ..+++|..+..|++-.. .......-..-..++..+++.|+.|+.|+.+| T Consensus 138 t~~t~~~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp 217 (423) T protein:vir:35 138 SPNTAIKKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIRALMSNGLA 217 (423) T ss_pred cccCCcchHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceEEEEcCCCc Confidence 01111234677777777776666553 2 34889999888763211 00000000011112234789999999999999 Q ss_pred cCcEEEE------------------eccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEe---ccEE------ecc Q lcl|Aclame:pro 354 QGTALVG------------------GFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRA---NLAV------YQP 406 (419) Q Consensus 354 ~~~~~~~------------------d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~---d~~~------~~~ 406 (419) ..+...+ +.+..+.. +...+....+..-..|.+.|-+...+ ...+ .++ T Consensus 218 ~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~-----~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~ 292 (423) T protein:vir:35 218 SRKQGDFDGAITVKTAPNVDYLSVKDSYQFTVA-----LTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMS 292 (423) T ss_pred cccccccccceeeccccccccccccccccceee-----eeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCce Confidence 6321110 11110010 00011000110111121111111110 0000 001 Q ss_pred cceEE-------------EEecCCCC Q lcl|Aclame:pro 407 KAFVR-------------VTFAAATT 419 (419) Q Consensus 407 ~a~~~-------------~~~~aa~~ 419 (419) .=|++ +++.+++. T Consensus 293 ~~~~V~~~~~~~a~g~~~v~i~p~~~ 318 (423) T protein:vir:35 293 FTATVLEETNSTASGDVTVKLSGVPI 318 (423) T ss_pred eEEEEeccccccccCceeEEcccccc Confidence 11111 22221111 No 188 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=97.67 E-value=3e-05 Score=45.39 Aligned_cols=268 Identities=11% Similarity=-0.051 Sum_probs=100.9 Q ss_pred ccccCCcc-cccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccc-eecccccc--ceeecCccccccccc Q lcl|Aclame:pro 126 GTITNPNV-PHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTA-GAGSTWNK--AAVVPEGTAKPQSTL 201 (419) Q Consensus 126 ~~~~~~~~-~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--a~~v~Eg~~~~~~~~ 201 (419) -.++-... .+-.+++..-.++.+.+......-+.--++.- .+.|....+.. +.+..++. -.-+.-.+..+.... T Consensus 1 ~~~t~~sdl~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l--~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~ki 78 (315) T protein:vir:96 1 MATTVNSDLVIYNDTAQTAYLERNMDNLAVFNENSRAAIGL--NSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKI 78 (315) T ss_pred CceeeecceeeehhhhhhhHHhhhHHHHHHhhhhcCCcccc--cccccccccccccccccccchhhcccCCCccccceec Confidence 11111111 12223333333333333322222111110000 00000000000 00000000 000011111111221 Q ss_pred -ceeeEEeeeEEEEEeehh--hHHHHh---hHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccc Q lcl|Aclame:pro 202 -SFDTITTTLKTVAHWLPI--TRQAAD---DNS-QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQP 274 (419) Q Consensus 202 -~~~~v~~~~~k~~~~~~v--s~ell~---d~~-~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~ 274 (419) +..++.. +...+.-++ +...+. +.| ....-|..++..+..+.+=...+.|.- +.+. ..+... T Consensus 79 t~~~dvaV--k~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~-----aai~---~~t~~~- 147 (315) T protein:vir:96 79 AADEMVSV--KVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQ-----GAIG---SNAGMN- 147 (315) T ss_pred ccccceeE--EEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-----hhhc---cccccc- Confidence 1122222 122233333 333332 222 233334444444444444333333221 0000 000000 Q ss_pred cccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCceeccCCcc--ccCCCcccccceeEecCCC Q lcl|Aclame:pro 275 KPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANV--QGEATPRIWGLNVVSTVAI 352 (419) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~--~~~~~~~l~G~pv~~~~~~ 352 (419) ............+.++..++-+....-..|+||..++..|.+ +. -....+...+. .+.. +..+|+||++++.| T Consensus 148 --~~~~~a~~~~~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~-q~-L~~~~~~~~~~~~~~~~-~~~lGkrViVdD~~ 222 (315) T protein:vir:96 148 --VSGELATEGKKVLTKGLRTMGDKASSIAIWVMDSTSYFDIVD-EA-IDNKLYEEAGVVVYGGT-PGTLGKPVLVTDQC 222 (315) T ss_pred --ccccccccCHHHHHHHHHHhcccccCeeEEEEchHHHHHHHH-hh-hhhhcccccceeEecCc-CcccccEEEEECCC Confidence 112223345677788888888887788899999999999987 21 11111111111 1122 33459999999999 Q ss_pred CcCcEEEEeccceEEEE-EecceEEEEeecccchhhcCcEEEEEEEEecc-EEecccceEEEEec-CCCC Q lcl|Aclame:pro 353 AQGTALVGGFRQGATLW-SRQGITVLMTDSHADFFTANTLVILAEFRANL-AVYQPKAFVRVTFA-AATT 419 (419) Q Consensus 353 ~~~~~~~~d~~~~~~~~-~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~-~~~~~~a~~~~~~~-aa~~ 419 (419) |..+++... .+...+ ....+.... .+ ..++-.+....|.++ -.++|..|..-+.+ .+|| T Consensus 223 P~~~~~gl~--~GAi~~~~~~~~~~~~-~~-----~~g~e~l~~~~r~e~tf~l~p~G~sw~~~~~~sPt 284 (315) T protein:vir:96 223 PATKIFGLV--AGAVMITESQAPGMRS-YQ-----IDDQENLAIGFRAEGTANVEVLGYKWKTKTNVNPA 284 (315) T ss_pred Ccceeeeee--cceeeecCCCcccccc-cc-----CCCcceeEEEEeeeeEeeeeeeeEEeecCCCcCCC Confidence 986544432 222222 111110110 00 112333444445555 35778877774321 2344 No 189 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=97.64 E-value=4.2e-06 Score=50.06 Aligned_cols=186 Identities=14% Similarity=0.021 Sum_probs=101.5 Q ss_pred EEEeehhhHHHHhhH------HHHHHHHHHHHHHHHHHHHHHHHHh----ccCcccccceeccccccccccccccccchh Q lcl|Aclame:pro 213 VAHWLPITRQAADDN------SQLMGYIQGRLTYGLRFLRDRQLLN----GNGSTEMQGILTTPGIGTYQQPKPTAPATD 282 (419) Q Consensus 213 ~~~~~~vs~ell~d~------~~~~~~i~~~l~~a~~~~~d~~il~----G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 282 (419) +- -.-+|.-++.|- -++.+...++++.++++..|+.++. +..+..|..-- .+...... .......+ T Consensus 1 iD-~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~--~~g~~~~~-~a~~t~~~ 76 (221) T protein:vir:17 1 MD-DLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQ--DGGFSVNI-GAGNTNNA 76 (221) T ss_pred CC-cchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCccccc--ccCcceec-cccccCCH Confidence 11 123455555432 2588899999999999999998864 33222222111 11111111 11222345 Q ss_pred hhHHHHHHHHHHhhhhhccCCc--EEEEehHHHHHHHHHhccC-Cceecc--CCccccC-CCcccccceeEecCCCCcC- Q lcl|Aclame:pro 283 EPPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWESIELDQAPG-SGVFRV--IANVQGE-ATPRIWGLNVVSTVAIAQG- 355 (419) Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~kd~~-g~~~~~--~~~~~~~-~~~~l~G~pv~~~~~~~~~- 355 (419) ..+++.+.++...+...+.+.. .++++|..+..|.+..+.. .++.+. .+....+ ..+.+.|++|+.|+.+|.. T Consensus 77 ~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~ 156 (221) T protein:vir:17 77 QAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLY 156 (221) T ss_pred HHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCccc Confidence 5678888888888888877643 3567999888887542211 111111 1111222 3457899999999999973 Q ss_pred -cEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 356 -TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 356 -~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) +-+..+...... ..... ..+...|.+ .=+.+.||+|...+|+=..|| T Consensus 157 gt~~~~~ag~~~~----~~~~~---~~yr~~fs~----------~~glv~~~~Avgtvkl~~~~~ 204 (221) T protein:vir:17 157 GTNLVTDPGDATT----SGENN---GSYRPAITD----------RAGLVFHKEAADTVEVLLPPS 204 (221) T ss_pred ccccccCCccccc----ccccc---ccccccccc----------eEEEEEcchheeeeeeecCCC Confidence 222222111100 00000 000011211 227789999999999999998 No 190 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=97.60 E-value=3.9e-05 Score=44.75 Aligned_cols=273 Identities=10% Similarity=0.000 Sum_probs=111.3 Q ss_pred ccccCCcccccch--hhhHHHHHhhhhhhhHHh---------hcceecccCcceeeeeeccccceeccccccceeecCcc Q lcl|Aclame:pro 126 GTITNPNVPHLPQ--LVPGIVPTTPDLPLLVAD---------LLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGT 194 (419) Q Consensus 126 ~~~~~~~~~~~p~--~~~~~i~~~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~ 194 (419) -+.+.-....+|+ .+...+.+...+.+.|.+ +......++..+++|....... +....+.+... T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g-----~~e~n~~~dt~ 75 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDT-----SIEPNYSNDVY 75 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCC-----CcccccCCCCc Confidence 1112223445565 355555444444333332 1111123455566664322110 00111111110 Q ss_pred --ccccccc-ceeeEEeeeEEEEEee--hhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccc Q lcl|Aclame:pro 195 --AKPQSTL-SFDTITTTLKTVAHWL--PITRQAADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIG 269 (419) Q Consensus 195 --~~~~~~~-~~~~v~~~~~k~~~~~--~vs~ell~d~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~ 269 (419) ..+-.+. +..++-.....--++. .++.++- ..+..+.|.++++.-..+...+.+|. -.+|++...... T Consensus 76 ~~~~t~~kit~~~~~a~~~~r~kaw~~~Dla~~ls--G~dpm~~Ia~~va~yW~r~~q~~Lia-----~L~Gvf~~~~~~ 148 (349) T protein:vir:94 76 QDIATPRAIQTGEMMARVAYLNEGFGQADLTVELT--SQNPLQSVASRLDNFWQRQAQRRLIA-----TALGLYNDNVSA 148 (349) T ss_pred ccccccccccccceeeeeeeeccccchhHHHHHhh--CchHHHHHHHHHHHHHhhHHHHHHHH-----HHHhhhcccccc Confidence 1111221 2223322222222222 2333332 12566667777776666666665553 111222211000 Q ss_pred cc-----ccccccccchhhhHHHHHHHHHHhhhhh-----ccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCc Q lcl|Aclame:pro 270 TY-----QQPKPTAPATDEPPLVDIRRAKTVAEIA-----GFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATP 339 (419) Q Consensus 270 ~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~ 339 (419) .. ..-.....+...+....+..+...+... ...-++++||+.++..|++.+--. ++++.-....-+ T Consensus 149 ~~~~~~~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~----~i~~s~~~~~i~ 224 (349) T protein:vir:94 149 TDAYHEQNDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID----FIRDAENNTMFA 224 (349) T ss_pred cccccccCceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhh----hccCcccCcccc Confidence 00 0000000111112234444555444332 234467999999999998764311 122222233456 Q ss_pred ccccceeEecCCCCcC--------cEEEEeccceEEEEEecce--EEEEeecccchhhcCcEEEEEEEEeccEEecccce Q lcl|Aclame:pro 340 RIWGLNVVSTVAIAQG--------TALVGGFRQGATLWSRQGI--TVLMTDSHADFFTANTLVILAEFRANLAVYQPKAF 409 (419) Q Consensus 340 ~l~G~pv~~~~~~~~~--------~~~~~d~~~~~~~~~~~~~--~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~ 409 (419) +++|++|++++.||-. +.|++. .+.+.+..... .+++.+.....-..++..+....+ .++||..+ T Consensus 225 ty~G~~VivDD~~Pv~~~g~~~~yttylfg--~GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~---~~~hp~G~ 299 (349) T protein:vir:94 225 TYQGYRVIVDDSMTVVGQDTSRKFISIIFG--QGAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKT---WLLHPFGY 299 (349) T ss_pred eecCcEEEEeCCCccccCCCCceEEEEEee--cceEEeecCCCCcceeeecccccCCcceeEEEEEeeE---EEeeeeee Confidence 8999999999999841 122333 22322222221 233333221000113334444433 35677776 Q ss_pred EEEEe----------cCCCC Q lcl|Aclame:pro 410 VRVTF----------AAATT 419 (419) Q Consensus 410 ~~~~~----------~aa~~ 419 (419) ..-+- +.+|| T Consensus 300 s~~~a~v~~~~~~~~~~sPt 319 (349) T protein:vir:94 300 SFTSAVITGNGTETIARSAS 319 (349) T ss_pred eecccccCCCccccccCCCC Confidence 66542 23455 No 191 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=97.53 E-value=5e-05 Score=44.17 Aligned_cols=300 Identities=9% Similarity=-0.066 Sum_probs=143.6 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceec Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAG 181 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (419) ++...+..+.............+ .......+...+...+.....+++.+++.+++++|.--.....-... ..++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~----~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~-~g~ia 75 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLNDTG----DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSV-SGPIA 75 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChh----hhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeecc-Cccee Confidence 00000000110100000000000 11122233345566677788888999999999988654433322211 11111 Q ss_pred cccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCc-- Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-- 256 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~g~-- 256 (419) +...+ +.+...|..-..++.-.+..++.-.-..|+.+.|+.. ++|...+++.+.++++.-+=.--++|+-. T Consensus 76 grt~t----~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~ 151 (337) T protein:vir:10 76 SRTDT----TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAA 151 (337) T ss_pred eeecC----CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeecc Confidence 11111 1122233333456777777777777788899988764 57999999999998876666666677542 Q ss_pred -----cccc------ceeccc-----------cccccccccccccchhhhHHHHHHHHHHh-hhhhccCC--cEEEEehH Q lcl|Aclame:pro 257 -----TEMQ------GILTTP-----------GIGTYQQPKPTAPATDEPPLVDIRRAKTV-AEIAGFPP--DGVVVHPQ 311 (419) Q Consensus 257 -----~~p~------Gi~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~~ 311 (419) ..|. |++... +............+....+...+.+++.. +...+.+. -+.+|... T Consensus 152 ~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~d 231 (337) T protein:vir:10 152 TTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRE 231 (337) T ss_pred CCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchh Confidence 1343 333110 00000000011111222222234455554 35555543 25667766 Q ss_pred HHHH-HHHHhccCCceeccCCccc-c--CCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 312 DWES-IELDQAPGSGVFRVIANVQ-G--EATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 312 ~~~~-l~~~kd~~g~~~~~~~~~~-~--~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) ..+. ...+-...+.+ ..... + ....++.|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+.. . T Consensus 232 Lladk~~~l~n~~~~p---tE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p----~ 304 (337) T protein:vir:10 232 LLHDKYFPIVNATQAP---TERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP----E 304 (337) T ss_pred hhhHHhhHHhccCCCc---HHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEcc----c Confidence 5542 11121111110 00000 0 11357999999999999999999999988777665554444432221 2 Q ss_pred cCcEEEEEEEEeccEEecccceEEE---EecCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRV---TFAAA 417 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~---~~~aa 417 (419) +|.+.-+-..--++.|-+..+++.+ +++.+ T Consensus 305 r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 305 RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred cccccchhhccceeeeeccccEEEEeceeecCC Confidence 2222222222233444444444443 44444 No 192 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=97.51 E-value=5.3e-05 Score=44.01 Aligned_cols=300 Identities=9% Similarity=-0.067 Sum_probs=143.4 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceec Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAG 181 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (419) ++...+..+.............+ .......+...+...+.....+++.+++.+++++|.--.....-... ..++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~----~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~-~g~ia 75 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLNDTG----DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSV-SGPIA 75 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChh----hhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeecc-Cccee Confidence 00000000111100000000000 11112233335566677778888999999999988654433322211 11111 Q ss_pred cccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCc-- Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-- 256 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~g~-- 256 (419) +...+ +.+...|..-..++.-.+..++.-.-..|+.+.|+.. ++|...+++.+.++++.-+=.--++|+-. T Consensus 76 grt~t----~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~ 151 (337) T protein:vir:79 76 SRTDT----TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAA 151 (337) T ss_pred eeecC----CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeecc Confidence 11111 1122233333456677777777777788899988764 57999999999998876666666677542 Q ss_pred -----cccc------ceeccc-----------cccccccccccccchhhhHHHHHHHHHHh-hhhhccCC--cEEEEehH Q lcl|Aclame:pro 257 -----TEMQ------GILTTP-----------GIGTYQQPKPTAPATDEPPLVDIRRAKTV-AEIAGFPP--DGVVVHPQ 311 (419) Q Consensus 257 -----~~p~------Gi~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~~ 311 (419) ..|. |++... +............+....+...+.+++.. +...+.+. -+.+|... T Consensus 152 ~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~d 231 (337) T protein:vir:79 152 TTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRE 231 (337) T ss_pred CCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchh Confidence 1343 333110 00000000011112222222234455554 35555543 25667766 Q ss_pred HHHH-HHHHhccCCceeccCCccc-c--CCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 312 DWES-IELDQAPGSGVFRVIANVQ-G--EATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 312 ~~~~-l~~~kd~~g~~~~~~~~~~-~--~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) ..+. ...+-...+.+ ..... + ....++.|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+.. . T Consensus 232 Lladk~~~l~n~~~~p---tE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p----~ 304 (337) T protein:vir:79 232 LLHDKYFPIVNATQAP---TERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP----E 304 (337) T ss_pred hhhHHhhHHhccCCCc---HHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEcc----c Confidence 5542 11121111110 00000 0 11357999999999999999999999988777665554444432221 2 Q ss_pred cCcEEEEEEEEeccEEecccceEEE---EecCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRV---TFAAA 417 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~---~~~aa 417 (419) +|.+.-+-..--++.|-+..+++.+ +++.+ T Consensus 305 r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 305 RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred cccccchhhccceeeeeccccEEEEeceeecCC Confidence 2222222222233444444444443 44444 No 193 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=97.45 E-value=6.5e-05 Score=43.54 Aligned_cols=306 Identities=9% Similarity=-0.015 Sum_probs=142.5 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceec Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAG 181 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (419) ++...+..+..+............. .....+.+...+...+....++++-+++.+++++|.--.....-... ..++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~--~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv-~g~ia 77 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLNGISVD--DVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGV-TGTIA 77 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChh--HccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeecc-Cccee Confidence 0000000111110000000000000 01122334445566677778888899999999988654433322111 11111 Q ss_pred cccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCc-- Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-- 256 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~g~-- 256 (419) +...+. +-..-.|..-..++.-.+..++.-.-..|+.+.|+.. ++|...+++.+.++++.-+=.--++|+-. T Consensus 78 grtdT~---~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~ 154 (355) T protein:vir:18 78 STTDTS---GDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRAD 154 (355) T ss_pred eccccC---CCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeec Confidence 100000 0011123333446667777777777788888888764 57999999999988876666666677542 Q ss_pred -----cccc------ceecc-----cc-c-ccc---------ccccccccchhhhHHHHHHHHHHh-hhhhccCC--cEE Q lcl|Aclame:pro 257 -----TEMQ------GILTT-----PG-I-GTY---------QQPKPTAPATDEPPLVDIRRAKTV-AEIAGFPP--DGV 306 (419) Q Consensus 257 -----~~p~------Gi~~~-----~~-~-~~~---------~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~ 306 (419) ..|. |++.. +. + ... ........+....+...+.+++.. +...+.+. -+. T Consensus 155 ~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVv 234 (355) T protein:vir:18 155 TSDRVKNPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVA 234 (355) T ss_pred cCChhhCcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEE Confidence 1343 33310 00 0 000 000001112222222333455554 34444443 256 Q ss_pred EEehHHHHH-HHHHhccCCceeccCCccc-c--CCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecc Q lcl|Aclame:pro 307 VVHPQDWES-IELDQAPGSGVFRVIANVQ-G--EATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSH 382 (419) Q Consensus 307 ~~~~~~~~~-l~~~kd~~g~~~~~~~~~~-~--~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~ 382 (419) +|.....+. .-.+-...+.+ ..... + ....++.|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+.. T Consensus 235 ivG~dLla~k~~~l~n~~~~p---tE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p 311 (355) T protein:vir:18 235 IVGRKLLADKYFPLVNKQQEN---TESLAADIIISQKRIGNLPAVRVPYFPANAVFVTTLENLSIYFMDESHRRSIDENP 311 (355) T ss_pred EEchhhhHHHHhHHhhccCCh---HHHHHHHHHHHHHhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 777665442 22222222111 11110 1 11358999999999999999999999988777665544443332221 Q ss_pred c-----chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 383 A-----DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 383 ~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) . ++...| .+|.++.+.-+.... .+...+.+++.. T Consensus 312 ~r~rie~y~s~N-e~YvVEd~~~~a~ie--ni~~~~~~~~~~ 350 (355) T protein:vir:18 312 KKDRVENYESMN-IDYVVEAYAAGCLLE--NITLGDFTAPAA 350 (355) T ss_pred ccccccchhhhc-ceeeeeccccEEEEe--eeeecCCCCccc Confidence 1 222222 245554444444443 333332222112 No 194 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=97.35 E-value=4.7e-05 Score=44.29 Aligned_cols=331 Identities=12% Similarity=0.010 Sum_probs=147.8 Q ss_pred HHHHHhhcccccccccchhh-hhhHHHHhHHHHHHHHHhhhhhh--hhHHHHHHHHHHhhhccccc------ccccCCcc Q lcl|Aclame:pro 63 PKGPADGGTPLTPAEAGTFR-SLAQRFADSDGLREYRARDKRGQ--FQVEMRDIDPNRLLSRDAPA------GTITNPNV 133 (419) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~------~~~~~~~~ 133 (419) +.+.. ............. ...........++.+...+..-. ....+. ....+ ....... ......+. T Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~~~~~-~~~~a-md~~~~~~~~~~~~~l~~~~~ 76 (379) T protein:vir:10 1 MPQIS--KIHSSLNARQMTQMVMDSADVTLDNLKHLESYGIHLNGRKNKLFE-LMQFA-MDSNDIGPIPTPLSPLSPVSI 76 (379) T ss_pred CCCcc--eeeeecCccccchhhhccccccHHHHHHHHhcCccccchhhhhhh-hhhhh-hccccccccccccCccccccc Confidence 00000 0000000000000 00000001111222221111100 000000 00000 1111111 01111122 Q ss_pred cccchhhh---HHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeee Q lcl|Aclame:pro 134 PHLPQLVP---GIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTL 210 (419) Q Consensus 134 ~~~p~~~~---~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~ 210 (419) ..+|+.+. ..+++.+-....+.+++++.+++....+.... ...-..+.+.+.+-+.+.|..+......+-.. T Consensus 77 ~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~-----~v~e~~G~A~~ygd~~d~pl~d~~~~~~~r~v 151 (379) T protein:vir:10 77 PGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQ-----RVLEGLGTAQPYTDGGNMALMSWTPTFETRTV 151 (379) T ss_pred cchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEE-----eeeeeeeeeEEeccccCCCeeeeeeeeeeeee Confidence 23344332 33444444555566666665543322111111 11112356777788888888887777777777 Q ss_pred EEEEEeehhhHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHHHhccC--cccccceeccccccccccc-------ccc Q lcl|Aclame:pro 211 KTVAHWLPITRQAADDNS----QLMGYIQGRLTYGLRFLRDRQLLNGNG--STEMQGILTTPGIGTYQQP-------KPT 277 (419) Q Consensus 211 ~k~~~~~~vs~ell~d~~----~~~~~i~~~l~~a~~~~~d~~il~G~g--~~~p~Gi~~~~~~~~~~~~-------~~~ 277 (419) +.++..+.++..=+..+. ++..--....++++...+|+-.++|.+ ..+..|++|.+++....+. ... T Consensus 152 ~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t~atg~~~~t~W 231 (379) T protein:vir:10 152 VRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVAVPNGAGGSPLW 231 (379) T ss_pred EEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccccccCCccccccc Confidence 888888888765343332 588888899999999999999999953 4467799999887643222 123 Q ss_pred ccchhhhHHHHHHHHHHhhhhhcc-------CCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecC Q lcl|Aclame:pro 278 APATDEPPLVDIRRAKTVAEIAGF-------PPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTV 350 (419) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~ 350 (419) ...+....++|+..++..+...-. .+..++|.+..+..|... +..|-.++ ..... .+-++.++..+ T Consensus 232 a~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl--~~lk~----n~Pnl~i~t~p 304 (379) T protein:vir:10 232 AQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP-TELGYSVA--QYMRE----SYPNVTFVSAP 304 (379) T ss_pred ccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc-cccCccHH--HHHHH----hcCCcEEEEcc Confidence 345677789999988877653311 122688899988887643 11121111 00000 11123344333 Q ss_pred CCCcCcEEEEeccceEEEEEe-cce------EEEE-eecccchhhc-------CcEEEEEEEEec-cEEecccceEEEEe Q lcl|Aclame:pro 351 AIAQGTALVGGFRQGATLWSR-QGI------TVLM-TDSHADFFTA-------NTLVILAEFRAN-LAVYQPKAFVRVTF 414 (419) Q Consensus 351 ~~~~~~~~~~d~~~~~~~~~~-~~~------~i~~-~~~~~~~~~~-------~~~~~r~~~r~d-~~~~~~~a~~~~~~ 414 (419) .+... -+.-+..+++.+. .+. .+.. .+. .|.. -.+..-...|.. ..+++|.||+++.- T Consensus 305 EL~~a---ggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~---k~~~l~ve~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G 378 (379) T protein:vir:10 305 ELNDA---NGGSSAIYYYADAVENNGTDDGRTWLQVVPT---KMFTLGVEKKIKGYAEGYTNATAGAMLKRPFATYRQTG 378 (379) T ss_pred ccccc---CCCccEEEEEeeccCCCccCCcceEEEecch---hhhhccceecCceeEeccccceeeeeeecchhhheecC Confidence 33110 0110111111111 000 0000 010 0111 111223344444 45677999999998 Q ss_pred c Q lcl|Aclame:pro 415 A 415 (419) Q Consensus 415 ~ 415 (419) + T Consensus 379 ~ 379 (379) T protein:vir:10 379 A 379 (379) T ss_pred C Confidence 8 No 195 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=97.34 E-value=9e-05 Score=42.77 Aligned_cols=273 Identities=11% Similarity=0.024 Sum_probs=111.6 Q ss_pred ccccCCcccccch--hhhHHHHHhhhhhhhHHh---------hcceecccCcceeeeeeccccceeccccccceeecCc- Q lcl|Aclame:pro 126 GTITNPNVPHLPQ--LVPGIVPTTPDLPLLVAD---------LLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEG- 193 (419) Q Consensus 126 ~~~~~~~~~~~p~--~~~~~i~~~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg- 193 (419) -+.+.-....+|+ .+...+.+...+.+.|.+ +......++..+++|...... +.....+...+ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~-----g~~e~nv~~D~~ 75 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAID-----TSIEPNYSNDVY 75 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCC-----CCcccccCCCCc Confidence 1122223445665 355555544444333322 111112345556666432211 00011111111 Q ss_pred -cccccccc-ceeeEEeeeEEEEEee--hhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccc Q lcl|Aclame:pro 194 -TAKPQSTL-SFDTITTTLKTVAHWL--PITRQAADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIG 269 (419) Q Consensus 194 -~~~~~~~~-~~~~v~~~~~k~~~~~--~vs~ell~d~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~ 269 (419) ...+-.+. +..++-.....--++. .++.++- ..+..+.|.++++.-..+...+.+|. -.+|++...... T Consensus 76 ~~~~t~~kitt~~~~a~~~~r~kaw~~~Dla~~ls--G~dpm~~Ia~~va~yW~r~~q~~Lia-----~L~Gvf~~~~~a 148 (349) T protein:vir:78 76 QDIATPRAIQTGEMMARVAYLNEGFGQADLTVELT--SQNPLQSVASRLDNFWQRQAQRRLIA-----TALGLYNDNVSA 148 (349) T ss_pred ccccccccccccceeeeeeeeccccchhHHHHHhh--CchHHHHHHHHHHHHHhhHHHHHHHH-----HHHHhhcccccc Confidence 11121222 2333333333332322 2333332 23566667777776666665554443 111222110000 Q ss_pred c--c---ccccccccchhhhHHHHHHHHHHhhhhh-----ccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCc Q lcl|Aclame:pro 270 T--Y---QQPKPTAPATDEPPLVDIRRAKTVAEIA-----GFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATP 339 (419) Q Consensus 270 ~--~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~ 339 (419) . . ..-.....+........+.++...+.+. ...-++++||+.++..|++.+--. ++++.-....-+ T Consensus 149 ~~~~~~~~~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~----~i~~s~~~~~i~ 224 (349) T protein:vir:78 149 TDAYHEQNDMVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID----FIRDAENNTMFA 224 (349) T ss_pred cchhhhcccceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhh----hccCcccCcccc Confidence 0 0 0000000011112333444454444433 234468999999999998764311 122222233456 Q ss_pred ccccceeEecCCCCcC--------cEEEEeccceEEEEEecc--eEEEEeecccchhhcCcEEEEEEEEeccEEecccce Q lcl|Aclame:pro 340 RIWGLNVVSTVAIAQG--------TALVGGFRQGATLWSRQG--ITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAF 409 (419) Q Consensus 340 ~l~G~pv~~~~~~~~~--------~~~~~d~~~~~~~~~~~~--~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~ 409 (419) +++|++|++++.||.. +.|++. .+.+.+.... ..+++.+.....-..++..+....++ ++||..+ T Consensus 225 ty~G~~VivDD~~Pv~~~g~~~~yttylfg--~GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~---~~hp~G~ 299 (349) T protein:vir:78 225 TYQGYRVIVDDSMTVVGQGAQRKFISIIFG--QGAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTW---LLHPFGY 299 (349) T ss_pred eecCeEEEEeCCCccccCCCCceEEEEEee--cceEEEccCCCccceeeecccccCCcceeEEEEEeeEE---Eeeeeee Confidence 8999999999999842 122333 2333222111 12333332210001234444444443 5667766 Q ss_pred EEEEe----------cCCCC Q lcl|Aclame:pro 410 VRVTF----------AAATT 419 (419) Q Consensus 410 ~~~~~----------~aa~~ 419 (419) ..-+- +.+|| T Consensus 300 s~~~a~v~~~~~~~~~~sPt 319 (349) T protein:vir:78 300 RFTSAVITGNGTETIARSAS 319 (349) T ss_pred eeccccccCCccccccCCCC Confidence 66532 23455 No 196 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=97.30 E-value=0.0001 Score=42.48 Aligned_cols=303 Identities=11% Similarity=-0.005 Sum_probs=140.0 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceec Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAG 181 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (419) ++...+..+............. ......+.+...+...+.....+++.+++.+++++|.--.....-.. ...++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv----~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg-~~g~ia 75 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLAKLNGV----NSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIG-VSGTIA 75 (338) T ss_pred CCHHHHHHHHHHHHHHHHHhCC----CcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeec-cCcccc Confidence 1111111111111110000111 11223344555666677778888899999999998865433322211 111111 Q ss_pred cccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCc-- Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-- 256 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~g~-- 256 (419) +...+.. -++-.|..-..++.-.+..++.-.-..|+.+.|+.. ++|...+++.+.++++.-+=.--++|.-. T Consensus 76 grtdT~~---~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~ 152 (338) T protein:vir:11 76 SRTDTTG---DGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAA 152 (338) T ss_pred ccccCCC---CCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeecc Confidence 1000000 011122211245666677777777778888888764 57999999999998877666666677641 Q ss_pred -----cccc------ceecc-----c------cccccccccc-cccchhhhHHHHHHHHHHh-hhhhccCCc--EEEEeh Q lcl|Aclame:pro 257 -----TEMQ------GILTT-----P------GIGTYQQPKP-TAPATDEPPLVDIRRAKTV-AEIAGFPPD--GVVVHP 310 (419) Q Consensus 257 -----~~p~------Gi~~~-----~------~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~--~~~~~~ 310 (419) ..|. |++.. + +..+...... ...+....+...+.+++.. +...+.+.. +.+|.. T Consensus 153 ~Td~~~nPllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~ 232 (338) T protein:vir:11 153 TTNRAANPLLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGR 232 (338) T ss_pred CCChhhCcCccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 1343 33210 0 0000000000 1111122222233345554 355555432 567776 Q ss_pred HHHHHH-HHHhccCCceeccCCcc-cc--CCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchh Q lcl|Aclame:pro 311 QDWESI-ELDQAPGSGVFRVIANV-QG--EATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFF 386 (419) Q Consensus 311 ~~~~~l-~~~kd~~g~~~~~~~~~-~~--~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~ 386 (419) ...+.- ..+-.....+ .... .+ ....++.|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+.. T Consensus 233 dLladk~~~l~n~~~~p---tE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p---- 305 (338) T protein:vir:11 233 ELVHDKYFPMVNKDQPA---TEKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLKNLSLYWQIGGRRRYLKEVP---- 305 (338) T ss_pred hhhHHHHhHHHhcCCCh---HHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEecc---- Confidence 655421 1122111110 0000 01 11457999999999999999999999988777665554444332221 Q ss_pred hcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 387 TANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 387 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+|.+.-+-..--++.|-++.+++.+.-..-.- T Consensus 306 ~r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 306 EKNRIENYESSNDAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred ccccccchhhhccceeeeccccEEEeecceecC Confidence 122222222222333333444333332100000 No 197 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=97.21 E-value=0.00013 Score=41.88 Aligned_cols=301 Identities=12% Similarity=-0.031 Sum_probs=140.3 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceec Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAG 181 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (419) ++...+..+..+.......... ......+.+...+...+.....+++.+++.++++++.--.....-.. ...++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv----~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg-~~g~ia 75 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKLNGV----ERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLG-VSGPVA 75 (339) T ss_pred CChHHHHHHHHHHHHHHHHhCc----ccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeec-cCccee Confidence 1110111111111110000000 01122334445566677777888889999999998865443332221 111111 Q ss_pred cccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCc-- Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-- 256 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~g~-- 256 (419) +...+. -+.-.|..-..++.-.+...+.-.-..|+.+.|+.. ++|...+++.+.++++.-+=.--+||+-. T Consensus 76 grtdt~----~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~ 151 (339) T protein:vir:79 76 STTDTT----QQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAA 151 (339) T ss_pred ecccCC----CCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeec Confidence 111111 111222222356666777777777778888888764 57888899999888876665566677532 Q ss_pred -----cccc------ceecc-----c------c-ccccccccccccchhhhHHHHHHHHHHh-hhhhccCCc--EEEEeh Q lcl|Aclame:pro 257 -----TEMQ------GILTT-----P------G-IGTYQQPKPTAPATDEPPLVDIRRAKTV-AEIAGFPPD--GVVVHP 310 (419) Q Consensus 257 -----~~p~------Gi~~~-----~------~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~--~~~~~~ 310 (419) ..|. |++.. + + ...........++....+...+.+++.. +.+.+.+.. +.+|.. T Consensus 152 ~Td~~~nPllqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~ 231 (339) T protein:vir:79 152 TSDRVANPMLQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGR 231 (339) T ss_pred CCChhhCcCccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 1333 33210 0 0 0000000001111222223333455543 455555432 556666 Q ss_pred HHHHH-HHHHhccCCceeccCCccc-c--CCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchh Q lcl|Aclame:pro 311 QDWES-IELDQAPGSGVFRVIANVQ-G--EATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFF 386 (419) Q Consensus 311 ~~~~~-l~~~kd~~g~~~~~~~~~~-~--~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~ 386 (419) ...+. ...+-...+.+ ..... + ....++-|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+.. T Consensus 232 dLla~k~~~l~n~~~~p---tE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p---- 304 (339) T protein:vir:79 232 NLLSDKYFPLVNRDRDP---VQQIAADLIISQKRIGNLPAIRVPYFPANGLLVTRLDNLSIYYQEGGRRRTILDNA---- 304 (339) T ss_pred hhhhhHhhhHhhcCCCh---HHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEecc---- Confidence 65542 11222111111 00000 1 12357899999999999999999999988777665554444332221 Q ss_pred hcCcEEEEEEEEeccEEecccceEEE---EecCCC Q lcl|Aclame:pro 387 TANTLVILAEFRANLAVYQPKAFVRV---TFAAAT 418 (419) Q Consensus 387 ~~~~~~~r~~~r~d~~~~~~~a~~~~---~~~aa~ 418 (419) .+|.+.-+-..--++.|-++.+++.+ +++.+. T Consensus 305 ~r~rie~y~s~Ne~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 305 KRDRIENYESSNDAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred ccccccchhhccceeeeeccccEEEeeeeecccCC Confidence 12222222222223333334333333 222222 No 198 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=97.18 E-value=0.00014 Score=41.71 Aligned_cols=308 Identities=10% Similarity=-0.018 Sum_probs=142.4 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceec Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAG 181 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (419) ++...+..+..+............. .. .....+...+...+....++++-+++.+++++|.--.....-.. ...++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~-~~-~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg-v~g~ia 77 (355) T protein:vir:98 1 MRPETRFKFNAYLTRVAELNNISTD-DV-SKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVG-VTGTIA 77 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChh-Hc-cceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeec-cCcccc Confidence 1111111111111111000000000 01 12233444555667777888889999999998865433332211 011111 Q ss_pred cccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCc-- Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-- 256 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~g~-- 256 (419) +...+. +-..-.|..-..++.-.+..++.-.-..|+.+.|+.. ++|...+++.+.++++.-+=.--+||+-. T Consensus 78 grtdT~---~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~ 154 (355) T protein:vir:98 78 STTDTS---GDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRAD 154 (355) T ss_pred ccccCC---CCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeec Confidence 100000 0011123333446667777777777788888888764 57999999999988876666666677541 Q ss_pred -----cccc------ceecc-----cc-c-ccc---------ccccccccchhhhHHHHHHHHHHhh-hhhccCC--cEE Q lcl|Aclame:pro 257 -----TEMQ------GILTT-----PG-I-GTY---------QQPKPTAPATDEPPLVDIRRAKTVA-EIAGFPP--DGV 306 (419) Q Consensus 257 -----~~p~------Gi~~~-----~~-~-~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~ 306 (419) ..|. |++.. +. + ... ........+....+...+.+++..+ ...+.+. -+. T Consensus 155 ~Td~~~nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVv 234 (355) T protein:vir:98 155 TSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVA 234 (355) T ss_pred cCChhhCcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEE Confidence 1343 33310 00 0 000 0000011122222222334456543 4444443 256 Q ss_pred EEehHHHHH-HHHHhccCCce-eccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeeccc- Q lcl|Aclame:pro 307 VVHPQDWES-IELDQAPGSGV-FRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHA- 383 (419) Q Consensus 307 ~~~~~~~~~-l~~~kd~~g~~-~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~- 383 (419) +|.....+. .-.+-.....+ --..... -....++.|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+... T Consensus 235 ivG~dLla~k~~~l~n~~~~ptE~~Aa~~-i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r 313 (355) T protein:vir:98 235 IVGRKLLADKYFPLVNKQQENSESLAADI-IISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKK 313 (355) T ss_pred EEchhhhHHHhhhHhhccCCcHHHHHHHH-HHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEecccc Confidence 777665442 22222221111 0000000 0123589999999999999999999999887776655444433322211 Q ss_pred ----chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 384 ----DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 384 ----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ++...| .+|.++.+.-+.... .+...+.+++.. T Consensus 314 ~rie~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~~~~ 350 (355) T protein:vir:98 314 DRVENYESMN-IDYVVEVYAAGCLLE--NITLGDFTAPAA 350 (355) T ss_pred ccccchhhhc-ceeeeeccccEEEee--ceeeeCCCCCcc Confidence 222222 245555544444443 333322222211 No 199 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=97.08 E-value=4.4e-05 Score=44.45 Aligned_cols=314 Identities=8% Similarity=-0.006 Sum_probs=138.5 Q ss_pred ccccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhh Q lcl|Aclame:pro 71 TPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDL 150 (419) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~ 150 (419) .....+..+......+.+.+ ...+.+. +.. ........+++..=-+.+.+.|..+.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~e-----------------~~~KS~~----tg~-g~~p~~q~~~~AlR~EsL~~~i~~Lt~~ 58 (463) T protein:vir:95 1 MTIEKNLSDVQQKYADQFQE-----------------DVVKSFQ----TGY-GITPDTQIDAGALRREILDDQITMLTWT 58 (463) T ss_pred CCcccccchHHHHHHhhhhH-----------------HHHHHhh----cCC-ccCCccccCcchhhhhhhhhhhheeeec Confidence 11111111111111111110 0011110 000 0111112222223233444444333332 Q ss_pred hhhH--HhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHH-HhhH Q lcl|Aclame:pro 151 PLLV--ADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQA-ADDN 227 (419) Q Consensus 151 ~~~l--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~el-l~d~ 227 (419) ...+ ..-+...++.+-.-+|..... ....+.+.+++|++..+.+++++.+.....|-++....+|.-+ +.++ T Consensus 59 ~~~f~~~~~i~k~~a~STV~~y~~~~~-----~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~ 133 (463) T protein:vir:95 59 NEDLIFYRDISRRPAQSTVVKYDQYLR-----HGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNN 133 (463) T ss_pred ccchhhhhhcCCchhhhhhhhheeeec-----cCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcc Confidence 2222 222233333332223322221 1223567899999999999999999999999999998888775 3344 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHhccCc----ccccceeccccccccccccc-cccchhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 228 -SQLMGYIQGRLTYGLRFLRDRQLLNGNGS----TEMQGILTTPGIGTYQQPKP-TAPATDEPPLVDIRRAKTVAEIAGF 301 (419) Q Consensus 228 -~~~~~~i~~~l~~a~~~~~d~~il~G~g~----~~p~Gi~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (419) .+.+..+.++-...++..++.+.+.|+-. ++|.|+ ...|+...-.... ...-+.....+.+..+-..+..+|. T Consensus 134 ~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gl-eFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fG 212 (463) T protein:vir:95 134 IADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGL-EFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFG 212 (463) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCcccc-chhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccC Confidence 48899999999999999999999999853 233443 2333333222222 2222334445556666666778888 Q ss_pred CCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEe--cCC----CCc----CcEEEEeccceEEEE-- Q lcl|Aclame:pro 302 PPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVS--TVA----IAQ----GTALVGGFRQGATLW-- 369 (419) Q Consensus 302 ~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~--~~~----~~~----~~~~~~d~~~~~~~~-- 369 (419) +++-.+|+..+.+.|..-.-...+ .+.+++.... ..|+||-- +.. ++. +.....|...-..-. T Consensus 213 t~TD~~lp~~vka~f~~~~l~~qr-v~~~~N~~~~----~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap 287 (463) T protein:vir:95 213 TATDAYMPIGVHADFVNSILGRQM-QLMQDNSGNV----NTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAP 287 (463) T ss_pred ChhheecchHHHHHHHHHhcCceE-EEEcCCCCce----eeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCc Confidence 888899999999998855433333 3333433221 33544421 110 000 000011111000000 Q ss_pred EecceEEEEeec-ccchh-hcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 370 SRQGITVLMTDS-HADFF-TANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 370 ~~~~~~i~~~~~-~~~~~-~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ...-.+..+... .+..+ ..|...+. .++=..=.+-++....-++++++ T Consensus 288 ~~~~~tatv~~~~~~~~~~~~~~a~~~--Y~vv~~s~~geS~pS~ivtaT~a 337 (463) T protein:vir:95 288 QPAKVTATVETKQKGAFENEEDRAGLS--YKVVVNSDDAQSAPSEEVTATVS 337 (463) T ss_pred cCceeEEEEeeccCCCCCCcccccceE--EEEEEECCCCCcccchheeeeee Confidence 000011111110 11111 11111110 00001112233433333333333 No 200 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=97.08 E-value=4.4e-05 Score=44.45 Aligned_cols=314 Identities=8% Similarity=-0.006 Sum_probs=138.5 Q ss_pred ccccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhh Q lcl|Aclame:pro 71 TPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDL 150 (419) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~ 150 (419) .....+..+......+.+.+ ...+.+. +.. ........+++..=-+.+.+.|..+.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~e-----------------~~~KS~~----tg~-g~~p~~q~~~~AlR~EsL~~~i~~Lt~~ 58 (463) T protein:vir:99 1 MTIEKNLSDVQQKYADQFQE-----------------DVVKSFQ----TGY-GITPDTQIDAGALRREILDDQITMLTWT 58 (463) T ss_pred CCcccccchHHHHHHhhhhH-----------------HHHHHhh----cCC-ccCCccccCcchhhhhhhhhhhheeeec Confidence 11111111111111111110 0011110 000 0111112222223233444444333332 Q ss_pred hhhH--HhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHH-HhhH Q lcl|Aclame:pro 151 PLLV--ADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQA-ADDN 227 (419) Q Consensus 151 ~~~l--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~el-l~d~ 227 (419) ...+ ..-+...++.+-.-+|..... ....+.+.+++|++..+.+++++.+.....|-++....+|.-+ +.++ T Consensus 59 ~~~f~~~~~i~k~~a~STV~~y~~~~~-----~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~ 133 (463) T protein:vir:99 59 NEDLIFYRDISRRPAQSTVVKYDQYLR-----HGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNN 133 (463) T ss_pred ccchhhhhhcCCchhhhhhhhheeeec-----cCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcc Confidence 2222 222233333332223322221 1223567899999999999999999999999999998888775 3344 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHhccCc----ccccceeccccccccccccc-cccchhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 228 -SQLMGYIQGRLTYGLRFLRDRQLLNGNGS----TEMQGILTTPGIGTYQQPKP-TAPATDEPPLVDIRRAKTVAEIAGF 301 (419) Q Consensus 228 -~~~~~~i~~~l~~a~~~~~d~~il~G~g~----~~p~Gi~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (419) .+.+..+.++-...++..++.+.+.|+-. ++|.|+ ...|+...-.... ...-+.....+.+..+-..+..+|. T Consensus 134 ~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gl-eFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fG 212 (463) T protein:vir:99 134 IADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGL-EFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFG 212 (463) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCcccc-chhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccC Confidence 48899999999999999999999999853 233443 2333333222222 2222334445556666666778888 Q ss_pred CCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEe--cCC----CCc----CcEEEEeccceEEEE-- Q lcl|Aclame:pro 302 PPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVS--TVA----IAQ----GTALVGGFRQGATLW-- 369 (419) Q Consensus 302 ~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~--~~~----~~~----~~~~~~d~~~~~~~~-- 369 (419) +++-.+|+..+.+.|..-.-...+ .+.+++.... ..|+||-- +.. ++. +.....|...-..-. T Consensus 213 t~TD~~lp~~vka~f~~~~l~~qr-v~~~~N~~~~----~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap 287 (463) T protein:vir:99 213 TATDAYMPIGVHADFVNSILGRQM-QLMQDNSGNV----NTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAP 287 (463) T ss_pred ChhheecchHHHHHHHHHhcCceE-EEEcCCCCce----eeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCc Confidence 888899999999998855433333 3333433221 33544421 110 000 000011111000000 Q ss_pred EecceEEEEeec-ccchh-hcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 370 SRQGITVLMTDS-HADFF-TANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 370 ~~~~~~i~~~~~-~~~~~-~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ...-.+..+... .+..+ ..|...+. .++=..=.+-++....-++++++ T Consensus 288 ~~~~~tatv~~~~~~~~~~~~~~a~~~--Y~vv~~s~~geS~pS~ivtaT~a 337 (463) T protein:vir:99 288 QPAKVTATVETKQKGAFENEEDRAGLS--YKVVVNSDDAQSAPSEEVTATVS 337 (463) T ss_pred cCceeEEEEeeccCCCCCCcccccceE--EEEEEECCCCCcccchheeeeee Confidence 000011111110 11111 11111110 00001112233433333333333 No 201 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=97.05 E-value=0.00019 Score=40.96 Aligned_cols=300 Identities=9% Similarity=-0.076 Sum_probs=142.7 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceec Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAG 181 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (419) ++...+..+............ .......+.+...+...+.....+++.+++.++++++.--.....-.. ...++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ng----v~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg-~~g~ia 75 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLND----TGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLS-VSGPIA 75 (337) T ss_pred CChHHHHHHHHHHHHHHHhcC----hhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecc-cCccee Confidence 000000001000000000000 001122333444566677777888889999999998865443332211 111111 Q ss_pred cccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCc-- Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-- 256 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~g~-- 256 (419) +...+ +-+...|..-..++.-.+...+.-.-..|+.+.|+.. ++|...+++.+.++++.-+=.--+||+-. T Consensus 76 grtdt----~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~ 151 (337) T protein:vir:78 76 SRTDT----TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAA 151 (337) T ss_pred eeecC----CCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeecc Confidence 11111 1112223223446666677777766778888888764 57888899999888876665566677532 Q ss_pred -----cccc------ceecc-----c-cccccc-----cccccccchhhhHHHHHHHHHHh-hhhhccCC--cEEEEehH Q lcl|Aclame:pro 257 -----TEMQ------GILTT-----P-GIGTYQ-----QPKPTAPATDEPPLVDIRRAKTV-AEIAGFPP--DGVVVHPQ 311 (419) Q Consensus 257 -----~~p~------Gi~~~-----~-~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~~ 311 (419) ..|. |++.. + -+.+.. .......+....+...+.+++.. +...+.+. -+.+|... T Consensus 152 ~Td~~~nPllqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~d 231 (337) T protein:vir:78 152 TTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRE 231 (337) T ss_pred CCChhhCcCccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchh Confidence 1332 33310 0 000000 00011122222233333455654 45555543 25567766 Q ss_pred HHHHHH-HHhccCCceeccCCccc-c--CCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 312 DWESIE-LDQAPGSGVFRVIANVQ-G--EATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 312 ~~~~l~-~~kd~~g~~~~~~~~~~-~--~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) ..+.-. .+-...+.+ ..... + ....++-|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+.. . T Consensus 232 Lladk~~~l~n~~~~p---tE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p----~ 304 (337) T protein:vir:78 232 LLHDKYFPIVNATQAP---TERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP----E 304 (337) T ss_pred hhHHHHHHHHhcCCCc---HHHHHHHHHHHhhhhcCcceEEccccCCCceEEeechhcEEEEecCcEEEEEEecc----c Confidence 654311 111111111 00000 0 12357899999999999999999999988777665554444433222 2 Q ss_pred cCcEEEEEEEEeccEEecccceEEE---EecCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRV---TFAAA 417 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~---~~~aa 417 (419) +|.+.-.-..--++.|-+..+++.+ +++.+ T Consensus 305 r~rie~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 305 RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred cccccchhhccceeeeeccccEEEEeceeecCC Confidence 2222222222233444444444443 44444 No 202 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=96.99 E-value=0.00022 Score=40.61 Aligned_cols=267 Identities=12% Similarity=0.020 Sum_probs=114.1 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhcceec-----c--cCcceeeeeeccccceeccccccceeecCcccccccc Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQN-----A--DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST 200 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~-----~--~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 200 (419) +...-...+|+.+...+...++...++..++..-. . .+..+++++-...... ..+.+-..+... .+ T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~-----d~~~~~~t~~~~--~~ 73 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSE-----RTMDGDITGKSK--NS 73 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeee-----cccCcccCcccc--cc Confidence 22222237899999999999999888887766532 1 2445666543211110 001110111111 11 Q ss_pred cce--eeEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 201 LSF--DTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 201 ~~~--~~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) ..- ..+.++-+|...+--=+.|+..+..+++++++. -.++++..+|..+...-....+..+ | . T Consensus 74 l~e~~v~l~id~~k~~a~~v~d~E~~l~i~~~~~~l~~-A~~aLA~~vd~~ia~~~~~~~~~~v----g----------t 138 (423) T protein:vir:10 74 LISAKATGEVGNYITVAVEYRQIEEALKLNQLDQILVP-INERMVTDLETELALFMMKHGALSL----G----------S 138 (423) T ss_pred cccceEEEEecceeeeeeeeChHHHhcChhHHHHHHHH-HHHHHHHHHHHHHHHHhhhcccccc----c----------c Confidence 111 234555555544433344455455678775544 4789999999988642211111000 0 0 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCC--cEEEEehHHHHHHHHH----hccCCceeccCCccccCCCcccccceeEecCCC Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWESIELD----QAPGSGVFRVIANVQGEATPRIWGLNVVSTVAI 352 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~----kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~ 352 (419) ..+....|+++..+-..+...+.+. -..+++|..+..|.+- ...++.. -..-..++..+++.|+.++.|+.+ T Consensus 139 ~~t~~~a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~--~~alr~~~i~G~~~GFdi~~Sn~v 216 (423) T protein:vir:10 139 PNTPIKKWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLV--RTAWENAQISGNFGGIRALMSNGL 216 (423) T ss_pred cccccccHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccc--hHHHHhcccceeecceEEEEecCC Confidence 1111223677777766666555543 3468999999888642 1111110 011122234578999999999999 Q ss_pred Cc---CcEE-EEeccceEEEEEecc--------eEEEEeecc-cchhh-cCcEEEEE---EEEeccEEe------cccce Q lcl|Aclame:pro 353 AQ---GTAL-VGGFRQGATLWSRQG--------ITVLMTDSH-ADFFT-ANTLVILA---EFRANLAVY------QPKAF 409 (419) Q Consensus 353 ~~---~~~~-~~d~~~~~~~~~~~~--------~~i~~~~~~-~~~~~-~~~~~~r~---~~r~d~~~~------~~~a~ 409 (419) |. ++.- .+-.+ +...+.+.. ......... ..... -|.+.|-+ ..++...++ +++-| T Consensus 217 p~~T~g~~~ga~~~~-~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~ 295 (423) T protein:vir:10 217 ASRTQGAFGGKLTVK-GTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTA 295 (423) T ss_pred cccccccccceeeee-eeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEE Confidence 83 3211 00000 011000000 000000000 00000 01111110 111111110 11111 Q ss_pred EE-------------EEecCCCC Q lcl|Aclame:pro 410 VR-------------VTFAAATT 419 (419) Q Consensus 410 ~~-------------~~~~aa~~ 419 (419) ++ +++.+++- T Consensus 296 ~V~~~~~~~a~~~~tv~i~p~~~ 318 (423) T protein:vir:10 296 TVMEDANAHSSGDVTVKISGVPI 318 (423) T ss_pred EEEecccccccCceEEEeccccc Confidence 11 12211110 No 203 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=96.97 E-value=8.8e-05 Score=42.82 Aligned_cols=333 Identities=11% Similarity=-0.001 Sum_probs=143.0 Q ss_pred HHHHHhhcccccccccchhhhhhHHHHhHHHHHHHHHhhh---hhhhhHHHHHHHHH------HhhhcccccccccCCcc Q lcl|Aclame:pro 63 PKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDK---RGQFQVEMRDIDPN------RLLSRDAPAGTITNPNV 133 (419) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~ 133 (419) +.+.. ......... ..+............+.+...+. ........+.+... ...........++. + T Consensus 1 ~~~~~--~~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~--~ 75 (382) T protein:vir:96 1 MSHIS--KTHSRLAGR-HAKPFDLKNVTHEAVAALGRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTP--S 75 (382) T ss_pred CCCcc--eeeeecCCc-cccchhhhcccHHHHHHHhccccccCcccchhHhhhhhhhhhhhhhcccccccCCccccC--C Confidence 00000 000000000 00000000000111111111000 00000011111110 00111112222222 2 Q ss_pred cccchhh----hHHHHHhhhhhhhHHhhcceecccCc---ceeeeeeccccceeccccccceeecCcccccccccceeeE Q lcl|Aclame:pro 134 PHLPQLV----PGIVPTTPDLPLLVADLLDQQNADYN---VLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTI 206 (419) Q Consensus 134 ~~~p~~~----~~~i~~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v 206 (419) .-+|..+ ...+++.+........++++.+++.. .+.|+..+ ..+.+.+.+-+++.|..+...... T Consensus 76 ~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e--------~~G~A~~ygd~~D~Pl~d~~~~~~ 147 (382) T protein:vir:96 76 IPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE--------PAGTAVEYGDHTNIPLTSWNANFE 147 (382) T ss_pred ccHHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeee--------cccceEEeecccCCCcccccccee Confidence 2234443 34555566666666777777664332 22333222 235677888888888888766666 Q ss_pred EeeeEEEEEeehhh-HHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHhccC--c-ccccceeccccccccc--ccccc Q lcl|Aclame:pro 207 TTTLKTVAHWLPIT-RQAADDNS---QLMGYIQGRLTYGLRFLRDRQLLNGNG--S-TEMQGILTTPGIGTYQ--QPKPT 277 (419) Q Consensus 207 ~~~~~k~~~~~~vs-~ell~d~~---~~~~~i~~~l~~a~~~~~d~~il~G~g--~-~~p~Gi~~~~~~~~~~--~~~~~ 277 (419) +-..+.+...+.++ .|+.+-+. ++.+--....++++...+|+-++.|+- . ++..|++|.+.+.... ....+ T Consensus 148 ~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~W 227 (382) T protein:vir:96 148 RRTIVRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGW 227 (382) T ss_pred EEEEEEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCc Confidence 66667777667775 44444332 566667888888999999999999962 2 3467999999875432 23345 Q ss_pred ccchhhhHHHHHHHHHHhhhhhcc---C----CcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecC Q lcl|Aclame:pro 278 APATDEPPLVDIRRAKTVAEIAGF---P----PDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTV 350 (419) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~---~----~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~ 350 (419) ...+....++|+..++..+...-. . +..++|.+..+..|... ...|-.++ ..... .+-++.++... T Consensus 228 a~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~-n~~g~Tvl--~~lk~----n~Pnl~i~t~p 300 (382) T protein:vir:96 228 ATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT-TPYGISVS--DWIEQ----TYPKMRIVSAP 300 (382) T ss_pred ccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc-CccCccHH--HHHHH----hcCCcEEEEcc Confidence 667788889999988888754331 1 22577898877666432 11111100 00000 01122233222 Q ss_pred CCC-cCcEEEEeccceEEEEEecceEEEEeecccchhhc--------Cc-------EEEEEE-EEeccEEecccceEEEE Q lcl|Aclame:pro 351 AIA-QGTALVGGFRQGATLWSRQGITVLMTDSHADFFTA--------NT-------LVILAE-FRANLAVYQPKAFVRVT 413 (419) Q Consensus 351 ~~~-~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~--------~~-------~~~r~~-~r~d~~~~~~~a~~~~~ 413 (419) .+. ++..-.+...-.++..+.-...+..+......|.+ -. +..-.. ...+..+++|.||++++ T Consensus 301 eL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~ 380 (382) T protein:vir:96 301 ELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYL 380 (382) T ss_pred ccccccCCCccceeEEEEecchhhhhcccccccCcceeccccceeeeccceeecceeEeccccceeeeEEEcchhhhhcc Confidence 221 00000000000011100000000000000001110 00 001111 22556678899998886 Q ss_pred ec Q lcl|Aclame:pro 414 FA 415 (419) Q Consensus 414 ~~ 415 (419) -= T Consensus 381 GI 382 (382) T protein:vir:96 381 GI 382 (382) T ss_pred CC Confidence 55 No 204 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=96.90 E-value=0.00027 Score=40.17 Aligned_cols=306 Identities=9% Similarity=-0.064 Sum_probs=141.8 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceec Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAG 181 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (419) ++...+..+..+..........+......+-.+.+...+...+.....+++.+++.+++++|.--.....-.. ...++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg-~~g~ia 79 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLD-SAHTVA 79 (342) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecc-cCcccc Confidence 1111111111111111111111111111122334445566677777888889999999998865443332211 111111 Q ss_pred cccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCc-- Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-- 256 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~g~-- 256 (419) +...+. +-+.-.|..-..++.-.+...+.-.-..|+.+.|+.. ++|...+++.+.++++.-+=.--+||+-. T Consensus 80 grtdT~---~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~ 156 (342) T protein:vir:10 80 STTDTS---GDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAA 156 (342) T ss_pred cccccC---CCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeecc Confidence 110000 0111222222456666777777777778888888764 57888899999888876665566677532 Q ss_pred -----cccc------ceecc-----c-----cccccccccccccchhhhHHHHHHHHHHh-hhhhccCC--cEEEEehHH Q lcl|Aclame:pro 257 -----TEMQ------GILTT-----P-----GIGTYQQPKPTAPATDEPPLVDIRRAKTV-AEIAGFPP--DGVVVHPQD 312 (419) Q Consensus 257 -----~~p~------Gi~~~-----~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~~~ 312 (419) ..|. |++.. + +...........++....+...+.+++.. +...+.+. -+.+|.... T Consensus 157 ~Td~~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 236 (342) T protein:vir:10 157 TSDRNSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKL 236 (342) T ss_pred CCChhhCcCccccchHHHHHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 1332 33311 0 00000000011112222222333455654 45555543 256677666 Q ss_pred HHHH-HHHhccCCceeccCCcc-cc--CCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhc Q lcl|Aclame:pro 313 WESI-ELDQAPGSGVFRVIANV-QG--EATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTA 388 (419) Q Consensus 313 ~~~l-~~~kd~~g~~~~~~~~~-~~--~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~ 388 (419) .+.- ..+-...+.+ .... .+ ....++-|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+.. .+ T Consensus 237 ladk~~~l~n~~~~p---tE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p----~r 309 (342) T protein:vir:10 237 LADKYFPIVNQQNAP---TEELAADIVISQKRIGGLKAVRVPFFPANAILITKLENLAIYVQEGTTRKHIENVP----KK 309 (342) T ss_pred hHHHHHHHHhcCCCh---HHHHHHHHHHhhhhhcCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEecc----cc Confidence 5521 1121111110 0000 01 12357899999999999999999999988777665554444332221 12 Q ss_pred CcEEEEEEEEeccEEecccceEEEE---ecCCC Q lcl|Aclame:pro 389 NTLVILAEFRANLAVYQPKAFVRVT---FAAAT 418 (419) Q Consensus 389 ~~~~~r~~~r~d~~~~~~~a~~~~~---~~aa~ 418 (419) |.+.-+-..--++.|-++.+++.+. ++-+- T Consensus 310 ~rie~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 310 DRIETYESENIDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred ccccchhhhccceeeeccccEEEeecceecCCC Confidence 2222222222333334444444332 11111 No 205 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=96.84 E-value=0.0003 Score=39.86 Aligned_cols=299 Identities=8% Similarity=-0.047 Sum_probs=131.9 Q ss_pred hhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeecccccee Q lcl|Aclame:pro 101 DKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGA 180 (419) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (419) ..+..+....... .............+..+.+...+...+.....+++.+++.+++++|.--.....-.. ...++ T Consensus 1 mtr~~~~~y~~~~----A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg-~~g~i 75 (336) T protein:vir:37 1 MNKQAYYALAAAL----AKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGA-TEKGV 75 (336) T ss_pred CcHHHHHHHHHHH----HHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeec-cCccc Confidence 1111111111110 000111100011112344555666777778888899999999998864433322211 11111 Q ss_pred ccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHHHHHHHH--HHHHHH--HhccC- Q lcl|Aclame:pro 181 GSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLTYGLRF--LRDRQL--LNGNG- 255 (419) Q Consensus 181 ~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~~a~~~--~~d~~i--l~G~g- 255 (419) .+... .+..| .++.++.-.+..++.-.-..|+.+.|+..+.+..+..+.+...+.+ ++|+-. +||+- T Consensus 76 agrtd-------t~R~~-~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~ 147 (336) T protein:vir:37 76 TGRKQ-------TGRNL-ANLDHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSV 147 (336) T ss_pred ccccC-------CCccc-cccCcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHhhchhhhcccceee Confidence 11101 11222 2245666667777777777889998887654444333333333333 344444 45542 Q ss_pred ---cccccc------eecc-----cc-ccccc------cccccccchhhhHHHHHHHHHHhhhhhccCCc--EEEEehHH Q lcl|Aclame:pro 256 ---STEMQG------ILTT-----PG-IGTYQ------QPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQD 312 (419) Q Consensus 256 ---~~~p~G------i~~~-----~~-~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 312 (419) +++|.| ++.. +. +.+.. ......++....+...+.+++..+...+.+.. +.+|.... T Consensus 148 A~~TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dL 227 (336) T protein:vir:37 148 ADNTTKADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADL 227 (336) T ss_pred ccCCCCCcccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhcCchHHhcCCCeEEEEchhh Confidence 234433 3210 00 00000 00001111122222234455666666555432 45666554 Q ss_pred HHHH-HHHhccCCceeccCCccc---cCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhc Q lcl|Aclame:pro 313 WESI-ELDQAPGSGVFRVIANVQ---GEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTA 388 (419) Q Consensus 313 ~~~l-~~~kd~~g~~~~~~~~~~---~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~ 388 (419) .+.= ..+-...+.. +..... -....++.|+|.+.-+++|.+.+++.-+++..+++-++..+=.+.+.. .+ T Consensus 228 la~~~~~l~~~~~~~--PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p----~r 301 (336) T protein:vir:37 228 VSKETKLIQQKHGLT--PTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDE----DK 301 (336) T ss_pred hhhhhhhhhhhcCCC--HHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEcc----cc Confidence 4321 1111111110 000000 112457899999999999999999999988777665544443332221 12 Q ss_pred CcEEEEEEEEeccEEecccceEEE-----EecCCC Q lcl|Aclame:pro 389 NTLVILAEFRANLAVYQPKAFVRV-----TFAAAT 418 (419) Q Consensus 389 ~~~~~r~~~r~d~~~~~~~a~~~~-----~~~aa~ 418 (419) |.+.-+-..--++.|-++.+++.+ ++.+-+ T Consensus 302 ~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~e~ 336 (336) T protein:vir:37 302 KGLVTSYYRQEGYVVEDLGLMTAIDHTKVKLNGEV 336 (336) T ss_pred ccccchhhhcceeeeeccccEEEeeeeeeeecCcC Confidence 222222222223333344443333 333333 No 206 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=96.78 E-value=0.00034 Score=39.57 Aligned_cols=308 Identities=6% Similarity=-0.130 Sum_probs=143.1 Q ss_pred HHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeecccc Q lcl|Aclame:pro 98 RARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGT 177 (419) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 177 (419) +.+.++...+..+............... . .....+.+...+...+.....+++-+++.+++++|.--........ .. T Consensus 1 m~~~M~~~tr~~~~~y~~~~A~~ngv~~-~-~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg-~~ 77 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYCDALAKAYGIDI-S-KLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVG-VG 77 (358) T ss_pred CcccccHHHHHHHHHHHHHHHHHhCCCh-h-HccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeec-CC Confidence 1111111111111111111111010000 0 0122344445566667788888899999999998865443332211 11 Q ss_pred ceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHH------HHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 178 AGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS------QLMGYIQGRLTYGLRFLRDRQLL 251 (419) Q Consensus 178 ~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~------~~~~~i~~~l~~a~~~~~d~~il 251 (419) .++... ..- ..|.....++.-.+...+.-.-..|+.+.|+..+ +|...+++.+.++++.-+=.--+ T Consensus 78 g~iagr------t~t--r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGf 149 (358) T protein:vir:78 78 QLYTGR------KKG--GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGW 149 (358) T ss_pred ccccee------cCC--CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecc Confidence 111111 111 2233334566667777777777788888887643 68888888888888766555666 Q ss_pred hccCc-------cccc------ceecc-----cc-cc-----ccccccc-cccchhhhHHHHHHHHHHh-hhhhccCC-- Q lcl|Aclame:pro 252 NGNGS-------TEMQ------GILTT-----PG-IG-----TYQQPKP-TAPATDEPPLVDIRRAKTV-AEIAGFPP-- 303 (419) Q Consensus 252 ~G~g~-------~~p~------Gi~~~-----~~-~~-----~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~-- 303 (419) ||+-. ..|. |++.. ++ +. +...... ...+....+...+.+++.. +...+.+. T Consensus 150 NGts~A~~Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~d 229 (358) T protein:vir:78 150 NGVSAADDTDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPR 229 (358) T ss_pred cceeeccCCChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCC Confidence 77532 1332 33310 00 00 0000001 1111222222233345543 44544443 Q ss_pred cEEEEehHHHHH-HHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecc Q lcl|Aclame:pro 304 DGVVVHPQDWES-IELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSH 382 (419) Q Consensus 304 ~~~~~~~~~~~~-l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~ 382 (419) -+.+|.....+. .-.+-...+.+ -.-........++-|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+.. T Consensus 230 LVvivG~dLla~k~~~l~n~~~~p--TE~~Aa~~i~k~iGGlpa~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p 307 (358) T protein:vir:78 230 LVVLVGTDLVAAAQAKLYSEATKP--SEQIAAQQLAKSIAGRKAYIPPFFPGKRMVVTTLDNLHCYTQRGTRKRKADDNQ 307 (358) T ss_pred EEEEEchhhhhHHhhhHhhcCCCc--HHHHHHHHHHHHhCCCeEEEccccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 256677666552 22222211111 000011122357899999999999999999999988777665554444332221 Q ss_pred c-----chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 383 A-----DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 383 ~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) . ++...| .+|.++.+.-+.....-.|.....++.+. T Consensus 308 ~r~riE~y~s~N-e~YvVEd~~~~a~iE~i~v~~~~~pa~~~ 348 (358) T protein:vir:78 308 DSKSFDNQYWRM-EGYALGEHKAYGGFEEADIEIGADPAVLA 348 (358) T ss_pred ccccccchhhhc-ceeeeeccccEEEEeeeeeeeCCCCCccc Confidence 1 112222 23444444444443333333222222222 No 207 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=96.69 E-value=0.00041 Score=39.16 Aligned_cols=299 Identities=9% Similarity=-0.028 Sum_probs=131.1 Q ss_pred hhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeecccccee Q lcl|Aclame:pro 101 DKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGA 180 (419) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (419) ..+..+....... .............+..+.+...+...+.....+++.+++.+++++|.--.....-... ..++ T Consensus 1 mtr~~~~~y~~~~----A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~-~g~i 75 (336) T protein:vir:37 1 MNKQAYYALAAAL----AKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGAT-EKGV 75 (336) T ss_pred CcHHHHHHHHHHH----HHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeecc-Cccc Confidence 1111111111110 0000100000011123445556666777888889999999999988644333222111 1111 Q ss_pred ccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHHHHHHHHHHHHHHHHHH--HHHHHHH--hccC- Q lcl|Aclame:pro 181 GSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQLMGYIQGRLTYGLRF--LRDRQLL--NGNG- 255 (419) Q Consensus 181 ~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~~~~~~i~~~l~~a~~~--~~d~~il--~G~g- 255 (419) .....+ + .......++.-.+..++.-.-..|+.+.|+..+.+..+..+.+...+.+ ++|+-.| +|+- T Consensus 76 agrtdt------~--r~r~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~ 147 (336) T protein:vir:37 76 TGRKQT------G--RNLATLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSV 147 (336) T ss_pred ccccCC------C--CCccccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhcccceee Confidence 111111 1 1111223555666666666677888898887654444333333333333 3455444 5542 Q ss_pred ---cccccc------eecc-----cc-ccccc------cccccccchhhhHHHHHHHHHHhhhhhccCCc--EEEEehHH Q lcl|Aclame:pro 256 ---STEMQG------ILTT-----PG-IGTYQ------QPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQD 312 (419) Q Consensus 256 ---~~~p~G------i~~~-----~~-~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 312 (419) +++|.| ++.. +. +.+.. ......++....+...+.+++..+...+.+.. +.+|.... T Consensus 148 A~~TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dL 227 (336) T protein:vir:37 148 ATNTTKTDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADL 227 (336) T ss_pred ccCCCCccccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhccchHHhcCCCeEEEEchhh Confidence 234433 3210 00 00000 00001111122222234455665666555432 45666554 Q ss_pred HHHH-HHHhccCCceeccCCccc---cCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhc Q lcl|Aclame:pro 313 WESI-ELDQAPGSGVFRVIANVQ---GEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTA 388 (419) Q Consensus 313 ~~~l-~~~kd~~g~~~~~~~~~~---~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~ 388 (419) .+.= ..+-...+.. +..... -....++.|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+.. .+ T Consensus 228 la~~~~~l~~~~~~~--PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p----~r 301 (336) T protein:vir:37 228 VSKETKLIQQKHGLT--PTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDE----DK 301 (336) T ss_pred hhhhhhhhhhhcCCC--HHHHHHHHHHHHHHhhCCceEEEccccCCCceEEeeccccEEEEecCcEEEEEEEcc----cc Confidence 4321 1111111110 000000 113457899999999999999999999988777665554444332221 12 Q ss_pred CcEEEEEEEEeccEEecccceEEEE-----ecCCC Q lcl|Aclame:pro 389 NTLVILAEFRANLAVYQPKAFVRVT-----FAAAT 418 (419) Q Consensus 389 ~~~~~r~~~r~d~~~~~~~a~~~~~-----~~aa~ 418 (419) |.+.-+-..--++.|-++.+++.+. +.+-+ T Consensus 302 ~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~e~ 336 (336) T protein:vir:37 302 KGLVTSYYRQEGYVVEDLGLMTAIDHTKVKLNGEV 336 (336) T ss_pred ccccchhhhcceeeeeccccEEEeeeeeeeccccC Confidence 2222222222333444444444432 32333 No 208 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=96.47 E-value=0.0006 Score=38.27 Aligned_cols=305 Identities=12% Similarity=-0.031 Sum_probs=139.9 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceec Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAG 181 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (419) ++...+..+..+............. . ....+.+...+...+.....+++.+++.++++++.--.....-.. ...++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~-d-~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg-~~g~ia 77 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVAELNGIDAG-D-VSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIG-VTGSIA 77 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChH-H-hcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecc-cCcccc Confidence 1111111111111111000000000 0 112233444556667777888889999999998865443332221 111111 Q ss_pred cccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCc-- Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-- 256 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~g~-- 256 (419) +...+.. ..|. .|..-..++.-.+...+.-.-..|+.+.|+.. ++|...+++.+.++++.-+=.--+||+-. T Consensus 78 grtdT~~-~~~R--~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~ 154 (357) T protein:vir:56 78 STTDTAG-GTER--QPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAE 154 (357) T ss_pred ccccCCC-CCCc--ccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeec Confidence 1111100 0111 22111345666777777777778888888764 57888888888888876655566677532 Q ss_pred -----cccc------ceecc-----c-----------cccccccccccccchhhhHHHHHHHHHHh-hhhhccCC--cEE Q lcl|Aclame:pro 257 -----TEMQ------GILTT-----P-----------GIGTYQQPKPTAPATDEPPLVDIRRAKTV-AEIAGFPP--DGV 306 (419) Q Consensus 257 -----~~p~------Gi~~~-----~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~ 306 (419) .+|. |++.. + |-..........++....+...+.+++.. +...+.+. -+. T Consensus 155 ~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVv 234 (357) T protein:vir:56 155 TSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVV 234 (357) T ss_pred cCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEE Confidence 1332 33310 0 00000000011112222222333455654 45555543 255 Q ss_pred EEehHHHHH-HHHHhccCCceeccCCcccc---CCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecc Q lcl|Aclame:pro 307 VVHPQDWES-IELDQAPGSGVFRVIANVQG---EATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSH 382 (419) Q Consensus 307 ~~~~~~~~~-l~~~kd~~g~~~~~~~~~~~---~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~ 382 (419) +|.....+. .-.+-...+.+ ...... ....++-|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+.. T Consensus 235 ivG~dLla~k~~~l~n~~~~p---TE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p 311 (357) T protein:vir:56 235 IVGRQLLADKYFPIVNKEQDN---SEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENP 311 (357) T ss_pred EEchhhhhhhhhhHhhccCCh---HHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 666665542 11222121111 111001 11357899999999999999999999988777665554444332221 Q ss_pred c-----chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 383 A-----DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 383 ~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) . ++...| .+|.++.+.-+..... +.+..++.+. T Consensus 312 ~r~riE~y~s~N-e~YvVEd~~~~a~iE~---i~i~~~~~~~ 349 (357) T protein:vir:56 312 KLDRVENYESMN-IDYVVEDYAAGCLVEK---IKVGDFSTPA 349 (357) T ss_pred ccccccchhhhc-ceeeeeccccEEEeee---eeeccCCCCc Confidence 1 112222 2444444444444332 1122222222 No 209 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=96.44 E-value=0.00062 Score=38.17 Aligned_cols=304 Identities=7% Similarity=-0.124 Sum_probs=140.6 Q ss_pred HHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeecccc Q lcl|Aclame:pro 98 RARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGT 177 (419) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 177 (419) +.+.++...+..+.............+ .......+...+...+.....+++-+++.+++++|.--.....-.. .. T Consensus 1 m~~~m~~~tr~~~~~y~~~~A~~ngv~----~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg-~~ 75 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQLAKSYGVS----NVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVG-VS 75 (341) T ss_pred CcccccHHHHHHHHHHHHHHHHHcCcc----cccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecc-cc Confidence 111111111111111111111111110 1112233434556777778888889999999988865433322211 11 Q ss_pred ceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH------HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 178 AGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN------SQLMGYIQGRLTYGLRFLRDRQLL 251 (419) Q Consensus 178 ~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~------~~~~~~i~~~l~~a~~~~~d~~il 251 (419) .++..... .+..|. ++.++...+...+.-.-+.|+.+.|+.. ++|...+.+.+.++++.-+=.--+ T Consensus 76 g~iagrtd-------t~R~~r-~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGf 147 (341) T protein:vir:27 76 GLYTGRKA-------GGRFTK-QVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGW 147 (341) T ss_pred cceeeccC-------CCceec-ccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcc Confidence 11111111 122222 2356666667766666677778776542 568888888888888766666666 Q ss_pred hccCc-------cccc------ceecc-----c-cccccccccccccchhhhHHHHHHHHHHhh-hhhccCCc--EEEEe Q lcl|Aclame:pro 252 NGNGS-------TEMQ------GILTT-----P-GIGTYQQPKPTAPATDEPPLVDIRRAKTVA-EIAGFPPD--GVVVH 309 (419) Q Consensus 252 ~G~g~-------~~p~------Gi~~~-----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~ 309 (419) +|.-. ..|. |++.. + -+.+.........+....+...+.+++..+ ...+.+.. +.+|. T Consensus 148 nGts~A~~Td~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG 227 (341) T protein:vir:27 148 NGVSAEADTDPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVG 227 (341) T ss_pred cceeeccCCChhhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEc Confidence 77541 1333 33321 0 011111111111222222222344555543 45544432 56677 Q ss_pred hHHHHH-HHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeeccc-chhh Q lcl|Aclame:pro 310 PQDWES-IELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHA-DFFT 387 (419) Q Consensus 310 ~~~~~~-l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~-~~~~ 387 (419) ....+. .-.+-.....+ -..........++.|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+... +-++ T Consensus 228 ~dLla~k~~~l~n~~~~p--tE~~Aa~~i~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie 305 (341) T protein:vir:27 228 SGLIGAAQAKLYDKADKP--SEQIAAQKLDKTIAGRPAYVPPFLPDNAMVVTIPENLQVLTQHGTAQRKAKHESDRKRSK 305 (341) T ss_pred hhhhhhhhhhhhccCCCC--HHHHHHHHHHHhhCCCeEEEccccCCCceEEeeccceEEEEecCcEEEEEEecccccccc Confidence 665542 22221111110 0000111223589999999999999999999999887766655444443322221 1111 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .-+-+|+++.+ .....-.|..+|.+++.- T Consensus 306 ~yes~YvVEdy---g~~~~~~~~~vkl~~~~~ 334 (341) T protein:vir:27 306 THTGAWKVTQW---VCWKRSPLTTQKKSTSAL 334 (341) T ss_pred chhhhheeehh---hhhhhccccccccCcccc Confidence 11113444332 222223344455544433 No 210 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=96.34 E-value=0.00013 Score=41.96 Aligned_cols=306 Identities=16% Similarity=0.130 Sum_probs=132.3 Q ss_pred hhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc Q lcl|Aclame:pro 84 LAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA 163 (419) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~ 163 (419) ........+....|..-.++..+..+.+..++.-. .+.|-+.+.....+|+-+...|...+..+.++...+.+..+ T Consensus 1 mtn~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L----~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~ 76 (318) T protein:vir:86 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKL----AENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 76 (318) T ss_pred CcchhhhhHHHHHHHHHHhccCCchhhhhhhhhhh----hhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccc Confidence 11111112222333333344444444444333322 23344445555677887777777777777666654444333 Q ss_pred cCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhH-HHHhhH----HHHHHHHHHHH Q lcl|Aclame:pro 164 DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITR-QAADDN----SQLMGYIQGRL 238 (419) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~-ell~d~----~~~~~~i~~~l 238 (419) + .+.+..... +...|...-.|+.+.+...+|..-++.+ ++.++..|- ++..+. ..+.+||..+| T Consensus 77 ~----~~~V~~s~~-----s~AeAq~HkdGqTK~eqa~~~~~~Tl~~--~~VY~~~S~Ae~~K~~~~sYsel~N~i~~EL 145 (318) T protein:vir:86 77 G----ALLVSRSFD-----SSAEAQVHKDGQTKTEQAATLTIDTLEP--VMVYKLQSLAERVKRLQMSYSELYNLIVAEL 145 (318) T ss_pred h----hhhhhhhhh-----hhhhhhhhccCCccccceeeeeeechhH--HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHH Confidence 2 222211111 1234556667888877666655444444 333443332 344443 25789999999 Q ss_pred HHHHH-HHHHHHHHhccCcccccceeccccccccccccccc-cchhhhHHHHHHHHHHhhhhhccCCcEEEEehHH-HHH Q lcl|Aclame:pro 239 TYGLR-FLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA-PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQD-WES 315 (419) Q Consensus 239 ~~a~~-~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 315 (419) ++++. +..|++++-|+|++....+.....+......++.+ .++.+.....+..++.-+.+..++. ..++...+ .+. T Consensus 146 tQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagttpfanaieeavdfvrptagrr-ylivkaedrkal 224 (318) T protein:vir:86 146 TQAIVNKIVDLALVEGDGSNGFKSIDKEADVKKIKKITTKAKSAGTTPFANAIEEAVDFVRPTAGRR-YLIVKAEDRKAL 224 (318) T ss_pred HHHHHHHHHHhhheeecCCCCccchhhHHHHHHHHHHhhhhhccCCCchhhHHHHHHhhhccCCCce-EEEEeecchHHH Confidence 99999 89999999999998755554433322221111111 1111111112223333333322221 23444433 333 Q ss_pred HHHHhccCCce-eccCCccccCCCcccccce---eEecCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcE Q lcl|Aclame:pro 316 IELDQAPGSGV-FRVIANVQGEATPRIWGLN---VVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTL 391 (419) Q Consensus 316 l~~~kd~~g~~-~~~~~~~~~~~~~~l~G~p---v~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 391 (419) |..++.+..+. ..+..+-+.. ..-.|+. |+.-.. .-..-++.|-. |.+ +-++++ .-+.-.|.+|.- T Consensus 225 ldelrqatanahvriknddtei--asevgvdeiivytgsk-alkptvlvdqk--yhi-dmqdlt----kvdafewktnsn 294 (318) T protein:vir:86 225 LDELRQATANAHVRIKNDDTEI--ASEVGVDEIIVYTGSK-ALKPTVLVDQK--YHI-DMQDLT----KVDAFEWKTNSN 294 (318) T ss_pred HHHHHhhcccceeEEeccchhh--hhhcCcceeeeeeccc-cccceeeeccc--eec-chhhhh----hhhcceeccCCc Confidence 44555433221 1111111100 0111221 111111 11112333322 221 112221 011112455444 Q ss_pred EEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 392 VILAEFRANLAVYQPKAFVRVTFA 415 (419) Q Consensus 392 ~~r~~~r~d~~~~~~~a~~~~~~~ 415 (419) -+.++..-.+-+.--+|-+++++. T Consensus 295 milvetltsghvetynagavitvs 318 (318) T protein:vir:86 295 MILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred eEEEeecccCcceeecCceeEEeC Confidence 445555555555444555555555 No 211 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=96.34 E-value=0.00072 Score=37.81 Aligned_cols=304 Identities=12% Similarity=-0.024 Sum_probs=139.0 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceec Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAG 181 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (419) ++...+..+..+............. .....+.+...+...+.....+++.+++.++++++.--.....-.. ...++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~--d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg-~~g~ia 77 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVAELNGIDAG--DVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIG-VTGSIA 77 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChH--HhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecc-cCcccc Confidence 1111111111111111000000000 0112233444556667777888889999999998865443332211 111111 Q ss_pred cccccceeecCcccccccc-cceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCc- Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQST-LSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGNGS- 256 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~g~- 256 (419) +...+. -+......+ ..++.-.+...+.-.-..|+.+.|+.. ++|...+++.+.++++.-+=.--+||+-. T Consensus 78 grtdT~----~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A 153 (357) T protein:vir:20 78 STTDTA----GGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRA 153 (357) T ss_pred ccccCC----CCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeee Confidence 111110 011111122 345666777777777778888888764 57888888888888876655566677532 Q ss_pred ------cccc------ceecc-----c-----------cccccccccccccchhhhHHHHHHHHHHh-hhhhccCC--cE Q lcl|Aclame:pro 257 ------TEMQ------GILTT-----P-----------GIGTYQQPKPTAPATDEPPLVDIRRAKTV-AEIAGFPP--DG 305 (419) Q Consensus 257 ------~~p~------Gi~~~-----~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~ 305 (419) .+|. |++.. + |-..........++....+...+.+++.. +...+.+. -+ T Consensus 154 ~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLV 233 (357) T protein:vir:20 154 ETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLV 233 (357) T ss_pred ccCChhhCcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEE Confidence 1332 33310 0 00000000011112222222333455654 45555543 25 Q ss_pred EEEehHHHHH-HHHHhccCCceeccCCcccc---CCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeec Q lcl|Aclame:pro 306 VVVHPQDWES-IELDQAPGSGVFRVIANVQG---EATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDS 381 (419) Q Consensus 306 ~~~~~~~~~~-l~~~kd~~g~~~~~~~~~~~---~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~ 381 (419) .+|.....+. .-.+-...+.+ ...... ....++-|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+. T Consensus 234 vivG~dLla~k~~~l~n~~~~p---tE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~ 310 (357) T protein:vir:20 234 VIVGRQLLADKYFPIVNKEQDN---SEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEEN 310 (357) T ss_pred EEEchhhhhhhhhhHhhccCCh---HHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEec Confidence 5666665542 11221121111 111001 1135789999999999999999999998877766555444433222 Q ss_pred cc-----chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 382 HA-----DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 382 ~~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .. ++...| .+|.++.+.-+..... +.+..++.|. T Consensus 311 p~r~riE~y~s~N-e~YvVEd~~~~a~iE~---i~~~~~~~p~ 349 (357) T protein:vir:20 311 PKLDRVENYESMN-IDYVVEDYAAGCLVEK---IKVGDFSTPA 349 (357) T ss_pred cccccccchhhhc-ceeeeeccccEEEeee---eeeccccCCc Confidence 11 111222 2344443333333321 1111222222 No 212 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=96.01 E-value=0.00026 Score=40.23 Aligned_cols=335 Identities=10% Similarity=-0.051 Sum_probs=142.4 Q ss_pred HHHH--Hhhccccc---ccccchhhhh----hHHHHhHHHHHHHHHhhhhhhhhHH-HHHHHHHHhh-hcccccccccCC Q lcl|Aclame:pro 63 PKGP--ADGGTPLT---PAEAGTFRSL----AQRFADSDGLREYRARDKRGQFQVE-MRDIDPNRLL-SRDAPAGTITNP 131 (419) Q Consensus 63 ~~~~--~~~~~~~~---~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~ 131 (419) +.+. .+..-..+ +..-...+.. .....+.+.+....... ......+ +......... ........ +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~-~~~~~~~~~~~~~~~~~a~da~~~~~~--t~ 77 (388) T protein:vir:99 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHA-TVKRQIELLHEGGVATQAFDSAYVAPT--TQ 77 (388) T ss_pred CCCccceeeecCCcccchhhhhcCCcceeeechhhHhhhhcceeccCc-cchhhhhhhhhhhhhhcccCccccccc--cc Confidence 0000 00000000 0000000000 00000000000000000 0000000 0000000000 11111111 22 Q ss_pred cccccchhhh----HHHHHhhhhhhhHHhhcceecccCc---ceeeeeeccccceeccccccceeecCccccccccccee Q lcl|Aclame:pro 132 NVPHLPQLVP----GIVPTTPDLPLLVADLLDQQNADYN---VLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFD 204 (419) Q Consensus 132 ~~~~~p~~~~----~~i~~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 204 (419) ++.-+|-.+. ..+++.+........++++.+++.. .+.|+.. ...+.+.+.+-+.+.|..+.... T Consensus 78 ~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~--------e~~G~A~~ygd~~D~Pl~d~~~~ 149 (388) T protein:vir:99 78 ASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIV--------EPAGTAMEYGDLTNIPLSSWNVN 149 (388) T ss_pred CcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeee--------ecceeEEEeecccCCCceeccce Confidence 2233444443 3444444555555566666664322 2222222 12346677788888888887777 Q ss_pred eEEeeeEEEEEeehhhHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHHHhcc-Cc--ccccceecccccccccccc-- Q lcl|Aclame:pro 205 TITTTLKTVAHWLPITRQAADDNS----QLMGYIQGRLTYGLRFLRDRQLLNGN-GS--TEMQGILTTPGIGTYQQPK-- 275 (419) Q Consensus 205 ~v~~~~~k~~~~~~vs~ell~d~~----~~~~~i~~~l~~a~~~~~d~~il~G~-g~--~~p~Gi~~~~~~~~~~~~~-- 275 (419) ..+-..+.++..+.++.+=+..+. ++..--.....+++...+|+-.|+|. |. .+..|++|.|.+......+ T Consensus 150 ~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at~~ 229 (388) T protein:vir:99 150 FERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTP 229 (388) T ss_pred eeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccccccccC Confidence 777777777777878765444332 57888888999999999999999994 32 3678999998875443222 Q ss_pred ----ccccchhhhHHHHHHHHHHhhhhhcc---C----CcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccc Q lcl|Aclame:pro 276 ----PTAPATDEPPLVDIRRAKTVAEIAGF---P----PDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGL 344 (419) Q Consensus 276 ----~~~~~~~~~~~~~~~~~~~~~~~~~~---~----~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~ 344 (419) .....+....++|+..++..+...-. . +..++|.+..+..|... ...|-.++ ..... ..-++ T Consensus 230 ~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl--~~lk~----n~Pnl 302 (388) T protein:vir:99 230 GGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVR--DWLKQ----TYPRV 302 (388) T ss_pred CcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhcccc-CcCCccHH--HHHHH----hcCCc Confidence 23345677789999988888744322 1 12577888888777432 11121110 00000 11122 Q ss_pred eeEecCCCC------cCcE-EEEecc-ce-EEEEEecceEEE--Eeeccc-chhhcCc--EEEEEEEE-eccEEecccce Q lcl|Aclame:pro 345 NVVSTVAIA------QGTA-LVGGFR-QG-ATLWSRQGITVL--MTDSHA-DFFTANT--LVILAEFR-ANLAVYQPKAF 409 (419) Q Consensus 345 pv~~~~~~~------~~~~-~~~d~~-~~-~~~~~~~~~~i~--~~~~~~-~~~~~~~--~~~r~~~r-~d~~~~~~~a~ 409 (419) .++.-..+. .+.. ++.... .. .........+.. +..... ...+... +..-...| .+..+++|.|| T Consensus 303 ~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~~~~~~rt~Gv~ir~P~Ai 382 (388) T protein:vir:99 303 RVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAV 382 (388) T ss_pred EEEEecccccccccCCceeEEEEecccccccccCccCcceeEEecccccccccceecCceeEeccccceeeeEEeccchh Confidence 233222211 1111 111100 00 000000000000 000000 0001111 11222334 44566779999 Q ss_pred EEEEec Q lcl|Aclame:pro 410 VRVTFA 415 (419) Q Consensus 410 ~~~~~~ 415 (419) ++++-= T Consensus 383 ~~~~GI 388 (388) T protein:vir:99 383 VRLIGL 388 (388) T ss_pred heeccC Confidence 988755 No 213 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=95.86 E-value=0.0013 Score=36.33 Aligned_cols=308 Identities=12% Similarity=-0.034 Sum_probs=139.3 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceec Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAG 181 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (419) ++...+..+..+............. . ....+.+...+...+.....+++.+++.++++++.--.....-.. ...++. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~-d-~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg-~~g~ia 77 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVAELNGIDAG-D-VSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIG-VTGSIA 77 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChH-H-hcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecc-cCcccc Confidence 1111111111111111100000000 0 112233444556667777888889999999998865443332211 111111 Q ss_pred cccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCc-- Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGNGS-- 256 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~g~-- 256 (419) +...+.. ..|. .|..-..++.-.+...+.-.-..|+.+.|+.. ++|...+++.+.++++.-+=.--+||+-. T Consensus 78 grtdT~~-~~~R--~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~ 154 (357) T protein:vir:60 78 STTDTAG-GTER--QPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAE 154 (357) T ss_pred cccccCC-CCCc--ccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeec Confidence 1111100 0111 22212345666777777777778888888764 57888888888888876665566677532 Q ss_pred -----cccc------ceecc-----c-----------cccccccccccccchhhhHHHHHHHHHHh-hhhhccCC--cEE Q lcl|Aclame:pro 257 -----TEMQ------GILTT-----P-----------GIGTYQQPKPTAPATDEPPLVDIRRAKTV-AEIAGFPP--DGV 306 (419) Q Consensus 257 -----~~p~------Gi~~~-----~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~ 306 (419) .+|. |++.. + |-..........++....+...+.+++.. +...+.+. -+. T Consensus 155 ~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVv 234 (357) T protein:vir:60 155 TSDRSSNQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVV 234 (357) T ss_pred cCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEE Confidence 1332 33310 0 00000000011112222222333455654 45555543 255 Q ss_pred EEehHHHHH-HHHHhccCCceeccCCccc-c--CCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeecc Q lcl|Aclame:pro 307 VVHPQDWES-IELDQAPGSGVFRVIANVQ-G--EATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSH 382 (419) Q Consensus 307 ~~~~~~~~~-l~~~kd~~g~~~~~~~~~~-~--~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~ 382 (419) +|.....+. ...+-...+.+ ..... + ....++-|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+.. T Consensus 235 ivG~dLla~k~~~l~n~~~~p---TE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p 311 (357) T protein:vir:60 235 IVGRQLLADKYFPIVNREQDN---SEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENP 311 (357) T ss_pred EEchhhhhHHhhhHhhcCCCh---HHHHHHHHHHHhhhhcCcceEEccccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 666665542 11221111111 00000 1 12357899999999999999999999988777665554444332221 Q ss_pred c-----chhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 383 A-----DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 383 ~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) . ++...| .+|.++.+.-+.....-.|+...-++... T Consensus 312 ~r~riE~y~s~N-e~YvVEd~~~~a~iE~i~~~~~~~pa~~~ 352 (357) T protein:vir:60 312 KLDRVENYESMN-IDYVVEDYAAGCLVEKIKVGDFSTPAKAT 352 (357) T ss_pred ccccccchhhhc-ceeeeeccccEEEeeeeeeccCcccccCC Confidence 1 112222 24444444444433321111111111111 No 214 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=95.55 E-value=0.0019 Score=35.55 Aligned_cols=296 Identities=10% Similarity=0.051 Sum_probs=140.7 Q ss_pred HHHHhhhcccccccccCCcccccchhh--hHHHHHhhhhhhhHHhhcceecccCc---ceeeeeeccccceeccccccce Q lcl|Aclame:pro 114 DPNRLLSRDAPAGTITNPNVPHLPQLV--PGIVPTTPDLPLLVADLLDQQNADYN---VLEYIRDTSGTAGAGSTWNKAA 188 (419) Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~p~~~--~~~i~~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~a~ 188 (419) ..+..........+.....+.-..+.+ ...++.... ...+.+++...|+..+ .+++-+-...... ..-..-+ T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~-~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~--~~pl~eG 77 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARK-EQYFMPLASVTNMPKHYGKTIKVYEYVPLLDD--RNINDQG 77 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhh-hhhhhhcccccccccccCCeEEEEeccccccc--ccchhcC Confidence 000000011111111111122222222 444554444 4777888888777433 3333221111100 0000111 Q ss_pred eecCcccc-----------------------------cccccceeeEEeeeEEEEEeehhhHHHHhh-H-HHHHHHHHHH Q lcl|Aclame:pro 189 VVPEGTAK-----------------------------PQSTLSFDTITTTLKTVAHWLPITRQAADD-N-SQLMGYIQGR 237 (419) Q Consensus 189 ~v~Eg~~~-----------------------------~~~~~~~~~v~~~~~k~~~~~~vs~ell~d-~-~~~~~~i~~~ 237 (419) .-++|+++ .....+-..+..+.++++.++.+|+++... . +.+.+.|..+ T Consensus 78 v~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~e 157 (401) T protein:vir:95 78 IDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRE 157 (401) T ss_pred CCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHH Confidence 11222211 111223345667899999999999998763 3 3566554333 Q ss_pred ----HHHHHHHHHHHHHHhccCcccccceeccccccc-cccccccccchhhhHHHHHHHHHHhhhhh------------- Q lcl|Aclame:pro 238 ----LTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGT-YQQPKPTAPATDEPPLVDIRRAKTVAEIA------------- 299 (419) Q Consensus 238 ----l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------- 299 (419) -+......+-+.+|++-+.- -.+|..+ .........+.....++++..+...|... T Consensus 158 ll~g~~~~t~d~i~~dll~ag~~v------iyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~ 231 (401) T protein:vir:95 158 LMNGATQITEAVLQKDLLAAAGTV------LYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSR 231 (401) T ss_pred HhhhhhhhHHHHHHHHHHhhcCee------ecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhh Confidence 33333555666777643210 1112111 12222233344455677776666555421 Q ss_pred -c----cCCcE-EEEehHHHHHHHHHhccCCceeccC-------CccccCCCcccccceeEecCCCC--------cC--- Q lcl|Aclame:pro 300 -G----FPPDG-VVVHPQDWESIELDQAPGSGVFRVI-------ANVQGEATPRIWGLNVVSTVAIA--------QG--- 355 (419) Q Consensus 300 -~----~~~~~-~~~~~~~~~~l~~~kd~~g~~~~~~-------~~~~~~~~~~l~G~pv~~~~~~~--------~~--- 355 (419) + ..++. -+||+.....|+.++|-.|.+-|.. .....+..+.+.++.+++++.+- ++ T Consensus 232 ~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~ 311 (401) T protein:vir:95 232 MIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGAN 311 (401) T ss_pred ccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCcccccccc Confidence 1 11222 3679999999998888777655542 22223445677889999888742 10 Q ss_pred ------------------cEEEEeccceEEEEEecce----EEEEeeccc----chhhcCcEEEEE-EEEeccEEecccc Q lcl|Aclame:pro 356 ------------------TALVGGFRQGATLWSRQGI----TVLMTDSHA----DFFTANTLVILA-EFRANLAVYQPKA 408 (419) Q Consensus 356 ------------------~~~~~d~~~~~~~~~~~~~----~i~~~~~~~----~~~~~~~~~~r~-~~r~d~~~~~~~a 408 (419) .+++|....+...+...+. .+.+..... ..=--+|+++.. .++..+.+++++= T Consensus 312 ~~y~~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~ 391 (401) T protein:vir:95 312 PGYRTSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPER 391 (401) T ss_pred cccccccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccce Confidence 1233333333333333332 222222110 000124555553 4567778899999 Q ss_pred eEEEEecCCC Q lcl|Aclame:pro 409 FVRVTFAAAT 418 (419) Q Consensus 409 ~~~~~~~aa~ 418 (419) +++++..+.. T Consensus 392 m~~ies~a~~ 401 (401) T protein:vir:95 392 LALIKTVAPL 401 (401) T ss_pred eEEEEeecCC Confidence 9999888877 No 215 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=94.67 E-value=0.0017 Score=35.84 Aligned_cols=309 Identities=16% Similarity=0.122 Sum_probs=136.9 Q ss_pred hhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecc Q lcl|Aclame:pro 84 LAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNA 163 (419) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~ 163 (419) .............|..-..+.....+.+..++.... +.|.+.+.....+|.-....|...+....++...+.+.++ T Consensus 1 mtnfiesqnavteffdvlkknsgkseiknawnakla----engvtitdttfqlprklvesintallntnpvfkvfhvtnv 76 (318) T protein:vir:94 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLA----ENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 76 (318) T ss_pred CccchhhhhhHHHHHHHHhcccChhhhhhhhhhhhh----hCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhh Confidence 000011112223344444444445555544443332 2333334444555655555565555555555444444444 Q ss_pred cCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHH--HhhH-HHHHHHHHHHHHH Q lcl|Aclame:pro 164 DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQA--ADDN-SQLMGYIQGRLTY 240 (419) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~el--l~d~-~~~~~~i~~~l~~ 240 (419) +.- .+..++ ...+.+...-.|+.+++...++.-=++.+.-+.....+.... ++.+ ..+...|..++.+ T Consensus 77 gal----lvsrsf-----dssneaqvhkdgqtkteqaatltidtlepvmvyklqslaervkrlqmsyselynlivaeltq 147 (318) T protein:vir:94 77 GAL----LVSRSF-----DSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 147 (318) T ss_pred hhe----eeeccc-----cccchhhhhcccccccccceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHH Confidence 332 222121 234567788889999988877666666665555444444443 2333 4688899999999 Q ss_pred HHHHHH-HHHHHhccCcccccceecccccccccccccc-ccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHH-HHHHH Q lcl|Aclame:pro 241 GLRFLR-DRQLLNGNGSTEMQGILTTPGIGTYQQPKPT-APATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQD-WESIE 317 (419) Q Consensus 241 a~~~~~-d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~ 317 (419) ++..++ |.+++-|+|+++.+.|.....+......+.. ..++.+...+.+..++.-+.+..++. ..++...+ .+.|. T Consensus 148 aivnkivdlalvegdgtngfksidkeadvkkikkittkaksagktpfadaieeavdfvrptagrr-ylivktedrkalld 226 (318) T protein:vir:94 148 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRR-YLIVKTEDRKALLD 226 (318) T ss_pred HHHhhhhheeeeecCCcchhhhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCCCce-EEEEeccchHHHHH Confidence 988774 6788889999887777654433322211111 11111112233334443333332222 23343333 33444 Q ss_pred HHhccCCce-eccCCccccCCCcccccce-eEe-cCCCCcCcEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEE Q lcl|Aclame:pro 318 LDQAPGSGV-FRVIANVQGEATPRIWGLN-VVS-TVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVIL 394 (419) Q Consensus 318 ~~kd~~g~~-~~~~~~~~~~~~~~l~G~p-v~~-~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r 394 (419) .++.+..+. ..+..+-+. ...-.|+. +++ +..-....-++.|-. |.+ +-++++- -+.-.|.+|.--+. T Consensus 227 elrqatananvriknddte--iasevgvdeiivytgskavkptvlvdqk--yhi-dmqdltk----vdafewktnsnmil 297 (318) T protein:vir:94 227 ELRQATANANVRIKNDDTE--IASEVGVDEIIVYTGSKAVKPTVLVDQK--YHI-DMQDLTK----VDAFEWKTNSNMIL 297 (318) T ss_pred HHHhhhcccceEEeccchh--hhhhcCcceeEEeeccccccceeEeccc--eec-chhhhhh----hhceeeccCCceEE Confidence 554433221 111111000 00111221 111 111111112333322 221 1122210 11112445444445 Q ss_pred EEEEeccEEecccceEEEEec Q lcl|Aclame:pro 395 AEFRANLAVYQPKAFVRVTFA 415 (419) Q Consensus 395 ~~~r~d~~~~~~~a~~~~~~~ 415 (419) ++..-.+-+.--+|-+++++. T Consensus 298 vetltsghvetynagavitvs 318 (318) T protein:vir:94 298 VETLTSGHVETYNAGAVITVS 318 (318) T ss_pred EEecccCcceeecCceeEEeC Confidence 555555555444555555555 No 216 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=94.63 E-value=0.0039 Score=33.80 Aligned_cols=285 Identities=11% Similarity=0.005 Sum_probs=131.6 Q ss_pred hhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhc---ceecccC-cceeeeeeccccce Q lcl|Aclame:pro 104 GQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL---DQQNADY-NVLEYIRDTSGTAG 179 (419) Q Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~---~~~~~~~-~~~~~~~~~~~~~~ 179 (419) ... ..+.+........+.. ...+..-.....++.|. ++.+.++ .++..|..- T Consensus 1 mp~-~~lsel~t~tl~~rs~------------------~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y----- 56 (321) T protein:vir:34 1 MPF-PNISDIITTTIESRSG------------------VIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSF----- 56 (321) T ss_pred CCC-chHHHHHHHHHHhhcc------------------hhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEee----- Confidence 000 0111111111111100 00000011111222222 2223332 344443322 Q ss_pred eccccccceeec-CcccccccccceeeEEeeeEEEEEeehhhHH-HHhhHH--HHHHHHH---HHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 180 AGSTWNKAAVVP-EGTAKPQSTLSFDTITTTLKTVAHWLPITRQ-AADDNS--QLMGYIQ---GRLTYGLRFLRDRQLLN 252 (419) Q Consensus 180 ~~~~~~~a~~v~-Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~e-ll~d~~--~~~~~i~---~~l~~a~~~~~d~~il~ 252 (419) ....++.|-. +..-...-...|..-++..+.+++-+.||-. +++.+. .+.+++. +...+.+...++..+.. T Consensus 57 --~~~s~~~wy~Gyd~l~~~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~s 134 (321) T protein:vir:34 57 --SGNSNGGWYSGYDVLPTAPQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYG 134 (321) T ss_pred --ccCcceeEEEeeeeeccchhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhc Confidence 1233555544 2222233335789999999999999988876 444432 3444444 45556677777777775 Q ss_pred -ccC--cccccceecccc----------cccccc-------ccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHH Q lcl|Aclame:pro 253 -GNG--STEMQGILTTPG----------IGTYQQ-------PKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQD 312 (419) Q Consensus 253 -G~g--~~~p~Gi~~~~~----------~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (419) |+| ..+..|+--... +..... ....+..+.......+..+..+.-.....|+.|+++..- T Consensus 135 dGTa~g~~~i~GL~~lv~~~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~ 214 (321) T protein:vir:34 135 DGTAFGGRAINGLDGAVPVDPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDA 214 (321) T ss_pred cccccccchhhhhhhhcccCCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHH Confidence 554 334444432111 110000 000111122222222223333344445567789999998 Q ss_pred HHHHHHHhccCCceeccCCccccC-CCcccccceeEecC----CCCcCcEEEEeccceEEEEEecceEEEEeecccchhh Q lcl|Aclame:pro 313 WESIELDQAPGSGVFRVIANVQGE-ATPRIWGLNVVSTV----AIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) Q Consensus 313 ~~~l~~~kd~~g~~~~~~~~~~~~-~~~~l~G~pv~~~~----~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~ 387 (419) |..++.......++- .......| ..-...|..|+.+. .+|++++++.|.+...+.....+...-+.+..-..+- T Consensus 215 y~~y~~s~q~~qR~~-~~~~a~~Gf~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r~~~~N 293 (321) T protein:vir:34 215 WTTYSNSLQVLQRFT-SAEEANLGFRSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSRRAAFN 293 (321) T ss_pred HHHHHHhhheeeeec-ccccccccceeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCcccccccc Confidence 888876544333321 11111111 12246688888887 6899999999988544433333333333332211233 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 388 ANTLVILAEFRANLAVYQPKAFVRVTFA 415 (419) Q Consensus 388 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 415 (419) +|.+.-.+.++....+-++.+=.+++-- T Consensus 294 qdA~~q~I~~~GnL~~sn~~~~~vL~~~ 321 (321) T protein:vir:34 294 QDAEAQILAWAGNLTCSGAQFQGRLIAE 321 (321) T ss_pred hhHHhhhhhhhheeeeecccceeEEeeC Confidence 4444445566666666667666666555 No 217 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=94.53 E-value=0.0042 Score=33.64 Aligned_cols=304 Identities=7% Similarity=-0.071 Sum_probs=130.8 Q ss_pred hhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceec Q lcl|Aclame:pro 102 KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAG 181 (419) Q Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (419) ++...+..+.............+......+..+.+...+...+.....+++-+++.++++++.--...+.-. +...... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~-~~sg~~t 79 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLR-SNRKRHY 79 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEe-ecCcccc Confidence 111111111111111100011000000111223444556677777888889999999998885322222111 1000000 Q ss_pred cccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH---HH-HHHHHHHHHHHHHHHHHHHHHHhccC-- Q lcl|Aclame:pro 182 STWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQ-LMGYIQGRLTYGLRFLRDRQLLNGNG-- 255 (419) Q Consensus 182 ~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~---~~-~~~~i~~~l~~a~~~~~d~~il~G~g-- 255 (419) ....+ -+..... ...+.-.+...+.-.-..|+.+.|+.. ++ |...+++.+.++++.-+=.--+||+- T Consensus 80 ~r~~t-----~~~~~~~--~~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A 152 (343) T protein:vir:98 80 GAHDR-----RTPIQQR--WTRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVG 152 (343) T ss_pred Ccccc-----CCCcccc--ccCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeec Confidence 00000 0000000 011112355555555567888877754 56 88888888888876655555567753 Q ss_pred --ccccc------ceecc-----c------cccccccccccccchhhhHHHHHHHHHHhhhhhccCC--cEEEEehHHHH Q lcl|Aclame:pro 256 --STEMQ------GILTT-----P------GIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWE 314 (419) Q Consensus 256 --~~~p~------Gi~~~-----~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 314 (419) +++|. |++.. + +...........++....+...+.++...+...+.+. -+.+|.....+ T Consensus 153 ~~T~nPllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla 232 (343) T protein:vir:98 153 TDTSDPNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLVA 232 (343) T ss_pred cCCCCcchhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhhh Confidence 23443 22210 0 0000000000111112222222234444455555543 24566655543 Q ss_pred HHH-HHhccCCceeccCCccc---cCCCcccccceeEecCCCCcCcEEEEeccceEEEEEecceEEEEeeccc-----ch Q lcl|Aclame:pro 315 SIE-LDQAPGSGVFRVIANVQ---GEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHA-----DF 385 (419) Q Consensus 315 ~l~-~~kd~~g~~~~~~~~~~---~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~~~~~~-----~~ 385 (419) .-. .+-...++. +..... -....++-|+|.+.-+++|.+.+++.-++...+++-++..+=.+.+... ++ T Consensus 233 ~~~~~l~n~~~~~--ptEk~Aa~~~~~~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y 310 (343) T protein:vir:98 233 KEASLVYKGNGLI--ATEKAALNTHDLMKSFGGMPAMIVPNMPPRAAIVTSLSNLSIYTQEGSMRRGMKDDDDKKAVRDS 310 (343) T ss_pred hhhhhhhhhcCCC--hHHHHHHHHHHHHHhhCCCeeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccch Confidence 321 111111211 000000 0123578999999999999999999999887776655544443322211 11 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEecCC-CC Q lcl|Aclame:pro 386 FTANTLVILAEFRANLAVYQPKAFVRVTFAAA-TT 419 (419) Q Consensus 386 ~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa-~~ 419 (419) ...| .+|.++.+.-+.....-.|+ +++. .+ T Consensus 311 ~s~N-e~YvVEd~~~~a~iE~i~v~---~~~~~g~ 341 (343) T protein:vir:98 311 YYRN-EAYAVEDCGKFMAVDFTKVK---LSSGKGT 341 (343) T ss_pred hhhc-ceeeeeccccEEEeeeeeee---ecCCCCC Confidence 1122 23444433333333333222 2222 22 No 218 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=94.40 E-value=0.0045 Score=33.43 Aligned_cols=276 Identities=11% Similarity=0.061 Sum_probs=139.0 Q ss_pred cccccCCcccccchhhhHHHHHhhhhhhhHHhhcc-eeccc-CcceeeeeeccccceeccccccceeecCcccccccccc Q lcl|Aclame:pro 125 AGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD-QQNAD-YNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLS 202 (419) Q Consensus 125 ~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 202 (419) ...+...-...+-+.+++.|...+-+...-..+++ +...+ +..+.||..... ...--.|-+...-.... T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~tiGs~---------~~~~~~E~~~~~~~~i~ 71 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTIGSV---------TLQEAEEDTPLIYNPIE 71 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEecccCce---------eeeccccCCCeeecccc Confidence 11122233455567778777665555433333344 33332 345666543221 11111233344434456 Q ss_pred eeeEEeeeEEEEEe-ehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHh-cc----Ccccccceecccccccccc Q lcl|Aclame:pro 203 FDTITTTLKTVAHW-LPITRQAADDNS---QLMGYIQGRLTYGLRFLRDRQLLN-GN----GSTEMQGILTTPGIGTYQQ 273 (419) Q Consensus 203 ~~~v~~~~~k~~~~-~~vs~ell~d~~---~~~~~i~~~l~~a~~~~~d~~il~-G~----g~~~p~Gi~~~~~~~~~~~ 273 (419) .++|++....+++- ..||+.|-+|+- ++...+..+-++++....+..+|. |+ |...|.-+ .|....-+ T Consensus 72 TGEIt~~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~v---NG~PH~~V 148 (313) T protein:vir:95 72 TGEITFQITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNV---NGFPHVIV 148 (313) T ss_pred cceEEEEEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCccc---ccccceEE Confidence 68888988888775 489999999875 466777777788888888877774 32 12233322 23333333 Q ss_pred ccccccchhhhHHHHHHHHHHhhhhhccC--CcEEEEehHHHHHHHHHhc----cCCceeccCCc--cccC-CCcccccc Q lcl|Aclame:pro 274 PKPTAPATDEPPLVDIRRAKTVAEIAGFP--PDGVVVHPQDWESIELDQA----PGSGVFRVIAN--VQGE-ATPRIWGL 344 (419) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~kd----~~g~~~~~~~~--~~~~-~~~~l~G~ 344 (419) .+. +.....+.++..+-+....+..+ +-.+++.|.....|..+.. -...+.|+... .-++ ....+.|. T Consensus 149 ~~~---T~~~~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~ 225 (313) T protein:vir:95 149 SAE---TNGVFALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGW 225 (313) T ss_pred ecc---CCceehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhh Confidence 322 22333456666665555555443 3368999999999887642 11222233222 1111 12346788 Q ss_pred eeEecCCCCc---------CcEEEEeccceE-------EEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccc Q lcl|Aclame:pro 345 NVVSTVAIAQ---------GTALVGGFRQGA-------TLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKA 408 (419) Q Consensus 345 pv~~~~~~~~---------~~~~~~d~~~~~-------~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a 408 (419) .+.+|+-+.. +.++.|+..-.+ .++-|..+.-.-+. ..++-..+.++.+ +|++..+.+.+. T Consensus 226 Di~~SN~L~~AN~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~-~~~~~~~~~~~~~--~R~G~Gi~R~~~ 302 (313) T protein:vir:95 226 DILTSNRLHVANYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGE-RNKDRARDEHVVR--CRYGFGIQRLDT 302 (313) T ss_pred hhhhhhhhhhccccccccccCceeeeeeeeeecccccceeeeeccccccccc-cccccccccceee--eeecccceeecc Confidence 8888876632 233443322111 01111111111111 0112223444444 477777777665 Q ss_pred eEEEE-ecCCC Q lcl|Aclame:pro 409 FVRVT-FAAAT 418 (419) Q Consensus 409 ~~~~~-~~aa~ 418 (419) ++.+- -+++- T Consensus 303 L~~~~~~A~~~ 313 (313) T protein:vir:95 303 LGLLATSATAY 313 (313) T ss_pred eeEEEeccccC Confidence 55443 22222 No 219 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=93.35 E-value=0.0079 Score=32.11 Aligned_cols=305 Identities=10% Similarity=0.005 Sum_probs=133.0 Q ss_pred ccccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhh Q lcl|Aclame:pro 71 TPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDL 150 (419) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~ 150 (419) ...............+... ....+++. +.. .........++..=-+.+.+.|..+.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~-e~~~KS~~--------------------tg~-g~~p~~q~~~gAlR~esL~~~i~~Lt~~ 58 (462) T protein:vir:96 1 MHKDTNLTAEQNKYADKFQ-EEVMKSYQ--------------------TGY-GITPDTQVDAGALRREILDDQITMLTWT 58 (462) T ss_pred Cccccccchhhhhhhchhh-HHHHHHHh--------------------cCC-CcCCccccccchhhhhhhhhhhheeeec Confidence 1111111111111111110 01111110 001 0011111112222224444444443333 Q ss_pred hhh--HHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHH-Hh-h Q lcl|Aclame:pro 151 PLL--VADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQA-AD-D 226 (419) Q Consensus 151 ~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~el-l~-d 226 (419) ... +..-+...++.+-.-+|..... ....+.+.+++|++..+.+++++.+.....|-++..-.+|-.. +. . T Consensus 59 ~~~~~~~~~i~k~~a~sTv~~y~~~~~-----~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~ 133 (462) T protein:vir:96 59 QDDLIFYREISRRPAQSTVQKYDVYLR-----HGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNN 133 (462) T ss_pred ccchhhhhhcCCchhhhhhhhheeeec-----cCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccc Confidence 222 2222333333332223322221 1123467899999999999999999999999999887777764 33 3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccce---eccccccccccccc-cccchhhhHHHHHHHHHHhhhhhccC Q lcl|Aclame:pro 227 NSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGI---LTTPGIGTYQQPKP-TAPATDEPPLVDIRRAKTVAEIAGFP 302 (419) Q Consensus 227 ~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi---~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (419) ..+.+....++-...++..++.+.+.|+-.=.|.+. +...|+...-.... ....+..+..+.+..+-..+..++.+ T Consensus 134 ~~d~~~~~~~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt 213 (462) T protein:vir:96 134 IQDPMQILTEDAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFGT 213 (462) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccCC Confidence 357889999999999999999999999854222111 23334322222221 12223334455555555666788888 Q ss_pred CcEEEEehHHHHHHHHHhccCCceeccCCccccCC------------------CcccccceeEecC------CCCcCcEE Q lcl|Aclame:pro 303 PDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEA------------------TPRIWGLNVVSTV------AIAQGTAL 358 (419) Q Consensus 303 ~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~------------------~~~l~G~pv~~~~------~~~~~~~~ 358 (419) ++-.+|+..+.+.|..-.-...+ .+.+++..... +.++++.|-+... .+|+-. T Consensus 214 ~TD~~~p~~v~a~f~~~~l~~qr-v~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~p~ap~~~-- 290 (462) T protein:vir:96 214 ATDAYMPIGVHADFVNSVLGRQM-QLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPA-- 290 (462) T ss_pred hhheecchHHHHHHHHhhcCceE-EEEcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccCCCCCCCC-- Confidence 88899999999998854332222 23333322111 1112222222111 111100 Q ss_pred EEeccceEEEEEecceEEEEeecccchhh----cCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 359 VGGFRQGATLWSRQGITVLMTDSHADFFT----ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 359 ~~d~~~~~~~~~~~~~~i~~~~~~~~~~~----~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .++..+.......|. ..+..|++...-+..=--|..++-++++++.- T Consensus 291 --------------~vsaTv~t~~~g~f~~~~d~~~y~Y~V~avs~dgeS~PS~~VtaTva~~~~ 341 (462) T protein:vir:96 291 --------------TVKATVETGKKGLFTDEHDRAELTYKVVVNSDDAQSAPSEAVTATVNNATD 341 (462) T ss_pred --------------ceeEEEEeCCCCCCCCccCceeEEEEEEEECCCCccccceeeEeeeecccc Confidence 000000000000010 11222222222221111233333333322211 No 220 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=93.13 E-value=0.0087 Score=31.89 Aligned_cols=269 Identities=11% Similarity=0.016 Sum_probs=123.9 Q ss_pred HHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHH-HHhhhhhhhHHhhcceecccC Q lcl|Aclame:pro 87 RFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIV-PTTPDLPLLVADLLDQQNADY 165 (419) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i-~~~~~~~~~l~~~~~~~~~~~ 165 (419) ....... +.... .-+...+ ..........+.+|+..+... T Consensus 1 m~it~~~----------------l~~l~-----------------------~~~~~~~~~~y~~a~~~~~~~a~~~~sdf 41 (302) T protein:vir:10 1 MLINKQS----------------LNAAF-----------------------VAIKTIFNNAFAAAPTTWQKIAMEVPSNT 41 (302) T ss_pred CcccHHH----------------HHHHH-----------------------HHHHHHHHHHHHhhhhhhhceeeecCCCc Confidence 0000000 00000 0000000 011111234455566666444 Q ss_pred cceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHh-hHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 166 NVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAAD-DNSQLMGYIQGRLTYGLRF 244 (419) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~-d~~~~~~~i~~~l~~a~~~ 244 (419) ..-++........ =-.|++| .+-..++=...++..++++..+.||++.|. |.-.+..-+...|+++.++ T Consensus 42 ~~~~~~~lg~~p~-------l~e~~Ge---~~~~~l~~~~~~i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~ 111 (302) T protein:vir:10 42 SSNDYKWLSTFPK-------MRRWIGA---KVVKNLKAYKYVVENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQ 111 (302) T ss_pred ceeeceecCCCCC-------ccccccc---eeeccccccceeEEeecccceecccHHhhcccccchhHHHHHHHHHHHHh Confidence 4444444332211 1234444 334445556678999999999999999875 5667888899999999999 Q ss_pred HHHHHHHh----ccCcc--cccceeccc---------cccccccccccccchhhhHHHHHHHHHHhhhhhcc-----CCc Q lcl|Aclame:pro 245 LRDRQLLN----GNGST--EMQGILTTP---------GIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGF-----PPD 304 (419) Q Consensus 245 ~~d~~il~----G~g~~--~p~Gi~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~ 304 (419) .+|+.++. |.+.. .-+-++... .+...... ..........+.....++........ .|. T Consensus 112 ~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~~~~-~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~ 190 (302) T protein:vir:10 112 LPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTAPLS-NASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPN 190 (302) T ss_pred hHHHHHHHHHhccCCCcccCCcceecccccccccccccccchhhh-hcccccchHHHHHHHHHHHHHhhhcccccccCCC Confidence 99987664 21110 111122111 00000000 01122334445555566655543333 456 Q ss_pred EEEEehHHHHHHHHHhccCCceeccCCccccCCCccccc-ceeEecCCCCcCcE-EEEeccceE---EEEEecceEEEEe Q lcl|Aclame:pro 305 GVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWG-LNVVSTVAIAQGTA-LVGGFRQGA---TLWSRQGITVLMT 379 (419) Q Consensus 305 ~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G-~pv~~~~~~~~~~~-~~~d~~~~~---~~~~~~~~~i~~~ 379 (419) .+|+.|.....-+.+-.+ +.. .. +....+.| ..+++++.+..++. ++.+....+ ++-.+++..+... T Consensus 191 ~LiVp~~le~~A~~ll~~-~~~--~~-----g~~Np~~g~~~~vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~ 262 (302) T protein:vir:10 191 VLLVGPALEDVAKMLLTN-PKL--AD-----NTPNPYVGTAELVVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQ 262 (302) T ss_pred EEEecchhHHHHHHHhhc-ccc--CC-----CCcceeccceEEEEeeccCCCCceEEEecCCccceEEEcCccccEEEec Confidence 677777776666554322 111 11 11122223 46777777765543 333322222 2222333444432 Q ss_pred ecccchhhcCcEEEEEEEEecc------EEecccceEEEEecCC Q lcl|Aclame:pro 380 DSHADFFTANTLVILAEFRANL------AVYQPKAFVRVTFAAA 417 (419) Q Consensus 380 ~~~~~~~~~~~~~~r~~~r~d~------~~~~~~a~~~~~~~aa 417 (419) ..|..+.+.++....+++ ....++....-+-+++ T Consensus 263 ----~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 263 ----VNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred ----cCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccCC Confidence 236667777666555553 2333333233333333 No 221 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=93.03 E-value=0.009 Score=31.79 Aligned_cols=302 Identities=10% Similarity=-0.021 Sum_probs=125.2 Q ss_pred ccccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhh Q lcl|Aclame:pro 71 TPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDL 150 (419) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~ 150 (419) .++..++........+. ......+++ ...-.....+..+++..=-+.+.+.|..+... T Consensus 1 ~~~~~~~~~~~~~~~~~-~~e~~~Ks~---------------------~agy~~~p~~q~~~~AlR~EsL~~~i~~L~~~ 58 (468) T protein:vir:63 1 MPKNNKEEEVKEVNLNS-VQEDALKSF---------------------TTGYGITPDTQTDAGALRREFLDDQISMLTWT 58 (468) T ss_pred CCCCcchhhccccChhH-HHHHHHHHH---------------------HcCcccCCccccCcchhhhhhhhhhhheeeec Confidence 11111111100000000 000111111 00001111112222223233344444333332 Q ss_pred hhhH--HhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHH--Hhh Q lcl|Aclame:pro 151 PLLV--ADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQA--ADD 226 (419) Q Consensus 151 ~~~l--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~el--l~d 226 (419) ...+ ..-+...+..+-.-+|..... ....+.+.+++|++..+.+++++.+.....|-++....+|.-+ .++ T Consensus 59 ~~~f~~~~di~k~~a~stv~~y~~~~~-----~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~ 133 (468) T protein:vir:63 59 ENDLTFYKDIAKKPATSTVAKYDVYMQ-----HGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNN 133 (468) T ss_pred ccchhhhhhcccchhhhhhhhheeeec-----cCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcc Confidence 2222 111222233222222322221 1123467899999999999999999999999999988887764 233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-----cccceecccccccccc-ccccccchhhhHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 227 NSQLMGYIQGRLTYGLRFLRDRQLLNGNGST-----EMQGILTTPGIGTYQQ-PKPTAPATDEPPLVDIRRAKTVAEIAG 300 (419) Q Consensus 227 ~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~-----~p~Gi~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (419) -.+.+....++-...++..++.+.+.|+-.- .+.||- ..|+...-. ...-...+.....+++..+.......+ T Consensus 134 i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glq-fDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gf 212 (468) T protein:vir:63 134 IQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLE-FDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGY 212 (468) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCcccccc-ccceeEEecCCceeccCCCccCHHHHHHHhhhccccc Confidence 3578899999999999999999999998531 223331 112111111 111111222233444445555556678 Q ss_pred cCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCc-CcE------EEEeccceEEEEEecc Q lcl|Aclame:pro 301 FPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQ-GTA------LVGGFRQGATLWSRQG 373 (419) Q Consensus 301 ~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~-~~~------~~~d~~~~~~~~~~~~ 373 (419) ..+.-++|+..+.+.|....-... +.++.++. .....|+|| ...++. |.+ ++++... +-.+..+ T Consensus 213 G~~td~~~~~~v~a~~~~~~L~~q-~~v~~~n~----~~~~~G~~v--~g~~sa~G~I~l~gs~il~~~~~--l~~~~~~ 283 (468) T protein:vir:63 213 GTPTDAYMPVGVQADFVNQQLSKQ-TQLVRDNG----NNVSVGFNI--QGFHSARGFIKLHGSTVMENEQI--LDERILA 283 (468) T ss_pred cChhhhhcchhHHhhhhhhhcCce-EEEEcCCC----Cceeeeecc--cceecceeeeeecCceeeccccC--CCccccc Confidence 888888899888877743321111 12222221 223456666 222221 221 2222110 0000000 Q ss_pred eEEEEeecccchhhcCcEEEEEEEEeccEE--------ecccceEEEEecC--CCC Q lcl|Aclame:pro 374 ITVLMTDSHADFFTANTLVILAEFRANLAV--------YQPKAFVRVTFAA--ATT 419 (419) Q Consensus 374 ~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~--------~~~~a~~~~~~~a--a~~ 419 (419) .... ..-..+ -+....+.+- .+.-+|+.+.-.. ++| T Consensus 284 ~~~A--------psp~~v--saT~~~~~~g~~~~~~~a~y~Y~v~~vs~~GES~pS 329 (468) T protein:vir:63 284 LPTA--------PQPAKV--TATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIAS 329 (468) T ss_pred cccc--------ccCCcc--ceeeecccCCcccCCCcceEEEEEEEECCCCccccc Confidence 0000 000000 0111111111 1111222221111 111 No 222 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=92.90 E-value=0.0096 Score=31.66 Aligned_cols=308 Identities=8% Similarity=0.003 Sum_probs=123.1 Q ss_pred cchhhhhhHHH--HhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhhhH- Q lcl|Aclame:pro 78 AGTFRSLAQRF--ADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLV- 154 (419) Q Consensus 78 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l- 154 (419) ....|...... ...+. ++ +.+..-........+++..=-+.+.+.|..+.+....+ T Consensus 1 ~~~~~n~~~~~~~~~e~~----------------~K-----s~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~ 59 (464) T protein:vir:80 1 MTEKKNTERQLTSVQEEV----------------IK-----GFTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLS 59 (464) T ss_pred CCcchhhHhhcCcccHHH----------------HH-----HHHhCCccCcccccCcchhhhhhhhhhhheeeecccchh Confidence 11111110000 00001 11 11111111111122233333344444444433332222 Q ss_pred -HhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhH--HHHhhHHHHH Q lcl|Aclame:pro 155 -ADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITR--QAADDNSQLM 231 (419) Q Consensus 155 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~--ell~d~~~~~ 231 (419) ..-+...++.+-.-+|..... ....+.+.+++|++..+.+++++.+.....+-+...-.+|- .+.+...+-+ T Consensus 60 f~~di~k~~a~STV~~y~~~~~-----~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~~~d~~ 134 (464) T protein:vir:80 60 FYRDITKRPATSTVAKYDVYLA-----HGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATGLVNNIEDPM 134 (464) T ss_pred hhhhcCCchhhhhhhhhheeec-----cCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehhhhcchhhHH Confidence 222333333332223322221 11234678999999999999999999988776655433443 3445444666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcc-----cccceeccccccccccc-cccccchhhhHHHHHHHHHHhhhhhccCCcE Q lcl|Aclame:pro 232 GYIQGRLTYGLRFLRDRQLLNGNGST-----EMQGILTTPGIGTYQQP-KPTAPATDEPPLVDIRRAKTVAEIAGFPPDG 305 (419) Q Consensus 232 ~~i~~~l~~a~~~~~d~~il~G~g~~-----~p~Gi~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (419) ....++-...++..++.+.+.|+-.= .+.|+- ..|+...-.. ......+..+..+.+..+-..+..+|.+++- T Consensus 135 ~~~~~dai~~va~tiE~a~FyGds~l~~~~~~~~gle-FDGl~~lI~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD 213 (464) T protein:vir:80 135 RILTDDAISVVAKTIEWASFYGDSDLSENPDAGSGLE-FDGLAKLIDKHNVLDAKGASLTEALLNQASVLVGKGYGTPTD 213 (464) T ss_pred HHHHHHHHHHHHHHHHHHHhhhccccCCCCCCccccc-hhhhHhhcCCCceeecCCCCcCHHHHhhhhhhhhcccCChhh Confidence 67788889999999999999998531 122221 2232221111 1111222334455566666667778888888 Q ss_pred EEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeE--ec----------CCCCcCcEEEEeccceEE--EEEe Q lcl|Aclame:pro 306 VVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVV--ST----------VAIAQGTALVGGFRQGAT--LWSR 371 (419) Q Consensus 306 ~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~--~~----------~~~~~~~~~~~d~~~~~~--~~~~ 371 (419) .+|+..+.+.+....-.. ++.+..++... ...|+||- .+ ..|.... ..|.+.... .... T Consensus 214 ~~lp~~v~a~f~n~~l~~-q~~~~~~n~~~----~~~G~~v~~f~sa~G~i~L~~s~~m~~~~--~ld~~~~~~~~apaa 286 (464) T protein:vir:80 214 AYMPIGVQADFVNQQLDR-QVQVISDNGQN----ATMGFNVKGFNSARGFIRLHGSTVMELEQ--ILDENRMQLPNAPQK 286 (464) T ss_pred cccchhHHHHHHhhhcCc-eeEEEcCCCCc----ceeeeecccccccccceeccCccccCccc--ccccccccCCCCcCC Confidence 999999988864322111 11122211111 12233321 00 0000000 000000000 0000 Q ss_pred cceEEEEeecccchhh-cC---cEEEEEEEE-----------eccEEecccceEEEEecCCCC Q lcl|Aclame:pro 372 QGITVLMTDSHADFFT-AN---TLVILAEFR-----------ANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 372 ~~~~i~~~~~~~~~~~-~~---~~~~r~~~r-----------~d~~~~~~~a~~~~~~~aa~~ 419 (419) ..++..++......|. .+ ...|++... .+..+.....-+.++++..+. T Consensus 287 psvt~tv~~~~~g~f~~~~~~~~~~Ykv~~vn~~GeS~ps~~~~~ti~~~~~~V~l~it~~~~ 349 (464) T protein:vir:80 287 ATVKATLEAGTKGKFRDEDLTIDTEYKVVVVSDDAESAPSDVASVVIDDKKKQVKLEITINNM 349 (464) T ss_pred ceeEEEecCCcccCCccccccceeEEEEEEECCCCccccceeeeeeecCcccEEEEEEEeCCc Confidence 0011111111000111 11 111221111 111222222233333332111 No 223 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=91.93 E-value=0.014 Score=30.82 Aligned_cols=301 Identities=9% Similarity=-0.015 Sum_probs=125.4 Q ss_pred ccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhhh Q lcl|Aclame:pro 73 LTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPL 152 (419) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~ 152 (419) ..........+.......+...|++ ...-.....+..+++..=-+.+.+.|..+..... T Consensus 1 ~~~~~~~~~~~~n~~~~~e~~~Ks~---------------------~agy~~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~ 59 (467) T protein:vir:80 1 MPKNNKEEVKEVNLNSVQEDALKSF---------------------TTGYGITPDTQTDAGALRREFLDDQISMLTWTEN 59 (467) T ss_pred CCCcchhhhhhcccccCHHHHHHHH---------------------HcccccCCccccCcchhhhhhhhhhhheeecccc Confidence 1000011100000000001111111 0000111111222333333444444443333332 Q ss_pred hHH--hhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHH--HhhHH Q lcl|Aclame:pro 153 LVA--DLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQA--ADDNS 228 (419) Q Consensus 153 ~l~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~el--l~d~~ 228 (419) .+- .-+...+..+-.-+|..... ....+.+.+++|++..+.+++++.+.....|-++....+|.-+ .++-. T Consensus 60 ~f~~~~di~k~~a~stv~~y~~~~~-----~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~ 134 (467) T protein:vir:80 60 DLTFYKDIAKKPATSTVAKYDVYMQ-----HGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQ 134 (467) T ss_pred chhhhhhcccchhhhhhhhheeeec-----cCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchh Confidence 221 11222233222222322221 1123467899999999999999999999999999988877764 23335 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCcc-----cccceecccccccccc-ccccccchhhhHHHHHHHHHHhhhhhccC Q lcl|Aclame:pro 229 QLMGYIQGRLTYGLRFLRDRQLLNGNGST-----EMQGILTTPGIGTYQQ-PKPTAPATDEPPLVDIRRAKTVAEIAGFP 302 (419) Q Consensus 229 ~~~~~i~~~l~~a~~~~~d~~il~G~g~~-----~p~Gi~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (419) +.+....++-...++..++.+.+.|+-.- .+.||- ..|+...-. ...-...+.....+++..+.......+.. T Consensus 135 d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glq-fDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~ 213 (467) T protein:vir:80 135 DPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLE-FDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGT 213 (467) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCcccccc-ccceeEEecCCceeccCCCccCHHHHHHHhhhccccccC Confidence 78899999999999999999999998531 223331 112111111 11111122223344444555555667888 Q ss_pred CcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCc-CcE------EEEeccceEEEEEecceE Q lcl|Aclame:pro 303 PDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQ-GTA------LVGGFRQGATLWSRQGIT 375 (419) Q Consensus 303 ~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~-~~~------~~~d~~~~~~~~~~~~~~ 375 (419) +.-++|+..+.+.|....-... +.++.++. .....|+|| ...++. |.+ ++++... +-.+..+.. T Consensus 214 ~td~~~p~~v~a~~~~~~L~~q-~~v~~~n~----~~~~~G~~v--~g~~sa~G~I~l~gs~il~~~~~--l~~~~~~~~ 284 (467) T protein:vir:80 214 PTDAYMPVGVQADFVNQQLSKQ-TQLVRDNG----NNVSVGFNI--QGFHSARGFIKLHGSTVMENEQI--LDERILALP 284 (467) T ss_pred hhhhhcchhHHhhhhhhhcCce-EEEEcCCC----Cceeeeecc--cceecceeeeeecCceeeccccC--CCccccccc Confidence 8888899888877743321111 12222221 223456666 222221 221 2222110 000000000 Q ss_pred EEEeecccchhhcCcEEEEEEEEeccEE--------ecccceEEEEecC--CCC Q lcl|Aclame:pro 376 VLMTDSHADFFTANTLVILAEFRANLAV--------YQPKAFVRVTFAA--ATT 419 (419) Q Consensus 376 i~~~~~~~~~~~~~~~~~r~~~r~d~~~--------~~~~a~~~~~~~a--a~~ 419 (419) .. ..-..+ -+....+.+- .+.-+|+.+.-.. ++| T Consensus 285 ~A--------psp~~v--saT~~~~~~g~~~~~~~a~y~Y~v~~vs~~GES~pS 328 (467) T protein:vir:80 285 TA--------PQPAKV--TATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIAS 328 (467) T ss_pred cc--------ccCCcc--ceeeecccCCcccCCCcceEEEEEEEECCCCccccc Confidence 00 000000 0111111111 1111222221111 111 No 224 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=91.20 E-value=0.01 Score=31.45 Aligned_cols=313 Identities=10% Similarity=0.017 Sum_probs=129.5 Q ss_pred hhhHHHHhHHHHHHHHHhhhhhhhhH----HHHHHHHHHhhhcccccccccC-----CcccccchhhhHHHHHhhhhhhh Q lcl|Aclame:pro 83 SLAQRFADSDGLREYRARDKRGQFQV----EMRDIDPNRLLSRDAPAGTITN-----PNVPHLPQLVPGIVPTTPDLPLL 153 (419) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~p~~~~~~i~~~~~~~~~ 153 (419) .+.+....+--.+.|....+.-.+.. .++..........+.+.|..+. +++..=-+...+.+..+.+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ 80 (514) T protein:vir:10 1 MYTQDKTKDIMKKSFFGGDRAVAFDTNKEDILNENLPENVKKSAFTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERD 80 (514) T ss_pred CCccchhhHHHhhhhcccceeeeecCcHHHHHHHhcchhhhhhhhccccccCCccccCccchhhhhhccceeEeeecCcc Confidence 00000000111111211111111111 1111111111112222222221 22222222222222222222111 Q ss_pred --HHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHH-HhhH-HH Q lcl|Aclame:pro 154 --VADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQA-ADDN-SQ 229 (419) Q Consensus 154 --l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~el-l~d~-~~ 229 (419) +..-+...++.+-.-.|.... .....+.+.+++|++-.+..++.+....+..+-++....+|.-+ +.++ .+ T Consensus 81 ftf~~~i~k~~a~STV~ey~~~~-----~~G~~G~~~f~~E~gi~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~n~i~d 155 (514) T protein:vir:10 81 FTLYNDIAKQPVDNTVLKYTQYY-----SHGRTGHSLFQPEIGIGDVNNPNERQRTINIKYIVDTHVTSIALQRANTIVD 155 (514) T ss_pred hhhhhhcCCchhhHHHhhhhhhc-----ccCcccccccccccccCcCCCcceEEEEEeeeeeeeeeeeeehhhhccchhh Confidence 111122223222222222111 11223467899999999999999999999999998887777665 3333 47 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCc---c------cccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 230 LMGYIQGRLTYGLRFLRDRQLLNGNGS---T------EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAG 300 (419) Q Consensus 230 ~~~~i~~~l~~a~~~~~d~~il~G~g~---~------~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (419) .+....++-...++..++.+.++|+-. + +.-||.+.... .......+..+..+.+..+--.+...+ T Consensus 156 ~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~~-----~NvIDarG~~Ls~~~ln~aA~~i~~gf 230 (514) T protein:vir:10 156 SLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIAP-----ENHIDLRGGRLSPAALNMAARKIGEGF 230 (514) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhcC-----CCeEecCCCCccHHHHhhhhhhhhccc Confidence 888888999999999999999998742 1 22344433221 111112223344455555555566678 Q ss_pred cCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeE--ecCCCCcCcEEEEeccceEEEEEecceEEEE Q lcl|Aclame:pro 301 FPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVV--STVAIAQGTALVGGFRQGATLWSRQGITVLM 378 (419) Q Consensus 301 ~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~--~~~~~~~~~~~~~d~~~~~~~~~~~~~~i~~ 378 (419) .+++-.+|+..+.+.+..-.....+. +++.++.+ ...|.||- .+.. |.+-+ . .-.++++ ...+.. T Consensus 231 Gt~TD~ylp~~vka~f~~~~~~~qRV-~~~~n~~~----~~~G~~v~~f~s~~---G~I~L-~---gs~im~~-~n~L~~ 297 (514) T protein:vir:10 231 GTPTDAYMPIGIKADFVNQHLNGQRV-MLPGQTGG----MTTGLDIDKFLSAH---GSIRI-Q---GSTIMDS-DNKLDF 297 (514) T ss_pred CChhheeCchHHHHHHhhcccCcceE-EeecCccc----eeeeeeccceeEec---cceee-c---CCeeecc-cccCcc Confidence 88888899999988777554443332 22222221 12344331 1110 11100 0 0011111 111111 Q ss_pred eecccc-hhhcCcEEEEEEEEeccEEec--------------ccceEE--EEecCCCC Q lcl|Aclame:pro 379 TDSHAD-FFTANTLVILAEFRANLAVYQ--------------PKAFVR--VTFAAATT 419 (419) Q Consensus 379 ~~~~~~-~~~~~~~~~r~~~r~d~~~~~--------------~~a~~~--~~~~aa~~ 419 (419) ...... .-.-+.+..-..- ..+...+ .++-+. .++.+... T Consensus 298 ~~~~~~~Ap~~~~va~svT~-~~~g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~ 354 (514) T protein:vir:10 298 DRPVSPTAPTAPQLSATVTP-DGGGLWHEADKTDSKGEVILNKEVGVEQSYVAVMVSR 354 (514) T ss_pred CCccCCcCCCCCcceEEEec-CcccccCcccccccccccccccccceeEEEEEEEECC Confidence 110000 0011222222211 1122222 222221 23333333 No 225 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=90.93 E-value=0.018 Score=30.11 Aligned_cols=281 Identities=14% Similarity=0.083 Sum_probs=118.4 Q ss_pred ccCCcccccchhhhHHHHHhhhhhh-hHH-hhcceecccCcceeeeeeccccceeccccccceeecCccccccc-cccee Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPL-LVA-DLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFD 204 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~-~l~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~ 204 (419) +..-..+.-+..+...|..+..... .+. .+++..++.+-.+.+....... ...+.+++.+...+-. ...+. T Consensus 1 M~~i~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~------~~~a~~v~~~~~~~~~~r~~~~ 74 (348) T protein:vir:27 1 MGLIYDKVTASNIAGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQ------SVALKAAAFDTNVTIRDRVSAE 74 (348) T ss_pred CcchhhhcCHHHHHHHHHhccchhhhhhHhhcCCCccccceeEEEEeeccCc------eeEeeeecCCCCcceeccccee Confidence 1111112222333333333332221 222 3455444444333332221111 1123455555443332 23456 Q ss_pred eEEeeeEEEEEeehhhHHHH------hh--HHH----HHHHH---HHHHHHHHHHHHHHHHH----hcc----Ccccccc Q lcl|Aclame:pro 205 TITTTLKTVAHWLPITRQAA------DD--NSQ----LMGYI---QGRLTYGLRFLRDRQLL----NGN----GSTEMQG 261 (419) Q Consensus 205 ~v~~~~~k~~~~~~vs~ell------~d--~~~----~~~~i---~~~l~~a~~~~~d~~il----~G~----g~~~p~G 261 (419) ..++.+-.++-...++..-+ .+ ++. +...| ..++.+++.+.+|..+. +|. |.+.... T Consensus 75 ~~~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~~~ 154 (348) T protein:vir:27 75 MHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKD 154 (348) T ss_pred eeeeecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCeeEE Confidence 66666666665555553321 11 111 22222 23344555555555333 331 1111111 Q ss_pred ee-ccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHH---HhccCCcee----ccCCcc Q lcl|Aclame:pro 262 IL-TTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL---DQAPGSGVF----RVIANV 333 (419) Q Consensus 262 i~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~---~kd~~g~~~----~~~~~~ 333 (419) +- ..+.....+....... .+.+.+.|+.+.+..+...+..+...+|++.+|..|++ +++.-.... .+.... T Consensus 155 vdfg~~~~~~~t~~~~W~~-~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~ 233 (348) T protein:vir:27 155 IDYGVKPDHKKQVSKSWAE-PGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKAE 233 (348) T ss_pred EeecCCcccceeeeeccCC-CCCCHHHHHHHHHHHHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCccccccCHHH Confidence 10 0011111112222222 34567788888877777777788889999999999875 332211110 010000 Q ss_pred ccCCCcccccceeEec------------CCCCcCcEEEEeccc-eEEEEEecceEEE-------------Eeeccc---- Q lcl|Aclame:pro 334 QGEATPRIWGLNVVST------------VAIAQGTALVGGFRQ-GATLWSRQGITVL-------------MTDSHA---- 383 (419) Q Consensus 334 ~~~~~~~l~G~pv~~~------------~~~~~~~~~~~d~~~-~~~~~~~~~~~i~-------------~~~~~~---- 383 (419) ....-+++.|.+|++- ..+|++.++++.... +...+ |...+ +..... T Consensus 234 ~~~~~~~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~y---G~~~e~~~~~~~~~~~~~~~~~~~~~~~ 310 (348) T protein:vir:27 234 LENYIADNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVF---GTTPEESDLFADNTVNAEVEIVDNGIAV 310 (348) T ss_pred HHHHHHhhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEe---ccCcchhhhhhccccccceeeeCCeeEE Confidence 0001113456666542 234666666654332 22211 11100 000000 Q ss_pred chhh-cC--cEEEEEEEEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 384 DFFT-AN--TLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 384 ~~~~-~~--~~~~r~~~r~d~~~~~~~a~~~~~~~aa~ 418 (419) ..|. .| .....+..+.=-.+.+|+++.++++-+++ T Consensus 311 ~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 311 TTTKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred EeeecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 0011 11 23333444444456779999999999999 No 226 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=89.93 E-value=0.024 Score=29.50 Aligned_cols=281 Identities=14% Similarity=0.092 Sum_probs=119.5 Q ss_pred ccCCcccccchhhhHHHHHhhhhh-hhH-HhhcceecccCcceeeeeeccccceeccccccceeecCcccccccc-ccee Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLP-LLV-ADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST-LSFD 204 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~-~~l-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~-~~~~ 204 (419) +..--.+.-+..+...|....... ..+ ..+++..++.+-.+.+....... ...+.+++.+...+... ..+. T Consensus 1 M~~i~d~f~~~~l~~~i~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~------~~~a~~v~~~~~~~~~~r~~~~ 74 (348) T protein:vir:96 1 MGLIYDKVTASNIAGYFNTLQENVDSTLGESIFPARKQLGTKLSYIKGASGQ------SVALKAAAFDTNVTIRDRVSAE 74 (348) T ss_pred CcchhhccCHHHHHHHHHhcccchhhhhhhhcCCCccccceeEEEEeecCCc------eeEeeeecCCCCcceeccccee Confidence 111111222233333333333222 122 24456555544333332221110 11245666665444332 4566 Q ss_pred eEEeeeEEEEEeehhhHHHH------hhH---H---HHHHHHH---HHHHHHHHHHHHH----HHHhcc----Ccccccc Q lcl|Aclame:pro 205 TITTTLKTVAHWLPITRQAA------DDN---S---QLMGYIQ---GRLTYGLRFLRDR----QLLNGN----GSTEMQG 261 (419) Q Consensus 205 ~v~~~~~k~~~~~~vs~ell------~d~---~---~~~~~i~---~~l~~a~~~~~d~----~il~G~----g~~~p~G 261 (419) ...+.+-.++-...++..-+ .++ + .+...+. ..+.+++.+.+|. ++.+|. |.+.... T Consensus 75 ~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~~~~ 154 (348) T protein:vir:96 75 IHDEQMPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGVNKD 154 (348) T ss_pred eeeeecCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCeeEE Confidence 66666666665555543211 111 1 1223332 2344455555554 333332 1111111 Q ss_pred ee-ccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHH---Hhcc----CCceeccCCcc Q lcl|Aclame:pro 262 IL-TTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL---DQAP----GSGVFRVIANV 333 (419) Q Consensus 262 i~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~---~kd~----~g~~~~~~~~~ 333 (419) +- ..+.....+....... .+.+.+.|+..++..+...+..+...+|++.+|..|+. +++. ++....+.... T Consensus 155 vdfg~~~~~~~t~~~~W~~-~~adp~~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~ 233 (348) T protein:vir:96 155 IDYGVKADHKKQVSKSWAE-PGATPLADLEDAIETARELGLNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVTKAE 233 (348) T ss_pred EeccCCcccceeeccccCC-CCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHHhccCCccccccHHH Confidence 10 0011111122222332 34567788888777777777788889999999999864 3321 11111111010 Q ss_pred ccCCCcccccceeEec------------CCCCcCcEEEEeccc-eEEEEEecceEEE-------------Eeecc----c Q lcl|Aclame:pro 334 QGEATPRIWGLNVVST------------VAIAQGTALVGGFRQ-GATLWSRQGITVL-------------MTDSH----A 383 (419) Q Consensus 334 ~~~~~~~l~G~pv~~~------------~~~~~~~~~~~d~~~-~~~~~~~~~~~i~-------------~~~~~----~ 383 (419) ....-.+..|+++++- ..+|++.++++.... +...+ +...+ ..... - T Consensus 234 ~~~~~~~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l~~~~~~G~~~y---g~~~e~~~~~~~~~~~~~~~~~~~~~~~ 310 (348) T protein:vir:96 234 LQNYVADNYGVEIVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVF---GTTPEESDLFADNTVNADVEIVDSGIAV 310 (348) T ss_pred HHHHHhhhcCceEEEEccEEEecCCcEeccccCCeEEEEcCCCceeEEe---ccChhhhhhhhcccccccceecCCeeEE Confidence 0011123456666542 234666666654322 11111 11000 00000 0 Q ss_pred chh-hcC--cEEEEEEEEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 384 DFF-TAN--TLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 384 ~~~-~~~--~~~~r~~~r~d~~~~~~~a~~~~~~~aa~ 418 (419) ..| +.| ...+.+..+.=-.+.+|+++.++++-+++ T Consensus 311 ~~~~~~dP~~~~~~~~s~plPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:96 311 TTTKTTDPVNVQTKVSMVALPSFERLGDVYMLTVIPGV 348 (348) T ss_pred EeeecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 001 111 22333444444456779999999999999 No 227 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=89.61 E-value=0.025 Score=29.33 Aligned_cols=284 Identities=13% Similarity=0.061 Sum_probs=114.6 Q ss_pred ccCCcccccchhhhHHHHHhhhhh-hhH-HhhcceecccCcceeeeeeccccceeccccccceeecCccccccc-cccee Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLP-LLV-ADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQS-TLSFD 204 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~-~~l-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~ 204 (419) +..--.+.-+..+...|..+.... ..+ ..+++..++.............. ...+.+++.+...+-. ...+. T Consensus 1 M~~l~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~------~~~a~~v~~~~~~~~~~r~~~~ 74 (348) T protein:vir:49 1 MGLIYDKVTASNIAGYFNALQENVDSTLGESIFPARKQLGTKLSYITGASGQ------SVALKAAAFDTNVTVRDRVSAE 74 (348) T ss_pred CcchhhhcCHHHHHHHHHhccccchhhhHhhcCCCccccCceeEEEEeecCc------eeeeeeecCCCCcceeccccee Confidence 100011111222223332222111 112 23345444433333222211111 1134455555443332 24456 Q ss_pred eEEeeeEEEEEeehhhHHHH------hh--HHH----HHHHHH---HHHHHHHHHHHHHHHH----hcc----Ccccccc Q lcl|Aclame:pro 205 TITTTLKTVAHWLPITRQAA------DD--NSQ----LMGYIQ---GRLTYGLRFLRDRQLL----NGN----GSTEMQG 261 (419) Q Consensus 205 ~v~~~~~k~~~~~~vs~ell------~d--~~~----~~~~i~---~~l~~a~~~~~d~~il----~G~----g~~~p~G 261 (419) ...+.+-.++-...++..-+ .+ ++. +...|. ..+.+++.+.+|.... +|. |.+.... T Consensus 75 ~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~~~~ 154 (348) T protein:vir:49 75 MHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKD 154 (348) T ss_pred eeeeecCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCceEE Confidence 66666666655555543321 11 111 222222 2334455555555333 331 1111111 Q ss_pred e-eccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHH---Hhcc----CCceeccCCcc Q lcl|Aclame:pro 262 I-LTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL---DQAP----GSGVFRVIANV 333 (419) Q Consensus 262 i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~---~kd~----~g~~~~~~~~~ 333 (419) + +..+.-...+....... .+.+.+.|+.+.+..+...+..+...+|++.+|..|+. +++. ++....+.... T Consensus 155 vdyg~~~~~~~t~~~~W~~-~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~ 233 (348) T protein:vir:49 155 IDYGVKPDHKKQVSKSWAE-PGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSSVTKAE 233 (348) T ss_pred EeecCCcccceeeeeccCC-CCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHhhccCcccccccHHH Confidence 1 00111111112222222 34567888888877777777788899999999999864 2211 11111111000 Q ss_pred ccCCCcccccceeEec------------CCCCcCcEEEEeccc---eEEEEEecce----------EEEEee--cccchh Q lcl|Aclame:pro 334 QGEATPRIWGLNVVST------------VAIAQGTALVGGFRQ---GATLWSRQGI----------TVLMTD--SHADFF 386 (419) Q Consensus 334 ~~~~~~~l~G~pv~~~------------~~~~~~~~~~~d~~~---~~~~~~~~~~----------~i~~~~--~~~~~~ 386 (419) ....-..+.|.+|++- ..+|++.++++.... .+++...... .+.... ..-..| T Consensus 234 ~~~~~~~~~g~~i~~y~~~y~d~dG~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (348) T protein:vir:49 234 LDNYIADNFGVTVVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDNGIAVTTT 313 (348) T ss_pred HHHHHHhhcCceEEEEeeEEEecCCcEeeeecCCeEEEecCCCcceeEEecChhhhhhccccccccceeecCCeEEEeee Confidence 0011123446666542 234566666654322 1111100000 000000 000011 Q ss_pred hc-C--cEEEEEEEEeccEEecccceEEEEecCCC Q lcl|Aclame:pro 387 TA-N--TLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) Q Consensus 387 ~~-~--~~~~r~~~r~d~~~~~~~a~~~~~~~aa~ 418 (419) .+ | .....+....=-.+.+|+++.++++-+++ T Consensus 314 ~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:49 314 KTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred ecCCCceEEEEEeeeccccccCCCcEEEEEEecCC Confidence 11 1 22333344444456789999999999999 No 228 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=88.40 E-value=0.033 Score=28.74 Aligned_cols=264 Identities=9% Similarity=-0.012 Sum_probs=120.6 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhcce------ecccCcceeeeeeccccceeccccccceeecCccccccccc Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ------QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTL 201 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 201 (419) +.. .-..+.+...+.+.....+....++.. .-.+++.++||+.......-+. ..+..|..+ ..+. T Consensus 1 MA~---~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~-R~~~g~~~g-----~~~~ 71 (299) T protein:vir:79 1 MAA---LNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSN-RDTIAVAQR-----NYDN 71 (299) T ss_pred Ccc---chhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccc-cCCCccccc-----ccCc Confidence 211 112367777777766665554444322 1234678999987643221111 111122111 1233 Q ss_pred ceeeEEeeeEEEEEeehhhHHH-HhhH-H--HHHHHHHHHHHHHHHHHHHHHHHhc--cCcccccceecccccccccccc Q lcl|Aclame:pro 202 SFDTITTTLKTVAHWLPITRQA-ADDN-S--QLMGYIQGRLTYGLRFLRDRQLLNG--NGSTEMQGILTTPGIGTYQQPK 275 (419) Q Consensus 202 ~~~~v~~~~~k~~~~~~vs~el-l~d~-~--~~~~~i~~~l~~a~~~~~d~~il~G--~g~~~p~Gi~~~~~~~~~~~~~ 275 (419) ++...+++-.+.-.+. |. .+ .+.+ . .+...+.+.....++-.+|...+.. ++.. ...... T Consensus 72 ~~~t~~ldqdr~~~f~-vD-~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~------------~~g~~~ 137 (299) T protein:vir:79 72 AWEPKVLTNQRKWSTL-VH-PADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWT------------ALGNTA 137 (299) T ss_pred ceeEEEeeccccceec-cc-hhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhh------------hcCCcc Confidence 4555555555554432 11 11 1111 1 2334444445555666667765542 1110 000111 Q ss_pred ccccchhhhHHHHHHHHHHhhhhhccCC--cEEEEehHHHHHHHHHhcc--CCceeccCCccccCCCcccccceeEe--c Q lcl|Aclame:pro 276 PTAPATDEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWESIELDQAP--GSGVFRVIANVQGEATPRIWGLNVVS--T 349 (419) Q Consensus 276 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~kd~--~g~~~~~~~~~~~~~~~~l~G~pv~~--~ 349 (419) .....+....|+.+.+++..+.....+. -..+++|..+..|.+...- .... .......++..+.|.|+||+. + T Consensus 138 ~~~~~T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~-~~~~~~~~g~Vg~idG~~Ii~Vps 216 (299) T protein:vir:79 138 DTTVLTTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNI-KDAGTSLNRQTTDIDTVKIIKVPS 216 (299) T ss_pred cccccCHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhccccc-ccccceeeeeeeeecceEEEEech Confidence 1222344567899999999998877654 3457899998887753210 1110 111123345667899999986 3 Q ss_pred CCCCcC----------------cEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecc-cceEEE Q lcl|Aclame:pro 350 VAIAQG----------------TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQP-KAFVRV 412 (419) Q Consensus 350 ~~~~~~----------------~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~-~a~~~~ 412 (419) +.|... ..+++.. .+..-..+ --.+.+..... .+++...+.-..++|.=+.+. ..-+.+ T Consensus 217 ~r~~t~~~~~~G~~~~~~ak~in~ii~~~-~a~~~~~K-~~~~~~~~P~~--~~~~~~~~~~r~y~d~~v~~nk~~~i~~ 292 (299) T protein:vir:79 217 NLMKTAYDFTTGWKVGAGAKQIFMSLVHP-SAIITPVS-YQFSKLDEPTA--VTEGKYFYFEESFEDVFILNKKADAIQF 292 (299) T ss_pred hhcCccceeccCccccCcccccceEEEcC-CeeeeeEe-eeeEEeecCCC--CCccceeeeeeeeeeeeeeccccCeEEE Confidence 334321 1222222 22221111 11222222211 223223445566677666653 333344 Q ss_pred EecCCCC Q lcl|Aclame:pro 413 TFAAATT 419 (419) Q Consensus 413 ~~~aa~~ 419 (419) .+.+|=. T Consensus 293 ~~~~a~~ 299 (299) T protein:vir:79 293 VVEGAGA 299 (299) T ss_pred EeeecCC Confidence 4444444 No 229 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=87.12 E-value=0.041 Score=28.20 Aligned_cols=351 Identities=12% Similarity=0.069 Sum_probs=134.0 Q ss_pred HHHHHHHHHHHHhhccc--ccccccchhhhhhHHHHhHHHHHHHHHhh-hhhhhhHHHHHHHHHHhhhcc---------- Q lcl|Aclame:pro 56 LRTAPPAPKGPADGGTP--LTPAEAGTFRSLAQRFADSDGLREYRARD-KRGQFQVEMRDIDPNRLLSRD---------- 122 (419) Q Consensus 56 l~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~---------- 122 (419) +...+.-.++...-.+. ...-.....+.+...+.+... +.+.... .+.... .+.+ .....+.. T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~llenq~-~~~~~~~~~~~~~~--~~~~-~~~l~ea~~~~~~~~~~~ 76 (528) T protein:vir:80 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQE-ADFAVDPIYKDEKV--VEAF-GGFIAEAEVAGDHGYDAS 76 (528) T ss_pred CcchHHHHHhhhHhhcCCccchhcchhhhhhhhhhhhhhh-HHhhccccccchHH--HHhh-hhhccccccccccCCccc Confidence 11111111111111111 110001111111111111100 0000000 000000 0000 01111110 Q ss_pred -cccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccce---------------------- Q lcl|Aclame:pro 123 -APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAG---------------------- 179 (419) Q Consensus 123 -~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---------------------- 179 (419) ...+..+......-|.+++ +.++.-......+++.+-||.++..-|.-+...... T Consensus 77 ~i~es~~t~~v~~~~P~Li~--lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS 154 (528) T protein:vir:80 77 QIAAGQTTGAITNVGPAVIG--MVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHS 154 (528) T ss_pred cccccccccccccCCchhhh--HHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCccccccccccccccccccccc Confidence 0011111111122232222 122223345556889999998774322211100000 Q ss_pred --------------------------------------------------------ec-------------------ccc Q lcl|Aclame:pro 180 --------------------------------------------------------AG-------------------STW 184 (419) Q Consensus 180 --------------------------------------------------------~~-------------------~~~ 184 (419) .. ... T Consensus 155 ~~~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~ 234 (528) T protein:vir:80 155 SLAAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAF 234 (528) T ss_pred cccccccccccccccccccccccccccccceeccccccccccccccccccccCccccCCccccccccccccccccccccc Confidence 00 000 Q ss_pred ccceeecC---------cccccccccceeeEEeeeEEEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 185 NKAAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-----QLMGYIQGRLTYGLRFLRDRQL 250 (419) Q Consensus 185 ~~a~~v~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-----~~~~~i~~~l~~a~~~~~d~~i 250 (419) +-+.-.+| +...++...++++++..++.-+-....|-||.+|-- |.++.|.+-|+..|...||+.| T Consensus 235 Gm~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINRei 314 (528) T protein:vir:80 235 GMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREI 314 (528) T ss_pred ccchhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHH Confidence 00011222 123667777788888888888888999999999852 5789999999999999999999 Q ss_pred Hhc---cCccccccee----ccccccccccccc-----cccchhhhHHHHHHHHHHhhh--hhccCCcEEEEehHHHHHH Q lcl|Aclame:pro 251 LNG---NGSTEMQGIL----TTPGIGTYQQPKP-----TAPATDEPPLVDIRRAKTVAE--IAGFPPDGVVVHPQDWESI 316 (419) Q Consensus 251 l~G---~g~~~p~Gi~----~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~l 316 (419) |.= .-.-+-+|+. ...|+........ ........++-.+-.+.+.+. ..+...+.+++|+.+...| T Consensus 315 i~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L 394 (528) T protein:vir:80 315 VDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNIL 394 (528) T ss_pred HhhhhheeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHH Confidence 631 1111112221 1112211111100 001111112222223333333 3334556788999999888 Q ss_pred HHHh-----ccCCceeccCCccccC-CCcccc-cceeEecCCCCcCcEEEEeccce-----EEEEEecceEEEEeecccc Q lcl|Aclame:pro 317 ELDQ-----APGSGVFRVIANVQGE-ATPRIW-GLNVVSTVAIAQGTALVGGFRQG-----ATLWSRQGITVLMTDSHAD 384 (419) Q Consensus 317 ~~~k-----d~~g~~~~~~~~~~~~-~~~~l~-G~pv~~~~~~~~~~~~~~d~~~~-----~~~~~~~~~~i~~~~~~~~ 384 (419) ...- ...|....+..+.+.. ..+.|. |++|+++...+.+.+++|--... .++..--.+...... +.. T Consensus 395 ~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~-dp~ 473 (528) T protein:vir:80 395 ASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRAT-DPQ 473 (528) T ss_pred hhccccccccccccccccccCCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEee-CCc Confidence 7531 1111111111111111 123444 68999999998887766632110 111100111111111 122 Q ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEecCC---CC Q lcl|Aclame:pro 385 FFTANTLVILAEFRANLAVYQPKAFVRVTFAAA---TT 419 (419) Q Consensus 385 ~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa---~~ 419 (419) .|+ . .+-+..|++..+ +| |+....-+. .+ T Consensus 474 sfq-P--~~g~~tRY~l~~-NP--~~~~~~~~~~~r~~ 505 (528) T protein:vir:80 474 SFH-P--VLGFKTRYGIGI-NP--FADSKSQAPSARIT 505 (528) T ss_pred ccc-c--eeeeeeeeceee-cC--cccccCCccccccc Confidence 343 2 233344555433 44 221111000 00 No 230 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=81.24 E-value=0.088 Score=26.38 Aligned_cols=349 Identities=12% Similarity=-0.007 Sum_probs=129.1 Q ss_pred HHHHHHHHhhccccc----ccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhccc-----------c Q lcl|Aclame:pro 60 PPAPKGPADGGTPLT----PAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDA-----------P 124 (419) Q Consensus 60 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~ 124 (419) ..-.++...-.+... +-.....+.+...+.+... +.+ ......+-.. +.+.......+... . T Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~l~enq~-~~~-~~~~~~~~~~-~~~~~~~~l~e~~~~~~~~~~~~~ia 77 (514) T protein:vir:56 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQD-RDI-NNDPMYRDPQ-LVEAFNAGLNEAVVNGDHGYDPANIA 77 (514) T ss_pred CchhhhhhHHhcccccccccccchhhhhhhhhhhhhHH-HHH-hcCCcccchh-hhhhhhcccccccccccccccccccc Confidence 011111111111110 0000111111111111110 111 1110000000 00000001111100 0 Q ss_pred cccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccc-----------------eeccccc-- Q lcl|Aclame:pro 125 AGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTA-----------------GAGSTWN-- 185 (419) Q Consensus 125 ~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~-- 185 (419) .+..+......-|.+++ +.+..-......+++.+-||+++..-+.-+..... ..+++.. T Consensus 78 ~s~~t~~v~~~~P~ll~--lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~tg~EAf~~~nEadt~fSG~~~~ 155 (514) T protein:vir:56 78 QGVTTGAVTNIGPTVMG--MVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAA 155 (514) T ss_pred cccccccccccchhHHH--HHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcccccccccccccCcCccccccc Confidence 11111111112232222 11222334455688888888887643322211100 0000000 Q ss_pred --------------------------------------------------------------------cceeecC----- Q lcl|Aclame:pro 186 --------------------------------------------------------------------KAAVVPE----- 192 (419) Q Consensus 186 --------------------------------------------------------------------~a~~v~E----- 192 (419) -..-.+| T Consensus 156 ~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~l 235 (514) T protein:vir:56 156 STIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENF 235 (514) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhcccC Confidence 0001112 Q ss_pred ----cccccccccceeeEEeeeEEEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHH---hccCc---- Q lcl|Aclame:pro 193 ----GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-----QLMGYIQGRLTYGLRFLRDRQLL---NGNGS---- 256 (419) Q Consensus 193 ----g~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-----~~~~~i~~~l~~a~~~~~d~~il---~G~g~---- 256 (419) +..+++...++++++..++.-+-...+|-||.+|-- |.++.|.+-|+..|...||+.|| +-.-+ T Consensus 236 ggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~~ 315 (514) T protein:vir:56 236 NGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKS 315 (514) T ss_pred CCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhc Confidence 223566677778888888888888899999999852 57899999999999999999995 32211 Q ss_pred ccccceecccccccccccccccc-chhhhHHHHHH----HHHHhh--hhhccCCcEEEEehHHHHHHHHHh--ccCCcee Q lcl|Aclame:pro 257 TEMQGILTTPGIGTYQQPKPTAP-ATDEPPLVDIR----RAKTVA--EIAGFPPDGVVVHPQDWESIELDQ--APGSGVF 327 (419) Q Consensus 257 ~~p~Gi~~~~~~~~~~~~~~~~~-~~~~~~~~~~~----~~~~~~--~~~~~~~~~~~~~~~~~~~l~~~k--d~~g~~~ 327 (419) ..-.|+-+ .|+..........+ -.....+..+. +..+.+ .......+.+++|+.+...|...- +..-..- T Consensus 316 ~~~~~~~~-~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g 394 (514) T protein:vir:56 316 GWTQGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQG 394 (514) T ss_pred cccccccc-ccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccC Confidence 11122211 12111111110000 01111222221 111222 123345677899999998887421 0000000 Q ss_pred ccCC-ccccCC----Ccccc-cceeEecCCCCcCcEEEEeccce-----EEEEEecceEEEEeecccchhhcCcEEEEEE Q lcl|Aclame:pro 328 RVIA-NVQGEA----TPRIW-GLNVVSTVAIAQGTALVGGFRQG-----ATLWSRQGITVLMTDSHADFFTANTLVILAE 396 (419) Q Consensus 328 ~~~~-~~~~~~----~~~l~-G~pv~~~~~~~~~~~~~~d~~~~-----~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~ 396 (419) +... ...+.. .+.|. |++|+++...+.+.+++|--... .++..--.+..... .+...|+ . .+-+. T Consensus 395 ~~~~~~~~d~~~~~~aG~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~-~dp~sfq-P--~~g~~ 470 (514) T protein:vir:56 395 MQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRG-SDSKNFQ-P--VIGFK 470 (514) T ss_pred ccccccccccCcceEEEEecCceEEEecCCCCcceEEEEEecCcceecceeeccccccccccc-cCCcccc-c--eeeee Confidence 0000 000111 13343 68999999999877666532110 00000000000000 1122332 2 23334 Q ss_pred EEeccEEecccc--eEEE-Ee------cCCCC Q lcl|Aclame:pro 397 FRANLAVYQPKA--FVRV-TF------AAATT 419 (419) Q Consensus 397 ~r~d~~~~~~~a--~~~~-~~------~aa~~ 419 (419) .|++..+ +|-+ -... .+ .+.-+ T Consensus 471 tRY~l~~-NPy~~~~~~~~~~~~~~~~~a~~~ 501 (514) T protein:vir:56 471 TRYGVQV-NPFADPTASATKVGNGAPVAASMG 501 (514) T ss_pred eeeceee-CCCCCccccccccCCcchhhhccc Confidence 4555443 3421 0000 00 00000 No 231 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=76.39 E-value=0.14 Score=25.33 Aligned_cols=351 Identities=10% Similarity=-0.007 Sum_probs=130.2 Q ss_pred HHHHHHHHHHHHhhcccccccc--cchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhccccc-ccccCCc Q lcl|Aclame:pro 56 LRTAPPAPKGPADGGTPLTPAE--AGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPA-GTITNPN 132 (419) Q Consensus 56 l~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 132 (419) +-+.+.-.++...-.+.....+ ....+.+...+-+.. ..................+............ +.++.+. T Consensus 1 ~~~~e~l~~kW~plLe~~~~~~i~~~~k~~i~a~llENQ--e~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~t~~v 78 (468) T protein:vir:10 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQ--ERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGL 78 (468) T ss_pred CcchHHHHHhhhHhhcCCccchhccchhhhhhhhhhhhH--HHHHhccccccchhhHhhcCCcccchhhhhhhhcccccc Confidence 1111111111111111111101 111111111111110 0001111000000000000000000000000 1111111 Q ss_pred ccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccce-------------eccc---------------- Q lcl|Aclame:pro 133 VPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAG-------------AGST---------------- 183 (419) Q Consensus 133 ~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~---------------- 183 (419) ...-|.+++ +.+.........+++.+-||.++..-+.-....... .+++ T Consensus 79 ~~~~P~Li~--l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~g~EAf~nEadt~fSg~~~~~~~~~~~~~~~~ 156 (468) T protein:vir:10 79 AGFDPVLIS--LVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAG 156 (468) T ss_pred cccCchhhh--hHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCCCCccceeccccccccccccccccccccccccc Confidence 112233322 112222344556889999998886554332210000 0000 Q ss_pred ------cccc------------------eeecC-----cccccccccceeeEEeeeEEEEEeehhhHHHHhhHH-----H Q lcl|Aclame:pro 184 ------WNKA------------------AVVPE-----GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-----Q 229 (419) Q Consensus 184 ------~~~a------------------~~v~E-----g~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-----~ 229 (419) +... .-.+| +..+++...++++++..++.-+-...+|-||.+|-- | T Consensus 157 ~~~~~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLD 236 (468) T protein:vir:10 157 VGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLD 236 (468) T ss_pred cccCCCCCcccccccccccccccccccchHHHhhcCCCCcccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCC Confidence 0000 00111 233566777778888888888888899999999842 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhc----cCcccccceeccccccccccccccccchhhhHHHHH----HHHHHhh--hhh Q lcl|Aclame:pro 230 LMGYIQGRLTYGLRFLRDRQLLNG----NGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDI----RRAKTVA--EIA 299 (419) Q Consensus 230 ~~~~i~~~l~~a~~~~~d~~il~G----~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~--~~~ 299 (419) .++.|.+-|+..|...+|+.||.- ...++-.|+. ..|+....... .+......+..+ ......+ ... T Consensus 237 AEtELaNILStEImlEINReii~~l~~va~~~k~~g~~-~~Gv~d~~~~~--~~rw~~e~~k~L~~~i~~ean~i~~~T~ 313 (468) T protein:vir:10 237 AEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVA-NAGIFDLDVDS--NGRWSVEKFKGLLFQVERDANAIAQETR 313 (468) T ss_pred hhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheeccccc-ccccccccccc--cchhHHHHHHHHHHHHHHHHHHHHHhhc Confidence 789999999999999999988862 1112222321 12222211111 011111111111 2222222 333 Q ss_pred ccCCcEEEEehHHHHHHHH---HhccCC--ceeccCCccccC----CCcccc-cceeEecCCCC----cCcEEEEeccc- Q lcl|Aclame:pro 300 GFPPDGVVVHPQDWESIEL---DQAPGS--GVFRVIANVQGE----ATPRIW-GLNVVSTVAIA----QGTALVGGFRQ- 364 (419) Q Consensus 300 ~~~~~~~~~~~~~~~~l~~---~kd~~g--~~~~~~~~~~~~----~~~~l~-G~pv~~~~~~~----~~~~~~~d~~~- 364 (419) .+..+.+++|+.+...|.. +....+ .+.-....-.+. ..+.|. |++|+++.... .+.+++|--.. T Consensus 314 rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~tg~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~KG~~ 393 (468) T protein:vir:10 314 RGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTS 393 (468) T ss_pred cccccEEEechhHHHHHhhcCcceecccccccccccccccccCcceEEEEecCceEEEEccccccCCccceEEEEEecCc Confidence 4556678999999999885 321111 110000000111 123343 68999997653 34444432110 Q ss_pred ---e-EEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEE-EEecCCCC Q lcl|Aclame:pro 365 ---G-ATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVR-VTFAAATT 419 (419) Q Consensus 365 ---~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~-~~~~aa~~ 419 (419) + .++..-..+...... +...|+ . .+-+..|++.. .+|-+-.. ++-.. |. T Consensus 394 ~~d~glfyaPYv~l~~~~~~-dp~sfq-P--~~g~~tRY~l~-~NP~~~~~~~~~g~-~~ 447 (468) T protein:vir:10 394 PYDAGLFYCPYVPLQMVRSI-DPNTFQ-P--KIGFKTRYGMV-SNPFVTTNGLYNGT-PD 447 (468) T ss_pred ceeceeeecccccccccccc-CCCccc-c--eeeeeeeecee-ecccceeccccCCC-cc Confidence 0 000000001111000 112232 2 23334455543 34432211 11111 22 No 232 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=76.01 E-value=0.14 Score=25.26 Aligned_cols=292 Identities=13% Similarity=0.033 Sum_probs=116.4 Q ss_pred cccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHhhhhh--hh Q lcl|Aclame:pro 76 AEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLP--LL 153 (419) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~--~~ 153 (419) -+-..-|.+.+ ...+++.+....+ +.+=-+...+.+..+.+.. .. T Consensus 1 ~~~~~~~~~~~-----a~~~al~~a~~~g----------------------------~AlR~EsLd~~l~~lt~~~~~ft 47 (470) T protein:vir:10 1 MPYEHLKHLDE-----ATLKALNAAGQVA----------------------------ESLEREDLEPEVTQLNVLDTPLT 47 (470) T ss_pred CChhHhhhhhH-----HHHHHHHHhhhcc----------------------------hhhhhhhhccceeEeeecCccch Confidence 00000000000 0011111111000 0010111111111111111 11 Q ss_pred HHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHH---hhH-HH Q lcl|Aclame:pro 154 VADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAA---DDN-SQ 229 (419) Q Consensus 154 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell---~d~-~~ 229 (419) +..-....++.+-.-.|-.... +.+...-....|++-.+.+++++.+.....|-++....||.-.+ +.+ .+ T Consensus 48 f~~~i~k~~a~STV~ey~~~~~-----rhG~~g~s~~~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d 122 (470) T protein:vir:10 48 DLLSKNAVKAKAYEHEYNVVTA-----RHDKIGYAAFREGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQ 122 (470) T ss_pred hhhhcCCchhhhHhhhhhhhcc-----ccccccceeecccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccc Confidence 1111222222221112211111 00111222458999999999999999999999999999997743 333 48 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccC---c---ccccceecccccccccc---c-cccccchhhhHHHHHHHHHHhh--h Q lcl|Aclame:pro 230 LMGYIQGRLTYGLRFLRDRQLLNGNG---S---TEMQGILTTPGIGTYQQ---P-KPTAPATDEPPLVDIRRAKTVA--E 297 (419) Q Consensus 230 ~~~~i~~~l~~a~~~~~d~~il~G~g---~---~~p~Gi~~~~~~~~~~~---~-~~~~~~~~~~~~~~~~~~~~~~--~ 297 (419) ++..+.++..-.++.+++.+++.||- + +.+.|+- ..|+..... . ......+.....+.+..+-..+ . T Consensus 123 ~~~~~~~dai~~ia~tiE~a~FyGDs~l~s~~~g~~~gle-FDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~ 201 (470) T protein:vir:10 123 IDELVKREKMIAVANEFEYLAFYGDNLLGDDVPGSPNNLQ-QDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVST 201 (470) T ss_pred hHHHHHHHHHHHHHHHHHhhhhhhccccccccCcccCcee-ccchhhhccCCCCccccccCCCCccHHHHHHHHhhhccc Confidence 99999999999999999999999964 1 1122222 222222111 1 1111222333455555655555 4 Q ss_pred hhccCCcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeE--ecCCCCcCcEEEEeccceEEEEE--ecc Q lcl|Aclame:pro 298 IAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVV--STVAIAQGTALVGGFRQGATLWS--RQG 373 (419) Q Consensus 298 ~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~--~~~~~~~~~~~~~d~~~~~~~~~--~~~ 373 (419) .++.+++-.+|+..+.+.|..-.....+. +.+.+... ...|+||- ++. .|.+.+- .+ .+++ +.- T Consensus 202 ~~fGt~TD~~lp~~vka~f~~~~~~~qRv-~~~~N~~~----~~~G~~v~~f~sa---~G~I~L~-~s---~~m~~~~k~ 269 (470) T protein:vir:10 202 QAFANPTAVFISYVDKLNLQASFYQISRV-MTTADRRA----GLLGADAQSYIGV---RGEHSLY-PS---QFLGDFHKF 269 (470) T ss_pred ccccChhhhccchhHHHHHHHhhcCceEE-EEecCCCc----eeeeeeccceeee---eeeeeec-cc---ccccchhhc Confidence 57888888899999999988766554443 33333222 12455542 111 1111110 00 0000 000 Q ss_pred eEEEEeecccchhhcCcEEEEEEEEeccEEec---------ccceEEEEecCCCC Q lcl|Aclame:pro 374 ITVLMTDSHADFFTANTLVILAEFRANLAVYQ---------PKAFVRVTFAAATT 419 (419) Q Consensus 374 ~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~---------~~a~~~~~~~aa~~ 419 (419) -...+..+.. .+.-+++.+-+..-.++.... ++-+....++.+.. T Consensus 270 ~p~~l~~~v~-~~aAP~~~~tv~~t~~~~a~~~~sk~g~~~~~~v~sy~y~v~~~ 323 (470) T protein:vir:10 270 NPARFGAEVG-DFAAPSNSWTVSTTDNFVTLPYNSGLGDPANTTVYSYAFKAANF 323 (470) T ss_pred CcccCCcccC-CcccCceeEEeecCCCceeecccCCCCcccCcceeEEEEEEEEe Confidence 0000000000 000111111111111111000 01111111111111 No 233 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=75.85 E-value=0.14 Score=25.23 Aligned_cols=354 Identities=11% Similarity=0.045 Sum_probs=131.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH---HHHHHHHHHHHHHHHHhhcccccccccchhhhhhHHHHhHHHHHH Q lcl|Aclame:pro 22 SLTTEQVQEIVAEARGLADALQ--AESDRAAA---RAALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLRE 96 (419) Q Consensus 22 ~~~~~~~~~~~~e~~~~~~~~~--~~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (419) ..+.+ +|.++=..+.+.-+ -+|..... ...-|+++.+. .+.....+ + .....+ T Consensus 1 ~~~~~---~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~----~~~~~~~~-----------~----~~~~~~ 58 (524) T protein:vir:98 1 MSKKN---ELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKD----AETDPVYR-----------D----EKIVES 58 (524) T ss_pred CcchH---HHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHH----HhcCcccc-----------c----hHHHHh Confidence 11111 11111111111000 01100000 00001111110 00000000 0 000000 Q ss_pred HHHhhhhhhhhHHHHHHHHHHhhh---cccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeee Q lcl|Aclame:pro 97 YRARDKRGQFQVEMRDIDPNRLLS---RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRD 173 (419) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 173 (419) |.. .+.+......+. .....+..+......-|.+++ +.+..-......+++.+-||.++..-+.-+ T Consensus 59 ~~~---------~l~ea~~~~~~~~~~~~i~~s~~t~~v~~~~P~Li~--lvRra~p~LIa~DIwGVQPMTgPTGLIFAm 127 (524) T protein:vir:98 59 FGG---------FLAEAEIAGDHNYDQTNIASGKSSGAITNIGPAVIG--MVRRAIPNLIAFDICGVQPMTGPTGQVFAL 127 (524) T ss_pred hhc---------cccccccccccccccccccccccccccccccchhhh--HHHHHHHhhhhhhhheeccCCchhhhhhhh Confidence 000 000000000000 000111111111222232222 111122344455778888887765333111 Q ss_pred ccccc---------e--------------eccc---------------------------------------------c- Q lcl|Aclame:pro 174 TSGTA---------G--------------AGST---------------------------------------------W- 184 (419) Q Consensus 174 ~~~~~---------~--------------~~~~---------------------------------------------~- 184 (419) ..... . .+++ + T Consensus 128 RsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt 207 (524) T protein:vir:98 128 RAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGA 207 (524) T ss_pred heeecCCCCCcccccccccccccccccccccCCccccccccccccccccccccccccccccccceeccccccCccccccc Confidence 11000 0 0000 0 Q ss_pred -----------------------ccceeecC---------cccccccccceeeEEeeeEEEEEeehhhHHHHhhHH---- Q lcl|Aclame:pro 185 -----------------------NKAAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS---- 228 (419) Q Consensus 185 -----------------------~~a~~v~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~---- 228 (419) +-..-.+| +..+++...++++++..++.-+-...+|-||.+|-- T Consensus 208 ~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHG 287 (524) T protein:vir:98 208 DPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHG 287 (524) T ss_pred ccccccccccccccccceeecccccchhhhhhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcC Confidence 00000112 233566777778888888888888899999999852 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHhccC---cccccceecc----ccccccccccc-----cccchhhhHHHHHHHHHHh Q lcl|Aclame:pro 229 -QLMGYIQGRLTYGLRFLRDRQLLNGNG---STEMQGILTT----PGIGTYQQPKP-----TAPATDEPPLVDIRRAKTV 295 (419) Q Consensus 229 -~~~~~i~~~l~~a~~~~~d~~il~G~g---~~~p~Gi~~~----~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~ 295 (419) |.++.|.+-|+..|...||+.||.=-- .-+..|+.+. .|+........ ........++-.+-++.+. T Consensus 288 LDAEtELsNILSTEImlEINReii~~i~~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~ 367 (524) T protein:vir:98 288 MDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANE 367 (524) T ss_pred CChHHHHHHHHHHHHHHHhhHHHHHHHhhhheeceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHH Confidence 578999999999999999999984100 1123333221 12211111110 0001111122222233333 Q ss_pred hh--hhccCCcEEEEehHHHHHHHHHh----ccCCceeccCCcc-ccCC----Ccccc-cceeEecCCCCcCcEEEEecc Q lcl|Aclame:pro 296 AE--IAGFPPDGVVVHPQDWESIELDQ----APGSGVFRVIANV-QGEA----TPRIW-GLNVVSTVAIAQGTALVGGFR 363 (419) Q Consensus 296 ~~--~~~~~~~~~~~~~~~~~~l~~~k----d~~g~~~~~~~~~-~~~~----~~~l~-G~pv~~~~~~~~~~~~~~d~~ 363 (419) +. ..+...+.+++|+.+...|..+- +..+ +.+... .+.+ -+.|. |++|+++...+.+.+++|--. T Consensus 368 I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~s~---~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG 444 (524) T protein:vir:98 368 IARQTGRGAGNFIIASRNVVSALARIDSGITPASQ---GLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKG 444 (524) T ss_pred HHHhhccccccEEEEchHHHHHHhhhhcccccccc---hhhcccccCCccceEEEEecCceEEEecCCCCcceEEEEeeC Confidence 32 33335677899999998888531 1111 011110 0111 13333 689999999988876665321 Q ss_pred ce-----EEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 364 QG-----ATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 364 ~~-----~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .. .++..--.+..... .+...|+ . .+-+..|++..+ +|- +....- ++. T Consensus 445 ~~~~~~glfyaPYv~l~~~~~-~dp~sfq-P--~~g~~tRY~l~~-NP~--~~~~~~-~~~ 497 (524) T protein:vir:98 445 DNEMDAGIYYAPYVALTPLRG-SDPKNFQ-P--VMGFKTRYGIGI-NPF--ANSRSQ-APA 497 (524) T ss_pred Ccccccceeeccccccccccc-cCCcccc-c--eeeeeeeeceee-cCc--ccccCC-ccc Confidence 10 00000000000000 1122232 2 233344555433 442 211111 110 No 234 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=68.75 E-value=0.2 Score=24.47 Aligned_cols=109 Identities=13% Similarity=0.027 Sum_probs=61.4 Q ss_pred EEehHHHHHHHHHhccCCceecc----CCccccCCCcccccceeEecCCCCcCcEEEEeccceEEE----------EEec Q lcl|Aclame:pro 307 VVHPQDWESIELDQAPGSGVFRV----IANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATL----------WSRQ 372 (419) Q Consensus 307 ~~~~~~~~~l~~~kd~~g~~~~~----~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~----------~~~~ 372 (419) +++...|+++...-..++ ++. .+...+..+-+++|+.-+++.++|.++.++.|.....-+ ...+ T Consensus 1 vvsdlqfA~~~g~~v~~~--aLpRE~aNp~ltG~lpV~~~GltWl~tpnlpg~~a~vlDst~lGgmaDE~l~~Pgya~~~ 78 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDK--ALPREQANIVLTGSLPVSAYGLTWVTSRHITGTDPWLFDVEQLGGMADEKLLSPEFAPAG 78 (123) T ss_pred CcchhhHHHHhcchhccc--ccccccCCceEecCcceeeeceeeeecCCCCCCccceeehhhhccccccccCCCcccCCC Confidence 444444555443322221 111 111223344467788889999999988888775442211 1123 Q ss_pred ceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCC Q lcl|Aclame:pro 373 GITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) Q Consensus 373 ~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa 417 (419) +..++++...++.=.+|+..+|+..----.+..|.|.++++-.-- T Consensus 79 ~~Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 79 NTGVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred CcceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 344555555443334787777775444445677999999976655 No 235 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=67.85 E-value=0.25 Score=23.92 Aligned_cols=359 Identities=14% Similarity=0.066 Sum_probs=130.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH---HHHHHHHHHHHHHHHhhcccccccccchhhhhhHHHHhHHHHHH Q lcl|Aclame:pro 21 TSLTTEQVQEIVAEARGLADAL-QAESDRAAAR---AALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLRE 96 (419) Q Consensus 21 ~~~~~~~~~~~~~e~~~~~~~~-~~~~~~~~~~---~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (419) +.-..++ |.++=..+.+.- .-++.....+ ..-|+++....+ .....+ ...+.+ T Consensus 1 ~~~~~~~---l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~----~~~~~~----------------~~~~~e 57 (529) T protein:vir:10 1 MSLKTKE---ILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSK----TDPVYR----------------DDKLIE 57 (529) T ss_pred CccchHH---HHHHhhHhhcCCccchhcchhhhhhhhhhhhhHHHHhh----cccccc----------------hhhhhh Confidence 1100111 111111111100 0001000000 000111110000 000000 000000 Q ss_pred HHHhhhhhhhhHHHHHHHHHHhhh---cccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeee Q lcl|Aclame:pro 97 YRARDKRGQFQVEMRDIDPNRLLS---RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRD 173 (419) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 173 (419) .... .+.+......+. .....+..+......-|.+++ +.+..-......+++.+-||.++..-+.-. T Consensus 58 ~~~~--------~l~e~~~~~~~~~~~~~ia~s~~t~~v~~~~P~Li~--lvRra~p~LIa~DIwGVQPMTgPTGLIFAM 127 (529) T protein:vir:10 58 AFGQ--------SLMEAEVAGDHGYDPTNIAAGQSSGAITNIGPAVIG--MVRRAIPSLIAFDIAGVQPMTGPTGQVFAL 127 (529) T ss_pred hhhh--------ccchhhcccccccccccccccccccccccccchhhh--hHHHHHHhHHhhhhheeccCCchhhhhhhh Confidence 0000 000000000000 000111111111222233222 111122334455778887877765333111 Q ss_pred cccc--------------------------------------------------------------------------ce Q lcl|Aclame:pro 174 TSGT--------------------------------------------------------------------------AG 179 (419) Q Consensus 174 ~~~~--------------------------------------------------------------------------~~ 179 (419) .... .. T Consensus 128 RsrY~~~~~~~~g~eaf~~~~e~dt~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~ 207 (529) T protein:vir:10 128 RSVYGKDPLAAGAKEAFHPMYAPDAWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVT 207 (529) T ss_pred eeeecCCcCCCcccccccccccccccccccccccccccccccccccccccceeeccccceeeeccccccccccccccccc Confidence 0000 00 Q ss_pred ecc-----------------------ccccceeecC---------cccccccccceeeEEeeeEEEEEeehhhHHHHhhH Q lcl|Aclame:pro 180 AGS-----------------------TWNKAAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN 227 (419) Q Consensus 180 ~~~-----------------------~~~~a~~v~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~ 227 (419) ... +.+-..-.+| +..+++...++++++..++.-+-...+|-||.+|- T Consensus 208 ~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDL 287 (529) T protein:vir:10 208 VGTNETGAALDALVSAKIAAGELAEIAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDL 287 (529) T ss_pred ccccccCCccccccccccccccccccccccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHH Confidence 000 0000001122 23366777778888888888888899999999985 Q ss_pred H-----HHHHHHHHHHHHHHHHHHHHHHHhc---cCcccccceec----cccccccccccc-----cccchhhhHHHHHH Q lcl|Aclame:pro 228 S-----QLMGYIQGRLTYGLRFLRDRQLLNG---NGSTEMQGILT----TPGIGTYQQPKP-----TAPATDEPPLVDIR 290 (419) Q Consensus 228 ~-----~~~~~i~~~l~~a~~~~~d~~il~G---~g~~~p~Gi~~----~~~~~~~~~~~~-----~~~~~~~~~~~~~~ 290 (419) - |.++.|.+-|+..|...||+.||.= +-.-+..|+.. ..|+........ ........++-.+- T Consensus 288 KAvHGLDAEtELsNILStEImlEINReii~~i~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~ 367 (529) T protein:vir:10 288 RAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQID 367 (529) T ss_pred HHhcCCChHHHHHHHHHHHHHHHhhHHHHHHhhhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHH Confidence 2 5789999999999999999999971 00001122211 112111111110 00011111222222 Q ss_pred HHHHhhh--hhccCCcEEEEehHHHHHHHHH--hccCCceeccCCccccCC----Ccccc-cceeEecCCCCcCcEEEEe Q lcl|Aclame:pro 291 RAKTVAE--IAGFPPDGVVVHPQDWESIELD--QAPGSGVFRVIANVQGEA----TPRIW-GLNVVSTVAIAQGTALVGG 361 (419) Q Consensus 291 ~~~~~~~--~~~~~~~~~~~~~~~~~~l~~~--kd~~g~~~~~~~~~~~~~----~~~l~-G~pv~~~~~~~~~~~~~~d 361 (419) ++.+.+. ..+...+.+++|+++...|... .+..+..-...+...+.. .+.|. |++|+++...+.+.+++|- T Consensus 368 ~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~ 447 (529) T protein:vir:10 368 KEANEIARQTGRGAGNFIIASRNVVSALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGY 447 (529) T ss_pred HHHHHHHHhhccccceEEEEchHHHHHHhhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEE Confidence 3333332 3333566788999999888742 111111000000011111 23443 6899999999888766663 Q ss_pred ccce-----EEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecC---------CCC Q lcl|Aclame:pro 362 FRQG-----ATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAA---------ATT 419 (419) Q Consensus 362 ~~~~-----~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~a---------a~~ 419 (419) -... .++..--.+..... .+...|+ . .+-+..|++..+ +|- +..+.-+ ... T Consensus 448 KG~~~~~~glfy~PYv~l~~~~~-~dp~sfq-P--~~g~~tRY~l~~-NP~--~~~~~~~~~~r~~~g~~~~ 512 (529) T protein:vir:10 448 RGANNLDAGIYYCPYVALTPLRG-SDPKNFQ-P--VMGFKTRYAIGV-NPF--AESRTQAPTSRISNGMPGA 512 (529) T ss_pred eCCcccccceeeccccccccccc-cCCCccc-c--eeeeeeeeceee-cCc--cccccccccccccCCcchh Confidence 2110 11110001111111 1122233 2 233344555433 442 2111110 001 No 236 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=65.16 E-value=0.29 Score=23.55 Aligned_cols=259 Identities=8% Similarity=-0.003 Sum_probs=109.9 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhcc--eecccCcceeeeeeccccceeccccccceeecCcccccccccceee Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD--QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDT 205 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~ 205 (419) ++- .+ -+.+...+.+.....+..-.+.. +.-.++++++||+.......-+. -..+| ..+.-+.+... T Consensus 1 Mai--n~--a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~--R~~g~-----~~g~v~~~~et 69 (290) T protein:vir:78 1 MAI--NY--VDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHT--RNKGY-----NEGSASNTNKS 69 (290) T ss_pred Cch--hH--HHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccc--cCCCc-----ccCccccceee Confidence 000 00 12333333333333222222211 12235678888886542211111 11111 11112233344 Q ss_pred EEeeeEEEEEeehhhHHH-HhhH---HHHHHHHHHHHHHHHHHHHHHHHHh----ccCcccccceecccccccccccccc Q lcl|Aclame:pro 206 ITTTLKTVAHWLPITRQA-ADDN---SQLMGYIQGRLTYGLRFLRDRQLLN----GNGSTEMQGILTTPGIGTYQQPKPT 277 (419) Q Consensus 206 v~~~~~k~~~~~~vs~el-l~d~---~~~~~~i~~~l~~a~~~~~d~~il~----G~g~~~p~Gi~~~~~~~~~~~~~~~ 277 (419) .++.-.+.-.+. |. .+ ++.+ ..+.+.+.+.....++-.+|...+. +.++.+ .... T Consensus 70 ~tl~qdR~~~F~-vD-~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~---------------~~~~ 132 (290) T protein:vir:78 70 YTIDFDRDVEFF-VD-VMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNS---------------NSVA 132 (290) T ss_pred EEeeccccceee-cc-ccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccC---------------cccc Confidence 444444433322 11 11 1111 2466677777777888888877553 111110 0001 Q ss_pred ccchhhhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHHHhccCCce--eccCCccccCCCcccccceeEecCC---C Q lcl|Aclame:pro 278 APATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGV--FRVIANVQGEATPRIWGLNVVSTVA---I 352 (419) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~kd~~g~~--~~~~~~~~~~~~~~l~G~pv~~~~~---~ 352 (419) ...+....|+.+..++.++......+-..+|+|..+..|...+.-.... ........++..+.|.|++|+..+. | T Consensus 133 ~t~t~~n~~~~i~~~~~~ldevp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~ 212 (290) T protein:vir:78 133 EEITKDNVFTKLKAAIRKVKKYGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRF 212 (290) T ss_pred cccCHHHHHHHHHHHHHHHHhcCCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchh Confidence 1223455677777777777654444445679999998776432111100 0001111245567899999986442 1 Q ss_pred ------CcC----------cEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 353 ------AQG----------TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) Q Consensus 353 ------~~~----------~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~a 416 (419) -+| ..+++..+ +..-..+-. .+.+..... .-.-|...|.-..++|.=+.+.+.=.+..-++ T Consensus 213 ~t~~~f~~G~~~~~~ak~in~ii~~~~-a~i~~~K~~-~~~~~~P~~-~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~ 289 (290) T protein:vir:78 213 YDTFDFTDGYKPAAGAKKLNFLLVNKG-SVVGGAKHA-SIYLHAPGS-VGQGDGWLYQYRVYHDIFVLDQQKDGVIASTE 289 (290) T ss_pred hhhhhhcccccccCCccceeEEEEcCC-ceeeeeeee-EEEeeCCCC-CcCcceeeeeeeeeeeeeeeccccCeeEEEee Confidence 011 12222221 222221111 233333221 11224456667777887777653222222222 Q ss_pred C Q lcl|Aclame:pro 417 A 417 (419) Q Consensus 417 a 417 (419) + T Consensus 290 ~ 290 (290) T protein:vir:78 290 V 290 (290) T ss_pred C Confidence 2 No 237 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=64.53 E-value=0.3 Score=23.46 Aligned_cols=357 Identities=12% Similarity=0.057 Sum_probs=130.5 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHH---HHHHHHHHHHHHHHHhhcccccccccchhhhhhHHHHhHHHHHHHHHh Q lcl|Aclame:pro 25 TEQVQEIVAEARGLADAL-QAESDRAAAR---AALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRAR 100 (419) Q Consensus 25 ~~~~~~~~~e~~~~~~~~-~~~~~~~~~~---~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 100 (419) ....+.|.++=..+.+.- .-++.....+ ..-|+++.+..... . ...+ .....++ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~----~-----------~~~~----~~~~~~~--- 58 (528) T protein:vir:66 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVD----P-----------IYKD----EKVVEAF--- 58 (528) T ss_pred CcchHHHHHHhHHhhcCCCcchhcchhhhhhhhhhhhhhHHHhhcc----c-----------chhh----HHHHHhh--- Confidence 111111111111111100 0001000000 00011111000000 0 0000 0000000 Q ss_pred hhhhhhhHHHHHHHHHHhhh---cccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceee------- Q lcl|Aclame:pro 101 DKRGQFQVEMRDIDPNRLLS---RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEY------- 170 (419) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~------- 170 (419) ...+........+. .....+..+.....+-|.+++ +.+..-......+++.+-||.++..-+ T Consensus 59 ------~~~l~ea~~~~~~~~~~~~i~es~~t~~v~~~~P~Li~--lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y 130 (528) T protein:vir:66 59 ------GGFIAEAEVAGDHGYDASQIAAGQTTGAITNVGPAVIG--MVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVY 130 (528) T ss_pred ------hhhhhhhcccccccccchhccccccccccccCchhHHH--HHHHHHHhhhhhhhheeecCCchhhhheeeeeee Confidence 00000000000000 000011111111222232222 112222344555778888887741100 Q ss_pred eeec-----------------------------------------------cccceec---------------------- Q lcl|Aclame:pro 171 IRDT-----------------------------------------------SGTAGAG---------------------- 181 (419) Q Consensus 171 ~~~~-----------------------------------------------~~~~~~~---------------------- 181 (419) +... ....... T Consensus 131 ~~~~~~~~~~eAfh~~~g~ea~fsea~t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~ 210 (528) T protein:vir:66 131 GGDPLKSGAREAFHPMYAPDAFHSSLAAKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQK 210 (528) T ss_pred cCCcccccccccccccccccccccccccccccccCCccceeecccccccccccceeeecccccceeeeccccccccccCc Confidence 0000 0000000 Q ss_pred ---------------------cccccceeecC---------cccccccccceeeEEeeeEEEEEeehhhHHHHhhHH--- Q lcl|Aclame:pro 182 ---------------------STWNKAAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS--- 228 (419) Q Consensus 182 ---------------------~~~~~a~~v~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~--- 228 (419) ...+-..-.+| +...++...+++++++.++.-+-...+|-||.+|-- T Consensus 211 ~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIH 290 (528) T protein:vir:66 211 VGSESEDEVVMKLIEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVH 290 (528) T ss_pred ccccccccccccccccccceecccccchhhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhc Confidence 00000011122 123677778888888888888889999999999852 Q ss_pred --HHHHHHHHHHHHHHHHHHHHHHHh---ccCcccccceec----ccccccccccccc-----ccchhhhHHHHHHHHHH Q lcl|Aclame:pro 229 --QLMGYIQGRLTYGLRFLRDRQLLN---GNGSTEMQGILT----TPGIGTYQQPKPT-----APATDEPPLVDIRRAKT 294 (419) Q Consensus 229 --~~~~~i~~~l~~a~~~~~d~~il~---G~g~~~p~Gi~~----~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 294 (419) |.+..|.+-|+..|...||+.||. -.-.-+-+|+.. ..|+......... .......++-.+-.+.+ T Consensus 291 GLDAEtELsNILStEImlEINREii~~i~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an 370 (528) T protein:vir:66 291 GMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAA 370 (528) T ss_pred CCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHH Confidence 578999999999999999999963 111111122221 1121111111000 00011112222223333 Q ss_pred hhh--hhccCCcEEEEehHHHHHHHHHh-----ccCCceeccCCccccC-CCcccc-cceeEecCCCCcCcEEEEeccce Q lcl|Aclame:pro 295 VAE--IAGFPPDGVVVHPQDWESIELDQ-----APGSGVFRVIANVQGE-ATPRIW-GLNVVSTVAIAQGTALVGGFRQG 365 (419) Q Consensus 295 ~~~--~~~~~~~~~~~~~~~~~~l~~~k-----d~~g~~~~~~~~~~~~-~~~~l~-G~pv~~~~~~~~~~~~~~d~~~~ 365 (419) .+. ..+...+.+++|+.+...|...- +..|....+..+.+.. ..+.|. |++|+++...+.+.+++|--... T Consensus 371 ~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~ 450 (528) T protein:vir:66 371 EIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDN 450 (528) T ss_pred HHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceeEEEecCceEEEecCCCCcceEEEEEeCCc Confidence 333 33345567889999998887531 1111111111111111 123454 68999999998887766632110 Q ss_pred -----EEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 366 -----ATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 366 -----~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .++..--........ +...|+ . .+-+..|++..+ +|- +. ..+-.+. T Consensus 451 ~~~~glfyaPYv~l~~~~~~-dp~sfq-P--~~g~~tRY~l~v-NP~--~~-~~~~~~~ 501 (528) T protein:vir:66 451 EMDAGIYYAPYVALTPLRAT-DPQSFH-P--VLGFKTRYGIGI-NPF--AD-SKSQEPS 501 (528) T ss_pred ccccceeecccccceeeEee-CCcccc-c--eeeeeeeeceee-cCc--cc-ccCcccc Confidence 111100011111111 122343 2 233344555433 442 11 1111111 No 238 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=64.00 E-value=0.31 Score=23.39 Aligned_cols=333 Identities=14% Similarity=0.038 Sum_probs=130.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhc Q lcl|Aclame:pro 42 LQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSR 121 (419) Q Consensus 42 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 121 (419) +.+.. . -++|. +...++.+... ++..+...++......+ +.+ ....+. T Consensus 1 ~~~~~--~---~e~l~---------------------~kw~p~l~~~~-~~~~~~~~a~llenq~~-~~~----~~l~e~ 48 (523) T protein:vir:59 1 MSQPK--I---NEQLI---------------------EKWQPLLEGCR-NDWERHTLATLLENQYR-EAK----KHLMET 48 (523) T ss_pred CCcch--h---hHHHH---------------------HhhhhhhcccC-ChhHHHHHHHHhhhhhH-HHH----Hhhhhh Confidence 00000 0 00000 00011111000 00011111111111000 000 011111 Q ss_pred ccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeecccc------------------------ Q lcl|Aclame:pro 122 DAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGT------------------------ 177 (419) Q Consensus 122 ~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~------------------------ 177 (419) ..... ..+.+++++ +.+.......-.+++.|-||+++..-|.-+..+. T Consensus 49 ~~~~~--~~~~~~~~~------~v~r~~p~l~a~DIWGVQPMTGPTGLIFAMRSRY~~q~gteA~yg~~~~~~~~a~~~~ 120 (523) T protein:vir:59 49 TQTTE--VDGWNLALP------IVRRVFANLRATDLVSVQPLSLPTGLVFYLDFKSPELPGNGSVYGGTGLTTDTATGGL 120 (523) T ss_pred hhccc--cccccchhh------hhhhHhhhhhhhhccccccCCCCcceeEEEEeeccCCCCcccccCccccCcccccccc Confidence 11111 111112221 2222223334445566666655442221111000 Q ss_pred ---ce--------------------------------------------------------------------------- Q lcl|Aclame:pro 178 ---AG--------------------------------------------------------------------------- 179 (419) Q Consensus 178 ---~~--------------------------------------------------------------------------- 179 (419) .. T Consensus 121 ~ean~~~s~~~~~~~~~~d~~~sg~~~~~~~a~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~ 200 (523) T protein:vir:59 121 YDENARLSRREYETTITVDLATAQQATMRDVGFDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENT 200 (523) T ss_pred cccccccccccccCccCCCcccccccccccccccccchhhccccceeeeecccccccccccccccccccccccccccccc Confidence 00 Q ss_pred ------ec---cc--------------ccccee-----------------------------------ecCccccccccc Q lcl|Aclame:pro 180 ------AG---ST--------------WNKAAV-----------------------------------VPEGTAKPQSTL 201 (419) Q Consensus 180 ------~~---~~--------------~~~a~~-----------------------------------v~Eg~~~~~~~~ 201 (419) .. .. ...... -.++...++... T Consensus 201 fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~F 280 (523) T protein:vir:59 201 VAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINL 280 (523) T ss_pred ccchhhccccccccccccccccccccccccccccCCCcccccccccccccccccchhhccccccccccccccccccceee Confidence 00 00 000000 012234556666 Q ss_pred ceeeEEeeeEEEEEeehhhHHHHhhH-H-----HHHHHHHHHHHHHHHHHHHHHHHhc----cCcccccceecccccccc Q lcl|Aclame:pro 202 SFDTITTTLKTVAHWLPITRQAADDN-S-----QLMGYIQGRLTYGLRFLRDRQLLNG----NGSTEMQGILTTPGIGTY 271 (419) Q Consensus 202 ~~~~v~~~~~k~~~~~~vs~ell~d~-~-----~~~~~i~~~l~~a~~~~~d~~il~G----~g~~~p~Gi~~~~~~~~~ 271 (419) +++++++.++.-+-...+|-||.+|- + |.++.|.+-|+..|...||+.||.- .-.++-.|+.+ .|+... T Consensus 281 sIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~~-~g~~~~ 359 (523) T protein:vir:59 281 ELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFWS-EVVGEY 359 (523) T ss_pred EEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccccc-cceeee Confidence 77788888888888889999999984 2 3788999999999999999988862 11122222211 111111 Q ss_pred ccccc---ccc-------chhhhHHHHHHHHHHhhh--hhccCCcEEEEehHHHHHHHHHhccCCceeccCCcccc-CCC Q lcl|Aclame:pro 272 QQPKP---TAP-------ATDEPPLVDIRRAKTVAE--IAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQG-EAT 338 (419) Q Consensus 272 ~~~~~---~~~-------~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~-~~~ 338 (419) ..... ... .....++-.+-+..+.+. ..+...+.+++|+++...|...---.++... ....++ ... T Consensus 360 ~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~-~~~~~~~~~~ 438 (523) T protein:vir:59 360 YDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGNDN-RDGGTGIFYV 438 (523) T ss_pred cccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCCcc-ccccccceeE Confidence 11000 000 000111122222223232 3333567789999999887642100000000 000000 012 Q ss_pred cccc-cceeEecCCCCcCcEEEEeccc-----e-EEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEE Q lcl|Aclame:pro 339 PRIW-GLNVVSTVAIAQGTALVGGFRQ-----G-ATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVR 411 (419) Q Consensus 339 ~~l~-G~pv~~~~~~~~~~~~~~d~~~-----~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~ 411 (419) +.|. |++|+++...+.+.+++|--.. + .++..-..+.......+...|+ =.+-+..|++..|.+|.+... T Consensus 439 g~l~~~~~vy~d~~~~~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~q---p~~~~~tRY~l~v~nP~~~~~ 515 (523) T protein:vir:59 439 GMVQGRYRLYKNIYQNQPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFS---YRRGLMTRYALEVVRPEFYGL 515 (523) T ss_pred EEecCceEEEecCCCCcceEEEEecccCCcccccceecccchhhcccccccCCccc---ceeeeeeehhheecchhHhhh Confidence 3444 5799999999888777663221 0 1111000010000001112343 244566799998888866554 Q ss_pred EEecCCCC Q lcl|Aclame:pro 412 VTFAAATT 419 (419) Q Consensus 412 ~~~~aa~~ 419 (419) +-+.---. T Consensus 516 ~~~~~~~~ 523 (523) T protein:vir:59 516 LYVKLLQP 523 (523) T ss_pred hhhhhcCC Confidence 43322111 No 239 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=52.55 E-value=0.55 Score=22.00 Aligned_cols=346 Identities=9% Similarity=-0.031 Sum_probs=83.8 Q ss_pred CCc----cHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MPP----TPTLEEQRAALLARLDDTSLTTE---------QVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPA 67 (419) Q Consensus 1 M~~----~~~L~e~~~~l~~~~~~~~~~~~---------~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 67 (419) |.. +++|++++++++++.+++....+ +.....++++...+.+++++++++.+.+.++.......... T Consensus 12 ~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l~~~~~~~~~~~ 91 (397) T protein:vir:96 12 IKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDLEDELAKAADPT 91 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh Confidence 222 34555555555555554433221 22233445566677777788777777766665544333322 Q ss_pred hhcccccccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhh-HHHHH Q lcl|Aclame:pro 68 DGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVP-GIVPT 146 (419) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~-~~i~~ 146 (419) ...................... .+....+... .+..............+....+... ....+. ..... T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~vp~~~-~~~i~~~~~~~~ 160 (397) T protein:vir:96 92 DQKPKDGEKRKMKKFKVTEEEL-AEKRSAINAF---------VKSKGAEKRDGFTSVEGGALIPQEL-LQPQLEPKDIVD 160 (397) T ss_pred hhhhHHHHHHHHHHHhhhhHHH-HHHHHHHHHH---------HHhhhhhhhhcccccccccchhHHH-HHHHHHhhhhhh Confidence 2111111100000000000000 0000000000 0000000000111111111111110 111110 00111 Q ss_pred hhhhhhh--HHh--h-cceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhH Q lcl|Aclame:pro 147 TPDLPLL--VAD--L-LDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITR 221 (419) Q Consensus 147 ~~~~~~~--l~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ 221 (419) ....... +.. + .++....+....+..+.+ -..|.........++.--.+...-.-..--+.+ T Consensus 161 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~-------------~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~d 227 (397) T protein:vir:96 161 LSKYVRSVPVNSASGKFPVISKSGSKMATVQQLE-------------KNPQLANPKMVEIDYSVATRRGYIPISQEMIDD 227 (397) T ss_pred HHHhhhhccccccceeEEEEeccCCccccccccc-------------cccccccccccceeecHhHhhcchhhHHHHHhh Confidence 1111111 100 0 111111122221111111 011111111111222111111100000001111 Q ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc--CcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhh Q lcl|Aclame:pro 222 QAADDNSQLMGYIQGRLTYGLRFLRDRQLLNGN--GSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIA 299 (419) Q Consensus 222 ell~d~~~~~~~i~~~l~~a~~~~~d~~il~G~--g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (419) ...+-...+.+.+...++.+....+-...=.+. |......|...-.... .......-......+..+ ..+.+. T Consensus 228 s~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~d~~~~~~~~~~-~~~~~a~~v~n~~~~~~l----~~lkd~ 302 (397) T protein:vir:96 228 ASYDVTGLIADEIQDQSLNTKNADIAAVLKTATAKSVVGVDGLKDLINKEI-KKVYDVKLFISASMYSEL----DKLKDK 302 (397) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccchHHHHHHHHHhh-hhhcCcEEEEcHHHHHHH----HHhhcc Confidence 111111123344444444444444333222222 2222222221111000 000000111122223332 333333 Q ss_pred ccCCcEEEEehHHHHHHHHHhccC-----CceeccCCccccCCCccccc-ceeEecCCCCcCcEEEEecc---------- Q lcl|Aclame:pro 300 GFPPDGVVVHPQDWESIELDQAPG-----SGVFRVIANVQGEATPRIWG-LNVVSTVAIAQGTALVGGFR---------- 363 (419) Q Consensus 300 ~~~~~~~~~~~~~~~~l~~~kd~~-----g~~~~~~~~~~~~~~~~l~G-~pv~~~~~~~~~~~~~~d~~---------- 363 (419) .. .+++.|.. .+.. |.++.+..+ ..++.-.| .++++-++ .. .+++++.. T Consensus 303 ~G---~~~~~~~~-------~~~~~~~l~G~pv~~~~~---~~~~~~~~~~~~~~gd~-~~-~~~~~~~~~~~~~~~~~~ 367 (397) T protein:vir:96 303 NG---RYLLQDSI-------TAASGKQLLGKEVVVLDD---DVIGKSVGNVVGFIGDA-KA-FASFFDRKQVSVSWVDNN 367 (397) T ss_pred CC---CeEeccCc-------cCCCcccccccceEEecc---cccCCCCCceEEEEeeh-hc-ceEeEeecceEEEEeccc Confidence 22 23443321 1111 222211111 00010011 12222111 00 01112111 Q ss_pred ---ceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccc Q lcl|Aclame:pro 364 ---QGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKA 408 (419) Q Consensus 364 ---~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a 408 (419) ..+..+.|-+..+. ....|.. + -+.+ | T Consensus 368 ~~~~~~~~~~r~d~~~~----~~~a~~~------~----~~~~----a 397 (397) T protein:vir:96 368 IYGQLLAGIIRYDVKAT----DKKAGFY------V----TFTI----G 397 (397) T ss_pred ccceeEEEEEEEccEEe----cccceEE------E----Eeec----C Confidence 11111111111111 1111211 1 1111 1 No 240 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=52.42 E-value=0.55 Score=21.99 Aligned_cols=348 Identities=14% Similarity=0.014 Sum_probs=124.1 Q ss_pred HHHHHHHHHHHHHHHHHhhcccccc--cccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhh--cc-ccc Q lcl|Aclame:pro 51 ARAALLRTAPPAPKGPADGGTPLTP--AEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLS--RD-APA 125 (419) Q Consensus 51 ~~~~~l~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~ 125 (419) -++. ..+.-.++...-.+.... -.....+.+...+.+.. .+.+... .....+......-.. .. ... T Consensus 1 ~~~~---~~e~l~~kw~p~l~~~~~~~i~~~~~~~v~a~l~enq-~~~~~~~-----~~~l~e~~~~~~~~~~~~~~i~~ 71 (470) T protein:vir:10 1 MQMF---NSEYLQEKWAPILDYDGLDPIKDSHRRSVTAVLLENQ-EKELREE-----RNFLSEAPNVNTNSGATAGFSAD 71 (470) T ss_pred CCcc---hhHHHHHhhhhhhcCCccchhcchhhhhhhhhhhhhh-HHHHhhc-----cchhhhhhhcccccccccccccc Confidence 0000 000000111100111000 00000111111111100 0100000 000000000000000 00 011 Q ss_pred ccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccc------cce-------ecccccc-----c Q lcl|Aclame:pro 126 GTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSG------TAG-------AGSTWNK-----A 187 (419) Q Consensus 126 ~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~------~~~-------~~~~~~~-----a 187 (419) +..+......-|.+++ +.+.........+++.+-||+++..-+.-.... +.. .+++... . T Consensus 72 st~t~~v~~~~P~Li~--lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~sG~EaffnEA~T~fSG~~~~~~~~~ 149 (470) T protein:vir:10 72 ATAAGPVAGFDPVLIS--LIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQSGTEALFNEADTAFSGQPDGLDDTS 149 (470) T ss_pred ccccccccccCchhhh--hHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCCccceeeecCCcccCcccccccccc Confidence 1111222222233333 222233445566889999999887655432210 000 0000000 0 Q ss_pred ----------------------------e----------------eecC------cccccccccceeeEEeeeEEEEEee Q lcl|Aclame:pro 188 ----------------------------A----------------VVPE------GTAKPQSTLSFDTITTTLKTVAHWL 217 (419) Q Consensus 188 ----------------------------~----------------~v~E------g~~~~~~~~~~~~v~~~~~k~~~~~ 217 (419) . -.+| +...++...+++++++.++.-+-.. T Consensus 150 ~~~~~~a~~~g~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~aE~lg~s~~~~f~EMaFsIeK~tVtAKSRaLKA 229 (470) T protein:vir:10 150 GFTATGANNVGLGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSRALKA 229 (470) T ss_pred cccccccccccccccccccccccccccccccccccccccccccchHHhhhcCCCCCcccceeeeEEEEEEEEeeccceec Confidence 0 0001 2335666677778888888888888 Q ss_pred hhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhcc----Ccccccceeccccccccccccc--c-ccchhhhH Q lcl|Aclame:pro 218 PITRQAADDNS-----QLMGYIQGRLTYGLRFLRDRQLLNGN----GSTEMQGILTTPGIGTYQQPKP--T-APATDEPP 285 (419) Q Consensus 218 ~vs~ell~d~~-----~~~~~i~~~l~~a~~~~~d~~il~G~----g~~~p~Gi~~~~~~~~~~~~~~--~-~~~~~~~~ 285 (419) .+|-||.+|-- |.++.|.+-|+..|...+|+.||.-= -.++..|+ ...|+........ . ........ T Consensus 230 eYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~k~~~~-~~~Gv~Dl~~~~~gr~~~e~~~~l~ 308 (470) T protein:vir:10 230 EYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPGAQANV-AAAGTFDLDTDSNGRWSVEKFKGLI 308 (470) T ss_pred cccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhceeccc-cccceEEeecccchhHHHHHHHHHH Confidence 99999999852 57899999999999999999888621 11122222 1122211111110 0 00001111 Q ss_pred HHHHHHHHHh-hhhhccCCcEEEEehHHHHHHHHH--hccC-CceeccCCccccC-CCcccc-cceeEecCCCCcC---- Q lcl|Aclame:pro 286 LVDIRRAKTV-AEIAGFPPDGVVVHPQDWESIELD--QAPG-SGVFRVIANVQGE-ATPRIW-GLNVVSTVAIAQG---- 355 (419) Q Consensus 286 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~--kd~~-g~~~~~~~~~~~~-~~~~l~-G~pv~~~~~~~~~---- 355 (419) +..-..+-.. ....-...+.+++|+.+...|... .+.. |-.-....+.+.. ..+.|. |++|+++..+..+ T Consensus 309 ~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~ 388 (470) T protein:vir:10 309 FQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGILQGKYRVYIDPFSASGGAAA 388 (470) T ss_pred HHHHHHHHHHHHhhccccceEEEEchhHHhHhhhccccccccccccccccCCCCceEEEEecCceEEEeeccccccCccc Confidence 1111111111 123334556788999999887531 1100 0000001111100 123443 5899999865432 Q ss_pred --cEEEEeccceEEEEEecceEEEEeecc---------cchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 356 --TALVGGFRQGATLWSRQGITVLMTDSH---------ADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 356 --~~~~~d~~~~~~~~~~~~~~i~~~~~~---------~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+++|- + +-.-+ ...+-..++. ...|+ . .+-+..|++..+ +|-+-..-.-.+..+ T Consensus 389 ~dy~~vG~-K-G~~~~---~~glfy~PYv~l~~~~~~dp~sfq-P--~~g~~tRY~l~~-NP~~~~~~~~~~~i~ 454 (470) T protein:vir:10 389 TQYYVVGY-K-GSSPY---DAGLFYCPYVPLQMVRAVGQDTFQ-P--KIGFKTRYGLVE-NPFSQGTTQGLGTLT 454 (470) T ss_pred ccEEEEEE-e-cCcce---ecceeeccccccccCCCCCCcccc-c--eeeeeeeeceee-cCcccCCCccccccc Confidence 233321 1 00000 0001111111 11232 1 222334544432 333211000000011 No 241 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=51.96 E-value=0.57 Score=21.93 Aligned_cols=282 Identities=12% Similarity=0.051 Sum_probs=117.6 Q ss_pred cccccCCcccccchhhhHHHHHhh----hhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccc Q lcl|Aclame:pro 125 AGTITNPNVPHLPQLVPGIVPTTP----DLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST 200 (419) Q Consensus 125 ~~~~~~~~~~~~p~~~~~~i~~~~----~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 200 (419) .+.+. .....-|..+...|...+ .....+..+++..++.+-.+.+....... ...+.+++.+...+-.. T Consensus 1 M~~~~-~~d~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~~~~~~~~~~~~~~~~------~~~a~~~~~~~~~~~~~ 73 (348) T protein:vir:98 1 MSWTL-DTEFIEPTQLTGLIREALRDLQVNRFRLARWLPNVDVDDITFEFLRGGGGL------AETASYRSWDTESKIGR 73 (348) T ss_pred Ccchh-hhhccCHHHHHHHHHHHhhccCcchhhHHhcCCCccccceEEEEEeccCCc------eeeeeeecCCCccceee Confidence 00101 111222333333333322 12223345565555444333333221111 11245566665555443 Q ss_pred -cceeeEEeeeEEEEEeehhhHH-HHh--hHH--HHHHHHHH---HHHHHHHHHHHH----HHHhcc----Ccccccce- Q lcl|Aclame:pro 201 -LSFDTITTTLKTVAHWLPITRQ-AAD--DNS--QLMGYIQG---RLTYGLRFLRDR----QLLNGN----GSTEMQGI- 262 (419) Q Consensus 201 -~~~~~v~~~~~k~~~~~~vs~e-ll~--d~~--~~~~~i~~---~l~~a~~~~~d~----~il~G~----g~~~p~Gi- 262 (419) ..+...++.+-.++-...++.+ ++. ..+ .+..+|.+ ++.+++.+.+|. ++.+|. |.+. .+ T Consensus 74 r~g~~~~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~--~vD 151 (348) T protein:vir:98 74 REGLAKVMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQ--TVD 151 (348) T ss_pred cccceeeeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCce--EEc Confidence 3567777777777766666664 222 221 34444443 345555555553 444442 1111 11 Q ss_pred eccccccccccccccccchhhhHHHHHHHHHHhhhh-hccCCcEEEEehHHHHHHHH---HhccC-Cc-----eeccCCc Q lcl|Aclame:pro 263 LTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEI-AGFPPDGVVVHPQDWESIEL---DQAPG-SG-----VFRVIAN 332 (419) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~l~~---~kd~~-g~-----~~~~~~~ 332 (419) +..+.....+.....+...+.+.+.|+.+.+..+.. .+..+..++|++.+|..|+. +++.- ++ ...+... T Consensus 152 yg~~~~~~~t~~~~Ws~~~~adp~~di~~~~~~~~~~~G~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~ 231 (348) T protein:vir:98 152 FGRIGSHSVVAAVLWSVHATATPISDLESWVATYEDTNGQSPGVILMPKAAVSHMRQCEEVIRQVFPLAPSGTAPMVSVE 231 (348) T ss_pred cccCcccccccccccCCCCCCCHHHHHHHHHHHHHHccCCcceEEEeCHHHHHHHhcCHHHHHHHhccCccccccccCHH Confidence 011111111222222223345667888888776655 46678889999999999863 33211 10 0011100 Q ss_pred cccCCCcccccce-eEecC-----------CCCcCcEEEEeccceE-------EEEEecceEEEEee---------ccc- Q lcl|Aclame:pro 333 VQGEATPRIWGLN-VVSTV-----------AIAQGTALVGGFRQGA-------TLWSRQGITVLMTD---------SHA- 383 (419) Q Consensus 333 ~~~~~~~~l~G~p-v~~~~-----------~~~~~~~~~~d~~~~~-------~~~~~~~~~i~~~~---------~~~- 383 (419) ..... -.-+|.| |++.+ .+|++.++++-..... ++....|.+.+... ..+ T Consensus 232 ~~~~~-~~~~g~~~i~~~d~~~~~~g~~~~~~p~~~i~l~p~~~~~~~~~~~~~G~t~~G~~~e~~~~~~~~~~~~~~~i 310 (348) T protein:vir:98 232 QLNTV-LSSMGLPPIEVYDAKVAVDGVSTRITPANAIALLPEPGATDAAQPTELGATLLGTTAESLEDDYALAPGEQPGI 310 (348) T ss_pred HHHHH-HHhhCCeEEEEeeeEEEcCCceeceecCCeEEEEecCCcccccccccccceecccchhhhccccccceeccCce Confidence 00000 0113443 33322 2355555553211100 00000000000000 000 Q ss_pred --chhh-cC--cEEEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 384 --DFFT-AN--TLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) Q Consensus 384 --~~~~-~~--~~~~r~~~r~d~~~~~~~a~~~~~~~a 416 (419) ..|. .| .....+..+.=-.+.+|++++++++=+ T Consensus 311 ~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 311 VAATWKTKDPVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred eeeeeeecCCcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 0011 11 233444555445667899999998888 No 242 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=47.39 E-value=0.7 Score=21.42 Aligned_cols=360 Identities=11% Similarity=0.019 Sum_probs=131.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH-H-HHHHHHHHHHHHHHhhcccccccccchhhhhhHHHHhHHHHHH Q lcl|Aclame:pro 21 TSLTTEQVQEIVAEARGLADA--LQAESDRAAAR-A-ALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLRE 96 (419) Q Consensus 21 ~~~~~~~~~~~~~e~~~~~~~--~~~~~~~~~~~-~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (419) +. ....+.|.++=..+.+. +. ++..-... . .-+++++... +.. +.-.. ...... T Consensus 1 ~~--~~~~~~l~~kw~p~l~~~~~~-~i~~~~~~~~a~~~enq~~~~----~~~------~~~~~---------~~~~~~ 58 (521) T protein:vir:10 1 MT--IKTKAELLNKWKPLLEGEGLP-EIANSKQAIIAKIFENQEKDF----QTA------PEYKD---------EKIAQA 58 (521) T ss_pred CC--cchhHHHHHhhhhhhccCCCC-ccccchhhhhhhhhhhhhhhh----hhc------cccch---------hHHHHH Confidence 00 00011111111111110 00 01000000 0 0011110000 000 00000 000000 Q ss_pred HHHhhhhhhhhHHHHHHHHHHhhh---cccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeee Q lcl|Aclame:pro 97 YRARDKRGQFQVEMRDIDPNRLLS---RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRD 173 (419) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 173 (419) +... +.+......+. .....+..+......-|.+++ +.+..-......+++.+-||.++..-+.-+ T Consensus 59 ~~~~---------l~e~~~~~~~~~~~~~i~es~~t~~v~~~~P~Li~--lvRra~p~LIa~DIwGVQPMTgPTGLIFAM 127 (521) T protein:vir:10 59 FGSF---------LTEAEIGGDHGYNATNIAAGQTSGAVTQIGPAVMG--MVRRAIPNLIAFDICGVQPMNSPTGQVFAL 127 (521) T ss_pred Hhhh---------hhhhcccCccccccccccccccccccccCCchhhh--HHHHHHhhhhhhhceeeccCCchhhhheee Confidence 0000 00000000000 000011111111112232222 111122344556788888888776433222 Q ss_pred ccccce-------------------eccccc------------------------------------------------- Q lcl|Aclame:pro 174 TSGTAG-------------------AGSTWN------------------------------------------------- 185 (419) Q Consensus 174 ~~~~~~-------------------~~~~~~------------------------------------------------- 185 (419) ...... .+++.. T Consensus 128 RsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~ 207 (521) T protein:vir:10 128 RAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAK 207 (521) T ss_pred eeeccCCccccccccccchhccccccccccccccccccccccccccccccccccccccccceecccccccCCCccccccc Confidence 111000 000000 Q ss_pred ------------c--------ceeecC---------cccccccccceeeEEeeeEEEEEeehhhHHHHhhHH-----HHH Q lcl|Aclame:pro 186 ------------K--------AAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-----QLM 231 (419) Q Consensus 186 ------------~--------a~~v~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-----~~~ 231 (419) . ..-.+| +...++...++++++..++.-+-...+|-||.+|-- |.+ T Consensus 208 ~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAE 287 (521) T protein:vir:10 208 LDAEIKKQMEAGALVEIAEGMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDAD 287 (521) T ss_pred ccccccccccccceeecccccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChH Confidence 0 000111 123566777778888888888888899999999852 578 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCc---ccccceecc----ccccccccccccc-cchhhh----HHHHHHHHHHhhh-- Q lcl|Aclame:pro 232 GYIQGRLTYGLRFLRDRQLLNGNGS---TEMQGILTT----PGIGTYQQPKPTA-PATDEP----PLVDIRRAKTVAE-- 297 (419) Q Consensus 232 ~~i~~~l~~a~~~~~d~~il~G~g~---~~p~Gi~~~----~~~~~~~~~~~~~-~~~~~~----~~~~~~~~~~~~~-- 297 (419) +.|.+-|+..|...||+.||.=--. -+.+|+... .|+.......... .-.... ++-.+-.....+. T Consensus 288 tELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~ 367 (521) T protein:vir:10 288 AELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQ 367 (521) T ss_pred HHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHh Confidence 9999999999999999999841100 122333221 1211111111000 001111 1222222222222 Q ss_pred hhccCCcEEEEehHHHHHHHHHh--c---cCC-ceeccCCcccc-CCCcccc-cceeEecCCCCcCcEEEEeccce---- Q lcl|Aclame:pro 298 IAGFPPDGVVVHPQDWESIELDQ--A---PGS-GVFRVIANVQG-EATPRIW-GLNVVSTVAIAQGTALVGGFRQG---- 365 (419) Q Consensus 298 ~~~~~~~~~~~~~~~~~~l~~~k--d---~~g-~~~~~~~~~~~-~~~~~l~-G~pv~~~~~~~~~~~~~~d~~~~---- 365 (419) ..-...+.+++|+++...|...- + +.| ..-|.. +.+. -.-+.|. |++|+++...+.+.+++|--... T Consensus 368 T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~-d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~ 446 (521) T protein:vir:10 368 TGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFNT-DTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDA 446 (521) T ss_pred cccccceEEEEchHHHHHHhhcccccccccccccccccc-cCCCceEEEEecCceEEEecCCCCcceEEEEEeCCccccc Confidence 22245567899999998887531 0 000 011111 1111 1123444 68999999998887766632110 Q ss_pred -EEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceE-------EEEe-----cCCCC Q lcl|Aclame:pro 366 -ATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFV-------RVTF-----AAATT 419 (419) Q Consensus 366 -~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~-------~~~~-----~aa~~ 419 (419) .++..--.+..... .+...|+ . .+-+..|++..+ +|-+-. +++- .+.-+ T Consensus 447 glfyaPYv~l~~~~~-~dp~sfq-P--~~g~~tRY~l~~-NP~~~~~~~~~~~~i~~~~~~~~a~~~ 508 (521) T protein:vir:10 447 GIYYAPYVALTPLRG-SDPKNFQ-P--VMGFKTRYGIGI-NPFAESAAQAPASRIQSGMPSILNSLG 508 (521) T ss_pred ceeeccccccccccc-cCCcccc-c--eeeeeeeeceee-cCcccccCCccceeecccchhhhcccc Confidence 01100000111111 1122343 2 233344555433 442111 0000 00111 No 243 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=45.13 E-value=0.78 Score=21.17 Aligned_cols=361 Identities=12% Similarity=0.037 Sum_probs=136.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH--HHHHHHHHHHHHHHHhhcccccccccchhhhhhHHHHhHHHH Q lcl|Aclame:pro 18 LDDTSLTTEQVQEIVAEARGLADALQ-AESDRAAAR--AALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGL 94 (419) Q Consensus 18 ~~~~~~~~~~~~~~~~e~~~~~~~~~-~~~~~~~~~--~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 94 (419) +..++. .+.|.++=..+.+.-- .++..-... ..-|+++.+..+. ... . .+.... T Consensus 1 ~~~~~~----~e~l~~kw~p~l~~~~~~~~~~~~~~~~a~l~enq~~~~~~----~~~-----------~----~~~~~~ 57 (522) T protein:vir:69 1 MTTIKT----KAQLVDKWKELLEGEGLPEIANSKQAIIAKIFENQEKDFEV----SPE-----------Y----KDEKIA 57 (522) T ss_pred CCccch----HHHHHHhhHHHhcCCCCCccccchhhhhhhhhhhhhHHhhc----ccc-----------c----chhHHH Confidence 111111 1111111111111000 000000000 0001111000000 000 0 000000 Q ss_pred HHHHHhhhhhhhhHHHHHHHHHHhhh---cccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeee Q lcl|Aclame:pro 95 REYRARDKRGQFQVEMRDIDPNRLLS---RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYI 171 (419) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~ 171 (419) .+| ...+.+......+. .....+.++....-+-|.++. +.++.-......+++.+-||+++..-+. T Consensus 58 ~~~---------~~~l~ea~~~~~~~~~~~~i~es~~t~~v~~~~P~li~--lvrRa~p~LIa~DIwGVQPMTgPTGLIF 126 (522) T protein:vir:69 58 QAF---------GSFLTEAEIGGDHGYNAQNIAAGQTSGAVTQIGPAVMG--MVRRAIPNLIAFDICGVQPMNSPTGQVF 126 (522) T ss_pred Hhh---------hhhhhhhccccccCCCcccccccccccccccccchHHH--HHHHHHhhhhhhhceeeccCCchhhhhe Confidence 000 00111110000000 000011111111112232222 1222233445567888888887764322 Q ss_pred eeccccce-------------------ecccc------------------------------------------------ Q lcl|Aclame:pro 172 RDTSGTAG-------------------AGSTW------------------------------------------------ 184 (419) Q Consensus 172 ~~~~~~~~-------------------~~~~~------------------------------------------------ 184 (419) -+...... .+++. T Consensus 127 AMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~ 206 (522) T protein:vir:69 127 ALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDA 206 (522) T ss_pred eeeeeccCCcccCccccccccccccccccccccccccccccccccccccccccccccccccceeeecccCCcCCCCCccc Confidence 11110000 00000 Q ss_pred ---------------------ccceeecC---------cccccccccceeeEEeeeEEEEEeehhhHHHHhhHH-----H Q lcl|Aclame:pro 185 ---------------------NKAAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-----Q 229 (419) Q Consensus 185 ---------------------~~a~~v~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-----~ 229 (419) +-..-.+| +...++...++++++..++.-+-...+|-||.+|-- | T Consensus 207 ~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLD 286 (522) T protein:vir:69 207 AKLDAEIIKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMD 286 (522) T ss_pred ccccchhccccccccceeeccccchhhhhhcccCCCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCC Confidence 00001122 124677778888888888888889999999999852 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccC-cc--cccceec----cccccccccccccc-cchhhh----HHHHHHHHHHhhh Q lcl|Aclame:pro 230 LMGYIQGRLTYGLRFLRDRQLLNGNG-ST--EMQGILT----TPGIGTYQQPKPTA-PATDEP----PLVDIRRAKTVAE 297 (419) Q Consensus 230 ~~~~i~~~l~~a~~~~~d~~il~G~g-~~--~p~Gi~~----~~~~~~~~~~~~~~-~~~~~~----~~~~~~~~~~~~~ 297 (419) .++.|.+-|+..|...||+.||.=-- +. +.+|+.+ ..|+.......... +-.... ++-.+-.....+. T Consensus 287 AEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~ 366 (522) T protein:vir:69 287 ADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIA 366 (522) T ss_pred hHHHHHHHHHHHHHHHhhHHHHhhhhhhheeeccccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHH Confidence 78999999999999999999984100 00 1223321 12222111111100 001111 1222222333332 Q ss_pred --hhccCCcEEEEehHHHHHHHHHh-----ccCC-ceeccCCcccc-CCCcccc-cceeEecCCCCcCcEEEEeccce-- Q lcl|Aclame:pro 298 --IAGFPPDGVVVHPQDWESIELDQ-----APGS-GVFRVIANVQG-EATPRIW-GLNVVSTVAIAQGTALVGGFRQG-- 365 (419) Q Consensus 298 --~~~~~~~~~~~~~~~~~~l~~~k-----d~~g-~~~~~~~~~~~-~~~~~l~-G~pv~~~~~~~~~~~~~~d~~~~-- 365 (419) ......+.+++|+++...|...- .+.| ..-+.. +.+. -.-+.|. |++|+++...+.+.+++|--... T Consensus 367 ~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~-d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~ 445 (522) T protein:vir:69 367 RQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLASGFNT-DTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEM 445 (522) T ss_pred HhcccccccEEEEchhHHHHHhhcccccccccccccccccc-cCCCceEEEEecCceEEEecCCCCcceEEEEEeCCccc Confidence 23335667899999998887531 0111 011111 1111 1123444 68999999998887766632110 Q ss_pred ---EEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccc-------eEEEEecCCCC Q lcl|Aclame:pro 366 ---ATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKA-------FVRVTFAAATT 419 (419) Q Consensus 366 ---~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a-------~~~~~~~aa~~ 419 (419) .++..-........ .+...|+ . .+-+..|++..+ +|-+ .+++. ...|+ T Consensus 446 ~~glfyaPYv~l~~~~~-~dp~sfq-P--~~g~~tRY~l~v-NP~~~~~~~~~~~ri~-~g~p~ 503 (522) T protein:vir:69 446 DAGIYYAPYVALTPLRG-SDPKNFQ-P--VMGFKTRYGIGV-NPFAESSLQAPGARIQ-SGMPS 503 (522) T ss_pred ccceeeccccccccccc-cCCcccc-c--eeeeeeeeceee-cCcccccCCcccceee-cccch Confidence 11110001111111 1222343 2 233444665543 3321 11222 22332 No 244 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=44.49 E-value=0.8 Score=21.10 Aligned_cols=270 Identities=12% Similarity=0.016 Sum_probs=110.3 Q ss_pred cCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEe Q lcl|Aclame:pro 129 TNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITT 208 (419) Q Consensus 129 ~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~ 208 (419) -+.+-.++.......-+........-..+++.+|+.....+|+...... .... .-.-++-++....-+++....++ T Consensus 1 ~~~~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~e-~F~~---~~t~r~~~~~~~~v~~~~~~~~~ 76 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQ-GFTV---PETLVGRKSKPNEVEFSATDETG 76 (309) T ss_pred CCCCCcCcCHhHHHHHhhccChhhhhhhcCCccccCccccceeeechhh-cccc---cchhhccCCCcceEeecccCcee Confidence 1111111111222222222222333346688889888778887754311 0000 01112333333333344444555 Q ss_pred eeEEEEEeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHh--ccCcccccc-eeccccccccccccccccchh Q lcl|Aclame:pro 209 TLKTVAHWLPITRQAADDNS---QLMGYIQGRLTYGLRFLRDRQLLN--GNGSTEMQG-ILTTPGIGTYQQPKPTAPATD 282 (419) Q Consensus 209 ~~~k~~~~~~vs~ell~d~~---~~~~~i~~~l~~a~~~~~d~~il~--G~g~~~p~G-i~~~~~~~~~~~~~~~~~~~~ 282 (419) .....+-..+|..+-+.+++ +.++.-.+.+.+.+....|..+-. -+..+-|.+ -.+.+| +... ...+ T Consensus 77 ~~~~~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Lsg------t~~w-sd~~ 149 (309) T protein:vir:99 77 STEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSG------ADQW-SDPT 149 (309) T ss_pred eecccceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEecC------cccc-CCCC Confidence 55555666777777766543 567777777777776666543322 111111222 001111 1111 1234 Q ss_pred hhHHHHHHHHHHhhhhhccCCcEEEEehHHHHHHHH---Hhcc-CCceeccCCccccCCCcccccc-eeEecCCC----- Q lcl|Aclame:pro 283 EPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL---DQAP-GSGVFRVIANVQGEATPRIWGL-NVVSTVAI----- 352 (419) Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~---~kd~-~g~~~~~~~~~~~~~~~~l~G~-pv~~~~~~----- 352 (419) .+.+.++..++..+ .+.++..+|...+|..|+. +... .++.. -.+.++-..-..|+|+ .|++.... T Consensus 150 SDPi~~i~~~~~~~---g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~-~~g~it~~~la~l~~ve~V~vg~a~~n~a~ 225 (309) T protein:vir:99 150 SNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLG-DEGMVPMAFLQELLELDAIYIGEARLNIAR 225 (309) T ss_pred CCcHHHHHHHHHhh---CCCcceEEechHHHHHHhhCHHHHHHhcCCCc-cccccCHHHHHHHhCcceEEeecceeeccc Confidence 55666776666554 5688899999999988765 2111 11110 0111111111235565 35543222 Q ss_pred C-c---------CcE-EEE---------eccceEEE---EEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccce Q lcl|Aclame:pro 353 A-Q---------GTA-LVG---------GFRQGATL---WSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAF 409 (419) Q Consensus 353 ~-~---------~~~-~~~---------d~~~~~~~---~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~ 409 (419) + + +.+ ++. +.+.+|.. ....+. +. .+.. =..+...+|+...+.-.+.-+.+- T Consensus 226 ~g~~~~~~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~-~~-d~~~---~~~g~~~vr~~~~~k~~i~~~d~G 300 (309) T protein:vir:99 226 PGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGS-IA-DPNI---GLRGGQRVRVGESVKELVTAPDLG 300 (309) T ss_pred cccccccccccCCcEEEEEcCCCCCCcccccccceeecccccCCc-ee-eeee---ccCCceEEEEeccccchhcchhcc Confidence 1 0 011 110 11111111 011111 11 0100 012223345544444444444433 Q ss_pred EEEEecCCC Q lcl|Aclame:pro 410 VRVTFAAAT 418 (419) Q Consensus 410 ~~~~~~aa~ 418 (419) ..++-+.|- T Consensus 301 ~li~~~va~ 309 (309) T protein:vir:99 301 FFFENAVAA 309 (309) T ss_pred hhhhhcccC Confidence 333222222 No 245 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=42.89 E-value=0.87 Score=20.92 Aligned_cols=379 Identities=9% Similarity=-0.011 Sum_probs=103.4 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQV-------QEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPL 73 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~-------~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 73 (419) |.++.+|++++.++.++.+++....++. .+..++.+...+.++++++++..+++.++........... .... T Consensus 4 ~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 82 (408) T protein:vir:10 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEE-KGPL 82 (408) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-cccc Confidence 6679999999888877776665543221 1223445555666677777777766655544322211111 1111 Q ss_pred cccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHh-hhhh- Q lcl|Aclame:pro 74 TPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTT-PDLP- 151 (419) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~-~~~~- 151 (419) ..............+.+. ...........+.+.. .......++.+-+.. +.+..+....... +... T Consensus 83 ~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~a~-----~~~t~~~gg~~vP~~-~~~~Ii~~~~~~~~l~~~~ 150 (408) T protein:vir:10 83 NKSENELKDKFVKDFVNM------VRNPMAFMNTVSSKTE-----TSGSDSAAGLTIPQD-IRTMINTLVRQYDSLQQYV 150 (408) T ss_pred ccchhhhHHHHHHHHHHH------hhcchhhhhhhhhhhh-----hcccccCCceeccHh-HHHHHHHHHHhhchhhhhc Confidence 111111111111111111 1111101111111111 111111122222221 1222222111111 0111 Q ss_pred --hhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHHH Q lcl|Aclame:pro 152 --LLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQ 229 (419) Q Consensus 152 --~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~~ 229 (419) .++......+++. ...+......+.+.+ +-+.|.........+|..-.+...-.-..--+.+....-... T Consensus 151 ~~~~~~~~~~~~~~~------~~~~~~~~a~~v~E~--~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~ 222 (408) T protein:vir:10 151 RVESVSTSNGSRVYE------KWTDVTPLTVMDAED--GKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAW 222 (408) T ss_pred ceeeccCCcceEEEe------eccccccceeeecCc--cccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHH Confidence 1111111111110 000111111111111 112232222234455555454433221111122211111223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCcc---cccceeccc--cccccccccccccchhhhHHHHHHHHHHhhhhhccCCc Q lcl|Aclame:pro 230 LMGYIQGRLTYGLRFLRDRQLLNGNGST---EMQGILTTP--GIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD 304 (419) Q Consensus 230 ~~~~i~~~l~~a~~~~~d~~il~G~g~~---~p~Gi~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 304 (419) +.+.|...++..+...+=...=.|.... ....++..- .+.. .......-......+. .+..+.....+ T Consensus 223 i~~~l~~~~~~~~~~~il~g~g~~~~~~~~~~~~~l~~~~~~~~~~-~~~~~a~~v~n~~~~~----~l~~lkd~~G~-- 295 (408) T protein:vir:10 223 LSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDP-AIIATSSLLTNQSGLN----KLALVKTAEGK-- 295 (408) T ss_pred HHHHHHHHHHHHHHHHHhhcccccccccccccHHHHHHHHHHhhhh-hhccCCEEEEcHHHHH----HHHHhhccCCc-- Confidence 4444555555555544433333333221 122222110 0000 0000000011222222 23334443322 Q ss_pred EEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEEE--------EEecceEE Q lcl|Aclame:pro 305 GVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATL--------WSRQGITV 376 (419) Q Consensus 305 ~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~~--------~~~~~~~i 376 (419) +++.+..... .-..=.|.++.+..+..-+.. .-.-.++++-+. ..-+++++....-+- +.+....+ T Consensus 296 -~i~~~~~~~~--~~~~l~G~PV~~~~~~~~~~~-~~~~~~i~~gd~--~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~ 369 (408) T protein:vir:10 296 -YLLEPDPTKP--NSYLIKGKQVIVVADRWLPNT-GSTVYPLYYGDM--SQAITLFDRENMSLLPTNIGAGAFETDTTKI 369 (408) T ss_pred -eEeccCcCCC--CCceecceeeEEecccccCcc-CCCceEEEEEeh--hccEEEEEecceEEEEcccccchhhcCceEE Confidence 3332211000 000002332222111000000 000122332221 011222222111110 11111111 Q ss_pred EEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 377 LMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 377 ~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) ......+ .-..+--+|. .+.+...-| -+-.+.++++| T Consensus 370 r~~~r~d-~~v~~~~a~~---~~~~~~~~~--~~~~~~~~~~~ 406 (408) T protein:vir:10 370 RVIDRFD-VKATDSEALV---AGSFSAIAD--QVGNFKTTTST 406 (408) T ss_pred EEEEeec-cEEeccccEE---EEEeecccc--CCCCCCCCCcc Confidence 1111000 0000000111 112111111 11112222222 No 246 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=42.03 E-value=0.9 Score=20.83 Aligned_cols=361 Identities=14% Similarity=0.055 Sum_probs=127.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH---HHHHHHHHHHHHHHHhhcccccccccchhhhhhHHHHhHHHHHH Q lcl|Aclame:pro 21 TSLTTEQVQEIVAEARGLADAL-QAESDRAAAR---AALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLRE 96 (419) Q Consensus 21 ~~~~~~~~~~~~~e~~~~~~~~-~~~~~~~~~~---~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (419) +....++ |.++=..+.+.- .-++.....+ ..-|+++.+..... ...+ ...+.+ T Consensus 1 ~~~~~~~---l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~----~~~~----------------~~~~~e 57 (529) T protein:vir:10 1 MSLKNKE---ILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSD----PVYR----------------DDKLIE 57 (529) T ss_pred CcccHHH---HHHHhHHHhcCCccchhccchhhhhhhhhhhhhHHHHhhc----cccc----------------hhhhhh Confidence 0000011 111111111100 0000000000 00011111000000 0000 000000 Q ss_pred HHHhhhhhhhhHHHHHHHHHHhhh---cccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeee Q lcl|Aclame:pro 97 YRARDKRGQFQVEMRDIDPNRLLS---RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRD 173 (419) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 173 (419) .... .+.+......+. .....+..+......-|.+++ +.+..-......+++.+-||.++..-|.-+ T Consensus 58 ~~~~--------~l~~~~~~~~~~~~~~~i~est~t~~v~~~~P~Li~--lvRra~p~LIa~DIwGVQPMTgPTGLIFAM 127 (529) T protein:vir:10 58 AFGQ--------SLMEAEVAGDHGYDPTNIAAGQSSGAITNIGPAVIG--MVRRAIPSLIAFDIAGVQPMTGPTGQVFAL 127 (529) T ss_pred hhhc--------ccchhhccccccccccccccccccccccccCchhhh--hHHHHHhhhhhheeeeeecCCchhhhhhhh Confidence 0000 000000000000 000011111111122232222 112122334455777888877653222111 Q ss_pred ccc--------------------------------------------------------------------------cce Q lcl|Aclame:pro 174 TSG--------------------------------------------------------------------------TAG 179 (419) Q Consensus 174 ~~~--------------------------------------------------------------------------~~~ 179 (419) ... ... T Consensus 128 RsrY~~~~~~~~~~eaf~~~y~Pda~~sga~~~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~ 207 (529) T protein:vir:10 128 RSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVT 207 (529) T ss_pred heeecCCccccccccccccccccccccccccccccccccCccccccccccccccccCcceeeeecccceecccccccccc Confidence 000 000 Q ss_pred ecc-----------------------ccccceeecC---------cccccccccceeeEEeeeEEEEEeehhhHHHHhhH Q lcl|Aclame:pro 180 AGS-----------------------TWNKAAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN 227 (419) Q Consensus 180 ~~~-----------------------~~~~a~~v~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~ 227 (419) ... +.+-..-.+| +..+++...+++++++.++.-+-...+|-||.+|- T Consensus 208 ~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDL 287 (529) T protein:vir:10 208 VGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDL 287 (529) T ss_pred cCccccCcccccccccccccccccccccccchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHH Confidence 000 0000001112 12366777788888888888888899999999985 Q ss_pred H-----HHHHHHHHHHHHHHHHHHHHHHHhcc----Ccccccceec---cccccccccccccc-cchh----hhHHHHHH Q lcl|Aclame:pro 228 S-----QLMGYIQGRLTYGLRFLRDRQLLNGN----GSTEMQGILT---TPGIGTYQQPKPTA-PATD----EPPLVDIR 290 (419) Q Consensus 228 ~-----~~~~~i~~~l~~a~~~~~d~~il~G~----g~~~p~Gi~~---~~~~~~~~~~~~~~-~~~~----~~~~~~~~ 290 (419) - |.++.|.+-|+..|...||+.||.-= -.++..|+-. ..|+.......... .-.. ..++-.+- T Consensus 288 KAVHGLDAEtELsNILStEImlEINReii~~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~ 367 (529) T protein:vir:10 288 RAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQID 367 (529) T ss_pred HHhcCCChHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHH Confidence 2 57899999999999999999888621 0011111110 11222111111000 0001 11222222 Q ss_pred HHHHhhh--hhccCCcEEEEehHHHHHHHHH--hccCCceeccCCccccC----CCcccc-cceeEecCCCCcCcEEEEe Q lcl|Aclame:pro 291 RAKTVAE--IAGFPPDGVVVHPQDWESIELD--QAPGSGVFRVIANVQGE----ATPRIW-GLNVVSTVAIAQGTALVGG 361 (419) Q Consensus 291 ~~~~~~~--~~~~~~~~~~~~~~~~~~l~~~--kd~~g~~~~~~~~~~~~----~~~~l~-G~pv~~~~~~~~~~~~~~d 361 (419) ++.+.+. ..+...+.+++|+++...|... .+.....-...+...+. ..+.|. |++|+++...+.+.+++|- T Consensus 368 ~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~ 447 (529) T protein:vir:10 368 KEANEIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGY 447 (529) T ss_pred HHHHHHHHhhccccceEEEEchHHHHHHHhhhhhccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEE Confidence 3333332 3334566788999999888742 11111000111111111 123443 5899999999887766663 Q ss_pred ccce-----EEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEe-------c------CCCC Q lcl|Aclame:pro 362 FRQG-----ATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTF-------A------AATT 419 (419) Q Consensus 362 ~~~~-----~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~-------~------aa~~ 419 (419) -... .++..--.+..... .+...|+ . .+-+..|++..+ +|-+-..-.. . +-.+ T Consensus 448 KG~~~~~~glfy~PYv~l~~~~~-~dp~sfq-P--~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n 518 (529) T protein:vir:10 448 RGANNLDAGIYYCPYVALTPLRG-SDPKNFQ-P--VMGFKTRYAIGV-NPFAESRTQAPQGRITSGMPGVNSVGKN 518 (529) T ss_pred eCCcccccceeeccccccccccc-cCCCccc-c--eeeeeeeeceee-cCccccccccccccccCCcchhhhcCcc Confidence 2110 01100001111110 1122232 2 223334554432 3322111000 0 0000 No 247 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=41.25 E-value=0.93 Score=20.74 Aligned_cols=266 Identities=13% Similarity=0.036 Sum_probs=113.3 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhh-cc---eecccCcceeeeeeccccceeccccccceeecCcccccccccce Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADL-LD---QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSF 203 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 203 (419) +. ...-..+.+...+.+.....+.--.+ .+ +.-.+++.+++|+.......-+.-....+|... +-+.++ T Consensus 1 Ma--ntl~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~g-----~v~~~~ 73 (312) T protein:vir:10 1 MA--NTLAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVGG-----DVKFEY 73 (312) T ss_pred CC--cchhHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeecccccccccccCCccccc-----cccccc Confidence 11 11122344555554444332211111 00 112456788888765322111111011112110 112233 Q ss_pred eeEEeeeEEEEEeehhhHHH-HhhH---HHHHHHHHHHHHHHHHHHHHHHHHhcc--Ccccccceecccccccccccccc Q lcl|Aclame:pro 204 DTITTTLKTVAHWLPITRQA-ADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGN--GSTEMQGILTTPGIGTYQQPKPT 277 (419) Q Consensus 204 ~~v~~~~~k~~~~~~vs~el-l~d~---~~~~~~i~~~l~~a~~~~~d~~il~G~--g~~~p~Gi~~~~~~~~~~~~~~~ 277 (419) ...+|.-.+.-.+. |. .+ ++.+ ..+...+.+.....+.=.+|...+.-= +... .......... T Consensus 74 et~tl~qDR~~~F~-vD-~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~---------~~~~~~~~~~ 142 (312) T protein:vir:10 74 ETKTMTQDRGRKFT-LD-AMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIG---------IKGDTNVEYS 142 (312) T ss_pred eeEEeeecccceee-cc-ccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhc---------cccccccccc Confidence 33343333332221 11 11 1112 134555556666666677777766410 0000 0000011112 Q ss_pred ccchhhhHHHHHHHHHHhhhhhccC-CcEEEEehHHHHHHHHHhccCCceecc---CCccccCCCcccccceeEecCCCC Q lcl|Aclame:pro 278 APATDEPPLVDIRRAKTVAEIAGFP-PDGVVVHPQDWESIELDQAPGSGVFRV---IANVQGEATPRIWGLNVVSTVAIA 353 (419) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~kd~~g~~~~~---~~~~~~~~~~~l~G~pv~~~~~~~ 353 (419) ...+....++.+..++..+.....+ +-..+|+|..+..|.+- ....+-. .....+...+.|.|+||+.. | T Consensus 143 ~~~T~~ni~~~i~~~~~~lde~~vp~~rvl~vTp~~~~lLk~~---~~~~~~~~~~~~~~i~~~V~~iDgv~Ii~V---P 216 (312) T protein:vir:10 143 YSVNSSTIINKIKTGIKIIRENGYNGPLVCHLTYDSMFAIEEK---VLEKLTAVTFAQGGIQTQVPSIDGCALIKT---P 216 (312) T ss_pred cccCHHHHHHHHHHHHHHHHHccCCCceEEEeChHHHHHHhhh---hhceecccccccceeeeeeeeecccEEEEc---h Confidence 2234566788888888888887655 34567888887666542 1111111 11112344567999999863 3 Q ss_pred cCcEE-EEeccce--------------------EEEEEecceE--------EEEeecccchhhcCcEEEEEEEEeccEEe Q lcl|Aclame:pro 354 QGTAL-VGGFRQG--------------------ATLWSRQGIT--------VLMTDSHADFFTANTLVILAEFRANLAVY 404 (419) Q Consensus 354 ~~~~~-~~d~~~~--------------------~~~~~~~~~~--------i~~~~~~~~~~~~~~~~~r~~~r~d~~~~ 404 (419) .+-.. ..+|..+ ++++. .... +.+.... ..-..|...|.-..++|.=+. T Consensus 217 s~r~~t~~~f~dG~t~~~~~gg~~~~~~ak~INfiiv~-~~a~i~~~K~~~~~if~P~-~~~~~d~~~~~~R~Y~D~fv~ 294 (312) T protein:vir:10 217 QNRMYSSILLNDGTTSNQTAGGYLKGTKALDTNFIIAP-VDVPLAITKQDKMRIFDPE-TNQTANAWSMDYRRYHDLWVT 294 (312) T ss_pred hhhccceeeeccCcccccccCceeecCcccccceEEeC-CceeeceeeeeeeeeeCCC-CCCCcceeeeeeeeeeeeeee Confidence 32110 0111111 11111 1111 2222111 111223456666777887776 Q ss_pred cc-cceEEEEecCCCC Q lcl|Aclame:pro 405 QP-KAFVRVTFAAATT 419 (419) Q Consensus 405 ~~-~a~~~~~~~aa~~ 419 (419) +. ..-+.+.++.|.+ T Consensus 295 ~nk~~~Iyv~~k~a~~ 310 (312) T protein:vir:10 295 DNKANSVYANFKDAKP 310 (312) T ss_pred ccccCeEEEEeecccC Confidence 64 3444567777666 No 248 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=40.86 E-value=0.95 Score=20.70 Aligned_cols=349 Identities=10% Similarity=-0.002 Sum_probs=85.1 Q ss_pred CCccHHHHHHHHHHHHHHHHHH----HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTS----LTTEQVQEIVA---EARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPL 73 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~----~~~~~~~~~~~---e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 73 (419) ..-.++++++.+++++++++.. ...++..++.+ ..+...+.++++++.++...+................... T Consensus 5 ~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~ 84 (394) T protein:vir:10 5 QTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQPNGTDL 84 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhcccccch Confidence 3333555555555555554322 22344444333 3444555666666665544433222222211111111111 Q ss_pred cccccchh-hhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHh-hhhh Q lcl|Aclame:pro 74 TPAEAGTF-RSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTT-PDLP 151 (419) Q Consensus 74 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~-~~~~ 151 (419) ........ +............. .... .. ...+.++...+.. +..+.+....... +... T Consensus 85 ~~~~~~~~~~~~~~~l~~~~~~~----------~~~~--------~~-~t~~~gg~~vP~~-~~~~ii~~~~~~~~l~~~ 144 (394) T protein:vir:10 85 KKKPIDAKKKAINDFIHSHGKVI----------DNAA--------GH-VTSTEAGVLIPEE-IIYDPTAEVNSVVDLSTL 144 (394) T ss_pred hhhHHHHHHHHHHHHHhccchhh----------hhhh--------cc-cccccCceeccHH-HHHHHHHHHHhhhhhhhh Confidence 11111111 11111111000000 0000 00 0011111111111 1111111111000 0000 Q ss_pred ---hhHHhhcceecc---cCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHh Q lcl|Aclame:pro 152 ---LLVADLLDQQNA---DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAAD 225 (419) Q Consensus 152 ---~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~ 225 (419) .++-..-..+|+ .++...+..+. +-..|.........++.--.+...-.-..--+-+.... T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~E~-------------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~ 211 (394) T protein:vir:10 145 VTKTPVTTPKGTYPILKRATDRFSSVAEL-------------AENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVD 211 (394) T ss_pred ceeeeccCCceEEEEEecCCCcccccccc-------------ccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHH Confidence 011101011111 12222221111 11122222222334444444433322111112221111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccc---cc--ccccccchhhhHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 226 DNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTY---QQ--PKPTAPATDEPPLVDIRRAKTVAEIAG 300 (419) Q Consensus 226 d~~~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~---~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (419) -...+...|...++..+-.++=...=.|...+.... .....+... .. .....-......+ ..+..+.+.. T Consensus 212 l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~~~~-~~~d~l~~~~~~~~~~~~~a~~vmn~~~~----~~l~~lkd~~ 286 (394) T protein:vir:10 212 LTSLVGQSINEKSVNTYNAMIAPVLQSFTAKATTTD-TLVDSLKHILNVDLDPAYSRALVVTQSLF----NTLDTLKDKN 286 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccccccccc-ccHHHHHHHHHhhhhhhccCEEEecHHHH----HHHHHhhccC Confidence 112344445555555555554433333332211111 111111000 00 0000001111222 2333444443 Q ss_pred cCCcEEEEehHHHHHHH-HHh-ccCCceeccCCccccCCCccccc-ceeEecCCCCcCcEEEEe-------------ccc Q lcl|Aclame:pro 301 FPPDGVVVHPQDWESIE-LDQ-APGSGVFRVIANVQGEATPRIWG-LNVVSTVAIAQGTALVGG-------------FRQ 364 (419) Q Consensus 301 ~~~~~~~~~~~~~~~l~-~~k-d~~g~~~~~~~~~~~~~~~~l~G-~pv~~~~~~~~~~~~~~d-------------~~~ 364 (419) .+ +++.|....... ... .=.|.+..+..+ ...+.-.| .++++-++ .. .+++++ |.. T Consensus 287 G~---~i~~~~~~~~~~~~~~~~L~G~PV~~~~~---~~~~~~~~~~~i~~gd~-s~-~~~~~~~~~~~v~~~~~~~~~~ 358 (394) T protein:vir:10 287 GR---YLLHDASDSITDGTAKGTVLGVPVYVVGD---ALLGSAAGDQKAFVGDL-KR-GVLFADRQQVTLAWEDSKIYGR 358 (394) T ss_pred CC---eeeeccccccccCCcccccccceeEEecc---cccCCCCCceEEEEeec-cc-cEEEEeecceEEEEecccccce Confidence 32 233222110000 000 001222211111 00011111 12222111 00 011111 111 Q ss_pred eEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 365 GATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 365 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .+..+.|-+..+. ..+.+.+ +..-..++.+| T Consensus 359 ~~~~~~r~d~~~~---------~~~ai~~---------------~~~~~~~~~~~ 389 (394) T protein:vir:10 359 YLGAAFRFGVKQA---------DSNAGYF---------------VTNTDAASGST 389 (394) T ss_pred eEEEEEEeccEEe---------ccccEEE---------------EEeecccCCCC Confidence 1111111111111 1111111 11112222222 No 249 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=40.20 E-value=0.98 Score=20.63 Aligned_cols=371 Identities=7% Similarity=-0.097 Sum_probs=115.7 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQV-----QEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTP 75 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~-----~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 75 (419) ...+++|+++++++.+++..+....++. .+..++...+.++++++++.++++.+.++.................. T Consensus 4 ~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (421) T protein:vir:13 4 FERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGGRVI 83 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 8888888888888887776555443221 11233344556777777777777776666655544333322221111 Q ss_pred --cccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhH----HHHHhhh Q lcl|Aclame:pro 76 --AEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPG----IVPTTPD 149 (419) Q Consensus 76 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~----~i~~~~~ 149 (419) ......+.... .....+...........+.... . ..++..-+.. +.+..+.. ..+..+- T Consensus 84 ~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ra~~t----------~-~~gg~liP~~-~~~~Ii~~~~~~~~l~~l~ 148 (421) T protein:vir:13 84 INGDSKEEKRSLQ---LSAMSKTIRGIQLSEEERDIMS----------S-TNNGAVIPQE-FVNEFEKLKEGYPSLKEHC 148 (421) T ss_pred cccchhHHHHHHH---HHHHHHhhhccchhHHHhhccc----------c-CCcceecchh-hHHHHHHHHHhhhhhhhhc Confidence 11111111111 1111111111111111111100 0 1111111111 11111111 1111111 Q ss_pred hhhhHHhhcceeccc--CcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhH Q lcl|Aclame:pro 150 LPLLVADLLDQQNAD--YNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN 227 (419) Q Consensus 150 ~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~ 227 (419) ...++......+|+. ...-.+-.. . ....+.+. .......++.--.+...-.-..--+.+.-..-. T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~---------~--E~~~~~~s-~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~ 216 (421) T protein:vir:13 149 HVIPVNRNAGKMPVRAGASVDKLANL---------A--KDTELVKA-MLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFL 216 (421) T ss_pred eeeeccCCceEEEEeecCCccceeec---------c--cccccccc-ccceeEEEeeeeeeEeehhhhHHHHhhhHHHHH Confidence 111111111112211 111001000 0 11111221 112223333333333221111111111111101 Q ss_pred HHHHHHHHHHHHHHHHHHHH---HHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCc Q lcl|Aclame:pro 228 SQLMGYIQGRLTYGLRFLRD---RQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD 304 (419) Q Consensus 228 ~~~~~~i~~~l~~a~~~~~d---~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 304 (419) ..|...|.+.+...+-..+- ..++...+......|...-............=......+. .+..+.+.... T Consensus 217 ~~i~~~la~~~~~~~~~~i~~~~~g~~~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~----~l~~lkd~~G~-- 290 (421) T protein:vir:13 217 EFVNEEFAEFAVNTENAEIVKQAKAVLAEETINDYAGLVKTINSLVPNARKRAIIVTNSDGRA----YLDGLMDKQGR-- 290 (421) T ss_pred HHHHHHHHHHHHHHhhhhHhhhhhhccccccccchHHHHHHHHHhhhhhcCCCEEEEcHHHHH----HHHHhhcCCCc-- Confidence 12223333333333332222 2333434433333332211100000000000011122222 23334443222 Q ss_pred EEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEecCCCCcCcEEEEeccceEE------EEEecceEEEE Q lcl|Aclame:pro 305 GVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGAT------LWSRQGITVLM 378 (419) Q Consensus 305 ~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~d~~~~~~------~~~~~~~~i~~ 378 (419) +++.+..... -..=.|.+..+..+...+.. .-.++++-++ ...+++++....-+ .|.+..+.+.. T Consensus 291 -~i~~~~~~~~---~~tl~G~pV~~~~~~~~~~~---~~~~~~~gd~--~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~ 361 (421) T protein:vir:13 291 -PLLKELSDGG---DLVFKGRPVIELEESIFDVG---DETKFIVSDF--KTLIKFMDRKQYLIDQSKEAGYTKNETIARI 361 (421) T ss_pred -eeecCcCCCC---CceecceeeEEeccccccCC---CceEEEEEec--cccEEEEEecceEEEeecccccccCeeEEEE Confidence 3333211000 00001222221111111100 0122333222 11133333332111 12222222222 Q ss_pred eecccchhhcCcEEEEEEEEeccEEecccceEEEEecCCCC Q lcl|Aclame:pro 379 TDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) Q Consensus 379 ~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~aa~~ 419 (419) .. .-|...+.-....-+.+..+.+|+...-++++| T Consensus 362 ~~------r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~ 396 (421) T protein:vir:13 362 IE------RFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKSS 396 (421) T ss_pred Ee------eecceeecchhhheeeecccceeeccccccCCC Confidence 11 112223333334456777889999986655555 No 250 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=40.07 E-value=0.99 Score=20.61 Aligned_cols=368 Identities=9% Similarity=0.017 Sum_probs=131.9 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH---HHHHHHHHHHHHHHHhhcccccccccchhhhhhHHHHhHHHHHHHH Q lcl|Aclame:pro 23 LTTEQVQEIVAEARGLADAL-QAESDRAAAR---AALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYR 98 (419) Q Consensus 23 ~~~~~~~~~~~e~~~~~~~~-~~~~~~~~~~---~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 98 (419) -..+ .|.++=..+.+.- .-++.....+ ..-|+++.+.......+. -... ...+ .+..+. T Consensus 1 ~~~~---~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~-------~~~~-~~~~------~~~~~~ 63 (534) T protein:vir:10 1 MSKK---SLLKKWQPLVESEGMPAIASMKRKDIVARIFENQDEDIAHNEGGV-------YTDQ-VVVN------SMVDVK 63 (534) T ss_pred Cchh---HHHHHhHHhhcCCccccccchhhhhhhhhhhhhHHHHHhhhcccc-------cchh-hhhh------hhhccc Confidence 0000 0111111111100 0011111110 001111111110000000 0000 0000 000000 Q ss_pred HhhhhhhhhHHHHHHHHHHhhh---cccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeecc Q lcl|Aclame:pro 99 ARDKRGQFQVEMRDIDPNRLLS---RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTS 175 (419) Q Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 175 (419) ..... ..+.+......+. .....+..+......-|.+++ +.+..-......+++.+-||.++..-+.-+.. T Consensus 64 ~~~~~----~~l~ea~~~~~~g~~~~~ia~s~~s~~v~~~~P~Li~--lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRs 137 (534) T protein:vir:10 64 GRIEE----ARLAEANIGGDHGYDATKIASGETSGSITNVGPAVMG--LVRRAIPQLIAFDICGVQPMTSSTGQVFTLRA 137 (534) T ss_pred cchhh----ccccccccccccccccccccccccccccccccchhhh--HHHHHHHhhhhhhhheeccCCchhhhheeeee Confidence 00000 0000000000000 000011111111122233222 11212234455678888888877543322221 Q ss_pred ccc--e-----------------eccccc---------------------------------c----------------- Q lcl|Aclame:pro 176 GTA--G-----------------AGSTWN---------------------------------K----------------- 186 (419) Q Consensus 176 ~~~--~-----------------~~~~~~---------------------------------~----------------- 186 (419) ... . .+++.. . T Consensus 138 rY~n~~~~~s~~EAf~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~ 217 (534) T protein:vir:10 138 IYGGNSQDANAREAFHPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPAD 217 (534) T ss_pred eecCCCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCC Confidence 100 0 000000 0 Q ss_pred ------------------------ceeecC---------cccccccccceeeEEeeeEEEEEeehhhHHHHhhHH----- Q lcl|Aclame:pro 187 ------------------------AAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS----- 228 (419) Q Consensus 187 ------------------------a~~v~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~----- 228 (419) ..-.+| +..+++...++++++..++.-+-...+|-||.+|-- T Consensus 218 ~~~ag~~~~~~~~~~~~y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGL 297 (534) T protein:vir:10 218 QTEAGLAYKWLLANGYAVETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGL 297 (534) T ss_pred ccccccccccccccccceecccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCC Confidence 000112 123666777888888888888888999999999852 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcc----Cccccccee---ccccccccccccccc-cchhhhHHHHHH----HHHHhh Q lcl|Aclame:pro 229 QLMGYIQGRLTYGLRFLRDRQLLNGN----GSTEMQGIL---TTPGIGTYQQPKPTA-PATDEPPLVDIR----RAKTVA 296 (419) Q Consensus 229 ~~~~~i~~~l~~a~~~~~d~~il~G~----g~~~p~Gi~---~~~~~~~~~~~~~~~-~~~~~~~~~~~~----~~~~~~ 296 (419) |.++.|.+-|+..|...||+.||.-= -.++-.|+- ...|+.......... .-.....+..+. +.-+.+ T Consensus 298 DAEtELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i 377 (534) T protein:vir:10 298 DADSELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEI 377 (534) T ss_pred ChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHH Confidence 57899999999999999999888621 111111111 111221111111100 001111122221 222222 Q ss_pred --hhhccCCcEEEEehHHHHHHHHH--hcc---CCceeccCCccccC-CCcccc-cceeEecCCCCcCcEEEEeccce-- Q lcl|Aclame:pro 297 --EIAGFPPDGVVVHPQDWESIELD--QAP---GSGVFRVIANVQGE-ATPRIW-GLNVVSTVAIAQGTALVGGFRQG-- 365 (419) Q Consensus 297 --~~~~~~~~~~~~~~~~~~~l~~~--kd~---~g~~~~~~~~~~~~-~~~~l~-G~pv~~~~~~~~~~~~~~d~~~~-- 365 (419) .......+.+++|+++...|... .+. .|...-...+.+.. ..++|. |++|+++...+.+.+++|--... T Consensus 378 ~~~T~rg~~n~~v~S~~Va~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~ 457 (534) T protein:vir:10 378 ARQTGRGQGNFIICSRNVAAALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVEDYFTVGYKGASEM 457 (534) T ss_pred HHhhccccccEEEEchhHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCccc Confidence 22234567789999999888642 110 11110111111110 123444 68999999999877666532110 Q ss_pred ---EEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccc-------eEEEEecCCCC Q lcl|Aclame:pro 366 ---ATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKA-------FVRVTFAAATT 419 (419) Q Consensus 366 ---~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a-------~~~~~~~aa~~ 419 (419) .++..--.+.... ..+...|+ . .+-+..|++..+ +|-+ +.++.-. .+. T Consensus 458 ~~glfyaPYv~l~~~~-~~dp~sfq-P--~~g~~tRY~l~~-NP~~~~~~~~~~~~i~~g-~~~ 515 (534) T protein:vir:10 458 DAGLYYCPYVALTPLR-GTDPKNFQ-P--VLGFKTRYGVKL-HPMADATQNKGFAKISNG-MPQ 515 (534) T ss_pred ccceeecccccccccc-ccCCcccc-c--eeeeeeeeceee-cCcccccCCccccccccC-Ccc Confidence 0010000000000 11222343 2 233444665543 4421 1111110 011 No 251 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=36.83 E-value=1.2 Score=20.25 Aligned_cols=117 Identities=15% Similarity=0.088 Sum_probs=10.7 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHH--HHHHHHHHHHHH-----HHHHHHHHhh Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADA----LQAESD--RAAARAALLRTA-----PPAPKGPADG 69 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~----~~~~~~--~~~~~~~~l~~~-----~~~~~~~~~~ 69 (419) .+....++++.++.+....+.....++.+...+..+...+. .+.+.. +++.+..+++.. ....+..... T Consensus 576 ~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q~~q~e~e~~~~~~~~~~~e~~~~~ 655 (705) T protein:vir:88 576 WTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKE 655 (705) T ss_pred hhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111222222222111111111111100000000000000 000000 000000000000 0000000000 Q ss_pred cc-cccccccchhhhhhHHHHhHHHHHHHHHhhhh-hhhhHHHHHHHHHHhhhc Q lcl|Aclame:pro 70 GT-PLTPAEAGTFRSLAQRFADSDGLREYRARDKR-GQFQVEMRDIDPNRLLSR 121 (419) Q Consensus 70 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 121 (419) .. ..........+...+.....+ ........ .............. ..+ T Consensus 656 a~~~~~~~~~e~e~~~~e~e~~~e---~~q~~~~~~~~~~~~~~~k~~~~-~rr 705 (705) T protein:vir:88 656 AELQLERDRFTWERARNEAEYHLE---ATQARAAYIGDGKVPETKKPTKA-VRR 705 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHhHHHHHHHHHH-hcC Confidence 00 000000000000000000000 00000000 00000000000000 011 No 252 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=31.95 E-value=1.5 Score=19.68 Aligned_cols=356 Identities=9% Similarity=-0.049 Sum_probs=97.7 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--c Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTT--EQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTP--A 76 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~--~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~--~ 76 (419) .+.+.+|++.++++.++.+++.... ++.....++.++.+++++.++++++++++..+.................. . T Consensus 11 ~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (400) T protein:vir:38 11 KKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSGKKPDHPEEHS 90 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhh Confidence 2233344444444433333322221 22233456677788889999999888887766665544333322211111 1 Q ss_pred ccchhhhhhHHHHh-HHHHH----HHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhh----HHHHHh Q lcl|Aclame:pro 77 EAGTFRSLAQRFAD-SDGLR----EYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVP----GIVPTT 147 (419) Q Consensus 77 ~~~~~~~~~~~~~~-~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~----~~i~~~ 147 (419) .............. ..... ...............+.. .........++...+.. +.+..+. ...+.. T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~gg~~vP~~-~~~~ii~~~~~~~~l~~ 166 (400) T protein:vir:38 91 YRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDA---VNAGVKAADAASTIPET-ISNTPQRELQTVVDLKP 166 (400) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHH---HhhcccccCCcccccHH-HHHHHHHHHHhhhhhhh Confidence 11111111111111 11111 111111111111111111 11111222233333321 1122211 111111 Q ss_pred hhhhhhHHhhcceecc---cCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHH Q lcl|Aclame:pro 148 PDLPLLVADLLDQQNA---DYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAA 224 (419) Q Consensus 148 ~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell 224 (419) .-...++...-..+|+ .++...+..+.+ -..+.........++.--.+...-.-..--+.+... T Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~-------------~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~ 233 (400) T protein:vir:38 167 FTNVFQASTQKGTYPTVANATTKMVTVAELE-------------KNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAI 233 (400) T ss_pred cceeEeccCcceEEEEEecCCCccccccccc-------------cccccccccceeeEeehhheeeehhhHHHHHhhhHH Confidence 1111111111112222 222222221111 111211222223344333333211111111222111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhccCc--ccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccC Q lcl|Aclame:pro 225 DDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGS--TEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFP 302 (419) Q Consensus 225 ~d~~~~~~~i~~~l~~a~~~~~d~~il~G~g~--~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (419) .--..+...+.+.+..++..++-...=.|.+. ....++....... ........-......+.. +..+.+.... T Consensus 234 ~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~a~~v~~~~~~~~----l~~lkd~~G~ 308 (400) T protein:vir:38 234 DLVGLIAQNGQQIKVNTTNGAVATLLKGFTAKTISSVDDLKHINNVD-LDPAYSRVIIASQSFYNF----LDTVKDGNGR 308 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccccccccccccHHHHHHHHHhh-hhhhhCcEEEEcHHHHHH----HHHhhccCCC Confidence 11123444455555555555554333333322 2222222211100 000000111112222222 3334443322 Q ss_pred CcEEEEehHHHHHHHHHhccCCceeccCCccccCCCc---ccc-------------cceeEecCCCCcCcEEEEeccceE Q lcl|Aclame:pro 303 PDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATP---RIW-------------GLNVVSTVAIAQGTALVGGFRQGA 366 (419) Q Consensus 303 ~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~---~l~-------------G~pv~~~~~~~~~~~~~~d~~~~~ 366 (419) +++.|.....-- ..=.|.++.+..+...+..+ -|+ |+.|..+++.... ..+ T Consensus 309 ---~i~~~~~~~~~~--~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~--------~~~ 375 (400) T protein:vir:38 309 ---YLLQDSILTPSG--KSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWVDDQIYG--------QFL 375 (400) T ss_pred ---eeeecCcCCCCc--cccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEecccccc--------eeE Confidence 344332100000 00012222211110000000 011 1111111111110 111 Q ss_pred EEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccc Q lcl|Aclame:pro 367 TLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKA 408 (419) Q Consensus 367 ~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a 408 (419) ..+.|-+..+. ..+ +|.... +-|.| T Consensus 376 ~~~~r~d~~~~---------~~~--a~~~l~------~~~~a 400 (400) T protein:vir:38 376 QAGMRFGVSVA---------DEK--AGYFLT------YTPKA 400 (400) T ss_pred EEEEEeccEEe---------ccc--ceEEEE------eecCC Confidence 11111111111 111 111111 12333 No 253 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=31.53 E-value=1.5 Score=19.63 Aligned_cols=361 Identities=13% Similarity=0.040 Sum_probs=127.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH---HHHHHHHHHHHHHHHhhcccccccccchhhhhhHHHHhHHHHHH Q lcl|Aclame:pro 21 TSLTTEQVQEIVAEARGLADAL-QAESDRAAAR---AALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLRE 96 (419) Q Consensus 21 ~~~~~~~~~~~~~e~~~~~~~~-~~~~~~~~~~---~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (419) +....++ |.++=..+.+.- .-++.....+ ..-|+++.+.... ....+ ...+.+ T Consensus 1 ~~~~~~~---l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~----~~~~~----------------~~~~~e 57 (529) T protein:vir:10 1 MSLKNKE---ILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKS----DPVYR----------------DDKLIE 57 (529) T ss_pred CccchHH---HHHHhhHhhcCCccchhccchhhhhhhhhhhhhHHHHhc----ccccc----------------hhhhhh Confidence 1000111 111111111100 0000000000 0001111100000 00000 000000 Q ss_pred HHHhhhhhhhhHHHHHHHHHHhh---hcccccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeee Q lcl|Aclame:pro 97 YRARDKRGQFQVEMRDIDPNRLL---SRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRD 173 (419) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 173 (419) .... .+.+......+ ......+..+......-|.+++ +.+..-......+++.+-||.++..-|.-+ T Consensus 58 ~~~~--------~l~e~~~~~~~~~~~~~i~~st~t~~v~~~~P~Li~--lvRra~p~LIa~DIwGVQPMTgPTGLIFAM 127 (529) T protein:vir:10 58 AFGQ--------SLMEAEVAGDHGYDPTNIAAGQSSGAITNIGPAVIG--MVRRAIPSLIAFDIAGVQPMTGPTGQVFAL 127 (529) T ss_pred hhhc--------cchhhcccccccccccccccccccccccccCchhhh--hHHHHHHhhhhhhhheeccCCchhhhhhee Confidence 0000 00000000000 0000011111111222233222 112222344555778888887665332211 Q ss_pred ccccce--------------------------------------------------------------e----------c Q lcl|Aclame:pro 174 TSGTAG--------------------------------------------------------------A----------G 181 (419) Q Consensus 174 ~~~~~~--------------------------------------------------------------~----------~ 181 (419) ...... . . T Consensus 128 RsrY~~~~~~~~~~eaf~~~~~pda~~sga~~~ga~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~ 207 (529) T protein:vir:10 128 RSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVT 207 (529) T ss_pred eeeecCCcccccccccccccccccccccccccccccccccccccccccccccccccccceeeecccCceeeccccccccc Confidence 110000 0 0 Q ss_pred ccc-------------------------ccceeecC---------cccccccccceeeEEeeeEEEEEeehhhHHHHhhH Q lcl|Aclame:pro 182 STW-------------------------NKAAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN 227 (419) Q Consensus 182 ~~~-------------------------~~a~~v~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~ 227 (419) .+. +-..-.+| +..+++...+++++++.++.-+-...+|-||.+|- T Consensus 208 ~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsTa~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDL 287 (529) T protein:vir:10 208 VGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDL 287 (529) T ss_pred cCccccCcccccccccccccccccccccchhhhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHH Confidence 000 00000112 12356667778888888888888899999999985 Q ss_pred H-----HHHHHHHHHHHHHHHHHHHHHHHhcc----Ccccccceec---cccccccccccccc-cchh----hhHHHHHH Q lcl|Aclame:pro 228 S-----QLMGYIQGRLTYGLRFLRDRQLLNGN----GSTEMQGILT---TPGIGTYQQPKPTA-PATD----EPPLVDIR 290 (419) Q Consensus 228 ~-----~~~~~i~~~l~~a~~~~~d~~il~G~----g~~~p~Gi~~---~~~~~~~~~~~~~~-~~~~----~~~~~~~~ 290 (419) - |.++.|.+-|+..|...||+.||.-= -.++..|+-+ ..|+.......... .-.. ..++-.+- T Consensus 288 KAVHGLDAEtELsNILStEImlEINReii~~l~~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~ 367 (529) T protein:vir:10 288 RAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQID 367 (529) T ss_pred HHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHH Confidence 2 57899999999999999999888621 0011111111 11222221111000 0001 11222222 Q ss_pred HHHHhhh--hhccCCcEEEEehHHHHHHHHH--hccCCceeccCCccccC----CCcccc-cceeEecCCCCcCcEEEEe Q lcl|Aclame:pro 291 RAKTVAE--IAGFPPDGVVVHPQDWESIELD--QAPGSGVFRVIANVQGE----ATPRIW-GLNVVSTVAIAQGTALVGG 361 (419) Q Consensus 291 ~~~~~~~--~~~~~~~~~~~~~~~~~~l~~~--kd~~g~~~~~~~~~~~~----~~~~l~-G~pv~~~~~~~~~~~~~~d 361 (419) ++.+.+. ..+...+.+++|+++...|... .......-...+...+. ..+.|. |++|+++...+.+.+++|- T Consensus 368 ~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~ 447 (529) T protein:vir:10 368 KEANEIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGY 447 (529) T ss_pred HHHHHHHHhhccccceEEEEchHHHHHHHhhcccccccccccccccccccCCceEEEEecCceEEEecCCCCcceEEEEE Confidence 3333332 3333566788999999888742 11000000011111111 123443 5899999998887766653 Q ss_pred ccce-----EEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceEEEEe-------c------CCCC Q lcl|Aclame:pro 362 FRQG-----ATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTF-------A------AATT 419 (419) Q Consensus 362 ~~~~-----~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~-------~------aa~~ 419 (419) -... .++..-........ .+...|+ . .+-+..|++..+ +|-+-..-.. . +-.+ T Consensus 448 KG~~~~~~glfy~PYv~l~~~~~-~dp~sfq-P--~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n 518 (529) T protein:vir:10 448 RGANNLDAGIYYCPYVALTPLRG-FDPKNFQ-P--VMGFKTRYAIGV-NPFAESRTQAPQGRITSGMPGVNSVGKN 518 (529) T ss_pred eCCcccccceeeccccccccccc-cCCCccc-c--eeeeeeeeceee-cCccccccccccccccCCcchhhhcCcc Confidence 2110 00000000000000 1112232 2 222334554432 3322111100 0 0000 No 254 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=28.60 E-value=1.7 Score=19.27 Aligned_cols=361 Identities=11% Similarity=0.029 Sum_probs=129.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHH-HH-HHHHHHHHHHHHHHhhcccccccccchhhhhhHHHHhHHHHHH Q lcl|Aclame:pro 21 TSLTTEQVQEIVAEARGLADA--LQAESDRAAA-RA-ALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLRE 96 (419) Q Consensus 21 ~~~~~~~~~~~~~e~~~~~~~--~~~~~~~~~~-~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (419) +. ....+.|.++=..+.+. +. ++..-.. -. .-|++++... +.... ..+ .....+ T Consensus 1 ~~--~~~~~~l~~kw~p~l~~~~~~-~i~~~~~~~~a~~~enq~~~~----~~~~~-----------~~~----~~~~~~ 58 (521) T protein:vir:72 1 MT--IKTKAELLNKWKPLLEGEGLP-EIANSKQAIIAKIFENQEKDF----QTAPE-----------YKD----EKIAQA 58 (521) T ss_pred CC--cchhHHHHHhhhhhhccCCCC-ccccchhhhhhhhhhhhhhhh----hhccc-----------ccc----hHHHHH Confidence 00 00011111111111110 00 0000000 00 0011110000 00000 000 000000 Q ss_pred HHHhhhhhhhhHHHHHHHHHHhhh--cc-cccccccCCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeee Q lcl|Aclame:pro 97 YRARDKRGQFQVEMRDIDPNRLLS--RD-APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRD 173 (419) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 173 (419) +... +.+......+. .. ...+..+......-|.+++ +.+..-......+++.+-||.++..-|.-. T Consensus 59 ~~~~---------l~e~~~~~~~~~~~~~iaes~~t~~v~~~~P~Li~--lvRra~p~LIa~DIwGVQPMTgPTGLIFAM 127 (521) T protein:vir:72 59 FGSF---------LTEAEIGGDHGYNATNIAAGQTSGAVTQIGPAVMG--MVRRAIPNLIAFDICGVQPMNSPTGQVFAL 127 (521) T ss_pred Hhhh---------hhhhcccCccccCcccccccccccccccCCchhhh--HHHHHHhhhhhhhceeeccCCchhhhheee Confidence 0000 00000000000 00 0011111111112232222 111122344555788888888775433221 Q ss_pred ccccc-------------------eeccc---------------------------------------------ccc--- Q lcl|Aclame:pro 174 TSGTA-------------------GAGST---------------------------------------------WNK--- 186 (419) Q Consensus 174 ~~~~~-------------------~~~~~---------------------------------------------~~~--- 186 (419) ..... ..+++ ... T Consensus 128 RsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~ 207 (521) T protein:vir:72 128 RAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAK 207 (521) T ss_pred eeeecCCCCCcccccccchhcccccccccccccccccccccccccccccccccccccccccccccccccccCCCCCCccc Confidence 11100 00000 000 Q ss_pred ---------------------ceeecC---------cccccccccceeeEEeeeEEEEEeehhhHHHHhhHH-----HHH Q lcl|Aclame:pro 187 ---------------------AAVVPE---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-----QLM 231 (419) Q Consensus 187 ---------------------a~~v~E---------g~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~-----~~~ 231 (419) ..-.+| +...++...++++++..++.-+-...+|-||.+|-- |.+ T Consensus 208 t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAE 287 (521) T protein:vir:72 208 LDAEIKKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDAD 287 (521) T ss_pred cccccccccccCceeeeecccchhhhhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChH Confidence 001122 122566667778888888888888899999999852 578 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCc---ccccceecc----ccccccccccccc-cchhhh----HHHHHHHHHHhhh-- Q lcl|Aclame:pro 232 GYIQGRLTYGLRFLRDRQLLNGNGS---TEMQGILTT----PGIGTYQQPKPTA-PATDEP----PLVDIRRAKTVAE-- 297 (419) Q Consensus 232 ~~i~~~l~~a~~~~~d~~il~G~g~---~~p~Gi~~~----~~~~~~~~~~~~~-~~~~~~----~~~~~~~~~~~~~-- 297 (419) +.|.+-|+..|...||+.||.=--. -+..|+... .|+.......... .-.... ++-.+-.....+. T Consensus 288 tELaNILSTEImlEINReii~~i~~sa~~g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~ 367 (521) T protein:vir:72 288 AELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQ 367 (521) T ss_pred HHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHh Confidence 9999999999999999999841100 122333221 1211111111000 001111 1222222222222 Q ss_pred hhccCCcEEEEehHHHHHHHHHh--c-cCC---ceeccCCccccCCCccc-ccceeEecCCCCcCcEEEEeccce----- Q lcl|Aclame:pro 298 IAGFPPDGVVVHPQDWESIELDQ--A-PGS---GVFRVIANVQGEATPRI-WGLNVVSTVAIAQGTALVGGFRQG----- 365 (419) Q Consensus 298 ~~~~~~~~~~~~~~~~~~l~~~k--d-~~g---~~~~~~~~~~~~~~~~l-~G~pv~~~~~~~~~~~~~~d~~~~----- 365 (419) ..-...+.+++|+++...|...- + +.+ ..-|.......-..+.| .|++|+++...+.+.+++|--... T Consensus 368 T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~g 447 (521) T protein:vir:72 368 TGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAG 447 (521) T ss_pred cccccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccc Confidence 22245567899999988887531 1 000 00011111111112234 368999999998887766632110 Q ss_pred EEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecccceE-------EEE-----ecCCCC Q lcl|Aclame:pro 366 ATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFV-------RVT-----FAAATT 419 (419) Q Consensus 366 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~-------~~~-----~~aa~~ 419 (419) .++..--.+..... .+...|+ . .+-+..|++..+ +|-+-. +++ ..+.-+ T Consensus 448 lfyaPYv~l~~~~~-~dp~sfq-P--~~g~~tRY~l~~-NP~~~~~~~~~a~~i~~~~~~~~a~~~ 508 (521) T protein:vir:72 448 IYYAPYVALTPLRG-SDPKNFQ-P--VMGFKTRYGIGI-NPFAESAAQAPASRIQSGMPSILNSLG 508 (521) T ss_pred eeeccccccccccc-cCCcccc-c--eeeeeeeeceee-cCcccccCcccceeecCcChhhhcCcc Confidence 01100000111110 1122343 2 233344555433 342111 110 001111 No 255 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=26.69 E-value=1.9 Score=19.03 Aligned_cols=377 Identities=10% Similarity=-0.004 Sum_probs=96.5 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 1 MPPTPTLEEQRAALLARLDDTSLTTEQV-------QEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPL 73 (419) Q Consensus 1 M~~~~~L~e~~~~l~~~~~~~~~~~~~~-------~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 73 (419) |..+.+|++++.++.++++++....++. .+..++.+..++.++++.+++..++...+............ ... T Consensus 4 ~m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 82 (408) T protein:vir:74 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK-GPL 82 (408) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-ccc Confidence 6688999888888777776655543221 12234555666677777777776665544433222111111 111 Q ss_pred cccccchhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhhcccccccccCCcccccchhhhHHHHHh-hhhhh Q lcl|Aclame:pro 74 TPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTT-PDLPL 152 (419) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~-~~~~~ 152 (419) ..............+.. +...........+.+.. .......++..-+.. +.+..+....... +.... T Consensus 83 ~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~a~-----~~~~~~~gg~~vP~~-~~~~Ii~~~~~~~~l~~~~ 150 (408) T protein:vir:74 83 NKSENELKDKFVKDFVN------MVRNPMAFLNTVSSKTE-----TSGSDSAAGLTIPQD-IRTMINTLVRQYDSLQQYV 150 (408) T ss_pred cchhhhhHHHHHHHHHH------HHhcchhhhhhhhhhhh-----cccccCCCceeechh-HhhHHHHHHhhhcchhhhc Confidence 11111111111111111 11111111111111111 111111122222221 1222222111111 01111 Q ss_pred ---hHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhhHHH Q lcl|Aclame:pro 153 ---LVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNSQ 229 (419) Q Consensus 153 ---~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d~~~ 229 (419) ++-.....+++.. + .+......+... .+-..|.........++.--++...-.-..--+.+.-..-... T Consensus 151 ~~~~~~~~~~~~~~~~----~--~~~~~~~~~v~E--~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~ 222 (408) T protein:vir:74 151 RVESVSTSSGSRVYEK----W--TDVTPLKAMDEE--DGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAW 222 (408) T ss_pred ceeeccCCcceEEEEe----e--cCCccccccccc--ccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHH Confidence 1111111111100 0 000111111111 1112333333333444444443332221111121111111112 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc---Ccccccceeccc--cccccccccccccchhhhHHHHHHHHHHhhhhhccCCc Q lcl|Aclame:pro 230 LMGYIQGRLTYGLRFLRDRQLLNGN---GSTEMQGILTTP--GIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD 304 (419) Q Consensus 230 ~~~~i~~~l~~a~~~~~d~~il~G~---g~~~p~Gi~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 304 (419) +.+.|...++..+..++=.--=.|. +.....++...- .+... ......-......+. .+..+..... T Consensus 223 i~~~l~~~~~~~~d~~il~G~G~~~~~~~~~~~~~i~~~~~~~l~~~-~~~~a~~v~n~~~~~----~l~~lkd~~G--- 294 (408) T protein:vir:74 223 LSSWIAKKVVVTRNQAIIAAMGTVPKKPTIANFDDVITMINTSVDPA-IIATSSLLTNQSGLN----KLALVKTAEG--- 294 (408) T ss_pred HHHHHHHHHHHHHHHHHhhcccccccccccccHHHHHHHHHHhhhhh-hcCCCEEEEcHHHHH----HHHHhhcCCC--- Confidence 3333333333333333222111111 111222222110 00000 000000011122222 2333443332 Q ss_pred EEEEehHHHHHHHHHhccCCceeccCCccccCCCccccc-ceeEecCCCCcCcEEEEeccceEEE--------EEecceE Q lcl|Aclame:pro 305 GVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWG-LNVVSTVAIAQGTALVGGFRQGATL--------WSRQGIT 375 (419) Q Consensus 305 ~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G-~pv~~~~~~~~~~~~~~d~~~~~~~--------~~~~~~~ 375 (419) .+++.+.....-- ..=.|.+..+..+..-+... .+ .++++-++ ...+++++....-+- +...... T Consensus 295 ~~l~~~~~~~~~~--~~l~G~pV~~~~~~~~~~~~--~~~~~i~~gd~--~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 368 (408) T protein:vir:74 295 KYLLEPDPTKPNS--YLIKGKQVIVVADRWLPNSG--STVYPLYYGDM--SQAITLFDRENMSLLPTNIGAGAFETDTTK 368 (408) T ss_pred ceEeccCcCCCCC--ceecceeeEEecCccccccc--CCcceEEEEeh--hccEEEEEecceEEEEeccccchhhcceee Confidence 2343332100000 00012222211110000000 01 11222211 011122221111110 1111111 Q ss_pred EEEeeccc-chhhcCcEEEEEEEEeccEEecc--cceEEEEecCC Q lcl|Aclame:pro 376 VLMTDSHA-DFFTANTLVILAEFRANLAVYQP--KAFVRVTFAAA 417 (419) Q Consensus 376 i~~~~~~~-~~~~~~~~~~r~~~r~d~~~~~~--~a~~~~~~~aa 417 (419) +......+ .....+ +|+ .+.+...-+ .++..-+.++. T Consensus 369 ~r~~~r~d~~~~~~~--a~~---~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:74 369 IRVIDRFDVKATDSE--ALV---AGSFTAIADQVGNFKTTTSTAV 408 (408) T ss_pred EEEEEeeCcEEeccc--ceE---EEEeecccCCCCCCCCCccccC Confidence 11111000 001111 121 222222221 11111111111 No 256 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=24.37 E-value=2.2 Score=18.72 Aligned_cols=335 Identities=12% Similarity=0.031 Sum_probs=123.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc--chhhhhhHHHHhHHHHHHHHHhhhhhhhhHHHHHHHHHHhh Q lcl|Aclame:pro 42 LQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA--GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLL 119 (419) Q Consensus 42 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 119 (419) +.. +.|. ++...-......... ...+.+...+.+. ..+..+ +.+........ T Consensus 1 ms~---------~~l~------~~w~~~l~~~~~~~i~~~~~~~~~~~~~en------q~~~~~-----~~~~~l~ea~~ 54 (462) T protein:vir:10 1 MSI---------QQLQ------EKWAPVLNHESVPEIKDSYKKGVVAQLLEN------QENAIR-----EEGQVLNETLQ 54 (462) T ss_pred Cch---------HHHH------HHhhhhhcccccchhhhhhHHHHHHHHhhh------HHHHHH-----hcccchhcccc Confidence 000 0111 111111111110000 0001111111110 000000 00111111111 Q ss_pred hccccccccc-CCcccccchhhhHHHHHhhhhhhhHHhhcceecccCcceeeeeecccc------------ce------- Q lcl|Aclame:pro 120 SRDAPAGTIT-NPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGT------------AG------- 179 (419) Q Consensus 120 ~~~~~~~~~~-~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~------------~~------- 179 (419) ........++ ......-|.+++ +.++.-...+..+++.+-||.++..-|.-..... .. T Consensus 55 ~~g~~~~~~~t~~~~~~~P~Li~--l~Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~gtEAlfnEadt 132 (462) T protein:vir:10 55 TTGYTTGDTATGPVAGFDPVLIS--LIRRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSDFREALFNEPNA 132 (462) T ss_pred ccCCCcCcccccccccccchhhh--HHHHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccccchhhhccCCc Confidence 1111111111 111112233332 1222223445568888888888764332211110 00 Q ss_pred eccc----------------------c--------ccc-------------eeecC-------cccccccccceeeEEee Q lcl|Aclame:pro 180 AGST----------------------W--------NKA-------------AVVPE-------GTAKPQSTLSFDTITTT 209 (419) Q Consensus 180 ~~~~----------------------~--------~~a-------------~~v~E-------g~~~~~~~~~~~~v~~~ 209 (419) .+++ + ..+ .-.+| +...++...++++++.. T Consensus 133 ~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~GM~Ta~aE~lg~~s~n~~f~EMaFsIeK~tVt 212 (462) T protein:vir:10 133 GFSGGAGTGLSNYDPTASSSAVNDAEGANPGLLNDSPAGTYEVTGDATGMATATAEALDDSSASTAFREMGFSIEKVTVT 212 (462) T ss_pred CccccccccccccccccccccccccccccceeecCCCccceecccccccccchhccccCCccCCcchhhceeEEEEEEEe Confidence 0000 0 000 00111 12456777778888888 Q ss_pred eEEEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhcc----Ccccccceeccccccccccccccccc Q lcl|Aclame:pro 210 LKTVAHWLPITRQAADDNS-----QLMGYIQGRLTYGLRFLRDRQLLNGN----GSTEMQGILTTPGIGTYQQPKPTAPA 280 (419) Q Consensus 210 ~~k~~~~~~vs~ell~d~~-----~~~~~i~~~l~~a~~~~~d~~il~G~----g~~~p~Gi~~~~~~~~~~~~~~~~~~ 280 (419) ++.-+-...+|-||.+|-- |.++.|.+-|+..|...||+.||.-= -.++..|+ ...|+...... . T Consensus 213 AKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~~k~~~~-~~~Gv~dl~~~-----~ 286 (462) T protein:vir:10 213 AKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVNAVKGAIANT-ATDGIFDLDVD-----S 286 (462) T ss_pred eeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhheeeecccc-cccceeeeccc-----c Confidence 8888888899999999842 57899999999999999999888621 11121111 11121111111 1 Q ss_pred hhhhHHHHHHHHHHhh---------hhhccCCcEEEEehHHHHHHHHH---hcc---CCceec-cCCccccCCCcccc-c Q lcl|Aclame:pro 281 TDEPPLVDIRRAKTVA---------EIAGFPPDGVVVHPQDWESIELD---QAP---GSGVFR-VIANVQGEATPRIW-G 343 (419) Q Consensus 281 ~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~l~~~---kd~---~g~~~~-~~~~~~~~~~~~l~-G 343 (419) .+-..++.+..++..+ ...-...+.+++|+++...|... +-. .+.... ...+......+.|. | T Consensus 287 ~gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r 366 (462) T protein:vir:10 287 NGRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMAGVLDYAPGLQGNSALTGVDDTSSTLVGTLNGR 366 (462) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhccchhccccccccccccccccccceeEEEecCc Confidence 1112233333332222 23334556789999999887532 100 011000 00111111223444 5 Q ss_pred ceeEecCCCC----cCcEEEEeccceEEEEEecceEEEEeec---------ccchhhcCcEEEEEEEEeccEEecccceE Q lcl|Aclame:pro 344 LNVVSTVAIA----QGTALVGGFRQGATLWSRQGITVLMTDS---------HADFFTANTLVILAEFRANLAVYQPKAFV 410 (419) Q Consensus 344 ~pv~~~~~~~----~~~~~~~d~~~~~~~~~~~~~~i~~~~~---------~~~~~~~~~~~~r~~~r~d~~~~~~~a~~ 410 (419) ++|+++.... .+.+++|- + +-.-++ ..+-..++ +...|+ . .+-+..|++..+ +|-+-. T Consensus 367 ~~vy~D~Y~~~ns~~dy~~vG~-K-G~~~~~---~glfy~PYv~l~~~~~~dp~sfq-P--~~g~~tRY~l~~-NP~t~~ 437 (462) T protein:vir:10 367 IKVYVDPYSSNVADKHFYVAGY-K-GTSPYD---AGLFYCPYVPLQQVRAINPNTFQ-P--KIGFKTRYGMVS-NPFSGG 437 (462) T ss_pred eEEEEecccCCCcccceEEEEE-e-CCcccc---cceeeccccccccccccCCcccc-c--eeeeeeeeeeee-cCCCCC Confidence 8999997653 33333332 1 000000 01111111 112232 1 222333443322 332100 Q ss_pred EEEecCCCC Q lcl|Aclame:pro 411 RVTFAAATT 419 (419) Q Consensus 411 ~~~~~aa~~ 419 (419) .-.-.+..+ T Consensus 438 ~~~~~~~~~ 446 (462) T protein:vir:10 438 LTQGSGALT 446 (462) T ss_pred cCCcccccc Confidence 000000011 No 257 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=24.14 E-value=2.2 Score=18.69 Aligned_cols=375 Identities=7% Similarity=-0.038 Sum_probs=118.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccchhhhh Q lcl|Aclame:pro 5 PTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGTFRSL 84 (419) Q Consensus 5 ~~L~e~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (419) .+|+|.+++++++.++.....+...+..+...+....++++++++..+++.+++++...+.................... T Consensus 1 M~l~el~~~~~~~~~~~~a~l~~~~~~~~~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~~~~ 80 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRLAELQGKVEKNEVRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPEKKE 80 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhhhc Confidence 99999999998888776554443322222223445667788888888888887776655443322221111111011000 Q ss_pred hHHHHhHHHHHHHHHhhhhhhhhHHHHHH-HHH------H-hhhcc---ccccccc-------CCcccccchhhhHHHHH Q lcl|Aclame:pro 85 AQRFADSDGLREYRARDKRGQFQVEMRDI-DPN------R-LLSRD---APAGTIT-------NPNVPHLPQLVPGIVPT 146 (419) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~------~-~~~~~---~~~~~~~-------~~~~~~~p~~~~~~i~~ 146 (419) ..................+.......... ... . ..... ...+... +.+.....-.++..+.. T Consensus 81 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~ 160 (434) T protein:vir:62 81 DPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNGSVTIPDFLSK 160 (434) T ss_pred chhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcccccccceecchhhHH Confidence 00000111111111111111110000000 000 0 00000 0000000 00000001122333222 Q ss_pred hhhhhhhHHhhcceecccCcceeeeeeccccceeccccccceeecCcccccccccceeeEEeeeEEEEEeehhhHHHHhh Q lcl|Aclame:pro 147 TPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADD 226 (419) Q Consensus 147 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~vs~ell~d 226 (419) . .+..+-...++..-.-.++.......+.......+.|+.+.++... .+....++...++..+-.-..--+.+ T Consensus 161 ~-----Ii~~l~~~~~i~~~~~~~~~~~~~~~p~~~~~~~a~~~~~~~e~~~--~~~~~~~f~~v~~~~~k~~~~~~iS~ 233 (434) T protein:vir:62 161 E-----IITYAQEENFLRRLGTGVKTKENIKYPVLVKKAEAQGHKNERTNNE--MPETDIEFDEIELSPTEFDALATVTK 233 (434) T ss_pred H-----HHHhhhhhhhhhhhcceeccCCceEEEEEecCCcccceeccccccc--ccccccceeeEEeeheeeEeehhhHH Confidence 1 2222212222211111123333334555666777888887765443 33456666666665443222111221 Q ss_pred H-HHHHH-HHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccccchhhhHHHHHHHHHHhhhhhccCCc Q lcl|Aclame:pro 227 N-SQLMG-YIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD 304 (419) Q Consensus 227 ~-~~~~~-~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 304 (419) . ..-.. -|...+...++..+-..+=. -|++..|.............. ... ... T Consensus 234 ell~ds~~~l~~~i~~~la~~~~~~~d~--------~~l~G~G~~~~~~g~~~~~~~---------------~~~--~~~ 288 (434) T protein:vir:62 234 KLLARTGLPIEQIVMDELKKAYVRKETQ--------YMVNGDEANNINDGALAKKAV---------------EFK--TDE 288 (434) T ss_pred HHHhcchHHHHHHHHHHHHHHHHHHHHH--------HHhccCCCCccccceeecccc---------------ccc--ccc Confidence 1 11122 36777777777777766532 233322211110000000000 000 000 Q ss_pred EEEEehHHHHHHHHH----hc---cCCceeccCCccccCC--CcccccceeEecC-CCCcCc--EEE------Eeccc-- Q lcl|Aclame:pro 305 GVVVHPQDWESIELD----QA---PGSGVFRVIANVQGEA--TPRIWGLNVVSTV-AIAQGT--ALV------GGFRQ-- 364 (419) Q Consensus 305 ~~~~~~~~~~~l~~~----kd---~~g~~~~~~~~~~~~~--~~~l~G~pv~~~~-~~~~~~--~~~------~d~~~-- 364 (419) ..++..|..+ .. .+.. ++..+...... -..=.|.|++... ....+. .+. .+... T Consensus 289 -----~~~~d~l~~l~~~l~~~~~~~a~-~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~ 362 (434) T protein:vir:62 289 -----KNLYDALVKMKNTPVKEVRKKAR-WVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIP 362 (434) T ss_pred -----cchhhHHHHHHhhcchhhhcCCE-EEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCc Confidence 0112222222 11 1111 11111100000 0001477776532 221111 111 11100 Q ss_pred ------eEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecc-cceEEEEecCCCC Q lcl|Aclame:pro 365 ------GATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQP-KAFVRVTFAAATT 419 (419) Q Consensus 365 ------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~-~a~~~~~~~aa~~ 419 (419) .+++++.....|- .......+......|....++++.+..- .+-.+. -+-+++ T Consensus 363 ~~~~~~~i~~Gdfs~~~i~-~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~-~~~~~~ 422 (434) T protein:vir:62 363 DSPDTPVFYFGDFSKFYIQ-DVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIH-SPFEVP 422 (434) T ss_pred cCCCceEEEEeeccceEEE-EeeceeEEEeehhhhcccCceEEEEEeeecceeec-Ccccce Confidence 0111111111000 0000000000000011111111111110 111111 122333 No 258 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=23.49 E-value=2.3 Score=18.60 Aligned_cols=261 Identities=13% Similarity=0.020 Sum_probs=106.6 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhHHhhcc------eecccCcceeeeeeccccceeccccccceeecCccccccccc Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLD------QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTL 201 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 201 (419) ++- ..-+.+...+.+.....+....++. +...+++.+++|+....... ....-+.+| .....+. T Consensus 1 Mai----n~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl-~dY~R~~g~-----~~g~v~~ 70 (285) T protein:vir:79 1 MTV----VLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDA-TAYKRGQDN-----ARKTISV 70 (285) T ss_pred Ccc----hhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccc-cccccccCc-----cccccce Confidence 110 1123344445444444333333322 23345678999986321111 111111111 1111122 Q ss_pred ceeeEEeeeEEEEEeehhhHHHH-hhHH--HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccccc Q lcl|Aclame:pro 202 SFDTITTTLKTVAHWLPITRQAA-DDNS--QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTA 278 (419) Q Consensus 202 ~~~~v~~~~~k~~~~~~vs~ell-~d~~--~~~~~i~~~l~~a~~~~~d~~il~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 278 (419) ++...++.-.+.-.+. |. .+- +.+. .+...+.+.......-.+|...+.-= ....+ ..... T Consensus 71 ~~et~tl~~DR~~~f~-iD-~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskl--------a~~a~------~~~~~ 134 (285) T protein:vir:79 71 GKETVKLTHEDWFGYD-LD-QFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKL--------FDSAA------KKATD 134 (285) T ss_pred eeeEEEeeccccceec-cc-ccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHH--------Hhhcc------ccccc Confidence 2233333322222211 11 010 1111 12222223233344445665544310 00000 00111 Q ss_pred cchhhhHHHHHHHHHHhhhhhccCCc-EEEEehHHHHHHHHHhccCCc----eeccCCccccCCCccccc-ceeEec--C Q lcl|Aclame:pro 279 PATDEPPLVDIRRAKTVAEIAGFPPD-GVVVHPQDWESIELDQAPGSG----VFRVIANVQGEATPRIWG-LNVVST--V 350 (419) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~kd~~g~----~~~~~~~~~~~~~~~l~G-~pv~~~--~ 350 (419) +.+....++.+..++..+.....+.+ ..+|+|..+..|.+.+.-... .-+..+ ..+...+.|.| .|++.. + T Consensus 135 ~~T~~nv~~~i~~~~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~-~i~~~V~~lDg~v~ii~Vps~ 213 (285) T protein:vir:79 135 SITKDNALDAYDTAEAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVIN-GIDRRVAQLDGGVPIVRVSSD 213 (285) T ss_pred ccCHHHHHHHHHHHHHHHHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceecc-ceeeeeccccceeEEEEcchh Confidence 23455678888888888888766433 456899988877754321110 001111 11234567888 898864 4 Q ss_pred CCCcCc------EEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecc-cceEEEEecCCC Q lcl|Aclame:pro 351 AIAQGT------ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQP-KAFVRVTFAAAT 418 (419) Q Consensus 351 ~~~~~~------~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~-~a~~~~~~~aa~ 418 (419) .|...+ .++...+ +..-..+... +.+..... .-.-|...|.-..++|.=+.+. ..-+.+..+++. T Consensus 214 r~kt~~~~k~Infiiv~~~-a~i~~~K~~~-~~~f~P~~-~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 214 RLKGLGITNHVNFILTPLS-AIAPIVKYDS-VSVIDPST-DRSGNRWTIKGLSYYDAIVLDNAKKGIYVAATAGV 285 (285) T ss_pred hccCcCcchhccEEEecCc-eeccceeeee-eEeECCCC-CCCcceeeeeeeeeeeeeehhhccceeeeeecccC Confidence 443211 2222222 2222222221 22222111 1123445566667777777664 344455566666 No 259 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=23.18 E-value=2.3 Score=18.56 Aligned_cols=264 Identities=9% Similarity=0.000 Sum_probs=109.0 Q ss_pred ccCCcccccchhhhHHHHHhhhhhhhH-Hhh-----cc-eecccCcceeeeeeccccceeccccccceeecCcccccccc Q lcl|Aclame:pro 128 ITNPNVPHLPQLVPGIVPTTPDLPLLV-ADL-----LD-QQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST 200 (419) Q Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~l-~~~-----~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 200 (419) ++- ..-+.+...+++.......- ..+ .. +.-.+++.++||+.+........ .-...|...| .-+ T Consensus 1 Mai----nya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY-~R~~g~~~~g----~v~ 71 (346) T protein:vir:10 1 MTI----NYAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDR-QRRTITTPVA----NYS 71 (346) T ss_pred Ccc----hhHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccc-cccCCccccc----ccc Confidence 110 01133444444433222111 011 11 12246678999987522111111 1111111111 112 Q ss_pred cceeeEEeeeEEEEEeehhhHHH-HhhH---HHHHHHHHHHHHHHHHHHHHHHHHh----ccCcccccceeccccccccc Q lcl|Aclame:pro 201 LSFDTITTTLKTVAHWLPITRQA-ADDN---SQLMGYIQGRLTYGLRFLRDRQLLN----GNGSTEMQGILTTPGIGTYQ 272 (419) Q Consensus 201 ~~~~~v~~~~~k~~~~~~vs~el-l~d~---~~~~~~i~~~l~~a~~~~~d~~il~----G~g~~~p~Gi~~~~~~~~~~ 272 (419) .++...++.-.+.-.+. |. .+ ++.+ ..+...+.+.....+.=.+|...+. +.+... + T Consensus 72 ~~~et~tl~qDR~~~F~-vD-~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~--------~----- 136 (346) T protein:vir:10 72 NDWDSYELKNERYWSTL-VD-PSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAH--------D----- 136 (346) T ss_pred cceeEEEeeccccceec-cc-ccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhc--------c----- Confidence 23333444333332221 11 11 1111 1233334444444455566765443 111100 0 Q ss_pred cccccccchhhhHHHHHHHHHHhhhhhccC--CcEEEEehHHHHHHHHHhccCCceeccCCccccCCCcccccceeEec- Q lcl|Aclame:pro 273 QPKPTAPATDEPPLVDIRRAKTVAEIAGFP--PDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVST- 349 (419) Q Consensus 273 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~kd~~g~~~~~~~~~~~~~~~~l~G~pv~~~- 349 (419) ........+....|+.+..++..+.....+ +-..+|+|..+..|.+...-+...-........+..+.|.|+||+.. T Consensus 137 ~~~~~~a~T~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~i~~~V~siDGv~Ii~VP 216 (346) T protein:vir:10 137 GGITTNTLDEKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPNNIQRTVYSLDDVTIRVVP 216 (346) T ss_pred ccccccccCHHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheeccccccccccceeeeeecCeEEEEcc Confidence 000111234556788888898888777654 33467899998876643321110001111223455678999999863 Q ss_pred -CCCCc------C----------cEEEEeccceEEEEEecceEEEEeecccchhhcCcEEEEEEEEeccEEecc-cceEE Q lcl|Aclame:pro 350 -VAIAQ------G----------TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQP-KAFVR 411 (419) Q Consensus 350 -~~~~~------~----------~~~~~d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~-~a~~~ 411 (419) +-|.. | ..+++.. .+..-..+- -.+.+.... .-..|...|.-..++|.=+.+. ..-+. T Consensus 217 s~r~~t~~~f~~G~~~~t~ak~INfiiv~~-~A~ia~~K~-~~~~if~P~--~~~~g~~l~~~R~Y~D~fv~~nk~~~Iy 292 (346) T protein:vir:10 217 SDLMQTAYDFSDGSKIIDTAKQIEMFLIYN-GVQIAPEKY-SFVGFDQPS--AATSGNYLYYEQSYDDVLLLNTKTKGIQ 292 (346) T ss_pred hhhcccchhhccCccccCCccceeEEEECC-ceeeeeeee-eeeEeeCCC--CCcccceeeeeeeeeeeeeeccccceEE Confidence 33321 1 0122221 122211111 112222222 1234445666677788777664 34444 Q ss_pred EEecCCCC Q lcl|Aclame:pro 412 VTFAAATT 419 (419) Q Consensus 412 ~~~~aa~~ 419 (419) +.++.++. T Consensus 293 v~~~~a~~ 300 (346) T protein:vir:10 293 FVVSDKPK 300 (346) T ss_pred Eeeecccc Confidence 45544443 Done!