Query lcl|NC_010583.1_cdsid_YP_001837083.1 [gene=AGC_0160] [protein=major head protein precursor] [protein_id=YP_001837083.1] [location=complement(102062..103438)] Match_columns 458 No_of_seqs 140 out of 1098 Neff 10.4 Searched_HMMs 1612 Date Thu Nov 7 14:36:33 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_160 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_160_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:104256 Length: 458 100.0 7.8E-84 4.8E-87 476.2 46.4 458 1-458 1-458 (458) 2 protein:vir:100247 Length: 425 100.0 2.9E-69 1.8E-72 396.4 39.9 411 1-458 7-424 (425) 3 protein:vir:485 Length: 407 # 100.0 3.1E-69 1.9E-72 396.2 37.9 393 14-458 1-400 (407) 4 protein:vir:4456 Length: 401 # 100.0 7.1E-68 4.4E-71 388.7 37.6 394 17-458 1-401 (401) 5 protein:vir:8102 Length: 543 # 100.0 1.5E-61 9.1E-65 354.1 43.9 438 1-458 55-542 (543) 6 protein:vir:4511 Length: 409 # 100.0 7.5E-63 4.6E-66 361.2 35.5 394 3-458 1-406 (409) 7 protein:vir:100135 Length: 418 100.0 1.5E-61 9.5E-65 354.0 38.7 408 5-458 1-415 (418) 8 protein:vir:7855 Length: 497 # 100.0 1.1E-60 7.1E-64 349.2 41.1 429 5-458 1-493 (497) 9 protein:vir:101650 Length: 497 100.0 1.1E-60 7.1E-64 349.2 41.1 429 5-458 1-493 (497) 10 protein:vir:6242 Length: 390 # 100.0 1.2E-61 7.5E-65 354.5 35.2 381 3-458 1-389 (390) 11 protein:vir:95376 Length: 425 100.0 2.8E-60 1.7E-63 347.1 42.4 411 3-458 1-421 (425) 12 protein:vir:1328 Length: 392 # 100.0 2E-61 1.2E-64 353.3 35.9 382 25-458 1-391 (392) 13 protein:vir:6212 Length: 434 # 100.0 8.9E-61 5.5E-64 349.8 39.0 421 3-458 1-431 (434) 14 protein:vir:105038 Length: 428 100.0 1E-60 6.3E-64 349.5 36.1 397 5-458 1-428 (428) 15 protein:vir:8420 Length: 477 # 100.0 1.3E-59 8E-63 343.4 38.1 433 1-458 1-471 (477) 16 protein:vir:1433 Length: 435 # 100.0 2.6E-60 1.6E-63 347.2 33.5 402 3-458 1-433 (435) 17 protein:vir:4600 Length: 415 # 100.0 3.4E-59 2.1E-62 341.2 37.5 400 5-458 1-404 (415) 18 protein:vir:4700 Length: 415 # 100.0 3.4E-59 2.1E-62 341.2 37.5 400 5-458 1-404 (415) 19 protein:vir:1268 Length: 397 # 100.0 4.6E-59 2.8E-62 340.4 38.1 384 1-458 1-397 (397) 20 protein:vir:80376 Length: 435 100.0 2.4E-59 1.5E-62 341.9 34.6 403 3-458 1-433 (435) 21 protein:vir:4339 Length: 395 # 100.0 1.3E-58 7.8E-62 338.0 38.2 388 5-458 1-395 (395) 22 protein:vir:97053 Length: 390 100.0 8.1E-59 5E-62 339.1 36.8 378 14-456 1-390 (390) 23 protein:vir:79987 Length: 415 100.0 1.4E-58 8.4E-62 337.8 37.7 401 5-458 1-404 (415) 24 protein:vir:98339 Length: 415 100.0 1.4E-58 8.4E-62 337.8 37.7 401 5-458 1-404 (415) 25 protein:vir:81100 Length: 415 100.0 1.4E-58 8.4E-62 337.8 37.7 401 5-458 1-404 (415) 26 protein:vir:9410 Length: 415 # 100.0 2.5E-58 1.5E-61 336.4 38.0 401 5-458 1-404 (415) 27 protein:vir:10364 Length: 390 100.0 2.7E-58 1.7E-61 336.2 36.5 378 14-456 1-390 (390) 28 protein:vir:81070 Length: 390 100.0 2.3E-58 1.4E-61 336.5 35.9 384 14-456 1-390 (390) 29 protein:vir:1084 Length: 437 # 100.0 1.1E-56 6.8E-60 327.4 44.6 412 3-458 1-427 (437) 30 protein:vir:80128 Length: 466 100.0 1.5E-57 9.2E-61 332.1 38.6 418 3-458 1-448 (466) 31 protein:vir:98635 Length: 377 100.0 1.5E-58 9.6E-62 337.5 31.3 363 1-458 1-377 (377) 32 protein:vir:4953 Length: 397 # 100.0 2E-57 1.2E-60 331.4 36.2 372 14-458 1-385 (397) 33 protein:vir:81160 Length: 371 100.0 1.6E-57 1E-60 331.9 35.2 356 30-458 1-371 (371) 34 protein:vir:1886 Length: 385 # 100.0 2.1E-57 1.3E-60 331.3 35.5 377 14-458 1-384 (385) 35 protein:vir:191 Length: 385 # 100.0 2.1E-57 1.3E-60 331.3 35.5 377 14-458 1-384 (385) 36 protein:vir:3870 Length: 400 # 100.0 1.3E-56 8E-60 327.0 38.8 395 3-458 1-399 (400) 37 protein:vir:102119 Length: 404 100.0 2.8E-57 1.7E-60 330.6 34.8 390 17-458 1-400 (404) 38 protein:vir:4997 Length: 397 # 100.0 5.8E-57 3.6E-60 328.9 35.9 372 14-458 1-385 (397) 39 protein:vir:7409 Length: 408 # 100.0 9.9E-57 6.2E-60 327.6 36.8 382 1-458 1-393 (408) 40 protein:vir:4830 Length: 397 # 100.0 1.2E-56 7.4E-60 327.2 36.9 375 14-458 1-385 (397) 41 protein:vir:1025 Length: 408 # 100.0 7.2E-57 4.5E-60 328.4 35.7 380 1-458 1-393 (408) 42 protein:vir:3991 Length: 404 # 100.0 3.6E-56 2.2E-59 324.6 36.6 382 1-458 1-393 (404) 43 protein:vir:4092 Length: 390 # 100.0 1.5E-56 9.6E-60 326.6 34.3 358 45-458 1-368 (390) 44 protein:vir:94673 Length: 419 100.0 7.7E-56 4.8E-59 322.7 37.6 406 1-458 1-417 (419) 45 protein:vir:105004 Length: 392 100.0 5.2E-56 3.2E-59 323.7 36.1 368 1-458 1-384 (392) 46 protein:vir:102873 Length: 392 100.0 5.2E-56 3.2E-59 323.7 36.1 368 1-458 1-384 (392) 47 protein:vir:107593 Length: 392 100.0 5.2E-56 3.2E-59 323.7 36.1 368 1-458 1-384 (392) 48 protein:vir:102082 Length: 392 100.0 5.2E-56 3.2E-59 323.7 36.1 368 1-458 1-384 (392) 49 protein:vir:3845 Length: 395 # 100.0 2E-55 1.3E-58 320.4 37.5 373 3-458 1-383 (395) 50 protein:vir:1383 Length: 421 # 100.0 1.7E-55 1E-58 320.9 36.9 378 3-458 1-383 (421) 51 protein:vir:81227 Length: 413 100.0 4.1E-55 2.6E-58 318.7 38.8 395 17-458 1-410 (413) 52 protein:vir:100172 Length: 394 100.0 2.5E-55 1.6E-58 319.9 37.1 380 5-458 1-384 (394) 53 protein:vir:100884 Length: 389 100.0 4.7E-55 2.9E-58 318.4 37.5 378 5-458 1-382 (389) 54 protein:vir:9704 Length: 394 # 100.0 9.6E-55 6E-58 316.7 36.5 387 1-458 2-390 (394) 55 protein:vir:962 Length: 397 # 100.0 5.5E-54 3.4E-57 312.6 40.4 393 3-458 1-397 (397) 56 protein:vir:93881 Length: 387 100.0 4.7E-54 2.9E-57 312.9 33.7 373 5-458 1-381 (387) 57 protein:vir:2685 Length: 387 # 100.0 3E-54 1.9E-57 314.0 32.0 374 5-458 1-381 (387) 58 protein:vir:96978 Length: 387 100.0 3E-54 1.9E-57 314.0 32.0 374 5-458 1-381 (387) 59 protein:vir:94424 Length: 387 100.0 3E-54 1.9E-57 314.0 32.0 374 5-458 1-381 (387) 60 protein:vir:7771 Length: 330 # 100.0 2.4E-55 1.5E-58 320.0 25.4 299 143-458 1-323 (330) 61 protein:vir:4226 Length: 326 # 100.0 3.1E-55 1.9E-58 319.4 25.8 305 143-458 1-323 (326) 62 protein:vir:5739 Length: 366 # 100.0 7.8E-55 4.8E-58 317.2 27.1 340 88-458 1-366 (366) 63 protein:vir:93616 Length: 645 100.0 5.5E-54 3.4E-57 312.6 31.5 422 1-458 165-642 (645) 64 protein:vir:9361 Length: 402 # 100.0 9.2E-54 5.7E-57 311.3 32.0 388 8-458 1-396 (402) 65 protein:vir:80684 Length: 315 100.0 5.4E-55 3.3E-58 318.1 25.1 284 162-458 1-306 (315) 66 protein:vir:41 Length: 299 # N 100.0 6.9E-55 4.3E-58 317.5 24.8 282 156-458 1-298 (299) 67 protein:vir:101607 Length: 379 100.0 1.1E-52 6.6E-56 305.5 36.3 373 3-458 1-379 (379) 68 protein:vir:105905 Length: 304 100.0 8.6E-55 5.3E-58 317.0 24.4 286 143-457 1-304 (304) 69 protein:vir:94142 Length: 304 100.0 8.6E-55 5.3E-58 317.0 24.4 286 143-457 1-304 (304) 70 protein:vir:78223 Length: 333 100.0 2.4E-54 1.5E-57 314.5 26.9 308 137-458 1-332 (333) 71 protein:vir:100632 Length: 381 100.0 3E-54 1.9E-57 314.0 27.4 348 51-458 1-368 (381) 72 protein:vir:96762 Length: 632 100.0 5.1E-53 3.2E-56 307.3 32.7 425 1-457 183-632 (632) 73 protein:vir:8187 Length: 311 # 100.0 4.9E-54 3E-57 312.9 25.7 282 164-458 1-310 (311) 74 protein:vir:95963 Length: 395 100.0 7.3E-53 4.6E-56 306.4 31.8 359 48-458 1-376 (395) 75 protein:vir:9643 Length: 377 # 100.0 2E-53 1.3E-56 309.5 28.4 352 48-458 1-377 (377) 76 protein:vir:78350 Length: 383 100.0 2.4E-53 1.5E-56 309.1 28.2 366 48-458 1-375 (383) 77 protein:vir:101291 Length: 381 100.0 3.5E-53 2.1E-56 308.2 28.9 347 51-458 1-368 (381) 78 protein:vir:9509 Length: 381 # 100.0 3.5E-53 2.1E-56 308.2 28.9 347 51-458 1-368 (381) 79 protein:vir:78640 Length: 352 100.0 7.6E-53 4.7E-56 306.3 29.8 344 45-458 1-346 (352) 80 protein:vir:78523 Length: 338 100.0 2.6E-53 1.6E-56 308.9 27.1 306 137-458 1-335 (338) 81 protein:vir:2430 Length: 318 # 100.0 2.3E-53 1.4E-56 309.2 26.6 296 143-458 1-313 (318) 82 protein:vir:104085 Length: 320 100.0 6.3E-53 3.9E-56 306.8 27.0 300 143-458 1-317 (320) 83 protein:vir:97148 Length: 324 100.0 6.2E-53 3.8E-56 306.8 26.4 298 124-458 1-315 (324) 84 protein:vir:9574 Length: 300 # 100.0 4.3E-53 2.7E-56 307.7 24.9 283 163-458 1-300 (300) 85 protein:vir:1638 Length: 298 # 100.0 5.7E-53 3.5E-56 307.0 25.5 281 166-457 1-298 (298) 86 protein:vir:9759 Length: 303 # 100.0 5.8E-53 3.6E-56 307.0 25.3 281 164-458 1-303 (303) 87 protein:vir:2504 Length: 305 # 100.0 8.7E-53 5.4E-56 306.0 25.7 283 162-458 1-298 (305) 88 protein:vir:9309 Length: 324 # 100.0 2.6E-52 1.6E-55 303.4 26.7 298 124-458 1-315 (324) 89 protein:vir:78830 Length: 324 100.0 2.5E-52 1.5E-55 303.5 26.6 298 124-458 1-315 (324) 90 protein:vir:96392 Length: 324 100.0 2.5E-52 1.5E-55 303.5 26.6 298 124-458 1-315 (324) 91 protein:vir:103955 Length: 324 100.0 6.9E-52 4.3E-55 301.1 26.3 298 124-458 1-315 (324) 92 protein:vir:99749 Length: 324 100.0 9E-52 5.6E-55 300.4 26.4 297 127-458 1-315 (324) 93 protein:vir:2344 Length: 397 # 100.0 6.6E-52 4.1E-55 301.2 24.6 289 147-458 1-306 (397) 94 protein:vir:94771 Length: 298 100.0 1.5E-51 9.4E-55 299.2 25.4 278 166-457 1-298 (298) 95 protein:vir:4856 Length: 293 # 100.0 1.3E-51 7.8E-55 299.6 24.4 271 157-458 1-281 (293) 96 protein:vir:96223 Length: 324 100.0 3E-51 1.8E-54 297.6 26.2 298 124-458 1-315 (324) 97 protein:vir:99920 Length: 311 100.0 1.9E-51 1.2E-54 298.7 24.5 285 162-458 1-311 (311) 98 protein:vir:95763 Length: 297 100.0 7.1E-51 4.4E-54 295.5 24.1 279 144-458 1-296 (297) 99 protein:vir:4197 Length: 314 # 100.0 1.1E-42 6.8E-46 250.6 23.7 295 149-458 1-313 (314) 100 protein:vir:4159 Length: 315 # 100.0 3.6E-42 2.2E-45 247.8 22.3 300 137-455 1-315 (315) 101 protein:vir:97397 Length: 517 100.0 7.1E-37 4.4E-40 218.8 28.1 389 1-458 120-516 (517) 102 protein:vir:3158 Length: 321 # 100.0 1E-33 6.3E-37 201.5 23.2 295 134-458 1-312 (321) 103 protein:vir:4074 Length: 480 # 100.0 4.9E-32 3E-35 192.2 21.7 365 1-458 107-477 (480) 104 protein:vir:3033 Length: 272 # 100.0 8.8E-32 5.4E-35 190.8 21.6 265 162-458 1-269 (272) 105 protein:vir:9820 Length: 272 # 100.0 8.8E-32 5.4E-35 190.8 21.6 265 162-458 1-269 (272) 106 protein:vir:93742 Length: 274 99.9 4.3E-23 2.7E-26 143.2 19.8 265 162-458 1-270 (274) 107 protein:vir:3613 Length: 272 # 99.9 3.5E-23 2.2E-26 143.7 18.9 267 162-458 1-272 (272) 108 protein:vir:96123 Length: 274 99.8 1.3E-21 8E-25 135.1 19.6 265 162-458 1-270 (274) 109 protein:vir:80930 Length: 278 99.8 2.2E-21 1.4E-24 133.8 20.1 271 162-458 1-277 (278) 110 protein:vir:96833 Length: 275 99.8 8.1E-21 5E-24 130.7 18.8 267 159-458 1-271 (275) 111 protein:vir:79928 Length: 393 99.8 1.4E-20 8.7E-24 129.4 19.8 360 52-458 1-381 (393) 112 protein:vir:97433 Length: 274 99.8 3.1E-20 1.9E-23 127.6 19.9 265 162-458 1-270 (274) 113 protein:vir:94494 Length: 274 99.8 3.1E-20 1.9E-23 127.6 19.9 265 162-458 1-270 (274) 114 protein:vir:105334 Length: 276 99.8 2.3E-20 1.4E-23 128.3 19.1 266 162-458 1-270 (276) 115 protein:vir:94933 Length: 330 99.8 4.4E-20 2.7E-23 126.7 18.0 309 133-458 1-329 (330) 116 protein:vir:1239 Length: 274 # 99.7 4.3E-19 2.7E-22 121.3 19.0 265 162-458 1-270 (274) 117 protein:vir:96262 Length: 274 99.7 2.2E-18 1.3E-21 117.4 19.4 265 162-458 1-270 (274) 118 protein:vir:95898 Length: 274 99.7 2.2E-18 1.3E-21 117.4 19.4 265 162-458 1-270 (274) 119 protein:vir:95107 Length: 270 99.7 2.4E-18 1.5E-21 117.2 18.1 261 161-458 1-265 (270) 120 protein:vir:97255 Length: 310 99.6 5.4E-16 3.4E-19 104.3 20.6 289 161-458 1-310 (310) 121 protein:vir:739 Length: 231 # 99.6 1.7E-16 1E-19 107.0 15.4 231 196-458 1-231 (231) 122 protein:vir:102605 Length: 273 99.5 3.5E-15 2.2E-18 99.8 18.8 266 168-458 1-273 (273) 123 protein:vir:105822 Length: 273 99.5 3.5E-15 2.2E-18 99.8 18.8 266 168-458 1-273 (273) 124 protein:vir:7990 Length: 273 # 99.5 2.6E-15 1.6E-18 100.6 17.7 266 168-458 1-273 (273) 125 protein:vir:93858 Length: 400 99.4 7.2E-14 4.4E-17 92.6 19.7 386 12-456 1-400 (400) 126 protein:vir:99424 Length: 360 99.3 1.9E-12 1.2E-15 84.9 19.9 302 134-458 1-360 (360) 127 protein:vir:8885 Length: 347 # 99.2 3E-13 1.9E-16 89.2 13.1 299 143-458 1-346 (347) 128 protein:vir:8324 Length: 410 # 99.2 4.2E-12 2.6E-15 82.9 18.0 397 14-456 1-410 (410) 129 protein:vir:94711 Length: 347 99.2 7.9E-13 4.9E-16 86.9 13.7 297 143-458 1-346 (347) 130 protein:vir:103323 Length: 364 99.2 2.2E-11 1.3E-14 79.1 20.9 297 137-458 1-339 (364) 131 protein:vir:80213 Length: 334 99.1 4E-12 2.5E-15 83.1 15.3 295 143-458 1-332 (334) 132 protein:vir:94622 Length: 341 99.1 1.8E-12 1.1E-15 85.0 13.3 288 137-458 1-339 (341) 133 protein:vir:94576 Length: 347 99.1 2.5E-12 1.6E-15 84.2 13.9 299 140-458 1-347 (347) 134 protein:vir:6324 Length: 335 # 99.1 1E-11 6.2E-15 80.9 16.8 299 137-458 1-328 (335) 135 protein:vir:78935 Length: 335 99.1 1.1E-11 6.8E-15 80.7 16.4 293 137-458 1-328 (335) 136 protein:vir:2201 Length: 345 # 99.1 1E-11 6.4E-15 80.8 15.8 298 143-458 1-345 (345) 137 protein:vir:78739 Length: 332 99.1 3.2E-12 2E-15 83.6 12.9 296 137-456 1-332 (332) 138 protein:vir:3364 Length: 347 # 99.1 1.2E-11 7.3E-15 80.5 15.4 302 143-458 1-345 (347) 139 protein:vir:100057 Length: 375 99.1 9.2E-11 5.7E-14 75.6 19.7 303 143-458 1-370 (375) 140 protein:vir:108211 Length: 318 99.1 9.7E-12 6E-15 81.0 14.3 278 158-458 1-317 (318) 141 protein:vir:1541 Length: 347 # 99.0 5.2E-11 3.2E-14 77.0 17.3 302 139-458 1-345 (347) 142 protein:vir:10450 Length: 344 99.0 2.6E-11 1.6E-14 78.6 14.2 303 143-458 1-344 (344) 143 protein:vir:5974 Length: 324 # 99.0 1.7E-10 1E-13 74.2 17.6 272 162-458 1-289 (324) 144 protein:vir:80180 Length: 381 98.9 4E-10 2.5E-13 72.1 17.2 293 143-458 1-305 (381) 145 protein:vir:95318 Length: 328 98.9 1.8E-10 1.1E-13 74.0 14.4 236 143-404 1-328 (328) 146 protein:vir:1583 Length: 351 # 98.8 1.4E-09 8.6E-13 69.2 17.6 274 162-458 1-296 (351) 147 protein:vir:99675 Length: 324 98.8 5.4E-10 3.4E-13 71.4 13.9 253 195-458 1-296 (324) 148 protein:vir:102944 Length: 330 98.8 2.2E-09 1.3E-12 68.1 16.8 275 162-458 1-295 (330) 149 protein:vir:102655 Length: 322 98.7 4.7E-09 2.9E-12 66.3 17.4 292 143-458 1-321 (322) 150 protein:vir:103759 Length: 330 98.7 9.7E-10 6E-13 70.0 13.4 236 143-404 1-330 (330) 151 protein:vir:103285 Length: 296 98.7 4E-09 2.5E-12 66.6 16.5 279 161-456 1-296 (296) 152 protein:vir:3136 Length: 322 # 98.7 2.5E-09 1.5E-12 67.8 14.6 284 161-458 1-318 (322) 153 protein:vir:107388 Length: 331 98.6 8.1E-09 5E-12 65.0 16.2 237 143-404 1-331 (331) 154 protein:vir:98525 Length: 331 98.6 8.1E-09 5E-12 65.0 16.2 237 143-404 1-331 (331) 155 protein:vir:107826 Length: 331 98.6 8.1E-09 5E-12 65.0 16.2 237 143-404 1-331 (331) 156 protein:vir:97031 Length: 402 98.6 4.2E-09 2.6E-12 66.5 13.9 295 137-458 1-339 (402) 157 protein:vir:105645 Length: 400 98.6 1E-08 6.5E-12 64.3 15.3 290 137-458 1-333 (400) 158 protein:vir:107687 Length: 319 98.5 4.7E-08 2.9E-11 60.7 18.2 299 143-456 1-319 (319) 159 protein:vir:80068 Length: 301 98.5 5.2E-08 3.2E-11 60.5 17.3 277 164-456 1-301 (301) 160 protein:vir:7324 Length: 335 # 98.4 1.5E-08 9.3E-12 63.5 12.7 235 143-406 1-335 (335) 161 protein:vir:7019 Length: 401 # 98.3 7.5E-08 4.6E-11 59.7 14.6 296 137-458 1-333 (401) 162 protein:vir:104342 Length: 314 98.3 2.7E-07 1.7E-10 56.6 17.1 296 126-456 1-314 (314) 163 protein:vir:79642 Length: 329 98.3 3.4E-07 2.1E-10 56.0 17.0 310 114-458 1-328 (329) 164 protein:vir:99075 Length: 392 98.2 4.8E-07 3E-10 55.2 17.1 266 168-458 1-316 (392) 165 protein:vir:80446 Length: 367 98.0 3.1E-06 1.9E-09 50.8 17.6 279 127-458 1-339 (367) 166 protein:vir:93966 Length: 400 97.9 1.3E-06 7.9E-10 52.9 13.8 380 12-456 1-400 (400) 167 protein:vir:9927 Length: 295 # 97.9 2.4E-06 1.5E-09 51.4 14.5 261 161-458 1-288 (295) 168 protein:vir:5255 Length: 304 # 97.8 2.9E-06 1.8E-09 51.0 14.1 276 167-455 1-304 (304) 169 protein:vir:8843 Length: 317 # 97.8 4E-06 2.5E-09 50.2 14.3 287 159-458 1-315 (317) 170 protein:vir:9875 Length: 296 # 97.5 2.5E-05 1.5E-08 45.9 15.6 270 143-458 1-295 (296) 171 protein:vir:108303 Length: 418 97.5 5.3E-05 3.3E-08 44.0 18.8 259 165-458 1-282 (418) 172 protein:vir:1663 Length: 393 # 97.5 9E-06 5.6E-09 48.2 12.8 375 12-456 1-393 (393) 173 protein:vir:94989 Length: 349 97.5 5.8E-05 3.6E-08 43.8 18.2 270 159-458 1-317 (349) 174 protein:vir:861 Length: 318 # 97.3 1.1E-05 7.1E-09 47.7 11.6 298 126-456 1-318 (318) 175 protein:vir:95131 Length: 325 97.2 0.00013 8.3E-08 41.8 18.1 276 162-458 1-292 (325) 176 protein:vir:94070 Length: 339 97.2 8.1E-05 5E-08 43.0 14.5 315 122-456 1-339 (339) 177 protein:vir:78387 Length: 349 97.1 0.00016 9.7E-08 41.5 18.6 270 159-458 1-317 (349) 178 protein:vir:1781 Length: 221 # 97.1 2.8E-05 1.7E-08 45.6 11.4 188 239-458 1-202 (221) 179 protein:vir:106647 Length: 303 97.0 6.7E-05 4.1E-08 43.5 13.2 265 143-458 1-296 (303) 180 protein:vir:3643 Length: 336 # 96.8 9.4E-05 5.8E-08 42.7 12.1 312 124-456 1-336 (336) 181 protein:vir:96792 Length: 315 96.7 0.00037 2.3E-07 39.4 15.9 264 159-458 1-281 (315) 182 protein:vir:101557 Length: 336 96.7 0.00013 8.1E-08 41.9 12.2 312 124-456 1-336 (336) 183 protein:vir:105522 Length: 423 96.5 0.00053 3.3E-07 38.5 17.8 265 168-458 1-332 (423) 184 protein:vir:105374 Length: 423 96.5 0.00056 3.5E-07 38.4 18.1 266 168-458 1-336 (423) 185 protein:vir:78558 Length: 336 96.5 0.00022 1.3E-07 40.7 12.1 312 92-456 1-336 (336) 186 protein:vir:106734 Length: 336 96.3 0.00029 1.8E-07 40.0 11.9 311 92-456 1-336 (336) 187 protein:vir:174 Length: 423 # 96.3 0.00078 4.8E-07 37.6 17.4 266 168-458 1-336 (423) 188 protein:vir:3525 Length: 423 # 96.1 0.00098 6.1E-07 37.1 17.5 266 168-458 1-336 (423) 189 protein:vir:95875 Length: 401 96.1 0.00075 4.6E-07 37.7 12.9 293 139-458 1-400 (401) 190 protein:vir:107732 Length: 379 95.2 0.0021 1.3E-06 35.2 12.4 333 105-456 1-379 (379) 191 protein:vir:94870 Length: 318 95.0 0.002 1.2E-06 35.4 11.5 303 126-456 1-318 (318) 192 protein:vir:94800 Length: 319 93.7 0.0067 4.2E-06 32.5 18.5 286 119-458 1-294 (319) 193 protein:vir:97331 Length: 319 93.7 0.0067 4.2E-06 32.5 18.5 286 119-458 1-294 (319) 194 protein:vir:270 Length: 341 # 91.1 0.017 1.1E-05 30.2 12.1 294 127-458 1-332 (341) 195 protein:vir:79548 Length: 652 91.1 0.017 1.1E-05 30.2 22.4 420 1-455 136-652 (652) 196 protein:vir:79171 Length: 337 87.7 0.037 2.3E-05 28.4 15.1 294 131-458 1-337 (337) 197 protein:vir:95451 Length: 313 85.8 0.05 3.1E-05 27.7 13.7 282 161-458 1-312 (313) 198 protein:vir:79008 Length: 299 85.8 0.05 3.1E-05 27.7 20.6 269 168-458 1-299 (299) 199 protein:vir:104011 Length: 337 85.0 0.056 3.5E-05 27.5 15.8 294 131-458 1-337 (337) 200 protein:vir:94673 Length: 419 83.9 0.065 4E-05 27.1 23.8 376 1-447 5-419 (419) 201 protein:vir:78186 Length: 337 83.8 0.066 4.1E-05 27.1 14.7 294 131-458 1-337 (337) 202 protein:vir:1829 Length: 355 # 83.1 0.072 4.4E-05 26.9 15.7 300 131-458 1-350 (355) 203 protein:vir:107120 Length: 329 82.8 0.074 4.6E-05 26.8 20.9 298 115-458 1-307 (329) 204 protein:vir:98856 Length: 343 82.2 0.079 4.9E-05 26.6 14.9 302 131-458 1-341 (343) 205 protein:vir:2016 Length: 357 # 82.1 0.08 4.9E-05 26.6 13.8 302 131-458 1-350 (357) 206 protein:vir:5694 Length: 357 # 81.7 0.083 5.1E-05 26.5 13.7 302 131-458 1-350 (357) 207 protein:vir:98566 Length: 355 81.5 0.085 5.3E-05 26.4 15.1 300 131-458 1-348 (355) 208 protein:vir:3783 Length: 336 # 80.8 0.092 5.7E-05 26.3 16.3 294 134-458 1-335 (336) 209 protein:vir:99576 Length: 388 80.1 0.098 6.1E-05 26.1 8.9 340 88-456 1-388 (388) 210 protein:vir:6061 Length: 357 # 79.6 0.1 6.4E-05 26.0 13.9 302 131-458 1-350 (357) 211 protein:vir:100331 Length: 342 79.4 0.11 6.5E-05 25.9 14.2 294 131-448 1-342 (342) 212 protein:vir:99311 Length: 463 78.1 0.12 7.3E-05 25.7 11.8 295 125-458 1-318 (463) 213 protein:vir:95603 Length: 463 78.1 0.12 7.3E-05 25.7 11.8 295 125-458 1-318 (463) 214 protein:vir:96079 Length: 382 77.6 0.12 7.6E-05 25.6 11.0 333 105-456 1-382 (382) 215 protein:vir:95512 Length: 693 77.1 0.13 7.9E-05 25.5 23.4 425 1-456 168-693 (693) 216 protein:vir:79157 Length: 339 75.9 0.14 8.7E-05 25.2 14.6 295 131-457 1-339 (339) 217 protein:vir:3746 Length: 336 # 75.8 0.14 8.8E-05 25.2 16.2 294 134-458 1-335 (336) 218 protein:vir:96666 Length: 462 75.7 0.14 8.9E-05 25.2 15.0 306 117-458 1-339 (462) 219 protein:vir:1153 Length: 338 # 75.4 0.15 9.1E-05 25.1 15.5 294 131-452 1-338 (338) 220 protein:vir:99888 Length: 309 73.4 0.17 0.00011 24.8 10.7 275 167-458 1-308 (309) 221 protein:vir:103886 Length: 302 68.2 0.24 0.00015 24.0 16.8 268 161-458 1-302 (302) 222 protein:vir:103463 Length: 521 68.2 0.24 0.00015 24.0 14.1 359 48-458 1-493 (521) 223 protein:vir:104256 Length: 458 66.0 0.27 0.00017 23.7 27.4 417 1-449 12-458 (458) 224 protein:vir:98143 Length: 524 65.3 0.29 0.00018 23.6 16.5 357 67-458 1-497 (524) 225 protein:vir:106286 Length: 534 60.3 0.38 0.00023 22.9 13.9 359 51-458 1-517 (534) 226 protein:vir:80986 Length: 528 59.9 0.38 0.00024 22.9 18.0 357 70-458 1-502 (528) 227 protein:vir:8420 Length: 477 # 56.1 0.46 0.00029 22.4 25.7 407 1-451 8-477 (477) 228 protein:vir:8846 Length: 705 # 55.4 0.48 0.0003 22.3 14.8 122 1-136 567-705 (705) 229 protein:vir:78920 Length: 290 51.0 0.59 0.00037 21.8 19.5 266 159-455 1-290 (290) 230 protein:vir:93696 Length: 364 48.8 0.66 0.00041 21.6 14.1 292 162-458 1-361 (364) 231 protein:vir:6601 Length: 528 # 46.9 0.72 0.00045 21.4 18.0 357 70-458 1-502 (528) 232 protein:vir:102335 Length: 312 45.3 0.78 0.00048 21.2 19.2 273 168-458 1-308 (312) 233 protein:vir:78777 Length: 358 41.6 0.92 0.00057 20.8 15.3 299 127-458 1-346 (358) 234 protein:vir:80835 Length: 464 40.3 0.98 0.00061 20.6 11.6 300 117-458 1-331 (464) 235 protein:vir:96442 Length: 418 36.1 1.2 0.00074 20.2 11.2 318 117-458 1-407 (418) 236 protein:vir:100851 Length: 514 34.5 1.3 0.0008 20.0 9.2 318 94-458 1-383 (514) 237 protein:vir:100603 Length: 529 33.9 1.3 0.00082 19.9 17.1 349 70-458 1-503 (529) 238 protein:vir:78148 Length: 123 33.0 1.4 0.00086 19.8 6.1 106 345-458 1-123 (123) 239 protein:vir:103181 Length: 457 32.6 1.4 0.00088 19.8 18.3 336 71-458 1-438 (457) 240 protein:vir:107947 Length: 519 29.4 1.7 0.001 19.4 15.7 353 58-458 1-504 (519) 241 protein:vir:8846 Length: 705 # 28.3 1.8 0.0011 19.2 16.6 138 1-145 559-705 (705) 242 protein:vir:108295 Length: 711 27.7 1.8 0.0011 19.2 10.8 100 1-105 593-711 (711) 243 protein:vir:4456 Length: 401 # 27.4 1.8 0.0011 19.1 20.8 367 1-449 1-401 (401) 244 protein:vir:9950 Length: 714 # 26.9 1.9 0.0012 19.1 12.9 113 1-121 581-714 (714) 245 protein:vir:817 Length: 714 # 26.9 1.9 0.0012 19.1 12.9 113 1-121 581-714 (714) 246 protein:vir:2764 Length: 714 # 26.9 1.9 0.0012 19.1 12.9 113 1-121 581-714 (714) 247 protein:vir:10117 Length: 714 26.9 1.9 0.0012 19.1 12.9 113 1-121 581-714 (714) 248 protein:vir:3296 Length: 714 # 26.9 1.9 0.0012 19.1 12.9 113 1-121 581-714 (714) 249 protein:vir:101039 Length: 529 20.5 2.8 0.0017 18.2 17.3 346 58-458 1-529 (529) No 1 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=7.8e-84 Score=476.22 Aligned_cols=458 Identities=90% Similarity=1.245 Sum_probs=391.7 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |||+|+++++++++.++++..+.+..+++..+.++.+.++++.+..+++++..+...|.++.+++..++++.+.++.++. T Consensus 1 ~~~~~~~~~~e~~~~e~a~~~~~~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k~~~E~~~~le~~~ee~k~l~ee~~~~ 80 (458) T protein:vir:10 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAQEAERMRKEQEEKELARMNDLVSKAVGEDRKRLEEALELVKSLDEKSKKS 80 (458) T ss_pred CccchhhhhhhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999998888888888888888999999999999998888899999999999999998888777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhh Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKA 160 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a 160 (458) .+......++....+....++........+.+.......................+.++...+.++...+.........+ T Consensus 81 ~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a 160 (458) T protein:vir:10 81 NELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKA 160 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhh Confidence 76666666666555555555554444444444444333333333333444444455566666666666666655555556 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccce Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLT 240 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~ 240 (458) ...+++.+.|+.++|+++++.|++.+++.++|+++|+++|++++...+|+..+.+.++|++|++..++.+....++++|+ T Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~ 240 (458) T protein:vir:10 161 VNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALK 240 (458) T ss_pred hhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeecccccccccccccccccccce Confidence 66666777899999999999999999999999999999999999999999999999999999999999988889999999 Q ss_pred eeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhh Q lcl|NC_010583. 241 EISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSV 320 (458) Q Consensus 241 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 320 (458) +|++.++|++++++||+++|+|+.++|.+||.++|++++++++|.+||+|+|+++|+||++.+.......+...+..... T Consensus 241 ~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 320 (458) T protein:vir:10 241 EIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADGSV 320 (458) T ss_pred eeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeecccccccc Confidence 99999999999999999999999999999999999999999999999999999999999998877766666666666677 Q ss_pred HHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCC Q lcl|NC_010583. 321 LVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASA 400 (458) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 400 (458) ..++++++++++.+...|+.++.|+||+.+|..|.+++|++|+|++++.+...+..+.+++|||+||+++++||+.+... T Consensus 321 ~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~ 400 (458) T protein:vir:10 321 LVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSA 400 (458) T ss_pred cccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEccccccccCCc Confidence 78889999999999999999999999999999999999999999998887777777888999999999999999876555 Q ss_pred ceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 401 EFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 401 ~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .++++.|.++|.++++.++++.+|+|+.+|++.|+++.|+|+.+++|+|||+.++||| T Consensus 401 ~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 401 EFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred ceEEEEecccEEEEEeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 5555555578999999999999999999999999999999999999999999999999 No 2 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=2.9e-69 Score=396.36 Aligned_cols=411 Identities=16% Similarity=0.128 Sum_probs=292.0 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) ..+|+..|- ....+..+.+.|.|++...+++++.++...+.++..++..+.++++...... T Consensus 7 ~~~~~~~~~------------------~~~~~~~~~l~e~ra~~~~e~~~l~~~~~~~~~~~k~~~~~~~~~~~~~~~~- 67 (425) T protein:vir:10 7 IAVLTAALT------------------GPVGAVPRGIISVRAEGPTEVKALIENLQKAFHDFKAEHTKQLDAVKAGLPT- 67 (425) T ss_pred HHhhHHHhh------------------hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc- Confidence 111221111 1222233455555555555555555444332221111111111111111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhh Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKA 160 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a 160 (458) .+ ...+++++..++..++...+......... +............+++.+|..++++++. .++ T Consensus 68 ~e-~~~~~~~~~~ei~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~af~~~l~~~e~---------~~a 129 (425) T protein:vir:10 68 SD-ALAKVDKVSADLEALQAAVDEANIKIAAA--------QMGANGVKPLRDPEYTEAFKAHVKRGDV---------QAA 129 (425) T ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhh--------hcccccccccccHHHHHHHHHHhhhhhh---------HHH Confidence 00 11122222223332222221111100000 0111122233344566777777765421 112 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccce Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLT 240 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~ 240 (458) ...++.+.||++||+++++.|++.++..++|+++|+++|++++..++|+..+++.++|++|++..++ ...++|+ T Consensus 130 -l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~-----~~~~~f~ 203 (425) T protein:vir:10 130 -LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMGGTTSGWVGEASQRPQ-----TNAATFQ 203 (425) T ss_pred -hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcCCcceeeecccccccc-----ccccccc Confidence 2345667889999999999999999999999999999999999999999999999999999865543 2357999 Q ss_pred eeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccce------eecc Q lcl|NC_010583. 241 EISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKV------VTEA 314 (458) Q Consensus 241 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~------~~~~ 314 (458) +|++++++++++++||+|+|+|+.++|++||.++|++++++++|.+||+|||+++|.||++......... .... T Consensus 204 ~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~ 283 (425) T protein:vir:10 204 PLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVV 283 (425) T ss_pred eeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999999999999998654322111 1112 Q ss_pred ccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccc Q lcl|NC_010583. 315 KADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFP 394 (458) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~ 394 (458) .......+.++++++++..+.+.|+.++.|+||+.++..|.+++|.+|+|+|++... .+.+++|+|+||+++++|| T Consensus 284 ~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~----~g~~~~l~G~PV~~~~~~p 359 (425) T protein:vir:10 284 NSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYV----AGQPATLAGYPVTEVPDMP 359 (425) T ss_pred cccccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCCceeeccCcc----CCCCceecceeeEEecCcC Confidence 223345567888999999999999999999999999999999999999999976433 4566899999999999999 Q ss_pred ccccCCceEEEEEece-EEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 395 AKAASAEFAVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 395 ~~~~~~~~~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+++...+++++++. |+++++.++++..++|+.+|++.|+++.|+|+++++|+||++++++|| T Consensus 360 ~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as 424 (425) T protein:vir:10 360 DVAANSTPILFGDFQQTYLIIDRIGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAAS 424 (425) T ss_pred CccCCccEEEEEehhccEEEEEecceEEEecccccCCcEEEEEEEEeccEeecccceEEEEeecc Confidence 8888777788888875 899999999999999999999999999999999999999999999999 No 3 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=3.1e-69 Score=396.19 Aligned_cols=393 Identities=19% Similarity=0.173 Sum_probs=277.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 14 LGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQE 93 (458) Q Consensus 14 ~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~ 93 (458) |.++.+.-+.+ ++......+++++..+. .++.+++..+...+++.+ ...++.++. T Consensus 1 l~~~k~l~~~i---~e~~~~~~~~k~~~~~~-----------~~~~e~~~~~l~~~~e~~-----------~~~~~~~e~ 55 (407) T protein:vir:48 1 MADVKDVEQVA---QELQRKFDDFKEKNDKR-----------IDAIEQEKGKLAGEVETL-----------NGKLAELEN 55 (407) T ss_pred CchHHHHHHHH---HHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHH-----------HHHHHHHHH Confidence 22222211100 01111111111111100 011111111111111111 111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccc Q lcl|NC_010583. 94 TIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEA 173 (458) Q Consensus 94 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ 173 (458) ....+..... ...+...... .....+++.+|.++++++... +......++.. .++.+.||++ T Consensus 56 ~~~~~~~~~~--------------~~~~~~~~~~-~~~~~e~~~a~~~~l~~g~~~--~~~~~e~~a~~-~~t~~~gG~~ 117 (407) T protein:vir:48 56 LKSDLEAELA--------------EVKRPAGGTQ-NKVASEHKEAFIGFMRKGRED--GLRELERKALQ-VGNDEDGGYA 117 (407) T ss_pred HHHHHHHHHH--------------Hhhccccccc-cchhhHHHHHHHHHHhccchh--hhhHHHHHhhh-cccCCCCccc Confidence 1111110000 0001111111 122334566777777765432 22233334443 4455678999 Q ss_pred cchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeee Q lcl|NC_010583. 174 YETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKS 253 (458) Q Consensus 174 ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~ 253 (458) ||++++++|++.++..++|+++|+++|++++...+|+..+++.++|++|+...++ .+.++|++|++.++++++++ T Consensus 118 iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~~~~f~~i~~~~~k~~~~~ 192 (407) T protein:vir:48 118 IPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGGTTSGWVGETDARPE-----TATSKLGLIEPFMGEIYGNP 192 (407) T ss_pred ccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCcceeeecccccccc-----cccccceeEEeeeeeeEeeh Confidence 9999999999999999999999999999999999999999999999999865443 34579999999999999999 Q ss_pred hhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccee------eccccchhhHHHHHHH Q lcl|NC_010583. 254 FITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVV------TEAKADGSVLVTAKTI 327 (458) Q Consensus 254 ~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~------~~~~~~~~~~~~~~~~ 327 (458) +||+|+|+||.++|++||.++|+++++.++|.+|++|+|+++|.||++.......... ..........++++++ T Consensus 193 ~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i 272 (407) T protein:vir:48 193 QATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAI 272 (407) T ss_pred hhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeecccccccccccccccccccccccccccChHHH Confidence 9999999999999999999999999999999999999999999999976543322111 1122233445668889 Q ss_pred HHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEE Q lcl|NC_010583. 328 SKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVY 407 (458) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~ 407 (458) +++.+.+.+.|+.++.|+||+.++..|++++|.+|||+|++... .+.+++|+|+||+++++||..+++...+++++ T Consensus 273 ~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~----~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd 348 (407) T protein:vir:48 273 IKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLWRPGIE----LGQPSSLAGYGIVENEQMPDIAADAKAIAFGN 348 (407) T ss_pred HHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCCceeeccCcC----CCCCceecceeeEEecCcCCccCCccEEEEEe Confidence 99999999999999999999999999999999999999876433 35668999999999999999888888888888 Q ss_pred ec-eEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 408 KD-NFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 408 ~~-~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ++ .|.++++.++++.+|+|+.+|++.|+++.|+|+++++|+||++++++|| T Consensus 349 ~~~~~~i~~~~~~~i~~d~~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa 400 (407) T protein:vir:48 349 FKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGAA 400 (407) T ss_pred ccccEEEEEeeceEEEeeccccCCcEEEEEEEEeccEEecccceEEEEeecc Confidence 86 5899999999999999999999999999999999999999999999999 No 4 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=7.1e-68 Score=388.72 Aligned_cols=394 Identities=17% Similarity=0.163 Sum_probs=276.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 17 LAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIV 96 (458) Q Consensus 17 ~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~ 96 (458) |.-+++.+ ++...++.++.+. +++..++..++.+++..+..++++.+ ...+.+++.... T Consensus 1 m~~~lk~l------~~~~~el~~~~~~----~k~~~~~~~~~~e~~~~~l~~~~~~l-----------~~~~~~~~~~~~ 59 (401) T protein:vir:44 1 MAVDIKDV------EQVAQELQQKFDD----FKAKNDKRVEAIEQEKGKLAGQVETL-----------NGKLSELENLKS 59 (401) T ss_pred CCccHHHH------HHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHH Confidence 22222222 1112222222211 11111111111111111111111111 111111111111 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccch Q lcl|NC_010583. 97 GLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYET 176 (458) Q Consensus 97 ~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~ 176 (458) .+..... .. .+.... .......+++.+|.++++++... +......++. ..++.+.||++||+ T Consensus 60 ~~~~~~~-------~~-------~~~~~~-~~~~~~~e~~~a~~~~lr~~~~~--~~~~~e~~a~-~~~~~~~GG~~iP~ 121 (401) T protein:vir:44 60 DLEKELL-------EL-------KRPARG-AQNKVAAEHKDAFVGFLRKGRED--GLRDLERKAL-QVGTDEDGGYAVPE 121 (401) T ss_pred HHHHHHH-------Hh-------hccccc-cccchhHHHHHHHHHHHhhhhhh--hhHHHHHHHh-hcCCCCCCceeccH Confidence 1110000 00 000011 11222334566777777654322 2222223333 34556678999999 Q ss_pred hHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhh Q lcl|NC_010583. 177 IFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFIT 256 (458) Q Consensus 177 ~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is 256 (458) ++.++|++.++..++|+++|+++|++++...+|+..+++.++|++|+...++ .+.++|++|++.++|++++++|| T Consensus 122 ~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~-----~~~~~~~~v~~~~~k~~~~~~iS 196 (401) T protein:vir:44 122 ELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGTASGWVGETDTRSQ-----TATSRLGLIEPFMGEIYGNPQAT 196 (401) T ss_pred hHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCccceeeccccccCc-----cccccceeeeeehhheeeehhhh Confidence 9999999999999999999999999999999999999999999999865443 34579999999999999999999 Q ss_pred HHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccee------eccccchhhHHHHHHHHHH Q lcl|NC_010583. 257 DETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVV------TEAKADGSVLVTAKTISKL 330 (458) Q Consensus 257 ~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~ 330 (458) +|+|+||.++|++||.++|++++++++|.+||+|+|+++|.||++.......... ...........++++++++ T Consensus 197 ~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~ 276 (401) T protein:vir:44 197 QKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKL 276 (401) T ss_pred HHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeccccccccccccccccccccccccccccCHHHHHHH Confidence 9999999999999999999999999999999999999999999976543322111 1111223444668889999 Q ss_pred HhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEec- Q lcl|NC_010583. 331 RRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKD- 409 (458) Q Consensus 331 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~- 409 (458) ++.+.+.|+.++.|+||+.++..|++++|.+|+|+|++... .+.+++|+|+||++++++|..+++...+++++++ T Consensus 277 ~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~----~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~ 352 (401) T protein:vir:44 277 IYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLE----LGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKR 352 (401) T ss_pred HHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcC----CCCCceecceeeEEecCcCCccCCccEEEEeehhc Confidence 99999999999999999999999999999999999866433 3566899999999999999888878777888886 Q ss_pred eEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 410 NFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 410 ~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .|.++++.++++.+++|+.+|++.||++.|+|+++++|+||++++++|| T Consensus 353 ~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 353 GYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred cEEEEEecceEEeeeccccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 5899999999999999999999999999999999999999999999999 No 5 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=1.5e-61 Score=354.10 Aligned_cols=438 Identities=12% Similarity=0.049 Sum_probs=260.4 Q ss_pred CcchHHHHHHHHHHH---------------HHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHH--H-------- Q lcl|NC_010583. 1 MTIDINKLKEELGLG---------------DLAKSLEGLTAAQKAAEAKRLRE---EQEEKELARMNDL--V-------- 52 (458) Q Consensus 1 ~~~~~~~~~~~~~~~---------------~~~~~~~~l~~~~~~~~~~~~~~---e~~~~~~~~~~~~--~-------- 52 (458) +...++++++..... ++.++++.|...++.+..+...+ ..+.....+++.. . T Consensus 55 ~~~~~e~l~~~~~~~~~e~~~~~~~~~e~~el~~~~~~l~~~e~~~~~~e~~~~~~~~~~~~~~e~r~e~~a~~~~~~~~ 134 (543) T protein:vir:81 55 VHARMEQIAELDKPTDEENEEFRALGAEFDSLVNHMSRLERAAELARVRSTHEQIGKPQSGGQRRMRVEAGSSQGGRGDY 134 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHhhHHH Confidence 233333333222110 11111111110000000000000 0000000000000 0 Q ss_pred -HHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhc Q lcl|NC_010583. 53 -SKA------VGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYG 125 (458) Q Consensus 53 -~~~------~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ 125 (458) .+. ..+.+.+.....++++.+.+....... +...+....+..+..+........................ T Consensus 135 ~~~~~~~~~~l~e~~~~~~~~~~e~k~~~e~~~~e~~---e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~ 211 (543) T protein:vir:81 135 DRDAILEPDSIEDCRFRDPWNLSEMRTFGRDAEEVKG---ELRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLAR 211 (543) T ss_pred HHhhhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 000 000111111111111111111100000 0000000111111111111111000000000000000000 Q ss_pred chhhhhhHHHHHHHHHhhhccch--hHHHHHHHHhhhhhcccccccCccccchhHHHHHH-HHHHhccchhhhcceeeec Q lcl|NC_010583. 126 TQDAFEDEVEKLVLLSYMMEKDV--FETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRII-RDLQKELVVGALFDELPMS 202 (458) Q Consensus 126 ~~~~~~~~~~~~a~~~~~~~~~~--~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii-~~~~~~~~l~~~~~~~~~~ 202 (458) ..........+.++.+.+++... ...............+.+.+.||++||+++.+.|| ..++..++|+++++++++ T Consensus 212 ~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~- 290 (543) T protein:vir:81 212 QCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA- 290 (543) T ss_pred hhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC- Confidence 11111122233333333332211 11111111222223345667789999999998876 667889999999998765 Q ss_pred cCceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 203 SKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVS 282 (458) Q Consensus 203 ~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~ 282 (458) ++.+.+|+..+++.++|++|++.. ++++++|++|++++++++++++||+++++|+ +++.+||...|+++++++ T Consensus 291 ~g~~~~~~~~~~~~a~~v~Eg~~~------~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~ 363 (543) T protein:vir:81 291 TGDVWHGVSSAAVQWSWDAEFEEV------SDDSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDEL 363 (543) T ss_pred CcceEEEEecCCcceeecccCccc------cccccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHH Confidence 567899999999999999987644 4678999999999999999999999999998 599999999999999999 Q ss_pred HHHHHhccCCCC-ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccc Q lcl|NC_010583. 283 IEEAFMSGNGTG-QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEE 361 (458) Q Consensus 283 ~d~~~l~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~ 361 (458) +|.+||+|+|++ +|.||++.......... .......++.++.+++..+.+.|..++.|+||+.++..|.+++|++ T Consensus 364 ~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~ 439 (543) T protein:vir:81 364 EAVTLTTGTGQGNQPTGIVTALAGTAAEIA----PVTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQG 439 (543) T ss_pred HHHHHhccCCCCcccccchhhccccccccc----ccccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCC Confidence 999999999985 89999987654332221 2223345677888999999999999999999999999999999999 Q ss_pred cccccccccccccccccCCeeecccceeccccccc-----ccCCceEEEEEeceEEEEecceeEEeeccc------ccCC Q lcl|NC_010583. 362 WQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAK-----AASAEFAVIVYKDNFVMPRQRAVTVERERQ------AGKQ 430 (458) Q Consensus 362 ~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~-----~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~------~~~~ 430 (458) |+|+|++. ..+.+++|+|+||+++++||.. .++...+++|+|++|.|+++.+++|..++| +.+| T Consensus 440 G~~l~~~~-----~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~ 514 (543) T protein:vir:81 440 GAGLWTTI-----GNGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNG 514 (543) T ss_pred CceeccCc-----CCCCCccccceeeEEeccccccccccccCCcceEEEeeccceeEEeecccEEEEeccccccchhhcC Confidence 99998643 2345678999999999999853 345556778899999999999999887654 3468 Q ss_pred ceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 431 RDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 431 ~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ++.|+++.|+|+++.+|+||+++++++| T Consensus 515 ~~~~~~~~r~d~~v~~~~A~~~l~~~~~ 542 (543) T protein:vir:81 515 SRGWFAYYRMGADVVNPNAFRLLNVETA 542 (543) T ss_pred ceEEEEEEeeccEeecccceEEEEeccc Confidence 9999999999999999999999999999 No 6 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=7.5e-63 Score=361.18 Aligned_cols=394 Identities=15% Similarity=0.164 Sum_probs=276.1 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAE 82 (458) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e 82 (458) |.|.+|+++.. .+.++.+.+.++.+ . ...++|.++++++...+++.+.+++++..+ T Consensus 1 M~l~eL~e~r~--~l~~e~~~l~~k~~----~------------------~~~t~e~~~~~~~~~~e~~~l~~~i~~~e~ 56 (409) T protein:vir:45 1 MKLHELKQKRN--TIATDMRALNEKIG----D------------------NAWTEEQRTEWNKAKSELEALDERIAREEE 56 (409) T ss_pred CCHHHHHHHHH--HHHHHHHHHHHHhh----c------------------CCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55677777654 33344333321100 0 012334555555556666665555443322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccch--hHHHHHHHHhhh Q lcl|NC_010583. 83 LFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDV--FETEHGKAHIKA 160 (458) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~--~~~~~~~~~~~a 160 (458) ..... ........... +. ...........+.++.+|.++++++.. ...+........ T Consensus 57 ~~~~~----~~~~~~~~~~~---------~~--------~~~~~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~ 115 (409) T protein:vir:45 57 LRRQD----QAYIESNEEEQ---------RQ--------NLDPENNSQQDEKRAQVFDKWMRHGASELTSEERKALRELR 115 (409) T ss_pred HHHHH----HHHHhhhhhhh---------cc--------cCCCCCcchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHh Confidence 11110 00000000000 00 000111122233444556666655322 222332222223 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecC--CCccccccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPE--AGRATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~--~~~a~~v~e~~~~~e~~~~~~~~~~ 238 (458) ....++.+.||++||+++.++|++.+++.++|+++|+++|++++...+++..+ ...+.|++|++ ..++++++ T Consensus 116 a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~------~~~~~~~~ 189 (409) T protein:vir:45 116 AQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLGENE------EAGEEDTD 189 (409) T ss_pred hccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCccccccccccc------cccccccc Confidence 33445666789999999999999999999999999999999877655444433 24567887765 45567889 Q ss_pred ceeeeeehhhee-eeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC---ccccccccccccccceeecc Q lcl|NC_010583. 239 LTEISFKTYKLA-AKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTG---QPKGLLKLAADDGAKVVTEA 314 (458) Q Consensus 239 f~~v~~~~~k~~-~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~---~p~Gi~~~~~~~~~~~~~~~ 314 (458) |+++++.++|++ ++++||+++|+|+.++|++||.++|++++++++|.+||+|+|++ +|+||++....... T Consensus 190 f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~------ 263 (409) T protein:vir:45 190 FGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQ------ 263 (409) T ss_pred cceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccc------ Confidence 999999999985 67899999999999999999999999999999999999999975 79999976543211 Q ss_pred ccchhhHHHHHHHHHHHhhhhhhhccccee--EechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccc Q lcl|NC_010583. 315 KADGSVLVTAKTISKLRRKLGRHGLKLSKL--VLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEY 392 (458) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~ 392 (458) .......+++++.++...+.+.|..++.| +||+.++..|++++|++|+|+++.... .+.+.+|||+||+++++ T Consensus 264 -~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~----~~~~~~l~G~PV~~~~~ 338 (409) T protein:vir:45 264 -TAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIV----GVAPASVLNVPYVIDQE 338 (409) T ss_pred -cccccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCCceeeccCcC----CCCCceecceeeEEecC Confidence 11223355678889999999999888876 669999999999999999999865433 35567999999999999 Q ss_pred ccccccCCceEEEEEeceEEEEecceeEEe--ecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 393 FPAKAASAEFAVIVYKDNFVMPRQRAVTVE--RERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 393 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~--~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ||..+++...+++++++.|.++++.++.+. .++|+.+|++.||++.|+|+++++|+||++++.++| T Consensus 339 ~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s 406 (409) T protein:vir:45 339 IDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGS 406 (409) T ss_pred cCCccCCccEEEEeehhhhheeeccceEEEEeecccccCCcEEEEEEEEeccEeechhheEEEEeccC Confidence 998888777788899999999998887664 578999999999999999999999999999999988 No 7 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=1.5e-61 Score=354.00 Aligned_cols=408 Identities=14% Similarity=0.070 Sum_probs=266.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELF 84 (458) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~ 84 (458) |..+++-.++.+..... ++.++..+++++.++....+++++.++...+. .+...+.++.+...+.. T Consensus 1 ~~~~~~~~~~~~~~~~~------~el~~~~~e~~~~l~~~~~e~~~~~e~~~~e~--------~~~~~~~~e~~~~~~~l 66 (418) T protein:vir:10 1 MSHMNEPRQFGRKSGGD------SHPEQVLETVTKELKRIGDEVKSAGEKALAEA--------KRAGDLGVETKATVDEL 66 (418) T ss_pred CCCchhHHHHHHHhccH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HhhhhhhHHHHHHHHHH Confidence 22222222211111110 11111122222222222222222222211111 11111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcc Q lcl|NC_010583. 85 AQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGS 164 (458) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~ 164 (458) ..+...+...+..+..... ......... .............+...+..++........+............ T Consensus 67 ~~~~~~l~~~~~~~e~~~~-------~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 137 (418) T protein:vir:10 67 LIKQGELQARLLEAEQKLA-------RGGGSAELE--TPKTLGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVG 137 (418) T ss_pred HHHHHHHHHHHHHHHHHHh-------hcccccccc--hhhhhhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhcc Confidence 1222222222211111111 000000000 0000011111223334455555544444444433333334445 Q ss_pred cccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecC-CCcccccccccccccccccccccccceeee Q lcl|NC_010583. 165 SSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPE-AGRATWVDASKFGTDETVGDEVKGQLTEIS 243 (458) Q Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~ 243 (458) ++++.+|.+||+++++.|++.+++.++|+++++++|++++.+.+|+..+ ++.+.|++|++ .+++++++|++|+ T Consensus 138 ~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~------~~~~~~~~f~~v~ 211 (418) T protein:vir:10 138 SGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVAEGA------QKPTSDLKFNLKN 211 (418) T ss_pred CCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeeeccCc------cccccccceeeEE Confidence 5667788899999999999999999999999999999999999999877 57889998875 4456789999999 Q ss_pred eehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC-ccccccccccccccceeeccccchhhHH Q lcl|NC_010583. 244 FKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTG-QPKGLLKLAADDGAKVVTEAKADGSVLV 322 (458) Q Consensus 244 ~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 322 (458) +.+++++++++||+++|+|++ ++++||.++|++++++++|.+||+|+|++ .|.||++.+...... ....... T Consensus 212 ~~~~k~~~~~~is~ell~ds~-~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~------~~~~~~~ 284 (418) T protein:vir:10 212 QPVRTIAHLFKASRQILDDAP-ALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPS------ITLANAT 284 (418) T ss_pred EeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccc------ccccccc Confidence 999999999999999999985 89999999999999999999999999986 599999876543222 1122234 Q ss_pred HHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCce Q lcl|NC_010583. 323 TAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEF 402 (458) Q Consensus 323 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 402 (458) .+.++.+++..+...+..+..|+||+.++..|++++|++|+|+++. +..+.+++|+|+||+++++||.+ . T Consensus 285 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~-----~~~~~~~~l~G~pV~~~~~~p~~-----~ 354 (418) T protein:vir:10 285 PIDKIRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVGN-----PVNGTTPRLWNLPVVETQAMTAN-----E 354 (418) T ss_pred cHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccc-----cccCCCceecceeeEEcCCCCCC-----c Confidence 4667888888888899999999999999999999999999999843 23345679999999999999853 4 Q ss_pred EEEEEece-EEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 403 AVIVYKDN-FVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 403 ~~~~~~~~-~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +++++++. |+++++.++++..++ +|.+|++.||++.|+|+++++|+||++++++++ T Consensus 355 ~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~ 415 (418) T protein:vir:10 355 FLVGAFSMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGALVEQ 415 (418) T ss_pred EEEeeccceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEeccC Confidence 67788875 889999999887654 477999999999999999999999999999998 No 8 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=1.1e-60 Score=349.21 Aligned_cols=429 Identities=17% Similarity=0.097 Sum_probs=249.2 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKE-ELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVG-----EDRKRLEEALDLVKNLDEKSK 78 (458) Q Consensus 5 ~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~-----e~~~~~~~~~~~i~~~~e~~~ 78 (458) |..... +..+.++.++++.+..+.... ..++++...+...+++.+..+... +..++.++..++++++...+. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 78 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDINADETKT--AAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222 112233344444433222111 111111111222222222111111 111111222222222221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHh Q lcl|NC_010583. 79 KSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHI 158 (458) Q Consensus 79 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 158 (458) +..........+.........+............ ......................+..+... ...... T Consensus 79 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~ 147 (497) T protein:vir:78 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGT-KFDVSFNVSAKAADPGTAAAELMGAFADG----------ETAPAA 147 (497) T ss_pred HHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhh-hhhhhhhhhhhhhhhHHHHHHHHHHHhhh----------hhhHHH Confidence 1100000000000000000000000000000000 00000000000000000000111111111 011111 Q ss_pred hhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCC-Ccccccccccccccccccccccc Q lcl|NC_010583. 159 KAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEA-GRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 159 ~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) ......++++.||++||+++.++||+.+++.++|+++++++|++++.+.||+..++ +.++|++|++ .++++++ T Consensus 148 ~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~------~~~~s~~ 221 (497) T protein:vir:78 148 IGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAG------TYPFSSE 221 (497) T ss_pred HHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCc------ccccccc Confidence 11122445567888999999999999999999999999999999999999998764 6889999875 4456789 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +|+.|++.+||++++++||+|||+|++ ++++||.++|++++++++|.+||+|+|+++|.||++.+.............+ T Consensus 222 ~f~~i~~~~~k~a~~~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~ 300 (497) T protein:vir:78 222 EFARVYEQVGKVANALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGA 300 (497) T ss_pred cceeeEeeeeeeEeecHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhh Confidence 999999999999999999999999986 6999999999999999999999999999999999987654322211110000 Q ss_pred ------------------------------------------------hhhHHHHHHHHHHHhhhh-hhhcccceeEech Q lcl|NC_010583. 318 ------------------------------------------------GSVLVTAKTISKLRRKLG-RHGLKLSKLVLIV 348 (458) Q Consensus 318 ------------------------------------------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 348 (458) .........+...+..+. ..++.+..|+||+ T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~ 380 (497) T protein:vir:78 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) T ss_pred hhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEch Confidence 001111122222333322 2344556899999 Q ss_pred hHHHHHHhhhcccccccccccccccc--ccccCCeeecccceecccccccccCCceEEEEEece--EEEEecceeEEeec Q lcl|NC_010583. 349 SMDAYYDLLEDEEWQDVAQVGNDAVK--LQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDN--FVMPRQRAVTVERE 424 (458) Q Consensus 349 ~~~~~l~~~~d~~~~~~~~~~~~~~~--~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~--~~i~~~~~~~i~~~ 424 (458) .++..|++++|++|+|+|+....... ..+.+.+|||+||++++.||.+ .+++++|+. |.|+++.+++|..+ T Consensus 381 ~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~-----~~~~Gd~~~~~~~i~~r~~~~v~~~ 455 (497) T protein:vir:78 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG-----TILVGHFAPSVIQTARREGVTMQMT 455 (497) T ss_pred HHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCC-----ceEEeecccceEEEEEecccEEEee Confidence 99999999999999999976543322 2234568999999999999853 346677764 67889999999875 Q ss_pred ----ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 425 ----RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 425 ----~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ++|.+|++.||++.|+|+.|++|+||++++++++ T Consensus 456 ~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:78 456 NSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred cccchhhhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 4588999999999999999999999999999998 No 9 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=1.1e-60 Score=349.21 Aligned_cols=429 Identities=17% Similarity=0.097 Sum_probs=249.2 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKE-ELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVG-----EDRKRLEEALDLVKNLDEKSK 78 (458) Q Consensus 5 ~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~-----e~~~~~~~~~~~i~~~~e~~~ 78 (458) |..... +..+.++.++++.+..+.... ..++++...+...+++.+..+... +..++.++..++++++...+. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 78 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDINADETKT--AAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222 112233344444433222111 111111111222222222111111 111111222222222221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHh Q lcl|NC_010583. 79 KSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHI 158 (458) Q Consensus 79 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 158 (458) +..........+.........+............ ......................+..+... ...... T Consensus 79 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~ 147 (497) T protein:vir:10 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGT-KFDVSFNVSAKAADPGTAAAELMGAFADG----------ETAPAA 147 (497) T ss_pred HHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhh-hhhhhhhhhhhhhhhHHHHHHHHHHHhhh----------hhhHHH Confidence 1100000000000000000000000000000000 00000000000000000000111111111 011111 Q ss_pred hhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCC-Ccccccccccccccccccccccc Q lcl|NC_010583. 159 KAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEA-GRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 159 ~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) ......++++.||++||+++.++||+.+++.++|+++++++|++++.+.||+..++ +.++|++|++ .++++++ T Consensus 148 ~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~------~~~~s~~ 221 (497) T protein:vir:10 148 IGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAG------TYPFSSE 221 (497) T ss_pred HHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCc------ccccccc Confidence 11122445567888999999999999999999999999999999999999998764 6889999875 4456789 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +|+.|++.+||++++++||+|||+|++ ++++||.++|++++++++|.+||+|+|+++|.||++.+.............+ T Consensus 222 ~f~~i~~~~~k~a~~~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~ 300 (497) T protein:vir:10 222 EFARVYEQVGKVANALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGA 300 (497) T ss_pred cceeeEeeeeeeEeecHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhh Confidence 999999999999999999999999986 6999999999999999999999999999999999987654322211110000 Q ss_pred ------------------------------------------------hhhHHHHHHHHHHHhhhh-hhhcccceeEech Q lcl|NC_010583. 318 ------------------------------------------------GSVLVTAKTISKLRRKLG-RHGLKLSKLVLIV 348 (458) Q Consensus 318 ------------------------------------------------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 348 (458) .........+...+..+. ..++.+..|+||+ T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~ 380 (497) T protein:vir:10 301 TSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNP 380 (497) T ss_pred hhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEch Confidence 001111122222333322 2344556899999 Q ss_pred hHHHHHHhhhcccccccccccccccc--ccccCCeeecccceecccccccccCCceEEEEEece--EEEEecceeEEeec Q lcl|NC_010583. 349 SMDAYYDLLEDEEWQDVAQVGNDAVK--LQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDN--FVMPRQRAVTVERE 424 (458) Q Consensus 349 ~~~~~l~~~~d~~~~~~~~~~~~~~~--~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~--~~i~~~~~~~i~~~ 424 (458) .++..|++++|++|+|+|+....... ..+.+.+|||+||++++.||.+ .+++++|+. |.|+++.+++|..+ T Consensus 381 ~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~-----~~~~Gd~~~~~~~i~~r~~~~v~~~ 455 (497) T protein:vir:10 381 RDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG-----TILVGHFAPSVIQTARREGVTMQMT 455 (497) T ss_pred HHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCC-----ceEEeecccceEEEEEecccEEEee Confidence 99999999999999999976543322 2234568999999999999853 346677764 67889999999875 Q ss_pred ----ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 425 ----RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 425 ----~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ++|.+|++.||++.|+|+.|++|+||++++++++ T Consensus 456 ~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:10 456 NSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred cccchhhhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 4588999999999999999999999999999998 No 10 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=1.2e-61 Score=354.55 Aligned_cols=381 Identities=17% Similarity=0.175 Sum_probs=256.8 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSK-----AVGEDRKRLEEALDLVKNLDEKS 77 (458) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~-----~~~e~~~~~~~~~~~i~~~~e~~ 77 (458) ||.-++++.. +++++..++++.+.++ .++|.++++++...+++.+.+++ T Consensus 1 m~~~~l~~l~--------------------------e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~l~~e~~~l~~~i 54 (390) T protein:vir:62 1 MDATTLSANF--------------------------EARERATAELRTLTDEFAGKEMTDEAREKEERLITAVSDYDARI 54 (390) T ss_pred CChhHHHHHH--------------------------HHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHH Confidence 3332222221 1111112222222211 23345555555555555555554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHH Q lcl|NC_010583. 78 KKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAH 157 (458) Q Consensus 78 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~ 157 (458) ++..+.... . . ............. . .......... ..+++.+.... .+.... T Consensus 55 ~~~~~~~~~----~-------~-~~~~~~~~~~~~~---~-----~~~~~~~~~~-------~~~~r~~~~~~-~r~~~~ 106 (390) T protein:vir:62 55 KRGIEAIKA----I-------D-PVTSLLSGLQGSG---S-----GAQRSADVDD-------DATLRAGNLGE-ARSFEF 106 (390) T ss_pred HHHHHHHHH----H-------H-HHHHHHhhccccc---c-----cchhhcchHH-------HHHHhhhhhhh-hHHHHh Confidence 432221110 0 0 0000000000000 0 0000000000 01111111100 000111 Q ss_pred hhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccC-ceEEEEecCCCccccccccccccccccccccc Q lcl|NC_010583. 158 IKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSK-ILTMLVEPEAGRATWVDASKFGTDETVGDEVK 236 (458) Q Consensus 158 ~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~ 236 (458) .......++..+|++++|+.+...|++.++..++++++|++++++++ .+.+|+.++.+.+.|++|++. .++++ T Consensus 107 ~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~------~~~~~ 180 (390) T protein:vir:62 107 APEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGETAE------IPESY 180 (390) T ss_pred hhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeeccccc------ccccc Confidence 11111233444556666666556666778888889999999998765 478999999999999998764 44578 Q ss_pred ccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeecccc Q lcl|NC_010583. 237 GQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKA 316 (458) Q Consensus 237 ~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~ 316 (458) ++|++|++++++++++++||+|+|+||.+++++||..+|+++++.++|.+||+|+| +|.||++........... T Consensus 181 ~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G--~p~Gi~~~~~~~~~~~~~---- 254 (390) T protein:vir:62 181 PATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTG--QPRGILTDASPATATFLA---- 254 (390) T ss_pred cceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCC--ccccccccccccccceec---- Confidence 99999999999999999999999999999999999999999999999999999987 699999876543322222 Q ss_pred chhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccccc Q lcl|NC_010583. 317 DGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 396 (458) ......++++++++.+++.+.|..++.|+||+.++..|++++|.+|+|+|+++.. .+.+.+|+|+||++++++|. T Consensus 255 ~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~----~g~~~~l~G~Pv~~~~~~p~- 329 (390) T protein:vir:62 255 TDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGLT----VGAPSLFNGKVVETDDGMPA- 329 (390) T ss_pred ccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCcC----CCccceecccceEEecCCCC- Confidence 2223456778899999999999999999999999999999999999999976543 35567999999999999985 Q ss_pred ccCCceEEEEEeceEEEEecceeEEee--cccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 397 AASAEFAVIVYKDNFVMPRQRAVTVER--ERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 397 ~~~~~~~~~~~~~~~~i~~~~~~~i~~--~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..++++++++|.++++.++++.. +.+|.+|++.||++.|+|+++++|+||+++++++| T Consensus 330 ----~~i~~gd~s~~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~ 389 (390) T protein:vir:62 330 ----DKILFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPG 389 (390) T ss_pred ----ccEEEeeccceeEEeecceEEEeeccccccCCcEEEEEEEEeCcEeechhheEEEEeecC Confidence 34667999999999999998875 67899999999999999999999999999999999 No 11 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=2.8e-60 Score=347.11 Aligned_cols=411 Identities=13% Similarity=0.195 Sum_probs=255.8 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAE 82 (458) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e 82 (458) |.|..++....++.+..++.+|+... .++.++++.....++. ...++..+.+++.++.++....++.+..+ T Consensus 1 ~~~~~~~~~~el~~~~~~l~el~~~~------~el~~~~~el~~~~e~---ak~eee~~~l~~ei~~le~e~~~l~~~~~ 71 (425) T protein:vir:95 1 MALRQLMLTKKIEQRKAALDELVKRE------QELQAKAAELEQAIEE---AQTEEEVSAVEEEVAKLEDERNELNEKKS 71 (425) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHH---hhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66666655444445555555443331 1111111111100100 01111111112222222221111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhh Q lcl|NC_010583. 83 LFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVN 162 (458) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~ 162 (458) ....+...+.+++..+..... ..... +........ .....+..+.+.+................... T Consensus 72 ~le~~~~~~~~~l~~~~~~~~----~~~~~--------~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 138 (425) T protein:vir:95 72 KLEGEIAQLEDELEQINSKQP----SNQSR--------QKMQGSKGD-VVEMNRLQVREMLKTGEYYKRSEVVEFYEKFR 138 (425) T ss_pred HHHHHHHHHHHHHHHhhhhcc----chhhh--------hhhhhhhhh-HHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHH Confidence 111111111111111110000 00000 000000000 00001111111111111111111111112222 Q ss_pred cccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccceee Q lcl|NC_010583. 163 GSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEI 242 (458) Q Consensus 163 ~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v 242 (458) ..+++++||++||+++.+.|++.+++.++|+++++++|++ +...+|+..+.+.++|++|++..++ .+.++|++| T Consensus 139 ~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-g~~~ip~~~~~~~a~~v~E~~~~~~-----~~~~~f~~i 212 (425) T protein:vir:95 139 NLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVK-GTTRILVDTDTSPATWIEQSGALPT-----GDVGTIASI 212 (425) T ss_pred hhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecC-ceeEEEEecCCcccccccccccccc-----cccccccee Confidence 3345567899999999999999999999999999999975 5679999999999999999865443 234789999 Q ss_pred eeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCC--Cccccccccccccccceeeccccchhh Q lcl|NC_010583. 243 SFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGT--GQPKGLLKLAADDGAKVVTEAKADGSV 320 (458) Q Consensus 243 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~--~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 320 (458) ++++++++++++||+++|+|++++|++||.++|++++++++|.+||+|+|+ ++|.||++........+ ...+ T Consensus 213 ~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~------~~~~ 286 (425) T protein:vir:95 213 DFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVT------VEAD 286 (425) T ss_pred eeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccc------cccc Confidence 999999999999999999999999999999999999999999999999986 48999998654433221 1223 Q ss_pred HHHHHHHHHHHhhhhhhhc--ccceeEechhHH----HHHHhhhccccccccccccccccccccCCeeecccceeccccc Q lcl|NC_010583. 321 LVTAKTISKLRRKLGRHGL--KLSKLVLIVSMD----AYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFP 394 (458) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~----~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~ 394 (458) ..++.++.++...+...+. .+..|+||+.++ ..+..++|.+|+|+++.+. +..++|+|+||++++.+| T Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~------~~~~~l~G~pvv~~~~~~ 360 (425) T protein:vir:95 287 NNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPN------LRTPDLLGLRVVFNNFLD 360 (425) T ss_pred cchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccCC------CCCccccceeeEEcCcCC Confidence 3455667777766666553 566799999875 3467789999999986432 234689999999999998 Q ss_pred ccccCCceEEEEEeceEEEEecceeEEee--cccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 395 AKAASAEFAVIVYKDNFVMPRQRAVTVER--ERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 395 ~~~~~~~~~~~~~~~~~~i~~~~~~~i~~--~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .. .+++|++++|.++++.++++.+ +.+|.+|++.||++.|+|+++++|+||+++++++. T Consensus 361 ~~-----~i~~Gd~~~~~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~ 421 (425) T protein:vir:95 361 DD-----TVLFGEFEQYTLVERENITIDSSTHVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDP 421 (425) T ss_pred Cc-----cEEEEecccEEEEeecceEEEeecccccccCceEEEEEEeeCcEeecccceEEEEecCc Confidence 53 4677899999999999888765 56789999999999999999999999999999997 No 12 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=2e-61 Score=353.35 Aligned_cols=382 Identities=17% Similarity=0.153 Sum_probs=260.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 25 TAAQKAAEAKRLREEQEEKELARMNDLVSKA-----VGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQ 99 (458) Q Consensus 25 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~-----~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~ 99 (458) |+. ...+++.+++++..++++.+.++. ++|.++++++...+++.+.+.+++..+..+... ... T Consensus 1 m~~----~~l~~l~e~r~~~~~e~~~l~~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~--------~~~ 68 (392) T protein:vir:13 1 MDA----TTLSANFEARERATAELRSLTDEFAGKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATD--------AVT 68 (392) T ss_pred CCH----HHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHH Confidence 111 122344444555555555555433 234444455555555555554443222211100 000 Q ss_pred HHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHH Q lcl|NC_010583. 100 DEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFS 179 (458) Q Consensus 100 ~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~ 179 (458) ...... +.. . ........ ..+..+++.+...+. +........ ..++.+++|.++|+++. T Consensus 69 ~~~~~~----~~~---------~--~~~~~~~~----~~~~~~~r~g~~~~~-~~~~~~~~~-~~~t~~~~g~~~~~~~~ 127 (392) T protein:vir:13 69 SLLSGL----QGS---------G--SGAQRSAD----HDDDAVLRAGNLGEA-RSFEFAPEK-RDGTKAGNPNVLSRTLY 127 (392) T ss_pred HHhccc----CCc---------c--cchhhhhh----HHHHHHHhccchhhh-HHHHhhhhh-hcccccCCCccccccch Confidence 000000 000 0 00000000 001111222111110 101111111 12233344445566666 Q ss_pred HHHHHH-HHhccchhhhcceeeeccC-ceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhH Q lcl|NC_010583. 180 TRIIRD-LQKELVVGALFDELPMSSK-ILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITD 257 (458) Q Consensus 180 ~~ii~~-~~~~~~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ 257 (458) ..+|.. +...+++++++++++++++ .+.+|+..+.+.++|++|++. +++++++|++|++.++|++++++||+ T Consensus 128 ~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~------~~~~~~~f~~v~~~~~k~~~~~~iS~ 201 (392) T protein:vir:13 128 GQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAE------IPESYPATTQRSMGGFKYGFASVVSY 201 (392) T ss_pred HHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeeccccc------ccccccceeeEEeeeeeEEeeehhHH Confidence 666654 5556678888999988655 478999999999999998764 45578999999999999999999999 Q ss_pred HHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhh Q lcl|NC_010583. 258 ETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRH 337 (458) Q Consensus 258 ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 337 (458) ++|+|+.++|++||.++|++++++++|.+||+|||+++|.||++......... ........++++++++.+.+.+. T Consensus 202 ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~----~~~~~~~~~~d~l~~~~~~l~~~ 277 (392) T protein:vir:13 202 EFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAF----GEADADSKVSDALIDLFHEVPSA 277 (392) T ss_pred HHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccccc----cccccccccHHHHHHHHHhhhhh Confidence 99999999999999999999999999999999999999999998765432221 11223445677888999999999 Q ss_pred hcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecc Q lcl|NC_010583. 338 GLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQR 417 (458) Q Consensus 338 ~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 417 (458) |+.++.|+||+.++..|.+++|++|+|+|++.. ..+.+.+|+|+||++++++|+ +.+++++|++|.++++. T Consensus 278 ~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~----~~g~~~~l~G~Pv~~~~~~~~-----~~i~~Gdf~~~~i~~~~ 348 (392) T protein:vir:13 278 YRKNAKFVVNDLRAAQMRKLKDANGQYLWQSAL----TVGAPDTFNGKVVETDDGMPA-----DKVLFADLSKYRVRFAG 348 (392) T ss_pred hhcCCEEEEcHHHHHHHHHhhccCCceeecCCc----CCCCCceecceeeEEcCCCCC-----CcEEEeeccceeEEeec Confidence 999999999999999999999999999986543 335567999999999999985 34678999999999999 Q ss_pred eeEEee--cccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 418 AVTVER--ERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 418 ~~~i~~--~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ++++.+ +.||.+|++.||++.|+|+++++|+||+++++++| T Consensus 349 ~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~a 391 (392) T protein:vir:13 349 SLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPA 391 (392) T ss_pred ceEEEeeccccccCCcEEEEEEEEeccEEecccceEEEEeecc Confidence 998865 67899999999999999999999999999999999 No 13 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=8.9e-61 Score=349.81 Aligned_cols=421 Identities=13% Similarity=0.084 Sum_probs=262.7 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAE 82 (458) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e 82 (458) |.|+++++++..+ ..+....+..+........+..++...+.++++...+. ++++++.+++..+...+ T Consensus 1 M~l~el~~~~~~~-~~~~~a~l~~~~~~~~~~~ee~~~~~~e~~~l~~~~~~--------l~~~i~~le~~~~~~~~--- 68 (434) T protein:vir:62 1 MNLKEILNASLTR-TKSRLAELQGKVEKNEVRSEELAAVKAEVEQLTKEIQT--------ISEELAKLEEKEKEEDP--- 68 (434) T ss_pred CCHHHHHHHHHHH-HHHHHHHHHHHHhccCccHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHH--- Confidence 7788887766632 11111111111111000000000011111111111111 11111111111110000 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhhhhhhhh-hcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhh Q lcl|NC_010583. 83 LFAQTVEKQQETIVGL-QDEIKSLLAAREGRSFVGDSVAKAL-YGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKA 160 (458) Q Consensus 83 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~e~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a 160 (458) ................ ...........+.+........... ..........+.+.+|.++++.+.. ..+ .++ T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~-~~e-----~~a 142 (434) T protein:vir:62 69 AKKKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNID-EKE-----ARA 142 (434) T ss_pred HhhhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccc-hhh-----hhh Confidence 0000000000000000 0000000111111111111111111 1111222333456677777654322 111 112 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccce Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLT 240 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~ 240 (458) . +.++++||++||+++.+.|++.++++++|+++|++++++ +..++|+...++.+.|..+. .+++..+.++++|+ T Consensus 143 ~--~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-~~~~~p~~~~~~~a~~~~~~---~e~~~~~~~~~~f~ 216 (434) T protein:vir:62 143 L--GLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTK-ENIKYPVLVKKAEAQGHKNE---RTNNEMPETDIEFD 216 (434) T ss_pred h--cccccccceecchhhHHHHHHhhhhhhhhhhhcceeccC-CceEEEEEecCCcccceecc---ccccccccccccee Confidence 2 234456899999999999999999999999999999876 46789999988888887543 34556778899999 Q ss_pred eeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc-ccccccccccccceeeccccchh Q lcl|NC_010583. 241 EISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQP-KGLLKLAADDGAKVVTEAKADGS 319 (458) Q Consensus 241 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p-~Gi~~~~~~~~~~~~~~~~~~~~ 319 (458) +|++.+|+++++++||+++|+|+.++|++||.++|++++++++|.+||+|+|+++| .|+++..... .... T Consensus 217 ~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~---------~~~~ 287 (434) T protein:vir:62 217 EIELSPTEFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVE---------FKTD 287 (434) T ss_pred eEEeeheeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeeccccc---------cccc Confidence 99999999999999999999999999999999999999999999999999999875 5665432211 1122 Q ss_pred hHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccC Q lcl|NC_010583. 320 VLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAAS 399 (458) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~ 399 (458) ....+++++++.+.+.+.|+.++.|+||+.++..|++++|++|+|+|++... ...+.+.+|+|+||++++++|.+.++ T Consensus 288 ~~~~~d~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~--~~~g~~~tl~G~pV~~~~~~~~~~~~ 365 (434) T protein:vir:62 288 EKNLYDALVKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRPFNQ--AEGGIGYTLLGFPVEEEDAIDIPDSP 365 (434) T ss_pred ccchhhHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeeccCCC--ccCCCCceecceeeEEecCccCccCC Confidence 3356788999999999999999999999999999999999999999976433 23355678999999999999976655 Q ss_pred Cce-EEEEEeceEEEEecce-eEE--eecccccCCceEEEEEEeeccEEec-ccceEEE--EeecC Q lcl|NC_010583. 400 AEF-AVIVYKDNFVMPRQRA-VTV--ERERQAGKQRDAYYVTQRVNLQRYF-ENGVVSG--AYAAA 458 (458) Q Consensus 400 ~~~-~~~~~~~~~~i~~~~~-~~i--~~~~~~~~~~~~~~~~~r~d~~~~~-~~afv~l--~~aaa 458 (458) ... +++++|++|+|+++.+ +++ ..+.|+.+|+|.||++.|+|+++++ |.+++++ +.++| T Consensus 366 ~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~ 431 (434) T protein:vir:62 366 DTPVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAP 431 (434) T ss_pred CceEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccC Confidence 444 4568899998988754 444 4578899999999999999999886 8776655 32333 No 14 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=1e-60 Score=349.48 Aligned_cols=397 Identities=13% Similarity=0.130 Sum_probs=253.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSK------AVGEDRKRLEEALDLVKNLDEKSK 78 (458) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~------~~~e~~~~~~~~~~~i~~~~e~~~ 78 (458) |.++++ +++++.+..++++++.++ ..+++.+++++...+++.+.++++ T Consensus 1 M~kl~~--------------------------L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~l~~~i~ 54 (428) T protein:vir:10 1 MPQIEE--------------------------LRRQRAGINEQIQALATIEATNGTLTAEQLTEFAGLQQQFTDISAKMD 54 (428) T ss_pred CchHHH--------------------------HHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHH Confidence 333222 122222222222222221 112233333444444444443333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccch-----hHHHH Q lcl|NC_010583. 79 KSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDV-----FETEH 153 (458) Q Consensus 79 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~-----~~~~~ 153 (458) +.....+. ................. ....+.. ........+........++.. ..... T Consensus 55 ~~e~~e~~-~~~~~~~~~~~~~~~~~-~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 117 (428) T protein:vir:10 55 RMEATERA-AALVAKPVKATQHGPAV-IVKAEPK---------------QYTGAGMTRMVMSIAAAQGNLQDAAKFASDE 117 (428) T ss_pred HHHHHHHH-HHHHhhhhhchhhcccc-ccccccc---------------hhhhHHHHHHHHHHHHhhhhHHHHHHHhhhh Confidence 22111000 00000000000000000 0000000 000000000000000000000 00000 Q ss_pred HHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhh-cceeeeccCceEEEEecCCCccccccccccccccccc Q lcl|NC_010583. 154 GKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGAL-FDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVG 232 (458) Q Consensus 154 ~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~ 232 (458) ...........++++.||++||+++.++||+.+++.++|+++ ++++|++++..++|+.++++.++|++|++ .+ T Consensus 118 ~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~------~~ 191 (428) T protein:vir:10 118 LNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLAGGATASYTGENQ------DA 191 (428) T ss_pred hhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcceEEEEEeCCcceeeeccCc------cc Confidence 000111112233445688999999999999999999999999 68899999999999999999999999875 44 Q ss_pred ccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC-cccccccccccccccee Q lcl|NC_010583. 233 DEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTG-QPKGLLKLAADDGAKVV 311 (458) Q Consensus 233 ~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~-~p~Gi~~~~~~~~~~~~ 311 (458) ++++++|++|++.+++++++++||+|+|+||.++|++||.++|++++++++|.+||+|+|++ +|+||++.+...+.... T Consensus 192 ~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~ 271 (428) T protein:vir:10 192 KVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLP 271 (428) T ss_pred cccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc Confidence 56789999999999999999999999999999999999999999999999999999999985 89999987765443332 Q ss_pred ecc--ccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeeccccee Q lcl|NC_010583. 312 TEA--KADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVV 389 (458) Q Consensus 312 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~ 389 (458) ... ..+......+.+...+.......+..++.|+||+.++..|.+++|++|+|+++.. ..++|+|+||++ T Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~--------~~g~l~G~pv~~ 343 (428) T protein:vir:10 272 WAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEM--------AQGMLKGYPIQR 343 (428) T ss_pred ccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhccCCceeccCC--------CCCeeeceeeEE Confidence 222 2222222233344444455555667788999999999999999999999998532 235899999999 Q ss_pred cccccccc---cCCceEEEEEeceEEEEecceeEEeeccc-------------ccCCceEEEEEEeeccEEecccceEEE Q lcl|NC_010583. 390 SEYFPAKA---ASAEFAVIVYKDNFVMPRQRAVTVERERQ-------------AGKQRDAYYVTQRVNLQRYFENGVVSG 453 (458) Q Consensus 390 ~~~~~~~~---~~~~~~~~~~~~~~~i~~~~~~~i~~~~~-------------~~~~~~~~~~~~r~d~~~~~~~afv~l 453 (458) ++++|... .+...++++++++|.+++++++++..+++ |..|++.||++.|+|+++.+|+||+++ T Consensus 344 ~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~ 423 (428) T protein:vir:10 344 TSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLG 423 (428) T ss_pred eccccccccCCCccceEEEEecceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEE Confidence 99998632 23445778899999999999999987654 678999999999999999999999998 Q ss_pred EeecC Q lcl|NC_010583. 454 AYAAA 458 (458) Q Consensus 454 ~~aaa 458 (458) +-..= T Consensus 424 t~~~~ 428 (428) T protein:vir:10 424 TGVLF 428 (428) T ss_pred eccCC Confidence 86666 No 15 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=1.3e-59 Score=343.43 Aligned_cols=433 Identities=15% Similarity=0.092 Sum_probs=249.4 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRK-RLEEALDLVKNLDEKSKK 79 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~-~~~~~~~~i~~~~e~~~~ 79 (458) |.=+++.++. ...+++++++...++++.+.++...+.++ ..++..++++...++++. T Consensus 1 ~~k~~eem~~----------------------~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ 58 (477) T protein:vir:84 1 MEKHLEELRA----------------------LRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKA 58 (477) T ss_pred CchHHHHHHH----------------------HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHH Confidence 1111111111 12222223333333333333222211111 111111111111111111 Q ss_pred HHHHHH---HHHHHHHH---HHHHHHHHHHHH-------HHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhcc Q lcl|NC_010583. 80 SAELFA---QTVEKQQE---TIVGLQDEIKSL-------LAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEK 146 (458) Q Consensus 80 ~~e~~~---~~~~~~~~---~~~~~~~~~~~~-------~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~ 146 (458) ..+..+ +.++++.. .......+.... ...............+......................... T Consensus 59 ei~~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 138 (477) T protein:vir:84 59 ELDKVEDLDEQIRELESEIERSGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDV 138 (477) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhh Confidence 110000 00000000 000000000000 00000000000000000000000000000000000000000 Q ss_pred ch---hHHHHHHHHhhhhhcccccccCccccchh-HHHHHHHHHHhccchhhhcceeeec--cCceEEEEecCCC-cccc Q lcl|NC_010583. 147 DV---FETEHGKAHIKAVNGSSSVSMSSEAYETI-FSTRIIRDLQKELVVGALFDELPMS--SKILTMLVEPEAG-RATW 219 (458) Q Consensus 147 ~~---~~~~~~~~~~~a~~~~~~~~~g~~~ip~~-~~~~ii~~~~~~~~l~~~~~~~~~~--~~~~~~p~~~~~~-~a~~ 219 (458) .. ............. .+++++.||++||++ +.+.|++.+++.++|+++++++|++ ++.+.+|+..+++ .+.| T Consensus 139 ~~~~~~~~~~~~~~~~~~-~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~ 217 (477) T protein:vir:84 139 ESDKEIRKIAKVGEEYRD-LDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQ 217 (477) T ss_pred hhhhhHHHHHHhhhhhcc-ccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeee Confidence 00 0000011111111 233444566666655 5688999999999999999888765 4567899876664 4678 Q ss_pred cccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-Ccccc Q lcl|NC_010583. 220 VDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGT-GQPKG 298 (458) Q Consensus 220 v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~-~~p~G 298 (458) ++|++..+. ..+++++++|++|+++++|++++++||+++|+||.+++++||.++|+++++.++|.+||+|+|+ ++|.| T Consensus 218 ~~Eg~~~~~-~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~G 296 (477) T protein:vir:84 218 AADNAALTA-PSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVG 296 (477) T ss_pred eccCccccc-ccccccccceeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccce Confidence 999876654 3577889999999999999999999999999999999999999999999999999999999996 58999 Q ss_pred ccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhccc-ceeEechhHHHHHHhhhcccccccccccccc----- Q lcl|NC_010583. 299 LLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKL-SKLVLIVSMDAYYDLLEDEEWQDVAQVGNDA----- 372 (458) Q Consensus 299 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~----- 372 (458) |++......................+..++++...+...+..+ ..|+|||.++.+|++++|++|+|+|++.... T Consensus 297 i~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~ 376 (477) T protein:vir:84 297 VRATAGITQVTATSAGSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLG 376 (477) T ss_pred eeeccccccccccccccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccc Confidence 9987655433332222223334456677788887777777654 4799999999999999999999999765432 Q ss_pred ----ccccccCCeeecccceeccccccc-cc--CCceEEEEEeceEEEEecceeEE--eecccccCCceEEEEEEeeccE Q lcl|NC_010583. 373 ----VKLQGQVGRIYGLPVVVSEYFPAK-AA--SAEFAVIVYKDNFVMPRQRAVTV--ERERQAGKQRDAYYVTQRVNLQ 443 (458) Q Consensus 373 ----~~~~~~~~~l~G~pv~~~~~~~~~-~~--~~~~~~~~~~~~~~i~~~~~~~i--~~~~~~~~~~~~~~~~~r~d~~ 443 (458) .+..+.+++|+|+||++++++|+. ++ +...+++++|+.++++.+ ++++ ..+.|+.++.+.|++..++++. T Consensus 377 ~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~~-~~~~~~~~~~~~~~~~~~~~v~~~~~~~ 455 (477) T protein:vir:84 377 VLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFES-SVRMRALQETRAENLSVLLQVYGYLAFT 455 (477) T ss_pred cccccccccccchhcccceEecCcccccccccCCcceEEEEEeceEEEEee-ceeEEeccccccccceeeeeehhhhhhh Confidence 234556789999999999999963 22 334567888888888764 4444 4556788899999998888875 Q ss_pred E-ecccceEEEEeecC Q lcl|NC_010583. 444 R-YFENGVVSGAYAAA 458 (458) Q Consensus 444 ~-~~~~afv~l~~aaa 458 (458) . ++|+|||.+|.+|. T Consensus 456 ~~r~~~afv~~t~~~~ 471 (477) T protein:vir:84 456 AARFPQSVVEIGGTAL 471 (477) T ss_pred hhccccceEEeecccc Confidence 4 56999999999998 No 16 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=2.6e-60 Score=347.25 Aligned_cols=402 Identities=14% Similarity=0.148 Sum_probs=258.3 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAE 82 (458) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e 82 (458) |+|++|+++.. ++.++.+.|..+. .+.+.+.+ +.++++++..++++.+..++++..+ T Consensus 1 M~i~eL~e~r~--~~~~~~~~l~~~~---~e~~~lt~------------------ee~~~~~~l~~ei~~l~~~I~~~e~ 57 (435) T protein:vir:14 1 MNVNELRRERA--AVNQRVQALAQIE---VGGTALSV------------------EQQAEFDQLSSKFSELTAQIERAEA 57 (435) T ss_pred CCHHHHHHHHH--HHHHHHHHHHHHH---hccCCCCH------------------HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88889888775 4455555443221 11111111 1222223333333333333322111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhh--ccchhH-------HHH Q lcl|NC_010583. 83 LFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMM--EKDVFE-------TEH 153 (458) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~--~~~~~~-------~~~ 153 (458) ..+. ........ +......... ... ... ......+.......+ +...+. ++.... ... T Consensus 58 ~~~~-~~~~~~~~----~~~~~~~~~~-~~~----~~~-~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 124 (435) T protein:vir:14 58 AERM-AAAAAVPV----DPNPTAVAAP-AAA----PVH-AQPKALEVKGAKMAR--MVRALAAARGDAQLASKLAIERGF 124 (435) T ss_pred HHHH-HHhhcccc----cchhhhhhhc-ccc----ccc-cccchhhhhHHHHHH--HHHHHHhhcchhhHHHHHHHhhhh Confidence 1000 00000000 0000000000 000 000 000000000000000 000000 000000 000 Q ss_pred HHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhh-cceeeeccCceEEEEecCCCccccccccccccccccc Q lcl|NC_010583. 154 GKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGAL-FDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVG 232 (458) Q Consensus 154 ~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~ 232 (458) ... .......++...||++||+++.++|++.+++.++++++ ++++|++++..++|+.++++.++|++|++. + T Consensus 125 ~~~-~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~------~ 197 (435) T protein:vir:14 125 GEE-VAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTD------I 197 (435) T ss_pred hhh-hhhhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceeeeccCcc------c Confidence 011 11223345566789999999999999999999999998 788999999999999999999999988754 4 Q ss_pred ccccccceeeeeehhheeeeehhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhccCCC-Cccccccccccccccc Q lcl|NC_010583. 233 DEVKGQLTEISFKTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAHAVSIEEAFMSGNGT-GQPKGLLKLAADDGAK 309 (458) Q Consensus 233 ~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~~~~~~~d~~~l~G~g~-~~p~Gi~~~~~~~~~~ 309 (458) ++++++|++|++.+++++++++||+|+|+|+. ++|++||.++|++++++++|.+|++|+|+ ++|.||++........ T Consensus 198 ~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~ 277 (435) T protein:vir:14 198 PTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVI 277 (435) T ss_pred cccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccee Confidence 56789999999999999999999999999984 46999999999999999999999999998 5799999865444332 Q ss_pred eeeccccchhhHHHHHHHHHHHhhhhhh--hcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccc Q lcl|NC_010583. 310 VVTEAKADGSVLVTAKTISKLRRKLGRH--GLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPV 387 (458) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv 387 (458) ...... .......++.+++..+... ++.++.|+||+.++..|++++|++|+|+|+.. ..++|+|+|| T Consensus 278 ~~~~~~---~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~--------~~g~l~G~Pv 346 (435) T protein:vir:14 278 TASDAS---TLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPEL--------ANGMLKGYPV 346 (435) T ss_pred cccccc---chhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCceeccCC--------CCCeeeccee Confidence 222221 1222233445554444432 45678899999999999999999999998421 2358999999 Q ss_pred eecccccccc---cCCceEEEEEeceEEEEecceeEEeeccc-------------ccCCceEEEEEEeeccEEecccceE Q lcl|NC_010583. 388 VVSEYFPAKA---ASAEFAVIVYKDNFVMPRQRAVTVERERQ-------------AGKQRDAYYVTQRVNLQRYFENGVV 451 (458) Q Consensus 388 ~~~~~~~~~~---~~~~~~~~~~~~~~~i~~~~~~~i~~~~~-------------~~~~~~~~~~~~r~d~~~~~~~afv 451 (458) ++++++|... .+...+++++++.|.+++++++++.+++| |.+|++.||++.|+|+++++|+||+ T Consensus 347 ~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~ 426 (435) T protein:vir:14 347 GKTTQVPINLGETGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIA 426 (435) T ss_pred EeeccccccccCCCccceEEEeecccEEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceE Confidence 9999999632 23345788999999999999999988765 6689999999999999999999999 Q ss_pred EEEeecC Q lcl|NC_010583. 452 SGAYAAA 458 (458) Q Consensus 452 ~l~~aaa 458 (458) +++-++- T Consensus 427 ~l~~~~~ 433 (435) T protein:vir:14 427 VLAGVAW 433 (435) T ss_pred EEecCCC Confidence 9999888 No 17 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=3.4e-59 Score=341.15 Aligned_cols=400 Identities=11% Similarity=0.055 Sum_probs=256.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELF 84 (458) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~ 84 (458) |+.+++ +.+..++++++.....++++.++.+. ..+..++..++++.+..++++..+.. T Consensus 1 mk~~~e-------------------m~~~l~el~~~~~~~~~e~~~~~~~~---~~e~~~~~~~ev~~l~~~i~~~~~~~ 58 (415) T protein:vir:46 1 MKTKEE-------------------LQSEISDIKRQIDLKVKYATRALNND---ELEKAEKLEQEITDLRSQIQEKQEEL 58 (415) T ss_pred CchHHH-------------------HHHHHHHHHHHHHHHHHHHHHHhchh---hHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222 11122222222222222222222111 11112222222222222222111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhc Q lcl|NC_010583. 85 AQTVEKQQETIVGLQDEIKSLLAAREGRSFVG-DSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNG 163 (458) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~-~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~ 163 (458) +................. ..+.+.... ..................++..|..+.... ...... T Consensus 59 ----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~ 122 (415) T protein:vir:46 59 ----DKLKEKDRTSENNQQSVE-VNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETR-----------NDIQGG 122 (415) T ss_pred ----HHHHHHHHhhhhcccccc-cchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhh-----------hhhhhc Confidence 000000000000000000 000000000 000000000000111111222222222111 111122 Q ss_pred ccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEe--cCCCccccccccccccccccccccccccee Q lcl|NC_010583. 164 SSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVE--PEAGRATWVDASKFGTDETVGDEVKGQLTE 241 (458) Q Consensus 164 ~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~--~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~ 241 (458) ..++.+|+.+||+++.+.|++.+++.++|+++++++|++++..++|+. .+...++|++|++..++ .+.++|+. T Consensus 123 ~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~-----~~~~~~~~ 197 (415) T protein:vir:46 123 SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE-----LAVKPFFQ 197 (415) T ss_pred cccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeeccccccccc-----ccccceee Confidence 334567889999999999999999999999999999999888887765 45567889988764443 35789999 Q ss_pred eeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhH Q lcl|NC_010583. 242 ISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVL 321 (458) Q Consensus 242 v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 321 (458) |++.+++++++++||+++++|+.++|++||.++|++++++++|.+|++|+|++.|.++.......... ...... T Consensus 198 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~------~~~~~~ 271 (415) T protein:vir:46 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK------LEVKKA 271 (415) T ss_pred EEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccce------eccccc Confidence 99999999999999999999999999999999999999999999999999998876665443222111 112233 Q ss_pred HHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCc Q lcl|NC_010583. 322 VTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAE 401 (458) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 401 (458) .+++++.++++.+...++.++.|+||+.++..|++++|++|+|++++.. ..+.+++|+|+||++++++|.+.++.. T Consensus 272 ~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~----~~~~~~~l~G~pV~~~~~~~~~~~~~~ 347 (415) T protein:vir:46 272 KSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV----KEKTQQRLLGAKIEILPDEVLGQKGNN 347 (415) T ss_pred cchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCc----CCCCCccccceeeEEeccccccCCCcc Confidence 4567788888898888999999999999999999999999999986533 345568999999999999998777777 Q ss_pred eEEEEEece-EEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 402 FAVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 402 ~~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+++++++. |.++++.++++..+++ .++++.+|++.|+|+++++|+||++++++++ T Consensus 348 ~~~~gd~~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:46 348 TLIIGNLKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEEehhccEEEEeecceEEEeecc-ccCceEEEEEEEeccEEeccccEEEEEeecc Confidence 788888885 7889999999988765 5778899999999999999999999999998 No 18 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=3.4e-59 Score=341.15 Aligned_cols=400 Identities=11% Similarity=0.055 Sum_probs=256.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELF 84 (458) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~ 84 (458) |+.+++ +.+..++++++.....++++.++.+. ..+..++..++++.+..++++..+.. T Consensus 1 mk~~~e-------------------m~~~l~el~~~~~~~~~e~~~~~~~~---~~e~~~~~~~ev~~l~~~i~~~~~~~ 58 (415) T protein:vir:47 1 MKTKEE-------------------LQSEISDIKRQIDLKVKYATRALNND---ELEKAEKLEQEITDLRSQIQEKQEEL 58 (415) T ss_pred CchHHH-------------------HHHHHHHHHHHHHHHHHHHHHHhchh---hHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222 11122222222222222222222111 11112222222222222222111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhc Q lcl|NC_010583. 85 AQTVEKQQETIVGLQDEIKSLLAAREGRSFVG-DSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNG 163 (458) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~-~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~ 163 (458) +................. ..+.+.... ..................++..|..+.... ...... T Consensus 59 ----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~ 122 (415) T protein:vir:47 59 ----DKLKEKDRTSENNQQSVE-VNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETR-----------NDIQGG 122 (415) T ss_pred ----HHHHHHHHhhhhcccccc-cchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhh-----------hhhhhc Confidence 000000000000000000 000000000 000000000000111111222222222111 111122 Q ss_pred ccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEe--cCCCccccccccccccccccccccccccee Q lcl|NC_010583. 164 SSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVE--PEAGRATWVDASKFGTDETVGDEVKGQLTE 241 (458) Q Consensus 164 ~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~--~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~ 241 (458) ..++.+|+.+||+++.+.|++.+++.++|+++++++|++++..++|+. .+...++|++|++..++ .+.++|+. T Consensus 123 ~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~-----~~~~~~~~ 197 (415) T protein:vir:47 123 SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE-----LAVKPFFQ 197 (415) T ss_pred cccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeeccccccccc-----ccccceee Confidence 334567889999999999999999999999999999999888887765 45567889988764443 35789999 Q ss_pred eeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhH Q lcl|NC_010583. 242 ISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVL 321 (458) Q Consensus 242 v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 321 (458) |++.+++++++++||+++++|+.++|++||.++|++++++++|.+|++|+|++.|.++.......... ...... T Consensus 198 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~------~~~~~~ 271 (415) T protein:vir:47 198 LAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK------LEVKKA 271 (415) T ss_pred EEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccce------eccccc Confidence 99999999999999999999999999999999999999999999999999998876665443222111 112233 Q ss_pred HHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCc Q lcl|NC_010583. 322 VTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAE 401 (458) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 401 (458) .+++++.++++.+...++.++.|+||+.++..|++++|++|+|++++.. ..+.+++|+|+||++++++|.+.++.. T Consensus 272 ~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~----~~~~~~~l~G~pV~~~~~~~~~~~~~~ 347 (415) T protein:vir:47 272 KSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV----KEKTQQRLLGAKIEILPDEVLGQKGNN 347 (415) T ss_pred cchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCc----CCCCCccccceeeEEeccccccCCCcc Confidence 4567788888898888999999999999999999999999999986533 345568999999999999998777777 Q ss_pred eEEEEEece-EEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 402 FAVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 402 ~~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+++++++. |.++++.++++..+++ .++++.+|++.|+|+++++|+||++++++++ T Consensus 348 ~~~~gd~~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:47 348 TLIIGNLKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEEehhccEEEEeecceEEEeecc-ccCceEEEEEEEeccEEeccccEEEEEeecc Confidence 788888885 7889999999988765 5778899999999999999999999999998 No 19 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=4.6e-59 Score=340.43 Aligned_cols=384 Identities=11% Similarity=0.032 Sum_probs=258.9 Q ss_pred CcchHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELG--LGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSK 78 (458) Q Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~ 78 (458) |-|.|.+..+++. .+++.++.+.+ ..+. ..+++++..++++.+.++++ T Consensus 1 ~~~~m~k~l~el~~~~~~~~~~~~~~---------------------------~~~~---~~ee~~~~~~e~~~l~~~i~ 50 (397) T protein:vir:12 1 MPMQMSKKEIALRQQFTEKKQQADKA---------------------------LQEG---NTDEARALLDEVKQLKNQIE 50 (397) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHH---------------------------hhhh---hHHHHHHHHHHHHHHHHHHH Confidence 7777665333222 11111221111 1100 01111122222333322222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHH- Q lcl|NC_010583. 79 KSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAH- 157 (458) Q Consensus 79 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~- 157 (458) +..+........... ......... .... ...............++.+|.++++.+......+.... T Consensus 51 ~~~~~~~~~~~~~~~----~~~~~~~~~----~~~~-----~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 117 (397) T protein:vir:12 51 LMTEGRSLDVPDLPG----GVNFVPEQE----RNPE-----GQRSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDS 117 (397) T ss_pred HHHHHHHHHHHHHHH----Hhhhhhhhh----hhhc-----ccccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhh Confidence 211111111111111 011110000 0000 00011112223334456677777766554433332221 Q ss_pred -hhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeecc--CceEEEEecCCCccccccccccccccccccc Q lcl|NC_010583. 158 -IKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSS--KILTMLVEPEAGRATWVDASKFGTDETVGDE 234 (458) Q Consensus 158 -~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~--~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~ 234 (458) ......+++++.||++||+++.+.|++.+++.++|+++++++|+++ +.+.+|+..+.+.++|++|++..++ . T Consensus 118 ~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-----~ 192 (397) T protein:vir:12 118 PEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPE-----I 192 (397) T ss_pred hhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeecccccccc-----c Confidence 1222345567778999999999999999999999999999999875 4556677778888999999864432 3 Q ss_pred ccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeecc Q lcl|NC_010583. 235 VKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEA 314 (458) Q Consensus 235 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~ 314 (458) +.++|+.|++.+++++++++||+++++|+.+++++||.+.|++++++++|.+|++|+|+++|.|+++ T Consensus 193 ~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~~~g~~~------------- 259 (397) T protein:vir:12 193 DQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASLKKVDIDG------------- 259 (397) T ss_pred ccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc------------- Confidence 5689999999999999999999999999999999999999999999999999999999999988754 Q ss_pred ccchhhHHHHHHHHHHH-hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccc- Q lcl|NC_010583. 315 KADGSVLVTAKTISKLR-RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEY- 392 (458) Q Consensus 315 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~- 392 (458) +.++..+. ..+.+.+..++.|+||+.++..|++++|++|+|++++.. ..+.+++|+|+||+++++ T Consensus 260 ---------~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~----~~g~~~~l~G~pv~~~~~~ 326 (397) T protein:vir:12 260 ---------LDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDP----TNPTKKLLDGRPVVPFTNR 326 (397) T ss_pred ---------HHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceeecccc----cCCCCccccceeeEEeccc Confidence 22344434 467788899999999999999999999999999986543 345668999999997765 Q ss_pred ccccccCCceEEEEEece-EEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 393 FPAKAASAEFAVIVYKDN-FVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 393 ~~~~~~~~~~~~~~~~~~-~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +|..+.+...+++++|+. |.++++.++++..+. +|.+|++.||++.|+|+++++|+||+++++++= T Consensus 327 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 327 VLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred ccccCCCccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 455556666778888885 668888888886543 477999999999999999999999999999999 No 20 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=2.4e-59 Score=341.94 Aligned_cols=403 Identities=13% Similarity=0.122 Sum_probs=256.0 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAE 82 (458) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e 82 (458) |+|++|+++.. .+.++.+.|..++ .+.+.+.+.. ++++++..++++.+.+++++..+ T Consensus 1 M~l~eL~~~r~--~~~~~~~~l~~~~---~e~~~l~~ee------------------~~~~~~l~~ei~~l~~~i~~~e~ 57 (435) T protein:vir:80 1 MNVNELRRERA--AVNQRVQALAQIE---VGGTALSVEQ------------------QAEFDQLSSKFNELTAQIERAEA 57 (435) T ss_pred CCHHHHHHHHH--HHHHHHHHHHHHH---hccCCCCHHH------------------HHHHHHHHHHHHHHHHHHHHHHH Confidence 77888888775 3445555443221 1111121111 22222222223333222222111 Q ss_pred HHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHH Q lcl|NC_010583. 83 LFAQTVEKQQETIVGLQ--------DEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHG 154 (458) Q Consensus 83 ~~~~~~~~~~~~~~~~~--------~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~ 154 (458) ..+. ..+......... ..........+.+........+....... ..+.++....++ ... T Consensus 58 ~e~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~------~~~ 125 (435) T protein:vir:80 58 AERM-AAAAAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARG-----DAQLASKLAIER------GFG 125 (435) T ss_pred HHHH-HHhhcccccchhhhhccccccccccccchhhhhHHHHHHHHHHHHhccc-----hhHHHHHHHHhh------hhh Confidence 0000 000000000000 00000000000000000000000000000 000000000100 001 Q ss_pred HHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhh-cceeeeccCceEEEEecCCCcccccccccccccccccc Q lcl|NC_010583. 155 KAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGAL-FDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGD 233 (458) Q Consensus 155 ~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~ 233 (458) .... .....++...||++||+++.++|++.+++.++|+++ ++++|++++..++|+.++++.+.|++|+. .++ T Consensus 126 ~~~~-~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~------~~~ 198 (435) T protein:vir:80 126 EEVA-MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADT------DIP 198 (435) T ss_pred hhhh-hhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCceEEEEEeCCcceeeeccCc------ccc Confidence 1111 112345566789999999999999999999999998 78999999999999999999999998875 445 Q ss_pred cccccceeeeeehhheeeeehhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhccCCC-Cccccccccccccccce Q lcl|NC_010583. 234 EVKGQLTEISFKTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAHAVSIEEAFMSGNGT-GQPKGLLKLAADDGAKV 310 (458) Q Consensus 234 ~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~~~~~~~d~~~l~G~g~-~~p~Gi~~~~~~~~~~~ 310 (458) +++++|++|++.+++++++++||+++|+|+. +++++||.++|++++++++|.+||+|+|+ ++|.||++.+....... T Consensus 199 ~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~ 278 (435) T protein:vir:80 199 TTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVIT 278 (435) T ss_pred ccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceee Confidence 6789999999999999999999999999984 47999999999999999999999999997 57999998765433322 Q ss_pred eeccccchhhHHHHHHHHHHHhhhhh--hhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccce Q lcl|NC_010583. 311 VTEAKADGSVLVTAKTISKLRRKLGR--HGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVV 388 (458) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~ 388 (458) .. ..........++.+++..+.. .++.++.|+||+.++..|++++|++|+|+|+.. ..++|+|+||+ T Consensus 279 ~~---~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~--------~~~~l~G~pv~ 347 (435) T protein:vir:80 279 AS---DGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPEL--------ANGMLKGYPVG 347 (435) T ss_pred cc---cccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceeccCC--------CCCeEeeeeeE Confidence 21 112222223345555555443 355678999999999999999999999998421 23589999999 Q ss_pred eccccccc---ccCCceEEEEEeceEEEEecceeEEeeccc-------------ccCCceEEEEEEeeccEEecccceEE Q lcl|NC_010583. 389 VSEYFPAK---AASAEFAVIVYKDNFVMPRQRAVTVERERQ-------------AGKQRDAYYVTQRVNLQRYFENGVVS 452 (458) Q Consensus 389 ~~~~~~~~---~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~-------------~~~~~~~~~~~~r~d~~~~~~~afv~ 452 (458) +++++|.. ..+...+++++++.|+++++.++++..+++ |.+|++.||++.|+|+++++|+||++ T Consensus 348 ~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~ 427 (435) T protein:vir:80 348 KTTQVPINLGEAGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAV 427 (435) T ss_pred EeccccccccCCCCcceEEEEEcccEEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEE Confidence 99999963 223446778899999999999999987654 56899999999999999999999999 Q ss_pred EEeecC Q lcl|NC_010583. 453 GAYAAA 458 (458) Q Consensus 453 l~~aaa 458 (458) ++-.+= T Consensus 428 l~~~~~ 433 (435) T protein:vir:80 428 LSGVAW 433 (435) T ss_pred EeccCC Confidence 887655 No 21 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=1.3e-58 Score=338.02 Aligned_cols=388 Identities=15% Similarity=0.101 Sum_probs=257.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELF 84 (458) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~ 84 (458) |..+.+++. ++.++++.+.++ .+++.+...++...+.. ..++.++..++...+++.+...+.+ T Consensus 1 m~~~~k~l~--el~~~~~~~~~~------~~~~~e~~~~~~~~~~~----~~~e~~~~~~~~~~~~~~~~~~~~~----- 63 (395) T protein:vir:43 1 MSDFEKQIG--ELNASLKQVGDQ------IKSQAEQVNTQIANFGE----MNKETRAKVDELLTAQGELQARLSA----- 63 (395) T ss_pred ChhHHHHHH--HHHHHHHHHHHH------HHHHHHHHHHHHHHHhh----hhHHHHHHHHHHHHHHHHHHHHHHH----- Confidence 333332222 333333222111 11111111111111111 1112222222222222222111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcc Q lcl|NC_010583. 85 AQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGS 164 (458) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~ 164 (458) ........ +..... ...................+..+++.+........ + .. T Consensus 64 -------------~~~~~~~~----~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~--~~ 115 (395) T protein:vir:43 64 -------------AEQAMLAN----EKRDGG----EEAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRS-----A--IT 115 (395) T ss_pred -------------HHHHHHhh----hccccc----cchhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhh-----h--hc Confidence 00000000 000000 00000111111222334445555555443332221 1 23 Q ss_pred cccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCC-Ccccccccccccccccccccccccceeee Q lcl|NC_010583. 165 SSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEA-GRATWVDASKFGTDETVGDEVKGQLTEIS 243 (458) Q Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~e~~~~~e~~~~~~~~~~f~~v~ 243 (458) ++...+|.++|++++++|++.+++.++|+++++++|++++...+|+..+. +.++|++|++ .+++++++|++|+ T Consensus 116 ~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~------~~~~~~~~~~~i~ 189 (395) T protein:vir:43 116 SIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNNAAPVSEGT------QKPYSDLTFELEN 189 (395) T ss_pred ccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCceeeecCCc------cccccccceeEEE Confidence 34455677889999999999999999999999999999999999998764 6889998875 4456789999999 Q ss_pred eehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc-ccccccccccccceeeccccchhhHH Q lcl|NC_010583. 244 FKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQP-KGLLKLAADDGAKVVTEAKADGSVLV 322 (458) Q Consensus 244 ~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p-~Gi~~~~~~~~~~~~~~~~~~~~~~~ 322 (458) +++++++++++||+++|+|++ ++++||.+.|++++++++|.+||+|+|+++| .||++......... ........ T Consensus 190 ~~~~k~~~~~~is~ell~d~~-~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~----~~~~~~~~ 264 (395) T protein:vir:43 190 APVRTIAHLFKASRQILDDAS-ALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPS----GVVVTAEQ 264 (395) T ss_pred EeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccc----ccccccch Confidence 999999999999999999986 7999999999999999999999999998765 89998765433222 12223345 Q ss_pred HHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCce Q lcl|NC_010583. 323 TAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEF 402 (458) Q Consensus 323 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 402 (458) .+.++.+++..+.+.+..+++|+|||.++..|.+++|++|+|+++. +..+.+++|+|+||++++++|.+ . T Consensus 265 ~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~-----~~~~~~~~l~G~pVv~~~~~~~~-----~ 334 (395) T protein:vir:43 265 RIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGS-----PQNGTTPTLWRLPVVETQAITQD-----E 334 (395) T ss_pred hHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccc-----cccCCCceecceeeEEcCCCCCC-----c Confidence 6778888899999999999999999999999999999999999853 23345678999999999999853 3 Q ss_pred EEEEEece-EEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 403 AVIVYKDN-FVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 403 ~~~~~~~~-~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +++++++. |.++++.++++..++ +|.+|++.||++.|+|+++++|+||+++++++| T Consensus 335 ~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 335 FLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred EEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 56777764 778888888887654 477999999999999999999999999999999 No 22 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=8.1e-59 Score=339.07 Aligned_cols=378 Identities=16% Similarity=0.137 Sum_probs=255.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 14 LGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSK------AVGEDRKRLEEALDLVKNLDEKSKKSAELFAQT 87 (458) Q Consensus 14 ~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~------~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~ 87 (458) |.+|.++++ +.+.+...+++.+.++ ...+.++.+++...+++.+.+++++..+. T Consensus 1 m~~~~~~l~----------------~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~---- 60 (390) T protein:vir:97 1 MTDITAKLE----------------ATLANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQR---- 60 (390) T ss_pred ChHHHHHHH----------------HHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 444443332 1112222222222211 12233333444334443333322221100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhccccc Q lcl|NC_010583. 88 VEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSV 167 (458) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 167 (458) ...... ...... ..................+......+............... .++++ T Consensus 61 ~~~~~~--------------~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 118 (390) T protein:vir:97 61 VAELEG--------------NGAGGD-------VQHVSVGDMFVASEQFQASTGRWNDRSARATMNIKAALNTA-STDAA 118 (390) T ss_pred HHHHHh--------------cccccc-------cccccchhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhh-hcccc Confidence 000000 000000 00000000111111122222222222222221222222222 34456 Q ss_pred ccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCC-Ccccccccccccccccccccccccceeeeeeh Q lcl|NC_010583. 168 SMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEA-GRATWVDASKFGTDETVGDEVKGQLTEISFKT 246 (458) Q Consensus 168 ~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~ 246 (458) ..+|.++|+++++.|++.++..++|+++++++|++++.+++|+..+. +.+.|++|++ .+++++++|++|++.+ T Consensus 119 ~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~------~~~~~~~~~~~i~~~~ 192 (390) T protein:vir:97 119 GSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGA------LKPESSLKFAKKTDTT 192 (390) T ss_pred cccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEecCCcceeeecCCc------cccccccceeEEEEee Confidence 66788899999999999999999999999999999999999999874 6789998875 4456789999999999 Q ss_pred hheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc-cccccccccccccceeeccccchhhHHHHH Q lcl|NC_010583. 247 YKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQ-PKGLLKLAADDGAKVVTEAKADGSVLVTAK 325 (458) Q Consensus 247 ~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (458) ++++++++||+++++|++ +++++|.++|++++++++|.+||+|+|+++ |.||++.+....... .......+. T Consensus 193 ~k~~~~~~is~ell~ds~-~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~------~~~~~~~~d 265 (390) T protein:vir:97 193 HVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPT------TIAGATRVD 265 (390) T ss_pred eeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeeccccccccc------cccccchHH Confidence 999999999999999985 799999999999999999999999999865 999998764433221 122334456 Q ss_pred HHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEE Q lcl|NC_010583. 326 TISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVI 405 (458) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~ 405 (458) ++.+++..+.+.+..++.|+|||.++..|++++|++|+|+++... .+.+++|+|+||++++.+|. +.+++ T Consensus 266 ~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~-----~~~~~~l~G~pV~~~~~~~~-----~~~~~ 335 (390) T protein:vir:97 266 QLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNAR-----GTLTPTLWGLPVVATQAMAP-----GEFLV 335 (390) T ss_pred HHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcc-----CCCCceecceeeEEcCCCCC-----CcEEE Confidence 788888899999999999999999999999999999999986532 23456899999999999985 24677 Q ss_pred EEec-eEEEEecceeEEeec---ccccCCceEEEEEEeeccEEecccceEEEEee Q lcl|NC_010583. 406 VYKD-NFVMPRQRAVTVERE---RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYA 456 (458) Q Consensus 406 ~~~~-~~~i~~~~~~~i~~~---~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~a 456 (458) ++++ .|.++++.++++..+ .+|.+|++.||++.|+|+++++|+|||++++| T Consensus 336 gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 336 GAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred EeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 8887 588899999988753 47899999999999999999999999999999 No 23 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=1.4e-58 Score=337.83 Aligned_cols=401 Identities=11% Similarity=0.037 Sum_probs=254.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELF 84 (458) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~ 84 (458) |+.+++. .+..++++++..+..++++..+.+ +..+++++...+++.+..++.+..+.. T Consensus 1 mk~~~el-------------------~~~l~el~~~~~~~~~e~~~~l~~---~~~~~~~~~~~e~~~l~~~i~~~~~~~ 58 (415) T protein:vir:79 1 MKTKEEL-------------------QSEISDIKRQIDLKVKYATRALNN---DELEKAEKLEQEITDLRSQIQEKQEEL 58 (415) T ss_pred CchHHHH-------------------HHHHHHHHHHHHHHHHHHHHHhch---HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222221 111112222222222222221111 111122222222222222221111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcc Q lcl|NC_010583. 85 AQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGS 164 (458) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~ 164 (458) .+.......... .........+........................++.+|....+.+ ....... T Consensus 59 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~ 123 (415) T protein:vir:79 59 DKLKEKDGTSEN----NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETR-----------NDIQGGS 123 (415) T ss_pred HHHHHHHhhhhh----cccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhh-----------hhhhhcc Confidence 000000000000 0000000000000000000000000001111112223333222211 1111123 Q ss_pred cccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEE--ecCCCcccccccccccccccccccccccceee Q lcl|NC_010583. 165 SSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLV--EPEAGRATWVDASKFGTDETVGDEVKGQLTEI 242 (458) Q Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~--~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v 242 (458) .++++||++||+++.+.|++.+++.++|+++++++|++++..++|+ ..+...++|++|++..++ .+.++|++| T Consensus 124 ~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~-----~~~~~~~~v 198 (415) T protein:vir:79 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE-----LAVKPFFQL 198 (415) T ss_pred ccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCc-----ccccceeeE Confidence 4556789999999999999999999999999999999877666554 556677899988865443 346899999 Q ss_pred eeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHH Q lcl|NC_010583. 243 SFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLV 322 (458) Q Consensus 243 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 322 (458) ++.+++++++++||+++++||.++|++||.++|++++++++|.+|++|+|++.|.++......... ........ T Consensus 199 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~------~~~~~~~~ 272 (415) T protein:vir:79 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK------KLEVKKAK 272 (415) T ss_pred EeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccc------cccccccc Confidence 999999999999999999999999999999999999999999999999999887665544322221 11222334 Q ss_pred HHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCce Q lcl|NC_010583. 323 TAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEF 402 (458) Q Consensus 323 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 402 (458) .++++.+++..+...+..++.|+||+.++..|++++|++|+|++++.. ..+.+++|+|+||++++++|.+.++... T Consensus 273 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~----~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:79 273 SLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV----KEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred chhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCc----CCCCCceecceeeEEecccccCCCCccE Confidence 577888889899888899999999999999999999999999986543 3355679999999999999987777777 Q ss_pred EEEEEece-EEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 403 AVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 403 ~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +++++|+. |.++++.++++..++| ..+.+.+|++.|+|+++++|+||++++++++ T Consensus 349 ~~~Gd~~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:79 349 LIIGNLKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEehhccEEEEeecceEEEEecc-ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 88888875 7789999999988765 4677889999999999999999999999999 No 24 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=1.4e-58 Score=337.83 Aligned_cols=401 Identities=11% Similarity=0.037 Sum_probs=254.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELF 84 (458) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~ 84 (458) |+.+++. .+..++++++..+..++++..+.+ +..+++++...+++.+..++.+..+.. T Consensus 1 mk~~~el-------------------~~~l~el~~~~~~~~~e~~~~l~~---~~~~~~~~~~~e~~~l~~~i~~~~~~~ 58 (415) T protein:vir:98 1 MKTKEEL-------------------QSEISDIKRQIDLKVKYATRALNN---DELEKAEKLEQEITDLRSQIQEKQEEL 58 (415) T ss_pred CchHHHH-------------------HHHHHHHHHHHHHHHHHHHHHhch---HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222221 111112222222222222221111 111122222222222222221111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcc Q lcl|NC_010583. 85 AQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGS 164 (458) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~ 164 (458) .+.......... .........+........................++.+|....+.+ ....... T Consensus 59 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~ 123 (415) T protein:vir:98 59 DKLKEKDGTSEN----NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETR-----------NDIQGGS 123 (415) T ss_pred HHHHHHHhhhhh----cccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhh-----------hhhhhcc Confidence 000000000000 0000000000000000000000000001111112223333222211 1111123 Q ss_pred cccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEE--ecCCCcccccccccccccccccccccccceee Q lcl|NC_010583. 165 SSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLV--EPEAGRATWVDASKFGTDETVGDEVKGQLTEI 242 (458) Q Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~--~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v 242 (458) .++++||++||+++.+.|++.+++.++|+++++++|++++..++|+ ..+...++|++|++..++ .+.++|++| T Consensus 124 ~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~-----~~~~~~~~v 198 (415) T protein:vir:98 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE-----LAVKPFFQL 198 (415) T ss_pred ccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCc-----ccccceeeE Confidence 4556789999999999999999999999999999999877666554 556677899988865443 346899999 Q ss_pred eeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHH Q lcl|NC_010583. 243 SFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLV 322 (458) Q Consensus 243 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 322 (458) ++.+++++++++||+++++||.++|++||.++|++++++++|.+|++|+|++.|.++......... ........ T Consensus 199 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~------~~~~~~~~ 272 (415) T protein:vir:98 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK------KLEVKKAK 272 (415) T ss_pred EeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccc------cccccccc Confidence 999999999999999999999999999999999999999999999999999887665544322221 11222334 Q ss_pred HHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCce Q lcl|NC_010583. 323 TAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEF 402 (458) Q Consensus 323 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 402 (458) .++++.+++..+...+..++.|+||+.++..|++++|++|+|++++.. ..+.+++|+|+||++++++|.+.++... T Consensus 273 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~----~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:98 273 SLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV----KEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred chhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCc----CCCCCceecceeeEEecccccCCCCccE Confidence 577888889899888899999999999999999999999999986543 3355679999999999999987777777 Q ss_pred EEEEEece-EEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 403 AVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 403 ~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +++++|+. |.++++.++++..++| ..+.+.+|++.|+|+++++|+||++++++++ T Consensus 349 ~~~Gd~~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:98 349 LIIGNLKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEehhccEEEEeecceEEEEecc-ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 88888875 7789999999988765 4677889999999999999999999999999 No 25 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=1.4e-58 Score=337.83 Aligned_cols=401 Identities=11% Similarity=0.037 Sum_probs=254.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELF 84 (458) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~ 84 (458) |+.+++. .+..++++++..+..++++..+.+ +..+++++...+++.+..++.+..+.. T Consensus 1 mk~~~el-------------------~~~l~el~~~~~~~~~e~~~~l~~---~~~~~~~~~~~e~~~l~~~i~~~~~~~ 58 (415) T protein:vir:81 1 MKTKEEL-------------------QSEISDIKRQIDLKVKYATRALNN---DELEKAEKLEQEITDLRSQIQEKQEEL 58 (415) T ss_pred CchHHHH-------------------HHHHHHHHHHHHHHHHHHHHHhch---HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222221 111112222222222222221111 111122222222222222221111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcc Q lcl|NC_010583. 85 AQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGS 164 (458) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~ 164 (458) .+.......... .........+........................++.+|....+.+ ....... T Consensus 59 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~ 123 (415) T protein:vir:81 59 DKLKEKDGTSEN----NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETR-----------NDIQGGS 123 (415) T ss_pred HHHHHHHhhhhh----cccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhh-----------hhhhhcc Confidence 000000000000 0000000000000000000000000001111112223333222211 1111123 Q ss_pred cccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEE--ecCCCcccccccccccccccccccccccceee Q lcl|NC_010583. 165 SSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLV--EPEAGRATWVDASKFGTDETVGDEVKGQLTEI 242 (458) Q Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~--~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v 242 (458) .++++||++||+++.+.|++.+++.++|+++++++|++++..++|+ ..+...++|++|++..++ .+.++|++| T Consensus 124 ~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~-----~~~~~~~~v 198 (415) T protein:vir:81 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE-----LAVKPFFQL 198 (415) T ss_pred ccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCc-----ccccceeeE Confidence 4556789999999999999999999999999999999877666554 556677899988865443 346899999 Q ss_pred eeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHH Q lcl|NC_010583. 243 SFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLV 322 (458) Q Consensus 243 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 322 (458) ++.+++++++++||+++++||.++|++||.++|++++++++|.+|++|+|++.|.++......... ........ T Consensus 199 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~------~~~~~~~~ 272 (415) T protein:vir:81 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK------KLEVKKAK 272 (415) T ss_pred EeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccc------cccccccc Confidence 999999999999999999999999999999999999999999999999999887665544322221 11222334 Q ss_pred HHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCce Q lcl|NC_010583. 323 TAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEF 402 (458) Q Consensus 323 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 402 (458) .++++.+++..+...+..++.|+||+.++..|++++|++|+|++++.. ..+.+++|+|+||++++++|.+.++... T Consensus 273 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~----~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:81 273 SLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV----KEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred chhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCc----CCCCCceecceeeEEecccccCCCCccE Confidence 577888889899888899999999999999999999999999986543 3355679999999999999987777777 Q ss_pred EEEEEece-EEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 403 AVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 403 ~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +++++|+. |.++++.++++..++| ..+.+.+|++.|+|+++++|+||++++++++ T Consensus 349 ~~~Gd~~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:81 349 LIIGNLKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEehhccEEEEeecceEEEEecc-ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 88888875 7789999999988765 4677889999999999999999999999999 No 26 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=2.5e-58 Score=336.39 Aligned_cols=401 Identities=11% Similarity=0.038 Sum_probs=255.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELF 84 (458) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~ 84 (458) |+.+++. .+..+++.++.....++++..+. ++..++++...++++.+..++.+..+.. T Consensus 1 mk~~~el-------------------~~~l~el~~~~~~~~~~~~~~~~---~~~~e~~~~~~~ei~~l~~~i~~~~~~~ 58 (415) T protein:vir:94 1 MKTKEEL-------------------QSEISDIKRQIDLKVKYATRALN---NDELEKAEKLEQEITDLRSQIQEKQEEL 58 (415) T ss_pred CChHHHH-------------------HHHHHHHHHHHHHHHHHHHHHhc---hhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222221 11111222222222222222111 1112222222222332222222111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcc Q lcl|NC_010583. 85 AQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGS 164 (458) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~ 164 (458) .+..+........ .................. ...............++.+|.+.++.+. ...... T Consensus 59 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~e~~~~~~~~~~~~-----------~~~~~~ 123 (415) T protein:vir:94 59 DKLKEKDGTSENN-QQSVEVNEASTYRNQANI---NDLGISIQNTKVTSQEVRDFTEYLETRN-----------DIQGGS 123 (415) T ss_pred HHHHHHHHhhhhc-cccccccchhhHHHHHHH---HHHHhhhhhhhhhHHHHHHHHHHhhhhh-----------hhhhhc Confidence 0000000000000 000000000000000000 0000000111111122233333322211 111223 Q ss_pred cccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEE--ecCCCcccccccccccccccccccccccceee Q lcl|NC_010583. 165 SSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLV--EPEAGRATWVDASKFGTDETVGDEVKGQLTEI 242 (458) Q Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~--~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v 242 (458) .+++.|+++||+++.+.|++.+++.++|+++++++|++++..++|+ ..+.+.+.|++|++..++ .+.++|++| T Consensus 124 ~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~-----~~~~~~~~i 198 (415) T protein:vir:94 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE-----LAVKPFFQL 198 (415) T ss_pred cccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceeccccccccc-----cccccceee Confidence 4556789999999999999999999999999999999877666554 456678899988865443 346899999 Q ss_pred eeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHH Q lcl|NC_010583. 243 SFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLV 322 (458) Q Consensus 243 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 322 (458) ++.+++++++++||+++++|+.++|++||.++|++++++++|.+|++|+|++.|.++.......... ....... T Consensus 199 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~------~~~~~~~ 272 (415) T protein:vir:94 199 AYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK------LEVKKAK 272 (415) T ss_pred EeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccc------ccccccc Confidence 9999999999999999999999999999999999999999999999999998876665543322211 1122234 Q ss_pred HHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCce Q lcl|NC_010583. 323 TAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEF 402 (458) Q Consensus 323 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 402 (458) .++++.++++.+...++.++.|+||+.++..|++++|++|+|++.+.. ..+.+++|+|+||++++++|.+..+... T Consensus 273 ~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~----~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:94 273 SLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDV----KEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred chHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCc----CCCCCceecceeeEEecccccCCCCccE Confidence 567788888888888889999999999999999999999999986533 3456689999999999999987777777 Q ss_pred EEEEEece-EEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 403 AVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 403 ~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +++++++. |+++++.++++.++++ ..+++.+|++.|+|+++++|+||++++++++ T Consensus 349 i~~gd~~~~~~~~~~~~~~v~~~~~-~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 404 (415) T protein:vir:94 349 LIIGNLKDAIVLFDRSQYQASWTDY-MHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEehhccEEEEeecceEEEEecc-ccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 88888875 7889999999988765 5778899999999999999999999999998 No 27 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=2.7e-58 Score=336.21 Aligned_cols=378 Identities=16% Similarity=0.137 Sum_probs=250.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 14 LGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSK------AVGEDRKRLEEALDLVKNLDEKSKKSAELFAQT 87 (458) Q Consensus 14 ~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~------~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~ 87 (458) |.++.++++.. +.+...+++.+.++ .++|.++.+++..++++.+..++.+..+ . T Consensus 1 m~e~~~~l~~~----------------~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~----~ 60 (390) T protein:vir:10 1 MTDITSKLEAT----------------LANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQ----R 60 (390) T ss_pred ChHHHHHHHHH----------------HHHHHHHHHHHHHHHHhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHH----H Confidence 55554444311 11111112221111 2233334444444444443332222111 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhccccc Q lcl|NC_010583. 88 VEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSV 167 (458) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 167 (458) .+.+ ..... ... . ......+.........++......+..................++. T Consensus 61 ~~~~-------~~~~~-------~~~----~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 119 (390) T protein:vir:10 61 VAEL-------EGNGA-------GGD----V---QHVSVGDLFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAG 119 (390) T ss_pred HHHH-------Hhhcc-------ccc----c---cccchhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhccccc Confidence 0000 00000 000 0 0000000000111111222222222111111111122223333333 Q ss_pred ccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCC-Ccccccccccccccccccccccccceeeeeeh Q lcl|NC_010583. 168 SMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEA-GRATWVDASKFGTDETVGDEVKGQLTEISFKT 246 (458) Q Consensus 168 ~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~ 246 (458) . +|.++|+++.+.||+.+++.++|+++++++|++++.+++|+.++. +.+.|++|++ .+++++++|++|++.+ T Consensus 120 ~-~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~------~~~~~~~~~~~i~~~~ 192 (390) T protein:vir:10 120 S-AGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGA------LKPESSLKFAKKTDTT 192 (390) T ss_pred c-cccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCCcceeeecCCc------cccccccceeEEEEee Confidence 3 445566677889999999999999999999999999999998875 6789998875 4456789999999999 Q ss_pred hheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc-cccccccccccccceeeccccchhhHHHHH Q lcl|NC_010583. 247 YKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQ-PKGLLKLAADDGAKVVTEAKADGSVLVTAK 325 (458) Q Consensus 247 ~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (458) ++++++++||+++|+|++ ++.+||.++|++++++++|.+||+|+|+++ |.||++.+.....+.. ......+. T Consensus 193 ~k~~~~~~is~ell~d~~-~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~------~~~~~~~~ 265 (390) T protein:vir:10 193 HVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT------IAGATRVD 265 (390) T ss_pred EEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCCcccccccccccccccccc------ccccchHH Confidence 999999999999999986 899999999999999999999999999864 9999987654332211 11223456 Q ss_pred HHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEE Q lcl|NC_010583. 326 TISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVI 405 (458) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~ 405 (458) .+.+++..+.+.++.++.|+|||.++..|.+++|++|+|+|+... .+.+++|+|+||++++.+|.+ .+++ T Consensus 266 ~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~-----~~~~~~l~G~pv~~~~~~p~~-----~~~~ 335 (390) T protein:vir:10 266 QLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNAR-----GTLTPTLWGLPVVATQAMAPG-----EFLV 335 (390) T ss_pred HHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCc-----CcCCceecceeeEEcCCCCCC-----cEEE Confidence 678888889999999999999999999999999999999996543 233568999999999999852 4567 Q ss_pred EEec-eEEEEecceeEEeec---ccccCCceEEEEEEeeccEEecccceEEEEee Q lcl|NC_010583. 406 VYKD-NFVMPRQRAVTVERE---RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYA 456 (458) Q Consensus 406 ~~~~-~~~i~~~~~~~i~~~---~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~a 456 (458) ++++ .|.++++.++++..+ .+|.+|++.||++.|+|+++++|+||+++++| T Consensus 336 gdf~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 336 GAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 8887 577889998888653 46889999999999999999999999999999 No 28 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=2.3e-58 Score=336.55 Aligned_cols=384 Identities=17% Similarity=0.129 Sum_probs=253.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 14 LGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQE 93 (458) Q Consensus 14 ~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~ 93 (458) |.++.++++.. +.++.++..+..+.... ..+..++.++.+++..++++.+.+++.+..+. ....+ T Consensus 1 m~~l~~~l~~~---------~~~~~~~~~~~~e~~~~-~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~----~~~~~- 65 (390) T protein:vir:81 1 MTDITSKLEAT---------LANVTDSLRAFGERAVR-DGELNASARSKVDELFATVGNLSAEVQAARQR----VAELE- 65 (390) T ss_pred ChHHHHHHHHH---------HHHHHHHHHHHHHHHHh-hcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHH- Confidence 44444443311 11111111111111000 00122333444444444444443333221110 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccc Q lcl|NC_010583. 94 TIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEA 173 (458) Q Consensus 94 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ 173 (458) ..........+.. .+..........+...................... ..++++.+|.+ T Consensus 66 -----------------~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~g~~ 124 (390) T protein:vir:81 66 -----------------GNGAGGDVQHVSV---GDMFVASEQFQASAGRWNDRSARATMNIKAALNTA-STDAAGSAGAL 124 (390) T ss_pred -----------------hcccccccccccc---hhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhh-ccccccCCcce Confidence 0000000000000 00000011111121211111111111111111122 23345566777 Q ss_pred cchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCC-Ccccccccccccccccccccccccceeeeeehhheeee Q lcl|NC_010583. 174 YETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEA-GRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAK 252 (458) Q Consensus 174 ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~-~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~ 252 (458) +|+++...|++.+++.++|+++++++|++++.+++|+..+. +.+.|++|++ ..++++++|+++++.+++++++ T Consensus 125 ~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~------~~~~~~~~~~~i~~~~~k~~~~ 198 (390) T protein:vir:81 125 TTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGA------LKPESSLKFAKKTDTTHVIAHT 198 (390) T ss_pred echhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEecCCcceeeecCCc------ccccccceeeEEEEeeeEEEEe Confidence 78888999999999999999999999999999999998875 5788998875 4456789999999999999999 Q ss_pred ehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc-cccccccccccccceeeccccchhhHHHHHHHHHHH Q lcl|NC_010583. 253 SFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQ-PKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLR 331 (458) Q Consensus 253 ~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (458) ++||+++|+|++ ++++||.++|++++++++|.+||+|+|+++ |.||++.+.....+. .......+.++.+++ T Consensus 199 ~~is~ell~d~~-~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~------~~~~~~~~~~~~~~~ 271 (390) T protein:vir:81 199 MKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPT------TIAGATRVDQLRLAM 271 (390) T ss_pred ehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeeccccccccc------ccccchhHHHHHHHH Confidence 999999999985 799999999999999999999999999875 999998765433222 122334456788888 Q ss_pred hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEece- Q lcl|NC_010583. 332 RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDN- 410 (458) Q Consensus 332 ~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~- 410 (458) ..+.+.+..++.|+|||.++..|++++|++|+|+|+... .+.+++|+|+||++++.+|.+ .+++++++. T Consensus 272 ~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~-----~~~~~~l~G~pv~~~~~~p~~-----~~~~gd~~~~ 341 (390) T protein:vir:81 272 LQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGNAR-----GTLTPTLWGLPVVATQAMAPG-----EFLVGAFDLA 341 (390) T ss_pred HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcc-----cccCceecceeeEEcCCCCCC-----cEEEEehhce Confidence 899999999999999999999999999999999986432 334568999999999999853 467788874 Q ss_pred EEEEecceeEEeec---ccccCCceEEEEEEeeccEEecccceEEEEee Q lcl|NC_010583. 411 FVMPRQRAVTVERE---RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYA 456 (458) Q Consensus 411 ~~i~~~~~~~i~~~---~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~a 456 (458) |.++++.++++..+ .+|.+|++.||++.|+|+++++|+|||++++| T Consensus 342 ~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 342 AQIFDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEEEecceEEEEecccchhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 78899999998764 46889999999999999999999999999999 No 29 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=1.1e-56 Score=327.37 Aligned_cols=412 Identities=12% Similarity=0.070 Sum_probs=245.0 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGLGDLAKSLEGLTAA-QKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSA 81 (458) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~ 81 (458) |.|++|++++.- +.++++..... ++.........+..+....+++.+.++ ..+.++++++.....+...++.+... T Consensus 1 Mki~elk~el~~--~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~e-i~el~~~l~~~~~~~~~~~e~~~~~~ 77 (437) T protein:vir:10 1 MKIEKLKKDLAT--KTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDE-IKEIRSNIEVLEQASALKVEEKRDDS 77 (437) T ss_pred CCHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 778889887763 22333221110 000000000000001111111111111 11111111111111111111111100 Q ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhc-------chhhhhhHHHHHHHHHhhhccchhHHH Q lcl|NC_010583. 82 ELFAQ--TVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYG-------TQDAFEDEVEKLVLLSYMMEKDVFETE 152 (458) Q Consensus 82 e~~~~--~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~-------~~~~~~~~~~~~a~~~~~~~~~~~~~~ 152 (458) +.... .......+...............+..........+.... .........+...|..++.. T Consensus 78 ~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 150 (437) T protein:vir:10 78 DLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKT------- 150 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHh------- Confidence 00000 000000000000000000000000000000000000000 00000000011111111111 Q ss_pred HHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecC-CCcccccccccccccccc Q lcl|NC_010583. 153 HGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPE-AGRATWVDASKFGTDETV 231 (458) Q Consensus 153 ~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~e~~~~~e~~~ 231 (458) ...+ .....+.+.||++||+++...|. .++..+.|+.++++++++++...+|+... .+.++|++|++..++ T Consensus 151 ---~e~~-~~~~~~~~~~g~lvp~~~~~~i~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e--- 222 (437) T protein:vir:10 151 ---GEVR-DVTGIALKDGKVIIPETILTPEK-EVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTK--- 222 (437) T ss_pred ---hhhh-hhhhcccccccccchHHHHHHHH-HhhhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccc--- Confidence 1111 12344566788999999987654 56788899999999999999999999864 477899988765443 Q ss_pred cccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccee Q lcl|NC_010583. 232 GDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVV 311 (458) Q Consensus 232 ~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~ 311 (458) .++++|++|++.+++++++++||+++|+|+.++|.+||.++|+++++.+++.+|++|+|++.|.+..+. T Consensus 223 --~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~--------- 291 (437) T protein:vir:10 223 --NATPVITPILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKTTSTY--------- 291 (437) T ss_pred --cccccceeeeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc--------- Confidence 356899999999999999999999999999999999999999999999999999999998766432110 Q ss_pred eccccchhhHHHHHHHHHHH-hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceec Q lcl|NC_010583. 312 TEAKADGSVLVTAKTISKLR-RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVS 390 (458) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~ 390 (458) .+.++.+++ ..+.+.|..++.|+||+.++..|++++|++|+|+|++... .+.+++|+|+||+++ T Consensus 292 -----------~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~----~~~~~~l~G~pv~~~ 356 (437) T protein:vir:10 292 -----------LLGDLKKVLNVTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVT----AATGYTLLGKTVVIV 356 (437) T ss_pred -----------chhhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCcc----CCCCcccccceeEEe Confidence 112233333 3677888899999999999999999999999999866433 355679999999998 Q ss_pred ccc--cccccCCceEEEEEec-eEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 391 EYF--PAKAASAEFAVIVYKD-NFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 391 ~~~--~~~~~~~~~~~~~~~~-~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +++ |.++++...+++|+|+ .|.++++.++++..++++..+.+.+++..|+|+++++|+|||+++.... T Consensus 357 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~ 427 (437) T protein:vir:10 357 DDKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGKLK 427 (437) T ss_pred cccccCCcCCCceEEEEeeccccEEEEeeeceEEEEecccccccceeeEEEEEccEEecccceEEEEeecc Confidence 765 6677777778888887 4789999999998888888888999999999999999999999885433 No 30 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=1.5e-57 Score=332.13 Aligned_cols=418 Identities=13% Similarity=0.129 Sum_probs=244.1 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAE 82 (458) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e 82 (458) |.++.++-...++.+..++..|+.+.+.-+. ...+.+.+ +.+ ...++..+..++.++.++....++.+... T Consensus 1 ~~~~~~~l~~~~~~~~~~l~el~e~~~~l~k--~~~el~~~-l~e------a~~~ee~~~~ee~i~~l~~~~~el~e~~~ 71 (466) T protein:vir:80 1 MALRQLMLAKKIEQRKAALAELLEQEKALQK--RSEELEAA-IDE------ANTDEEIAVVEDEINKLEGEKTELEEKKS 71 (466) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH-HHh------hhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6666666444445555555544332111110 00010000 000 01111112222222222222222222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhcc----chhHHH---HHH Q lcl|NC_010583. 83 LFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEK----DVFETE---HGK 155 (458) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~----~~~~~~---~~~ 155 (458) ...++++.+++++..+........... ............ ........+.+.+... ...... ... T Consensus 72 ~l~~ei~~le~el~e~~~~~~~~~~~~---~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 142 (466) T protein:vir:80 72 KLEGEIKELENELEQLNNKEPKNNSEP---AQVSGARTQQFV------GGETRMKGFFRNMPYEQRAALIARSEVKEFLA 142 (466) T ss_pred HHHHHHHHHHHHHHHHHHhhhccCchh---HHHHhhhhhHHh------hHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHH Confidence 122222222222222111110000000 000000000000 0000000110000000 000000 000 Q ss_pred HHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccc Q lcl|NC_010583. 156 AHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEV 235 (458) Q Consensus 156 ~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~ 235 (458) ..........+.++|+++||+++.+.|++.++.+++|++++++.|+++ ..++|+....+.+.|++|++. ++++ T Consensus 143 ~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g-~~~~~~~~~~~~a~wv~E~~~------~~~~ 215 (466) T protein:vir:80 143 QVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKG-TARQNIAGAIPEGVWTEAVAN------LNEL 215 (466) T ss_pred HHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCc-eeEeeeecCCcceeecccccc------cccc Confidence 111112223345667789999999999999999999999999999864 568899888888999988764 4456 Q ss_pred cccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccc Q lcl|NC_010583. 236 KGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 236 ~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) +|+|++|++.+|+++++++||++||+||.+++++||+.+|+++++.++|.+||+|+|+++|.||++.............. T Consensus 216 ~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~ 295 (466) T protein:vir:80 216 SLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTK 295 (466) T ss_pred cccccceeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccc Confidence 89999999999999999999999999999999999999999999999999999999999999999865332211111111 Q ss_pred cchhhHHH-----------------HHHHHHHHhhhhhhh-cccceeEechhHHHHHHhhh---cccccccccccccccc Q lcl|NC_010583. 316 ADGSVLVT-----------------AKTISKLRRKLGRHG-LKLSKLVLIVSMDAYYDLLE---DEEWQDVAQVGNDAVK 374 (458) Q Consensus 316 ~~~~~~~~-----------------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~---d~~~~~~~~~~~~~~~ 374 (458) ......+. +.++..........+ .+...|+||+.++..|..++ +.+|.+.+... T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~----- 370 (466) T protein:vir:80 296 APAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLN----- 370 (466) T ss_pred cccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCC----- Confidence 11111111 111222222222333 34446999999999998887 44444443221 Q ss_pred ccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeecc--cccCCceEEEEEEeeccEEecccceEE Q lcl|NC_010583. 375 LQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERER--QAGKQRDAYYVTQRVNLQRYFENGVVS 452 (458) Q Consensus 375 ~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~--~~~~~~~~~~~~~r~d~~~~~~~afv~ 452 (458) ....|+|+||+++++||.+ .+++++++.|.++++.++++.+++ +|.+|++.||++.|+||++++|+||++ T Consensus 371 ---~~~~i~G~pvv~s~~~~~~-----~~~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~ 442 (466) T protein:vir:80 371 ---NTMPIVGGDIVILDFIPDN-----DIIGGYGSLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVA 442 (466) T ss_pred ---CcccccccceeecCccCcc-----ceeeeccccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEE Confidence 1124999999999999863 378889999999999999998765 477999999999999999999999999 Q ss_pred EEeecC Q lcl|NC_010583. 453 GAYAAA 458 (458) Q Consensus 453 l~~aaa 458 (458) ++++.. T Consensus 443 ~~~~~~ 448 (466) T protein:vir:80 443 VNIANA 448 (466) T ss_pred EEecCC Confidence 998887 No 31 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=1.5e-58 Score=337.52 Aligned_cols=363 Identities=17% Similarity=0.095 Sum_probs=253.8 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |+|.++++.+..+. .++ +.++.+... ..++.. +.+++... .+.++..+. T Consensus 1 M~i~~k~~~~~~~~---~~~---l~~~~~~~~----~~ee~~------------------~~~~~~~~---~~~~~~~~~ 49 (377) T protein:vir:98 1 MAINLKELPKYREA---VAE---LSAKISAGA----TSEEQE------------------KLFEAAFT---TMGDEILAK 49 (377) T ss_pred CCCcHHHHHHHHHH---HHH---HHHHHHhhh----hhHHHH------------------HHHHHHHH---hHHHHHHHH Confidence 99988876544331 111 111100000 000000 00000000 011100000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhh Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKA 160 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a 160 (458) .. .+ . +.. ... .+ ..... ..+++..+. . T Consensus 50 ~~------~e----~---~~~---~~~--~~--------------~~~~l-t~ee~~~~~-------------------~ 77 (377) T protein:vir:98 50 NE------EE----M---ERM---FDL--RD--------------KNREL-TAEEIKFFN-------------------D 77 (377) T ss_pred HH------HH----H---HHH---HHh--cc--------------CCccc-CHHHHHHHH-------------------H Confidence 00 00 0 000 000 00 00000 111121111 1 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccce Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLT 240 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~ 240 (458) ....++.++||++||+++.+.|++.+...++|+++|+++++++ ..++|+..+.+.+.|++|++.. .++++|+|+ T Consensus 78 ~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~~~~~~~~~~a~w~~e~~~~-----~~~~~~~f~ 151 (377) T protein:vir:98 78 IDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEI-----KGQLKQAFK 151 (377) T ss_pred HHhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCc-ceEEEEecCCcceeEeeccccc-----CcccCccce Confidence 1234566788999999999999999999999999999999864 5799999999999999987533 345789999 Q ss_pred eeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhh Q lcl|NC_010583. 241 EISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSV 320 (458) Q Consensus 241 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 320 (458) +|++.+|+++++++||++||+||.+++++||+++|+++++++++.+|++|+|+++|.||++.............. .... T Consensus 152 ~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~-~~~~ 230 (377) T protein:vir:98 152 EQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRD-ITTY 230 (377) T ss_pred eEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccc-cccc Confidence 999999999999999999999999999999999999999999999999999999999999865432221111111 1122 Q ss_pred HHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccccccccc----------ccccccccccCCeeecccc--e Q lcl|NC_010583. 321 LVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQV----------GNDAVKLQGQVGRIYGLPV--V 388 (458) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~----------~~~~~~~~~~~~~l~G~pv--~ 388 (458) ......+.++.+.++..|+.+++|+||..++..+++++|.+|+++|.. ........|.+.+++|+|+ + T Consensus 231 ~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv 310 (377) T protein:vir:98 231 KTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITIL 310 (377) T ss_pred cchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCceEE Confidence 223356778888999999999999999999999999999999999942 2222334577788999995 5 Q ss_pred ecccccccccCCceEEEEEeceEEEEecceeEEeec--ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 389 VSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERE--RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 389 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~--~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .++++|. ..+++++|+.|.|+++.++++.++ .+|.+|++.|++..|+|+++++|+||++++++.- T Consensus 311 ~s~~~p~-----~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 311 ESLAVET-----GKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ecCCCCc-----ccEEEEEecceeEEeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 6777774 346789999999999999999874 4688999999999999999999999999999999 No 32 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=2e-57 Score=331.44 Aligned_cols=372 Identities=11% Similarity=0.005 Sum_probs=247.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 14 LGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAV-GE--DRKRLEEALDLVKNLDEKSKKSAELFAQTVEK 90 (458) Q Consensus 14 ~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~-~e--~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~ 90 (458) |+.+.+ | .+++.++.++. +.+.+..+... ++ ..+++++..++++.+.++ .+...+.... T Consensus 1 Mk~~~e----l------~~~~~~~~~~~----~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~----~e~~~~~~~~ 62 (397) T protein:vir:49 1 MKTSNE----L------HDLWVAQGDKV----ENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMK----RDMFKEQYTE 62 (397) T ss_pred CchHHH----H------HHHHHHHHHHH----HHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHH----HHHHHHHHHH Confidence 221111 1 01111111111 11111111000 00 001111222222221111 1111111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccC Q lcl|NC_010583. 91 QQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMS 170 (458) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g 170 (458) . ..... . ......+..............+.+|.++++++..... .....++.+.| T Consensus 63 ~-------~~~~~---~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~--------~~~~~~t~~~g 117 (397) T protein:vir:49 63 A-------RANEV---A-------NMSEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLL--------DSKTDASGSDA 117 (397) T ss_pred H-------HHHhh---h-------ccccccccccccchhHHHHHHHHHHHHHHhcchhHHH--------HHhhccccccC Confidence 0 00000 0 0000000011111112223445667666665432211 11234556778 Q ss_pred ccccchhHHHHHHHHHHhccchhhhcceeeeccCc--eEEEEecC-CCcccccccccccccccccccccccceeeeeehh Q lcl|NC_010583. 171 SEAYETIFSTRIIRDLQKELVVGALFDELPMSSKI--LTMLVEPE-AGRATWVDASKFGTDETVGDEVKGQLTEISFKTY 247 (458) Q Consensus 171 ~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~p~~~~-~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~ 247 (458) |++||+++.+.|++.+++.++|+++|+++|+++.. ..+|.... .+.+.|++|++..++ .+.++|++|+++++ T Consensus 118 g~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~~~~~~~i~~~~~ 192 (397) T protein:vir:49 118 GLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIAD-----VDDPKLSLIKYTIK 192 (397) T ss_pred cccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCcccccc-----ccccceeeEEeeee Confidence 99999999999999999999999999999887544 45555544 467899998864443 35789999999999 Q ss_pred heeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHHHHHH Q lcl|NC_010583. 248 KLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTI 327 (458) Q Consensus 248 k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (458) +++++++||+++++||.+++++||.++|++++++++|.+|++|+|++.+.+.. ..++++ T Consensus 193 k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~~~~---------------------~~~d~i 251 (397) T protein:vir:49 193 RYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAALPTKPTL---------------------TKWDDI 251 (397) T ss_pred eEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc---------------------ccHHHH Confidence 99999999999999999999999999999999999999999999987654321 135578 Q ss_pred HHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccc--ccccccCCceEEE Q lcl|NC_010583. 328 SKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEY--FPAKAASAEFAVI 405 (458) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~--~~~~~~~~~~~~~ 405 (458) .++.+.+.+.+..++.|+||+.++..|++++|++|+|++++.. ..+.+++|+|+||+++++ +|+...+...+++ T Consensus 252 ~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~----~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~ 327 (397) T protein:vir:49 252 IDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDV----KSPTGYSIDGFAVKEVADRWLANGTGGAMPLYF 327 (397) T ss_pred HHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCc----CCCCCceecceeeEEecccccccccCCceeEEE Confidence 8889999999999999999999999999999999999986643 335567999999998654 6666677777888 Q ss_pred EEec-eEEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 406 VYKD-NFVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 406 ~~~~-~~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ++++ +|.++++.++++..++ +|.+|++.||++.|+|+++++|+||++++++++ T Consensus 328 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 328 GDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAI 385 (397) T ss_pred eeccceEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEecccceEEEEeecc Confidence 8887 5789999999988754 578999999999999999999999999999998 No 33 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=1.6e-57 Score=331.93 Aligned_cols=356 Identities=14% Similarity=0.080 Sum_probs=249.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 30 AAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAR 109 (458) Q Consensus 30 ~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (458) +..+.+++.+++....++++.+..+. ..+++++..++++.+.+.+.+..+..++. . T Consensus 1 M~k~l~~l~e~~~~~~~e~~~~~~~~---~~e~~~~~~~ei~~l~~~i~~~~~~~~~~------------------~--- 56 (371) T protein:vir:81 1 MPKELRELLEQINNKKEEARKLLAEN---KIEEAKKLKEEIVALQEKFDVAKELYEEQ------------------K--- 56 (371) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------H--- Confidence 12223444444444444444443221 11222333333333322222111100000 0 Q ss_pred HHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhc Q lcl|NC_010583. 110 EGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKE 189 (458) Q Consensus 110 e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~ 189 (458) +..... ............++.+|.++++.+. .++ ...++.+.||++||+++.+.|++.+++. T Consensus 57 --~~~~~~-----~~~~~~~~~~~~~~~~~~~~l~~~~----------~~a-~~~~t~~~gg~~vP~~~~~~ii~~~~~~ 118 (371) T protein:vir:81 57 --QTIEDK-----EPLKPTVQVKENEVEAFVNHIRTRF----------RNA-MSEGSNQDGGYTVPQDIQTRINELRESK 118 (371) T ss_pred --Hhhccc-----cccccchhhHHHHHHHHHHHHHHHH----------HHh-hccCCCccCceeecHhHHHHHHHHHHhh Confidence 000000 0000011112234556666554321 122 2344566789999999999999999999 Q ss_pred cchhhhcceeeeccCceE--EEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHH Q lcl|NC_010583. 190 LVVGALFDELPMSSKILT--MLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSL 267 (458) Q Consensus 190 ~~l~~~~~~~~~~~~~~~--~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~ 267 (458) ++|+++++++|++++... +++..+.+.++|++|++..++ .+.++|++|++.++|++++++||+|+++|+.++| T Consensus 119 s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-----~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l 193 (371) T protein:vir:81 119 DALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGE-----KATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAI 193 (371) T ss_pred hhhhhhceeeeccCCceeEEEEeecCCcceeeecccccccc-----ccccceeeEEeeeeEEEEeehhhHHHHhhhhHHH Confidence 999999999998876655 455556678899998764432 3578999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHHHHHHHH-HHhhhhhhhcccceeEe Q lcl|NC_010583. 268 LPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISK-LRRKLGRHGLKLSKLVL 346 (458) Q Consensus 268 ~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 346 (458) ++||.+.|++++++++|.+|++|+|++.|.|+.+. .++.. +...+.+.+..++.|+| T Consensus 194 ~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~~~~~~----------------------~~i~~~~~~~l~~~~~~~a~~vm 251 (371) T protein:vir:81 194 VNTLVRWIGDESRVTRNGLIINVLNTKAKTAIADL----------------------DGLKQIINVQLDPVFRSTSSVIV 251 (371) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccccccccH----------------------HHHHHHHHhhcchhhhcCCEEEE Confidence 99999999999999999999999999988776431 12222 23466778888999999 Q ss_pred chhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccccc-------ccCCceEEEEEec-eEEEEecce Q lcl|NC_010583. 347 IVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAK-------AASAEFAVIVYKD-NFVMPRQRA 418 (458) Q Consensus 347 ~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~-------~~~~~~~~~~~~~-~~~i~~~~~ 418 (458) |+.++..|.+++|++|+|+|++.. ..+.+++|+|+||++++++|.+ .++...+++|+++ +|.++++.+ T Consensus 252 n~~~~~~L~~lkd~~g~~l~~~~~----~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~ 327 (371) T protein:vir:81 252 NQDAFNWLDTLKDQNGQYLLQPSI----SSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQR 327 (371) T ss_pred cHHHHHHHHHhhccCCCeeeeccc----CCCCCceecceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecc Confidence 999999999999999999986643 3356689999999999999843 2344556777776 478889999 Q ss_pred eEEeeccc----ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 419 VTVERERQ----AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 419 ~~i~~~~~----~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +++..+++ |.+|++.||++.|+|+++++|+||+++++++| T Consensus 328 ~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 328 TEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred eEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 99887654 67899999999999999999999999999999 No 34 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=2.1e-57 Score=331.33 Aligned_cols=377 Identities=18% Similarity=0.153 Sum_probs=246.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 14 LGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQE 93 (458) Q Consensus 14 ~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~ 93 (458) |.+ + .+++++.+...++++.+.++... +.++..++.+.+.+...+ ..+..++..+ T Consensus 1 M~~----l-------------~el~~~~~~~~~e~~~l~~~~~~----e~~~~~~~~~~l~~~~~~----~~~~~~~~~~ 55 (385) T protein:vir:18 1 MSE----L-------------ALIQKAIEESQQKMTQLFDAQKA----EIESTGQVSKQLQSDLMK----VQEELTKSGT 55 (385) T ss_pred ChH----H-------------HHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHH----HHHHHHHHHH Confidence 221 1 12222223333333333222111 111111111111111111 1111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccc Q lcl|NC_010583. 94 TIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEA 173 (458) Q Consensus 94 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ 173 (458) .......... . ................ ...+..+.... ..... . ..+.....++ +.+|.+ T Consensus 56 ~~~~~~~~~~-------~----~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~-~-----~~~~~~~~~~-~~~g~~ 115 (385) T protein:vir:18 56 RLFDLEQKLA-------S----GAENPGEKKSFSERAA-EELIKSWDGKQ-GTFGA-K-----TFNKSLGSDA-DSAGSL 115 (385) T ss_pred HHHHHHHHhh-------c----cccccchhhhhHHHHH-HHHHHHHHHhh-ccchh-h-----HHHhhhcccc-ccCCce Confidence 1110000000 0 0000000000000000 11111111111 11011 1 1112222333 344556 Q ss_pred cchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecC-CCcccccccccccccccccccccccceeeeeehhheeee Q lcl|NC_010583. 174 YETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPE-AGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAK 252 (458) Q Consensus 174 ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~ 252 (458) +|+++...|++.++..++|+++++++|++++.+.+|+..+ .+.+.|++|++ .+++++++|+++++.+++++++ T Consensus 116 i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~------~~~~~~~~~~~~~~~~~k~~~~ 189 (385) T protein:vir:18 116 IQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKA------LKPESDITFSKQTANVKTIAHW 189 (385) T ss_pred ecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCc------cccccccceeEEEEeeeeEEEe Confidence 7778899999999999999999999999999999999876 56788888765 4556789999999999999999 Q ss_pred ehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc-cccccccccccccceeeccccchhhHHHHHHHHHHH Q lcl|NC_010583. 253 SFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQ-PKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLR 331 (458) Q Consensus 253 ~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (458) ++||+++|+|++ ++++||..+|+++++.++|.+||+|+|+++ |.||++.+...... ........++++.+++ T Consensus 190 ~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~------~~~~~~~~~d~i~~~~ 262 (385) T protein:vir:18 190 VQASRQVMDDAP-MLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTS------LNATGDTRADIIAHAI 262 (385) T ss_pred ehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccc------ccccccchHHHHHHHH Confidence 999999999885 699999999999999999999999999876 58998866443221 1222334677888899 Q ss_pred hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEec-e Q lcl|NC_010583. 332 RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKD-N 410 (458) Q Consensus 332 ~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~-~ 410 (458) ..+...+..++.|+||+.++..|++++|++|+|+++.. ..+.+++|+|+||++++++|. +.+++++++ . T Consensus 263 ~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~-----~~~~~~~l~G~pV~~~~~~p~-----~~~~~gd~~~~ 332 (385) T protein:vir:18 263 YQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGP-----QAFTSNIMWGLPVVPTKAQAA-----GTFTVGGFDMA 332 (385) T ss_pred HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCc-----ccCCCceecceeeEEcCcCCC-----CcEEEeecccE Confidence 99999999999999999999999999999999998542 345568999999999999985 346677776 5 Q ss_pred EEEEecceeEEeec----ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 411 FVMPRQRAVTVERE----RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 411 ~~i~~~~~~~i~~~----~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.++++.++++..+ .+|.+|++.||++.|+|+++++|+||+++++++| T Consensus 333 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:18 333 SQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred EEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 88999999888653 4578999999999999999999999999999999 No 35 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=2.1e-57 Score=331.33 Aligned_cols=377 Identities=18% Similarity=0.153 Sum_probs=246.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 14 LGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQE 93 (458) Q Consensus 14 ~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~ 93 (458) |.+ + .+++++.+...++++.+.++... +.++..++.+.+.+...+ ..+..++..+ T Consensus 1 M~~----l-------------~el~~~~~~~~~e~~~l~~~~~~----e~~~~~~~~~~l~~~~~~----~~~~~~~~~~ 55 (385) T protein:vir:19 1 MSE----L-------------ALIQKAIEESQQKMTQLFDAQKA----EIESTGQVSKQLQSDLMK----VQEELTKSGT 55 (385) T ss_pred ChH----H-------------HHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHH----HHHHHHHHHH Confidence 221 1 12222223333333333222111 111111111111111111 1111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccc Q lcl|NC_010583. 94 TIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEA 173 (458) Q Consensus 94 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ 173 (458) .......... . ................ ...+..+.... ..... . ..+.....++ +.+|.+ T Consensus 56 ~~~~~~~~~~-------~----~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~-~-----~~~~~~~~~~-~~~g~~ 115 (385) T protein:vir:19 56 RLFDLEQKLA-------S----GAENPGEKKSFSERAA-EELIKSWDGKQ-GTFGA-K-----TFNKSLGSDA-DSAGSL 115 (385) T ss_pred HHHHHHHHhh-------c----cccccchhhhhHHHHH-HHHHHHHHHhh-ccchh-h-----HHHhhhcccc-ccCCce Confidence 1110000000 0 0000000000000000 11111111111 11011 1 1112222333 344556 Q ss_pred cchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecC-CCcccccccccccccccccccccccceeeeeehhheeee Q lcl|NC_010583. 174 YETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPE-AGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAK 252 (458) Q Consensus 174 ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~ 252 (458) +|+++...|++.++..++|+++++++|++++.+.+|+..+ .+.+.|++|++ .+++++++|+++++.+++++++ T Consensus 116 i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~------~~~~~~~~~~~~~~~~~k~~~~ 189 (385) T protein:vir:19 116 IQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKA------LKPESDITFSKQTANVKTIAHW 189 (385) T ss_pred ecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCc------cccccccceeEEEEeeeeEEEe Confidence 7778899999999999999999999999999999999876 56788888765 4556789999999999999999 Q ss_pred ehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc-cccccccccccccceeeccccchhhHHHHHHHHHHH Q lcl|NC_010583. 253 SFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQ-PKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLR 331 (458) Q Consensus 253 ~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (458) ++||+++|+|++ ++++||..+|+++++.++|.+||+|+|+++ |.||++.+...... ........++++.+++ T Consensus 190 ~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~------~~~~~~~~~d~i~~~~ 262 (385) T protein:vir:19 190 VQASRQVMDDAP-MLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTS------LNATGDTRADIIAHAI 262 (385) T ss_pred ehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccc------ccccccchHHHHHHHH Confidence 999999999885 699999999999999999999999999876 58998866443221 1222334677888899 Q ss_pred hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEec-e Q lcl|NC_010583. 332 RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKD-N 410 (458) Q Consensus 332 ~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~-~ 410 (458) ..+...+..++.|+||+.++..|++++|++|+|+++.. ..+.+++|+|+||++++++|. +.+++++++ . T Consensus 263 ~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~-----~~~~~~~l~G~pV~~~~~~p~-----~~~~~gd~~~~ 332 (385) T protein:vir:19 263 YQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGP-----QAFTSNIMWGLPVVPTKAQAA-----GTFTVGGFDMA 332 (385) T ss_pred HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCc-----ccCCCceecceeeEEcCcCCC-----CcEEEeecccE Confidence 99999999999999999999999999999999998542 345568999999999999985 346677776 5 Q ss_pred EEEEecceeEEeec----ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 411 FVMPRQRAVTVERE----RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 411 ~~i~~~~~~~i~~~----~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.++++.++++..+ .+|.+|++.||++.|+|+++++|+||+++++++| T Consensus 333 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:19 333 SQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred EEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 88999999888653 4578999999999999999999999999999999 No 36 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=1.3e-56 Score=326.99 Aligned_cols=395 Identities=12% Similarity=0.050 Sum_probs=243.4 Q ss_pred chHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGL--GDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 3 ~~~~~~~~~~~~--~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |.|++..+++.. .++.++.+. ..++.+++.++...+. . .....+.++++++...+++++.+..... T Consensus 1 ~~l~e~i~e~~~~l~el~~~~~~------~~~e~r~~~e~~~~~~--~----~~~~~e~~~~~~~l~~ei~~l~e~~~~~ 68 (400) T protein:vir:38 1 MTLDEKLAAVKKQLDEKRSALPA------MKTELRSLLEGEDSEE--N----LKKAEGVRAKYDKAGKEIKDLEEKRDLY 68 (400) T ss_pred CChHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHhhccch--H----HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 555554443331 122121111 1111111111111000 0 0011122333333333343333332221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhh Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKA 160 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a 160 (458) ................ . ........... ...+.......... ............... ....... T Consensus 69 ~~~~~~~~~~~~~~~~----~----~~~~~~~~~~~-~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~----~~~~~~~ 132 (400) T protein:vir:38 69 EAALKGNEQSSGKKPD----H----PEEHSYRDALN-AYLHTRGRNTDGVN---FEKTDVGTFAVLRAV----PTDASDA 132 (400) T ss_pred HHHHHHHhhccccccc----c----hhhhhHHHHHH-HHHhhHHHHHHHHH---HHHHHHHHHhhhhhh----hHHHHHH Confidence 1111100000000000 0 00000000000 00000000000000 000000000000011 1111222 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecC-CCcccccccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPE-AGRATWVDASKFGTDETVGDEVKGQL 239 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~e~~~~~e~~~~~~~~~~f 239 (458) .....+.+.||++||+++.+.|++.+++.++|+++++++|++++..++|+... .+.+.|++|++..+ +.++++| T Consensus 133 ~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~-----~~~~~~f 207 (400) T protein:vir:38 133 VNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNP-----AMAKPEF 207 (400) T ss_pred HhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecCCCcccccccccccc-----ccccccc Confidence 33344566789999999999999999999999999999999999999999874 46688988876443 3467999 Q ss_pred eeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchh Q lcl|NC_010583. 240 TEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGS 319 (458) Q Consensus 240 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 319 (458) ++|++.+++++++++||+|||+||.+++++||.+.|+++++.+++.+|++|+|++.+.|+.+ T Consensus 208 ~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~------------------ 269 (400) T protein:vir:38 208 KPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTAKTISS------------------ 269 (400) T ss_pred eeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc------------------ Confidence 99999999999999999999999999999999999999999999999999999876655432 Q ss_pred hHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccC Q lcl|NC_010583. 320 VLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAAS 399 (458) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~ 399 (458) +.++.++..... ....+++|+|||.++..|.+++|++|+|+|++.. ..+.+++|+|+||++++++|...++ T Consensus 270 ----~~~~~~~~~~~~-~~~~~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~----~~~~~~~l~G~pv~~~~~~~~~~~g 340 (400) T protein:vir:38 270 ----VDDLKHINNVDL-DPAYSRVIIASQSFYNFLDTVKDGNGRYLLQDSI----LTPSGKSVLGMPIAVVSDDTLGAAG 340 (400) T ss_pred ----HHHHHHHHHhhh-hhhhCcEEEEcHHHHHHHHHhhccCCCeeeecCc----CCCCccccccceeEEecccccCCCC Confidence 122333333222 2234689999999999999999999999986533 3355679999999999999988777 Q ss_pred CceEEEEEece-EEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 400 AEFAVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 400 ~~~~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ...+++++++. |.++++.++++..+++. .+...||++.|+|+++++|+||+.++++++ T Consensus 341 ~~~~~~gd~s~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~ 399 (400) T protein:vir:38 341 EAHAFLGDIKRAILFANRADFMVRWVDDQ-IYGQFLQAGMRFGVSVADEKAGYFLTYTPK 399 (400) T ss_pred ceEEEEEeccccEEEEeecceEEEEeccc-ccceeEEEEEEeccEEecccceEEEEeecC Confidence 77888888885 78888999988776553 445689999999999999999999999988 No 37 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=2.8e-57 Score=330.62 Aligned_cols=390 Identities=15% Similarity=0.093 Sum_probs=249.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 17 LAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIV 96 (458) Q Consensus 17 ~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~ 96 (458) |.+++ +++.++++...++++.+.++... .-+++++..++++.+.+++.+..+.... ..... T Consensus 1 M~k~l-------------~el~~~~~~~~~e~~~~~~~~~~-~~ee~~~~~~e~~~l~~~i~~~~~~~~~-----~~~~~ 61 (404) T protein:vir:10 1 MSKEL-------------RELLNQLDSKNKELNSLLNKDGV-TAEELNKTSNEIDILQAKIEAQKRKENI-----ENNFN 61 (404) T ss_pred CcHHH-------------HHHHHHHHHHHHHHHHHHhhcCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHh Confidence 22222 23333333334444444332111 0011222233333333222211111100 00000 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccch Q lcl|NC_010583. 97 GLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYET 176 (458) Q Consensus 97 ~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~ 176 (458) . ..........+.. .. .......+......+...............++ ...++.+.||++||+ T Consensus 62 ~--~~~~~~~~~~~~~---------~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a-~~~~~~~~gg~~vP~ 124 (404) T protein:vir:10 62 E--DNVKSLNTGKEEN---------VI-----YNGALFVRAIADNLLKQKNQRGLNLSEKEINA-ISENIDEDGGYAVPE 124 (404) T ss_pred h--hhccccccccchh---------hH-----HHHHHHHHHHHHHHHHHHHhhhhcchhhHHhh-hccccCCCCceeech Confidence 0 0000000000000 00 00000000000001111000000011111222 234456778999999 Q ss_pred hHHHHHHHHHHhccchhhhcceeeec--cCceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeeh Q lcl|NC_010583. 177 IFSTRIIRDLQKELVVGALFDELPMS--SKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSF 254 (458) Q Consensus 177 ~~~~~ii~~~~~~~~l~~~~~~~~~~--~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~ 254 (458) ++.+.|++.++..++|+++++++|++ ++.+.||+..+.+.++|++|++..++. ..+++|++|++++++++++++ T Consensus 125 ~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~----~~~~~f~~i~~~~~k~~~~~~ 200 (404) T protein:vir:10 125 DIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLSENQQIPTN----GDNGKLERFNFKLKDLADFMS 200 (404) T ss_pred hHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeecccccccccc----ccccceeeeEeeheeeEeeeh Confidence 99999999999999999999998876 456778888888999999998755432 246899999999999999999 Q ss_pred hhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc-cccccccccccccceeeccccchhhHHHHHHHHHHH-h Q lcl|NC_010583. 255 ITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQ-PKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLR-R 332 (458) Q Consensus 255 is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~-p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 332 (458) ||+++|+|+.++|.+||.+.|++++++++|.+||+|+|+++ |.||++....... +..+. ..+.++...+ . T Consensus 201 iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~---~~~~~-----~~~~~~~~~~~~ 272 (404) T protein:vir:10 201 IPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKI---TLPKS-----PALKDFKKCKNV 272 (404) T ss_pred hhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeecccccee---ecccc-----ccHHHHHHHHHh Confidence 99999999999999999999999999999999999999865 6888865543221 11111 1223344333 3 Q ss_pred hhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceec-ccccccccCCceEEEEEec-e Q lcl|NC_010583. 333 KLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVS-EYFPAKAASAEFAVIVYKD-N 410 (458) Q Consensus 333 ~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~-~~~~~~~~~~~~~~~~~~~-~ 410 (458) .+.+.+..++.|+||+.++..|++++|++|+|++++.. ..+.+++|+|+||+++ +.+|..+.+...+++++++ . T Consensus 273 ~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~----~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~ 348 (404) T protein:vir:10 273 ELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDP----KDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEA 348 (404) T ss_pred hhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccCc----CCCCCccccceeeEEecccccCCCCCccEEEEEecccc Confidence 67788888899999999999999999999999986543 3455679999999854 5566666667777888887 5 Q ss_pred EEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 411 FVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 411 ~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.++++.++++..++ .|.+|++.||++.|+|+++.+|+||+++++++| T Consensus 349 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~a 400 (404) T protein:vir:10 349 YKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVE 400 (404) T ss_pred EEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecc Confidence 789999998887643 367999999999999999999999999999999 No 38 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=5.8e-57 Score=328.90 Aligned_cols=372 Identities=12% Similarity=0.013 Sum_probs=247.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 14 LGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDR---KRLEEALDLVKNLDEKSKKSAELFAQTVEK 90 (458) Q Consensus 14 ~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~---~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~ 90 (458) |+.+.+-. ++..++. +....+++..++...+.. +++++..++++.+.+. .+...+ T Consensus 1 Mk~~~eL~----------~~~~~~~----~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~----~~~~~~---- 58 (397) T protein:vir:49 1 MKTSNELH----------DLWIAQG----DKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMK----RDLFKE---- 58 (397) T ss_pred CchHHHHH----------HHHHHHH----HHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHH----HHHHHH---- Confidence 22211111 1111111 111111111111110000 1111111111111111 111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccC Q lcl|NC_010583. 91 QQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMS 170 (458) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g 170 (458) .....+...... .....+.............++.+|.++++.+.... ......++.+.| T Consensus 59 ---~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~--------~~~~~~~t~~~g 117 (397) T protein:vir:49 59 ---QYTEARANEVAN----------MSEEEKKPLTKNEEEVKANFVKDFKNLVRGRYQNL--------LDSKTDGSGSDA 117 (397) T ss_pred ---HHHHHHHhhhhc----------ccccccccccchhhHHHHHHHHHHHHHhhcchhhH--------HHhhhccCCccC Confidence 111000000000 00000111111122233345666777766543221 112234556678 Q ss_pred ccccchhHHHHHHHHHHhccchhhhcceeeeccCceE--EEEecC-CCcccccccccccccccccccccccceeeeeehh Q lcl|NC_010583. 171 SEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILT--MLVEPE-AGRATWVDASKFGTDETVGDEVKGQLTEISFKTY 247 (458) Q Consensus 171 ~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~-~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~ 247 (458) |++||+++.+.|++.+++.++|+++++++|++++..+ +|+..+ .+.+.|++|++..++ .+.++|+.|++.++ T Consensus 118 g~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~~~~~~~v~~~~~ 192 (397) T protein:vir:49 118 GLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQ-----NDDPKLSLIRYAIK 192 (397) T ss_pred cceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeecccccccc-----ccccceeeeEeeee Confidence 9999999999999999999999999999998876555 444433 467889998864443 23589999999999 Q ss_pred heeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHHHHHH Q lcl|NC_010583. 248 KLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTI 327 (458) Q Consensus 248 k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (458) +++++++||+++|+|+.++|++||.++|++++++++|.+||+|+|++.|.+.. .+++++ T Consensus 193 k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~~~~~---------------------~~~d~i 251 (397) T protein:vir:49 193 RYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLPNKPTL---------------------AKWDDI 251 (397) T ss_pred eeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc---------------------cCHHHH Confidence 99999999999999999999999999999999999999999999998664211 234578 Q ss_pred HHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecc--cccccccCCceEEE Q lcl|NC_010583. 328 SKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSE--YFPAKAASAEFAVI 405 (458) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~--~~~~~~~~~~~~~~ 405 (458) .++...+.+.+..++.|+||+.++..|++++|++|+|++++.. ..+.+++|+|+||++++ .+|+..++...+++ T Consensus 252 ~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~----~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~ 327 (397) T protein:vir:49 252 IDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDV----KSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYF 327 (397) T ss_pred HHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccc----cCCCCceecceeeEEecccccccccCCceeEEE Confidence 8888899999999999999999999999999999999986543 33556799999999855 46666777777888 Q ss_pred EEec-eEEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 406 VYKD-NFVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 406 ~~~~-~~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ++++ .|.++++.++++..++ +|.+|++.||++.|+|+++++|+||+++++++. T Consensus 328 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 328 GDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAI 385 (397) T ss_pred eeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEecccceEEEEeccc Confidence 8887 5889999999998765 478999999999999999999999999998887 No 39 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=9.9e-57 Score=327.61 Aligned_cols=382 Identities=12% Similarity=0.048 Sum_probs=244.8 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |+|-|+- +++. +++.+..+.+.. ..++..++.+......++ ..+.++++++..++++.+.+. T Consensus 1 m~~~m~i--~el~-~~~~~~~~~~~~---~~~e~~~~~~~~~~~~e~--------i~e~~~~~~~~~~~~~~~~~~---- 62 (408) T protein:vir:74 1 MGVKLTV--NQLN-EAWIASGDKVTD---FNDQINMALNDDNFSAEA--------MSELKNKRDNEKVRRDALREQ---- 62 (408) T ss_pred CChhhhH--HHHH-HHHHHHHHHHHH---HHHHHHHHHhhhcccHHH--------HHHHHHHHHHHHHHHHHHHHH---- Confidence 6555521 2221 122221111110 000000000000000000 011111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhh Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKA 160 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a 160 (458) .... +....... ....+................+|.++++...... .....++ T Consensus 63 -------~~~~-------~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~a 115 (408) T protein:vir:74 63 -------LVEA-------QAEQVVNM----------REEEKGPLNKSENELKDKFVKDFVNMVRNPMAFL---NTVSSKT 115 (408) T ss_pred -------HHHH-------HHHHHhhc----------cccccccccchhhhhHHHHHHHHHHHHhcchhhh---hhhhhhh Confidence 1100 00000000 0000000001111111223444555554432211 1112222 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEE--ecC-CCcccccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLV--EPE-AGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~--~~~-~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) . ..++.+.||++||+++.+.|++.+++.++|+++++++|++++...+++ ..+ ++.+.|++|++..++ .+++ T Consensus 116 ~-~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~-----~~~~ 189 (408) T protein:vir:74 116 E-TSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPD-----LDNP 189 (408) T ss_pred h-cccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCccccccccccccccc-----cccc Confidence 2 345567789999999999999999999999999999998876655544 433 355678888754432 3579 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +|+.|++++++++++++||+|+++|+.++|++||.++|++++++++|.+||+|+|++.|.+... T Consensus 190 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~~~~~~---------------- 253 (408) T protein:vir:74 190 RLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKPTIA---------------- 253 (408) T ss_pred ceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc---------------- Confidence 9999999999999999999999999999999999999999999999999999999987654221 Q ss_pred hhhHHHHHHHHHHH-hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccc--cc Q lcl|NC_010583. 318 GSVLVTAKTISKLR-RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEY--FP 394 (458) Q Consensus 318 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~--~~ 394 (458) ++.++.+++ ..+.+.|+.++.|+||+.++..|.+++|++|+|+++... ..+.+++|+|+||+++++ +| T Consensus 254 -----~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~----~~~~~~~l~G~pV~~~~~~~~~ 324 (408) T protein:vir:74 254 -----NFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDP----TKPNSYLIKGKQVIVVADRWLP 324 (408) T ss_pred -----cHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCc----CCCCCceecceeeEEecCcccc Confidence 123444443 577888999999999999999999999999999987543 335567999999998764 77 Q ss_pred ccccCCceEEEEEec-eEEEEecceeEEeeccc----ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 395 AKAASAEFAVIVYKD-NFVMPRQRAVTVERERQ----AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 395 ~~~~~~~~~~~~~~~-~~~i~~~~~~~i~~~~~----~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+++...+++++++ .|.++++.++++..+++ |.+|++.||++.|+|+++++|+||+++++++. T Consensus 325 ~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (408) T protein:vir:74 325 NSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAI 393 (408) T ss_pred cccCCcceEEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEeecc Confidence 777777778888887 47899999999987654 67999999999999999999999999999776 No 40 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=1.2e-56 Score=327.17 Aligned_cols=375 Identities=11% Similarity=-0.022 Sum_probs=245.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 14 LGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQE 93 (458) Q Consensus 14 ~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~ 93 (458) |+.+.+- .+.+.++.++.+...++++..... .....+++++...+++.+.++++.. T Consensus 1 Mk~~~el----------~~~~~~~~~~i~~~~~~~~~~~~~-~~~~~ee~~~l~~ei~~~~~~~~~~------------- 56 (397) T protein:vir:48 1 MKTSNEL----------HDLWVAQGDKVENLNEKLNVAMLD-DSVTAEELQAIKNERDTAKMKRDMF------------- 56 (397) T ss_pred CchHHHH----------HHHHHHHHHHHHHHHHHHHHhhcc-hhhhHHHHHHHHHHHHHHHHHHHHH------------- Confidence 2222111 111111111110000001000000 0000111111122222221111111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccc Q lcl|NC_010583. 94 TIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEA 173 (458) Q Consensus 94 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ 173 (458) ++............. ....+.............++..|..+++.+.... ......++++.||++ T Consensus 57 -----~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~t~~~gg~~ 120 (397) T protein:vir:48 57 -----KEQYTEARANEVVNM---SEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNL--------LDSKTDASGSDAGLT 120 (397) T ss_pred -----HHHHHHHHHhhhhhh---hhhccccccchhhHHHHHHHHHHHHHHhhhhhHH--------HHHhhccCCcccccc Confidence 110000000000000 0000001111112222334555666555432211 111234455678999 Q ss_pred cchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEe---cCCCcccccccccccccccccccccccceeeeeehhhee Q lcl|NC_010583. 174 YETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVE---PEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLA 250 (458) Q Consensus 174 ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~---~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~ 250 (458) ||+++++.|++.+++.++|+++++++|++++...+|+. ...+.++|++|++..++ .+.++|++|++++++++ T Consensus 121 iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~~~~~~~v~~~~~k~~ 195 (397) T protein:vir:48 121 IPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGT-----NDDPKLYPIRYAIKRYA 195 (397) T ss_pred ccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeecccccccc-----ccccceeeEEeeheeee Confidence 99999999999999999999999999988776666654 23466889988864443 34689999999999999 Q ss_pred eeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHHHHHHHHH Q lcl|NC_010583. 251 AKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKL 330 (458) Q Consensus 251 ~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (458) ++++||+++|+|+.+++++||.++|++++++++|.+|++|+|++.+.+.. .+++++.++ T Consensus 196 ~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~~~~~---------------------~~~d~i~~~ 254 (397) T protein:vir:48 196 GISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATLPTKPTL---------------------TKWDDIIDL 254 (397) T ss_pred eehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc---------------------ccHHHHHHH Confidence 99999999999999999999999999999999999999999987664311 234567888 Q ss_pred HhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccc--ccccccCCceEEEEEe Q lcl|NC_010583. 331 RRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEY--FPAKAASAEFAVIVYK 408 (458) Q Consensus 331 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~--~~~~~~~~~~~~~~~~ 408 (458) ...+.+.+..++.|+||+.++..|++++|++|+|+++... ..+.+++|+|+||++++. +|....+...++++++ T Consensus 255 ~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~----~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~ 330 (397) T protein:vir:48 255 QAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDV----KSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDL 330 (397) T ss_pred HHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCc----CCCCCceeccceeEEecccccCCcCCCceEEEEEec Confidence 8899999999999999999999999999999999986543 335668999999998654 5666667777888888 Q ss_pred c-eEEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 409 D-NFVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 409 ~-~~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) + .|.++++.++++..++ +|.+|++.||++.|+|+++++|+||++++++++ T Consensus 331 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:48 331 KQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKAI 385 (397) T ss_pred cceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEEeccc Confidence 7 4778999998887654 588999999999999999999999999999998 No 41 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=7.2e-57 Score=328.37 Aligned_cols=380 Identities=13% Similarity=0.058 Sum_probs=249.0 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLV--SKAVGEDRKRLEEALDLVKNLDEKSK 78 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~--~~~~~e~~~~~~~~~~~i~~~~e~~~ 78 (458) |+|.|. + +++ .++...+.. +.++.+.++.+ .+.+.. .+...+.++++++..++++.+ T Consensus 1 m~~~m~-l-~el-----~~~~~~~~~--~~~~~~~~~~~-------~~~~~~~~~ee~~~~~~~~~~~~~~~~~~----- 59 (408) T protein:vir:10 1 MGVKLT-V-NQL-----NEAWIASGD--KVTDFNDQINM-------ALNDDNFSAEAMSELKNKRDNEKVRRDAL----- 59 (408) T ss_pred CCcccc-H-HHH-----HHHHHHHHH--HHHHHHHHHHH-------HhhcccccHHHHHHHHHHHHHHHHHHHHH----- Confidence 888774 3 222 222221111 11111111100 000000 000001111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHh Q lcl|NC_010583. 79 KSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHI 158 (458) Q Consensus 79 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 158 (458) ..++.......... .....+.............+..+|.++++....... .... T Consensus 60 -------------~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 113 (408) T protein:vir:10 60 -------------REQLVEAQAEQVVN----------MREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMN---TVSS 113 (408) T ss_pred -------------HHHHHHHHHHHHhc----------cccccccccccchhhhHHHHHHHHHHHhhcchhhhh---hhhh Confidence 11111111000000 000000111111222223345566666655432111 1122 Q ss_pred hhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEe--c-CCCcccccccccccccccccccc Q lcl|NC_010583. 159 KAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVE--P-EAGRATWVDASKFGTDETVGDEV 235 (458) Q Consensus 159 ~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~--~-~~~~a~~v~e~~~~~e~~~~~~~ 235 (458) ++ ...++.+.||++||+++++.|++.+++.++|+++|+++|+++....+|+. . ..+.+.|++|++..++ .+ T Consensus 114 ~a-~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~ 187 (408) T protein:vir:10 114 KT-ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPD-----LD 187 (408) T ss_pred hh-hhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCcccccc-----cc Confidence 23 33455667899999999999999999999999999999998776666654 3 3466789888864443 35 Q ss_pred cccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccc Q lcl|NC_010583. 236 KGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 236 ~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) .++|++|++.+++++++++||+++|+|+.++|.+||.+.|++++++++|.+|++|+|++.+.+-. T Consensus 188 ~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~~~~--------------- 252 (408) T protein:vir:10 188 NPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTI--------------- 252 (408) T ss_pred CcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc--------------- Confidence 68999999999999999999999999999999999999999999999999999999987653211 Q ss_pred cchhhHHHHHHHHHHH-hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecc--c Q lcl|NC_010583. 316 ADGSVLVTAKTISKLR-RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSE--Y 392 (458) Q Consensus 316 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~--~ 392 (458) .++.++.+++ ..+.+.|..++.|+||+.++..|++++|++|+|+|++.. ..+.+.+|+|+||++++ . T Consensus 253 ------~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~----~~~~~~~l~G~PV~~~~~~~ 322 (408) T protein:vir:10 253 ------AKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDP----TKPNSYLIKGKQVIVVADRW 322 (408) T ss_pred ------ccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCc----CCCCCceecceeeEEecccc Confidence 1233455544 567788899999999999999999999999999987643 33556799999999965 4 Q ss_pred ccccccCCceEEEEEece-EEEEecceeEEeeccc----ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 393 FPAKAASAEFAVIVYKDN-FVMPRQRAVTVERERQ----AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 393 ~~~~~~~~~~~~~~~~~~-~~i~~~~~~~i~~~~~----~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +|+.+++...+++++++. |.++++.++++..+++ |.+|++.||++.|+|+++++|+||++++++++ T Consensus 323 ~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~ 393 (408) T protein:vir:10 323 LPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) T ss_pred cCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEEeecc Confidence 677667777778888875 7899999999987654 67899999999999999999999999999997 No 42 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=3.6e-56 Score=324.56 Aligned_cols=382 Identities=13% Similarity=0.034 Sum_probs=247.8 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |+|.|.- ++|.++.+.+..+ .+.+.++.......-+.. .+...+..+.+++...+++.+.+.+ T Consensus 1 ~~~~m~l-------~el~~~~~~~~~~------~~~~~~~~~~~~~~~~~~-~ee~~~~~~~~~~~~~~~~~~~~~~--- 63 (404) T protein:vir:39 1 MGVKLTV-------NQLNEAWIASGDK------VTDFNDQINMALNDDNFS-AEAMSELKNKRDNEKVRRDALREQL--- 63 (404) T ss_pred CChHHHH-------HHHHHHHHHHHHH------HHHHHHHHHHHhcccccc-HHHHHHHHHHHHHHHHHHHHHHHHH--- Confidence 7766532 2233333322111 111111100000000000 0000011111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhh Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKA 160 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a 160 (458) +.. ....... ....................+.+|.++++.+..... ....++ T Consensus 64 --------~~~-------~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~e~~a 115 (404) T protein:vir:39 64 --------VEA-------QAEQVVN----------MREEEKGPLNKSEYELKDKFVKEFVNMVRNPMAFLN---TVSSKT 115 (404) T ss_pred --------HHH-------HHHHHhc----------cccccccccccchhhhHHHHHHHHHHHHhcchhhhh---hhhhhh Confidence 000 0000000 000000111111222233455667777665432211 112222 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEe--c-CCCcccccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVE--P-EAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~--~-~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) ...++.+.||++||+++.+.|++.+++.++|+++++++|++++...+|+. . ..+.+.|++|++..++ .+.+ T Consensus 116 -~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-----~~~~ 189 (404) T protein:vir:39 116 -ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPD-----LDNP 189 (404) T ss_pred -hhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCcccccc-----cccc Confidence 23455677899999999999999999999999999999988776666543 3 3467889988864442 3578 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +|++|++++++++++++||+++++|+.++|++||.++|++++++++|.+||+|+|++.|.+... T Consensus 190 ~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~~~~~~~---------------- 253 (404) T protein:vir:39 190 RLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKPTIA---------------- 253 (404) T ss_pred ceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc---------------- Confidence 9999999999999999999999999999999999999999999999999999999987654321 Q ss_pred hhhHHHHHHHHHHH-hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccc--cc Q lcl|NC_010583. 318 GSVLVTAKTISKLR-RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEY--FP 394 (458) Q Consensus 318 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~--~~ 394 (458) .+.++.+++ ..+.+.|..++.|+||+.++..|.+++|++|+|+++... ..+.+++|+|+||+++++ +| T Consensus 254 -----~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~----~~~~~~~l~G~pV~~~~~~~~~ 324 (404) T protein:vir:39 254 -----KFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDP----TKPNSYLIKGKKVIVVADRWLP 324 (404) T ss_pred -----cHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCc----CCCCcceecceeEEEecccccC Confidence 123344443 356677888999999999999999999999999986543 334567999999999765 55 Q ss_pred ccccCCceEEEEEec-eEEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 395 AKAASAEFAVIVYKD-NFVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 395 ~~~~~~~~~~~~~~~-~~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +.+.+...+++++++ .|.++++.++++..++ +|.+|++.||++.|+|+++.+|+||++++++++ T Consensus 325 ~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (404) T protein:vir:39 325 NSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAI 393 (404) T ss_pred ccCCCccEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEeecc Confidence 555566667788877 4788999999988765 467999999999999999999999999998888 No 43 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=1.5e-56 Score=326.55 Aligned_cols=358 Identities=15% Similarity=0.113 Sum_probs=231.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhh Q lcl|NC_010583. 45 LARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALY 124 (458) Q Consensus 45 ~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 124 (458) +++++++..+..+ .++++ .+.++...+. .+..+...+..... ..+..........+....... T Consensus 1 ik~L~e~~~e~~e-~~~~~---~~~~~~~~~~-~e~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~----- 63 (390) T protein:vir:40 1 MNNLDKKDSETLN-ISTAF---LNAIKEGATE-AEQVTAFTNMAEQI-------QNNIIAQARKEVNREMNDNNV----- 63 (390) T ss_pred CchHHHHHHHHHH-HHHHH---HHHHhhhhhH-HHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHH----- Confidence 2222222111111 11111 1111111100 00000000000111 000000000000000000000 Q ss_pred cchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccC Q lcl|NC_010583. 125 GTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSK 204 (458) Q Consensus 125 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 204 (458) .. .+....+..+.+.. .......+++++||++||+++.++|++.++..++|+++|+++|++++ T Consensus 64 --------------~~--~~~~~~l~~~~r~~-~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~ 126 (390) T protein:vir:40 64 --------------LA--SRGANALTSDESKY-YNEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTAT 126 (390) T ss_pred --------------HH--hcCchhccHHHHHH-HHHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCc Confidence 00 00000000111111 11223345567889999999999999999999999999999999999 Q ss_pred ceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 205 ILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIE 284 (458) Q Consensus 205 ~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d 284 (458) ...+|+..+.+.+.|++|++..+ +.++++|++|++++|+++++++||+++|+|++++|++||+++|++++++++| T Consensus 127 ~~~i~~~~~~~~a~~~~E~~~~~-----~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~ 201 (390) T protein:vir:40 127 TEWIISVGDVATAWWGPLCAEIK-----EVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLE 201 (390) T ss_pred eeEEEEEcCCcceeeeccccccC-----ccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999875433 3568999999999999999999999999999999999999999999999999 Q ss_pred HHHhccCCCCccccccccccccccceeeccccch----hhHHHHHHHHHHHhhhhhhhcccceeEechhHH-H---HHHh Q lcl|NC_010583. 285 EAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADG----SVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMD-A---YYDL 356 (458) Q Consensus 285 ~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~---~l~~ 356 (458) .+||+|+|+++|.||++.............+... .....+..+...+......+..++.|+||+.++ . .+.. T Consensus 202 ~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~ 281 (390) T protein:vir:40 202 AGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATS 281 (390) T ss_pred hhhhcccCCCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhh Confidence 9999999999999999865432221111111111 111112222222323333456778899999874 2 4457 Q ss_pred hhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeecc--cccCCceEE Q lcl|NC_010583. 357 LEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERER--QAGKQRDAY 434 (458) Q Consensus 357 ~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~--~~~~~~~~~ 434 (458) ++|.+|+|++.. .++|+||+++++||.+ .+++++++.|.++++.++++.+++ +|.+|++.| T Consensus 282 ~~d~~G~~v~~~------------~~~g~pvv~~~~~p~~-----~i~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~ 344 (390) T protein:vir:40 282 YMTPQGVWVTGI------------LPVPLEIVQSVAVPVG-----KAVAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLY 344 (390) T ss_pred ccCCCCcccccc------------CCCceeEEEcCCCCCC-----cEEEEeeceEEEEeecceEEEecchhhhhcCcEEE Confidence 899999987522 3579999999999853 367899999999999999988765 689999999 Q ss_pred EEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 435 YVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 435 ~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |++.|+|+++++|+||++++++++ T Consensus 345 r~~~r~dg~v~~~~A~~~l~~~~~ 368 (390) T protein:vir:40 345 YAKQYANGRPKDNSSFLVFDITGL 368 (390) T ss_pred EEEEEeCCEEecccceEEEEeecc Confidence 999999999999999999999998 No 44 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=7.7e-56 Score=322.75 Aligned_cols=406 Identities=14% Similarity=0.085 Sum_probs=248.8 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |+ ..+..+++. .++.+....+ ....++......+++.+. ++..++.++...+++.+.+... T Consensus 1 m~--~~~~lee~~-a~l~~~~~~~----------~~~~~~~~~~~~e~~~~~----~~~~~~~~~~~~~~~~~~~~~~-- 61 (419) T protein:vir:94 1 MP--PTPTLEEQR-AALLARLDDT----------SLTTEQVQEIVAEARGLA----DALQAESDRAAARAALLRTAPP-- 61 (419) T ss_pred CC--HHHHHHHHH-HHHHHHHHHH----------HHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHH-- Confidence 33 222222221 1111111000 000111111111111111 1111111111111111111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchh----HHHHHHH Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVF----ETEHGKA 156 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~----~~~~~~~ 156 (458) ................ ....+. .............+.....++... ....... T Consensus 62 ----------------~~~~~~~~~~~~~~~~----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 118 (419) T protein:vir:94 62 ----------------APKGPADGGTPLTPAE----AGTFRS---LAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRL 118 (419) T ss_pred ----------------HHHHHhhhhccccccc----cccccc---hhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHh Confidence 0000000000000000 000000 000000000000111111111000 0001111 Q ss_pred HhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCccccc--cccccccccccccc Q lcl|NC_010583. 157 HIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWV--DASKFGTDETVGDE 234 (458) Q Consensus 157 ~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v--~e~~~~~e~~~~~~ 234 (458) ........+...+++.++|+.+...|+..+.....|+++++++|++++...||+.++.....|. ....|++|++.+++ T Consensus 119 ~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 198 (419) T protein:vir:94 119 LSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ 198 (419) T ss_pred hccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccc Confidence 1122233344556677888888888888888889999999999999999999998765433221 22344455567778 Q ss_pred ccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeecc Q lcl|NC_010583. 235 VKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEA 314 (458) Q Consensus 235 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~ 314 (458) ++++|++|++++++++++++||+++++|++ ++++||..+|++++++++|.+||+|+|+++|+||++.......... .. T Consensus 199 ~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~-~~ 276 (419) T protein:vir:94 199 STLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQP-KP 276 (419) T ss_pred cccceeeEEeeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccc-cc Confidence 899999999999999999999999999985 7999999999999999999999999999999999987654433322 22 Q ss_pred ccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccc Q lcl|NC_010583. 315 KADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFP 394 (458) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~ 394 (458) .........+.++.++++.+...+..+++|+||+.++..|.+++|.+|++++.. .....+.+++|+|+||++++++| T Consensus 277 ~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~---~~~~~~~~~~l~G~pV~~~~~~~ 353 (419) T protein:vir:94 277 TAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVI---ANVQGEATPRIWGLNVVSTVAIA 353 (419) T ss_pred ccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCCCceeec---CCcccCCCccccceeeEEcCCCC Confidence 334445567888999999999999999999999999999999999887765422 22344566899999999999998 Q ss_pred ccccCCceEEEEEec-eEEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 395 AKAASAEFAVIVYKD-NFVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 395 ~~~~~~~~~~~~~~~-~~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) . ..+++++++ .|.++++.++++..++ +|.+|++.||++.|+|+++++|+||++++++|| T Consensus 354 ~-----~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa 417 (419) T protein:vir:94 354 Q-----GTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) T ss_pred C-----ccEEEeeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEeccC Confidence 5 235677776 4788999999887644 478999999999999999999999999999999 No 45 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=5.2e-56 Score=323.67 Aligned_cols=368 Identities=12% Similarity=0.055 Sum_probs=247.2 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |.-.|.++++++. .+.+++ +.+.+ ++..+++++..++++.+.+++++. T Consensus 1 M~k~l~el~~~~~--~~~~e~---------------------------~~~~~---~~~~~e~~~~~~e~~~l~~~i~~~ 48 (392) T protein:vir:10 1 MSKELRELLAKLE--GKKEEV---------------------------RSLMG---EDKVAEAEQMMEEVRSLQKKIDLQ 48 (392) T ss_pred CcHHHHHHHHHHH--HHHHHH---------------------------HHHhh---HHHHHHHHHHHHHHHHHHHHHHHH Confidence 4433333333322 222222 22211 111222233333333333332221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHH---HH Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGK---AH 157 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~---~~ 157 (458) .+..+.+.+. . . . . .............++.++.+.++.+......... .. T Consensus 49 ~~~~~~~~~~------------~--~----~----~-----~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 101 (392) T protein:vir:10 49 RSLDEAETEE------------R--N----N----G-----REVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDL 101 (392) T ss_pred HHHHHHHHHH------------h--h----c----c-----ccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhh Confidence 1110000000 0 0 0 0 0000111112233455566666554432222211 11 Q ss_pred hhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCc--eEEEEecCCCcccccccccccccccccccc Q lcl|NC_010583. 158 IKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKI--LTMLVEPEAGRATWVDASKFGTDETVGDEV 235 (458) Q Consensus 158 ~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~ 235 (458) .......++.++||++||+++.+.|++.+++.++|++++++++++++. ..+|+..+.+.++|++|++..++ .+ T Consensus 102 ~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~ 176 (392) T protein:vir:10 102 EQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPE-----TD 176 (392) T ss_pred hhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccc-----cc Confidence 222233455677899999999999999999999999999999987655 45666777788999999865443 34 Q ss_pred cccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccc Q lcl|NC_010583. 236 KGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 236 ~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) .++|+.|++.+++++++++||+++|+||.++|.+||.+.|++++++++|.+|++|+|++.+.|..+ T Consensus 177 ~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~-------------- 242 (392) T protein:vir:10 177 NPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS-------------- 242 (392) T ss_pred cccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccC-------------- Confidence 689999999999999999999999999999999999999999999999999999999876654321 Q ss_pred cchhhHHHHHHHHHHH-hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceec-ccc Q lcl|NC_010583. 316 ADGSVLVTAKTISKLR-RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVS-EYF 393 (458) Q Consensus 316 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~-~~~ 393 (458) +.++.+++ ..+.+.|..++.|+||+.++..|++++|++|+|+|++... .+.+++|+|+|++++ +.+ T Consensus 243 --------~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~----~~~~~tllG~~~v~~~~~~ 310 (392) T protein:vir:10 243 --------LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPT----QKNKKLFAGTNPVVVVSNR 310 (392) T ss_pred --------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCcc----CCccccccCcccEEEeccc Confidence 23444433 4778888899999999999999999999999999966433 355678999876653 332 Q ss_pred -c---ccccCCceEEEEEece-EEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 394 -P---AKAASAEFAVIVYKDN-FVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 394 -~---~~~~~~~~~~~~~~~~-~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) | ...++...+++++|+. |.++++.++++..++ +|.+|++.||++.|+|+++++|+||++++++++ T Consensus 311 ~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 311 FLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred ccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2 2344566667788775 788999999988764 478999999999999999999999999999888 No 46 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=5.2e-56 Score=323.67 Aligned_cols=368 Identities=12% Similarity=0.055 Sum_probs=247.2 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |.-.|.++++++. .+.+++ +.+.+ ++..+++++..++++.+.+++++. T Consensus 1 M~k~l~el~~~~~--~~~~e~---------------------------~~~~~---~~~~~e~~~~~~e~~~l~~~i~~~ 48 (392) T protein:vir:10 1 MSKELRELLAKLE--GKKEEV---------------------------RSLMG---EDKVAEAEQMMEEVRSLQKKIDLQ 48 (392) T ss_pred CcHHHHHHHHHHH--HHHHHH---------------------------HHHhh---HHHHHHHHHHHHHHHHHHHHHHHH Confidence 4433333333322 222222 22211 111222233333333333332221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHH---HH Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGK---AH 157 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~---~~ 157 (458) .+..+.+.+. . . . . .............++.++.+.++.+......... .. T Consensus 49 ~~~~~~~~~~------------~--~----~----~-----~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 101 (392) T protein:vir:10 49 RSLDEAETEE------------R--N----N----G-----REVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDL 101 (392) T ss_pred HHHHHHHHHH------------h--h----c----c-----ccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhh Confidence 1110000000 0 0 0 0 0000111112233455566666554432222211 11 Q ss_pred hhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCc--eEEEEecCCCcccccccccccccccccccc Q lcl|NC_010583. 158 IKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKI--LTMLVEPEAGRATWVDASKFGTDETVGDEV 235 (458) Q Consensus 158 ~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~ 235 (458) .......++.++||++||+++.+.|++.+++.++|++++++++++++. ..+|+..+.+.++|++|++..++ .+ T Consensus 102 ~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~ 176 (392) T protein:vir:10 102 EQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPE-----TD 176 (392) T ss_pred hhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccc-----cc Confidence 222233455677899999999999999999999999999999987655 45666777788999999865443 34 Q ss_pred cccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccc Q lcl|NC_010583. 236 KGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 236 ~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) .++|+.|++.+++++++++||+++|+||.++|.+||.+.|++++++++|.+|++|+|++.+.|..+ T Consensus 177 ~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~-------------- 242 (392) T protein:vir:10 177 NPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS-------------- 242 (392) T ss_pred cccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccC-------------- Confidence 689999999999999999999999999999999999999999999999999999999876654321 Q ss_pred cchhhHHHHHHHHHHH-hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceec-ccc Q lcl|NC_010583. 316 ADGSVLVTAKTISKLR-RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVS-EYF 393 (458) Q Consensus 316 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~-~~~ 393 (458) +.++.+++ ..+.+.|..++.|+||+.++..|++++|++|+|+|++... .+.+++|+|+|++++ +.+ T Consensus 243 --------~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~----~~~~~tllG~~~v~~~~~~ 310 (392) T protein:vir:10 243 --------LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPT----QKNKKLFAGTNPVVVVSNR 310 (392) T ss_pred --------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCcc----CCccccccCcccEEEeccc Confidence 23444433 4778888899999999999999999999999999966433 355678999876653 332 Q ss_pred -c---ccccCCceEEEEEece-EEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 394 -P---AKAASAEFAVIVYKDN-FVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 394 -~---~~~~~~~~~~~~~~~~-~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) | ...++...+++++|+. |.++++.++++..++ +|.+|++.||++.|+|+++++|+||++++++++ T Consensus 311 ~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 311 FLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred ccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2 2344566667788775 788999999988764 478999999999999999999999999999888 No 47 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=5.2e-56 Score=323.67 Aligned_cols=368 Identities=12% Similarity=0.055 Sum_probs=247.2 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |.-.|.++++++. .+.+++ +.+.+ ++..+++++..++++.+.+++++. T Consensus 1 M~k~l~el~~~~~--~~~~e~---------------------------~~~~~---~~~~~e~~~~~~e~~~l~~~i~~~ 48 (392) T protein:vir:10 1 MSKELRELLAKLE--GKKEEV---------------------------RSLMG---EDKVAEAEQMMEEVRSLQKKIDLQ 48 (392) T ss_pred CcHHHHHHHHHHH--HHHHHH---------------------------HHHhh---HHHHHHHHHHHHHHHHHHHHHHHH Confidence 4433333333322 222222 22211 111222233333333333332221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHH---HH Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGK---AH 157 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~---~~ 157 (458) .+..+.+.+. . . . . .............++.++.+.++.+......... .. T Consensus 49 ~~~~~~~~~~------------~--~----~----~-----~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 101 (392) T protein:vir:10 49 RSLDEAETEE------------R--N----N----G-----REVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDL 101 (392) T ss_pred HHHHHHHHHH------------h--h----c----c-----ccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhh Confidence 1110000000 0 0 0 0 0000111112233455566666554432222211 11 Q ss_pred hhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCc--eEEEEecCCCcccccccccccccccccccc Q lcl|NC_010583. 158 IKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKI--LTMLVEPEAGRATWVDASKFGTDETVGDEV 235 (458) Q Consensus 158 ~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~ 235 (458) .......++.++||++||+++.+.|++.+++.++|++++++++++++. ..+|+..+.+.++|++|++..++ .+ T Consensus 102 ~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~ 176 (392) T protein:vir:10 102 EQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPE-----TD 176 (392) T ss_pred hhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccc-----cc Confidence 222233455677899999999999999999999999999999987655 45666777788999999865443 34 Q ss_pred cccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccc Q lcl|NC_010583. 236 KGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 236 ~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) .++|+.|++.+++++++++||+++|+||.++|.+||.+.|++++++++|.+|++|+|++.+.|..+ T Consensus 177 ~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~-------------- 242 (392) T protein:vir:10 177 NPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS-------------- 242 (392) T ss_pred cccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccC-------------- Confidence 689999999999999999999999999999999999999999999999999999999876654321 Q ss_pred cchhhHHHHHHHHHHH-hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceec-ccc Q lcl|NC_010583. 316 ADGSVLVTAKTISKLR-RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVS-EYF 393 (458) Q Consensus 316 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~-~~~ 393 (458) +.++.+++ ..+.+.|..++.|+||+.++..|++++|++|+|+|++... .+.+++|+|+|++++ +.+ T Consensus 243 --------~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~----~~~~~tllG~~~v~~~~~~ 310 (392) T protein:vir:10 243 --------LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPT----QKNKKLFAGTNPVVVVSNR 310 (392) T ss_pred --------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCcc----CCccccccCcccEEEeccc Confidence 23444433 4778888899999999999999999999999999966433 355678999876653 332 Q ss_pred -c---ccccCCceEEEEEece-EEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 394 -P---AKAASAEFAVIVYKDN-FVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 394 -~---~~~~~~~~~~~~~~~~-~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) | ...++...+++++|+. |.++++.++++..++ +|.+|++.||++.|+|+++++|+||++++++++ T Consensus 311 ~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 311 FLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred ccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2 2344566667788775 788999999988764 478999999999999999999999999999888 No 48 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=5.2e-56 Score=323.67 Aligned_cols=368 Identities=12% Similarity=0.055 Sum_probs=247.2 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |.-.|.++++++. .+.+++ +.+.+ ++..+++++..++++.+.+++++. T Consensus 1 M~k~l~el~~~~~--~~~~e~---------------------------~~~~~---~~~~~e~~~~~~e~~~l~~~i~~~ 48 (392) T protein:vir:10 1 MSKELRELLAKLE--GKKEEV---------------------------RSLMG---EDKVAEAEQMMEEVRSLQKKIDLQ 48 (392) T ss_pred CcHHHHHHHHHHH--HHHHHH---------------------------HHHhh---HHHHHHHHHHHHHHHHHHHHHHHH Confidence 4433333333322 222222 22211 111222233333333333332221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHH---HH Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGK---AH 157 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~---~~ 157 (458) .+..+.+.+. . . . . .............++.++.+.++.+......... .. T Consensus 49 ~~~~~~~~~~------------~--~----~----~-----~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 101 (392) T protein:vir:10 49 RSLDEAETEE------------R--N----N----G-----REVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDL 101 (392) T ss_pred HHHHHHHHHH------------h--h----c----c-----ccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhh Confidence 1110000000 0 0 0 0 0000111112233455566666554432222211 11 Q ss_pred hhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCc--eEEEEecCCCcccccccccccccccccccc Q lcl|NC_010583. 158 IKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKI--LTMLVEPEAGRATWVDASKFGTDETVGDEV 235 (458) Q Consensus 158 ~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~ 235 (458) .......++.++||++||+++.+.|++.+++.++|++++++++++++. ..+|+..+.+.++|++|++..++ .+ T Consensus 102 ~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~ 176 (392) T protein:vir:10 102 EQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPE-----TD 176 (392) T ss_pred hhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccc-----cc Confidence 222233455677899999999999999999999999999999987655 45666777788999999865443 34 Q ss_pred cccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccc Q lcl|NC_010583. 236 KGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 236 ~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) .++|+.|++.+++++++++||+++|+||.++|.+||.+.|++++++++|.+|++|+|++.+.|..+ T Consensus 177 ~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~~~~~-------------- 242 (392) T protein:vir:10 177 NPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQAIKS-------------- 242 (392) T ss_pred cccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccC-------------- Confidence 689999999999999999999999999999999999999999999999999999999876654321 Q ss_pred cchhhHHHHHHHHHHH-hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceec-ccc Q lcl|NC_010583. 316 ADGSVLVTAKTISKLR-RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVS-EYF 393 (458) Q Consensus 316 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~-~~~ 393 (458) +.++.+++ ..+.+.|..++.|+||+.++..|++++|++|+|+|++... .+.+++|+|+|++++ +.+ T Consensus 243 --------~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~----~~~~~tllG~~~v~~~~~~ 310 (392) T protein:vir:10 243 --------LDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPT----QKNKKLFAGTNPVVVVSNR 310 (392) T ss_pred --------HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCcc----CCccccccCcccEEEeccc Confidence 23444433 4778888899999999999999999999999999966433 355678999876653 332 Q ss_pred -c---ccccCCceEEEEEece-EEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 394 -P---AKAASAEFAVIVYKDN-FVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 394 -~---~~~~~~~~~~~~~~~~-~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) | ...++...+++++|+. |.++++.++++..++ +|.+|++.||++.|+|+++++|+||++++++++ T Consensus 311 ~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 311 FLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred ccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2 2344566667788775 788999999988764 478999999999999999999999999999888 No 49 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=2e-55 Score=320.40 Aligned_cols=373 Identities=12% Similarity=0.073 Sum_probs=241.3 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAE 82 (458) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e 82 (458) |+|++|++++. ++.++++.+.++.+..... +..+.... ..+.. ++..++++.+.+... T Consensus 1 M~~~eL~~~~~--~~~~~~~~l~e~~~~~~~~-~~~~~~~~------------~~ee~---~~l~~~i~~~~~~~~---- 58 (395) T protein:vir:38 1 MNINQLKDAFD--MAGQKVQDLEDKRAQFAID-LGNDASSH------------SVDDI---NKLNASLKNAKMAQE---- 58 (395) T ss_pred CCHHHHHHHHH--HHHHHHHHHHHHHHHHHHH-HhhhHHHH------------HHHHH---HHHHHHHHHHHHHHH---- Confidence 77788887765 4455555443221110000 00000000 00011 111111111111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhh Q lcl|NC_010583. 83 LFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVN 162 (458) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~ 162 (458) ...+.... ....... . .... ..................+.+.++. .... T Consensus 59 ~~~~~~~~----------~~~~~~~-~-~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~ 107 (395) T protein:vir:38 59 LAKSAYED----------ARANLNA-E-PVNK------KPLPVKDGKPDAQAMKNQFVKDFKN-------------LVTS 107 (395) T ss_pred HHHHHHHH----------HHhhhhh-c-cccc------cccchhhhhHHHHHHHHHHHHHHHH-------------HHhh Confidence 00000000 0000000 0 0000 0000000000001111122211110 0111 Q ss_pred cccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEE--ecC-CCcccccccccccccccccccccccc Q lcl|NC_010583. 163 GSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLV--EPE-AGRATWVDASKFGTDETVGDEVKGQL 239 (458) Q Consensus 163 ~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~--~~~-~~~a~~v~e~~~~~e~~~~~~~~~~f 239 (458) ..++++.||++||+++.+.|++.+++.++|+++|+++|++++...+++ ..+ .+.+.|++|++..++ .+.++| T Consensus 108 ~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-----~~~~~f 182 (395) T protein:vir:38 108 GTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGD-----NDDPEL 182 (395) T ss_pred ccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCcccccccccccccc-----ccccce Confidence 234456689999999999999999999999999999998876655554 333 456789988865443 346899 Q ss_pred eeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchh Q lcl|NC_010583. 240 TEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGS 319 (458) Q Consensus 240 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 319 (458) +.|++++++++++++||+++++|+.++|++||.++|++++++++|.+|++|+|++.+.+... T Consensus 183 ~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~~~------------------ 244 (395) T protein:vir:38 183 TVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKKPTIS------------------ 244 (395) T ss_pred eeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc------------------ Confidence 99999999999999999999999999999999999999999999999999999876532111 Q ss_pred hHHHHHHHHHHH-hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccccc-c Q lcl|NC_010583. 320 VLVTAKTISKLR-RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAK-A 397 (458) Q Consensus 320 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~-~ 397 (458) .+.++.+++ ..+...++.++.|+||+.++..|.+++|++|+|+|++.. ..+.+++|+|+||+++++++.. . T Consensus 245 ---~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~----~~~~~~~l~G~pV~~~~~~~~~~~ 317 (395) T protein:vir:38 245 ---QFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDV----TSPDKYLIDGKPVIRIADKWLPDV 317 (395) T ss_pred ---cHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCc----CCCCcceeccceeEEecccccCcC Confidence 122344443 367788889999999999999999999999999986643 3456679999999999887644 3 Q ss_pred cCCceEEEEEec-eEEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 398 ASAEFAVIVYKD-NFVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 398 ~~~~~~~~~~~~-~~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+...+++++++ .|.++++.++++..++ +|.+|++.||++.|+|+++.+|+||++++++++ T Consensus 318 ~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 383 (395) T protein:vir:38 318 SGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTV 383 (395) T ss_pred CCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecc Confidence 445556778877 4889999998887653 578999999999999999999999999999988 No 50 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=1.7e-55 Score=320.88 Aligned_cols=378 Identities=11% Similarity=0.053 Sum_probs=247.1 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAE 82 (458) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e 82 (458) |.|++..+++. +++. ++.++.....++++.+..+... ++.++..++++.+.++++... T Consensus 1 Mn~~e~lkel~-----~~~~-------------el~~~~~~~~~~~~~~~~e~~~---~e~~~~~~e~~~l~~~i~~~~- 58 (421) T protein:vir:13 1 MNLFERLKELR-----AKKK-------------ELEEKRCGIVEEIRSLAKEKKE---EEARSKALEREKIEARMEIIE- 58 (421) T ss_pred CCHHHHHHHHH-----HHHH-------------HHHHHHHHHHHHHHHHhhccch---HHHHHHHHHHHHHHHHHHHHH- Confidence 44443332222 1111 1111111111122221111110 111122222222222211111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhh Q lcl|NC_010583. 83 LFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVN 162 (458) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~ 162 (458) +..+....... ...... .....................+.+|.++++........+ T Consensus 59 ---~~~~~~~~~~~---~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r--------- 114 (421) T protein:vir:13 59 ---EEIESVMTAID---EERKNT---------NFTGGRVIINGDSKEEKRSLQLSAMSKTIRGIQLSEEER--------- 114 (421) T ss_pred ---HHHHHHHHHHH---HHHhhh---------cccccccccccchhHHHHHHHHHHHHHhhhccchhHHHh--------- Confidence 01111110000 000000 000000000011111112233445556555433221111 Q ss_pred cccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccc--cccccccccccccccccccce Q lcl|NC_010583. 163 GSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATW--VDASKFGTDETVGDEVKGQLT 240 (458) Q Consensus 163 ~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~--v~e~~~~~e~~~~~~~~~~f~ 240 (458) ...+++.||++||+++.+.|++.+++.++|+++|+++|++++...+|+....+.+.| ++|+ +.+++++++|+ T Consensus 115 a~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~------~~~~~s~~~f~ 188 (421) T protein:vir:13 115 DIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRAGASVDKLANLAKD------TELVKAMLKTQ 188 (421) T ss_pred hccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEeecCCccceeecccc------cccccccccee Confidence 123455689999999999999999999999999999999999999999887766544 5554 45667889999 Q ss_pred eeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhh Q lcl|NC_010583. 241 EISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSV 320 (458) Q Consensus 241 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 320 (458) +|++.+++++++++||+++|+|+.++|++||.++|++++..++|..++ +.|+|+++.... T Consensus 189 ~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~-----~~~~g~~~~~~~--------------- 248 (421) T protein:vir:13 189 PMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIV-----KQAKAVLAEETI--------------- 248 (421) T ss_pred EEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHh-----hhhhhccccccc--------------- Confidence 999999999999999999999999999999999999999999887665 467887643211 Q ss_pred HHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCC Q lcl|NC_010583. 321 LVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASA 400 (458) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 400 (458) .+++++++++..+...++.++.|+||+.++..|.+++|++|+|+|+. +..+.+++|||+||++++++|.+..+. T Consensus 249 -~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~-----~~~~~~~tl~G~pV~~~~~~~~~~~~~ 322 (421) T protein:vir:13 249 -NDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKE-----LSDGGDLVFKGRPVIELEESIFDVGDE 322 (421) T ss_pred -cchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecC-----cCCCCCceecceeeEEeccccccCCCc Confidence 13557888888888999999999999999999999999999999964 233556799999999999999877777 Q ss_pred ceEEEEEece-EEEEecceeEEeec--ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 401 EFAVIVYKDN-FVMPRQRAVTVERE--RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 401 ~~~~~~~~~~-~~i~~~~~~~i~~~--~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+++++++. |+++++.++++..+ .+|.+|++.||++.|+|+++++|+||+.++...- T Consensus 323 ~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 383 (421) T protein:vir:13 323 TKFIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKF 383 (421) T ss_pred eEEEEEeccccEEEEEecceEEEeecccccccCeeEEEEEeeecceeecchhhheeeeccc Confidence 7788888875 88999999888764 5688999999999999999999999876555432 No 51 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=4.1e-55 Score=318.74 Aligned_cols=395 Identities=19% Similarity=0.161 Sum_probs=236.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 17 LAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIV 96 (458) Q Consensus 17 ~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~ 96 (458) |.++.... ..+...+...+++.+.++.... ....++..++++.+.+...+..+......... T Consensus 1 ~~ke~~~~------------~~~~~~~~~~e~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 62 (413) T protein:vir:81 1 MVKEAGDA------------PTNAQVAEIAEVKSMVEQFKAD-EDAKRERAKSVKANQDFLRELQEATAGSVDSE----- 62 (413) T ss_pred ChhhHHHH------------HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHH----- Confidence 22221110 0000111111111111111110 01111111222222111111111100000000 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccch Q lcl|NC_010583. 97 GLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYET 176 (458) Q Consensus 97 ~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~ 176 (458) .............. ......+............. ............+.... .......++++.++.++|+ T Consensus 63 ---~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~vp~ 132 (413) T protein:vir:81 63 ---KSGELTRKGEGYKS-IGEFFAKRAGDQIKQQAGGA-----QLNYSVGEYVAPRVKAA-SDPASTATLTDEFQGGYGT 132 (413) T ss_pred ---HhhhHhhhhhhhhh-hhhhhhhhhhhHHHHHHHHH-----HhhhhhhhhhhhHHHhh-hhhhhhcccccccccccch Confidence 00000000000000 00000000000000000000 00000011111111111 1122334556778899999 Q ss_pred hHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCC----Cccccccccccccccccccccc-ccceeeeeehhheee Q lcl|NC_010583. 177 IFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEA----GRATWVDASKFGTDETVGDEVK-GQLTEISFKTYKLAA 251 (458) Q Consensus 177 ~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~----~~a~~v~e~~~~~e~~~~~~~~-~~f~~v~~~~~k~~~ 251 (458) ++.+.|++.+++.++|+++++++|++++...+|+.... ..+.|++|++..+ +++ ++|+.|++.++++++ T Consensus 133 ~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~------~~~~~~f~~i~~~~~k~~~ 206 (413) T protein:vir:81 133 TWNRNIIYRRREKLVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKP------YMRFADFDIVTESLSKIAG 206 (413) T ss_pred hhHHHHHHHHhhhhhHHhhcceeeccCCceeEEEeccccccccccceecCccccc------ccCcccceeeEeeeeeEEE Confidence 99999999999999999999999999999999998764 3468888875443 444 789999999999999 Q ss_pred eehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc-ccccccccccccceeeccccchhhHHHHHHHHHH Q lcl|NC_010583. 252 KSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQP-KGLLKLAADDGAKVVTEAKADGSVLVTAKTISKL 330 (458) Q Consensus 252 ~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p-~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (458) +++||+++|+|++ .|.+||+..|++++++++|.+||+|+|+++| .||++.+....... .+.. ..+.++..+ T Consensus 207 ~~~iS~ell~ds~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~---~~~~----~~~~~i~~~ 278 (413) T protein:vir:81 207 LTKITDEMIEDYD-FLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAV---SNKD----ELADSIYKA 278 (413) T ss_pred eehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccccc---cccc----hhHHHHHHH Confidence 9999999999996 5999999999999999999999999998765 89988664432211 1111 122333333 Q ss_pred Hhhhhh-hhcccceeEechhHHHHHHhhhccccccccccccccc---cccccCCeeecccceecccccccccCCceEEEE Q lcl|NC_010583. 331 RRKLGR-HGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAV---KLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIV 406 (458) Q Consensus 331 ~~~~~~-~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~---~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 406 (458) +..+.. .......|+||+.++..|++++|++|+|++....... +..+..++|||+||++++++|. +.++++ T Consensus 279 ~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~-----~~~~~g 353 (413) T protein:vir:81 279 MTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPV-----GKPVVG 353 (413) T ss_pred HHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCc-----ccEEEE Confidence 333322 2234456999999999999999999999997655433 2334557899999999999985 346778 Q ss_pred Eec-eEEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 407 YKD-NFVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 407 ~~~-~~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +++ .|.++++.++++..++ +|.+|++.||++.|+|+++.+|+||++++++++ T Consensus 354 d~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 410 (413) T protein:vir:81 354 AFRSAASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAEV 410 (413) T ss_pred ecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecCC Confidence 887 5888998888886543 578999999999999999999999999999999 No 52 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=2.5e-55 Score=319.90 Aligned_cols=380 Identities=12% Similarity=0.090 Sum_probs=244.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELF 84 (458) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~ 84 (458) |+++++.+. ++.+..+.+ +..+.++........ +...+..++++....+++.+.++ T Consensus 1 M~~l~~l~~--~~~~~~~e~---------~~~~~~~~~~~~~~~-----ee~~~~~~~~~~~~~~~~~l~~~-------- 56 (394) T protein:vir:10 1 MDKLQTLFN--EVSAKCADL---------NAQLNAKLQDENASV-----DDFQKIKDDLTAAKARRDAINDQ-------- 56 (394) T ss_pred ChHHHHHHH--HHHHHHHHH---------HHHHHHHHhhhhccH-----HHHHHHHHHHHHHHHHHHHHHHH-------- Confidence 333322222 111111111 000000000000000 00000111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcc Q lcl|NC_010583. 85 AQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGS 164 (458) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~ 164 (458) ++.+ +...+........ ... ...............++.+|..+++.+.... ...... T Consensus 57 ---i~~~-------e~~~~~~~~~~~~---~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~--------~~~~~~ 113 (394) T protein:vir:10 57 ---IKDL-------EAENKANSDPDKP---VDN--AQPNGTDLKKKPIDAKKKAINDFIHSHGKVI--------DNAAGH 113 (394) T ss_pred ---HHHH-------HHHHHhhcchhhh---hhh--hcccccchhhhHHHHHHHHHHHHHhccchhh--------hhhhcc Confidence 1111 1000000000000 000 0000111122223345566777766543221 112234 Q ss_pred cccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecC-CCcccccccccccccccccccccccceeee Q lcl|NC_010583. 165 SSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPE-AGRATWVDASKFGTDETVGDEVKGQLTEIS 243 (458) Q Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~ 243 (458) .+++.||++||++++++|++.+++.++|+++|+++|++++...+|+... .+.+.|++|++..+ ..++++|++|+ T Consensus 114 ~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~-----~~~~~~~~~v~ 188 (394) T protein:vir:10 114 VTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENP-----ALAEPEFEQVD 188 (394) T ss_pred cccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCcccccccccccc-----ccccccceeEE Confidence 5667789999999999999999999999999999999999999998775 46678988876443 24678999999 Q ss_pred eehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHH Q lcl|NC_010583. 244 FKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVT 323 (458) Q Consensus 244 ~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (458) +.+++++++++||+|+|+||.+++++||.+.|++++++++|.+|++|+|++.|.++.+.. + T Consensus 189 l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~~~~~-------------------~ 249 (394) T protein:vir:10 189 WSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTAKATTTDT-------------------L 249 (394) T ss_pred eeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc-------------------c Confidence 999999999999999999999999999999999999999999999999988776543221 1 Q ss_pred HHHHHHHHh-hhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccc-cccccCCc Q lcl|NC_010583. 324 AKTISKLRR-KLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYF-PAKAASAE 401 (458) Q Consensus 324 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~-~~~~~~~~ 401 (458) ++++.+++. .+... .+++|+||+.++..|.+++|++|||+|++........+.+++|+|+||++++++ +...++.. T Consensus 250 ~d~l~~~~~~~~~~~--~~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~ 327 (394) T protein:vir:10 250 VDSLKHILNVDLDPA--YSRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQ 327 (394) T ss_pred HHHHHHHHHhhhhhh--ccCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCce Confidence 223444333 23333 357899999999999999999999999887766666677889999999987754 33344555 Q ss_pred eEEEEEece-EEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 402 FAVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 402 ~~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+++++++. |.++++.++++..+++.. ....|+++.|+|+++++|+||+.++++++ T Consensus 328 ~i~~gd~s~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~r~d~~~~~~~ai~~~~~~~~ 384 (394) T protein:vir:10 328 KAFVGDLKRGVLFADRQQVTLAWEDSKI-YGRYLGAAFRFGVKQADSNAGYFVTNTDA 384 (394) T ss_pred EEEEeeccccEEEEeecceEEEEecccc-cceeEEEEEEeccEEeccccEEEEEeecc Confidence 677888875 788889999988766543 34568999999999999999999999888 No 53 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=4.7e-55 Score=318.44 Aligned_cols=378 Identities=12% Similarity=0.077 Sum_probs=242.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELF 84 (458) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~ 84 (458) |++|++.+. ++.++++.+ +..+.+.........+ + ..+..+++++...+++.+.+++.. T Consensus 1 meeL~~~~~--~~~~~~~e~---------~~~l~~~~~~~~~~~e----~-~~~l~~ei~~~~~~~~~l~~~~~~----- 59 (389) T protein:vir:10 1 MDKLQTLFN--DVSAKCADL---------NAQLNAKLQDENASVD----D-FQKIKDDLTAAKARRDAINDQIKA----- 59 (389) T ss_pred ChHHHHHHH--HHHHHHHHH---------HHHHHHHHHhHhhhHH----H-HHHHHHHHHHHHHHHHHHHHHHHH----- Confidence 444444333 222222211 1011110000000000 0 001111111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcc Q lcl|NC_010583. 85 AQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGS 164 (458) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~ 164 (458) .+.............. ...............++.++..+++.+... ...... T Consensus 60 -------------~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~---------~~~~~~ 111 (389) T protein:vir:10 60 -------------LEAEKPAEPKTEPKDD------GSKKGTDLSKKPIDAKKKAINDFIHSHGKV---------IDATSK 111 (389) T ss_pred -------------HHHHHHhhhhcccccc------ccccccccchhHHHHHHHHHHHHhhcchhh---------hhhhcc Confidence 1100000000000000 000011111222233455666666554321 112234 Q ss_pred cccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecC-CCcccccccccccccccccccccccceeee Q lcl|NC_010583. 165 SSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPE-AGRATWVDASKFGTDETVGDEVKGQLTEIS 243 (458) Q Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~ 243 (458) ++++.||++||+++...|++.++++++|+++|+++|++++...+|+... ...+.|++|++..+ +.++++|++|+ T Consensus 112 ~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~-----~~~~~~~~~i~ 186 (389) T protein:vir:10 112 VTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENP-----KLAEPEFNKVD 186 (389) T ss_pred cccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEecCCCcccccccccccc-----ccccccceeee Confidence 5567789999999999999999999999999999999999999998875 45557777765433 34689999999 Q ss_pred eehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHH Q lcl|NC_010583. 244 FKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVT 323 (458) Q Consensus 244 ~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (458) +.+++++++++||+++|+||.++|++||.+.|++++++++|.+|++|+|++.|.|..+. .. T Consensus 187 ~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~-------------------~~ 247 (389) T protein:vir:10 187 WSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTAKKTTTD-------------------TL 247 (389) T ss_pred eeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc-------------------cc Confidence 99999999999999999999999999999999999999999999999987766543221 11 Q ss_pred HHHHHHHHh-hhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccc-cccccCCc Q lcl|NC_010583. 324 AKTISKLRR-KLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYF-PAKAASAE 401 (458) Q Consensus 324 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~-~~~~~~~~ 401 (458) +.++.++++ .+.+.+ +++|+||+.++..|.+++|++|+|+|+++.......+.+++|||+||++++++ +...++.. T Consensus 248 ~d~l~~~~~~~~~~~~--~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 325 (389) T protein:vir:10 248 VDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQ 325 (389) T ss_pred HHHHHHHHHhhhhhhh--CcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCCCCce Confidence 233444333 333333 67899999999999999999999999887766666677789999999877654 44445555 Q ss_pred eEEEEEece-EEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 402 FAVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 402 ~~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+++++++. |.++++.++++.++++. .....+|++.|+|+++++|+||++++++++ T Consensus 326 ~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~ 382 (389) T protein:vir:10 326 KAFVGDLKRGVLFTDRQQVTLAWEDSK-IYGKYLGAAFRFGVQKADSKAGYFVTNTDV 382 (389) T ss_pred EEEEeeccccEEEEeecceEEEeeccc-cccceEEEEEEeccEEecccceEEEEeecc Confidence 677888885 88999999999887653 344578999999999999999999998877 No 54 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=9.6e-55 Score=316.72 Aligned_cols=387 Identities=10% Similarity=0.013 Sum_probs=229.5 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |.-.|+++++++. ++.++++.+ .++.++.+ .++..++.++..++++.+.+++.+. T Consensus 2 ~~~~l~el~~~l~--e~~~~i~~~-----~~e~~~~~------------------~~~~~~~~~~l~~eie~l~~ei~~l 56 (394) T protein:vir:97 2 FEEKIKEIKATIA--DLNNTIVTK-----TAQVKNAL------------------ESDDLEAARSIKAEVEQAKANLVEA 56 (394) T ss_pred cHHHHHHHHHHHH--HHHHHHHHH-----HHHHHHhh------------------chhhHHHHHHHHHHHHHHHHHHHHH Confidence 2223333332222 222222111 00001110 1111111222222222222222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhh Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKA 160 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a 160 (458) .+...+.......................+.+.... ..................+....+ .......... T Consensus 57 ~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~ 126 (394) T protein:vir:97 57 ENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVN----DFIRSKGKIVNDSLRFEGKDEVLM------PINETTPVEP 126 (394) T ss_pred HHHHHHHHHHhhhhccccccccccchhhHHHHHHHH----HHHHHHHHHhhhhhhhhhHHHHHH------HHHhhhhhhh Confidence 110000000000000000000000000000000000 000000000000000000000000 0011111112 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecC-CCcccccccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPE-AGRATWVDASKFGTDETVGDEVKGQL 239 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~e~~~~~e~~~~~~~~~~f 239 (458) .....+...||++||+++.+.|++.+++.++|+++|+++|++++...+|+... +..+.|++|++..++ .+.++| T Consensus 127 ~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~-----~~~~~~ 201 (394) T protein:vir:97 127 QKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPA-----LAKPDF 201 (394) T ss_pred hccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCCCccceecccccccc-----cccccc Confidence 22344566789999999999999999999999999999999999999998764 467899998865442 357899 Q ss_pred eeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchh Q lcl|NC_010583. 240 TEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGS 319 (458) Q Consensus 240 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 319 (458) +.|++.+++++++++||+++|+|+.++|++||.++|++++++++|.+|++|.|++.+.|.. T Consensus 202 ~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~------------------- 262 (394) T protein:vir:97 202 KDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVK------------------- 262 (394) T ss_pred eeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc------------------- Confidence 9999999999999999999999999999999999999999999999999998876554321 Q ss_pred hHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccC Q lcl|NC_010583. 320 VLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAAS 399 (458) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~ 399 (458) .+.++.+++.... .+..++.|+||+.++..|.+++|++|+|+|++.. ..+.+++|+|+||+++++.+ .+ T Consensus 263 ---~~~~~~~~~~~~~-~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~----~~~~~~~l~G~pv~~~~~~~---~~ 331 (394) T protein:vir:97 263 ---NLDEIKALLNGGF-DPAYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDI----TAVSGKVLLGKPVFVLSDEV---LG 331 (394) T ss_pred ---cHHHHHHHHHhhh-hhhhCCEEEEcHHHHHHHHHhhccCCCeeeecCc----CCCCCceeccceeEEecccc---cC Confidence 1233444443322 2345688999999999999999999999986533 34556799999999976543 24 Q ss_pred CceEEEEEece-EEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 400 AEFAVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 400 ~~~~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ...+++++++. |.++++.++++..+++- .+...||++.|+|+++.+|+||++++++++ T Consensus 332 ~~~~~~gd~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 390 (394) T protein:vir:97 332 ANKAFIGDFKRGVLFADRKDLGLRWADNE-IYGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) T ss_pred CccEEEeeccccEEEEEecceEEEEeccc-ccceeEEEEEEEccEEecccceEEEEeccc Confidence 45577888775 78999999988876543 445689999999999999999999999888 No 55 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=5.5e-54 Score=312.57 Aligned_cols=393 Identities=12% Similarity=0.091 Sum_probs=246.3 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDL-VSKAVGEDRKRLEEALDLVKNLDEKSKKSA 81 (458) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~ 81 (458) |..+.++-+..++++.+++++|..+.+. +.++.+.....+... .++...+..++.++...+++.+.+++.+ T Consensus 1 m~~k~~~l~~~~~el~~~l~eL~e~~~~------l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~-- 72 (397) T protein:vir:96 1 MALKQLILNKQIKERSSEIDKLLSQRSD------LEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAE-- 72 (397) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 6666555555566666666655333111 111111111111100 0000111112222222222222222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhh Q lcl|NC_010583. 82 ELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAV 161 (458) Q Consensus 82 e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~ 161 (458) ..+....+........+.........+.+.... .. .........+.++..+++.. .... T Consensus 73 --~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~------~~--~~~~~~~~~~~~~~~~~~~~-----------~~~~ 131 (397) T protein:vir:96 73 --LQKEKQDLEDELAKAADPTDQKPKDGEKRKMKK------FK--VTEEELAEKRSAINAFVKSK-----------GAEK 131 (397) T ss_pred --HHHHHHHHHHHHHhhhhhhhhhhHHHHHHHHHH------Hh--hhhHHHHHHHHHHHHHHHhh-----------hhhh Confidence 111111211111111111111110000000000 00 00000111122222222111 0111 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecC-CCcccccccccccccccccccccccce Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPE-AGRATWVDASKFGTDETVGDEVKGQLT 240 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~e~~~~~e~~~~~~~~~~f~ 240 (458) ....+...|++++|+++.+.|++ ++...+++++|+++|++++...+|+... +..++|++|++..+ ..++++|+ T Consensus 132 ~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~-----~~~~~~~~ 205 (397) T protein:vir:96 132 RDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNP-----QLANPKMV 205 (397) T ss_pred hhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccCCcccccccccccc-----cccccccc Confidence 22445667889999999999987 5778889999999999999999998764 46678888876443 34689999 Q ss_pred eeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhh Q lcl|NC_010583. 241 EISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSV 320 (458) Q Consensus 241 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 320 (458) +|+++++++++++++|+++|+||.+++++||.+.|+++++.+++.+|++|+|+++|.|+.+ T Consensus 206 ~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~------------------- 266 (397) T protein:vir:96 206 EIDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTATAKSVVG------------------- 266 (397) T ss_pred ceeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc------------------- Confidence 9999999999999999999999999999999999999999999999999999988766532 Q ss_pred HHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccc-cccccC Q lcl|NC_010583. 321 LVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYF-PAKAAS 399 (458) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~-~~~~~~ 399 (458) ++++.++++..... ..+++|+||+.++..|.+++|++|+|+|++.. ..+.+++|+|+||++++++ +...++ T Consensus 267 ---~d~~~~~~~~~~~~-~~~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~----~~~~~~~l~G~pv~~~~~~~~~~~~~ 338 (397) T protein:vir:96 267 ---VDGLKDLINKEIKK-VYDVKLFISASMYSELDKLKDKNGRYLLQDSI----TAASGKQLLGKEVVVLDDDVIGKSVG 338 (397) T ss_pred ---hHHHHHHHHHhhhh-hcCcEEEEcHHHHHHHHHhhccCCCeEeccCc----cCCCcccccccceEEecccccCCCCC Confidence 22344444433232 34688999999999999999999999986543 3355679999999987664 444556 Q ss_pred CceEEEEEece-EEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 400 AEFAVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 400 ~~~~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ...+++|+|+. |+++++.++++.++++ ..+.+.+|++.|+|+++++|+||+++++++| T Consensus 339 ~~~~~~gd~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 339 NVVGFIGDAKAFASFFDRKQVSVSWVDN-NIYGQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred ceEEEEeehhcceEeEeecceEEEEecc-cccceeEEEEEEEccEEecccceEEEEeecC Confidence 66778888885 7899999999988665 3456789999999999999999999999999 No 56 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=4.7e-54 Score=312.93 Aligned_cols=373 Identities=12% Similarity=0.069 Sum_probs=231.3 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEE-LGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAEL 83 (458) Q Consensus 5 ~~~~~~~-~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~ 83 (458) |.++++. ..+.++.++++.+.+ ++.++.... +...|..+..++..+.++...+.++++. T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~---------~~~~~~~~~---------~~~~ee~~~~~~~~~~l~~~~~~l~~~~-- 60 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKND---------ELSQKATDP---------NIDMEDIKQLETEKAGLQQRFNIVERQV-- 60 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHH---------HHHHHHhcc---------CcCHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 4443331 112233333222211 111111000 0000111111111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhc----cchhHHHHHHHHhh Q lcl|NC_010583. 84 FAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMME----KDVFETEHGKAHIK 159 (458) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~----~~~~~~~~~~~~~~ 159 (458) +..+.+........ .. .. ... .. .+....++..++++ .............. T Consensus 61 -----~~~e~~~~~~~~~~-------~~---~~----~~~-~~-----~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~ 115 (387) T protein:vir:93 61 -----KDIEEKEKAKVKDT-------GE---AY----QSL-ND-----HEKMVKAKAEFYRHAILPNEFEKPSMEAQRLL 115 (387) T ss_pred -----HHHHHHHHHhhhhc-------cc---cC----CCc-ch-----hhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHH Confidence 11110000000000 00 00 000 00 00000111111111 11111111222233 Q ss_pred hhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEec-CCCccccccccccccccccccccccc Q lcl|NC_010583. 160 AVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEP-EAGRATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 160 a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~e~~~~~e~~~~~~~~~~ 238 (458) .....++.++||++||+++.++|++.++.+++|+++|+++++++ ..+|+.. +...++|++|++. .++++++ T Consensus 116 ~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~------~~~~~~~ 187 (387) T protein:vir:93 116 HALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVET------AKELKLK 187 (387) T ss_pred HhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCC--ceEEEEeecCCccccccCccc------ccccccc Confidence 34456677889999999999999999999999999999988765 4567654 5577889988754 4556899 Q ss_pred ceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHH-HHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 239 LTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE-AFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 239 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~-~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) |++|++++++++++++||+|||+||.+++++||.++|+++++++++. .|.+|+|+++|.|++....... + T Consensus 188 f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~---v------ 258 (387) T protein:vir:93 188 GDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKE---V------ 258 (387) T ss_pred cceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccccc---c------ Confidence 99999999999999999999999999999999999999999999766 5667889999999986543211 1 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHH-HHhhhccccccccccccccccccccCCeeecccceeccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAY-YDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 396 (458) .....+++++++++.+.+.|+.++.|+||+.++.. +.+++|.++ +++. +.+.+|+|+||+++++++ T Consensus 259 -~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~-~~~~---------~~~~~llG~PV~~~~~~~-- 325 (387) T protein:vir:93 259 -EGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTT-NFFD---------TPAEKVFGKPVVFTDAAV-- 325 (387) T ss_pred -cccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCC-cccc---------cCCccccccceEEecCCC-- Confidence 11224677899999999999999999999998755 556666554 4431 345689999999988764 Q ss_pred ccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 397 AASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 397 ~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+++|+|+.|++. ..++.+..+.++.++++.|++..|+|+++++|+||+.+++++| T Consensus 326 -----~~~~GDf~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~ 381 (387) T protein:vir:93 326 -----KPIVGDFNYFGIN-YDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred -----ceeeeehhhhhee-hhhheeeecccccCCceeEEEEeeeCceeechhheEEEEeecC Confidence 3577888877664 4456777777888999999999999999999999999999888 No 57 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=3e-54 Score=314.00 Aligned_cols=374 Identities=11% Similarity=0.056 Sum_probs=230.4 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEEL-GLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAEL 83 (458) Q Consensus 5 ~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~ 83 (458) |.++++.. .+.++.++++.+. .++.++.... +...|.-...++..+.++...+.++++. T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~---------~el~e~~~~~---------~~~~eei~~~~~~~~~l~~~~~~l~~~~-- 60 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKN---------DELSQKATDP---------NIDMEDIKQLETEKAGLQQRFNIVERQV-- 60 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHH---------HHHHHHHhcc---------CcCHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 33333311 1123333322221 1111111100 0000110111111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhc----cchhHHHHHHHHhh Q lcl|NC_010583. 84 FAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMME----KDVFETEHGKAHIK 159 (458) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~----~~~~~~~~~~~~~~ 159 (458) +..+.+....... ..... . ..... +....++..+++. .............. T Consensus 61 -----~~~e~~~~~~~~~----------~~~~~----~-~~~~~-----~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~ 115 (387) T protein:vir:26 61 -----QDIEEKEKAKVKD----------KGEAY----Q-SLSDN-----EKMVKAKAEFYRHAILPNEFEKPSMEAQRLL 115 (387) T ss_pred -----HHHHHHHHhhhhh----------ccccC----C-CCchh-----HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHH Confidence 1110000000000 00000 0 00000 0000011111111 11111111112222 Q ss_pred hhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEec-CCCccccccccccccccccccccccc Q lcl|NC_010583. 160 AVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEP-EAGRATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 160 a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~e~~~~~e~~~~~~~~~~ 238 (458) .....++.++||++||+++.++|++.++.+++|+++++++++++ ..+|+.. +..+++|++|++. .++++|+ T Consensus 116 ~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~------~~~~~~~ 187 (387) T protein:vir:26 116 HALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVET------AKELKAK 187 (387) T ss_pred hhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCcccccccccc------ccccccc Confidence 33345566778999999999999999999999999999988765 4567654 4577899888754 4556899 Q ss_pred ceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHH-HHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 239 LTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE-AFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 239 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~-~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) |++|++.+++++++++||+|||+||.+++++||.++|+++++++++. .|.+|+|+++|.|++....... ++ T Consensus 188 f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~---~~----- 259 (387) T protein:vir:26 188 GDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKE---VE----- 259 (387) T ss_pred cceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccccc---cc----- Confidence 99999999999999999999999999999999999999999999765 5667889999999986543221 11 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKA 397 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 397 (458) ....+++++++++.+.+.|+.++.|+||+.++..+..+.+..|++++. +.+.+|+|+||+++++++ T Consensus 260 --~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~---------~~~~~llG~PV~~~~~~~--- 325 (387) T protein:vir:26 260 --GADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD---------TPAEKVFGKPVVFTDAAV--- 325 (387) T ss_pred --ccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc---------cCCccccccceEEecCCC--- Confidence 123467889999999999999999999999987766666666777662 335689999999998764 Q ss_pred cCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 398 ASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 398 ~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+++|+|+.|++. ..++.+..+.++..+++.|+++.|+|+++++|+||++++.+|| T Consensus 326 ----~~~~GDf~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:26 326 ----KPIVGDFNYFGIN-YDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred ----ceeeechhhhhhh-hhhhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 3577888876554 3455666666677899999999999999999999999999998 No 58 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=3e-54 Score=314.00 Aligned_cols=374 Identities=11% Similarity=0.056 Sum_probs=230.4 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEEL-GLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAEL 83 (458) Q Consensus 5 ~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~ 83 (458) |.++++.. .+.++.++++.+. .++.++.... +...|.-...++..+.++...+.++++. T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~---------~el~e~~~~~---------~~~~eei~~~~~~~~~l~~~~~~l~~~~-- 60 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKN---------DELSQKATDP---------NIDMEDIKQLETEKAGLQQRFNIVERQV-- 60 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHH---------HHHHHHHhcc---------CcCHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 33333311 1123333322221 1111111100 0000110111111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhc----cchhHHHHHHHHhh Q lcl|NC_010583. 84 FAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMME----KDVFETEHGKAHIK 159 (458) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~----~~~~~~~~~~~~~~ 159 (458) +..+.+....... ..... . ..... +....++..+++. .............. T Consensus 61 -----~~~e~~~~~~~~~----------~~~~~----~-~~~~~-----~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~ 115 (387) T protein:vir:96 61 -----QDIEEKEKAKVKD----------KGEAY----Q-SLSDN-----EKMVKAKAEFYRHAILPNEFEKPSMEAQRLL 115 (387) T ss_pred -----HHHHHHHHhhhhh----------ccccC----C-CCchh-----HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHH Confidence 1110000000000 00000 0 00000 0000011111111 11111111112222 Q ss_pred hhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEec-CCCccccccccccccccccccccccc Q lcl|NC_010583. 160 AVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEP-EAGRATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 160 a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~e~~~~~e~~~~~~~~~~ 238 (458) .....++.++||++||+++.++|++.++.+++|+++++++++++ ..+|+.. +..+++|++|++. .++++|+ T Consensus 116 ~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~------~~~~~~~ 187 (387) T protein:vir:96 116 HALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVET------AKELKAK 187 (387) T ss_pred hhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCcccccccccc------ccccccc Confidence 33345566778999999999999999999999999999988765 4567654 4577899888754 4556899 Q ss_pred ceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHH-HHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 239 LTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE-AFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 239 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~-~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) |++|++.+++++++++||+|||+||.+++++||.++|+++++++++. .|.+|+|+++|.|++....... ++ T Consensus 188 f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~---~~----- 259 (387) T protein:vir:96 188 GDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKE---VE----- 259 (387) T ss_pred cceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccccc---cc----- Confidence 99999999999999999999999999999999999999999999765 5667889999999986543221 11 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKA 397 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 397 (458) ....+++++++++.+.+.|+.++.|+||+.++..+..+.+..|++++. +.+.+|+|+||+++++++ T Consensus 260 --~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~---------~~~~~llG~PV~~~~~~~--- 325 (387) T protein:vir:96 260 --GADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD---------TPAEKVFGKPVVFTDAAV--- 325 (387) T ss_pred --ccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc---------cCCccccccceEEecCCC--- Confidence 123467889999999999999999999999987766666666777662 335689999999998764 Q ss_pred cCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 398 ASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 398 ~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+++|+|+.|++. ..++.+..+.++..+++.|+++.|+|+++++|+||++++.+|| T Consensus 326 ----~~~~GDf~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:96 326 ----KPIVGDFNYFGIN-YDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred ----ceeeechhhhhhh-hhhhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 3577888876554 3455666666677899999999999999999999999999998 No 59 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=3e-54 Score=314.00 Aligned_cols=374 Identities=11% Similarity=0.056 Sum_probs=230.4 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 5 INKLKEEL-GLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAEL 83 (458) Q Consensus 5 ~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~ 83 (458) |.++++.. .+.++.++++.+. .++.++.... +...|.-...++..+.++...+.++++. T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~---------~el~e~~~~~---------~~~~eei~~~~~~~~~l~~~~~~l~~~~-- 60 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKN---------DELSQKATDP---------NIDMEDIKQLETEKAGLQQRFNIVERQV-- 60 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHH---------HHHHHHHhcc---------CcCHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 33333311 1123333322221 1111111100 0000110111111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhc----cchhHHHHHHHHhh Q lcl|NC_010583. 84 FAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMME----KDVFETEHGKAHIK 159 (458) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~----~~~~~~~~~~~~~~ 159 (458) +..+.+....... ..... . ..... +....++..+++. .............. T Consensus 61 -----~~~e~~~~~~~~~----------~~~~~----~-~~~~~-----~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~ 115 (387) T protein:vir:94 61 -----QDIEEKEKAKVKD----------KGEAY----Q-SLSDN-----EKMVKAKAEFYRHAILPNEFEKPSMEAQRLL 115 (387) T ss_pred -----HHHHHHHHhhhhh----------ccccC----C-CCchh-----HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHH Confidence 1110000000000 00000 0 00000 0000011111111 11111111112222 Q ss_pred hhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEec-CCCccccccccccccccccccccccc Q lcl|NC_010583. 160 AVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEP-EAGRATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 160 a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~e~~~~~e~~~~~~~~~~ 238 (458) .....++.++||++||+++.++|++.++.+++|+++++++++++ ..+|+.. +..+++|++|++. .++++|+ T Consensus 116 ~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~------~~~~~~~ 187 (387) T protein:vir:94 116 HALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVET------AKELKAK 187 (387) T ss_pred hhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCcccccccccc------ccccccc Confidence 33345566778999999999999999999999999999988765 4567654 4577899888754 4556899 Q ss_pred ceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHH-HHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 239 LTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE-AFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 239 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~-~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) |++|++.+++++++++||+|||+||.+++++||.++|+++++++++. .|.+|+|+++|.|++....... ++ T Consensus 188 f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~---~~----- 259 (387) T protein:vir:94 188 GDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKE---VE----- 259 (387) T ss_pred cceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccccc---cc----- Confidence 99999999999999999999999999999999999999999999765 5667889999999986543221 11 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKA 397 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 397 (458) ....+++++++++.+.+.|+.++.|+||+.++..+..+.+..|++++. +.+.+|+|+||+++++++ T Consensus 260 --~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~---------~~~~~llG~PV~~~~~~~--- 325 (387) T protein:vir:94 260 --GADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD---------TPAEKVFGKPVVFTDAAV--- 325 (387) T ss_pred --ccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc---------cCCccccccceEEecCCC--- Confidence 123467889999999999999999999999987766666666777662 335689999999998764 Q ss_pred cCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 398 ASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 398 ~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+++|+|+.|++. ..++.+..+.++..+++.|+++.|+|+++++|+||++++.+|| T Consensus 326 ----~~~~GDf~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:94 326 ----KPIVGDFNYFGIN-YDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred ----ceeeechhhhhhh-hhhhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 3577888876554 3455666666677899999999999999999999999999998 No 60 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=2.4e-55 Score=320.01 Aligned_cols=299 Identities=14% Similarity=0.079 Sum_probs=242.0 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCccccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDA 222 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e 222 (458) +. .. ..++....++.. +|.++|++++++|++.+++.++|+++++++|++++..+||+..+.+.++|++| T Consensus 1 m~-----~~-----~~~a~~~~~t~~-~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E 69 (330) T protein:vir:77 1 MA-----GS-----TVPSTQVALTGD-FSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGE 69 (330) T ss_pred Cc-----cc-----ccchhhccccCC-CcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecC Confidence 10 01 111222233334 44466667889999999999999999999999999999999999999999988 Q ss_pred ccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc-cccccc Q lcl|NC_010583. 223 SKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQ-PKGLLK 301 (458) Q Consensus 223 ~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~-p~Gi~~ 301 (458) ++. +++++++|++|++.++|++++++||+|+|+|+.++++++|.++|++++++++|.+||+|+|+++ |.||++ T Consensus 70 g~~------~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~ 143 (330) T protein:vir:77 70 AER------KPITKGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLA 143 (330) T ss_pred CCc------cccccceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccc Confidence 754 4567899999999999999999999999999999999999999999999999999999999865 578887 Q ss_pred cccccccc-eeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccc-cccccC Q lcl|NC_010583. 302 LAADDGAK-VVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAV-KLQGQV 379 (458) Q Consensus 302 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~-~~~~~~ 379 (458) ........ ..............+.++.+++..+...+..++.|+||+.++..|++++|.+|+|+|+...... +....+ T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~ 223 (330) T protein:vir:77 144 ETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIRE 223 (330) T ss_pred cccccceeecccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCC Confidence 65432221 1222333444556677888888888888889999999999999999999999999997655433 333456 Q ss_pred CeeecccceecccccccccC-CceEEEEEeceEEEEecceeEEeecc--------------------cccCCceEEEEEE Q lcl|NC_010583. 380 GRIYGLPVVVSEYFPAKAAS-AEFAVIVYKDNFVMPRQRAVTVERER--------------------QAGKQRDAYYVTQ 438 (458) Q Consensus 380 ~~l~G~pv~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~i~~~~--------------------~~~~~~~~~~~~~ 438 (458) ++|+|+||++++++|++.++ ...+++++++.|.++++.++++..++ .|.+|++.||++. T Consensus 224 ~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~ 303 (330) T protein:vir:77 224 GRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEA 303 (330) T ss_pred ceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEE Confidence 79999999999999976543 45567789999999999988776432 1678999999999 Q ss_pred eeccEEecccceEEEEeecC Q lcl|NC_010583. 439 RVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 439 r~d~~~~~~~afv~l~~aaa 458 (458) |+|+++.+|+||++++.+++ T Consensus 304 r~d~~v~~~~a~~~i~~~~~ 323 (330) T protein:vir:77 304 EFAFMVNDKDAFVKLTDQVA 323 (330) T ss_pred EeccEEecccceEEEEeccC Confidence 99999999999999999999 No 61 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=3.1e-55 Score=319.40 Aligned_cols=305 Identities=13% Similarity=0.089 Sum_probs=235.5 Q ss_pred hhccchhHHHHH-HHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccc Q lcl|NC_010583. 143 MMEKDVFETEHG-KAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVD 221 (458) Q Consensus 143 ~~~~~~~~~~~~-~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~ 221 (458) +.-..--..++. ....++.+ .++.++|+ ++|++++++|++.+++.++|+++++++|++++..++|+.++++.++|++ T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~-~~~~~~g~-~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~ 78 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQ-TGDSMFEG-YLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVSASWIG 78 (326) T ss_pred CCCCccchhhhcCcchhhhee-ccccCCcc-eechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcceEEec Confidence 000000001111 11112222 23333344 6899999999999999999999999999999999999999999999998 Q ss_pred cccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccc Q lcl|NC_010583. 222 ASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLK 301 (458) Q Consensus 222 e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~ 301 (458) |++ .+++++++|+++++.++|++++++||+|+++||.+++++||.++|++++++++|.++|+|+|+++|.||++ T Consensus 79 Eg~------~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~ 152 (326) T protein:vir:42 79 EGD------MKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQ 152 (326) T ss_pred CCc------cccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccc Confidence 875 44567899999999999999999999999999999999999999999999999999999999999999997 Q ss_pred cccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccccccccccccc-ccccccCC Q lcl|NC_010583. 302 LAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDA-VKLQGQVG 380 (458) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~-~~~~~~~~ 380 (458) .........................+......+...+..++.|+||+.++..|++++|++|+|+|+..... .+.....+ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~ 232 (326) T protein:vir:42 153 TTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLG 232 (326) T ss_pred cccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCc Confidence 66543333222222222222222234455556667788889999999999999999999999999765433 22333456 Q ss_pred eeecccceecccccccccCCceEEEEEeceEEEEecceeEEeecc--c--------------ccCCceEEEEEEeeccEE Q lcl|NC_010583. 381 RIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERER--Q--------------AGKQRDAYYVTQRVNLQR 444 (458) Q Consensus 381 ~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~--~--------------~~~~~~~~~~~~r~d~~~ 444 (458) +++|+||++++++|+ +....++++++.+.++.+.++.+..++ + |.+|++.||++.|+|+++ T Consensus 233 ~l~G~pv~~~~~~~~---~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v 309 (326) T protein:vir:42 233 RIVARPTILSDHVAS---GTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHC 309 (326) T ss_pred eeeeeeEEEcCCCCC---CceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEE Confidence 899999999999985 445567789998889999888776432 1 668999999999999999 Q ss_pred ecccceEEEEeecC Q lcl|NC_010583. 445 YFENGVVSGAYAAA 458 (458) Q Consensus 445 ~~~~afv~l~~aaa 458 (458) .+|+||++|+.+++ T Consensus 310 ~~~~a~~~l~~~~~ 323 (326) T protein:vir:42 310 NDKDAFVKLTNVDA 323 (326) T ss_pred ecccceEEEeeccc Confidence 99999999999998 No 62 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=7.8e-55 Score=317.24 Aligned_cols=340 Identities=14% Similarity=0.112 Sum_probs=237.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHH-------HHHHhhh Q lcl|NC_010583. 88 VEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEH-------GKAHIKA 160 (458) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~-------~~~~~~a 160 (458) +....+... ++. +.+ .....+.. ..+..+....+........++....... ......+ T Consensus 1 ~a~~~a~~~--~~~--------~~~---~~~~~~~~--~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a 65 (366) T protein:vir:57 1 MAAAVAVPV--KAH--------SVA---PGIIIKEE--LQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLSMA 65 (366) T ss_pred Ccccccccc--ccc--------ccc---cccccccc--cccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhhhh Confidence 000000000 000 000 00000000 0000000000000000000010000000 0001112 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhh-cceeeeccCceEEEEecCCCcccccccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGAL-FDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQL 239 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f 239 (458) .+++++.||++||+++.++|++.+++.++++++ ++++|+.++..++|+.++++.++|++|++. +++++++| T Consensus 66 --~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a~wv~E~~~------~~~s~~~f 137 (366) T protein:vir:57 66 --ISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGATAGYVGEGKD------VVATGATF 137 (366) T ss_pred --ccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcceeeeccCcc------ccccccce Confidence 233455689999999999999999999999998 789999999999999999999999998754 45678999 Q ss_pred eeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-Cccccccccccccccceee-ccccc Q lcl|NC_010583. 240 TEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGT-GQPKGLLKLAADDGAKVVT-EAKAD 317 (458) Q Consensus 240 ~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~-~~p~Gi~~~~~~~~~~~~~-~~~~~ 317 (458) ++|++.++|++++++||+|+|+||.++++++|+++|++++++++|.+||+|+|+ ++|+||++.+......... ....+ T Consensus 138 ~~i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~ 217 (366) T protein:vir:57 138 DDVKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAIN 217 (366) T ss_pred eEEEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccc Confidence 999999999999999999999999999999999999999999999999999997 5899999877654433322 22223 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccccc- Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAK- 396 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~- 396 (458) ........+++.........++.++.|+||+.++..|++++|++|+|+|+.. ..++|+|+||++++++|+. T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~--------~~g~l~G~Pvv~s~~ip~~~ 289 (366) T protein:vir:57 218 LTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPEM--------SQGILKGYPIQRTSAIPANL 289 (366) T ss_pred hhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCceeccCC--------CCCeecceeeEEcccccccc Confidence 3333334445555556666677889999999999999999999999998421 2368999999999999963 Q ss_pred --ccCCceEEEEEeceEEEEecceeEEeeccc-------------ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 397 --AASAEFAVIVYKDNFVMPRQRAVTVERERQ-------------AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 397 --~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~-------------~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+...+++++++.|.|+++.+++++.+++ |.+|++.||++.|+|+++.||+||++++-..= T Consensus 290 ~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 290 GDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred ccCCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 234456778999999999999999876543 45899999999999999999999999875555 No 63 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=5.5e-54 Score=312.57 Aligned_cols=422 Identities=15% Similarity=0.148 Sum_probs=238.3 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAA----QKAAEAKRLREEQEEKELARMNDLVSKAVGE-------DRKRLEEALDL 69 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~----~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e-------~~~~~~~~~~~ 69 (458) +.+....++...... ...+..... ....+..+.+.+.+.+..++++.+..+...+ +.+++++...+ T Consensus 165 ~~~~~~~~~~~~~~~---~~~~~~~~~e~~~~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~~d~l~ae 241 (645) T protein:vir:93 165 NRKPVVKIASSAGAA---AQSTTVFHKEKTIMNIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEHYDNTAAE 241 (645) T ss_pred hhcchhhhhhhhcch---hhccccccccccccchhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHHHHHHHHH Confidence 111111111111000 000000000 0111223445555666666666655544332 22333333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH----HHHHHHhhhhhhhhh---hhhcchhhhhhHHHHHHHHH Q lcl|NC_010583. 70 VKNLDEKSKKSAELFAQTVEKQQETIVGL-QDEIKSLL----AAREGRSFVGDSVAK---ALYGTQDAFEDEVEKLVLLS 141 (458) Q Consensus 70 i~~~~e~~~~~~e~~~~~~~~~~~~~~~~-~~~~~~~~----~~~e~~~~~~~~~~~---~~~~~~~~~~~~~~~~a~~~ 141 (458) ++++.+.+.+..+.......... ..... ........ ...+.....+..+.+ ........ .......+ .. T Consensus 242 i~~l~~~i~r~e~~e~~~a~~a~-pv~~~~~~~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~-~~~a~e~a-~~ 318 (645) T protein:vir:93 242 IRQVDAHLKRLRELEAGKAATAQ-PVKQAGNGNVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGV-RSEALEVA-RR 318 (645) T ss_pred HHHHHHHHHHHHHHHHHHHhccc-ccccccccccccccccccccchhhhhhhhhHHHHHHHHHhcccc-hhHHHHHH-Hh Confidence 33333333222111000000000 00000 00000000 000000000000000 00000000 00000000 00 Q ss_pred hhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcce-eeec---cCceEEEEecCCCcc Q lcl|NC_010583. 142 YMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDE-LPMS---SKILTMLVEPEAGRA 217 (458) Q Consensus 142 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~-~~~~---~~~~~~p~~~~~~~a 217 (458) ...++.......... ..+....++..+|++++|+++..+||+.+++.+++++++.. ++.. .+..++|+.++++.+ T Consensus 319 ~~~~~~~~~~~~~~a-~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a 397 (645) T protein:vir:93 319 QYPDDSRLHHVLKSA-VGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAA 397 (645) T ss_pred hcccchhhhhhhhhh-hhccccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcce Confidence 000000000001111 11223344556789999999999999999999999999754 3322 346799999999999 Q ss_pred cccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC--- Q lcl|NC_010583. 218 TWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTG--- 294 (458) Q Consensus 218 ~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~--- 294 (458) +|++|++ .+++++++|++|+++++|++++++||+|||+||.+++++||+.+|++++++++|.+||+|+|++ T Consensus 398 ~wv~Eg~------~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~ 471 (645) T protein:vir:93 398 GWVGEGK------TKPLTKFDFESITFSHAKVSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVAD 471 (645) T ss_pred EEeccCc------cccccccceeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCC Confidence 9999975 4556789999999999999999999999999999999999999999999999999999998764 Q ss_pred -ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhh--hcccceeEechhHHHHHHhhhccccccccccccc Q lcl|NC_010583. 295 -QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRH--GLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGND 371 (458) Q Consensus 295 -~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~ 371 (458) .|.|++...... ..... ...++..++..+... ...+++|+|||.++..|.+++|++|++++.. . T Consensus 472 ~~p~gi~~~~~~~------~~~~~-----~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~-~- 538 (645) T protein:vir:93 472 VSPASITHDVKGT------ASSGN-----PDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPD-M- 538 (645) T ss_pred ccccceecccccc------ccccc-----hHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCceeecC-C- Confidence 588876532211 11111 112334444333322 3345789999999999999999999998732 1 Q ss_pred cccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeecc------------------------cc Q lcl|NC_010583. 372 AVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERER------------------------QA 427 (458) Q Consensus 372 ~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~------------------------~~ 427 (458) ....++|+|+||++++++|+ .+++++++.+.++.+.++.+..+. .| T Consensus 539 ----~~~~~tL~G~PV~~s~~vp~------~~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf 608 (645) T protein:vir:93 539 ----TLLGGSFQGLPVIVSQYVGD------QLVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMF 608 (645) T ss_pred ----CCCCceeeceeeEEeccCCc------ceeEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHh Confidence 12236899999999999985 245677777778877777665421 16 Q ss_pred cCCceEEEEEEeeccEEecccceEEEEee---cC Q lcl|NC_010583. 428 GKQRDAYYVTQRVNLQRYFENGVVSGAYA---AA 458 (458) Q Consensus 428 ~~~~~~~~~~~r~d~~~~~~~afv~l~~a---aa 458 (458) .+|+++||++.|+|+++++|+||++++-+ +| T Consensus 609 ~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~ 642 (645) T protein:vir:93 609 QTGSVAIRAERWINWRRRRTAAVAVITGVNYGSA 642 (645) T ss_pred hcCceEEEEEEEEcceeeCccceEEEecccCCcc Confidence 78999999999999999999999998732 22 No 64 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=9.2e-54 Score=311.35 Aligned_cols=388 Identities=10% Similarity=0.062 Sum_probs=229.4 Q ss_pred HHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 8 LKEELGLGDLA--KSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFA 85 (458) Q Consensus 8 ~~~~~~~~~~~--~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~ 85 (458) .++-...+++. .++++|... .+.+.++.++.++..++++.+..... ...+.+.+++...+.++...+... T Consensus 1 ~~~~~~~~~~~~g~~mk~l~el---~~~~~e~~~~~~~~~~el~~~~~~~~-----~~~ee~~~~~~~~~~l~~~~~~l~ 72 (402) T protein:vir:93 1 MRNFKNDNELLGGNEMPTLYEL---KQSLGMIGQQLKNKNDELSQKATDPN-----IDMEDIKQLETEKAGLQQRFNIVE 72 (402) T ss_pred CcchhhhhhcCCCCCChHHHHH---HHHHHHHHHHHHHHHHHHHHHHhccC-----cCHHHHHHHHHHHHHHHHHHHHHH Confidence 22222222222 222322211 12222222222221111111111000 000111111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhc----cchhHHHHHHHHhhhh Q lcl|NC_010583. 86 QTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMME----KDVFETEHGKAHIKAV 161 (458) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~----~~~~~~~~~~~~~~a~ 161 (458) ++++.. +.+.+....... .... ... ..+....++..+++. ................ T Consensus 73 ~~~~~~-------e~~~~~~~~~~~----------~~~~-~~~--~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a 132 (402) T protein:vir:93 73 RQVQDI-------EEKEKAKVKDKG----------EAYQ-SLS--DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHA 132 (402) T ss_pred HHHHHH-------HHHHHhhhhhcc----------ccCC-CCc--hhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhh Confidence 111111 111000000000 0000 000 000000111111111 1111111111222233 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEec-CCCcccccccccccccccccccccccce Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEP-EAGRATWVDASKFGTDETVGDEVKGQLT 240 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~e~~~~~e~~~~~~~~~~f~ 240 (458) ...++.++||++||+++.++|++.++.+++|+++|+++++++ ..+|+.. +.+++.|++|++. .++++|+|+ T Consensus 133 ~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~Eg~~------~~~~~~~f~ 204 (402) T protein:vir:93 133 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVET------AKELKAKGD 204 (402) T ss_pred hccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC--ceeeeeeccCCcccccccccc------ccccccccc Confidence 345566778999999999999999999999999999988764 4567654 4577889988754 455689999 Q ss_pred eeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHH-HHhccCCCCccccccccccccccceeeccccchh Q lcl|NC_010583. 241 EISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE-AFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGS 319 (458) Q Consensus 241 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~-~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 319 (458) +|++.+++++++++||+|+|+||.+++++||.++|+++++++++. .|.+|+|+++|.|++....... ++ T Consensus 205 ~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~---~~------- 274 (402) T protein:vir:93 205 TVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKE---VE------- 274 (402) T ss_pred eeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccccc---cc------- Confidence 999999999999999999999999999999999999999999765 5677889999999986543221 11 Q ss_pred hHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccC Q lcl|NC_010583. 320 VLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAAS 399 (458) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~ 399 (458) ....+++++++++++.+.|+.++.|+||+.++..+..+++..|++++. +.|.+|+|+||++++.++ T Consensus 275 ~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~---------~~~~~llG~PV~~t~~~~----- 340 (402) T protein:vir:93 275 GADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD---------TPAEKVFGKPVVFTDAAV----- 340 (402) T ss_pred ccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc---------cCCccccccceEEecCCC----- Confidence 112356788999999999999999999999987665555555666652 345689999999998764 Q ss_pred CceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 400 AEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 400 ~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+++|||+.|++.. ..+.+..+.+..++++.|++..|+|+++++|+||+.|+++++ T Consensus 341 --~i~~GDf~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~ 396 (402) T protein:vir:93 341 --KPIVGDFNYFGINY-DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 396 (402) T ss_pred --ceeeechhhhhhhh-hhhhhhhhhcccCCceEEEEEEEeCcEEechhheEEEEeecC Confidence 35678887654432 223333334455799999999999999999999999999888 No 65 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=5.4e-55 Score=318.12 Aligned_cols=284 Identities=12% Similarity=-0.014 Sum_probs=227.5 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCccccccccccccccccccccccccee Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTE 241 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~ 241 (458) +..++++.||+++|++++.+||+.+++.++|+++++++|++++..+||+..+++.++|++|++ .+++++++|++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~------~~~~s~~~f~~ 74 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGE------VKPSASVDVSA 74 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCc------cccccccceee Confidence 445566779999999999999999999999999999999999999999999999999999875 44567899999 Q ss_pred eeeehhheeeeehhhHHHHhccHHH----HHHHHHHHHHHHHHHHHHHHHhccCCC--C-ccccccccccccccceeecc Q lcl|NC_010583. 242 ISFKTYKLAAKSFITDETEEDAIFS----LLPLLRKRLIEAHAVSIEEAFMSGNGT--G-QPKGLLKLAADDGAKVVTEA 314 (458) Q Consensus 242 v~~~~~k~~~~~~is~ell~ds~~~----~~~~i~~~la~~~~~~~d~~~l~G~g~--~-~p~Gi~~~~~~~~~~~~~~~ 314 (458) |++.++|++++++||+|+++++..+ |+++|.++|++++++++|.++|+|+|. + .+.|+.+..... ....... T Consensus 75 v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~-~~~~~~~ 153 (315) T protein:vir:80 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKT-KNIVDAT 153 (315) T ss_pred eEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccc-cceeecc Confidence 9999999999999999999988765 789999999999999999999999864 3 244544433221 1111111 Q ss_pred ccchhhHHHHHHHHHHHhhhhh-hhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccc Q lcl|NC_010583. 315 KADGSVLVTAKTISKLRRKLGR-HGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYF 393 (458) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~ 393 (458) + ..+.++.+++..+.. .+..+.+|+||+.++..|+++++.+|++++..........+.+++|+|+||+++++| T Consensus 154 ~------~~~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~ 227 (315) T protein:vir:80 154 D------SATADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTV 227 (315) T ss_pred c------cchHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcC Confidence 1 123445555555433 345566899999999999999998888766544444445566789999999999999 Q ss_pred ccccc----CCceEEEEEeceEEEEecceeEEeeccc----------ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 394 PAKAA----SAEFAVIVYKDNFVMPRQRAVTVERERQ----------AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 394 ~~~~~----~~~~~~~~~~~~~~i~~~~~~~i~~~~~----------~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.... ....++++||+++.++.+.+++++.+++ |.+|++.||++.|+|+++++|+||++|+.++| T Consensus 228 ~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a 306 (315) T protein:vir:80 228 SGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) T ss_pred CcccccccccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccC Confidence 86432 2345677899999888888888765443 77999999999999999999999999999998 No 66 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=6.9e-55 Score=317.53 Aligned_cols=282 Identities=15% Similarity=0.137 Sum_probs=237.7 Q ss_pred HHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccc Q lcl|NC_010583. 156 AHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEV 235 (458) Q Consensus 156 ~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~ 235 (458) ....+ ....+.+.|+.+||++++++|++.+++.++|+++++++|++++..++|+.. .+.++|++|++. ++++ T Consensus 1 ~g~~a-~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~------~~~~ 72 (299) T protein:vir:41 1 MGFNP-DTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMS-GVGAFWVDEAER------IQTS 72 (299) T ss_pred CCcCC-CcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEc-CCceeeeecCcc------cccc Confidence 11112 223445566779999999999999999999999999999999999999876 477899988754 4566 Q ss_pred cccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccc Q lcl|NC_010583. 236 KGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 236 ~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) +++|++|++.+++++++++||+|+++||.++++++|.+.|++++++++|.++|+|+|+++|.||++.+....... T Consensus 73 ~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~----- 147 (299) T protein:vir:41 73 KPTFTKAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLV----- 147 (299) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceee----- Confidence 899999999999999999999999999999999999999999999999999999999999999998654332111 Q ss_pred cchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccc Q lcl|NC_010583. 316 ADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPA 395 (458) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 395 (458) ......++++.+++..+...++.++.|+||+.++..|++++|.+|+|++++... +..++|+|+||++++++|. T Consensus 148 --~~~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~-----~~~~~l~G~PV~~~~~~~~ 220 (299) T protein:vir:41 148 --EETANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATS-----NGVDDVLGLPIAYTPKYTF 220 (299) T ss_pred --ccccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcC-----CCCceecceeeEEecccCC Confidence 122345678888999999999999999999999999999999999999865432 2235899999999999996 Q ss_pred cccCCceEEEEEeceEEEEecceeEEeecc----------------cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 396 KAASAEFAVIVYKDNFVMPRQRAVTVERER----------------QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 396 ~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~----------------~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) + ++...+++++++.+.++.+.++++..++ .|.+|++.||++.|+|+++.+|+||++++.+|| T Consensus 221 ~-~~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa 298 (299) T protein:vir:41 221 G-DKDISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAG 298 (299) T ss_pred C-CCceEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 4 3455677889999999999888876533 257899999999999999999999999999999 No 67 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=1.1e-52 Score=305.53 Aligned_cols=373 Identities=12% Similarity=0.080 Sum_probs=236.7 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 3 IDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAE 82 (458) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e 82 (458) |++.++++++. ++.++++....+ ...+ .....+.+. .+...+.+...++...++..+.+.+. T Consensus 1 m~~~e~~~~~~--~~~~~l~~~~~~-----~~~e----~~~~~e~~~---~~~~~~~~~~~~e~~~~~~~l~~~~~---- 62 (379) T protein:vir:10 1 MEALEIKVALE--AIKGQVDSKSSA-----QALE----VKGLIEALE---AKMTSEKDLAVNELKSDMAALQAHAD---- 62 (379) T ss_pred CCHHHHHHHHH--HHHHHHHHHHHH-----HHHH----HHHHHHHHH---hHhhHHHHHHHHHHHHHHHHHHHHHH---- Confidence 66666555544 333333211100 0000 000000000 00111111111111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhh Q lcl|NC_010583. 83 LFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVN 162 (458) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~ 162 (458) .+............. .............+....+... ........ T Consensus 63 --------------~~e~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~ 107 (379) T protein:vir:10 63 --------------KLDVKLKEKAKSEDK--------------SDSLVKSITENFNDIKEVRNGK-------SIQVKAVG 107 (379) T ss_pred --------------HHHHHHHhccccccc--------------chhHHHHHHHHHHhHHHHHhhh-------hhhhhhhc Confidence 000000000000000 0000000000000111111100 00011111 Q ss_pred cccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCC--Ccccccccccccccccccccccccce Q lcl|NC_010583. 163 GSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEA--GRATWVDASKFGTDETVGDEVKGQLT 240 (458) Q Consensus 163 ~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~--~~a~~v~e~~~~~e~~~~~~~~~~f~ 240 (458) ..++++.++.+||+.+...|++.++..++|+++|++++++++.+.||+.++. +.+.|++|++ .+++++++|+ T Consensus 108 ~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~------~~~~~~~~f~ 181 (379) T protein:vir:10 108 DMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENGAGEGAIGAQVEGA------TKGQKDYDIS 181 (379) T ss_pred ccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeecCCCcccccccCCc------ccccccccee Confidence 2233344455789999999999999999999999999999999999998754 3456676654 5566789999 Q ss_pred eeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhh Q lcl|NC_010583. 241 EISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSV 320 (458) Q Consensus 241 ~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 320 (458) +|++.++|++++++||+++|+|++ ++.+||.++|++++++++|.+|+.|+|++.+.+..... + T Consensus 182 ~i~~~~~k~~~~~~iS~ell~D~~-~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~~~-----------~----- 244 (379) T protein:vir:10 182 MIDVNTDFIAGFTRYSKKMANNLP-FLTSFIPNALRRDYAKAENAAFNAVLAANATASTEIIT-----------N----- 244 (379) T ss_pred eeEeeeeeEEeeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHhccccccccccccccc-----------C----- Confidence 999999999999999999999986 69999999999999999999999998876544432211 1 Q ss_pred HHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCC Q lcl|NC_010583. 321 LVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASA 400 (458) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 400 (458) ...++++.++++.+...++.++.|+|||.++..|++++|++|+|+++++... ..+.+.+|||+||++++.||.+ T Consensus 245 ~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~--~~~~~~~l~G~pvv~s~~~~ag---- 318 (379) T protein:vir:10 245 KNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVT--QDNGVLRINGIPLFRATWLAAN---- 318 (379) T ss_pred cccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccC--CCCCcceecceeeEecCCCCCC---- Confidence 1123456777777888888999999999999999999999999999765432 2344568999999999999852 Q ss_pred ceEEEEEeceEEEEecceeEEee--c--ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 401 EFAVIVYKDNFVMPRQRAVTVER--E--RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 401 ~~~~~~~~~~~~i~~~~~~~i~~--~--~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+++++++.+.+..+.++++.. + .+|.+|++.||++.|+|+.|++|+|||++++++= T Consensus 319 -~~~~gdf~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 319 -KYYVGDWTRVTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred -ceEEeecccEEEEEEeceEEEEeecccccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 4678899988888777766653 3 3689999999999999999999999999999999 No 68 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=8.6e-55 Score=317.00 Aligned_cols=286 Identities=11% Similarity=0.036 Sum_probs=238.2 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCccccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDA 222 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e 222 (458) +.. ........++++.||.+||+++.++|++.+++.++|+++++++|++++..+||+..+.+.+.|++| T Consensus 1 ma~-----------~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 69 (304) T protein:vir:10 1 MAT-----------PTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSE 69 (304) T ss_pred Ccc-----------cccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeec Confidence 111 111112244556778999999999999999999999999999999999999999999999999988 Q ss_pred ccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccc Q lcl|NC_010583. 223 SKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKL 302 (458) Q Consensus 223 ~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~ 302 (458) ++. +++++++|++|++.++|++++++||+|+++||.+++++||.++|++++++++|.++|+|+|+++|.|+... T Consensus 70 ~~~------~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~ 143 (304) T protein:vir:10 70 TER------IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGK 143 (304) T ss_pred Ccc------cccccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccc Confidence 764 44568999999999999999999999999999999999999999999999999999999999988877654 Q ss_pred ccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCee Q lcl|NC_010583. 303 AADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRI 382 (458) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l 382 (458) ......... ..........+.++.++...+...+..++.|+||+.++..|++++|++|+|+++.. +++| T Consensus 144 ~~~~~~~~~--~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~---------~~~l 212 (304) T protein:vir:10 144 PLVEGAEEK--GNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN---------GNEI 212 (304) T ss_pred ccccccccc--ccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC---------Cccc Confidence 332222111 11112334567888999999999999999999999999999999999999998432 3689 Q ss_pred ecccceecccccccccCCceEEEEEeceEEEEecceeEEeecc------------------cccCCceEEEEEEeeccEE Q lcl|NC_010583. 383 YGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERER------------------QAGKQRDAYYVTQRVNLQR 444 (458) Q Consensus 383 ~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~------------------~~~~~~~~~~~~~r~d~~~ 444 (458) +|+||++++++|... +...+++++++++.++.+.++++..+. +|.+|++.||++.|+|+++ T Consensus 213 ~G~PV~~~~~~~~~~-~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v 291 (304) T protein:vir:10 213 MGLPLSYTGADVYDK-KKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMN 291 (304) T ss_pred cceeeEEecccccCC-CCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEe Confidence 999999999999643 345677889999999999988876532 3789999999999999999 Q ss_pred ecccceEEEEeec Q lcl|NC_010583. 445 YFENGVVSGAYAA 457 (458) Q Consensus 445 ~~~~afv~l~~aa 457 (458) ++|+||++||.|- T Consensus 292 ~~~~a~~~l~~a~ 304 (304) T protein:vir:10 292 VKPEAFATLKPTE 304 (304) T ss_pred ecccceEEEEecC Confidence 9999999999999 No 69 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=8.6e-55 Score=317.00 Aligned_cols=286 Identities=11% Similarity=0.036 Sum_probs=238.2 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCccccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDA 222 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e 222 (458) +.. ........++++.||.+||+++.++|++.+++.++|+++++++|++++..+||+..+.+.+.|++| T Consensus 1 ma~-----------~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 69 (304) T protein:vir:94 1 MAT-----------PTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSE 69 (304) T ss_pred Ccc-----------cccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeec Confidence 111 111112244556778999999999999999999999999999999999999999999999999988 Q ss_pred ccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccc Q lcl|NC_010583. 223 SKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKL 302 (458) Q Consensus 223 ~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~ 302 (458) ++. +++++++|++|++.++|++++++||+|+++||.+++++||.++|++++++++|.++|+|+|+++|.|+... T Consensus 70 ~~~------~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~ 143 (304) T protein:vir:94 70 TER------IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGK 143 (304) T ss_pred Ccc------cccccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccc Confidence 764 44568999999999999999999999999999999999999999999999999999999999988877654 Q ss_pred ccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCee Q lcl|NC_010583. 303 AADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRI 382 (458) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l 382 (458) ......... ..........+.++.++...+...+..++.|+||+.++..|++++|++|+|+++.. +++| T Consensus 144 ~~~~~~~~~--~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~---------~~~l 212 (304) T protein:vir:94 144 PLVEGAEEK--GNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN---------GNEI 212 (304) T ss_pred ccccccccc--ccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC---------Cccc Confidence 332222111 11112334567888999999999999999999999999999999999999998432 3689 Q ss_pred ecccceecccccccccCCceEEEEEeceEEEEecceeEEeecc------------------cccCCceEEEEEEeeccEE Q lcl|NC_010583. 383 YGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERER------------------QAGKQRDAYYVTQRVNLQR 444 (458) Q Consensus 383 ~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~------------------~~~~~~~~~~~~~r~d~~~ 444 (458) +|+||++++++|... +...+++++++++.++.+.++++..+. +|.+|++.||++.|+|+++ T Consensus 213 ~G~PV~~~~~~~~~~-~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v 291 (304) T protein:vir:94 213 MGLPLSYTGADVYDK-KKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMN 291 (304) T ss_pred cceeeEEecccccCC-CCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEe Confidence 999999999999643 345677889999999999988876532 3789999999999999999 Q ss_pred ecccceEEEEeec Q lcl|NC_010583. 445 YFENGVVSGAYAA 457 (458) Q Consensus 445 ~~~~afv~l~~aa 457 (458) ++|+||++||.|- T Consensus 292 ~~~~a~~~l~~a~ 304 (304) T protein:vir:94 292 VKPEAFATLKPTE 304 (304) T ss_pred ecccceEEEEecC Confidence 9999999999999 No 70 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=2.4e-54 Score=314.50 Aligned_cols=308 Identities=11% Similarity=-0.000 Sum_probs=239.6 Q ss_pred HHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCc Q lcl|NC_010583. 137 LVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGR 216 (458) Q Consensus 137 ~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~ 216 (458) .+..+.+ +.............+.++.++|+++.++|++.+++.++|+++++++|++++..++|+.++.+. T Consensus 1 ~a~l~el----------~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~ 70 (333) T protein:vir:78 1 MATLNEL----------LPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPE 70 (333) T ss_pred CchhHHh----------hhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCce Confidence 1111111 111000111122233445589999999999999999999999999999999999999999999 Q ss_pred cccccccc--ccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC Q lcl|NC_010583. 217 ATWVDASK--FGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTG 294 (458) Q Consensus 217 a~~v~e~~--~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~ 294 (458) ++|++|+. +..|++.+++++++|++|++.++|++++++||+|+++|+.+++++||+++|++++++++|.++|+|+|++ T Consensus 71 a~~v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~ 150 (333) T protein:vir:78 71 VGQVGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPL 150 (333) T ss_pred eEeecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCC Confidence 99999874 4567788899999999999999999999999999999999999999999999999999999999999987 Q ss_pred cccccccccccccccee-eccccchhhHHHHHHHHHHHhhhhhh-hcccceeEechhHHHHHH---hhhccccccccccc Q lcl|NC_010583. 295 QPKGLLKLAADDGAKVV-TEAKADGSVLVTAKTISKLRRKLGRH-GLKLSKLVLIVSMDAYYD---LLEDEEWQDVAQVG 369 (458) Q Consensus 295 ~p~Gi~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~---~~~d~~~~~~~~~~ 369 (458) +|.++............ ...........++.++.+++..+... +.....|+|||.++..|. .++|.+|+|+++.. T Consensus 151 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~ 230 (333) T protein:vir:78 151 TGSALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRI 230 (333) T ss_pred CCcccccccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCc Confidence 65544433222111111 11222333445677788888777655 445567999999987765 47899999998543 Q ss_pred cccccccccCCeeecccceeccccccc----ccCCceEEEEEeceEEEEecceeEEeeccc-------------ccCCce Q lcl|NC_010583. 370 NDAVKLQGQVGRIYGLPVVVSEYFPAK----AASAEFAVIVYKDNFVMPRQRAVTVERERQ-------------AGKQRD 432 (458) Q Consensus 370 ~~~~~~~~~~~~l~G~pv~~~~~~~~~----~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~-------------~~~~~~ 432 (458) ...+.+++|+|+||++++++|.. ......+++++++.|.++++.++++..++| |.+|++ T Consensus 231 ----~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v 306 (333) T protein:vir:78 231 ----NLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQI 306 (333) T ss_pred ----cccCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcE Confidence 33356789999999999999853 233456788999999999999999987654 678999 Q ss_pred EEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 433 AYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 433 ~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .||++.|+|+++.+|+||++++.++| T Consensus 307 ~~r~~~r~d~~v~~~~a~~~l~~~~a 332 (333) T protein:vir:78 307 AILIEVTFGWLLGDKQAFVKFVDDEQ 332 (333) T ss_pred EEEEEEEEccEEecccceEEEeccCC Confidence 99999999999999999999999999 No 71 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=3e-54 Score=314.00 Aligned_cols=348 Identities=16% Similarity=0.057 Sum_probs=227.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhh Q lcl|NC_010583. 51 LVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAF 130 (458) Q Consensus 51 ~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 130 (458) +.-+..++.++..++.+..+++... ... ..+..+. ....+.++.... ...+ T Consensus 1 m~~kl~~~~~~~~~~~~~~~~~~~~---~~~--~~~~~~~---~~~~~~~~~~~~-~~~e-------------------- 51 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEP---QER--QNELYGD---MINQLFEETKLQ-AKAE-------------------- 51 (381) T ss_pred CchhHHHHHHHHHHHHHHHHHhhhH---HHH--HHHHHHH---HHHhhhhhHHHH-HHHH-------------------- Confidence 3333333333322333333322110 000 0000111 111111110000 0000 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLV 210 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~ 210 (458) .+.++............+.. ...+++ .++.++||++||+++.++|++.++..++|+++|++++++ +..++|+ T Consensus 52 ----~~~~~~~~~~~~~l~~~e~~--~~~~~~-~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~-~~~~i~~ 123 (381) T protein:vir:10 52 ----AERVSSLPKSAQTLSANQRN--FFMDIN-KSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLK 123 (381) T ss_pred ----HHHHHHhcccccccCHHHHH--HHHHHh-hcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecC-cceEEEe Confidence 01111110000111111111 112222 345567899999999999999999999999999999986 5678999 Q ss_pred ecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010583. 211 EPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSG 290 (458) Q Consensus 211 ~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G 290 (458) .++++.+.|++|.+.. ..+++|+|++|++.+||++++++||++||+|+.+++++||+++|+++++++++.+|++| T Consensus 124 ~~~~~~a~W~~e~~~~-----~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~G 198 (381) T protein:vir:10 124 SETSGVAVWGKIYGEI-----KGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKG 198 (381) T ss_pred ecCCcceEEeeccccc-----ccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEec Confidence 9999999998886532 34568999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCccccccccccccccce-eeccc-------cchhhHHHHHHHHHHHhhhh-------hhhcccceeEechhHHHHHH Q lcl|NC_010583. 291 NGTGQPKGLLKLAADDGAKV-VTEAK-------ADGSVLVTAKTISKLRRKLG-------RHGLKLSKLVLIVSMDAYYD 355 (458) Q Consensus 291 ~g~~~p~Gi~~~~~~~~~~~-~~~~~-------~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~l~ 355 (458) ||+++|.||++........+ ..... ........+..+..++..+. ..|..+..|+||+.++..+. T Consensus 199 dG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~ 278 (381) T protein:vir:10 199 TGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQ 278 (381) T ss_pred ccCCCceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhc Confidence 99999999997543221111 00000 00111111222222222221 23556778999999988877 Q ss_pred hhh---ccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeec--ccccCC Q lcl|NC_010583. 356 LLE---DEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERE--RQAGKQ 430 (458) Q Consensus 356 ~~~---d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~--~~~~~~ 430 (458) .++ +.+|+|++.. -+|+||+++++||.+ .+++++|+.|.|+++.++++.++ .+|.+| T Consensus 279 ~~~~~~~~~G~~v~~l-------------p~g~~vv~~~~~p~~-----~i~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d 340 (381) T protein:vir:10 279 AQYTHLNANGVYVTAL-------------PFNLNVIESTVQEAG-----KVLTYVKGLYDGYLAGGINVQKFKETLALDD 340 (381) T ss_pred cccccCCCCCceeecC-------------CCCceeEEcCCCCcC-----cEEEEEcccEEEEEecccEEEeechhhhhcC Confidence 544 5666665421 147889999999852 36789999999999999998774 578999 Q ss_pred ceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 431 RDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 431 ~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ++.|++..|+||++++|+||++++++.+ T Consensus 341 ~~~f~a~~r~dG~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:10 341 MDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred ceEEEEEEEEcCEEecCCcEEEEEEeec Confidence 9999999999999999999999999987 No 72 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=5.1e-53 Score=307.26 Aligned_cols=425 Identities=14% Similarity=0.124 Sum_probs=240.4 Q ss_pred CcchHHHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLA----KSLEGLT-AAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDE 75 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~----~~~~~l~-~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e 75 (458) +.|.....+......... .+..... .............+++.++..+..++.+.... +...++.+..-+.+++ T Consensus 183 ~~~~~~~~~~~~~~~~~~~~r~~~~~a~~~~~~~~~a~~~~~~~~E~~r~~eI~~l~~~~~~--~~~~~~ai~~g~sld~ 260 (632) T protein:vir:96 183 AEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQ--RSLAQEAIQKGHTVDQ 260 (632) T ss_pred ccccchhhhhhccccccccccchhhcccccchhhhhhhhhhhhhhhHHHHHHHHHHHHHhhh--hhhHHHHHhccccHHH Confidence 112111111000000000 0000000 00000000111111111111111111111110 1111111111111111 Q ss_pred HHHHHHHHHHH----HHHHHHH----HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcc---hhhhhhHHHHHHHHHhhh Q lcl|NC_010583. 76 KSKKSAELFAQ----TVEKQQE----TIVGLQDEIKSLLAAREGRSFVGDSVAKALYGT---QDAFEDEVEKLVLLSYMM 144 (458) Q Consensus 76 ~~~~~~e~~~~----~~~~~~~----~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~---~~~~~~~~~~~a~~~~~~ 144 (458) ...+..+.... ....... ................+.+........+..... ......+..........+ T Consensus 261 ~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~ 340 (632) T protein:vir:96 261 FRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGK 340 (632) T ss_pred HHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhh Confidence 00000000000 0000000 000000000000000000000000000000000 000000111111111111 Q ss_pred ccchhHHHHHHHHhhhhhcccccccCccccchhH-HHHHHHHHHhccchhhh-cceeeeccCceEEEEecCCCccccccc Q lcl|NC_010583. 145 EKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIF-STRIIRDLQKELVVGAL-FDELPMSSKILTMLVEPEAGRATWVDA 222 (458) Q Consensus 145 ~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~-~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~e 222 (458) ..............++.. .++.+.||++||+++ ...||+.+++.++++++ ++++|+.++.+++|+.++++.++|++| T Consensus 341 ~arg~~~~~~~l~~ra~~-~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E 419 (632) T protein:vir:96 341 EARGFYMPHEVLVQRQLE-KKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGE 419 (632) T ss_pred hhhhhhhhHHHHHHhhhh-cccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeCCceeEeecC Confidence 111111111222233333 344556788888775 67899999999999998 688999999999999999999999998 Q ss_pred ccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-Cccccccc Q lcl|NC_010583. 223 SKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGT-GQPKGLLK 301 (458) Q Consensus 223 ~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~-~~p~Gi~~ 301 (458) ++. +++++++|++|++.+++++++++||+++|.||.++++++|+++|++++++++|.++|+|+|+ ++|.||++ T Consensus 420 ~~~------~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~ 493 (632) T protein:vir:96 420 DED------VQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLN 493 (632) T ss_pred Ccc------ccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeee Confidence 764 45568999999999999999999999999999999999999999999999999999999996 68999998 Q ss_pred cccccccceeeccccchhhHHHHHHHHHHHhhhhhhh--cccceeEechhHHHHHHh--hhccccccccccccccccccc Q lcl|NC_010583. 302 LAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHG--LKLSKLVLIVSMDAYYDL--LEDEEWQDVAQVGNDAVKLQG 377 (458) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~l~~--~~d~~~~~~~~~~~~~~~~~~ 377 (458) .+....... .. ...++.++.++...+...+ ..++.|+||+.++..+.+ ++|.+|+|+|+ T Consensus 494 ~~~~~~~~~---~~----~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~---------- 556 (632) T protein:vir:96 494 MTGVPALTY---PA----GGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ---------- 556 (632) T ss_pred cccccceec---cc----ccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeec---------- Confidence 765432211 11 1133455566665655544 346789999998877765 77999999874 Q ss_pred cCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeeccc--ccCCceEEEEEEeeccEEecccceEEEEe Q lcl|NC_010583. 378 QVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQ--AGKQRDAYYVTQRVNLQRYFENGVVSGAY 455 (458) Q Consensus 378 ~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~--~~~~~~~~~~~~r~d~~~~~~~afv~l~~ 455 (458) +++|+|+||++++++|.. .+++++|+.|.++++.++++..++| +.+|++.|+++.|+|+++++|++|++++. T Consensus 557 -~~~l~G~pv~~s~~ip~~-----~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~ 630 (632) T protein:vir:96 557 -NNEVNGYRAEASNQIPAD-----TWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKK 630 (632) T ss_pred -CCeecccceEeccccccC-----cEEEeecceEEEEEecceEEEEccccccccCceEEEEEeecCceeechhhhhheee Confidence 257999999999999853 4678999999999999999998875 57999999999999999999999999999 Q ss_pred ec Q lcl|NC_010583. 456 AA 457 (458) Q Consensus 456 aa 457 (458) +| T Consensus 631 ~A 632 (632) T protein:vir:96 631 GA 632 (632) T ss_pred cC Confidence 99 No 73 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=4.9e-54 Score=312.85 Aligned_cols=282 Identities=10% Similarity=-0.006 Sum_probs=227.0 Q ss_pred ccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccceeee Q lcl|NC_010583. 164 SSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEIS 243 (458) Q Consensus 164 ~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~ 243 (458) -.+.+.||++||+++.++|++.+++.++|+++|+++|++++..++|+.++++.++|++|++. +++++++|++++ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~------~~~~~~~f~~v~ 74 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQ------KSESTATFAPVT 74 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcc------cccccceeeEEE Confidence 23445578999999999999999999999999999999999999999999999999998754 456689999999 Q ss_pred eehhheeeeehhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC---ccccccccccccccceeeccccc Q lcl|NC_010583. 244 FKTYKLAAKSFITDETEE---DAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTG---QPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 244 ~~~~k~~~~~~is~ell~---ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~---~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +.++|++++++||+|+|+ |+.++|+++|.+++++++++++|.++++|++++ .|.|+.+................ T Consensus 75 l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~ 154 (311) T protein:vir:81 75 AIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTS 154 (311) T ss_pred EeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeeccccc Confidence 999999999999999996 556789999999999999999999999997543 35677765433222222221111 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccccc- Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAK- 396 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~- 396 (458) ...+.++.+++..+......+..|+||+.++..|++++|.+|+|+|+... ..+.+++|+|+||++++.+|.. T Consensus 155 ---~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~----~~~~~~tl~G~Pv~~~~~i~~~~ 227 (311) T protein:vir:81 155 ---ATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELG----FGTDVASFAGLNAAVSDTVRGGP 227 (311) T ss_pred ---chHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCcc----ccCCCceecceeEEecccccccc Confidence 12233444455555444556667999999999999999999999996533 3356789999999999999842 Q ss_pred ------------ccCCceEEEEEeceEEEEecceeEEeecc---------cccCCceEEEEEEeeccEEecccceEEEEe Q lcl|NC_010583. 397 ------------AASAEFAVIVYKDNFVMPRQRAVTVERER---------QAGKQRDAYYVTQRVNLQRYFENGVVSGAY 455 (458) Q Consensus 397 ------------~~~~~~~~~~~~~~~~i~~~~~~~i~~~~---------~~~~~~~~~~~~~r~d~~~~~~~afv~l~~ 455 (458) ..+...+++++|++|.++.+.++++...+ +|.+|++.||++.|+|++|++|+||++++. T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~ 307 (311) T protein:vir:81 228 EAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRD 307 (311) T ss_pred cccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEe Confidence 22344567899999999999888876543 378999999999999999999999999999 Q ss_pred ecC Q lcl|NC_010583. 456 AAA 458 (458) Q Consensus 456 aaa 458 (458) +.- T Consensus 308 a~~ 310 (311) T protein:vir:81 308 ADE 310 (311) T ss_pred ecc Confidence 988 No 74 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=7.3e-53 Score=306.40 Aligned_cols=359 Identities=16% Similarity=0.150 Sum_probs=227.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcch Q lcl|NC_010583. 48 MNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQ 127 (458) Q Consensus 48 ~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~ 127 (458) |..+... .+..+.+++...++....+..... +++...+.+.+..+...+..... .+.+. T Consensus 1 mt~~~~~--~e~~~~~~e~~~~~~~~~~~~~~~----e~~~~~~~~~~~~~~~~~~~~~~-~e~~~-------------- 59 (395) T protein:vir:95 1 MADMKQN--NVKLKNYHEHKKQFANLVQNGASD----EEQSKAFGAMFDALSNDLQEEIT-AEINN-------------- 59 (395) T ss_pred ChhHHHH--HHHHHHHHHHHHHHHHHHhhhhhH----HHHHHHHHHHHHHHHHHHHHHHH-HHHHH-------------- Confidence 3222111 111111222222222221111111 11111111112222111111100 00000 Q ss_pred hhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceE Q lcl|NC_010583. 128 DAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILT 207 (458) Q Consensus 128 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 207 (458) ..+..+....+.......+.+. ...++ ..++.++||++||+++.++|++.++..++|+++|+++|+++ ... T Consensus 60 ------~~~~~~~~~~r~~~~l~~ee~~-~~~~~-~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~-~~~ 130 (395) T protein:vir:95 60 ------RVVDNGILAKRSQDPLTSEERK-FFNDI-NYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGI-KTR 130 (395) T ss_pred ------HHHHHHHHhhcCccccchHHHH-HHHHH-hhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceE Confidence 0000000000111111111111 11122 23456678999999999999999999999999999999865 678 Q ss_pred EEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 208 MLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 208 ~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) +|+..+.+.+.|+.|... ..++++|+|++|++.+|+++++++||++||+|+.+++++||+++|++++++++|.+| T Consensus 131 i~~~~~~~~a~w~~e~~~-----~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~ 205 (395) T protein:vir:95 131 VIKADPAGQAVWGKVFGE-----IKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAI 205 (395) T ss_pred EEEecCCcceEEeecccc-----cCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhhe Confidence 999999999999877543 234678999999999999999999999999999999999999999999999999999 Q ss_pred hccCCCC--ccccccccccccccceeecccc----chhhHHHHHHHHHHHhhh-------hhhhcccceeEechhHHHHH Q lcl|NC_010583. 288 MSGNGTG--QPKGLLKLAADDGAKVVTEAKA----DGSVLVTAKTISKLRRKL-------GRHGLKLSKLVLIVSMDAYY 354 (458) Q Consensus 288 l~G~g~~--~p~Gi~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~l 354 (458) |+|+|++ +|.||++............... .......+..+..++..+ ...+..+..|+||+.++. T Consensus 206 i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~-- 283 (395) T protein:vir:95 206 INGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW-- 283 (395) T ss_pred eeccCCCCcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh-- Confidence 9999986 6999998765433322211111 111112222232222222 123455678999998754 Q ss_pred HhhhccccccccccccccccccccCCeee--cccceecccccccccCCceEEEEEeceEEEEecceeEEeec--ccccCC Q lcl|NC_010583. 355 DLLEDEEWQDVAQVGNDAVKLQGQVGRIY--GLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERE--RQAGKQ 430 (458) Q Consensus 355 ~~~~d~~~~~~~~~~~~~~~~~~~~~~l~--G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~--~~~~~~ 430 (458) +..|+|+|++. .|.+.+++ |+||++++.||.+ .+++++|+.|.|+++.++++.++ .+|.+| T Consensus 284 ----~~~g~~~~~~~------~G~~~~~lg~g~~v~~~~~~p~~-----~i~fgdfs~y~i~~r~~~~i~~~~~~~~~~d 348 (395) T protein:vir:95 284 ----DVQARYTYLTA------NGGFVTVLPYNVTIITSEFVPEG-----KLVAFVTDRYNAVRGGGLTVKKFDQTLALED 348 (395) T ss_pred ----hcCCcceeccC------CCcceeccCCcceEEEcCCCCCC-----cEEEEecccEEEEEecceEEEeccchhhhCC Confidence 45677877542 34556675 5567889999852 36789999999999999888664 568999 Q ss_pred ceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 431 RDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 431 ~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ++.||+..|+|+++++++||++|+++.+ T Consensus 349 ~~~f~~~~r~dg~~~~~~A~~~l~i~~~ 376 (395) T protein:vir:95 349 AVLFTAKTFAYGQPDDNKASAVYDLKVA 376 (395) T ss_pred cEEEEEEEEECCEEeccccEEEEEeecc Confidence 9999999999999999999999999877 No 75 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=2e-53 Score=309.46 Aligned_cols=352 Identities=17% Similarity=0.087 Sum_probs=228.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcch Q lcl|NC_010583. 48 MNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQ 127 (458) Q Consensus 48 ~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~ 127 (458) |.-..++ ..+.+++..+.++.+++... . .+..+...+....+.+++..... .+.+.... T Consensus 1 M~i~~~~-~~~~~e~~~~l~~~~~~~~~----~----e~~~~~~~~~~~~~~~~~~~~~~-~e~~~~~~----------- 59 (377) T protein:vir:96 1 MAINLKE-LPKYREAVAELSAKISAGAT----P----EEQEKLFEAAFTTMGDEILAKNE-EEMERMFD----------- 59 (377) T ss_pred CCccHHH-HHHHHHHHHHHHHHHhhccc----H----HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH----------- Confidence 3332211 11222222222222222110 0 11111222222222222221111 11111110 Q ss_pred hhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceE Q lcl|NC_010583. 128 DAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILT 207 (458) Q Consensus 128 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 207 (458) ...........++. ....+...++.++||++||+++.++|++.+...++++++|+++|++ +..+ T Consensus 60 -------------~~~~~~~lt~ee~~--~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~~~ 123 (377) T protein:vir:96 60 -------------LRDKNRELTAEEIK--FFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LRLK 123 (377) T ss_pred -------------hccCCcccCHHHHH--HHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecC-CceE Confidence 00000011111111 1112234566778999999999999999999999999999999976 5678 Q ss_pred EEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 208 MLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 208 ~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) +|+.++.+.++|++|++..+ ++++|+|+++++.+|+++++++||++||+||.+++++||+++|+++++++++.+| T Consensus 124 i~~~~~~~~a~wv~e~~~~~-----~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~ 198 (377) T protein:vir:96 124 ALTAETSGTAVWGDIFGEIK-----GQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAI 198 (377) T ss_pred EEEecCCcceeEeecccccc-----cccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhce Confidence 99999999999999875433 3568999999999999999999999999999999999999999999999999999 Q ss_pred hccCCCCccccccccccccccceeecc----------ccchhhHHHHHHHHHHHhhhhhhh-----------cccceeEe Q lcl|NC_010583. 288 MSGNGTGQPKGLLKLAADDGAKVVTEA----------KADGSVLVTAKTISKLRRKLGRHG-----------LKLSKLVL 346 (458) Q Consensus 288 l~G~g~~~p~Gi~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~ 346 (458) ++|+|+++|.||++............. ............+.++.+.+...+ ..+..|+| T Consensus 199 i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~m 278 (377) T protein:vir:96 199 VKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLL 278 (377) T ss_pred EeccCCCcceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEE Confidence 999999999999986533221111100 000001112233333333333322 24567999 Q ss_pred chhHHHHHHhhhccccccccccccccccccccCCeeecccc--eecccccccccCCceEEEEEeceEEEEecceeEEeec Q lcl|NC_010583. 347 IVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPV--VVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERE 424 (458) Q Consensus 347 ~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv--~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~ 424 (458) |+.++..+ .+++.++. ..|.+.+++|+|+ +.++.+|.+ .+++++|+.|.|+++.++++..+ T Consensus 279 n~~t~~~~------~~~~~~~~------~~G~~~~~l~~p~~v~~s~~~p~~-----~i~fgdf~~Y~i~~r~~~~i~~~ 341 (377) T protein:vir:96 279 NPEDRWTL------EAKFTSRN------QFGEYVTVLPHGITILESLAVETG-----KAIAFVANRYDAFMATASTIEEY 341 (377) T ss_pred chhhHHhc------cccccccC------CCCCceeccCCCceEEecCCCCcc-----cEEEEEcCcEEEEEecccEEEee Confidence 99876543 34444433 2355678888885 467777752 36789999999999999999874 Q ss_pred --ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 425 --RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 425 --~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+|.+|++.||+..|+||++++|+||++++++.- T Consensus 342 ~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 342 DQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred hhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 5688999999999999999999999999999999 No 76 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=2.4e-53 Score=309.09 Aligned_cols=366 Identities=13% Similarity=0.055 Sum_probs=224.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcch Q lcl|NC_010583. 48 MNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQ 127 (458) Q Consensus 48 ~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~ 127 (458) |...+++ ..+.++++.+++ .+..+... ...+..+........+.+++.... ..+. T Consensus 1 M~~kl~~----~~~~~~e~~~~l---~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---------------- 55 (383) T protein:vir:78 1 MTIKLKN----NLANYEEKRTAF---VNAVKNED-TQEIQNKAYVEMVDAMAADIMEQA-KKEA---------------- 55 (383) T ss_pred CchhHHH----HHHHHHHHHHHH---HHHHhccC-hHHHHHHHHHHHHHHHHHHHHHHH-HHHH---------------- Confidence 2222111 111111111111 11111100 000011111111111111111000 0000 Q ss_pred hhhhhHHHHHHHHHhhhccc-hhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce Q lcl|NC_010583. 128 DAFEDEVEKLVLLSYMMEKD-VFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL 206 (458) Q Consensus 128 ~~~~~~~~~~a~~~~~~~~~-~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 206 (458) ...+..+....+... ....+++ ...++ ..+++++||++||+++.++|++.++..++|+++|+++|+++ .. T Consensus 56 -----~~~~~~~~~~~~g~~~lt~~e~~--~~~~~-~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~-~~ 126 (383) T protein:vir:78 56 -----RQEADAYISASRTDKNITNEEIK--FFNDI-NKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGL-RT 126 (383) T ss_pred -----HHHHHHHHHhcCChhhhhHHHHH--HHHHH-hccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCC-ce Confidence 000011111001001 1111111 11122 24566788999999999999999999999999999999865 57 Q ss_pred EEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 207 TMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEA 286 (458) Q Consensus 207 ~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~ 286 (458) ++|+..+.+.++|++|++.. ..+++++|+++++.+|+++++++||++||+||++++++||+++|++++++++|.+ T Consensus 127 ~i~~~~~~~~a~w~~e~~~~-----~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a 201 (383) T protein:vir:78 127 KFLKSETSGVAVWGKIFGEI-----KGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESA 201 (383) T ss_pred EEEEEcCCcceEEeeccccc-----ccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhh Confidence 99999999999999987533 2456899999999999999999999999999999999999999999999999999 Q ss_pred HhccCCCCcccccccccccccccee-eccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhh---cccc Q lcl|NC_010583. 287 FMSGNGTGQPKGLLKLAADDGAKVV-TEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLE---DEEW 362 (458) Q Consensus 287 ~l~G~g~~~p~Gi~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~---d~~~ 362 (458) |++|+|+++|.||++.....+..+. ...........+..++..+...+. .++.+..|+||..++..+.+++ +..+ T Consensus 202 ~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 280 (383) T protein:vir:78 202 YIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELT-DVYKYHSVKENGHPLNVAGKVTLLVNPTD 280 (383) T ss_pred eEeccCCCCceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHH-HHHhccchhcccchhhhcCceEEEEcCcc Confidence 9999999999999986543322111 111111122223333333333333 3444455566665555555443 1111 Q ss_pred ccccccccccccccccCCeeeccc--ceecccccccccCCceEEEEEeceEEEEecceeEEeec--ccccCCceEEEEEE Q lcl|NC_010583. 363 QDVAQVGNDAVKLQGQVGRIYGLP--VVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERE--RQAGKQRDAYYVTQ 438 (458) Q Consensus 363 ~~~~~~~~~~~~~~~~~~~l~G~p--v~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~--~~~~~~~~~~~~~~ 438 (458) .+.+++........|.+.+++|+| |+.++++|.. .+++++|+.|.|+++.++++.++ .+|.+|++.||+.. T Consensus 281 ~~~~~~~~~~~~~~G~~~t~l~~~~~iv~s~~~p~~-----~iifgdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~ 355 (383) T protein:vir:78 281 AWDVKKQYTSLNANGVYVTALPFNLNIIESLFVPEK-----KAISYVAERYDALIGGPLDIGTYDQTLAIEDLNLYAAKQ 355 (383) T ss_pred hhhhccchhccCCCCceeeecCCCceEEecCCCCcc-----cEEEeeccceEEEecccceEEecchhhhhcCceEEEEEE Confidence 122222222223345666788777 4567778742 36789999999999999999874 56899999999999 Q ss_pred eeccEEecccceEEEEeecC Q lcl|NC_010583. 439 RVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 439 r~d~~~~~~~afv~l~~aaa 458 (458) |+|+++++|+||++++++-+ T Consensus 356 r~dG~~~~~~A~~vl~~~~~ 375 (383) T protein:vir:78 356 FAYGKAKDDKAAAVWTLNIN 375 (383) T ss_pred EEcCEEecCCeEEEEEEEec Confidence 99999999999999888888 No 77 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=3.5e-53 Score=308.20 Aligned_cols=347 Identities=15% Similarity=0.049 Sum_probs=229.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhh Q lcl|NC_010583. 51 LVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAF 130 (458) Q Consensus 51 ~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 130 (458) +.-+..++..+..++.+..+++.. .. . +..+...+.+..+.++.... ... T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~---~~-~----~~~~~~~~~~~~~~~~~~~~-~~~--------------------- 50 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGE---PQ-E----RQNELYGDMINQLFEETKLQ-AKA--------------------- 50 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhh---hh-H----HHHHHHHHHHHhhhhhHHHH-HHH--------------------- Confidence 222333222222222222222111 00 0 00000000111111111000 000 Q ss_pred hhHHHHHHHHHhhhccch-hHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDV-FETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p 209 (458) +.+.++.. .+.+.. ...++.. ..++ ..++.++||++||+++.++|++.++..++|+++|+++++++ ..++| T Consensus 51 ---e~~~~~~~-~~~~~~lt~~e~~~--~~~~-~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~ 122 (381) T protein:vir:10 51 ---EAERVSSL-PKSAQSLSANQRSF--FMDI-NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFL 122 (381) T ss_pred ---HHHHHHHh-ccCcccccHHHHHH--HHHH-hcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc-ceEEE Confidence 01111111 111111 1111111 1122 23456678999999999999999999999999999999765 57899 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMS 289 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~ 289 (458) +..+.+.++|++|++..+ .+++++|++|++.+|+++++++||++||+|++++|++||+++|+++++++++.+|++ T Consensus 123 ~~~~~~~a~w~~e~~~~~-----~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~ 197 (381) T protein:vir:10 123 KSETSGVAVWGKIYGEIK-----GQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK 197 (381) T ss_pred EecCCcceeeeccccccc-----ccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEe Confidence 999999999999875432 356799999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCccccccccccccccce--------eeccccchhhHHHHHHHHHHHhhhhh-------hhcccceeEechhHHHHH Q lcl|NC_010583. 290 GNGTGQPKGLLKLAADDGAKV--------VTEAKADGSVLVTAKTISKLRRKLGR-------HGLKLSKLVLIVSMDAYY 354 (458) Q Consensus 290 G~g~~~p~Gi~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~l 354 (458) |+|+++|.||++........+ .............+..+.+++..+.. .|..+..|+||+.++..+ T Consensus 198 G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l 277 (381) T protein:vir:10 198 GTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEV 277 (381) T ss_pred ccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhh Confidence 999999999998654221110 01111111222334445555555542 356677899999998888 Q ss_pred Hhhhc---cccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeec--ccccC Q lcl|NC_010583. 355 DLLED---EEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERE--RQAGK 429 (458) Q Consensus 355 ~~~~d---~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~--~~~~~ 429 (458) ..+++ .+|+|.+.. -+|++|+.++.||. ..+++++|+.|.|+++.++++..+ .+|.+ T Consensus 278 ~~~~~~~~~~G~~v~~l-------------~~g~~vv~s~~~p~-----~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~ 339 (381) T protein:vir:10 278 QAQYTHLNANGVYVTAL-------------PFNLNVIESTVQEA-----GKVLTYVKGLYDGYLAGGINVQKFKETLALD 339 (381) T ss_pred ccccccCCCCCceeecC-------------CCCceEEecCCCCc-----CcEEEEecccEEEEEecccEEEeechhHhhc Confidence 76554 445444311 13667889999985 236789999999999999998775 56899 Q ss_pred CceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 430 QRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 430 ~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |++.||+..|+||++++++||++++++.+ T Consensus 340 d~~~f~a~~r~dg~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:10 340 DMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred CCeEEEEEEEEcCEEecCceEEEEEEEec Confidence 99999999999999999999999888887 No 78 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=3.5e-53 Score=308.20 Aligned_cols=347 Identities=15% Similarity=0.049 Sum_probs=229.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhh Q lcl|NC_010583. 51 LVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAF 130 (458) Q Consensus 51 ~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 130 (458) +.-+..++..+..++.+..+++.. .. . +..+...+.+..+.++.... ... T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~---~~-~----~~~~~~~~~~~~~~~~~~~~-~~~--------------------- 50 (381) T protein:vir:95 1 MTINLSETFANAKNEFINAVNNGE---PQ-E----RQNELYGDMINQLFEETKLQ-AKA--------------------- 50 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhh---hh-H----HHHHHHHHHHHhhhhhHHHH-HHH--------------------- Confidence 222333222222222222222111 00 0 00000000111111111000 000 Q ss_pred hhHHHHHHHHHhhhccch-hHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDV-FETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~-~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p 209 (458) +.+.++.. .+.+.. ...++.. ..++ ..++.++||++||+++.++|++.++..++|+++|+++++++ ..++| T Consensus 51 ---e~~~~~~~-~~~~~~lt~~e~~~--~~~~-~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~ 122 (381) T protein:vir:95 51 ---EAERVSSL-PKSAQSLSANQRSF--FMDI-NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFL 122 (381) T ss_pred ---HHHHHHHh-ccCcccccHHHHHH--HHHH-hcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc-ceEEE Confidence 01111111 111111 1111111 1122 23456678999999999999999999999999999999765 57899 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMS 289 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~ 289 (458) +..+.+.++|++|++..+ .+++++|++|++.+|+++++++||++||+|++++|++||+++|+++++++++.+|++ T Consensus 123 ~~~~~~~a~w~~e~~~~~-----~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~ 197 (381) T protein:vir:95 123 KSETSGVAVWGKIYGEIK-----GQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK 197 (381) T ss_pred EecCCcceeeeccccccc-----ccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEe Confidence 999999999999875432 356799999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCccccccccccccccce--------eeccccchhhHHHHHHHHHHHhhhhh-------hhcccceeEechhHHHHH Q lcl|NC_010583. 290 GNGTGQPKGLLKLAADDGAKV--------VTEAKADGSVLVTAKTISKLRRKLGR-------HGLKLSKLVLIVSMDAYY 354 (458) Q Consensus 290 G~g~~~p~Gi~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~l 354 (458) |+|+++|.||++........+ .............+..+.+++..+.. .|..+..|+||+.++..+ T Consensus 198 G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l 277 (381) T protein:vir:95 198 GTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEV 277 (381) T ss_pred ccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhh Confidence 999999999998654221110 01111111222334445555555542 356677899999998888 Q ss_pred Hhhhc---cccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeec--ccccC Q lcl|NC_010583. 355 DLLED---EEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERE--RQAGK 429 (458) Q Consensus 355 ~~~~d---~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~--~~~~~ 429 (458) ..+++ .+|+|.+.. -+|++|+.++.||. ..+++++|+.|.|+++.++++..+ .+|.+ T Consensus 278 ~~~~~~~~~~G~~v~~l-------------~~g~~vv~s~~~p~-----~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~ 339 (381) T protein:vir:95 278 QAQYTHLNANGVYVTAL-------------PFNLNVIESTVQEA-----GKVLTYVKGLYDGYLAGGINVQKFKETLALD 339 (381) T ss_pred ccccccCCCCCceeecC-------------CCCceEEecCCCCc-----CcEEEEecccEEEEEecccEEEeechhHhhc Confidence 76554 445444311 13667889999985 236789999999999999998775 56899 Q ss_pred CceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 430 QRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 430 ~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |++.||+..|+||++++++||++++++.+ T Consensus 340 d~~~f~a~~r~dg~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:95 340 DMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred CCeEEEEEEEEcCEEecCceEEEEEEEec Confidence 99999999999999999999999888887 No 79 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=7.6e-53 Score=306.33 Aligned_cols=344 Identities=10% Similarity=0.040 Sum_probs=223.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhh Q lcl|NC_010583. 45 LARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALY 124 (458) Q Consensus 45 ~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 124 (458) +++.+++. ++.+.+++..+.+++. +++++.+.......... ......... T Consensus 1 ~eei~~l~------------~~~~~l~~~~~~l~~~-------~d~~e~e~~~~~~~~~~----------~~~~~~~~~- 50 (352) T protein:vir:78 1 MEDIKQLE------------TEKAGLQQRFNIVERQ-------VQDIEEKEKAKVKDKGE----------AYQSLNDNE- 50 (352) T ss_pred ChhHHHHH------------HHHHHHHHHHHHHHHH-------HHHHHHHHHHHhhhccc----------cccccchhh- Confidence 11111111 1111111111111111 11111000000000000 000000000 Q ss_pred cchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccC Q lcl|NC_010583. 125 GTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSK 204 (458) Q Consensus 125 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 204 (458) .....+....+.....+...+ ..............++.++||++||+++.++|++.++.+++|+++++++++++ T Consensus 51 ~~~~~~~~~~r~~~~~~~~~~-----~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~- 124 (352) T protein:vir:78 51 KLVKAKAEFYRHAILPNEFEK-----PSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 124 (352) T ss_pred hHHHHHHHHHHHHhhhhHHHH-----HHhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC- Confidence 000000000000000000000 00111112222345567788999999999999999999999999999988764 Q ss_pred ceEEEEec-CCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 205 ILTMLVEP-EAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSI 283 (458) Q Consensus 205 ~~~~p~~~-~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~ 283 (458) ..+|+.. +.+.+.|++|++. .++++++|++|++.+++++++++||+++|+||.++|++||.++|++++++++ T Consensus 125 -~~~p~~~~~~~~a~~v~E~~~------~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e 197 (352) T protein:vir:78 125 -LEIPRVSYTLDDDDFITDVET------AKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKE 197 (352) T ss_pred -ceEEEEecCCCcccccccccc------cccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHH Confidence 4566654 4578999988764 4456899999999999999999999999999999999999999999999986 Q ss_pred HH-HHhccCCCCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccc Q lcl|NC_010583. 284 EE-AFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEW 362 (458) Q Consensus 284 d~-~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~ 362 (458) +. .|.+|+|+++|.|++....... ++ ....++++.++.+.+.+.|+.+++|+||+.++..+.++++..| T Consensus 198 ~~~~~~~g~g~~~~~g~l~~~~~~~---~t-------~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~ 267 (352) T protein:vir:78 198 RKDALAVSPKSGLEHMSFYNGSVKE---VE-------GANMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGT 267 (352) T ss_pred HHhhhhcCCCCcccccceecccccc---cc-------ccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccC Confidence 55 5668889999999986543221 11 1123678888999999999999999999999999888888888 Q ss_pred ccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeecc Q lcl|NC_010583. 363 QDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNL 442 (458) Q Consensus 363 ~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~ 442 (458) ++++. +.+.+|+|+||++++.++ .+++|+|+.|++. +.++.+.....+.++++.|++..|+|+ T Consensus 268 ~~~~~---------~~~~~llG~PV~~~~~~~-------~~~~Gdf~~~~~~-~~~~~~~~~~~~~~g~~~f~~~~r~Dg 330 (352) T protein:vir:78 268 TNFFD---------TPAEKVFGKPVVFTDAAV-------KPIVGDFNYFGIN-YDGTTYDTDKDVKKGEYLFVLTAWYDQ 330 (352) T ss_pred Ccccc---------cCCccccccceEEecCCC-------ceeEeehhhhhhh-hhhheeeeeccccCCeeEEEEEeeeCc Confidence 88873 335689999999988654 3567888877654 334555554455689999999999999 Q ss_pred EEecccceEEEEeecC Q lcl|NC_010583. 443 QRYFENGVVSGAYAAA 458 (458) Q Consensus 443 ~~~~~~afv~l~~aaa 458 (458) ++++|+||+.++++|| T Consensus 331 ~~~~~eA~~~l~~~a~ 346 (352) T protein:vir:78 331 QRTLDSAFRIAKAKES 346 (352) T ss_pred eeechhheEEEEeecc Confidence 9999999999999999 No 80 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=2.6e-53 Score=308.85 Aligned_cols=306 Identities=12% Similarity=-0.003 Sum_probs=238.7 Q ss_pred HHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCc Q lcl|NC_010583. 137 LVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGR 216 (458) Q Consensus 137 ~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~ 216 (458) .+..+.+ +.............+.++.+||+++.++|++.+++.++|+++|+++|++++..++|+....+. T Consensus 1 ~~~~~e~----------~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~ 70 (338) T protein:vir:78 1 MATLNEL----------APNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPE 70 (338) T ss_pred CcchHHh----------hhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcc Confidence 1111111 111110111122233456699999999999999999999999999999999999999999999 Q ss_pred cccccc--ccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC Q lcl|NC_010583. 217 ATWVDA--SKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTG 294 (458) Q Consensus 217 a~~v~e--~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~ 294 (458) ++|+++ ..+.+|++.+++++++|++|++.++|++++++||+|+|+|+.++++++|.++|++++++++|.+||+|+|++ T Consensus 71 a~~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~ 150 (338) T protein:vir:78 71 VGQVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPL 150 (338) T ss_pred ceeecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Confidence 999864 457788889999999999999999999999999999999999999999999999999999999999999975 Q ss_pred ---ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhh-hcccceeEechhHHHHHH---hhhccccccccc Q lcl|NC_010583. 295 ---QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRH-GLKLSKLVLIVSMDAYYD---LLEDEEWQDVAQ 367 (458) Q Consensus 295 ---~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~---~~~d~~~~~~~~ 367 (458) +|.||.+.......+.... ........+..+.++...+... .....+|+||+.++..|. +++|.+|+|+++ T Consensus 151 ~~~~~~gi~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~ 228 (338) T protein:vir:78 151 TGSALQGIDTNNVIVNTTNVDY--LQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPT 228 (338) T ss_pred cccccccccccccccccccccc--ccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeec Confidence 4677776554433222211 1222334556666666665443 445667999999987764 578999999985 Q ss_pred cccccccccccCCeeecccceeccccccc----ccCCceEEEEEeceEEEEecceeEEeeccc----------------c Q lcl|NC_010583. 368 VGNDAVKLQGQVGRIYGLPVVVSEYFPAK----AASAEFAVIVYKDNFVMPRQRAVTVERERQ----------------A 427 (458) Q Consensus 368 ~~~~~~~~~~~~~~l~G~pv~~~~~~~~~----~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~----------------~ 427 (458) .. ...+.+++|+|+||++++++|+. ......+++++++.|.++++.++++..+++ | T Consensus 229 ~~----~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 304 (338) T protein:vir:78 229 RI----NLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMW 304 (338) T ss_pred cc----ccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhh Confidence 43 33456789999999999999853 233456778999999999999999876543 6 Q ss_pred cCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 428 GKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 428 ~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+|++.||++.|+|+++.+|+||++++.+++ T Consensus 305 ~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 335 (338) T protein:vir:78 305 QTNQIAILIEVTFGWLLGDKQAFVKFVDDED 335 (338) T ss_pred hcCcEEEEEEEEeccEeecccceEEEecccC Confidence 7899999999999999999999999999999 No 81 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=2.3e-53 Score=309.19 Aligned_cols=296 Identities=15% Similarity=0.126 Sum_probs=239.4 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCccccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDA 222 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e 222 (458) ++++.....+... . ..++++.++.+||+++.++|++.+++.++|+++++++|+.++..+||+.++.+.++|++| T Consensus 1 ~~~~~~~~~e~~~-----~-~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 74 (318) T protein:vir:24 1 MAAGTAFAVDHAQ-----I-AQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGE 74 (318) T ss_pred CCCCCCCCHHHHH-----h-hcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecC Confidence 4444333332221 1 223334456678999999999999999999999999999999999999999999999998 Q ss_pred ccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccc Q lcl|NC_010583. 223 SKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKL 302 (458) Q Consensus 223 ~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~ 302 (458) ++. +++++++|++|++.++|++++++||+|+|+||.++++++|.+.|++++++++|.++|+|+|+++|.|++.. T Consensus 75 g~~------~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~ 148 (318) T protein:vir:24 75 GDM------KPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQT 148 (318) T ss_pred Ccc------ccccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccc Confidence 764 44568999999999999999999999999999999999999999999999999999999999999999876 Q ss_pred ccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccc-cccccCCe Q lcl|NC_010583. 303 AADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAV-KLQGQVGR 381 (458) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~-~~~~~~~~ 381 (458) .......... .........+.++...+...++.+..|+||+.++..|++++|++|+|+++...... +.....++ T Consensus 149 ~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~ 223 (318) T protein:vir:24 149 TKAISIADTT-----GATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGR 223 (318) T ss_pred cccccccccc-----cccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCce Confidence 5432222111 11222334556677778888899999999999999999999999999987644332 22233467 Q ss_pred eecccceecccccccccCCceEEEEEeceEEEEecceeEEeeccc----------------ccCCceEEEEEEeeccEEe Q lcl|NC_010583. 382 IYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQ----------------AGKQRDAYYVTQRVNLQRY 445 (458) Q Consensus 382 l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~----------------~~~~~~~~~~~~r~d~~~~ 445 (458) ++|+||++++.+|. +...+++++++.+.++++.++.+..+++ |.+|++.||++.|+|+++. T Consensus 224 i~g~pv~~~~~~~~---~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 300 (318) T protein:vir:24 224 IVARPTILSDHVVE---GTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCN 300 (318) T ss_pred EEEEeeEEeCCCCC---CccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEe Confidence 99999999998875 4556678899999999999888765332 7789999999999999999 Q ss_pred cccceEEEEeecC Q lcl|NC_010583. 446 FENGVVSGAYAAA 458 (458) Q Consensus 446 ~~~afv~l~~aaa 458 (458) +|+||++++.++| T Consensus 301 ~~~a~~~i~~~~a 313 (318) T protein:vir:24 301 DAEAFVALTNVVS 313 (318) T ss_pred cccceEEEEeecc Confidence 9999999999999 No 82 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=6.3e-53 Score=306.78 Aligned_cols=300 Identities=14% Similarity=0.115 Sum_probs=239.2 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCccccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDA 222 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e 222 (458) +..+.....+... ...+++ +.++.+||++++++|++.+++.++|+++++++|+.++..++|+..+.+.+.|++| T Consensus 1 ~~~~~~~~~~~~~-----~~~t~~-~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E 74 (320) T protein:vir:10 1 MAAGTAFQVDHAQ-----IAQTGD-TMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGE 74 (320) T ss_pred CCCCccCCHHHHH-----hhcccc-ccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecC Confidence 3333332222211 122333 3344478999999999999999999999999999999999999999999999998 Q ss_pred ccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccc Q lcl|NC_010583. 223 SKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKL 302 (458) Q Consensus 223 ~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~ 302 (458) ++. +++++++|+++++.++|++++++||+|+|+|+.++++++|.++|++++++++|+++|+|+|+++|.++... T Consensus 75 ~~~------~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~ 148 (320) T protein:vir:10 75 GDM------KPITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQT 148 (320) T ss_pred Ccc------ccccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccc Confidence 754 45678999999999999999999999999999999999999999999999999999999999999888766 Q ss_pred ccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccc-cccccccCCe Q lcl|NC_010583. 303 AADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGND-AVKLQGQVGR 381 (458) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~-~~~~~~~~~~ 381 (458) ............+.+ ........+.++...+...+..++.|+||+.++..|++++|++|+|+++.... +.+.....++ T Consensus 149 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~ 227 (320) T protein:vir:10 149 TKSVSLADPGGATAS-DLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGR 227 (320) T ss_pred cccccceeccccccc-ccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCce Confidence 544333222222222 12222234667777888888999999999999999999999999999975433 2233344578 Q ss_pred eecccceecccccccccCCceEEEEEeceEEEEecceeEEeecc----------------cccCCceEEEEEEeeccEEe Q lcl|NC_010583. 382 IYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERER----------------QAGKQRDAYYVTQRVNLQRY 445 (458) Q Consensus 382 l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~----------------~~~~~~~~~~~~~r~d~~~~ 445 (458) ++|+||++++.+|. +....++++++.+.++.+.++++..++ .|.+|++.||++.|+|+++. T Consensus 228 i~g~pv~~~~~~~~---~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~ 304 (320) T protein:vir:10 228 IVSRPTILSDHVAD---GTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNN 304 (320) T ss_pred eeeeeeEecCCCCC---CceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEe Confidence 99999999999875 344567889999999999988876542 26789999999999999999 Q ss_pred cccceEEEEeecC Q lcl|NC_010583. 446 FENGVVSGAYAAA 458 (458) Q Consensus 446 ~~~afv~l~~aaa 458 (458) +|+||++++.++| T Consensus 305 ~~~a~~~l~~~~a 317 (320) T protein:vir:10 305 DKDAFVKLTNVVT 317 (320) T ss_pred cccceEEEEeccC Confidence 9999999999999 No 83 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=6.2e-53 Score=306.81 Aligned_cols=298 Identities=13% Similarity=0.076 Sum_probs=237.8 Q ss_pred hcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeecc Q lcl|NC_010583. 124 YGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSS 203 (458) Q Consensus 124 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 203 (458) .. +..........|....+++.. .++ ........|+.+||+++.++|++.+++.++|+++++++|+++ T Consensus 1 ~~--~~~~~~~~~~~f~~~~~~~~~---------~~a-~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~ 68 (324) T protein:vir:97 1 ME--QTQKLKLNLQHFASNNVKPQV---------FNP-DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEG 68 (324) T ss_pred Cc--cchhHHHHHHHHHHhhhhhhh---------hcc-ccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccC Confidence 00 001111111223222222211 111 223344557889999999999999999999999999999999 Q ss_pred CceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 204 KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSI 283 (458) Q Consensus 204 ~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~ 283 (458) +..++|+.++.+.+.|++|++. +++++++|+.|+++++|++++++||+|+|+|+.++++++|.++|++++++++ T Consensus 69 ~~~~ip~~~~~~~a~~v~Eg~~------~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~ 142 (324) T protein:vir:97 69 TEKKFTFWADKPGAYWVGEGQK------IETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred CceEEEEEecCcceeEeccCcc------ccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999998764 4567899999999999999999999999999999999999999999999999 Q ss_pred HHHHhccCCCC-ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccc Q lcl|NC_010583. 284 EEAFMSGNGTG-QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEW 362 (458) Q Consensus 284 d~~~l~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~ 362 (458) |.++|+|+|++ .|.||++........ .....+++++.++..++...++.++.|+||+.++..|++++|++| T Consensus 143 d~a~l~G~g~~~~~~gi~~~~~~~~~~--------~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g 214 (324) T protein:vir:97 143 DEAGILNQGNNPFGKSIAQSIEKTNKV--------IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET 214 (324) T ss_pred HHHhhccCCCCccCcccccccccccee--------ccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCC Confidence 99999999975 678888654332211 122345677888999999999999999999999999999999999 Q ss_pred ccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeecc----------------c Q lcl|NC_010583. 363 QDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERER----------------Q 426 (458) Q Consensus 363 ~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~----------------~ 426 (458) +++++. +.+++|+|+||++++..+. +...+++++++.+.++++.++++..++ + T Consensus 215 ~~~~~~--------~~~~tl~G~PV~~~~~~~~---~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:97 215 KERIYD--------RNSDTLDGLPVVNLKSSNL---KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred ceeecC--------CCCccccceeeEeecCCCC---CcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhh Confidence 998742 2346899999998876543 455678899999999999998887643 2 Q ss_pred ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 427 AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 427 ~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.+|++.||++.|+|+++.+|+||++++.+.+ T Consensus 284 f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) T protein:vir:97 284 FEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred hhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 77899999999999999999999999999988 No 84 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=4.3e-53 Score=307.69 Aligned_cols=283 Identities=11% Similarity=0.027 Sum_probs=225.8 Q ss_pred cccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccceee Q lcl|NC_010583. 163 GSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEI 242 (458) Q Consensus 163 ~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v 242 (458) -+.++..+|.+||++++.+|++.+++.++++++++++|+.++..++|+.++++.++|++|++ .+++++++|++| T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~------~~~~s~~~f~~v 74 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENG------KKTHGGVSLDPV 74 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCc------ccccccccceee Confidence 12233445678999999999999999999999999999999999999999999999999975 455678999999 Q ss_pred eeehhheeeeehhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhccCCC--Cccccccccccccccceeeccccc Q lcl|NC_010583. 243 SFKTYKLAAKSFITDETEE---DAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGT--GQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 243 ~~~~~k~~~~~~is~ell~---ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~--~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) ++++||++++++||+|++. |+.++++++|.++|++++++++|.++|+|+++ +.+.++.......+...... . T Consensus 75 ~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~---~ 151 (300) T protein:vir:95 75 TIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTV---P 151 (300) T ss_pred EeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceee---c Confidence 9999999999999999994 66789999999999999999999999999643 33333333222211111111 1 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKA 397 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 397 (458) ......+.++.++...+...++.+++|+||+.++..|++++|.+|+|+|+... ..+.+++|+|+||++++++|... T Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~----~~~~~~~l~G~Pv~~s~~v~~~~ 227 (300) T protein:vir:95 152 FKDTNPDESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELA----WGGVPDAINGLAVDKNRTVSYSQ 227 (300) T ss_pred ccccchHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCcc----ccCCCceecceeeEEecCCCCCC Confidence 12234456777888888877888889999999999999999999999985332 33567899999999999999755 Q ss_pred cCC-ceEEEEEece-EEEEecceeEEeecc----------cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 398 ASA-EFAVIVYKDN-FVMPRQRAVTVERER----------QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 398 ~~~-~~~~~~~~~~-~~i~~~~~~~i~~~~----------~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+. ..+++++|+. +.++.+.++++..++ +|.+|++.||++.|+|+++.+|+||+++|.++- T Consensus 228 ~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 228 TDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred CCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 443 3455678875 446677777776543 488999999999999999999999999999988 No 85 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=5.7e-53 Score=307.02 Aligned_cols=281 Identities=14% Similarity=0.064 Sum_probs=227.2 Q ss_pred ccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccceeeeee Q lcl|NC_010583. 166 SVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFK 245 (458) Q Consensus 166 ~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~ 245 (458) -...||+++|++++++|++.+++.++|+++|+++|++++..++|+.++.+.++|++|++ .+++++++|++|++. T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~------~~~~~~~~f~~v~l~ 74 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESG------KKTHGGVTLAPQTMV 74 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCc------cccccccceeEEEEe Confidence 23446789999999999999999999999999999999999999999999999999875 445668999999999 Q ss_pred hhheeeeehhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhccCC--CCccccccccccccccceeeccccchhh Q lcl|NC_010583. 246 TYKLAAKSFITDETEE---DAIFSLLPLLRKRLIEAHAVSIEEAFMSGNG--TGQPKGLLKLAADDGAKVVTEAKADGSV 320 (458) Q Consensus 246 ~~k~~~~~~is~ell~---ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g--~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 320 (458) ++|++++++||+|+|+ |+.++|+++|.++|++++++++|.++++|++ ++.+.++.......+.+. ......... T Consensus 75 ~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~-~~~~~~~~~ 153 (298) T protein:vir:16 75 PIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVT-QKVEAPRGI 153 (298) T ss_pred eeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccc-ccccccccc Confidence 9999999999999996 5568899999999999999999999999964 455555544333222211 112222223 Q ss_pred HHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccccccc-C Q lcl|NC_010583. 321 LVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAA-S 399 (458) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~-~ 399 (458) ...+.++.+++..+...+..+.+|+||+.++..|++++|.+|+|+|+.. +..+.+++|+|+||++++.+|.... . T Consensus 154 ~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~----~~~~~~~~l~G~PV~~~~~v~~~~~~~ 229 (298) T protein:vir:16 154 ADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPEL----KWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) T ss_pred ccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCc----ccCCCCceecceeeEEecccccccCCC Confidence 3345678888888888888888999999999999999999999998643 3345678999999999999996543 3 Q ss_pred CceEEEEEece-EEEEecceeEEeecc----------cccCCceEEEEEEeeccEEecccceEEEEeec Q lcl|NC_010583. 400 AEFAVIVYKDN-FVMPRQRAVTVERER----------QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAA 457 (458) Q Consensus 400 ~~~~~~~~~~~-~~i~~~~~~~i~~~~----------~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aa 457 (458) ...+++++|++ +.++.+.++++...+ +|.+|++.||++.|+|+++++|+||++++.+. T Consensus 230 ~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 44566788875 456667766665432 47889999999999999999999999999999 No 86 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=5.8e-53 Score=306.98 Aligned_cols=281 Identities=10% Similarity=0.028 Sum_probs=226.0 Q ss_pred ccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccceeee Q lcl|NC_010583. 164 SSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEIS 243 (458) Q Consensus 164 ~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~ 243 (458) -++.+.||++||++++.+|++.+++.++|+++|+++|++++..++|+.++++.+.|++|++. +++++++|++++ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~------~~~s~~~f~~v~ 74 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGK------KTHGGLSLEPVT 74 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCcc------ccccccceeeEE Confidence 22344578999999999999999999999999999999999999999999999999998754 456789999999 Q ss_pred eehhheeeeehhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc-----ccccccccccccceeeccc Q lcl|NC_010583. 244 FKTYKLAAKSFITDETEE---DAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQP-----KGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 244 ~~~~k~~~~~~is~ell~---ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p-----~Gi~~~~~~~~~~~~~~~~ 315 (458) +.+||+++++++|+|+|. |+.++|.++|.++|++++++++|.++++|++++.. .|+....... +... T Consensus 75 l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~-----~~~~ 149 (303) T protein:vir:97 75 IVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKV-----TQVV 149 (303) T ss_pred eeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcccccccccccccccc-----cccc Confidence 999999999999999994 66789999999999999999999999999764322 2222111111 1111 Q ss_pred cchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccc Q lcl|NC_010583. 316 ADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPA 395 (458) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 395 (458) ..+.....+.++.+++..+...+..+..|+||+.++..|++++|.+|+|++.+... ..+.+++|+|+||++++++|. T Consensus 150 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~---~~~~~~~l~G~Pv~~s~~v~~ 226 (303) T protein:vir:97 150 KFTESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELA---WGANPDSINGLKSSVNTTVGA 226 (303) T ss_pred ccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCcc---CCCCCceecceeeEEecccCC Confidence 11122345678888888888888888899999999999999999999999865432 234457899999999999996 Q ss_pred cc---cCCceEEEEEe-ceEEEEecceeEEeecc----------cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 396 KA---ASAEFAVIVYK-DNFVMPRQRAVTVERER----------QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 396 ~~---~~~~~~~~~~~-~~~~i~~~~~~~i~~~~----------~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .. .+...+++++| +.|.++.+.+++++..+ +|.+|++.||++.|+|+++++|+||+++|.+-= T Consensus 227 ~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 227 GADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred ccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 43 23445667777 46778888888776543 378999999999999999999999999998877 No 87 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=8.7e-53 Score=306.01 Aligned_cols=283 Identities=14% Similarity=0.107 Sum_probs=230.6 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCccccccccccccccccccccccccee Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTE 241 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~ 241 (458) ....+++.||.+||++++++|++.+++.++|+++++++|+.++..++|+.++++.+.|++|++..+++ .++.++++|++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~-~~~~s~~~f~~ 79 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKG-VKPTSKVTWAN 79 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccc-cccccccceee Confidence 44566677889999999999999999999999999999999999999999999999999999877764 46678899999 Q ss_pred eeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccc---cccccccccccceeeccccch Q lcl|NC_010583. 242 ISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPK---GLLKLAADDGAKVVTEAKADG 318 (458) Q Consensus 242 v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~---Gi~~~~~~~~~~~~~~~~~~~ 318 (458) |++.+||++++++||+|+++||.++++++|+++|++++++++|.+||+|+|++.+. ++.+........ ........ T Consensus 80 i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 158 (305) T protein:vir:25 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQA-VEVVGGVA 158 (305) T ss_pred EEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCcccccccccccccccc-ccccccch Confidence 99999999999999999999999999999999999999999999999999976433 333332222111 12222222 Q ss_pred hhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccccccc Q lcl|NC_010583. 319 SVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAA 398 (458) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~ 398 (458) ........+..+...+.........|+||+.++..|++++|++|+|+|++ ++|+|+||++++++|.. . T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~-----------~~l~G~Pv~~~~~~~~~-~ 226 (305) T protein:vir:25 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-----------DSFAGFRTFFNRNGAWD-A 226 (305) T ss_pred hhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecC-----------CcccccceEEcCccCCC-C Confidence 22233444555555555555566679999999999999999999999842 47999999999998854 3 Q ss_pred CCceEEEEEeceEEEEecceeEEeeccc------------ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 399 SAEFAVIVYKDNFVMPRQRAVTVERERQ------------AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 399 ~~~~~~~~~~~~~~i~~~~~~~i~~~~~------------~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +...+++++++.|.++.+.++++..+++ |.+|++.+|++.|+|+.+.+|+||++++..-. T Consensus 227 ~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~ 298 (305) T protein:vir:25 227 DAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) T ss_pred CccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccc Confidence 4556778999999999999998876543 67899999999999999999999999888643 No 88 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=2.6e-52 Score=303.37 Aligned_cols=298 Identities=13% Similarity=0.053 Sum_probs=235.0 Q ss_pred hcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeecc Q lcl|NC_010583. 124 YGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSS 203 (458) Q Consensus 124 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 203 (458) ... ........+ .|.....+. ...++. .......++.+||+++.++|++.++..++|+++++++|+++ T Consensus 1 ~~~-~~~~~~~~~-~f~~~~~~~---------~~~~a~-~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~ 68 (324) T protein:vir:93 1 MEQ-TQKLKLNLQ-HFASNNVKP---------QVFNPD-NVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG 68 (324) T ss_pred Cch-hHHHHHHHH-HHHHhhhhh---------hhcccc-cccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC Confidence 000 000001111 122221111 111122 22333445668999999999999999999999999999999 Q ss_pred CceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 204 KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSI 283 (458) Q Consensus 204 ~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~ 283 (458) +..+||+.++.+.++|++|++. +++++++|++|++.++|++++++||+|+++||.++++++|.++|++++++++ T Consensus 69 ~~~~ip~~~~~~~a~~v~Eg~~------~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~ 142 (324) T protein:vir:93 69 TEKKFTFWADKPGAYWVGEGQK------IETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred CceEEEEEecCcceeeecCCcc------ccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999988754 4456789999999999999999999999999999999999999999999999 Q ss_pred HHHHhccCCCC-ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccc Q lcl|NC_010583. 284 EEAFMSGNGTG-QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEW 362 (458) Q Consensus 284 d~~~l~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~ 362 (458) |.++|+|+|++ .|.|+++........ .....+++++.+++..+...+..+..|+||+.++..|++++|++| T Consensus 143 d~a~l~G~g~~~~~~~~~~~~~~~~~~--------~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G 214 (324) T protein:vir:93 143 DEAGILNQGNNPFGKSIAQSIEKTNKV--------IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET 214 (324) T ss_pred HHHHhcCCCCCCcCcccccccccccee--------ccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCC Confidence 99999999875 678887755332211 112345678888899999999999999999999999999999999 Q ss_pred ccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeeccc---------------- Q lcl|NC_010583. 363 QDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQ---------------- 426 (458) Q Consensus 363 ~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~---------------- 426 (458) +++++. +.+++|+|+||++++..+ .+...+++++++.+.++.+.++++..+++ T Consensus 215 ~~~~~~--------~~~~~l~G~PVv~~~~~~---~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:93 215 KERIYD--------RNSDSLDGLPVVNLKSSN---LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred CeeecC--------CCCCcccceeeEeecCCC---CCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhh Confidence 998743 235689999999876544 34556788999999999999988876543 Q ss_pred ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 427 AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 427 ~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.+|++.||++.|+|+++.+|+||++++.|.+ T Consensus 284 f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:93 284 FEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred hhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 67899999999999999999999999999888 No 89 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=2.5e-52 Score=303.52 Aligned_cols=298 Identities=13% Similarity=0.064 Sum_probs=236.6 Q ss_pred hcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeecc Q lcl|NC_010583. 124 YGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSS 203 (458) Q Consensus 124 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 203 (458) .... +. .......|.....+. ...++ ....+.+.++.+||+++.+.|++.+++.++|+++++++|+++ T Consensus 1 ~~~~-~~-~~~~~~~~~~~~~~~---------~~~~a-~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~ 68 (324) T protein:vir:78 1 MEQT-QK-LKLNLQHFASNNVKP---------QVFNP-DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEG 68 (324) T ss_pred CCcc-hh-hhHHHHHHHHHhhhh---------hhhcc-ccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccC Confidence 0000 00 111111222222111 11111 223445667889999999999999999999999999999999 Q ss_pred CceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 204 KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSI 283 (458) Q Consensus 204 ~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~ 283 (458) +..+||+..+.+.++|++|++. +++++++|++|++.++|++++++||+|+|+|+.++++++|.++|++++++++ T Consensus 69 ~~~~~p~~~~~~~a~~v~Eg~~------~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~ 142 (324) T protein:vir:78 69 TEKKFTFWADKPGAYWVGEGQK------IETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred CceEEEEEecCcceeEecCCcc------ccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999988754 4567899999999999999999999999999999999999999999999999 Q ss_pred HHHHhccCCCC-ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccc Q lcl|NC_010583. 284 EEAFMSGNGTG-QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEW 362 (458) Q Consensus 284 d~~~l~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~ 362 (458) |.++|+|+|++ .|.||.+........ .....+++++.++...+...+..+++|+||+.++..|++++|++| T Consensus 143 d~a~l~G~g~~~~~~gi~~~~~~~~~~--------~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G 214 (324) T protein:vir:78 143 DEAGILNQGNNPFGKSIAQSIEKTNKV--------IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET 214 (324) T ss_pred HHHHhccCCCCCcCcccccccccccee--------ccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCC Confidence 99999999875 578887654332211 112345778888889999999999999999999999999999999 Q ss_pred ccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeecc----------------c Q lcl|NC_010583. 363 QDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERER----------------Q 426 (458) Q Consensus 363 ~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~----------------~ 426 (458) +++++. +.+++|+|+||++++.++ .+...+++++++.+.++.+.++++..++ + T Consensus 215 ~~~~~~--------~~~~~l~G~PV~~~~~~~---~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:78 215 KERIYD--------RNSDSLDGLPVVNLKSSN---LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred CeeecC--------CCCCcccceeeEeeCCCC---CCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhh Confidence 998743 334689999999876554 3455678899999999999988886543 2 Q ss_pred ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 427 AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 427 ~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.+|++.||++.|+|+++.+|+||++++.+.+ T Consensus 284 f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:78 284 FEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred hhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 77899999999999999999999999998877 No 90 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=2.5e-52 Score=303.52 Aligned_cols=298 Identities=13% Similarity=0.064 Sum_probs=236.6 Q ss_pred hcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeecc Q lcl|NC_010583. 124 YGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSS 203 (458) Q Consensus 124 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 203 (458) .... +. .......|.....+. ...++ ....+.+.++.+||+++.+.|++.+++.++|+++++++|+++ T Consensus 1 ~~~~-~~-~~~~~~~~~~~~~~~---------~~~~a-~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~ 68 (324) T protein:vir:96 1 MEQT-QK-LKLNLQHFASNNVKP---------QVFNP-DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEG 68 (324) T ss_pred CCcc-hh-hhHHHHHHHHHhhhh---------hhhcc-ccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccC Confidence 0000 00 111111222222111 11111 223445667889999999999999999999999999999999 Q ss_pred CceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 204 KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSI 283 (458) Q Consensus 204 ~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~ 283 (458) +..+||+..+.+.++|++|++. +++++++|++|++.++|++++++||+|+|+|+.++++++|.++|++++++++ T Consensus 69 ~~~~~p~~~~~~~a~~v~Eg~~------~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~ 142 (324) T protein:vir:96 69 TEKKFTFWADKPGAYWVGEGQK------IETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred CceEEEEEecCcceeEecCCcc------ccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999988754 4567899999999999999999999999999999999999999999999999 Q ss_pred HHHHhccCCCC-ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccc Q lcl|NC_010583. 284 EEAFMSGNGTG-QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEW 362 (458) Q Consensus 284 d~~~l~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~ 362 (458) |.++|+|+|++ .|.||.+........ .....+++++.++...+...+..+++|+||+.++..|++++|++| T Consensus 143 d~a~l~G~g~~~~~~gi~~~~~~~~~~--------~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G 214 (324) T protein:vir:96 143 DEAGILNQGNNPFGKSIAQSIEKTNKV--------IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET 214 (324) T ss_pred HHHHhccCCCCCcCcccccccccccee--------ccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCC Confidence 99999999875 578887654332211 112345778888889999999999999999999999999999999 Q ss_pred ccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeecc----------------c Q lcl|NC_010583. 363 QDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERER----------------Q 426 (458) Q Consensus 363 ~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~----------------~ 426 (458) +++++. +.+++|+|+||++++.++ .+...+++++++.+.++.+.++++..++ + T Consensus 215 ~~~~~~--------~~~~~l~G~PV~~~~~~~---~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:96 215 KERIYD--------RNSDSLDGLPVVNLKSSN---LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred CeeecC--------CCCCcccceeeEeeCCCC---CCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhh Confidence 998743 334689999999876554 3455678899999999999988886543 2 Q ss_pred ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 427 AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 427 ~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.+|++.||++.|+|+++.+|+||++++.+.+ T Consensus 284 f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:96 284 FEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred hhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 77899999999999999999999999998877 No 91 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=6.9e-52 Score=301.08 Aligned_cols=298 Identities=14% Similarity=0.083 Sum_probs=236.9 Q ss_pred hcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeecc Q lcl|NC_010583. 124 YGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSS 203 (458) Q Consensus 124 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 203 (458) ....++. ......|...+.++... ++. .......++.+||++++++|++.+++.++|+++++++|+++ T Consensus 1 ~~~~~~~--~~~~~~f~~~~~~~~~~---------~a~-~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~ 68 (324) T protein:vir:10 1 MEQTQKL--KLNLQHFASNNVKPQVF---------NPD-NVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG 68 (324) T ss_pred CCCchHH--HHHHHHHHHHhhcccee---------ccc-ceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC Confidence 0000111 11122233433332211 111 12233344568999999999999999999999999999999 Q ss_pred CceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 204 KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSI 283 (458) Q Consensus 204 ~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~ 283 (458) +...||+..+.+.+.|++|++. +++++++|+++++.++|++++++||+|+++|+.+++++||.++|++++++++ T Consensus 69 ~~~~~p~~~~~~~a~~v~Eg~~------~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~ 142 (324) T protein:vir:10 69 TEKKFTFWADKPGAYWVGEGQK------IETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred CceEEEEEeCCcceeEeccCcc------ccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999998764 4566899999999999999999999999999999999999999999999999 Q ss_pred HHHHhccCCCC-ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccc Q lcl|NC_010583. 284 EEAFMSGNGTG-QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEW 362 (458) Q Consensus 284 d~~~l~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~ 362 (458) |.++|+|+|++ .|.|+++........ .....+++++.++...+...++.++.|+||+.++..|++++|++| T Consensus 143 d~a~l~G~g~~~~~~~i~~~~~~~~~~--------~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g 214 (324) T protein:vir:10 143 DEAGILNQGNNPFGKSIAQSIEKTNKV--------IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET 214 (324) T ss_pred HHHhhhcCCCCccCcccccccccccee--------ccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCC Confidence 99999999975 688888654332211 112345678888999999999999999999999999999999999 Q ss_pred ccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeeccc---------------- Q lcl|NC_010583. 363 QDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQ---------------- 426 (458) Q Consensus 363 ~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~---------------- 426 (458) +++++. +.+++|+|+||++++.++ .+...+++++++.+.++.+.++++..+++ T Consensus 215 ~~~~~~--------~~~~~l~G~PV~~~~~~~---~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:10 215 KERIYD--------RNSDTLDGLPVVNLKSSN---LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred ceeecC--------CCCccccceeEEeecCCC---CCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhh Confidence 998743 334689999999886654 34556788999999999999888865432 Q ss_pred ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 427 AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 427 ~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.+|++.||++.|+|+++.+|+||++++.+++ T Consensus 284 ~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:10 284 FEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred hhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 67899999999999999999999999999999 No 92 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=9e-52 Score=300.43 Aligned_cols=297 Identities=14% Similarity=0.081 Sum_probs=236.1 Q ss_pred hhhhh-hHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCc Q lcl|NC_010583. 127 QDAFE-DEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKI 205 (458) Q Consensus 127 ~~~~~-~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 205 (458) +++.. .......|.....++... ++.. ......++.+||+++++.|++.+++.++|+++++++|++++. T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~---------~a~~-~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 70 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQVF---------NPDN-VMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTE 70 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhhhc---------cccc-eeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCc Confidence 11111 011122233333222111 1222 223334456899999999999999999999999999999999 Q ss_pred eEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 206 LTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE 285 (458) Q Consensus 206 ~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~ 285 (458) .+||+..+.+.+.|++|++. +++++++|+++++.++|++++++||+|+++|+.+++++||.++|++++++++|. T Consensus 71 ~~~p~~~~~~~a~~v~Eg~~------~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~ 144 (324) T protein:vir:99 71 KKFTFWADKPGAYWVGEGQK------IETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDE 144 (324) T ss_pred eEEEEEecCcceeEeccCcc------ccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999988754 456788999999999999999999999999999999999999999999999999 Q ss_pred HHhccCCCC-ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccccc Q lcl|NC_010583. 286 AFMSGNGTG-QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQD 364 (458) Q Consensus 286 ~~l~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~ 364 (458) ++|+|+|++ .|.|+++....... ......++.++.++...+.+.++.++.|+||+.++..|++++|++|++ T Consensus 145 ~~l~G~g~~~~~~~~~~~~~~~~~--------~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~ 216 (324) T protein:vir:99 145 AGILNQGNNPFGKSIAQSIEKTNK--------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKE 216 (324) T ss_pred HhhhcCCCCccCccccccccccce--------eccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCce Confidence 999999975 67888765433221 111234567888899999999999999999999999999999999999 Q ss_pred ccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeeccc----------------cc Q lcl|NC_010583. 365 VAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQ----------------AG 428 (458) Q Consensus 365 ~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~----------------~~ 428 (458) ++.. +.+++|+|+||++++.++. +...+++++++.+.++++.++++..+++ |. T Consensus 217 ~~~~--------~~~~~l~G~PVv~~~~~~~---~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) T protein:vir:99 217 RIYD--------RNSDTLDGLPVVNLKSSNL---KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) T ss_pred eecC--------CCCccccceeEEeecCCCC---CcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhh Confidence 8742 2346899999998876653 4556788999999999999988876432 67 Q ss_pred CCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 429 KQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 429 ~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +|++.||++.|+|+++.+|+||++++.+.+ T Consensus 286 ~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~ 315 (324) T protein:vir:99 286 QDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred cCcEEEEEEEEEccEEecccceEEEEeccC Confidence 899999999999999999999999999988 No 93 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=6.6e-52 Score=301.17 Aligned_cols=289 Identities=13% Similarity=0.105 Sum_probs=228.0 Q ss_pred chhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCccccccccccc Q lcl|NC_010583. 147 DVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFG 226 (458) Q Consensus 147 ~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~ 226 (458) .....+. ..+...++..+|++ +|+++..+|++.+++.++|++++++++++++..+||+....+.++|++|++. T Consensus 1 ~g~~~e~-----~~~~~~~t~~~~g~-l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~- 73 (397) T protein:vir:23 1 MGFSADH-----SQIAQTKDTMFTGY-LDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDM- 73 (397) T ss_pred CCcCHHH-----HHHhhccCCCCccc-cchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCcc- Confidence 1111111 11122344444555 5556788999999999999999999999999999999999999999998754 Q ss_pred ccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccc Q lcl|NC_010583. 227 TDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADD 306 (458) Q Consensus 227 ~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~ 306 (458) +++++++|++|++.+||++++++||+|+|+|+.++++++|+++|++++++++|.++|+|+|++++.+.+..... T Consensus 74 -----~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~- 147 (397) T protein:vir:23 74 -----KPITKGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSN- 147 (397) T ss_pred -----ccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCccccccccccc- Confidence 45678999999999999999999999999999999999999999999999999999999998765443322221 Q ss_pred ccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccc-cccccCCeeecc Q lcl|NC_010583. 307 GAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAV-KLQGQVGRIYGL 385 (458) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~-~~~~~~~~l~G~ 385 (458) ...... ....+.++.++...+...+..++.|+||+.++..|++++|.+|+|+|+...... +..+.+++|+|+ T Consensus 148 -~~~~~~------~~~~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~ 220 (397) T protein:vir:23 148 -KTQSIS------PNAYQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGR 220 (397) T ss_pred -ceeeec------ccchhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeee Confidence 111111 112234455666677788889999999999999999999999999997654433 233455789999 Q ss_pred cceecccccccccCCceEEEEEeceEEEEecceeEEeeccc----------------ccCCceEEEEEEeeccEEecccc Q lcl|NC_010583. 386 PVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQ----------------AGKQRDAYYVTQRVNLQRYFENG 449 (458) Q Consensus 386 pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~----------------~~~~~~~~~~~~r~d~~~~~~~a 449 (458) ||++++++|. +....++++++.+.++.+.++.+..++. |.+|++.||++.|+|+++++|+| T Consensus 221 Pv~~s~~~~~---g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a 297 (397) T protein:vir:23 221 PTILSDHVAE---GDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNA 297 (397) T ss_pred eEEEeCCCCC---CceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccc Confidence 9999999985 3445677899998899999888765432 67899999999999999999999 Q ss_pred eEEEEeecC Q lcl|NC_010583. 450 VVSGAYAAA 458 (458) Q Consensus 450 fv~l~~aaa 458 (458) |++++..+. T Consensus 298 ~~~~~~~~~ 306 (397) T protein:vir:23 298 FVKLTFDPV 306 (397) T ss_pred eEEEeeccc Confidence 999999777 No 94 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=1.5e-51 Score=299.19 Aligned_cols=278 Identities=13% Similarity=0.066 Sum_probs=223.8 Q ss_pred ccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccceeeeee Q lcl|NC_010583. 166 SVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFK 245 (458) Q Consensus 166 ~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~ 245 (458) =+..||++||+++.++|++.+++.++|+++++++|++++..++|+.++++.++|++|++ .+++++++|++|++. T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~------~~~~~~~~f~~v~l~ 74 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESG------KKTHGGVTLAPQTMV 74 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCc------cccccccceeEEEEe Confidence 12246789999999999999999999999999999999999999999999999999875 445678999999999 Q ss_pred hhheeeeehhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhccCC--CCcc---ccccccccccccceeeccccc Q lcl|NC_010583. 246 TYKLAAKSFITDETEE---DAIFSLLPLLRKRLIEAHAVSIEEAFMSGNG--TGQP---KGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 246 ~~k~~~~~~is~ell~---ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g--~~~p---~Gi~~~~~~~~~~~~~~~~~~ 317 (458) ++|++++++||+|+|+ ++..+|+++|.++|++++++++|.++++|++ ++.+ .|+....... ....... T Consensus 75 ~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~----~~~~~~~ 150 (298) T protein:vir:94 75 PIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKV----TQKVEAP 150 (298) T ss_pred eeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccc----ccccccc Confidence 9999999999999996 4457899999999999999999999999953 3322 2222211111 1111122 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKA 397 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 397 (458) ......+.++.+++..+...+..+.+|+||+.++..|++++|.+|+|+|+... ..+.+++|+|+||++++.+|... T Consensus 151 ~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~----~~~~~~tl~G~PV~~~~~v~~~~ 226 (298) T protein:vir:94 151 RGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELK----WGATPDTINGLPVDVNKTVSDMS 226 (298) T ss_pred cccccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcc----cCCCCceecceeeEEeccccccc Confidence 22334556788888888888888899999999999999999999999986533 34567899999999999999643 Q ss_pred -cCCceEEEEEece-EEEEecceeEEeecc----------cccCCceEEEEEEeeccEEecccceEEEEeec Q lcl|NC_010583. 398 -ASAEFAVIVYKDN-FVMPRQRAVTVERER----------QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAA 457 (458) Q Consensus 398 -~~~~~~~~~~~~~-~~i~~~~~~~i~~~~----------~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aa 457 (458) .....+++++++. +.++.+.++++..++ +|.+|++.||++.|+|+++.+|+||++++.+. T Consensus 227 ~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 227 LTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred CCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 3345567788875 456677777765533 47899999999999999999999999999999 No 95 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=1.3e-51 Score=299.65 Aligned_cols=271 Identities=12% Similarity=0.009 Sum_probs=225.1 Q ss_pred HhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce--EEEEec-CCCcccccccccccccccccc Q lcl|NC_010583. 157 HIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL--TMLVEP-EAGRATWVDASKFGTDETVGD 233 (458) Q Consensus 157 ~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~--~~p~~~-~~~~a~~v~e~~~~~e~~~~~ 233 (458) ..++ ...++.++||++||+++.++|++.++++++|+++++++|+++... .+|... ..+.+.|++|++..++ T Consensus 1 ~l~~-~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~----- 74 (293) T protein:vir:48 1 MLDS-KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIAD----- 74 (293) T ss_pred Ccee-ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCccccc----- Confidence 1222 234566678999999999999999999999999999999876554 455554 4577899999865543 Q ss_pred cccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeec Q lcl|NC_010583. 234 EVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTE 313 (458) Q Consensus 234 ~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~ 313 (458) .++++|++|++.++|++++++||+|+++|+.+++++||.++|++++++++|.+|++|+|++.+. T Consensus 75 ~~~~~~~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~---------------- 138 (293) T protein:vir:48 75 IDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTK---------------- 138 (293) T ss_pred ccccceeEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccccc---------------- Confidence 3568999999999999999999999999999999999999999999999999999998764321 Q ss_pred cccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccc- Q lcl|NC_010583. 314 AKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEY- 392 (458) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~- 392 (458) ....+++++.++..++.+.++.++.|+||+.++..|++++|.+|+|+++... ..+.+++|+|+||+++++ T Consensus 139 -----~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~----~~~~~~~l~G~Pv~~~~~~ 209 (293) T protein:vir:48 139 -----PTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDV----KSPTGYSIAGFAVKEISDR 209 (293) T ss_pred -----ccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCc----CCCCCceecceeeEEeccc Confidence 1123466788888999999999999999999999999999999999997643 335567999999998654 Q ss_pred -ccccccCCceEEEEEece-EEEEecceeEEeecc----cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 393 -FPAKAASAEFAVIVYKDN-FVMPRQRAVTVERER----QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 393 -~~~~~~~~~~~~~~~~~~-~~i~~~~~~~i~~~~----~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +|....+...+++++++. |+++++.++++..++ +|.+|++.||++.|+|+++++|+||++++++++ T Consensus 210 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 281 (293) T protein:vir:48 210 WLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAI 281 (293) T ss_pred ccCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeecc Confidence 455556666777888874 789999998887653 688999999999999999999999999999887 No 96 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=3e-51 Score=297.60 Aligned_cols=298 Identities=13% Similarity=0.073 Sum_probs=234.0 Q ss_pred hcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeecc Q lcl|NC_010583. 124 YGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSS 203 (458) Q Consensus 124 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 203 (458) ....++.. .....|.+.+.++.. .++. .......++.+||++++++|++.+++.++|+++++++|+++ T Consensus 1 ~~~~~~~~--~~~~~f~~~~~~~~~---------~~a~-~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~ 68 (324) T protein:vir:96 1 MEQTQKLK--LNLQHFASNNVKPQV---------FNPD-NVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG 68 (324) T ss_pred CCcchhhh--HHHHHHHHhhhhhhh---------cccc-cccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC Confidence 00111111 111223333332211 1111 12233345668999999999999999999999999999999 Q ss_pred CceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 204 KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSI 283 (458) Q Consensus 204 ~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~ 283 (458) +..+||+..+.+.+.|++|++. +++++++|++|++.++|++++++||+|+|+|+.++++++|.++|++++++++ T Consensus 69 ~~~~~p~~~~~~~a~~v~Eg~~------~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~ 142 (324) T protein:vir:96 69 TEKKFTFWADKPGAYWVGEGQK------IETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred CceEEEEEecCcceeeecCCcc------ccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999988754 4567899999999999999999999999999999999999999999999999 Q ss_pred HHHHhccCCCC-ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccc Q lcl|NC_010583. 284 EEAFMSGNGTG-QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEW 362 (458) Q Consensus 284 d~~~l~G~g~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~ 362 (458) |.++|+|+|++ .|.|+.+....... ......+++++.++...+...+..+..|+||+.++..|++++|++| T Consensus 143 d~~~l~G~g~~~~~~~~~~~~~~~~~--------~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G 214 (324) T protein:vir:96 143 DEAGILNQGNNPFGKSIAQSIKKTNK--------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET 214 (324) T ss_pred HHHhhhcCCCCCcCccccccccccce--------ecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCC Confidence 99999999875 57777764332211 1112345677888888888888899999999999999999999999 Q ss_pred ccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeecc----------------c Q lcl|NC_010583. 363 QDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERER----------------Q 426 (458) Q Consensus 363 ~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~----------------~ 426 (458) +++++. +.+++|+|+||+++...+ .+...+++++++.+.++.+.++++..++ + T Consensus 215 ~~~~~~--------~~~~~l~G~PV~~~~~~~---~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:96 215 KERIYD--------RNSDSLDGLPVVNLKSSN---LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred CeeecC--------CCCCcccceeeEeecCCC---CCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhh Confidence 998742 235689999999876544 3455678899999999999998887643 3 Q ss_pred ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 427 AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 427 ~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.+|++.||++.|+|+++.+|+||++++.|.. T Consensus 284 ~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:96 284 FEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred hhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 67899999999999999999999999998888 No 97 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=1.9e-51 Score=298.68 Aligned_cols=285 Identities=13% Similarity=0.018 Sum_probs=212.8 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCccccccccccccccccccccccccee Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTE 241 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~ 241 (458) +.+ .++.||++||++++++|++.+++.++|+++++++|++++..+||+.++.+.++|++|++. +++++++|++ T Consensus 1 Mat-~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~------~~~~~~~f~~ 73 (311) T protein:vir:99 1 MAT-FGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQ------KSSTTGEFDF 73 (311) T ss_pred Cce-ecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcc------cccccceeeE Confidence 223 335677899999999999999999999999999999999999999999999999998764 4566899999 Q ss_pred eeeehhheeeeehhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccch Q lcl|NC_010583. 242 ISFKTYKLAAKSFITDETEE---DAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADG 318 (458) Q Consensus 242 v~~~~~k~~~~~~is~ell~---ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~ 318 (458) +++.++|++++++||+|+|+ |+.++|+++|+++|++++++++|.++|+|+|++++.++................... T Consensus 74 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~ 153 (311) T protein:vir:99 74 VTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTAD 153 (311) T ss_pred EEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeecccc Confidence 99999999999999999995 667899999999999999999999999999987655544322211111111111111 Q ss_pred hhHHHHHHHHHHHhhhhhh--hcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccccc Q lcl|NC_010583. 319 SVLVTAKTISKLRRKLGRH--GLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 396 (458) .......++.+++..+... ......|+||+.++..|++++|.+|+|+|+.... .+.+++|+|+||++++.+|.. T Consensus 154 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~----~~~~~~l~G~Pv~~s~~i~~~ 229 (311) T protein:vir:99 154 TIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGL----GIGVSSFEGIDASVSDTVNGG 229 (311) T ss_pred ccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCccc----CCCCceecceeeEeecccccc Confidence 1111122333333332222 1233459999999999999999999999864333 345679999999999988732 Q ss_pred -----------ccCCceEEEEEece-EEEEecceeEEeecc---------cccCCceEEEEEEeeccEEecccceEEEEe Q lcl|NC_010583. 397 -----------AASAEFAVIVYKDN-FVMPRQRAVTVERER---------QAGKQRDAYYVTQRVNLQRYFENGVVSGAY 455 (458) Q Consensus 397 -----------~~~~~~~~~~~~~~-~~i~~~~~~~i~~~~---------~~~~~~~~~~~~~r~d~~~~~~~afv~l~~ 455 (458) .++....++++++. +.++.+.++++...+ +|.+|++.||++.|+|+++.+| +|++++. T Consensus 230 ~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~ 308 (311) T protein:vir:99 230 DEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIEN 308 (311) T ss_pred cccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecCh-hHeeeec Confidence 12333455677764 456666666665433 3789999999999999999996 6888888 Q ss_pred ecC Q lcl|NC_010583. 456 AAA 458 (458) Q Consensus 456 aaa 458 (458) ++| T Consensus 309 ~~A 311 (311) T protein:vir:99 309 AVA 311 (311) T ss_pred ccC Confidence 888 No 98 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=7.1e-51 Score=295.53 Aligned_cols=279 Identities=15% Similarity=0.088 Sum_probs=230.3 Q ss_pred hccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccC-ceEEEEecCCCccccccc Q lcl|NC_010583. 144 MEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSK-ILTMLVEPEAGRATWVDA 222 (458) Q Consensus 144 ~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~e 222 (458) +. ....++.+. .+++.++.+||++++++|++.+++.++|+++|+++|+++. ...+|+..+++.++|++| T Consensus 1 m~---------~~~~~~~~~-~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 70 (297) T protein:vir:95 1 MT---------VQTFNPENV-LVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNE 70 (297) T ss_pred CC---------ccccccccc-cccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeec Confidence 10 011122222 2344566789999999999999999999999999998765 467888888899999998 Q ss_pred ccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccc Q lcl|NC_010583. 223 SKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKL 302 (458) Q Consensus 223 ~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~ 302 (458) ++. +++++++|++|++.++|++++++||+|+++|+.++++++|.++|++++++++|.++|+|+|+++|.||++. T Consensus 71 g~~------~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~ 144 (297) T protein:vir:95 71 TEK------IKTDKPEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKA 144 (297) T ss_pred Ccc------ccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccc Confidence 764 44567899999999999999999999999999999999999999999999999999999999999999976 Q ss_pred ccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCee Q lcl|NC_010583. 303 AADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRI 382 (458) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l 382 (458) ...... ......+++++.++..++...+..++.|+||+.++..|++++|.+|+|+++. .+++| T Consensus 145 ~~~~~~--------~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~---------~~~~l 207 (297) T protein:vir:95 145 AKDANK--------VIGGPINYDNILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDK---------AANTI 207 (297) T ss_pred ccccce--------ecccccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecC---------CCCcc Confidence 543221 1122345678888999999999999999999999999999999999998742 23679 Q ss_pred ecccceecccccccccCCceEEEEEeceEEEEecceeEEeeccc----------------ccCCceEEEEEEeeccEEec Q lcl|NC_010583. 383 YGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQ----------------AGKQRDAYYVTQRVNLQRYF 446 (458) Q Consensus 383 ~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~----------------~~~~~~~~~~~~r~d~~~~~ 446 (458) +|+||+.+...+ .+...+++++++.+.++.+.++++..+++ |.+|++.||++.|+|+++++ T Consensus 208 ~G~Pv~~~~~~~---~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~ 284 (297) T protein:vir:95 208 DGITTVDLKSAR---FEKGDLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITK 284 (297) T ss_pred cceeeEeecCCC---CCCceEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeec Confidence 999999876544 34556788999999999999888765332 67899999999999999999 Q ss_pred ccceEEEEeecC Q lcl|NC_010583. 447 ENGVVSGAYAAA 458 (458) Q Consensus 447 ~~afv~l~~aaa 458 (458) |+||++||.|+= T Consensus 285 ~~a~~~l~~at~ 296 (297) T protein:vir:95 285 TDAFAKLTPAER 296 (297) T ss_pred ccceEEEeecCC Confidence 999999998888 No 99 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=1.1e-42 Score=250.62 Aligned_cols=295 Identities=13% Similarity=0.081 Sum_probs=229.9 Q ss_pred hHHHHHHH-HhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceee-eccCceEEEEecCC-Ccccccccccc Q lcl|NC_010583. 149 FETEHGKA-HIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELP-MSSKILTMLVEPEA-GRATWVDASKF 225 (458) Q Consensus 149 ~~~~~~~~-~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~p~~~~~-~~a~~v~e~~~ 225 (458) .+..++.. ..+.+ +.+..+||+++|+++. ++++.+++.+++++++++++ +.+....+|....+ ....|..++ T Consensus 1 ~~~~~~~~~~~k~i--t~~d~~gG~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~-- 75 (314) T protein:vir:41 1 MDFLNKPFQITPKI--DVPDLGKGILAVQRFG-EFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTS-- 75 (314) T ss_pred CchhhhHHHhhccc--ccccCCCceeChHHHH-HHHHHHHhccchhhheeeecccCccceeecccccCcccccccccc-- Confidence 00011111 11222 2344568999999874 79999999999999999986 46777888876533 222333222 Q ss_pred cccccccccccccceeeeeehhheeeeehhhHHHHhccHH--HHHHHHHHHHHHHHHHHHHHHHhccCCC--------Cc Q lcl|NC_010583. 226 GTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIF--SLLPLLRKRLIEAHAVSIEEAFMSGNGT--------GQ 295 (458) Q Consensus 226 ~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~--~~~~~i~~~la~~~~~~~d~~~l~G~g~--------~~ 295 (458) .+.+..++++++|+++++.+|++...++||+++|+|+.. +|+++|...|++++++.++..+++|+|+ ++ T Consensus 76 -~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~ 154 (314) T protein:vir:41 76 -GTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRI 154 (314) T ss_pred -cCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhc Confidence 233455678999999999999999999999999999965 8999999999999999999999999985 37 Q ss_pred cccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcc---cceeEechhHHHHHHhhhcccccccccccccc Q lcl|NC_010583. 296 PKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLK---LSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDA 372 (458) Q Consensus 296 p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~ 372 (458) |.||++.+... .+ ... ..+.....+.+.++++++++.|+. +.+|+||+.+...++++.+..++++++.... T Consensus 155 p~G~l~~a~~~--~~-~~~--~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~- 228 (314) T protein:vir:41 155 NDGWMKLAGNQ--YT-DAE--PEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALI- 228 (314) T ss_pred chhhhhhcccc--ee-ecC--ccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhh- Confidence 89999865322 11 111 112234456678899999999975 5579999999999999999999998876544 Q ss_pred ccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEE Q lcl|NC_010583. 373 VKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVS 452 (458) Q Consensus 373 ~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~ 452 (458) .+.+.+|+|+||+.+++||...++...+++++++.|.++.+.++++..+.++.++++.|.+..|+|+.+.+++|.|+ T Consensus 229 ---~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~ 305 (314) T protein:vir:41 229 ---GATGLQYDGIPIQYVPALDALGDDKARALLTVPTNLVYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVA 305 (314) T ss_pred ---CCCCceecceeeEecccccccCCCCceEEEechhheEEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEE Confidence 35577899999999999999888999999999999999999999999999999999999999999999998877655 Q ss_pred --EEeecC Q lcl|NC_010583. 453 --GAYAAA 458 (458) Q Consensus 453 --l~~aaa 458 (458) ++.+.| T Consensus 306 ~~~~~~~~ 313 (314) T protein:vir:41 306 AVIDMSSG 313 (314) T ss_pred EEeeccCC Confidence 445555 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=3.6e-42 Score=247.78 Aligned_cols=300 Identities=13% Similarity=0.065 Sum_probs=224.0 Q ss_pred HHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceee-eccCceEEEEecCC- Q lcl|NC_010583. 137 LVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELP-MSSKILTMLVEPEA- 214 (458) Q Consensus 137 ~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~p~~~~~- 214 (458) .--...++.+..... .++++ .+..+||+++|+.+. ++|+.+.+.++++++|++++ +++....++....+ T Consensus 1 ~~~~~~~~~~~~~~~------~k~~t--~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~ 71 (315) T protein:vir:41 1 MLTIEDIRGGKPFEI------VPKID--VPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVL 71 (315) T ss_pred CcccchhhcCChhhh------hhhcC--CcCCCCceechHHHH-HHHHHHHhhhhhhhhceeeeccccccccccccccCc Confidence 001122332222211 12222 334578999998865 59999999999999999865 44433333332211 Q ss_pred CcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_010583. 215 GRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAHAVSIEEAFMSGNG 292 (458) Q Consensus 215 ~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~~~~~~~d~~~l~G~g 292 (458) ....+ ..+..+.+..++++|+|+++++.++++.+.+.||+++|+|+. ++|+++|...+++++++.++..+++|+| T Consensus 72 ~~~~g---~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg 148 (315) T protein:vir:41 72 DVGPG---RDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDT 148 (315) T ss_pred ccccc---cccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 11111 112233345667889999999999999999999999999986 4999999999999999999999999998 Q ss_pred C------CccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcc---cceeEechhHHHHHHhhhccccc Q lcl|NC_010583. 293 T------GQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLK---LSKLVLIVSMDAYYDLLEDEEWQ 363 (458) Q Consensus 293 ~------~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~l~~~~d~~~~ 363 (458) + ++|.||++.+..... ....+. .+.....+.+.++++++++.|+. +++|+||+.++..++++++.+|+ T Consensus 149 ~s~~p~~~~~~G~l~~a~~~~~--~~~~~~-~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~ 225 (315) T protein:vir:41 149 SSSDPLLRMSDGWLKLASEKLT--ESDVDP-EAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRET 225 (315) T ss_pred cCcCccccccccceeccccccc--cccccc-ccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCC Confidence 5 467899986543221 111111 12223456788999999999974 56899999999999999999999 Q ss_pred cccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccE Q lcl|NC_010583. 364 DVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQ 443 (458) Q Consensus 364 ~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~ 443 (458) |+|++... .+.+.+|+|+||..++.||+...+...++++++++|.++.+.++++.++.++.++.+.|....|+|+. T Consensus 226 ~lw~~~~~----~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a~~~~~~~~~~~r~d~~ 301 (315) T protein:vir:41 226 GLGDQALT----GANSILYDGRPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDAEMRLTKYVASLRTDNH 301 (315) T ss_pred ccccchhh----cCCCceecccceEecccccccCCCCccEEEecccceEEEeccccEEEeeecCCCCceEEEEEEEecee Confidence 99976544 35678999999999999999888888899999999999999999999999999999999999999999 Q ss_pred EecccceE--EEEe Q lcl|NC_010583. 444 RYFENGVV--SGAY 455 (458) Q Consensus 444 ~~~~~afv--~l~~ 455 (458) +.++++.| .+|+ T Consensus 302 ~~~~~~~a~~~~~v 315 (315) T protein:vir:41 302 YEDEEGAVSATITV 315 (315) T ss_pred EEeccceeEeeeeC Confidence 88887744 4555 No 101 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=7.1e-37 Score=218.76 Aligned_cols=389 Identities=14% Similarity=0.086 Sum_probs=208.0 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) .|+-...-..-...++...... . +..+.....++.. +..+...+...+++++.+... T Consensus 120 v~~pa~~~a~I~~vke~~~~e~--~---~~~~~~a~~ee~~----------------e~~~k~~el~a~l~~~~~~~~-- 176 (517) T protein:vir:97 120 TPNPSNKNAVVTYFREEKKKEE--N---KMTFDQNLMQELL----------------DAKKLAADLNAKLKERENGGD-- 176 (517) T ss_pred cchhhhhhhhhhhhhhhhhhhh--h---hhhhhhhhhhhhh----------------hhhhhHHHHHHHHHHHHHHHH-- Confidence 3332222111111111100000 0 0000000000000 000111111111111111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhh Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKA 160 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a 160 (458) +...+....+..++...+...... ............+..........+......+.. .... T Consensus 177 -~~~~e~~~~l~a~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~ 237 (517) T protein:vir:97 177 -NAALKTVSELAANLMKQRESEKIL----------GVEALKVTPEATEFLKTREAEVAYMSASLTKDP--------KAAW 237 (517) T ss_pred -HHHHhhhhhhhhhHHHHHHhhhhc----------ccccccccchhhHHHHHHHHHHHHHHhcccccc--------ccee Confidence 000111111111111100000000 000000000000000000000000000000000 0000 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccce Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLT 240 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~ 240 (458) .........+++.+|..+...+...+...+++.+++++.+.. ...+|.......+.|+.|| +.+++++++|+ T Consensus 238 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~--~~~~~~~~~~~~a~~~~eG------~~kp~s~~tf~ 309 (517) T protein:vir:97 238 TAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP--TLVVGGDNALTQGTGHTTG------TDKTESNITLQ 309 (517) T ss_pred eeecccccccccccchHHHHHHHHhhhhhccceeeeeecccc--ceeeecccccceeeeeecC------Cccccccccee Confidence 111223445789999999999999999999888887765443 3456666666666677665 45667889999 Q ss_pred eeeeehhheeeeehhhHHHHhccHHH----HHHHHHHHHHHHHHHHHHHHHhccCCCC-ccccccccccccccceeeccc Q lcl|NC_010583. 241 EISFKTYKLAAKSFITDETEEDAIFS----LLPLLRKRLIEAHAVSIEEAFMSGNGTG-QPKGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 241 ~v~~~~~k~~~~~~is~ell~ds~~~----~~~~i~~~la~~~~~~~d~~~l~G~g~~-~p~Gi~~~~~~~~~~~~~~~~ 315 (458) .+++.++++++++++|+++|+|+.++ |++||.++|++.++++++.+||+|+|++ .+.|++..+....... ... T Consensus 310 ~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~--~~~ 387 (517) T protein:vir:97 310 TRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATN--VTG 387 (517) T ss_pred eEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCccccccccccccccccc--ccc Confidence 99999999999999999999998877 9999999999999999999999999986 4568876543221111 111 Q ss_pred cchhhHHHHHHHHHHH-hhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccc Q lcl|NC_010583. 316 ADGSVLVTAKTISKLR-RKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFP 394 (458) Q Consensus 316 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~ 394 (458) .+ ...+++..+ .++. ...++.|+||+.+|..|+++||++|+|+|+..... +.+.+++|. +..+| T Consensus 388 ~~-----~~~d~i~~l~~a~~--~a~~a~~vmn~~t~~~I~klKD~~G~Yl~~~~~~~----~~~~~l~G~----~~~~~ 452 (517) T protein:vir:97 388 TT-----NIQELLEKLSVATP--KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSN----QTIATHFGF----NRLVQ 452 (517) T ss_pred cc-----hHHHHHHHHHHHhh--hccCCEEEECHHHHHHHHHhhcCCCCeeccCcCCc----ccccccCCc----ccccc Confidence 11 111222211 1111 12467899999999999999999999999664433 344567773 22333 Q ss_pred ccccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEE--eecC Q lcl|NC_010583. 395 AKAASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGA--YAAA 458 (458) Q Consensus 395 ~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~--~aaa 458 (458) ....+. ..+++.+.|.++++.++....+.+..+|++.|+...|+++.++.|++|+++. +.+| T Consensus 453 ~~~~~~--~~~~~~~~y~i~~~~g~~~~~~fd~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~~ 516 (517) T protein:vir:97 453 SVAVDE--KTAVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVA 516 (517) T ss_pred ccccCc--eeEeeccccEEEeecceeeeeeeecccCceeEeeeeeeccccccccceEEEEEcCCCC Confidence 322222 3455778899999998888777667789999999999999999999988654 4555 No 102 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=1e-33 Score=201.48 Aligned_cols=295 Identities=11% Similarity=0.064 Sum_probs=209.4 Q ss_pred HHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecC Q lcl|NC_010583. 134 VEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPE 213 (458) Q Consensus 134 ~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~ 213 (458) ..++.|.+++.. ... ....+.+...+|++||+.+..+|++.+...++++++++++++.+....+|.... T Consensus 1 ~~~k~~~~~l~~----------~~~-~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~ 69 (321) T protein:vir:31 1 MASRTINNDLSR----------ITE-KNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNI 69 (321) T ss_pred CchHHHHHHHHH----------HHH-hccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeecc Confidence 222223332211 111 111223345567788889999999999999999999999999999999998877 Q ss_pred CCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_010583. 214 AGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAHAVSIEEAFMSGN 291 (458) Q Consensus 214 ~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~~~~~~~d~~~l~G~ 291 (458) +....|+++.. ....+.++|+|+++++.++++.+.++||+++|+|+. ++|+++|.+.++++++..++..+++|+ T Consensus 70 ~~~~~~~~~e~----~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd 145 (321) T protein:vir:31 70 GERHRRPQDEG----EWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGD 145 (321) T ss_pred CCccccccccc----ccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeecc Confidence 76677765322 123345689999999999999999999999999974 589999999999999999999999999 Q ss_pred CCCcc------ccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcc--cceeEechhHHHHHH-hhhcccc Q lcl|NC_010583. 292 GTGQP------KGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLK--LSKLVLIVSMDAYYD-LLEDEEW 362 (458) Q Consensus 292 g~~~p------~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~-~~~d~~~ 362 (458) |++.| .||++.+........ .+ ....+.+.+.++...+++.|+. +.+|+||+.++..+. .+++.+ T Consensus 146 ~~~~~~~~~~n~G~l~~a~~~~~~~~--~~---~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~- 219 (321) T protein:vir:31 146 EDAEDSFENQNDGFITVAEGDVETID--AA---DDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRD- 219 (321) T ss_pred ccCCCcccccchhhhhhhcccccccc--cc---ccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCC- Confidence 87554 688875543322111 11 1223345677888899998874 458999999987655 456644 Q ss_pred ccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeeccc---ccCCceEEE--EE Q lcl|NC_010583. 363 QDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQ---AGKQRDAYY--VT 437 (458) Q Consensus 363 ~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~---~~~~~~~~~--~~ 437 (458) .++++... ..+.+.+|+|+||+.+++||. ..+++.+++++.++.+.++++.+... .......++ .. T Consensus 220 ~~~~~~~l----~~~~~~tl~G~pvv~~~~mP~-----~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (321) T protein:vir:31 220 TPLGDNVI----MGEADVNPFSFPIIGSGLWPD-----DKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMR 290 (321) T ss_pred Cccccchh----hccccccccceeEEEcCCCCC-----CcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeee Confidence 46654432 234566899999999999995 34677888888877777777655332 222233444 44 Q ss_pred EeeccEEecccceEEEE-eecC Q lcl|NC_010583. 438 QRVNLQRYFENGVVSGA-YAAA 458 (458) Q Consensus 438 ~r~d~~~~~~~afv~l~-~aaa 458 (458) .++|+.+-++++++.++ +.-. T Consensus 291 ~~~~~~ve~~~a~a~~~~i~~~ 312 (321) T protein:vir:31 291 GDDDFAIENTEAVVLAEGLGDP 312 (321) T ss_pred eecceeEeccccEEEEecCCcc Confidence 46888999999998877 2333 No 103 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.96 E-value=4.9e-32 Score=192.23 Aligned_cols=365 Identities=11% Similarity=0.014 Sum_probs=180.9 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) .++-...--+-...++.....+ ......+..+...+ ..+.+....+..+.+.++ .+. T Consensus 107 v~~pa~~~a~v~~vks~~~~~e-------~~~~~~e~~e~~~e------------~~e~~~~~~el~akl~el----~k~ 163 (480) T protein:vir:40 107 TPLPSNKGAKVTKVREENKGEQ-------EQMGANETQEIMKQ------------AIEAGVKVRELEAKVEEL----NKE 163 (480) T ss_pred eecccchhhhhhhhhhhhhhhh-------hhhhhHHHHHHHHh------------hhhhhhhhhhHHHHHHHH----HhH Confidence 2222222111111111000000 00000000000000 000000000000000000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhh Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKA 160 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a 160 (458) .+... ..+...... . ...... ..+. .....+...+........ ...+ T Consensus 164 ~ee~k---~~~~~~~~~-~-~~~~~~-~~e~----------------------r~~~~~~~~~~e~~~~~~-----~~~~ 210 (480) T protein:vir:40 164 REELK---KEREASIPS-E-KPEDAE-RKFM----------------------RELGSKMAEMPEQGFLRE-----FANG 210 (480) T ss_pred HHHHh---hhhhhhccc-c-chhhhh-hHHH----------------------HHHHHHhccchhhhhhhh-----hhhh Confidence 00000 000000000 0 000000 0000 000000000000000000 0001 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccce Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLT 240 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~ 240 (458) . ..+...+++.++| .+.+.+........++...++.. ..+.....|++|+...+.. ....++. T Consensus 211 ~-~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-----------~~g~~~~~~~~e~~~~~~~----~~~~~~~ 273 (480) T protein:vir:40 211 A-DLNVVNSLGSITS-KYARKSGIYDGAMKARFQGLTLA-----------EDGVDDTFISGTFKAGTDK----NKSQTAT 273 (480) T ss_pred c-ccccccccccccc-chhhheeechhhhhhhhhcceee-----------eccccceeeeeeeeccccc----ccccccc Confidence 1 1122233444444 44443333333333333322211 1233446677766433321 1122344 Q ss_pred eeeee---hhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccC--CCCccccccccccccccceeeccc Q lcl|NC_010583. 241 EISFK---TYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGN--GTGQPKGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 241 ~v~~~---~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~--g~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) +.++. .+++.+...+|.++|+|+. +|++||.++|++.++.+++.+||+|+ |++.+.||.+...... . T Consensus 274 ~~~~~~~~v~~l~~~~k~t~~lLDDa~-~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~~~~~------~- 345 (480) T protein:vir:40 274 KRSLRPQMAEAYLQMDKATVRGVNDSG-ALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTATDGWT------K- 345 (480) T ss_pred cchhhHHHHHHHHHhHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceeeccccc------c- Confidence 44444 5788888899999999987 79999999999999999999999995 4556777754322110 0 Q ss_pred cchhhHHHHHHHHHHHhhhhhhhcccc-eeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccc Q lcl|NC_010583. 316 ADGSVLVTAKTISKLRRKLGRHGLKLS-KLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFP 394 (458) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~ 394 (458) .....+.+..+.+++...|+.++ .|+||+.+|+.++++||.+|+|+|++.. ..+.+.+|||+||++++.+. T Consensus 346 ----~~~~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~G~Yi~q~~~----~~~~~~~llG~pvv~~~~~~ 417 (480) T protein:vir:40 346 ----QIEYTDLFEGITDAVAECSISDAITIVMSPQTFAELRKAKGTDGHSRFNELA----TKEQIAQSFGAVNLETRVWM 417 (480) T ss_pred ----cchhHHHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCCCCeeccCcc----cccCcceecccceeeeeccc Confidence 11112345568888888888877 6999999999999999999999997643 34667899999988764432 Q ss_pred ccccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 395 AKAASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 395 ~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .. ....+.++...+.++++ +++...+..+..++..|..+.|+++.+..|++|+.++..++ T Consensus 418 ~~---~~~~~~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~ 477 (480) T protein:vir:40 418 PK---DEVAVYNHDEYVLIGDL-NVENYNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGS 477 (480) T ss_pred cC---CcceeeeCCccEEEEec-ccceecccccccchhhhhhhhhhceeeEccccEEEEEeccC Confidence 21 12223333345566664 56665555667899999999999999999999999999999 No 104 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.96 E-value=8.8e-32 Score=190.85 Aligned_cols=265 Identities=17% Similarity=0.146 Sum_probs=201.9 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceee----eccCceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELP----MSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +..+++..+..++|+.++..|++.+...+.+.+++.+.. ..+...++|++...+.+.|++||. ..+.+++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~------~i~~~~~ 74 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGE------AIPMTQL 74 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCC------ccccccc Confidence 333344566789999999999999998888888876532 234458899998888899998875 3456688 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +|+++++.+++++..+++|+++..++..++.+++.+++++++++.+|..++..-.. +.. ... T Consensus 75 ~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~---------a~~-----~~~---- 136 (272) T protein:vir:30 75 GFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK---------STQ-----TVE---- 136 (272) T ss_pred ccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc---------ccc-----ccc---- Confidence 99999999999999999999999999999999999999999999999999853110 000 000 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKA 397 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 397 (458) ...+++.+.++...+...+..+..|+|||.++..|.+....+.......+. .....|..++++|+||++++++|.. T Consensus 137 --~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~-~~~~~g~ig~i~G~~Vi~s~~~p~~- 212 (272) T protein:vir:30 137 --ATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGA-NRVVSGVYGEVLGVQIVRSRKCPKG- 212 (272) T ss_pred --cccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccc-cccccccchhhcCeeEEEcCCCCcc- Confidence 112344566666677777777788999999999887654332221111111 2334456679999999999999842 Q ss_pred cCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 398 ASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 398 ~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+++....+.+..+.+++++.+....++...+++..|+++++.+|++||++|+++| T Consensus 213 ----t~~~~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:30 213 ----TAYMVRKGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred ----eEEEEcCCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 234444557777888888888888888899999999999999999999999999999 No 105 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.96 E-value=8.8e-32 Score=190.85 Aligned_cols=265 Identities=17% Similarity=0.146 Sum_probs=201.9 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceee----eccCceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELP----MSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +..+++..+..++|+.++..|++.+...+.+.+++.+.. ..+...++|++...+.+.|++||. ..+.+++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~------~i~~~~~ 74 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGE------AIPMTQL 74 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCC------ccccccc Confidence 333344566789999999999999998888888876532 234458899998888899998875 3456688 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +|+++++.+++++..+++|+++..++..++.+++.+++++++++.+|..++..-.. +.. ... T Consensus 75 ~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~---------a~~-----~~~---- 136 (272) T protein:vir:98 75 GFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK---------STQ-----TVE---- 136 (272) T ss_pred ccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc---------ccc-----ccc---- Confidence 99999999999999999999999999999999999999999999999999853110 000 000 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKA 397 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 397 (458) ...+++.+.++...+...+..+..|+|||.++..|.+....+.......+. .....|..++++|+||++++++|.. T Consensus 137 --~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~-~~~~~g~ig~i~G~~Vi~s~~~p~~- 212 (272) T protein:vir:98 137 --ATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGA-NRVVSGVYGEVLGVQIVRSRKCPKG- 212 (272) T ss_pred --cccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccc-cccccccchhhcCeeEEEcCCCCcc- Confidence 112344566666677777777788999999999887654332221111111 2334456679999999999999842 Q ss_pred cCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 398 ASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 398 ~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+++....+.+..+.+++++.+....++...+++..|+++++.+|++||++|+++| T Consensus 213 ----t~~~~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:98 213 ----TAYMVRKGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred ----eEEEEcCCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 234444557777888888888888888899999999999999999999999999999 No 106 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.85 E-value=4.3e-23 Score=143.20 Aligned_cols=265 Identities=15% Similarity=0.080 Sum_probs=191.4 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeee----ccCceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPM----SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +....+..+..++|+.+.+-+.+.+.....+.+++++... .+...++|++...+.+.++.|+. ..+.++. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~------~i~~~~i 74 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGE------KIPTDIL 74 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCC------ccccccc Confidence 3344556667899999999999888888888888765432 13367899987666666766654 4456678 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) ++++.++..++.+..+.++++...++..++.+.+.+++++++++.+|..++..-.+... .+.+ T Consensus 75 t~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~-------------~~~~---- 137 (274) T protein:vir:93 75 ETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-------------TVNA---- 137 (274) T ss_pred ccceeEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------------cccc---- Confidence 89999999999988899999999998889999999999999999999999854211100 0000 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccccccccccc-ccccccccCCeeecccceeccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGN-DAVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~-~~~~~~~~~~~l~G~pv~~~~~~~~~ 396 (458) ....++.+.++...+.........++|||..+..|.+ +...++.-.... ......|..++++|+||++++.+|.. T Consensus 138 --~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k--~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~ 213 (274) T protein:vir:93 138 --DITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRG--DASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG 213 (274) T ss_pred --cccCHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHh--hhhhcccccccccccceeecccceecCeeEEEcCCCCcc Confidence 1123445556666666655567788999999988864 322222211111 12234556688999999999999842 Q ss_pred ccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 397 AASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 397 ~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+++....+.++....+.++.+....+....+++..++++++++|+++++++++++ T Consensus 214 -----t~~l~~~gai~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~ 270 (274) T protein:vir:93 214 -----TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred -----eEEEEeCCeEEEEecCCcccccccchhhcccEEEEEEEEEEEEEcCCceEEEeeCcc Confidence 234444456666667777888777777888899999999999999999999999999 No 107 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.85 E-value=3.5e-23 Score=143.66 Aligned_cols=267 Identities=15% Similarity=0.086 Sum_probs=190.2 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeeec----cCceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMS----SKILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~----~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +..+.+.....++|+.+.+-+.+.+.....+.+++.+.+.. +...++|.+.....+.++.|+ ...+..+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg------~~i~~~~l 74 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEG------GEISLDKI 74 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCC------CccChhhc Confidence 33334555678999999998888888888888887664432 345789998765556566655 44556677 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) ++++.++..++.+..+.++++...++..++.+.+.++++.++++.+|+.++..- ..+.. . T Consensus 75 t~~~~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l---------~~~~~------~----- 134 (272) T protein:vir:36 75 GTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAA---------KTTSQ------T----- 134 (272) T ss_pred CCcceeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHh---------ccccc------c----- Confidence 889999999999888999999988888899999999999999999999887531 00000 0 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKA 397 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 397 (458) .....+++.+.++...+.........++|||..+..|.+.... ..............|..++++|+||++++.+|... T Consensus 135 ~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~--~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~ 212 (272) T protein:vir:36 135 VSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANA--KNIGSEVGANALINGTYADVLGAQIVRSKKLAEGS 212 (272) T ss_pred ccccccHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhccccc--ccccccccccceeeeccceecCeeEEEeCCCCCCc Confidence 0111234456666666666666667789999998888654332 22211222222344556789999999999999643 Q ss_pred cCCceEEEEEe-ceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 398 ASAEFAVIVYK-DNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 398 ~~~~~~~~~~~-~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) . ....+.++ ..+.++...++.++.+++..+....+++..++++++.+|+++|++|++.- T Consensus 213 ~--~~~~~~~~~gA~~~~~~~~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 213 A--LMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred e--eEEEEEecccceeeeecCCcccccccchhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 2 22222333 34555666777888877777788899999999999999999999999999 No 108 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.82 E-value=1.3e-21 Score=135.11 Aligned_cols=265 Identities=15% Similarity=0.097 Sum_probs=187.8 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeee----ccCceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPM----SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +....+..+..++|+.+++-+.+.+.....+.++++..+. .+...++|++...+.+..+.++ +..+..+. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g------~~i~~~~i 74 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEG------EKIPVDQI 74 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCC------CcCchhhc Confidence 2333345567899999999999988888777777755332 2446789988654454444444 44556677 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +++..++..++.+..+.++++....+..++.+.+.++++.++++.+|..++..- +.++.. +. T Consensus 75 t~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l---------~~a~~~----~~----- 136 (274) T protein:vir:96 75 GTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEAL---------KGATLT----VE----- 136 (274) T ss_pred ccceeEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHH---------hcCCCC----cC----- Confidence 888888898888888899999988888899999999999999999999888531 101000 00 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccc-cccccccccCCeeecccceeccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVG-NDAVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~-~~~~~~~~~~~~l~G~pv~~~~~~~~~ 396 (458) ....+++.+.++...+......+..++|||..+..|.++... ++..... ..+....|..++++|++|++++.+|.. T Consensus 137 -~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~--~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~ 213 (274) T protein:vir:96 137 -ADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASD--NFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKG 213 (274) T ss_pred -cccccHHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhcccc--cccccccccccceeecccceecCeeEEEcCCCCcc Confidence 111235556666666665555667789999999988775321 1111111 112334566789999999999999853 Q ss_pred ccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 397 AASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 397 ~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+++....+.++...++.++.+....+....+++..+++.++++|+++|+++.++| T Consensus 214 -----t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:96 214 -----EALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) T ss_pred -----eEEEEeCcceeeeecCCcccccccchhhcccEEEEeeEEEEEEEcCccEEEEEcCcc Confidence 223333445666667777787777777788899999999999999999999999999 No 109 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.81 E-value=2.2e-21 Score=133.83 Aligned_cols=271 Identities=14% Similarity=0.086 Sum_probs=183.9 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeee----ccCceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPM----SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +...++..+..++|+.|++-+.+.+.....+.+++..... .+...++|++...+.+.++.++. ..+..+. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~------~i~~~~l 74 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGA------AIDYSAL 74 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCC------cCccccc Confidence 2233445567899999999999988888777777755332 23457899887555555555543 4455677 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccC-CCCccccccccccccccceeecccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGN-GTGQPKGLLKLAADDGAKVVTEAKA 316 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~-g~~~p~Gi~~~~~~~~~~~~~~~~~ 316 (458) ++++.++..++.+..+.++++...++..++.+.+.++++.++++.+|..++..- |+. ... ....+. T Consensus 75 t~~~~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~----------~~~---~~~~t~ 141 (278) T protein:vir:80 75 ETESVKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTT----------LEV---KGAINI 141 (278) T ss_pred ccceeeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccc----------ccc---cccccc Confidence 889999998888888899999999988899999999999999999999888641 211 000 000001 Q ss_pred chhhHHHHHHHHHHHhhhhhhhcc-cceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccc Q lcl|NC_010583. 317 DGSVLVTAKTISKLRRKLGRHGLK-LSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPA 395 (458) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 395 (458) . .....+..+.++...+.....+ ...++|||..+..|.+....+...-... ..+....|..++++|++|++++.+|. T Consensus 142 ~-~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~-g~~~~~~G~ig~~~G~~Vi~s~~~p~ 219 (278) T protein:vir:80 142 G-LIDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQL-GDDLLVKGAFGELLGWEIVRTKKLAD 219 (278) T ss_pred c-hhhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccc-cccceeeccceeecceeEEEcCCCCc Confidence 1 1112233444444454443333 2346789999888875432221111111 11233456678999999999999985 Q ss_pred cccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 396 KAASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 396 ~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) . ..++.....+.++....+.++.++...+....+++..+++.++++|+++|+++..|. T Consensus 220 ~-----t~~l~~~gAi~~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~ 277 (278) T protein:vir:80 220 G-----NALAVKAGALKTFLKRNLLAESGRDMDHKLTKFNADQHYAVALVDETKAVKVVPVAG 277 (278) T ss_pred c-----eEEEEeccceeeeecCCcccccccchhhccceeeeeeEEEEEEEcCcceEEEeeccC Confidence 2 223333345556666777788777777788899999999999999999999999999 No 110 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.79 E-value=8.1e-21 Score=130.74 Aligned_cols=267 Identities=15% Similarity=0.100 Sum_probs=188.5 Q ss_pred hhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeee----ccCceEEEEecCCCccccccccccccccccccc Q lcl|NC_010583. 159 KAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPM----SSKILTMLVEPEAGRATWVDASKFGTDETVGDE 234 (458) Q Consensus 159 ~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~ 234 (458) .+. .+.+.....++|+.+..-+.+.+.....+.+++.+-+. .+...++|++...+.+.++.|+ +..+. T Consensus 1 ~~~--~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g------~~i~~ 72 (275) T protein:vir:96 1 MAL--ENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEG------EEIPI 72 (275) T ss_pred CCC--cccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCC------CCcch Confidence 122 22344556899999999999999888888888866443 2345789988765555555554 34556 Q ss_pred ccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeecc Q lcl|NC_010583. 235 VKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEA 314 (458) Q Consensus 235 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~ 314 (458) .+.++++.+...++.+..+.++++....+..|+...+.++++.++++.+|..++.--++ +.. ... T Consensus 73 ~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~---------a~~----~~~-- 137 (275) T protein:vir:96 73 DLIETKKRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQG---------ATL----KVE-- 137 (275) T ss_pred hhcccceeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhc---------ccc----ccc-- Confidence 67788888899999988899999998888778999999999999999999998842111 100 000 Q ss_pred ccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccc Q lcl|NC_010583. 315 KADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFP 394 (458) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~ 394 (458) ....+++.+.++...+.........++|||..+..|.++...+...-...+ ......|.-++++|++|++++.+| T Consensus 138 ----~~~~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g-~~~~~~G~ig~~~G~~Vi~s~~~p 212 (275) T protein:vir:96 138 ----ADITKLAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLG-DNVIVKGAFGEALGAIIVRSNKIK 212 (275) T ss_pred ----ccccCHHHHHHHHHHhccccCCccEEEeCHHHHHHHHhccccccccccccc-ccceeccccceecCeeEEEeCCCC Confidence 111335556666666655555666789999999988775422211111111 122345566889999999999998 Q ss_pred ccccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 395 AKAASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 395 ~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .. ..++.....+.++....+.++.++...+....+++..+++.++++|+++|++++..| T Consensus 213 ~~-----t~~i~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 271 (275) T protein:vir:96 213 EG-----EAILAKRGAVKLITKRDFFLETERHASHKSTALFSDKHYVAYLYDESKVVKITKSAS 271 (275) T ss_pred cc-----eEEEEeccceeeeecCCcccccccchhhcCcEEEEeEEEEEEEEcCccEEEEEeccc Confidence 53 123333345556667778888888887888899999999999999999999999999 No 111 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.79 E-value=1.4e-20 Score=129.43 Aligned_cols=360 Identities=11% Similarity=0.015 Sum_probs=206.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhh Q lcl|NC_010583. 52 VSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFE 131 (458) Q Consensus 52 ~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 131 (458) .+. . +.++++.- .. +.. ..+.++++...+......|. ..+.-+-.. T Consensus 1 ~~~----~-------~~~~~~~~--~~---~~~-------~~e~k~lr~~me~~et~~e~-----------~~~~~~~~~ 46 (393) T protein:vir:79 1 MEN----W-------LKQLKESG--FT---ETQ-------VQEQKSLRTRMERGETLAEA-----------DANKLALNE 46 (393) T ss_pred Cch----H-------HHHHHhcc--Cc---hhH-------HHHHHHHHHHhhhhhhhhhh-----------hhhhhhcch Confidence 111 1 11111000 00 000 00111111110000000000 000000000 Q ss_pred hHH-HHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEE Q lcl|NC_010583. 132 DEV-EKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLV 210 (458) Q Consensus 132 ~~~-~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~ 210 (458) .+. --..|.+-+. ++.-..+. ..+. .-++..+..+||..+++.|.+...+.....++...+....|...+-. T Consensus 47 ~e~el~E~f~Kmm~-G~~p~~eV---~~~e---~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~ 119 (393) T protein:vir:79 47 EETQILESFAKMME-GETPTNEV---NLRE---FMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFP 119 (393) T ss_pred hHHHHHHHHHHHhc-CCCchhhe---ehhh---hhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceecc Confidence 000 0122333222 32222221 1111 12344567899999999999999999888888888777666655555 Q ss_pred ecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010583. 211 EPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSG 290 (458) Q Consensus 211 ~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G 290 (458) ..+--.++-++||++.++. .-+..||+.|++..+|++..+.+|.|++.||.+++.+++...+.+++++..+..++++ T Consensus 120 ~~g~~Ra~~IgEGgE~~~~---sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~ 196 (393) T protein:vir:79 120 SIGIMRAYDVAEGQEIPED---SIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQ 196 (393) T ss_pred chheeeecccccccccccc---chhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhh Confidence 5556677788888766643 4567899999999999999999999999999999999999999999999999999999 Q ss_pred CCCC-c--cccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccc Q lcl|NC_010583. 291 NGTG-Q--PKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQ 367 (458) Q Consensus 291 ~g~~-~--p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~ 367 (458) .-++ + ..++.+.. ..-+++..-.....+++...+++++..++.+....+..++|||-.|..+.+-.--.+.+... T Consensus 197 fk~~ghtvfDa~st~t--~ahptGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na 274 (393) T protein:vir:79 197 FRSHGHTVFDNYSTNK--LAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANP 274 (393) T ss_pred hhcccceeeeccccCc--cceeecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeecc Confidence 7543 2 34544321 22223333334567778889999999999999999999999999999887642211111111 Q ss_pred cccccccccccCCeeec-----------ccceecccccccccCCce-EEEEEeceEEEE-ecceeEEeecccccCCceEE Q lcl|NC_010583. 368 VGNDAVKLQGQVGRIYG-----------LPVVVSEYFPAKAASAEF-AVIVYKDNFVMP-RQRAVTVERERQAGKQRDAY 434 (458) Q Consensus 368 ~~~~~~~~~~~~~~l~G-----------~pv~~~~~~~~~~~~~~~-~~~~~~~~~~i~-~~~~~~i~~~~~~~~~~~~~ 434 (458) .+ +.++......+.+| +.|++|+.+|-......+ .+..+.+...+. ..-+++++.-+.-..|...+ T Consensus 275 ~g-N~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~i 353 (393) T protein:vir:79 275 YG-NYPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNI 353 (393) T ss_pred cc-ccCccccchhhhhchhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcceeccccccccceee Confidence 11 11111112223333 678899998854443332 222333322111 11244444333345788999 Q ss_pred EEEEeeccEEecc-cceEE---EEeecC Q lcl|NC_010583. 435 YVTQRVNLQRYFE-NGVVS---GAYAAA 458 (458) Q Consensus 435 ~~~~r~d~~~~~~-~afv~---l~~aaa 458 (458) ....|+|+.|++. +|+++ ++++-+ T Consensus 354 Kl~ERYG~gvLn~gkaiavakNI~~~k~ 381 (393) T protein:vir:79 354 KMIERYGIGILNEGKAIAVAKNISMDKS 381 (393) T ss_pred eeeeeeceeeeeCCceEEEEecceeecc Confidence 9999999999998 55443 334333 No 112 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.78 E-value=3.1e-20 Score=127.55 Aligned_cols=265 Identities=16% Similarity=0.081 Sum_probs=186.2 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeee----ccCceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPM----SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +....+..+..++|+.+.+-+.+.+.......+++.+-+. .+...++|++...+.+..+.|+ +..+..+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g------~~i~~~~l 74 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG------EKIPTDIL 74 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC------Cccccccc Confidence 3334455667899999999998888887777777765432 2456789987654555444444 44556677 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +++..++..++.+.-..++++....+..++.+.+.++++.++++.+|..++.- +..+.. .+.+ T Consensus 75 t~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~---------l~~a~~----~~~~---- 137 (274) T protein:vir:97 75 ETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEA---------LMGAKL----TVNA---- 137 (274) T ss_pred ccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHH---------HhccCc----cccc---- Confidence 88889999999887889999998888888999999999999999999998853 111100 0001 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccccccccccc-ccccccccCCeeecccceeccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGN-DAVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~-~~~~~~~~~~~l~G~pv~~~~~~~~~ 396 (458) ....++.+.++...+..........+|||..+..|.+ +...+++-.... ......|..++++|++|++++.+|.. T Consensus 138 --~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k--~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~ 213 (274) T protein:vir:97 138 --DITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRG--DASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG 213 (274) T ss_pred --cccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHh--hhhhhccccCcccccceeccccceecCeeEEEcCCCCcc Confidence 1123455666666666555566678899999988864 322222211111 11234556688999999999999842 Q ss_pred ccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 397 AASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 397 ~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+++....+.++....+.++.++...+....+++..++++++++|..+|+++++.| T Consensus 214 -----t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:97 214 -----TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred -----eEEEEeCcceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 233333445566667777888777777778889999999999999999999999999 No 113 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.78 E-value=3.1e-20 Score=127.55 Aligned_cols=265 Identities=16% Similarity=0.081 Sum_probs=186.2 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeee----ccCceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPM----SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +....+..+..++|+.+.+-+.+.+.......+++.+-+. .+...++|++...+.+..+.|+ +..+..+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g------~~i~~~~l 74 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG------EKIPTDIL 74 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC------Cccccccc Confidence 3334455667899999999998888887777777765432 2456789987654555444444 44556677 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +++..++..++.+.-..++++....+..++.+.+.++++.++++.+|..++.- +..+.. .+.+ T Consensus 75 t~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~---------l~~a~~----~~~~---- 137 (274) T protein:vir:94 75 ETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEA---------LMGAKL----TVNA---- 137 (274) T ss_pred ccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHH---------HhccCc----cccc---- Confidence 88889999999887889999998888888999999999999999999998853 111100 0001 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccccccccccc-ccccccccCCeeecccceeccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGN-DAVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~-~~~~~~~~~~~l~G~pv~~~~~~~~~ 396 (458) ....++.+.++...+..........+|||..+..|.+ +...+++-.... ......|..++++|++|++++.+|.. T Consensus 138 --~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k--~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~ 213 (274) T protein:vir:94 138 --DITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRG--DASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG 213 (274) T ss_pred --cccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHh--hhhhhccccCcccccceeccccceecCeeEEEcCCCCcc Confidence 1123455666666666555566678899999988864 322222211111 11234556688999999999999842 Q ss_pred ccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 397 AASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 397 ~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+++....+.++....+.++.++...+....+++..++++++++|..+|+++++.| T Consensus 214 -----t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:94 214 -----TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred -----eEEEEeCcceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 233333445566667777888777777778889999999999999999999999999 No 114 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.77 E-value=2.3e-20 Score=128.27 Aligned_cols=266 Identities=16% Similarity=0.125 Sum_probs=190.8 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeee----ccCceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPM----SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +....+.....++|+.+.+-+.+.+.....+.+++.+... .+...++|.+.....+.++.|+ +..+..+. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg------~~i~~~~l 74 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEG------QKIPVDKI 74 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCC------CccCcccc Confidence 2233445567899999999999999888888888865442 3556789988766566555555 44556678 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) ++++.+...++.+..+.++++....+..|..+.+.++++.++++.+|..++.- +.... . ... T Consensus 75 t~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~---------l~~~~---~-~~~----- 136 (276) T protein:vir:10 75 ETNRREAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEA---------LRGTK---L-TVS----- 136 (276) T ss_pred ccceeeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHH---------Hhccc---c-ccc----- Confidence 88999999999999999999999998889999999999999999999988741 11100 0 000 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKA 397 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 397 (458) ...++++.+.++...+..........+|||..+..|.++...........+ ......|.-++++|++|++++.+|.. T Consensus 137 -~~~~t~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~~- 213 (276) T protein:vir:10 137 -ADIGTLAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELG-DNIIVKGAFGEALGAVIVRSKKLDEG- 213 (276) T ss_pred -ccccCHHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhcccccccccccc-ccceeccccceecceeEEEcCCCCcc- Confidence 111234556666666665555667789999999999875443322211111 12234566688999999999999852 Q ss_pred cCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 398 ASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 398 ~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..++.....+.++...++.++.++...+....+++...++.++.+|..++++++++- T Consensus 214 ----t~~l~~~gAi~~~~~~~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (276) T protein:vir:10 214 ----EAILAKRGAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKVTKGAG 270 (276) T ss_pred ----eEEEEeccceeeeecCCceeecccchhhcccEEEEeeEEEEEEEcCcceEEEecCCc Confidence 223333335556667788888888888888999999999999999999999998765 No 115 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.76 E-value=4.4e-20 Score=126.73 Aligned_cols=309 Identities=10% Similarity=0.087 Sum_probs=198.1 Q ss_pred HHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEec Q lcl|NC_010583. 133 EVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEP 212 (458) Q Consensus 133 ~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~ 212 (458) +.+ .+.-..+.....-.......++. +.+...++.+.|..+...||+.+.+.+.|+++..+.++.++.+.|++.. T Consensus 1 ~~~----~~~~~~~~~~~~~~~~~p~l~m~-alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~ 75 (330) T protein:vir:94 1 MVR----ICTPPLRGRWRTLTHQFPELKMP-TVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNREN 75 (330) T ss_pred Cce----ecCCccccceeehhccccccchh-hhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeee Confidence 000 00000000000000000111222 2223334667899999999999999999999999888889999999999 Q ss_pred CCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHH--hccHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010583. 213 EAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETE--EDAIFSLLPLLRKRLIEAHAVSIEEAFMSG 290 (458) Q Consensus 213 ~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell--~ds~~~~~~~i~~~la~~~~~~~d~~~l~G 290 (458) .-+.+.|...+...+ ++...||.+++...+.+++.+.|+.++. ..++.+...+-.....+++....+..+||| T Consensus 76 ~lp~a~~r~~n~~~~-----~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linG 150 (330) T protein:vir:94 76 VLGDVQFLAVGGTIT-----AKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITG 150 (330) T ss_pred cCCcceeeecccccc-----ccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 989999877553222 1234589999999999999999999995 446788999999999999999999999999 Q ss_pred CCC-CccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccc Q lcl|NC_010583. 291 NGT-GQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVG 369 (458) Q Consensus 291 ~g~-~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~ 369 (458) +++ +++.||++....... +.+...+.... .+++..++..+......+..++||......++.+.+..+++..... T Consensus 151 Ds~~~~F~GL~~~~~~~q~-i~tg~~gg~~T---~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~ 226 (330) T protein:vir:94 151 DGTGNSFQGMMGLVAASQT-ISAGANGGTLT---FELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEV 226 (330) T ss_pred CCCCccccchhhcCCcccE-EecCCCCCCCC---HHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCc Confidence 976 567899876643221 11222222222 2333333333333334567889999999999999887776654221 Q ss_pred cccccccccC-Ceeecccceecccccccc-----cCCceE-EEEEece----EEEEe----cceeEEeeccc--ccCCce Q lcl|NC_010583. 370 NDAVKLQGQV-GRIYGLPVVVSEYFPAKA-----ASAEFA-VIVYKDN----FVMPR----QRAVTVERERQ--AGKQRD 432 (458) Q Consensus 370 ~~~~~~~~~~-~~l~G~pv~~~~~~~~~~-----~~~~~~-~~~~~~~----~~i~~----~~~~~i~~~~~--~~~~~~ 432 (458) .. ...|.+ .++.|+||+.++.+|... ++...+ ++-+++. ..++- ..++.+ ++.. ..++.. T Consensus 227 ~~--~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsV-r~~G~~~~k~v~ 303 (330) T protein:vir:94 227 MT--LPSGRQIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRV-QNVGAKENADET 303 (330) T ss_pred cc--ccCCCEEeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCccee-eeCCCcccccee Confidence 11 112322 357799999999998632 222222 2334432 22332 234544 2222 346778 Q ss_pred EEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 433 AYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 433 ~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .++++.+++..+.+|+|+++|+--.= T Consensus 304 ~~~v~~y~~~av~~~~a~~~L~~V~~ 329 (330) T protein:vir:94 304 ITRVKMYCGFANFSQLGLAAIKGLIP 329 (330) T ss_pred eEEEEEeeeeEEechhheeeeccccC Confidence 89999999999999999998775555 No 116 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.73 E-value=4.3e-19 Score=121.28 Aligned_cols=265 Identities=16% Similarity=0.069 Sum_probs=182.4 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceee---e-ccCceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELP---M-SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~---~-~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +....+.-...++|+.+.+-+.+.+.....+.+++.+-. . .+...++|.+...+.+..+.++ +..+..+. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g------~~i~~~~l 74 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG------EKIPTDIL 74 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC------Cccchhhc Confidence 233344556789999999988888877777777765532 2 3456788887655455444444 44556677 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +.++.++..++.+..+.++++....+..++.+.+.++++.++++.+|..++.--.+ +.. ... T Consensus 75 t~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~---------a~~----~~~----- 136 (274) T protein:vir:12 75 ETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMG---------AKL----TVN----- 136 (274) T ss_pred ccceeeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhc---------ccc----ccc----- Confidence 88888888888888899999888887778899999999999999999988853111 000 000 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccccccccc-ccccccccccCCeeecccceeccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQV-GNDAVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~-~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 396 (458) ....+++.+.++...+......+...+|||..+..|.+. ...++.-.. ........|.-++++|++|++++.+|.. T Consensus 137 -~~a~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~--~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~ 213 (274) T protein:vir:12 137 -ADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGAIIVRSNKLEAG 213 (274) T ss_pred -ccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhh--hhhhccccccccccceecccceeecCeeEEEeCCCCcc Confidence 111234556666666665555666788999998887653 211222111 1112334556688999999999999853 Q ss_pred ccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 397 AASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 397 ~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) . .+++-...+.++....+.++.++...+....+++..++++++++|+.+|+++++.+ T Consensus 214 t-----~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:12 214 T-----AILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred e-----EEEEeccceeeeecCCceeccccchhhcccEEEeeeEEEEEEEcCCceEEEEcCCc Confidence 1 22322334555566777888877777788899999999999999999999999998 No 117 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.70 E-value=2.2e-18 Score=117.43 Aligned_cols=265 Identities=15% Similarity=0.089 Sum_probs=181.4 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeee----ccCceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPM----SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +....+.-...++|+.+++-+.+.+.....+.+++.+-+. .+...++|++...+.+..+.++ +..+..+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g------~~i~~~~l 74 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEG------EKIPTDIL 74 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCC------Cccchhhc Confidence 2233445567899999999898888888777777654332 3457789988755555444444 34455677 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +.+..++..++.+-.+.++++....+..++.+.+.++++.++++.+|..++.--.+. . ..... T Consensus 75 t~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a---------~----~~~~~---- 137 (274) T protein:vir:96 75 ETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSA---------K----LTVEA---- 137 (274) T ss_pred ccceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------c----ccccc---- Confidence 788888888888878899999888887789999999999999999999887421110 0 00000 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccc-cccccccccCCeeecccceeccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVG-NDAVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~-~~~~~~~~~~~~l~G~pv~~~~~~~~~ 396 (458) ....++.+.++...+......+...+|||..+..|.+. ...++.-... .......|..++++|++|++++.+|.. T Consensus 138 --~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~ 213 (274) T protein:vir:96 138 --DITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGD--ATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAG 213 (274) T ss_pred --cccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhh--ccccccccccccccceeccccceecCeEEEEeCCCCCc Confidence 11234455556666655444566778999999888653 2222221111 112334566788999999999999842 Q ss_pred ccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 397 AASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 397 ~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..++.-...+.++....+.++.++...+....+++..++++++++|+.+|++++..- T Consensus 214 -----t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~ 270 (274) T protein:vir:96 214 -----TAILAKKGAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred -----eEEEEeccceeeeecCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEEcCCc Confidence 222322334555666777888888777888899999999999999999999995554 No 118 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.70 E-value=2.2e-18 Score=117.43 Aligned_cols=265 Identities=15% Similarity=0.089 Sum_probs=181.4 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhcceeee----ccCceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPM----SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +....+.-...++|+.+++-+.+.+.....+.+++.+-+. .+...++|++...+.+..+.++ +..+..+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g------~~i~~~~l 74 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEG------EKIPTDIL 74 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCC------Cccchhhc Confidence 2233445567899999999898888888777777654332 3457789988755555444444 34455677 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +.+..++..++.+-.+.++++....+..++.+.+.++++.++++.+|..++.--.+. . ..... T Consensus 75 t~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a---------~----~~~~~---- 137 (274) T protein:vir:95 75 ETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSA---------K----LTVEA---- 137 (274) T ss_pred ccceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------c----ccccc---- Confidence 788888888888878899999888887789999999999999999999887421110 0 00000 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccc-cccccccccCCeeecccceeccccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVG-NDAVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~-~~~~~~~~~~~~l~G~pv~~~~~~~~~ 396 (458) ....++.+.++...+......+...+|||..+..|.+. ...++.-... .......|..++++|++|++++.+|.. T Consensus 138 --~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~ 213 (274) T protein:vir:95 138 --DITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGD--ATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAG 213 (274) T ss_pred --cccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhh--ccccccccccccccceeccccceecCeEEEEeCCCCCc Confidence 11234455556666655444566778999999888653 2222221111 112334566788999999999999842 Q ss_pred ccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 397 AASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 397 ~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..++.-...+.++....+.++.++...+....+++..++++++++|+.+|++++..- T Consensus 214 -----t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~ 270 (274) T protein:vir:95 214 -----TAILAKKGAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred -----eEEEEeccceeeeecCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEEcCCc Confidence 222322334555666777888888777888899999999999999999999995554 No 119 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.69 E-value=2.4e-18 Score=117.22 Aligned_cols=261 Identities=10% Similarity=0.007 Sum_probs=181.8 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeee----ccCceEEEEecCCCccccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPM----SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVK 236 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~ 236 (458) +. .+.-...++|+.+.+-|.+.+.....+.+++.+-+. .+...++|.+...+.+ ....+++..+..+ T Consensus 1 Ma---~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igda------e~~~eg~~i~~~~ 71 (270) T protein:vir:95 1 MT---QTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAA------EDLQEGVAMDTTQ 71 (270) T ss_pred CC---ceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCcc------ccccCCCccchhh Confidence 11 122234689999999888888888778888765333 3456788887654443 3444555666777 Q ss_pred ccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeecccc Q lcl|NC_010583. 237 GQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKA 316 (458) Q Consensus 237 ~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~ 316 (458) .++++.....++.+--+.++++....+..|....+.++++..+++.+|+.++.- ++.+... T Consensus 72 lt~~~~~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~---------l~~a~~~---------- 132 (270) T protein:vir:95 72 MSMTTTKVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAE---------LNKSKQT---------- 132 (270) T ss_pred cccchheeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHH---------hcccccc---------- Confidence 888888899999998899999988776567888899999999999999988731 1111000 Q ss_pred chhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccccc Q lcl|NC_010583. 317 DGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~ 396 (458) ....++...+.+....+..........+|||.++..|++.....+ .....+....|.-++++|++|++++.+|.. T Consensus 133 -~~~~~t~~~~~dA~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~----~~~~~~~~~~G~ig~~~G~~Viv~s~~~~~ 207 (270) T protein:vir:95 133 -ATVSADATGILDAIEVFNSENDEDYVLYVNPKDYNKLVKSLFKVG----GNVQDRAISKGDLVEIVGVSDIVKSKRVSE 207 (270) T ss_pred -cccccCHHHHHHHHHHhccccCCCcEEEEcHHHHHHHHhhhcccc----cccccchhcccccceecceeEEEeCCCCCc Confidence 011123445566666666666667788999999998876432111 111222334466788999999998887742 Q ss_pred ccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 397 AASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 397 ~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ...++.-...+.++...++.++.++...+....+.+..++++++.+|..+|++|++-| T Consensus 208 ----~~~~l~~~gAi~~~~~~~~~vEtdRd~~~~~d~i~~~~~y~v~~~~~skvv~~t~~~a 265 (270) T protein:vir:95 208 ----NTAFLQRYGAMEIVNKKKPEAYTDFDILKRTHLLSTNYHYSVNLKDETGVVKVTFKPS 265 (270) T ss_pred ----eeEEEEeccceeeeecCCceeeeccchhhcccEEEeeeEEEEEEEccceEEEEEecCC Confidence 2233333345566667778888888777888899999999999999999999999888 No 120 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.58 E-value=5.4e-16 Score=104.28 Aligned_cols=289 Identities=12% Similarity=0.118 Sum_probs=182.7 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccce Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLT 240 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~ 240 (458) +. +-+....+.+.+..+...||+.+...+.|+++..+.++.++.+.|.+...-+.+.+.+.+... .....+.+..+|+ T Consensus 1 mp-altLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~-~~~g~~~~~~t~~ 78 (310) T protein:vir:97 1 MA-SVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTF-SGAGAGKAAATFT 78 (310) T ss_pred Cc-ccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccc-cCCCccccccccc Confidence 11 111111234667888899999999999999999998888888899888776655543322111 1223456788999 Q ss_pred eeeeehhheeeeehhhHHHHhc--c-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc-ccccccccccccceeecccc Q lcl|NC_010583. 241 EISFKTYKLAAKSFITDETEED--A-IFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQP-KGLLKLAADDGAKVVTEAKA 316 (458) Q Consensus 241 ~v~~~~~k~~~~~~is~ell~d--s-~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p-~Gi~~~~~~~~~~~~~~~~~ 316 (458) +++...+.+++.+.|.+.+.+- + +.+...+=.....+++....+..+|||+.++.+ .|+++...... .+.+...+ T Consensus 79 ~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q-~i~~~~~g 157 (310) T protein:vir:97 79 KVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQ-KATTGATG 157 (310) T ss_pred eeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccc-eeecCCCC Confidence 9999999999999999876552 3 455555556667899999999999999986554 59988764322 12222222 Q ss_pred chhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhc-cccccccccccccccccccC-Ceeecccceeccccc Q lcl|NC_010583. 317 DGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLED-EEWQDVAQVGNDAVKLQGQV-GRIYGLPVVVSEYFP 394 (458) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d-~~~~~~~~~~~~~~~~~~~~-~~l~G~pv~~~~~~~ 394 (458) ... +.+++..++..+....+.+..++|||.++..++.+.. ..++.+++..... .|.+ .++.|+||+.++.+| T Consensus 158 g~~---t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~---~G~~v~~~~GiPi~~~d~ip 231 (310) T protein:vir:97 158 SAI---SFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELP---SGAEVPAYSGTPIFRNDYIP 231 (310) T ss_pred CCC---CHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccC---CCCEEeeeCCeEEEEeCccC Confidence 222 2334444444433333456779999998777776543 3344444332221 1222 468899999999998 Q ss_pred ccc-----cCCceE-EEEEece----EEEE----ecceeEEeecc-cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 395 AKA-----ASAEFA-VIVYKDN----FVMP----RQRAVTVERER-QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 395 ~~~-----~~~~~~-~~~~~~~----~~i~----~~~~~~i~~~~-~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ... .+...+ ++-++.. -.++ ...++.+..=- --.+....++++.+++.++.+|+|+++|.--.= T Consensus 232 ~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 232 TNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred CCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCcceeEEEEEeeeEEEecccceeeeccccC Confidence 642 222223 3334432 1122 12334332211 124677888999999999999999988765444 No 121 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.55 E-value=1.7e-16 Score=107.05 Aligned_cols=231 Identities=13% Similarity=0.048 Sum_probs=166.4 Q ss_pred cceeeeccCceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHH Q lcl|NC_010583. 196 FDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRL 275 (458) Q Consensus 196 ~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l 275 (458) -+-+. .+...++|. |++......++...+....++++.+.+.++.+--++|+++....+..|......+++ T Consensus 1 ~~~~~-~Gdtit~P~--------~iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~ 71 (231) T protein:vir:73 1 ENGIN-LANLCEYPN--------DIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQL 71 (231) T ss_pred Ccccc-CCceEEecc--------cccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHH Confidence 11111 223456663 344555666777777888899999999999999999999998877778899999999 Q ss_pred HHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHH Q lcl|NC_010583. 276 IEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYD 355 (458) Q Consensus 276 a~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 355 (458) +.+|++++|..++.-- ..+. . + .+..+++..+.+....+......+...+|||..+..|+ T Consensus 72 ~~~iA~kvD~di~~~~---------~~a~---l---~-----~~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lr 131 (231) T protein:vir:73 72 GLSLANKVDDDLLKAA---------KTTS---Q---T-----VSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIR 131 (231) T ss_pred HHHHHHhhhHHHHHhh---------cccc---c---c-----ccccccHHHHHHHHHHhccccccceEEEEcchHHHhhh Confidence 9999999999988411 0000 0 0 01123456666666666666666777899999998888 Q ss_pred hhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEEeecccccCCceEEE Q lcl|NC_010583. 356 LLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYY 435 (458) Q Consensus 356 ~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~ 435 (458) +..+.... ...........|.-+++.|+||++|+.+|.+..- ...++.-...+.+....++.++.++........++ T Consensus 132 k~~~~~~~--~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~-~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~ 208 (231) T protein:vir:73 132 KDANAKNI--GSEVGANALINGTYADVLGAQIVRSKKLAEGSAL-MFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVIT 208 (231) T ss_pred hccchhhh--hhhhccceeeecccceEcceEEEEcCCCCCCcee-eeeEEeeccceeeeecccceeeccccccccccEEE Confidence 75544332 1122233445667789999999999999864321 11122234567778888888988888888899999 Q ss_pred EEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 436 VTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 436 ~~~r~d~~~~~~~afv~l~~aaa 458 (458) +...++.++.+|..+|+++++.- T Consensus 209 ~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 209 ADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred EeEEEEEEEEcCccEEEEEeecC Confidence 99999999999999999999999 No 122 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.50 E-value=3.5e-15 Score=99.83 Aligned_cols=266 Identities=13% Similarity=0.044 Sum_probs=151.3 Q ss_pred ccCccccchhHHHHHHHHHHhccchhhhccee----eeccCceEEEEecCCCcccccccccccccccccccccccceeee Q lcl|NC_010583. 168 SMSSEAYETIFSTRIIRDLQKELVVGALFDEL----PMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEIS 243 (458) Q Consensus 168 ~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~ 243 (458) .+-...+|+.++..+++.++..+.+..+++.- ...+...++|+......+.+..++..+ ...+.+.+.++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~------~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQT------SADAISDTGVD 74 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCcc------CccccccceEE Confidence 22234679999999999999988888876431 223456788876654444444333222 22233445555 Q ss_pred eehhhe-eeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHH Q lcl|NC_010583. 244 FKTYKL-AAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLV 322 (458) Q Consensus 244 ~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 322 (458) ++..+. ..-+.|++.-...+..++.+ +.++++++++.++|..++. .+..+.... .... ...... T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~---------~~~~a~~~~---~~~~--~~~~~~ 139 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD---------MLVDNGTAL---TGSA--PTDADD 139 (273) T ss_pred EEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHH---------HHhcccccc---cccc--ccchhH Confidence 554432 22234565434445567887 5567889999999987763 111110000 0000 111122 Q ss_pred HHHHHHHHHhhhhhhhcc--cceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCC Q lcl|NC_010583. 323 TAKTISKLRRKLGRHGLK--LSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASA 400 (458) Q Consensus 323 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 400 (458) .+..+.++...+.....+ +-.++++|..+..|.+..+...... ..+.......|..++|.|.+|+.++.+|.... T Consensus 140 ~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~-- 216 (273) T protein:vir:10 140 AFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) T ss_pred HHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhh-ccccccceeeeeeeEEeceEEEEecccccCCc-- Confidence 345566666666665543 3345778888887765322111111 11112223456678999999999999996432 Q ss_pred ceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 401 EFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 401 ~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ...+.+..+.+....+. ..++......+-...+++...+|+++++|++++.++.++| T Consensus 217 ~~~~~~~~~A~~~a~q~-~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 217 EQFVAFHPSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred cEEEEEeccceeeeeee-ehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 12222233343333322 2333322223335677888999999999999999999999 No 123 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.50 E-value=3.5e-15 Score=99.83 Aligned_cols=266 Identities=13% Similarity=0.044 Sum_probs=151.3 Q ss_pred ccCccccchhHHHHHHHHHHhccchhhhccee----eeccCceEEEEecCCCcccccccccccccccccccccccceeee Q lcl|NC_010583. 168 SMSSEAYETIFSTRIIRDLQKELVVGALFDEL----PMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEIS 243 (458) Q Consensus 168 ~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~ 243 (458) .+-...+|+.++..+++.++..+.+..+++.- ...+...++|+......+.+..++..+ ...+.+.+.++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~------~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQT------SADAISDTGVD 74 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCcc------CccccccceEE Confidence 22234679999999999999988888876431 223456788876654444444333222 22233445555 Q ss_pred eehhhe-eeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHH Q lcl|NC_010583. 244 FKTYKL-AAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLV 322 (458) Q Consensus 244 ~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 322 (458) ++..+. ..-+.|++.-...+..++.+ +.++++++++.++|..++. .+..+.... .... ...... T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~---------~~~~a~~~~---~~~~--~~~~~~ 139 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD---------MLVDNGTAL---TGSA--PTDADD 139 (273) T ss_pred EEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHH---------HHhcccccc---cccc--ccchhH Confidence 554432 22234565434445567887 5567889999999987763 111110000 0000 111122 Q ss_pred HHHHHHHHHhhhhhhhcc--cceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCC Q lcl|NC_010583. 323 TAKTISKLRRKLGRHGLK--LSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASA 400 (458) Q Consensus 323 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 400 (458) .+..+.++...+.....+ +-.++++|..+..|.+..+...... ..+.......|..++|.|.+|+.++.+|.... T Consensus 140 ~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~-- 216 (273) T protein:vir:10 140 AFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) T ss_pred HHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhh-ccccccceeeeeeeEEeceEEEEecccccCCc-- Confidence 345566666666665543 3345778888887765322111111 11112223456678999999999999996432 Q ss_pred ceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 401 EFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 401 ~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ...+.+..+.+....+. ..++......+-...+++...+|+++++|++++.++.++| T Consensus 217 ~~~~~~~~~A~~~a~q~-~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 217 EQFVAFHPSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred cEEEEEeccceeeeeee-ehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 12222233343333322 2333322223335677888999999999999999999999 No 124 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.50 E-value=2.6e-15 Score=100.56 Aligned_cols=266 Identities=12% Similarity=0.045 Sum_probs=152.2 Q ss_pred ccCccccchhHHHHHHHHHHhccchhhhcce----eeeccCceEEEEecCCCcccccccccccccccccccccccceeee Q lcl|NC_010583. 168 SMSSEAYETIFSTRIIRDLQKELVVGALFDE----LPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEIS 243 (458) Q Consensus 168 ~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~----~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~ 243 (458) .+-..++|+.++..+++.++..+.+.++++. ....+...++|+......+.+..++.. ....+.+.+.++ T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~------~~~~~~~~~~~~ 74 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQ------TSADAISDTGVD 74 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCc------cCccccccceEE Confidence 1222368999999999999999888888643 222345688888665444444444432 223344556666 Q ss_pred eehhhe-eeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHH Q lcl|NC_010583. 244 FKTYKL-AAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLV 322 (458) Q Consensus 244 ~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 322 (458) ++..+. ..-+.|++.-...+..++.+ +.++++.+++.++|..++. .+..+..... ..... .... T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~vD~~i~~---------~~~~a~~~~~-~~~~~----~~~~ 139 (273) T protein:vir:79 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD---------MLVDNGTALT-GSAPS----DADD 139 (273) T ss_pred EEEeeecccceeeccHHHHhhcccHHH-HHHHHHHHHHHHHHHHHHH---------HHhhcccccc-ccccc----chhh Confidence 666543 33345665434445567887 4567889999999987652 1111100000 00000 1112 Q ss_pred HHHHHHHHHhhhhhhhcc--cceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCC Q lcl|NC_010583. 323 TAKTISKLRRKLGRHGLK--LSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASA 400 (458) Q Consensus 323 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 400 (458) .+..+.++...+.....+ +-.++++|..+..|.+..+....... .+.......|..++|+|.+|+.++.+|.... T Consensus 140 ~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~-~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~-- 216 (273) T protein:vir:79 140 AFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADT-SGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) T ss_pred HHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhh-cccccceeeeEeeEEeceEEEecccccccCc-- Confidence 334566666666655442 23457788877766543221111111 1111223456778999999999999996432 Q ss_pred ceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 401 EFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 401 ~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ...+.+..+.+....+. ..++......+-...+++...+|+++++|++++.++.++| T Consensus 217 ~~~~a~~~~A~~~a~~~-~~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 217 EQFVAFHPSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred eEEEEEeccceeeeeeh-hhhhcccCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 12222223333333322 2333333333445678889999999999999999999999 No 125 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.41 E-value=7.2e-14 Score=92.65 Aligned_cols=386 Identities=13% Similarity=0.063 Sum_probs=183.5 Q ss_pred HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 12 LGLGDLAKSLEGLTAAQKA---AEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTV 88 (458) Q Consensus 12 ~~~~~~~~~~~~l~~~~~~---~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~ 88 (458) .+ + .+.+ +.+... .....+++|...+......-+.-+-.-|.-...++. +.-+ T Consensus 1 ~~---~-s~~~--~~k~~~~ek~~~~~~~~e~~~~lks~~~g~~~~~~~~~~~k~~el------------------~kT~ 56 (400) T protein:vir:93 1 MR---I-SKRN--MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQEL------------------EKTL 56 (400) T ss_pred Cc---c-cccc--cccchHHHHHHHHhhhhhhhhhhhhhhhccchhhhhhhchhHHHH------------------HHHH Confidence 00 0 0000 000000 000001111111100000000000000000000111 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhH----HHHHHHHhhhhhcc Q lcl|NC_010583. 89 EKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFE----TEHGKAHIKAVNGS 164 (458) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~----~~~~~~~~~a~~~~ 164 (458) .++..++...+.+..... ...+......+-......-..|.+-+....... .|......++++ T Consensus 57 Sel~~ei~k~e~eln~~~-----------E~~Kgk~~mtefLkT~~A~~~fa~~l~~nsg~sd~knaW~A~l~E~gvt-- 123 (400) T protein:vir:93 57 SENSIEIIKIENELNAQE-----------EKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVT-- 123 (400) T ss_pred HHhHHHHHHHhhhhhhhh-----------hhcccchhHHHhhhhHHHHHHHHHHHHhhcCCcchhhhhhhhhhhcccc-- Confidence 122222211111111000 000000000011111111122222222222111 222122222222 Q ss_pred cccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccceeeee Q lcl|NC_010583. 165 SSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISF 244 (458) Q Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~ 244 (458) .......+|.-+...|-+.++.+.++.+..++...+ .+.+..+..++. ++.--..++.++++..+|..-++ T Consensus 124 --~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p----~l~V~~~~dt~~---qa~gHk~G~~K~eq~~tl~~rtL 194 (400) T protein:vir:93 124 --ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG----ALLVSRSFDSAN---EAQVHKDGQTKTEQAATLTIDTL 194 (400) T ss_pred --cCCchhhcchHHHHHHHHhhhccCCcccceeeecCC----ceeeecchhhhc---ccceeccCCcccceeeeeeeecc Confidence 122234789999999999999999999877665542 223332222221 22223456778888889999999 Q ss_pred ehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHH-HHHHHHhccCCCCccccccccccccc---cceeeccccch Q lcl|NC_010583. 245 KTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAV-SIEEAFMSGNGTGQPKGLLKLAADDG---AKVVTEAKADG 318 (458) Q Consensus 245 ~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~-~~d~~~l~G~g~~~p~Gi~~~~~~~~---~~~~~~~~~~~ 318 (458) .|.-+..+..+.+-..++. .-.|.+||.++|...+.. ..+.+++-|+|++...++-..+.... ....+..+... T Consensus 195 ~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~~~ 274 (400) T protein:vir:93 195 EPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKT 274 (400) T ss_pred CHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhhhhhhhhcCCc Confidence 9998877777744444432 234799999999999996 57999999999886655533322111 11111122222 Q ss_pred hhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccce-ecccccccc Q lcl|NC_010583. 319 SVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVV-VSEYFPAKA 397 (458) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~-~~~~~~~~~ 397 (458) ........+ .....+......-.++.|..++.|+.++|++|...|..+...... .+-+|..=+ +..-+|. T Consensus 275 ~~qdl~E~~---~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a~f~~~n~d~~I----A~~fGv~~Lv~~Tr~~~-- 345 (400) T protein:vir:93 275 PFADAIEEA---VDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEI----ASEVGVDEIIVYTGSKA-- 345 (400) T ss_pred cHHHHHHHH---HhhhhhccCCceeEEeccchHHHHHHhcCCcceeeeeeccccchh----hhhcccceeeeeccCCC-- Confidence 222222222 223334444555678999999999999999999888655443221 123343222 2222221 Q ss_pred cCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEee Q lcl|NC_010583. 398 ASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYA 456 (458) Q Consensus 398 ~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~a 456 (458) ....+++ +.+...+..++.-..+.....|+-.+..+..++|.+.-|++-++++++ T Consensus 346 --~kp~V~V--Dek~~i~~~~~~t~~sf~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 346 --LKPTVLV--DQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred --CCceeee--ehhhhccccCceeccceeeeeccceEEeeeeeccceecccceeeEeeC Confidence 1222333 333333556666555555567777788899999999999999999999 No 126 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.28 E-value=1.9e-12 Score=84.89 Aligned_cols=302 Identities=10% Similarity=0.063 Sum_probs=172.6 Q ss_pred HHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecC Q lcl|NC_010583. 134 VEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPE 213 (458) Q Consensus 134 ~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~ 213 (458) ..-.+....+++....... ... ...++. |+.+++......+++.+...+++++.++++++.+....++.... T Consensus 1 ~~~~~~~~~~~n~~~~~i~-----k~~-it~~~l--~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~ 72 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLS-----QKD-IGLAEL--DGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGV 72 (360) T ss_pred CcchhHHHHHhhhHHHHHH-----hhh-cccccc--CceeecHHHHHHHHHHHhhccchhhhcceeeccccccccccccc Confidence 1111122233332221111 111 112222 34455667778899999999999999999999998888776654 Q ss_pred CCcccc-cccccccccccccccccccceeeee-ehhheeeeehhhHHHHhccH----HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 214 AGRATW-VDASKFGTDETVGDEVKGQLTEISF-KTYKLAAKSFITDETEEDAI----FSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 214 ~~~a~~-v~e~~~~~e~~~~~~~~~~f~~v~~-~~~k~~~~~~is~ell~ds~----~~~~~~i~~~la~~~~~~~d~~~ 287 (458) +..-.- ..|... .....+++...|.+ ..+++-....+..+.+.+.. ..+++.|.+.|++++++-++.-. T Consensus 73 G~r~~r~~~e~~~-----~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~ 147 (360) T protein:vir:99 73 PRLSGHTRDEEGS-----RTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMG 147 (360) T ss_pred ceeeccccccCCC-----CCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHH Confidence 422110 112111 11112344445555 33455566677777777642 35779999999999999999999 Q ss_pred hccCCCC--------------ccccccccccccccce-------e---ec----c------ccch-----hhHHHHHHHH Q lcl|NC_010583. 288 MSGNGTG--------------QPKGLLKLAADDGAKV-------V---TE----A------KADG-----SVLVTAKTIS 328 (458) Q Consensus 288 l~G~g~~--------------~p~Gi~~~~~~~~~~~-------~---~~----~------~~~~-----~~~~~~~~~~ 328 (458) ++|+... ...||++.+......+ . .. . ..++ ........+. T Consensus 148 ~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~ 227 (360) T protein:vir:99 148 IRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFN 227 (360) T ss_pred hhccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHH Confidence 9997431 2578887763221000 0 00 0 0000 0011223456 Q ss_pred HHHhhhhhhhccc----ceeEechhHHH-HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceE Q lcl|NC_010583. 329 KLRRKLGRHGLKL----SKLVLIVSMDA-YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFA 403 (458) Q Consensus 329 ~~~~~~~~~~~~~----~~~~~~~~~~~-~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~ 403 (458) +++..++..|+.+ ..|+|++.... +.+.|.+-.. ++. ... ...+..-...|+||+..+.+|. ..+ T Consensus 228 ~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t-~LG---d~~-l~g~~~~~~~Gipi~~v~~~pd-----~~~ 297 (360) T protein:vir:99 228 ETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLTERED-PLG---SAV-IFGDSDITPFSYDLVGVNGFPD-----EYM 297 (360) T ss_pred HHHHhcchhhhcCcccceEEEccCchHHHHHHHHhccCc-ccc---hhh-eecccccccceeeeEEcCCCCC-----Cce Confidence 7788888888753 37999998754 4444544332 111 000 1111223467999999999985 246 Q ss_pred EEEEeceEEEEecceeEEee----cccccCCc-eEEEEEEeeccEEecccceEEEEe---ecC Q lcl|NC_010583. 404 VIVYKDNFVMPRQRAVTVER----ERQAGKQR-DAYYVTQRVNLQRYFENGVVSGAY---AAA 458 (458) Q Consensus 404 ~~~~~~~~~i~~~~~~~i~~----~~~~~~~~-~~~~~~~r~d~~~~~~~afv~l~~---aaa 458 (458) ++....++.++-+.++++.. +.+..+.. +.+.....+|+..-+++|.|.++- +.| T Consensus 298 mlT~p~NLi~g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 298 MFTDPNNLAFGLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred EEeccCceeEEeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 67788888888888888753 22333332 333345568888889999888653 222 No 127 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.23 E-value=3e-13 Score=89.20 Aligned_cols=299 Identities=10% Similarity=0.024 Sum_probs=151.8 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccC---ccccchhHHHHHHHHHHhccchhhhcceeeec-cCceEEEEecCCCccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMS---SEAYETIFSTRIIRDLQKELVVGALFDELPMS-SKILTMLVEPEAGRAT 218 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g---~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~a~ 218 (458) +..-.. ......+ ......++ .+.| +.|..+|...+...+.++.+.++.++. +....+|+.... ++. T Consensus 1 ~a~~~~----~~~~~~~---~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~-~~~ 71 (347) T protein:vir:88 1 MANATG----GQQIGAN---QGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRT-KGY 71 (347) T ss_pred CCCccc----chhhhcc---CCCCccccchHHHHH-HHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecce-eee Confidence 110000 0000000 01111111 1234 788899998888889889988776644 455666655433 333 Q ss_pred ccccccccccccccccccccceeeeeehhhee-eeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhc----cCC- Q lcl|NC_010583. 219 WVDASKFGTDETVGDEVKGQLTEISFKTYKLA-AKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMS----GNG- 292 (458) Q Consensus 219 ~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~-~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~----G~g- 292 (458) ....+.... .+..++..+++++...++- .-..|.+-=.-++.+|+.+.+..++++++++..|+.++. +.. T Consensus 72 ~~~~g~~l~----~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~ 147 (347) T protein:vir:88 72 YLAPGENLD----DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNL 147 (347) T ss_pred eeccccCCC----CCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 333332211 1112345666655555441 112333332233457899999999999999999998762 211 Q ss_pred ----CCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhccc-ce-eEechhHHHHHHhhhc-cccccc Q lcl|NC_010583. 293 ----TGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKL-SK-LVLIVSMDAYYDLLED-EEWQDV 365 (458) Q Consensus 293 ----~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~l~~~~d-~~~~~~ 365 (458) .+.+.|+-+.......+..............++.+..+...+.....+. .. .++.|..+..|..... ....+. T Consensus 148 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~ 227 (347) T protein:vir:88 148 PAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYA 227 (347) T ss_pred ccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhc Confidence 1123343322211111111112222333344566777777777666543 23 3556776665543221 111111 Q ss_pred cccccccccccccCCeeecccceecccccccccCCceEEEE--------------------Ee----------ceEEEEe Q lcl|NC_010583. 366 AQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIV--------------------YK----------DNFVMPR 415 (458) Q Consensus 366 ~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~--------------------~~----------~~~~i~~ 415 (458) .......|..+.++|.+|+.++++|....+......+ ++ +.+.... T Consensus 228 ----~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~ 303 (347) T protein:vir:88 228 ----ALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVK 303 (347) T ss_pred ----cccchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhhee Confidence 1123345666889999999999999543322111100 01 1111222 Q ss_pred cceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 416 QRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 416 ~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) -.+++++....-.+-...+++..-+|.++++|++.+.+++.+| T Consensus 304 ~~d~~~e~~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a 346 (347) T protein:vir:88 304 LKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) T ss_pred cccceeeeeechhhHHHHhhhhhhhcCceeccceEEEEEeCCC Confidence 2332333222222333466888899999999999888887777 No 128 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.20 E-value=4.2e-12 Score=82.93 Aligned_cols=397 Identities=14% Similarity=0.082 Sum_probs=180.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 14 LGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQE 93 (458) Q Consensus 14 ~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~ 93 (458) |..--.....+. ++. .+.+.+++-.... .=+.........|++..+-..+.+...+..+...|. ..+....+. T Consensus 1 ~~n~t~a~d~~~--RR~---~~~L~~~EvSvv~-~PAY~nA~vt~vRe~e~~~~~e~~~~~e~~en~~e~-~~~~~~~~~ 73 (410) T protein:vir:83 1 MGNATTASDEYI--RRL---ENELREKESLVRG-IYDRANASNRDVNEEEGQMVAECRGRMEQIKNQMEQ-AQEVNRIAF 73 (410) T ss_pred CCCcccchhhHH--HHH---HHHhhhhheeeec-cccccccccccchhhhccccccccCcccchhhhhHH-HHHHHHHHH Confidence 110000000000 000 0111110000000 000000000001111000001111111111111111 111222222 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhh---hccchhHHHHHHHHhhhhhcccccccC Q lcl|NC_010583. 94 TIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYM---MEKDVFETEHGKAHIKAVNGSSSVSMS 170 (458) Q Consensus 94 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~---~~~~~~~~~~~~~~~~a~~~~~~~~~g 170 (458) +..++...+...... .- ..+.. ..-.+.+. ..|.+.+ ++++...........++.. ..++..- T Consensus 74 E~Rs~~~~i~~~~~~-------~r--~~p~~-~~veyRSa---GE~lkal~~~~~Gd~~A~~~~e~~r~a~~-~~~Tgd~ 139 (410) T protein:vir:83 74 ETRSKGQAVDAAISA-------MR--GSPVG-TEVEYRSA---GEYMLDMWNSAQGNASAADRLEVYARAAD-HQKTGDL 139 (410) T ss_pred HHHHHHHHHHhhhcc-------Cc--CCCCC-CCcccccH---HHHHHHHhccCCchHHHHHHHHHHHHhhc-cCccccc Confidence 222222222111000 00 00000 00011111 1123333 1222221111111122222 2222222 Q ss_pred ccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCccc-ccccccccccccccccccccceeeeeehhhe Q lcl|NC_010583. 171 SEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRAT-WVDASKFGTDETVGDEVKGQLTEISFKTYKL 249 (458) Q Consensus 171 ~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~-~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~ 249 (458) ..+||+.+....|+.+....+|..+...+|..+..+.||+....++.+ ...++...-|+...+..+.+|+.-+...+.+ T Consensus 140 ~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~t~tA~ikTy 219 (410) T protein:vir:83 140 QGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVIDRLTVNAKTL 219 (410) T ss_pred ccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccccccccccccccceeeeeccceeehh Confidence 446777788889999999999999877799999999998887665432 2223444456667777888999999999999 Q ss_pred eeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHH---HHhccCCCCccccccccccccccceeeccccchhhHHHHHH Q lcl|NC_010583. 250 AAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE---AFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKT 326 (458) Q Consensus 250 ~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~---~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (458) +++..+|++.++.|++.+.+...+-|..+.+.+-+. ++|+++-++ .... ... +...++.. T Consensus 220 GGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~--------~~a~----~~~-----Tad~~~~~ 282 (410) T protein:vir:83 220 GGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTG--------AVGY----GNA-----TADNVASA 282 (410) T ss_pred cCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh--------hhhh----hhc-----cHHHHHHH Confidence 999999999999999999999999997777777665 344432111 0000 000 11111222 Q ss_pred HHHHHhhhhhh--hcccceeEechhHHHHHHhh-hcccccccccccccc-ccccccCCeeecccceecccccccccCCce Q lcl|NC_010583. 327 ISKLRRKLGRH--GLKLSKLVLIVSMDAYYDLL-EDEEWQDVAQVGNDA-VKLQGQVGRIYGLPVVVSEYFPAKAASAEF 402 (458) Q Consensus 327 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~-~d~~~~~~~~~~~~~-~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~ 402 (458) +.+....+... ......+...|.++..+..+ ++-++......++.. ..+.+..|.++|.||.+.+..+++ . T Consensus 283 i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~Ag-----T 357 (410) T protein:vir:83 283 IWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSG-----D 357 (410) T ss_pred HHHHHHHHhhhhccceeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcC-----e Confidence 22222222222 22333456677776544332 222221111111111 112355688999999999887753 3 Q ss_pred EEEEEeceEEEEecce--eEEeecccccCCceEEEEEEeeccEEecccceEEEEee Q lcl|NC_010583. 403 AVIVYKDNFVMPRQRA--VTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYA 456 (458) Q Consensus 403 ~~~~~~~~~~i~~~~~--~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~a 456 (458) .++.+...+..+.... +++..+.- .+-+..|. .++...+..|.+++-+.-+ T Consensus 358 A~f~~~~Ai~~~eS~~gp~qL~d~~i-~nLt~~yS--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 358 AYLFSTAAIECFEQRVGTLQVVEPSV-FGLQVAYA--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred eeEeccceeeeeecCCceeEeeCCch-hhhhhhhe--eeeeeccccccceeeeccC Confidence 4455566676776654 66644322 23333444 4567788889998877666 No 129 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.19 E-value=7.9e-13 Score=86.94 Aligned_cols=297 Identities=12% Similarity=0.039 Sum_probs=147.5 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccC----ccccchhHHHHHHHHHHhccchhhhcceeeecc-CceEEEEecCCCcc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMS----SEAYETIFSTRIIRDLQKELVVGALFDELPMSS-KILTMLVEPEAGRA 217 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g----~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~a 217 (458) +..-.. .+..+..+...+. ...| +.|.++++......+.++.+.++.++.+ ....+|+. +..++ T Consensus 1 m~~~~~---------~~~~t~~g~~~~~~d~~al~i-k~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~i-G~~tv 69 (347) T protein:vir:94 1 MANVPG---------QKIGTDQGKGKSSSDALALFL-KVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVM-GRTSG 69 (347) T ss_pred CCCCCc---------cccccccccCCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccccccccceEEEecc-cceee Confidence 000000 0000000000011 1233 6888999998888888888887766554 45566666 33344 Q ss_pred cccccccccccccccccccccceeeeee--hhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhc----c- Q lcl|NC_010583. 218 TWVDASKFGTDETVGDEVKGQLTEISFK--TYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMS----G- 290 (458) Q Consensus 218 ~~v~e~~~~~e~~~~~~~~~~f~~v~~~--~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~----G- 290 (458) .....|...... . ...+-+++++. ..++..+ .|.+-=--++..++.+.+.++.++++++..|+.++. . T Consensus 70 ~~~t~G~~l~~~--~--~~~~~~e~~itID~~~~~~~-~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~a 144 (347) T protein:vir:94 70 VYLAPGERLSDK--R--KGIKHTEKVITIDGLLTADV-MIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILC 144 (347) T ss_pred eeecCCCCcCCC--C--CCCCcceEEEEecchhhhhH-HhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 333333322111 0 11233443333 3333221 122111112457899999999999999999997752 1 Q ss_pred C--CC--CccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccc-e-eEechhHHHHHHhhhcccccc Q lcl|NC_010583. 291 N--GT--GQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLS-K-LVLIVSMDAYYDLLEDEEWQD 364 (458) Q Consensus 291 ~--g~--~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~l~~~~d~~~~~ 364 (458) . ++ ..+.|+-........+.....+........+..+.++...+.....+.. . .+++|..+..|. .+..-.. T Consensus 145 a~~~~~~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll--~~~~~~~ 222 (347) T protein:vir:94 145 NLPAASNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAIL--AALMPNA 222 (347) T ss_pred ccccccccccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHh--ccchhhh Confidence 1 11 1223322111111111111111111223334556666666766655432 3 355777666553 3322111 Q ss_pred ccccccccccccccCCeeecccceecccccccccCC-----c--------e------------------EEEEEeceEEE Q lcl|NC_010583. 365 VAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASA-----E--------F------------------AVIVYKDNFVM 413 (458) Q Consensus 365 ~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~-----~--------~------------------~~~~~~~~~~i 413 (458) .. ....+....|..++++|.+|+.|+.+|....+. . . .+++..+.+.. T Consensus 223 ~~-~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~ 301 (347) T protein:vir:94 223 AN-YAALIDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGT 301 (347) T ss_pred hh-ccccccccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhh Confidence 11 111233455667899999999999999522111 0 0 01111122223 Q ss_pred EecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 414 PRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 414 ~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +...+++++....-.+-...+++..-+|.++++|++.+.+++++| T Consensus 302 v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A 346 (347) T protein:vir:94 302 VKLRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFSPA 346 (347) T ss_pred hhcccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEecCC Confidence 333444444333223334567888899999999999999999999 No 130 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.17 E-value=2.2e-11 Score=79.06 Aligned_cols=297 Identities=10% Similarity=-0.011 Sum_probs=147.7 Q ss_pred HHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccc-hhHHHHHHHHHHhccchhhhcceeeecc-CceEEEEecCC Q lcl|NC_010583. 137 LVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYE-TIFSTRIIRDLQKELVVGALFDELPMSS-KILTMLVEPEA 214 (458) Q Consensus 137 ~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip-~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~ 214 (458) ....+.+.+ .....++.-..+. +.|.++|...+...+.++.+..+..+.+ ...++|+.. . T Consensus 1 ms~~n~~t~-----------------~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG-~ 62 (364) T protein:vir:10 1 MSNPNVLTQ-----------------PAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIG-E 62 (364) T ss_pred CCCcccccc-----------------cccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeee-e Confidence 000000000 0001111122333 7888999999988999999887766554 456777663 3 Q ss_pred Ccccccccccccccccccccccccceeeeeehhheee-eehhhHHHHhccHHH-HHHHHHHHHHHHHHHHHHHHHhc--- Q lcl|NC_010583. 215 GRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAA-KSFITDETEEDAIFS-LLPLLRKRLIEAHAVSIEEAFMS--- 289 (458) Q Consensus 215 ~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~-~~~is~ell~ds~~~-~~~~i~~~la~~~~~~~d~~~l~--- 289 (458) .++.+..-|+.. + -..+..++.++....+-. -..|.+----++.++ +.+.+..++++++++..|+.++. T Consensus 63 ~~~~~~~~G~~l-d-----~~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~ 136 (364) T protein:vir:10 63 TELQVLSPGKSP-D-----ASPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLV 136 (364) T ss_pred eEEeeeccCccc-C-----CCCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333333333321 2 233445565555554321 112221111134466 78999999999999999998752 Q ss_pred cCC-CC-cc---ccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccce--eEechhHHHHHHhhhcccc Q lcl|NC_010583. 290 GNG-TG-QP---KGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSK--LVLIVSMDAYYDLLEDEEW 362 (458) Q Consensus 290 G~g-~~-~p---~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~l~~~~d~~~ 362 (458) --+ ++ .| .++............+..............+.++...+.....+... .++.|..+..|.+-.+--. T Consensus 137 ~aa~a~~~~~~~~~~~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn 216 (364) T protein:vir:10 137 LGGISNTEAIRKNPRVAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVD 216 (364) T ss_pred hhhhhcccccccCCcccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCcccc Confidence 011 10 00 11111110000001111222223334444455566666666554443 4556766655543211000 Q ss_pred ccccccccccccccccCCeeecccceeccccccccc--------------------------C--CceEEEEEeceEEEE Q lcl|NC_010583. 363 QDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAA--------------------------S--AEFAVIVYKDNFVMP 414 (458) Q Consensus 363 ~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~--------------------------~--~~~~~~~~~~~~~i~ 414 (458) +- +.....+....|....+.|.||+.|+.+|.... + ....+++-.+.+... T Consensus 217 ~d-~~~~~~~~~~~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv 295 (364) T protein:vir:10 217 KS-YTIAASDNTVDGFVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVG 295 (364) T ss_pred cc-ccccCCCccccceeEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEE Confidence 00 111122334556667899999999999995211 0 011112222334444 Q ss_pred ecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 415 RQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 415 ~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ...+++.+....-.+-...+.+..-+|.++++|++.+.++.+++ T Consensus 296 ~~~~~t~e~~~~~~~~~~~ida~~a~G~g~lRPeaa~~i~~~~~ 339 (364) T protein:vir:10 296 RTISITGDIFYEKKEKTWYIDTFLAEGAIPDRWEAVAVVTAADT 339 (364) T ss_pred EEecceeeeeeccceeeeeeeeehcccCcccCccceEEEEecCC Confidence 44555554322222333344455568999999999999999998 No 131 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.14 E-value=4e-12 Score=83.07 Aligned_cols=295 Identities=11% Similarity=0.015 Sum_probs=149.6 Q ss_pred hhccchhHHHHHHHHhhhhhc-ccccccCccccc-hhHHHHHHHHHHhccchhhhcceeeec-cCceEEEEecCCCcccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNG-SSSVSMSSEAYE-TIFSTRIIRDLQKELVVGALFDELPMS-SKILTMLVEPEAGRATW 219 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~-~~~~~~g~~~ip-~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~ 219 (458) +.. .. ...... .-..+++...++ +.|..+|...+...+.++++.++..+. +....+|+. +..++.+ T Consensus 1 m~~------~~----~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~ 69 (334) T protein:vir:80 1 MTY------PA----ANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAG 69 (334) T ss_pred CCC------Cc----CCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeee Confidence 000 00 000000 011122223444 889999999998999999998877766 455667755 4455544 Q ss_pred cccccccccccccccccccceeeeeehhhee-eeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHh----ccCCCC Q lcl|NC_010583. 220 VDASKFGTDETVGDEVKGQLTEISFKTYKLA-AKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFM----SGNGTG 294 (458) Q Consensus 220 v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~-~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l----~G~g~~ 294 (458) ..-+... .....+.+++++....+- .-..|.+-=--++.+|+.+.+.+++++++++..|+.++ .|.... T Consensus 70 ~~~g~~l------~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~ 143 (334) T protein:vir:80 70 RKAGEEL------VVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFL 143 (334) T ss_pred ecCCCCC------CCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Confidence 4444332 223344455555554421 11222222222356789999999999999999999765 232222 Q ss_pred cc--------ccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcc-----cceeEechhHHHHHHhhhccc Q lcl|NC_010583. 295 QP--------KGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLK-----LSKLVLIVSMDAYYDLLEDEE 361 (458) Q Consensus 295 ~p--------~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~~d~~ 361 (458) .| .|+.......+. ..............+..+...+.....+ .-..+++|..+..|..-..-- T Consensus 144 ~~~~~~~~~~~G~~~~~~~~g~----~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~ 219 (334) T protein:vir:80 144 APAHLKPAFHDGILLPSTISGL----AADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLM 219 (334) T ss_pred ccccccccccCCcceeeccccc----ccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccc Confidence 11 133322222111 1111122222233444455555554444 233466887777665432111 Q ss_pred cccccccccccccccccCCeeecccceecccccccccC-----Cc-----------eEEEEEeceEEEEecceeEEeecc Q lcl|NC_010583. 362 WQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAAS-----AE-----------FAVIVYKDNFVMPRQRAVTVERER 425 (458) Q Consensus 362 ~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~-----~~-----------~~~~~~~~~~~i~~~~~~~i~~~~ 425 (458) .+-+...+.......+..++++|.||+.|+.+|..... .. ..++...+.+......+++.+... T Consensus 220 n~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~ 299 (334) T protein:vir:80 220 NVEFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWE 299 (334) T ss_pred cceeccccccccccceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeee Confidence 11000001111223445678999999999999954211 00 111222334444455555433321 Q ss_pred cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 426 QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 426 ~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .-.+-...+.+..-+|.++++|++.+.+++..- T Consensus 300 ~~~~~~d~i~~~~a~G~g~lRPeaa~vv~~~~~ 332 (334) T protein:vir:80 300 EKKDFGHYLDTFQSYNIGQRRPDAVAVHDITVT 332 (334) T ss_pred chhhHHHHHHHHHHcCCceeccceEEEEEEeee Confidence 111222233445567999999999999888887 No 132 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.14 E-value=1.8e-12 Score=85.04 Aligned_cols=288 Identities=10% Similarity=-0.012 Sum_probs=149.2 Q ss_pred HHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceee---eccCceEEEEecC Q lcl|NC_010583. 137 LVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELP---MSSKILTMLVEPE 213 (458) Q Consensus 137 ~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~~p~~~~ 213 (458) .+|.+.+-.. +. +++.-...||+.++..|++.+...+...++++..+ ..+...++|+.. T Consensus 1 ~~~~~~~~~~-------------~~----~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g- 62 (341) T protein:vir:94 1 MALGNTITGP-------------SI----NTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS- 62 (341) T ss_pred Ccchhhhccc-------------cc----cchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC- Confidence 1111111000 00 11112236799999999999998888888765433 234567788754 Q ss_pred CCcccccccccccccccccccccccceeeeeehhhe-eeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccC- Q lcl|NC_010583. 214 AGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKL-AAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGN- 291 (458) Q Consensus 214 ~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~- 291 (458) .+.+....++... +-.+.+-..++++..+. ..-+.|++.-..++..++...+.++.++++++.+|..++.-- T Consensus 63 ~~~~~d~~~~~~i------~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a 136 (341) T protein:vir:94 63 ELGVEDKATDVPV------GVQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRA 136 (341) T ss_pred cceeeeecCCCcc------ccccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 4444333333322 22233334455554332 333566665555567899999999999999999999877421 Q ss_pred -CCCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhccc--ceeEechhHHHHHHhhhcccccccccc Q lcl|NC_010583. 292 -GTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKL--SKLVLIVSMDAYYDLLEDEEWQDVAQV 368 (458) Q Consensus 292 -g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~~d~~~~~~~~~ 368 (458) +++.+.+... .. .............++.++.+...+.....+. -..+++|..+..|.+...-..... T Consensus 137 ~~~~~~~~~~~----~~---~~~~~t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~--- 206 (341) T protein:vir:94 137 AVQNTASQNVF----SS---SNGAITGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDF--- 206 (341) T ss_pred hccccccCccc----cC---ccccccCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhc--- Confidence 1122111100 00 0000011112234455666666666654432 234668888877754221111111 Q ss_pred ccccccccccCCeeecccceecccccccccCCceEEE----------------------EEeceE--EEEeccee----- Q lcl|NC_010583. 369 GNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVI----------------------VYKDNF--VMPRQRAV----- 419 (458) Q Consensus 369 ~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~----------------------~~~~~~--~i~~~~~~----- 419 (458) ........|..++|+|.+|+.|+.+|........... +|++.+ .++-+..+ T Consensus 207 ~g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~ 286 (341) T protein:vir:94 207 INNAPIAQGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVM 286 (341) T ss_pred cccchhheeeeeeEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceee Confidence 1122345666789999999999999964321110000 011100 00000000 Q ss_pred --------------EEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 420 --------------TVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 420 --------------~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ....+....+....+++..=+|.++++|++.|.++.+++ T Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~ 339 (341) T protein:vir:94 287 CHMDWAAAVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTTGD 339 (341) T ss_pred ecchhhhccccccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecCcC Confidence 000111111223445566678999999999999999999 No 133 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.13 E-value=2.5e-12 Score=84.18 Aligned_cols=299 Identities=12% Similarity=0.061 Sum_probs=152.2 Q ss_pred HHhhhccchhHHHHHHHHhhhhhcccccccCc---cccchhHHHHHHHHHHhccchhhhcceeeec-cCceEEEEecCCC Q lcl|NC_010583. 140 LSYMMEKDVFETEHGKAHIKAVNGSSSVSMSS---EAYETIFSTRIIRDLQKELVVGALFDELPMS-SKILTMLVEPEAG 215 (458) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~---~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~ 215 (458) ......+.... . .......+|. +.| +.|..+|...+...+.++.+.++..+. +....+|+.. .. T Consensus 1 ma~~~~~~~~~-------t---~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG-~~ 68 (347) T protein:vir:94 1 MANMNGGQQMG-------K---DQGKGMSAGDKLALFL-KVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLG-RT 68 (347) T ss_pred CCccccccccc-------c---ccccCCcccchHHHHH-HHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeecc-ce Confidence 00000000000 0 0000111111 344 789999999999999999998775544 4556666543 33 Q ss_pred cccccccccccccccccccccccceeeeeehhhe--eeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhc---- Q lcl|NC_010583. 216 RATWVDASKFGTDETVGDEVKGQLTEISFKTYKL--AAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMS---- 289 (458) Q Consensus 216 ~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~--~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~---- 289 (458) .+.....|..... +..+++.+++++....+ .. ..|.+-=--++.+++.+.+..++++++++..|+.++. T Consensus 69 ~~~~~~~G~~l~~----~~~~~~~~e~~ltID~~~y~~-~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~ 143 (347) T protein:vir:94 69 KAAYLQPGENLDD----KRKDMKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAK 143 (347) T ss_pred eEeeeecCcCCCC----CcCCccccceEEEEcchhhhh-hhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444444432211 11245666655554443 22 1222222223567899999999999999999998762 Q ss_pred cCC----CC-cccccccccccccc-ceeeccccchhhHHHHHHHHHHHhhhhhhhcccc-eeEe-chhHHHHHHhhhccc Q lcl|NC_010583. 290 GNG----TG-QPKGLLKLAADDGA-KVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLS-KLVL-IVSMDAYYDLLEDEE 361 (458) Q Consensus 290 G~g----~~-~p~Gi~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~l~~~~d~~ 361 (458) +.. +. .+.|.......... ...............+..+.++...+.....+.. .|++ .|..+..|.+..+.. T Consensus 144 ~a~~~~~~~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~ 223 (347) T protein:vir:94 144 LCNLPTANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPN 223 (347) T ss_pred hhccccccccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccc Confidence 211 11 12221111000000 0000111112233445667777777777666533 3433 577665554432222 Q ss_pred cccccccccccccccccCCeeecccceecccccccccCCce------------------------------EEEEEeceE Q lcl|NC_010583. 362 WQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEF------------------------------AVIVYKDNF 411 (458) Q Consensus 362 ~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~------------------------------~~~~~~~~~ 411 (458) .... ........|..+++.|.+|+.|+++|....+... .+++..+.+ T Consensus 224 ~~~~---~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~ 300 (347) T protein:vir:94 224 AANY---QALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAV 300 (347) T ss_pred cccc---ccccccccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhh Confidence 1111 1112234466788999999999999853211100 011112222 Q ss_pred EEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 412 VMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 412 ~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+.-.+++++...........+.+..-+|.++++|++.+.+++..| T Consensus 301 ~tv~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 301 GTVKLKDMALERARRANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhcccceeeeechhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 23333444444332233334466777789999999999998888888 No 134 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.12 E-value=1e-11 Score=80.90 Aligned_cols=299 Identities=9% Similarity=-0.018 Sum_probs=151.7 Q ss_pred HHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeecc-CceEEEEecCCC Q lcl|NC_010583. 137 LVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSS-KILTMLVEPEAG 215 (458) Q Consensus 137 ~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~ 215 (458) ....+. ..+.....+ .+.-.+.| ++|..+|...+...+.++.+.++.++.+ ....+|+. +.. T Consensus 1 ms~~~~--------------~tr~~~~~s-~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~ 63 (335) T protein:vir:63 1 MSFLND--------------LTRPNYAGK-NADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNV 63 (335) T ss_pred CCCccc--------------chhhhcccc-cchhheeh-hhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eee Confidence 000000 000000111 11112444 7899999999999999999987766654 45566666 444 Q ss_pred cccccccccccccccccccccccceeeeeehhhee-eeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHh----cc Q lcl|NC_010583. 216 RATWVDASKFGTDETVGDEVKGQLTEISFKTYKLA-AKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFM----SG 290 (458) Q Consensus 216 ~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~-~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l----~G 290 (458) .+.+..-|... ..+.+..++.++....+- +-..|.+----++.+|+.+.+..++++++++..|+.++ .+ T Consensus 64 ~~~~~~pG~~l------~~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~a 137 (335) T protein:vir:63 64 EAKGRRAGEEL------ERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKA 137 (335) T ss_pred eeecccCCcCc------CCCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 44444434322 222344455555555432 11222222222356899999999999999999999764 44 Q ss_pred CCCCccccccccc--cccccceeeccccchhhHHHHHHHHHHHhhhhhhhccc-----ceeEechhHHHHHHhhhccccc Q lcl|NC_010583. 291 NGTGQPKGLLKLA--ADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKL-----SKLVLIVSMDAYYDLLEDEEWQ 363 (458) Q Consensus 291 ~g~~~p~Gi~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~l~~~~d~~~~ 363 (458) .....|.++-... ........++.+...........+..+...+.....+. -..+++|..+..|.....--.+ T Consensus 138 a~~~a~~~~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~ 217 (335) T protein:vir:63 138 AAMDAPVDLEDAFSPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNV 217 (335) T ss_pred ccccCccccCCCcCCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccccc Confidence 3322221111100 00000011111111122233334555555666555442 3456787777666543221111 Q ss_pred cccccccccccccccCCeeecccceecccccccccCCc----------------eEEEEEeceEEEEecceeEEeecccc Q lcl|NC_010583. 364 DVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAE----------------FAVIVYKDNFVMPRQRAVTVERERQA 427 (458) Q Consensus 364 ~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~----------------~~~~~~~~~~~i~~~~~~~i~~~~~~ 427 (458) .+.......+...+....++|.||+.++.+|....... ..+++-.+.+..+.-.+++.+..... T Consensus 218 ~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~ 297 (335) T protein:vir:63 218 EYQATGATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDN 297 (335) T ss_pred ccccccccccccCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeecc Confidence 11001111234556667899999999999995432111 11111123344444444443322111 Q ss_pred cCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 428 GKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 428 ~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+-...+.+..-+|.++.+|++.+.++++.. T Consensus 298 ~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:63 298 EKFSWVLDTFQMYNIGARRPDTAGAIELKGI 328 (335) T ss_pred chhhHHhHHHHHcCCcccccceEEEEEEcCC Confidence 2222344555568999999999999998877 No 135 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.11 E-value=1.1e-11 Score=80.66 Aligned_cols=293 Identities=10% Similarity=0.013 Sum_probs=152.1 Q ss_pred HHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeecc-CceEEEEecCCC Q lcl|NC_010583. 137 LVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSS-KILTMLVEPEAG 215 (458) Q Consensus 137 ~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~ 215 (458) ....+. ..+...+.+ .+.-.+.| +.|..+|...+...+.++.+.++..+.+ ....+|+. +.. T Consensus 1 ms~~~~--------------~t~~~~~~s-~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~ 63 (335) T protein:vir:78 1 MSFLND--------------LTRPNYAGK-NADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNV 63 (335) T ss_pred CCcccc--------------ccccccccc-cchhhhhh-hhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eee Confidence 000000 000000111 11113445 7899999999999999999987766554 45567755 444 Q ss_pred cccccccccccccccccccccccceeeeeehhhee-eeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHh----cc Q lcl|NC_010583. 216 RATWVDASKFGTDETVGDEVKGQLTEISFKTYKLA-AKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFM----SG 290 (458) Q Consensus 216 ~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~-~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l----~G 290 (458) .+.+..-|... ..+.+..++.++....+- .-..|.+-=--++.+|+.+.+..++++++++..|+.++ .+ T Consensus 64 ~~~~~~pG~~l------~~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~a 137 (335) T protein:vir:78 64 EAKGRRAGEEL------ERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKA 137 (335) T ss_pred eecccccCccc------CCCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 44444434322 222344455555555432 11122222222356899999999999999999999765 33 Q ss_pred CCCCccc--------cccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcc-----cceeEechhHHHHHHhh Q lcl|NC_010583. 291 NGTGQPK--------GLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLK-----LSKLVLIVSMDAYYDLL 357 (458) Q Consensus 291 ~g~~~p~--------Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~ 357 (458) .....|. |+..... .++.+...+.......+..+...+.....+ .-..+++|..+..|... T Consensus 138 a~~~a~~~~~~~~~~G~~~~~~------~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~ 211 (335) T protein:vir:78 138 AAMDAPVDLEDAFSPGVLEKLD------LTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEH 211 (335) T ss_pred cccccccccCCCcCCCcceeee------eccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcc Confidence 3322221 2221111 111122222333334444444445544443 23567888888777653 Q ss_pred hccccccccccccccccccccCCeeecccceecccccccccCCc----------------eEEEEEeceEEEEecceeEE Q lcl|NC_010583. 358 EDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAE----------------FAVIVYKDNFVMPRQRAVTV 421 (458) Q Consensus 358 ~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~----------------~~~~~~~~~~~i~~~~~~~i 421 (458) ..--.+.+.......+...+....++|.||+.++.+|....... ..+++-.+.+..+.-.++.. T Consensus 212 ~~l~n~~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~ 291 (335) T protein:vir:78 212 DKLMSVEYQATGATNDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQA 291 (335) T ss_pred cccccccccccccccccccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEeccc Confidence 22111111111111234556677899999999999995421111 11122223344444444433 Q ss_pred eecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 422 ERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 422 ~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +....-..-...+.+..-+|.++++|++.+.+++... T Consensus 292 e~~~~~~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:78 292 KLWEDHDQFSWVLDTFQMYNIGARRPDTAGAIELKGI 328 (335) T ss_pred ceeeccchhhHhhhHHHHcCCcccCcceEEEEEecCC Confidence 3221112222344555568999999999999999888 No 136 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.10 E-value=1e-11 Score=80.81 Aligned_cols=298 Identities=12% Similarity=0.061 Sum_probs=154.9 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccC-----ccccchhHHHHHHHHHHhccchhhhcceeeecc-CceEEEEecCCCc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMS-----SEAYETIFSTRIIRDLQKELVVGALFDELPMSS-KILTMLVEPEAGR 216 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g-----~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~ 216 (458) +..-.. .....+.+....+| .+.| +.|..+|...+...+.++.+.++..+.+ ....+|+. +..+ T Consensus 1 ~~~~~~--------~~~~~~~~~~~~~~~~~~~al~l-e~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~ 70 (345) T protein:vir:22 1 MASMTG--------GQQMGTNQGKGVVAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQ 70 (345) T ss_pred Cccccc--------chhcccccccccccCCchhHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceE Confidence 000000 00000011111111 1233 7889999999999999999988777664 45567765 4444 Q ss_pred cccccccccccccccccccccccee--eeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhc----c Q lcl|NC_010583. 217 ATWVDASKFGTDETVGDEVKGQLTE--ISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMS----G 290 (458) Q Consensus 217 a~~v~e~~~~~e~~~~~~~~~~f~~--v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~----G 290 (458) +.....|..... +..++..++ ++++..++..+ .|.+-=--++.+++.+.+.+++++++++..|+.++. + T Consensus 71 ~~~~~~G~~l~~----~~~~~~~~e~~ltID~~~y~~~-~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~ 145 (345) T protein:vir:22 71 AAYLAPGENLDD----KRKDIKHTEKVITIDGLLTADV-LIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGL 145 (345) T ss_pred EEeeecCCCCCC----CCCCcccceEEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 444444433211 112345566 34444443322 222221223567899999999999999999998872 1 Q ss_pred CC-----CCcccccccccccccccee-eccccchhhHHHHHHHHHHHhhhhhhhcccc-ee-EechhHHHHHHhhhcccc Q lcl|NC_010583. 291 NG-----TGQPKGLLKLAADDGAKVV-TEAKADGSVLVTAKTISKLRRKLGRHGLKLS-KL-VLIVSMDAYYDLLEDEEW 362 (458) Q Consensus 291 ~g-----~~~p~Gi~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~l~~~~d~~~ 362 (458) .. ++.|.|+-........... ............+..+..+...+.....+.. .| +++|..+..|..-..... T Consensus 146 a~~~~~~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~ 225 (345) T protein:vir:22 146 CNVESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNA 225 (345) T ss_pred hcccccccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccc Confidence 11 1223332211110000000 0111112233455666667667766655533 34 568877776643322221 Q ss_pred ccccccccccccccccCCeeecccceecccccccccCC---------------------------ceEEEEEeceEEEEe Q lcl|NC_010583. 363 QDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASA---------------------------EFAVIVYKDNFVMPR 415 (458) Q Consensus 363 ~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~---------------------------~~~~~~~~~~~~i~~ 415 (458) ... ........|..++++|.+|+.|+.+|....+. ...+++..+.+..+. T Consensus 226 ~~~---~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~ 302 (345) T protein:vir:22 226 ANY---AALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVK 302 (345) T ss_pred ccc---ccccccccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeee Confidence 111 12223345667789999999999988432110 011122223444445 Q ss_pred cceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 416 QRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 416 ~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+++++....-.+-...+++..-+|.++++|++.+.++++-- T Consensus 303 ~~~~~~e~~r~~~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 303 LRDLALERARRANFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred eecceeeeeechhHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 5554444322222223456777789999999999998877777 No 137 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.10 E-value=3.2e-12 Score=83.62 Aligned_cols=296 Identities=11% Similarity=0.046 Sum_probs=149.3 Q ss_pred HHHHHhhhccchhHHHHHHHHhhhhhcccccccC-ccccchhHHHHHHHHHHhccchhhhcceeeec-cCceEEEEecCC Q lcl|NC_010583. 137 LVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMS-SEAYETIFSTRIIRDLQKELVVGALFDELPMS-SKILTMLVEPEA 214 (458) Q Consensus 137 ~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g-~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~ 214 (458) ..+.+.+-.-... +..-+..+...- .+.| +.+..+|...+...+.++.+.++.... +....+|+.. . T Consensus 1 ~~~~~~~~~~~~~---------~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig-~ 69 (332) T protein:vir:78 1 MTTLSNFSLPNQA---------NGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG-K 69 (332) T ss_pred CcccccccCCccc---------cCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhccccccccccceEEEEecc-c Confidence 1111111110000 000001111010 2444 788999999999999998888765544 4556666664 3 Q ss_pred Cccccccccccccccccccccccccee--eeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhc--- Q lcl|NC_010583. 215 GRATWVDASKFGTDETVGDEVKGQLTE--ISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMS--- 289 (458) Q Consensus 215 ~~a~~v~e~~~~~e~~~~~~~~~~f~~--v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~--- 289 (458) .+......+..... ..+++-++ ++++..++..+ .|.+-=-.++..++.+.+.++.++++++..|+.++. T Consensus 70 ~~~~~~~~g~~l~~-----~~~~~~~~~~l~ID~~ky~~~-~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~ 143 (332) T protein:vir:78 70 LSAGYHTPGTPIVG-----DAGIKANEKTLVMDDLLVSSQ-FVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLA 143 (332) T ss_pred eeEeeecCCCCCCC-----CCCCCCceEEEEEehhhhhHH-HHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33333222321111 11223333 34444344332 222211123567899999999999999999987762 Q ss_pred -cCCCCcc-ccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccc-eeEe-chhHHHHHHhhhccccccc Q lcl|NC_010583. 290 -GNGTGQP-KGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLS-KLVL-IVSMDAYYDLLEDEEWQDV 365 (458) Q Consensus 290 -G~g~~~p-~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~l~~~~d~~~~~~ 365 (458) +..++-| .|.... ..+..+.+...+....+..++++...+.....+.. .|++ .|..+..|.+..|. +.. T Consensus 144 ~aa~~~~~~~~~~g~-----~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~--~~~ 216 (332) T protein:vir:78 144 KASAEASPVTGEPGG-----FHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDT--NIL 216 (332) T ss_pred hhhcccCcccccccc-----cccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCc--eee Confidence 2212111 111110 11111111222334556677788888887776544 4444 67776666543332 111 Q ss_pred cc--ccccccccccc-CCeeecccceecccccccccCCc-------------------eEEEEEeceEEEEecceeEEee Q lcl|NC_010583. 366 AQ--VGNDAVKLQGQ-VGRIYGLPVVVSEYFPAKAASAE-------------------FAVIVYKDNFVMPRQRAVTVER 423 (458) Q Consensus 366 ~~--~~~~~~~~~~~-~~~l~G~pv~~~~~~~~~~~~~~-------------------~~~~~~~~~~~i~~~~~~~i~~ 423 (458) .. .+..+....+. .+++.|.+|+.|+.+|....... ..+++..+.+......++++.. T Consensus 217 n~~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~ 296 (332) T protein:vir:78 217 NREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQT 296 (332) T ss_pred eeeccccccceecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhh Confidence 10 11112222222 46799999999999995321111 1112222334444444444432 Q ss_pred ---cccccCCceEEEEEEeeccEEecccceEEEEee Q lcl|NC_010583. 424 ---ERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYA 456 (458) Q Consensus 424 ---~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~a 456 (458) +....+-...+++...+|.++++|++.+.++.| T Consensus 297 t~~~~~~~~~~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 297 TSGDFNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhcccchhhhHhhhhhhhhhcCceecccceEEEeeC Confidence 111222234667777899999999999999988 No 138 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.08 E-value=1.2e-11 Score=80.49 Aligned_cols=302 Identities=11% Similarity=0.018 Sum_probs=150.8 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeec-cCceEEEEecCCCcccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMS-SKILTMLVEPEAGRATWVD 221 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~ 221 (458) +.....- .....+...+......=...| +.|..+|...+...+.++.+.++-... +....+|+.... ++.... T Consensus 1 ~~~~~~~----~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~-t~~~~~ 74 (347) T protein:vir:33 1 MANIQGG----QQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRT-KAAYLK 74 (347) T ss_pred CCCCccC----cccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccce-eeeeec Confidence 1100000 000000000000000001345 788999988899999999998765544 445566655433 333333 Q ss_pred cccccccccccccccccceeeeee--hhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHh-----ccCCCC Q lcl|NC_010583. 222 ASKFGTDETVGDEVKGQLTEISFK--TYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFM-----SGNGTG 294 (458) Q Consensus 222 e~~~~~e~~~~~~~~~~f~~v~~~--~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l-----~G~g~~ 294 (458) .+..... +..+++.+++++. ..++... .|.+-=--++..++.+.+..+.++++++..|+.++ .+.... T Consensus 75 ~g~~l~~----~~~~~~~~e~~ltiD~~~y~~~-~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~ 149 (347) T protein:vir:33 75 PGENLDD----KRKDIKHTEKVIHIDGLLTADV-LIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPD 149 (347) T ss_pred CCCCCCC----CCCCCccceEEEEechhhhhhH-HHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 3322111 1122344554443 3333221 22222222355789999999999999999999886 222221 Q ss_pred cccccc---cccccc---ccceeeccccchhhHHHHHHHHHHHhhhhhhhccc-cee-EechhHHHHHHhhhcccccccc Q lcl|NC_010583. 295 QPKGLL---KLAADD---GAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKL-SKL-VLIVSMDAYYDLLEDEEWQDVA 366 (458) Q Consensus 295 ~p~Gi~---~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~l~~~~d~~~~~~~ 366 (458) .+.+.. ...... ...+.............+..++++...+.....+. ..| ++.|..+..|........+.. T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~- 228 (347) T protein:vir:33 150 GSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANY- 228 (347) T ss_pred ccccccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccc- Confidence 111100 000000 00011111111223344566777777777766542 333 557777666653222121111 Q ss_pred ccccccccccccCCeeecccceecccccccccCCc---------eE------------------EEEEeceEEEEeccee Q lcl|NC_010583. 367 QVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAE---------FA------------------VIVYKDNFVMPRQRAV 419 (458) Q Consensus 367 ~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~---------~~------------------~~~~~~~~~i~~~~~~ 419 (458) ........|..++++|.+|+.|+.+|....... .. +++..+.+......++ T Consensus 229 --~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~ 306 (347) T protein:vir:33 229 --QALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDL 306 (347) T ss_pred --ccccccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeece Confidence 112344566678899999999999986322110 00 1111122333444445 Q ss_pred EEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 420 TVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 420 ~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +++....-.+....+++...+|.++++|++.+.+++.-= T Consensus 307 ~~e~~r~~~~~~d~i~~~~~~G~~vlrP~~av~i~~~~~ 345 (347) T protein:vir:33 307 ALERARRANYQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred eeeeccchhhhhHhhhhhhhcCCceecccceEEEecCCC Confidence 555443333444566788888999999999988877665 No 139 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.06 E-value=9.2e-11 Score=75.60 Aligned_cols=303 Identities=14% Similarity=0.093 Sum_probs=154.1 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccC------ccccchhHHHHHHHHHHhccchhhhcceeeecc-CceEEEEecCCC Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMS------SEAYETIFSTRIIRDLQKELVVGALFDELPMSS-KILTMLVEPEAG 215 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g------~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~ 215 (458) +..-.. . ...+.+..+....| .+.| +.|..+|...+...+.++.+.++..+.+ ...++|+. +.. T Consensus 1 ~~~~~~------~-~~~~~n~~t~~~~~~~~~~~al~l-e~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~i-G~~ 71 (375) T protein:vir:10 1 MANANQ------V-ALGRSNLSTGTGYGGATDKYALYL-KLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYT-GRM 71 (375) T ss_pred Cccccc------c-ccCccccCCccccccccchHHHHH-HHHhHHHHHHHHHHHhhhccccccccccCceEEEEee-eee Confidence 000000 0 00001111111111 1233 7888999999999999999988766664 45556666 333 Q ss_pred ccccccccccccccccccccccccee--eeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhc---- Q lcl|NC_010583. 216 RATWVDASKFGTDETVGDEVKGQLTE--ISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMS---- 289 (458) Q Consensus 216 ~a~~v~e~~~~~e~~~~~~~~~~f~~--v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~---- 289 (458) ++....-|...... +..++...+ +++...++..+ .|.+-=--++..++.+.+.+++++++++..|+.++. T Consensus 72 t~~~~t~G~~i~~~---~~~d~~~te~~l~ID~~~y~~~-~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~k 147 (375) T protein:vir:10 72 TSSFHTPGTPILGN---ADKAPPVAEKTIVMDDLLISSA-FVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITR 147 (375) T ss_pred EEeeecCCcCcCCc---cccCCCCCceEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44333333221111 111222222 44444444322 222211223567899999999999999999997762 Q ss_pred cCCCCccccccccccccccce----eeccccchhhHHHHHHHHHHHhhhhhhhccc-cee-EechhHHHHHHhhhccccc Q lcl|NC_010583. 290 GNGTGQPKGLLKLAADDGAKV----VTEAKADGSVLVTAKTISKLRRKLGRHGLKL-SKL-VLIVSMDAYYDLLEDEEWQ 363 (458) Q Consensus 290 G~g~~~p~Gi~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~l~~~~d~~~~ 363 (458) |..+..|.+.-......+..+ ........+....+..+.++...+.....+. ..| +++|..+..|...+|.+.- T Consensus 148 aa~~~~p~~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~ 227 (375) T protein:vir:10 148 GARSASPVSATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGL 227 (375) T ss_pred hhhhccccccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccce Confidence 222222211110000000000 1111222234455667777777777766653 334 5577777666544443211 Q ss_pred cccccccccccccccCCeeecccceecccccccccCC------------------------------------------- Q lcl|NC_010583. 364 DVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASA------------------------------------------- 400 (458) Q Consensus 364 ~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~------------------------------------------- 400 (458) ........+....+..+++.|.+|+.|+.+|...... T Consensus 228 ~n~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~ 307 (375) T protein:vir:10 228 VNRDVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELG 307 (375) T ss_pred eeecccccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeecccccccccccccc Confidence 0001111222233445689999999999999532110 Q ss_pred --ceEEEEEeceEEEEecceeEEeec--c-cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 401 --EFAVIVYKDNFVMPRQRAVTVERE--R-QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 401 --~~~~~~~~~~~~i~~~~~~~i~~~--~-~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ...+++..+......-.+++++.. + ...+....+.+..=+|..+.+|++.|.|+..+. T Consensus 308 ~~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~ 370 (375) T protein:vir:10 308 AKSCGLIFQKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGAT 370 (375) T ss_pred CceEEEEEchhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcC Confidence 001111223344444455555432 1 234555667788889999999999999988866 No 140 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.06 E-value=9.7e-12 Score=80.95 Aligned_cols=278 Identities=10% Similarity=-0.013 Sum_probs=151.7 Q ss_pred hhhhhc-ccccccCcc------ccchhHHHHHHHHHHhccchhhhcceeee-ccCceEEEEecCCCcccccccccccccc Q lcl|NC_010583. 158 IKAVNG-SSSVSMSSE------AYETIFSTRIIRDLQKELVVGALFDELPM-SSKILTMLVEPEAGRATWVDASKFGTDE 229 (458) Q Consensus 158 ~~a~~~-~~~~~~g~~------~ip~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~~p~~~~~~~a~~v~e~~~~~e~ 229 (458) ..+-+. .+...++.+ --|+.+.+.|.+.+.+.-+.-.+-+.+.. .++.+.+-..... +..+....++|+ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~---~~~~d~e~VaEg 77 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPS---FLEDDVADVAEF 77 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccc---cccCcHhhccCc Confidence 000000 111111222 12555556677777666555555544432 3444444332222 112334445666 Q ss_pred cccccccccceeeee-ehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccC---CCC---cccccccc Q lcl|NC_010583. 230 TVGDEVKGQLTEISF-KTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGN---GTG---QPKGLLKL 302 (458) Q Consensus 230 ~~~~~~~~~f~~v~~-~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~---g~~---~p~Gi~~~ 302 (458) +.++.+.+.++...+ ..+|.+..++||+|++..+..+..+....+++.+|.+..|+.++.-= ++. .+.++.+. T Consensus 78 gEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~ 157 (318) T protein:vir:10 78 GEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNG 157 (318) T ss_pred ccccccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCc Confidence 677777888877766 44699999999999999999999999999999999999999777421 110 00111100 Q ss_pred ccccccceeeccccchhhHHHHHHHHH-----------HHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccc Q lcl|NC_010583. 303 AADDGAKVVTEAKADGSVLVTAKTISK-----------LRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGND 371 (458) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~ 371 (458) ... ..+...+....+ ..... .-...+...+|||..+..|.+..+- +.++..... T Consensus 158 ~~~------------~~d~~~A~e~v~~a~~~~~~a~~~~~~~-~~GY~pdtIVlhP~~~~~l~~n~~~--~~~y~~~a~ 222 (318) T protein:vir:10 158 GKV------------RTDIAIAIEQISTAAPTAYPAGVGSSDE-YFGFIPDTIVMHYALLPILMDNENF--MKVYERNAN 222 (318) T ss_pred ccc------------cccchhhhhhhhhhhhhhhhhhhhhhhh-ccCccceeeEECHHHHHHHhcchhh--hhhhhccch Confidence 000 001010100000 00011 1123445689999999998654332 222211110 Q ss_pred ----ccc-ccccCCeeecccceecccccccccCCceEEEEEeceE-EEEecceeEEe--e----cccc-cCCceEEEEEE Q lcl|NC_010583. 372 ----AVK-LQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNF-VMPRQRAVTVE--R----ERQA-GKQRDAYYVTQ 438 (458) Q Consensus 372 ----~~~-~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~-~i~~~~~~~i~--~----~~~~-~~~~~~~~~~~ 438 (458) ... ....+++++|+.|+.+..+|.+. .++.....+ .+.+...++.. + +++- .+....+++.. T Consensus 223 ~~~~~~~~tg~~~g~~lGl~vi~s~~~p~~~-----alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~ 297 (318) T protein:vir:10 223 YVSTAPDWTGNFPGSVMGLNVIRSRTFPIDR-----VLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASH 297 (318) T ss_pred hhhhcccccccccceeeceEEeecCccCCCe-----eEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehhe Confidence 011 11225789999999999999632 233333322 23344444432 1 1121 24456777888 Q ss_pred eeccEEecccceEEEEeecC Q lcl|NC_010583. 439 RVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 439 r~d~~~~~~~afv~l~~aaa 458 (458) +-...|.+|+|++++|-=.+ T Consensus 298 ~~~~~V~~PkA~~~itgi~~ 317 (318) T protein:vir:10 298 KRALAVDQPKAALWLTGIVT 317 (318) T ss_pred eeeeeeeCcceeEEEeeccC Confidence 88899999999999998888 No 141 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.04 E-value=5.2e-11 Score=76.95 Aligned_cols=302 Identities=11% Similarity=0.032 Sum_probs=147.5 Q ss_pred HHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeec-cCceEEEEecCCCcc Q lcl|NC_010583. 139 LLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMS-SKILTMLVEPEAGRA 217 (458) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~a 217 (458) ..+ ...+... ..+...+......-...| +.|..+|...+...+.++.+.++-+.. +....+|+... .++ T Consensus 1 ma~-~~~~~~~-------~t~~~~~~~~~~~~a~~i-e~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~-~t~ 70 (347) T protein:vir:15 1 MAN-IQGGQQI-------GTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR-TKA 70 (347) T ss_pred CCc-cccCCcc-------ccccccCCCcchHHHHHH-HHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccc-eee Confidence 000 0000000 000000000000001122 677888888888889889988765544 44556666544 333 Q ss_pred cccccccccccccccccccccceeeeee--hhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhcc----- Q lcl|NC_010583. 218 TWVDASKFGTDETVGDEVKGQLTEISFK--TYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSG----- 290 (458) Q Consensus 218 ~~v~e~~~~~e~~~~~~~~~~f~~v~~~--~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G----- 290 (458) .....+..... +..+++.+++++. ..++..+ .|.+-=--++..++.+.+.++.++++++..|+.++.= T Consensus 71 ~~~~~g~~l~~----~~~~~~~~e~~ltID~~~~~~~-~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~ 145 (347) T protein:vir:15 71 AYLKPGENLDD----KRKDIKHTEKVIHIDGLLTADV-LIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLV 145 (347) T ss_pred eeeccCCCCCC----CCCCCccceEEEEechhhhhhH-HhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 33333322111 1122345554443 3333321 2222222235678999999999999999999988721 Q ss_pred ---CCCCcccccccccccccccee---eccccchhhHHHHHHHHHHHhhhhhhhcc-cceeE-echhHHHHHHhhhcccc Q lcl|NC_010583. 291 ---NGTGQPKGLLKLAADDGAKVV---TEAKADGSVLVTAKTISKLRRKLGRHGLK-LSKLV-LIVSMDAYYDLLEDEEW 362 (458) Q Consensus 291 ---~g~~~p~Gi~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~l~~~~d~~~ 362 (458) ..+..+.+............. ............++.+.++...+.....+ ...|+ +.|..+..|..-.+... T Consensus 146 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~ 225 (347) T protein:vir:15 146 NLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNA 225 (347) T ss_pred hccccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccccc Confidence 011111111100000000001 11111111223344555555556555543 23454 46777766654322221 Q ss_pred ccccccccccccccccCCeeecccceecccccccccCC---------ce------------------EEEEEeceEEEEe Q lcl|NC_010583. 363 QDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASA---------EF------------------AVIVYKDNFVMPR 415 (458) Q Consensus 363 ~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~---------~~------------------~~~~~~~~~~i~~ 415 (458) ... ........|..++++|.+|+.|+.+|...... .. .+++..+.+.... T Consensus 226 ~d~---~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~ 302 (347) T protein:vir:15 226 ANY---QALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVK 302 (347) T ss_pred ccc---cccccccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeE Confidence 111 12233456677889999999999999532211 00 1111122333444 Q ss_pred cceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 416 QRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 416 ~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+.++++.+..-.+....+++...+|.++++|++.+.+++.-= T Consensus 303 ~~~~~~e~~~~~~~~~d~i~~~~~~G~~vlrP~~av~~~~~~~ 345 (347) T protein:vir:15 303 LKDLALERARRANYQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred eeceeeeecccchhhhhhhehhhhcCCceeccccEEEEecCCC Confidence 4555555544434445666888889999999999888877655 No 142 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.00 E-value=2.6e-11 Score=78.58 Aligned_cols=303 Identities=13% Similarity=0.040 Sum_probs=147.5 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeecc-CceEEEEecCCCcccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSS-KILTMLVEPEAGRATWVD 221 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~ 221 (458) +..-.. .............+....=.+.| +.|..+|...+...+.++.+.++..+.+ ....+|+. +...+.... T Consensus 1 ma~~~~---~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G~~~~~~~~ 75 (344) T protein:vir:10 1 MANMTG---GQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLA 75 (344) T ss_pred Cccccc---cccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-ceeEEEeee Confidence 000000 00000000000000000011234 7889999999999999999988777664 45567766 333444444 Q ss_pred cccccccccccccccccceeeeeehhh--eeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCC-- Q lcl|NC_010583. 222 ASKFGTDETVGDEVKGQLTEISFKTYK--LAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMS----GNGT-- 293 (458) Q Consensus 222 e~~~~~e~~~~~~~~~~f~~v~~~~~k--~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~----G~g~-- 293 (458) .|...... ..++.-+++++...+ +..+ .|.+-=--++.+++.+.+..++++++++..|+.++. +... T Consensus 76 ~G~~l~~t----~~~~~~~e~~l~ID~~~y~~~-~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~ 150 (344) T protein:vir:10 76 PGENLDDI----RKDIKHTEKVITIDGLLTADV-LIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVES 150 (344) T ss_pred cCCCCCCC----CCCcccceEEEEEcchhhhhh-hhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Confidence 34332211 113344554444333 3221 222211223567899999999999999999987752 2211 Q ss_pred ---Cccccccccccccccc-eeeccccchhhHHHHHHHHHHHhhhhhhhcccc-ee-EechhHHHHHHhhhccccccccc Q lcl|NC_010583. 294 ---GQPKGLLKLAADDGAK-VVTEAKADGSVLVTAKTISKLRRKLGRHGLKLS-KL-VLIVSMDAYYDLLEDEEWQDVAQ 367 (458) Q Consensus 294 ---~~p~Gi~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~l~~~~d~~~~~~~~ 367 (458) ..|.|.-+........ ..............++.+.++...+.....+.. .| +++|..+..|..-..-.... T Consensus 151 ~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~--- 227 (344) T protein:vir:10 151 QYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAAN--- 227 (344) T ss_pred ccccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccc--- Confidence 1222221111110000 011112222223445566666777776665433 44 45887777664322211111 Q ss_pred cccccccccccCCeeecccceecccccccccCCceE--------------------------EEEEeceEEEEecceeEE Q lcl|NC_010583. 368 VGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFA--------------------------VIVYKDNFVMPRQRAVTV 421 (458) Q Consensus 368 ~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~--------------------------~~~~~~~~~i~~~~~~~i 421 (458) .+.......|..++++|.||+.|+.+|......... +++..+.+......++++ T Consensus 228 ~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~ 307 (344) T protein:vir:10 228 YAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLAL 307 (344) T ss_pred cccccceeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhcccee Confidence 112233455667789999999999998532111100 011111222333344444 Q ss_pred eecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 422 ERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 422 ~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +....-.+-...+++..-+|.++++|++.+.++++.= T Consensus 308 e~~r~~~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 308 ERARRANFQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred ecccchhHHHHHHHHHhhcccceecccceEEEEeecC Confidence 3322112222355677789999999998855555444 No 143 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.97 E-value=1.7e-10 Score=74.19 Aligned_cols=272 Identities=14% Similarity=0.049 Sum_probs=147.9 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhh---------hcceee--eccCceEEEEecCC-Ccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGA---------LFDELP--MSSKILTMLVEPEA-GRATWVDASKFGTDE 229 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~---------~~~~~~--~~~~~~~~p~~~~~-~~a~~v~e~~~~~e~ 229 (458) +. ++.....++|+.|..-+.......+.+.+ +..... .++...++|.+..- ..+.-+.++ T Consensus 1 MA--~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~------ 72 (324) T protein:vir:59 1 MA--YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDT------ 72 (324) T ss_pred CC--ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCC------ Confidence 11 22334578898887766555544444422 222222 23446678877542 333323333 Q ss_pred cccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccc Q lcl|NC_010583. 230 TVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAK 309 (458) Q Consensus 230 ~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~ 309 (458) +..+....+.++-....+..+.-..++++...-+.-+....|.++++....+..+..+|.- ..|++........+ T Consensus 73 ~~i~~~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~-----l~g~~~~~~~~~~~ 147 (324) T protein:vir:59 73 DDLVPQKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAE-----LAGVFSNDDMKDNK 147 (324) T ss_pred cccchhhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhhhccccccce Confidence 3333444454554445555555556777655555567888899999999999999887742 11222111111100 Q ss_pred eeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeeccccee Q lcl|NC_010583. 310 VVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVV 389 (458) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~ 389 (458) ... ..+....++...+.+....+.........|+||+.++..|.++.--+. +. . ......-++++|++|++ T Consensus 148 -~dv-sa~~~~~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~--~~-~----s~~~~~i~~~~G~~Viv 218 (324) T protein:vir:59 148 -LDI-SGTADGIYSAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIEF--VK-D----SQSGIRFPTYMNKRVIV 218 (324) T ss_pred -eee-eccccceecHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhhh--cc-c----cccCceeeeecccEEEE Confidence 011 111122345566777777777777778899999999999886542221 11 0 11122346799999999 Q ss_pred cccccccc-c--CCceEEEEEece-EEEEe-cceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 390 SEYFPAKA-A--SAEFAVIVYKDN-FVMPR-QRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 390 ~~~~~~~~-~--~~~~~~~~~~~~-~~i~~-~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ++.||... + ......+.++.. +.+.. ...+.++.++....+...+....++ ++||..|..-..+.+ T Consensus 219 dD~~p~~~~~~~~~~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~~~l~~r~~~---~~~p~G~s~~~~~~~ 289 (324) T protein:vir:59 219 DDSMPVETLEDGTKVFTSYLFGAGALGYAEGQPEVPTETARNALGSQDILINRKHF---VLHPRGVKFTENAMA 289 (324) T ss_pred eCCCCccccCCCCceEEEEEEecCeEEEeecCCCcceecccCccccceEEEEeeEE---EeEeeeEEecccccC Confidence 99999532 2 222333444443 33333 3446666666666677777766665 355555533222211 No 144 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=98.90 E-value=4e-10 Score=72.10 Aligned_cols=293 Identities=11% Similarity=0.002 Sum_probs=144.1 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceee--e-ccCceEEEEecCCCcccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELP--M-SSKILTMLVEPEAGRATW 219 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~--~-~~~~~~~p~~~~~~~a~~ 219 (458) +..-+.. +.-. ..+-.++.....+|+.++..+++.+.+.+.+..+++... . .+...++|+.. .+.+.. T Consensus 1 ~~~~~~~----~~~~----~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~a~d 71 (381) T protein:vir:80 1 MATIQGT----GGYK----GSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAAVYD 71 (381) T ss_pred Cceeccc----cccc----CcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cceeee Confidence 1100000 0000 001111122347799999999999999888888765432 2 23456777754 445554 Q ss_pred cccccccccccccccccccceeeeeehhhee-eeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccC--CCCc- Q lcl|NC_010583. 220 VDASKFGTDETVGDEVKGQLTEISFKTYKLA-AKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGN--GTGQ- 295 (458) Q Consensus 220 v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~-~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~--g~~~- 295 (458) ..++... +-.+.+.+.++++..+.. .-++|++.-...+..++.+.+.+.++.++++..|+.++.-- .... T Consensus 72 ~~~g~~i------~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~ 145 (381) T protein:vir:80 72 KQPQTPV------NLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFP 145 (381) T ss_pred ecCCCcc------cccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 5444322 223444555555554432 23567776555667899999999999999999999887421 1111 Q ss_pred -cccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccc--eeEechhHHHHHHhhhcccccccccccccc Q lcl|NC_010583. 296 -PKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLS--KLVLIVSMDAYYDLLEDEEWQDVAQVGNDA 372 (458) Q Consensus 296 -p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~ 372 (458) +............. ............++..++++...+.....+.. ..+++|..+..|.+...-... ...... T Consensus 146 ~~~~~t~~~~i~~~~-~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~a---d~~~~~ 221 (381) T protein:vir:80 146 SQRIYSYDTTLGDGT-VNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISV---DFSQVK 221 (381) T ss_pred ccccccccccccccc-cccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhh---hhccch Confidence 11111001111111 11111122334566777888878777665432 467788888777543211111 111223 Q ss_pred ccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecceeEE-eecccccCCceEEEEEEeeccEEecc-cce Q lcl|NC_010583. 373 VKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTV-ERERQAGKQRDAYYVTQRVNLQRYFE-NGV 450 (458) Q Consensus 373 ~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i-~~~~~~~~~~~~~~~~~r~d~~~~~~-~af 450 (458) ....|..++|+|.+|+.|+.+|...........+-.... . ..+.- .+...+..+...++....+|.++... ..+ T Consensus 222 ~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~~~~agap~~~--~--~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~ 297 (381) T protein:vir:80 222 PVTSGVVGTILGMEVIVTTQIGINSLTGYVNGQGAPTQP--T--PGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGL 297 (381) T ss_pred hhhceeeeEEcceEEEeecccccccccceeeeccccccc--c--ccccccccccccccceeeeeeeeeeceeeeeeeccc Confidence 345666789999999999999964332211111100000 0 00000 01111223334444444555544221 222 Q ss_pred EEEEeecC Q lcl|NC_010583. 451 VSGAYAAA 458 (458) Q Consensus 451 v~l~~aaa 458 (458) -..+.+.+ T Consensus 298 ~~~~g~~~ 305 (381) T protein:vir:80 298 PVFSGAGA 305 (381) T ss_pred eeeeccee Confidence 21111111 No 145 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.87 E-value=1.8e-10 Score=74.00 Aligned_cols=236 Identities=12% Similarity=0.092 Sum_probs=149.9 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeec-cCceEEEEecCCCcccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMS-SKILTMLVEPEAGRATWVD 221 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~ 221 (458) +..-. ...-+-......+-|......||+.+...++|+....++... +....+.+.++-|.++|.. T Consensus 1 m~~~~-------------~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~ 67 (328) T protein:vir:95 1 MAVKG-------------LTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRL 67 (328) T ss_pred CCccc-------------cccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeee Confidence 00000 000000010122446667778999999999999999998885 3457788999989888865 Q ss_pred cccccccccccccccccceeeeeehhheeeeehhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccc Q lcl|NC_010583. 222 ASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGL 299 (458) Q Consensus 222 e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi 299 (458) =+ ...+.+++++.+++...+-+++.+.|.+.+.+... .++...-.....+++.......||+|+.+..|.++ T Consensus 68 lN------~g~~~s~~tt~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F 141 (328) T protein:vir:95 68 LN------YGVQPSKSTTVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQF 141 (328) T ss_pred cC------CccCcccceeEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhh Confidence 43 45567788999999999999999999999888753 33444455568899999999999999877666555 Q ss_pred ccccc-----------------ccccc--ee------------------------------------------------- Q lcl|NC_010583. 300 LKLAA-----------------DDGAK--VV------------------------------------------------- 311 (458) Q Consensus 300 ~~~~~-----------------~~~~~--~~------------------------------------------------- 311 (458) ..... ..+.. +. T Consensus 142 ~GL~~R~~~~s~~~a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w 221 (328) T protein:vir:95 142 MGLSSRYSSLSAGNAQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKW 221 (328) T ss_pred cchhhhcCccccccccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEe Confidence 32211 00000 00 Q ss_pred --------------------eccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhh-cccccccccccc Q lcl|NC_010583. 312 --------------------TEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLE-DEEWQDVAQVGN 370 (458) Q Consensus 312 --------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-d~~~~~~~~~~~ 370 (458) .....+.......+.++.....++........|+||...+..|++.. +...-.+-.... T Consensus 222 ~~Gl~i~d~r~vvrI~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~ 301 (328) T protein:vir:95 222 DNGLALRDWRYVVRIANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKET 301 (328) T ss_pred eeeeEEcCcccEEEEecCcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeecc Confidence 00001112334455667777777777788899999999999998753 332222221222 Q ss_pred ccccccccCCeeecccceecccccccccCCceEE Q lcl|NC_010583. 371 DAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAV 404 (458) Q Consensus 371 ~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~ 404 (458) .+. .+-.++|+||..++.+....+- ++ T Consensus 302 ~g~----~~t~~~gipir~~dai~~tE~~---vv 328 (328) T protein:vir:95 302 EGE----WWTSFRGVPIRETDALLETEAR---VV 328 (328) T ss_pred CCc----ceeEECCeEEEEEeeeecCccc---cC Confidence 111 2346889999998887643221 11 No 146 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.82 E-value=1.4e-09 Score=69.16 Aligned_cols=274 Identities=10% Similarity=0.007 Sum_probs=141.8 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhh---------hcceeeeccCceEEEEecCC-Ccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGA---------LFDELPMSSKILTMLVEPEA-GRATWVDASKFGTDETV 231 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~---------~~~~~~~~~~~~~~p~~~~~-~~a~~v~e~~~~~e~~~ 231 (458) +. ++.....++|+.|..=+.+.....+.+.+ +......++...++|.+..- ..+.-+.++. . T Consensus 1 MA--~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~------~ 72 (351) T protein:vir:15 1 MA--ETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSD------D 72 (351) T ss_pred CC--ceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCc------c Confidence 11 22334578898886655454444343322 22222234667788887642 3333333333 3 Q ss_pred cccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccc-ce Q lcl|NC_010583. 232 GDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGA-KV 310 (458) Q Consensus 232 ~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~-~~ 310 (458) .+..+.+-++-....+..+--..++++...-+.-+....|.++++....+..+..+|.- ..|++........ +. T Consensus 73 i~~~kitt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~-----l~gv~~~~~~~~~~~~ 147 (351) T protein:vir:15 73 IDVNNLTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSV-----LKGVMGVTKIANSKVY 147 (351) T ss_pred cchheecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhhchhhccccee Confidence 33333333333334444444456666654445557788899999999999999887742 1122111110000 00 Q ss_pred eeccccchhhHHHHHHHHHHHhhhhhhhc-ccceeEechhHHHHHHhhhccccccccccccccccccccCCeeeccccee Q lcl|NC_010583. 311 VTEAKADGSVLVTAKTISKLRRKLGRHGL-KLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVV 389 (458) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~ 389 (458) -..........+.+..+.+....+..... ....|+||+.++..|.++.--+. + +.. .....-++++|++|++ T Consensus 148 d~t~~~~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~--~-~~s----~~~~~i~t~~G~~Viv 220 (351) T protein:vir:15 148 DQTKVSPSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIET--I-QPQ----NGATPFEAYNGLRIVL 220 (351) T ss_pred ccccccccccccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhh--c-ccc----ccCcccceecceEEEE Confidence 00111122333555667777777655433 35889999999988886542111 1 110 1122347899999999 Q ss_pred cccccccccC---CceEEEEEeceEE-EEec-ceeEEeecccccCCceEEEEEEeeccEEecccceEEEEe--ec---C Q lcl|NC_010583. 390 SEYFPAKAAS---AEFAVIVYKDNFV-MPRQ-RAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAY--AA---A 458 (458) Q Consensus 390 ~~~~~~~~~~---~~~~~~~~~~~~~-i~~~-~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~--aa---a 458 (458) ++.+|....+ ..+..+.++.... +... ..+.+.+++....+...+....+ .++||..|..-+. .+ + T Consensus 221 dD~~p~~~~~~~~~~ytsyl~~~GAi~~~~~~~~ve~~rd~~~~~g~d~l~~r~~---~~~hp~G~s~~~~~~~~~~~s 296 (351) T protein:vir:15 221 DDDIEIDLTDKTKPVSTSYIFAPGAVRYSTNMRSTETKYDPLINGGQDVIVQKRV---GTIHVAGTSIKASFSPSKASF 296 (351) T ss_pred cCCCccccCCCCCceeEEEEEecceeeeecCCcCcceeecccCCCCceEEEEeee---eeeeeeeeeecccccccCcCC Confidence 9999964322 2233444544332 2222 23556666666556655554333 3467766654211 11 1 No 147 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.77 E-value=5.4e-10 Score=71.39 Aligned_cols=253 Identities=12% Similarity=0.043 Sum_probs=127.4 Q ss_pred hcceeeeccCceEEEEecCCCccccccccccccccccccccccccee--eeeehhheeeeehhhHHHHhccHHHHHHHHH Q lcl|NC_010583. 195 LFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTE--ISFKTYKLAAKSFITDETEEDAIFSLLPLLR 272 (458) Q Consensus 195 ~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~--v~~~~~k~~~~~~is~ell~ds~~~~~~~i~ 272 (458) +.+.+. ++...++|+... ..+....-|...... ..++.-++ +++...++..+ .|.+-=--++.+++.+.+. T Consensus 1 ~vr~i~-~g~s~~~~~iG~-~~~~~~~~G~~l~~~----~~~~~~~e~~itID~~l~~~~-~VdDiD~~qa~~Dlr~e~s 73 (324) T protein:vir:99 1 MTRTIT-SGKSAQFPVMGR-TKARYLKQGQSLDDG----REDIKHTEKVITIDGLLTTDV-LIYDIEDAMNHYDVRSEYS 73 (324) T ss_pred Ceeeee-cCceEEEeeeee-eEeccccCCCCcCCC----cCCcCcccEEEEecchhhhhh-hhhhHHHHhcCccchhHHH Confidence 444443 355677777733 333333333322110 01122233 34444444332 1222112235678999999 Q ss_pred HHHHHHHHHHHHHHHhc----cC--C---CCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccc- Q lcl|NC_010583. 273 KRLIEAHAVSIEEAFMS----GN--G---TGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLS- 342 (458) Q Consensus 273 ~~la~~~~~~~d~~~l~----G~--g---~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 342 (458) +++++++++..|+.++. +. . ...|.+.......... ..............++.+.++...+.....+.. T Consensus 74 ~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~g 152 (324) T protein:vir:99 74 TQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKI-TGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGD 152 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecc-cccccccccCHHHHHHHHHHHHHHHhhcCCCCCC Confidence 99999999999987751 11 1 1111111110000000 001111122233445666666777776665432 Q ss_pred ee-EechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccC---------------------- Q lcl|NC_010583. 343 KL-VLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAAS---------------------- 399 (458) Q Consensus 343 ~~-~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~---------------------- 399 (458) .| +++|..+..|..-+...... ....+....|..++++|.+|+.|+++|..... T Consensus 153 R~~vv~P~~y~~Ll~~~~~~~~~---~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~ 229 (324) T protein:vir:99 153 RTFYTDPDTYSAILAALMPNAAN---YAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTT 229 (324) T ss_pred CEEEeChHHHHHHhhcccccccc---cccccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccc Confidence 34 55777766553222211111 11223445566788999999999999963211 Q ss_pred --------CceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 400 --------AEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 400 --------~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ....+++..+.+..+....++++....-.+....+++..-+|.++++|++.+.+++.+. T Consensus 230 ~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~ 296 (324) T protein:vir:99 230 GKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDG 296 (324) T ss_pred cccccccCceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccC Confidence 11112222333444444555444332223334556777788999999999988877666 No 148 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.76 E-value=2.2e-09 Score=68.09 Aligned_cols=275 Identities=9% Similarity=0.055 Sum_probs=141.8 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhh---------hcceeeeccCceEEEEecCC-Ccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGA---------LFDELPMSSKILTMLVEPEA-GRATWVDASKFGTDETV 231 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~---------~~~~~~~~~~~~~~p~~~~~-~~a~~v~e~~~~~e~~~ 231 (458) +....+.....++|+.|..-+.......+.+.+ +...+..++...++|.+..- ..+.-+.++. +. T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~-----~~ 75 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGD-----KA 75 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCc-----cc Confidence 222334445678999887655555544443322 22223345667788887632 3332233332 12 Q ss_pred cccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccc--- Q lcl|NC_010583. 232 GDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGA--- 308 (458) Q Consensus 232 ~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~--- 308 (458) .+....+-++-....++.+.-..++++..--+.-+....+.++++....+..+..++.- ..|+++....... T Consensus 76 i~~~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~-----l~gvf~~~~~~~~~~~ 150 (330) T protein:vir:10 76 LETGKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIAT-----LNGIFATGTAGEKGAL 150 (330) T ss_pred cchhhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHH-----HHhhhhhhhcccchhh Confidence 22233333333334444444455666654445567788899999988888888776631 1122221100000 Q ss_pred -ceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccc Q lcl|NC_010583. 309 -KVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPV 387 (458) Q Consensus 309 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv 387 (458) .............+....+.+....+.........|+||+.++..|.+..--+. . +. ....+.-++++|++| T Consensus 151 ~~~~~~~~~~~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~--~-~~----s~~~~~i~~~~G~~V 223 (330) T protein:vir:10 151 EETHVSDQSKASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQY--I-QP----TTATINIPTYLGYRV 223 (330) T ss_pred hhhheecccccccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhhh--h-cc----cccCcccccccceEE Confidence 000001111222344556666666776666678899999999988886432111 1 11 111233478999999 Q ss_pred eecccccccccCCceEEEEEece-EEEEec---ceeEEeecccccCCceEEEEEEeeccEEecccceEEEEee--cC Q lcl|NC_010583. 388 VVSEYFPAKAASAEFAVIVYKDN-FVMPRQ---RAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYA--AA 458 (458) Q Consensus 388 ~~~~~~~~~~~~~~~~~~~~~~~-~~i~~~---~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~a--aa 458 (458) ++++.+|.... ....+.++.. +.+.+. ..+.++.++....+...+....++ ++||..|..-... .+ T Consensus 224 ivdD~~p~~~~--~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~~r~~~---~~hp~G~s~~~~~~~~~ 295 (330) T protein:vir:10 224 IIDDGIAPTGD--IYTSYLFRTGSIGLNTGNPSGLTTFETSREAAKGNDMIYTRRAL---VMHPYGVKWTGAEVDAG 295 (330) T ss_pred EEeCCCCCCCC--ceeEEEEecCceeeecccCCccccccccCCccccceEEEEeeEE---EeeeeeeeecccccccC Confidence 99999985332 2333334432 333332 224555555555666666555554 4556665433211 11 No 149 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.71 E-value=4.7e-09 Score=66.25 Aligned_cols=292 Identities=10% Similarity=0.061 Sum_probs=140.9 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchh----HHHHHHHHHH-hccchhhhcceeeeccCceEEEEecCCCcc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETI----FSTRIIRDLQ-KELVVGALFDELPMSSKILTMLVEPEAGRA 217 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~----~~~~ii~~~~-~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a 217 (458) +. -.+..+....- +..||+. |..++.-... ..+.|+..++......+... +.......+ T Consensus 1 ~~-------------~~~~~~~~~~M--s~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~-~~~~~~~~~ 64 (322) T protein:vir:10 1 MK-------------LNAIMSMLPLI--AGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHN-WETLASMDP 64 (322) T ss_pred Cc-------------ccceeeeeeee--echhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccc-eeecccccc Confidence 00 00000000000 1123444 4444444433 34556655543322223211 111111112 Q ss_pred cccccccccc---ccc-ccccccccce--eeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_010583. 218 TWVDASKFGT---DET-VGDEVKGQLT--EISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGN 291 (458) Q Consensus 218 ~~v~e~~~~~---e~~-~~~~~~~~f~--~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~ 291 (458) .-++.+.... +.+ ..|.....++ .+.+..+ ....+|.+.-+-....|..+...+..+.+++++.|..|+.+- T Consensus 65 ~~~~~~~~~~~~~d~~~dtp~~~~~~~~r~~~~~d~--~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~ 142 (322) T protein:vir:10 65 DAVKRKRSRQQSADGTYPTPVNNKPFAKRRTNVDTY--DTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGA 142 (322) T ss_pred cccccccccccccCcccCCCccccccceEEEeeccc--ccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhh Confidence 1122111110 110 0111112233 3444444 334567766666666789999999999999999999888642 Q ss_pred -CCCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhccc--ceeEe-chhHHHHHHhhhccccccccc Q lcl|NC_010583. 292 -GTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKL--SKLVL-IVSMDAYYDLLEDEEWQDVAQ 367 (458) Q Consensus 292 -g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~-~~~~~~~l~~~~d~~~~~~~~ 367 (458) |.....+.-+......... ........++..++.+...+.....+. ..|++ .|..+..|..........+ T Consensus 143 ~g~a~~~~~gt~v~~~ss~~----i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~-- 216 (322) T protein:vir:10 143 WKPASIKGTGQPVEFLATQE----IGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADY-- 216 (322) T ss_pred hccccccccccccccCCCcc----cccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhc-- Confidence 2211100000000000000 011112344556777777777766663 24544 6666555443222111111 Q ss_pred cccccccccccCCeeecccceecccccccc-------------cCCceEEEEEeceEEEEecceeEEeeccccc-CCceE Q lcl|NC_010583. 368 VGNDAVKLQGQVGRIYGLPVVVSEYFPAKA-------------ASAEFAVIVYKDNFVMPRQRAVTVERERQAG-KQRDA 433 (458) Q Consensus 368 ~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~-------------~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~-~~~~~ 433 (458) .+.......|..++++|..++.++.+|..+ .+....+.+..+.+.+..+.++....+..-. .+... T Consensus 217 ~~~~~l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a~~ 296 (322) T protein:vir:10 217 TSAMDLQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSASFAWR 296 (322) T ss_pred ccchhhhhcCeeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCcchhhh Confidence 111122234667899999999999998311 1112222333456666666555554332222 33456 Q ss_pred EEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 434 YYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 434 ~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +++..-+|.++++|+.+|.+.+.-| T Consensus 297 I~~~~~~Ga~ri~~~gVv~i~~~e~ 321 (322) T protein:vir:10 297 IYSAFTADCVRVEDEHIFKLRLKNS 321 (322) T ss_pred hhhhhhhCceEeccCcEEEEEEecc Confidence 6778889999999999999999999 No 150 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.70 E-value=9.7e-10 Score=69.99 Aligned_cols=236 Identities=12% Similarity=0.044 Sum_probs=142.7 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCc-eEEEEecCCCcccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKI-LTMLVEPEAGRATWVD 221 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~ 221 (458) +..-. ...-+-......+-|......|++.+...++|++...++...++. ....+.++-|.++|-. T Consensus 1 m~~~~-------------~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~ 67 (330) T protein:vir:10 1 MATLS-------------TNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRK 67 (330) T ss_pred CCcCC-------------CCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhh Confidence 00000 000000000122345556668999999999999988887543322 2344556667766644 Q ss_pred cccccccccccccccccceeeeeehhheeeeehhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccc Q lcl|NC_010583. 222 ASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGL 299 (458) Q Consensus 222 e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi 299 (458) -+...+.+..++.+++.+.+-+++.+.|.+.+.+... -++...-.....+++.+.+...||+|+.+..|.++ T Consensus 68 ------lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F 141 (330) T protein:vir:10 68 ------LYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEF 141 (330) T ss_pred ------cCCccccccceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhc Confidence 3445667889999999999999999999999887643 33555566778999999999999999876666555 Q ss_pred ccccccc-----------------ccc--ee--------------------------------eccc------------- Q lcl|NC_010583. 300 LKLAADD-----------------GAK--VV--------------------------------TEAK------------- 315 (458) Q Consensus 300 ~~~~~~~-----------------~~~--~~--------------------------------~~~~------------- 315 (458) .....-. +.. +. .... T Consensus 142 ~GL~kR~~~~ta~~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~ 221 (330) T protein:vir:10 142 TGLSPRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHY 221 (330) T ss_pred cchhhhcCCCCCCchhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeee Confidence 5321100 000 00 0000 Q ss_pred --------------------------cchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhh-hcccccccccc Q lcl|NC_010583. 316 --------------------------ADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLL-EDEEWQDVAQV 368 (458) Q Consensus 316 --------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~d~~~~~~~~~ 368 (458) .+......++.++.+...++...+...+|+||...+..|++. .+...-.+-.. T Consensus 222 ~w~~Gl~i~d~r~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~ 301 (330) T protein:vir:10 222 KWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWE 301 (330) T ss_pred eeeeeeEEeCcccEEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeee Confidence 000112344566667778887888889999999999999985 33332222111 Q ss_pred ccccccccccCCeeecccceecccccccccCCceEE Q lcl|NC_010583. 369 GNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAV 404 (458) Q Consensus 369 ~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~ 404 (458) ...+. ..-.++|+||..++.+....+-- + T Consensus 302 ~~~g~----~~t~~~gipir~~Dail~tE~~v---v 330 (330) T protein:vir:10 302 TVSGE----RVMTFDGIPVQRTDALLNTESRV---V 330 (330) T ss_pred ecCCe----eeEEECCeEEEEEeeeecCcccc---C Confidence 11111 12458899999988876432211 1 No 151 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.69 E-value=4e-09 Score=66.60 Aligned_cols=279 Identities=13% Similarity=0.019 Sum_probs=154.9 Q ss_pred hhcccccccCccccc--hhHHHHHHHHHHhccchhhhccee---eeccCceEEEEecCCCcccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMSSEAYE--TIFSTRIIRDLQKELVVGALFDEL---PMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEV 235 (458) Q Consensus 161 ~~~~~~~~~g~~~ip--~~~~~~ii~~~~~~~~l~~~~~~~---~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~ 235 (458) +......++|.+++. +.+.+.|++...+....+++..+. +-......+.+.+..+.+.|++..+ ...+.. T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~-----~dip~v 75 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYT-----DDLPLV 75 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCc-----ccccee Confidence 111111122333332 234456666666655555554432 2222344555555545555554322 234556 Q ss_pred cccceeeeeehhheeeeehhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceee Q lcl|NC_010583. 236 KGQLTEISFKTYKLAAKSFITDETEEDA---IFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVT 312 (458) Q Consensus 236 ~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~ 312 (458) +..+.......+.++..+.++.+=|+.+ ..++..--....+.++.+.+|+.+++|+..-...|+++.......+. T Consensus 76 ~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~-- 153 (296) T protein:vir:10 76 DALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVS-- 153 (296) T ss_pred eccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccc-- Confidence 6677777878888888888887766554 35688888888999999999999999987777889998765432221 Q ss_pred ccccchhhHHHHHHHHHHHhhhhh---hhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeeccccee Q lcl|NC_010583. 313 EAKADGSVLVTAKTISKLRRKLGR---HGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVV 389 (458) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~ 389 (458) ..+... ....+.++..++..+.. ....+..++++|..+.+|...-+..+...... . .....+.++.+.|... T Consensus 154 ~~~W~~-~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~-i---k~~~~~l~i~~~~~l~ 228 (296) T protein:vir:10 154 GGSWSQ-PTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEF-F---RQNNSGVTVEFVQYLN 228 (296) T ss_pred cCCccC-HHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHH-H---HHhcCCceEEEeeeec Confidence 111111 12445666666655443 34556678889998888865444333222211 1 1111223444444432 Q ss_pred cccccccccCCceEEEEE--eceEEEEecceeEEeecccccCCceEEEEEEeec-cEEecccceEEE---Eee Q lcl|NC_010583. 390 SEYFPAKAASAEFAVIVY--KDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVN-LQRYFENGVVSG---AYA 456 (458) Q Consensus 390 ~~~~~~~~~~~~~~~~~~--~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d-~~~~~~~afv~l---~~a 456 (458) . +++.+....++.. .+.+.+...+.++...- ....-...++...|++ ..+.+|.||+++ |+| T Consensus 229 ~----a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~~-e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 229 D----YNGTGTSAAIAYEKDPNNMAIEIPEATNALPA-QPKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred c----CCCCcceEEEEEEcCCceEEEEcCcceeeecc-cccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 2 1222233333332 22333444444444321 1223445667788885 799999999997 888 No 152 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.67 E-value=2.5e-09 Score=67.79 Aligned_cols=284 Identities=12% Similarity=0.058 Sum_probs=144.8 Q ss_pred hhcccccccC-ccccchhHHHHHHHHHHhccchhhhcceeeec-cCceEEEEecCCCccccccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMS-SEAYETIFSTRIIRDLQKELVVGALFDELPMS-SKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 161 ~~~~~~~~~g-~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~ 238 (458) +.....++.+ .+++|+.++..|...+.+......++++.... +..++||.... ++..-..++.... ..+.+.+ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~-~tV~dY~~~~~i~----~d~ltt~ 75 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGT-PVVRSRPEQGDFT----FDNLDTG 75 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccc-cccccccCCCCcc----cccCCCc Confidence 1112222222 35669999999988777777666666654433 44455554433 3322111221111 1111111 Q ss_pred ceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhc--cCCCCccccccccccccc--cceeecc Q lcl|NC_010583. 239 LTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMS--GNGTGQPKGLLKLAADDG--AKVVTEA 314 (458) Q Consensus 239 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~--G~g~~~p~Gi~~~~~~~~--~~~~~~~ 314 (458) =-.+.++..|+-++. |+++.. +...+|.+...++.+++++...|+.+.. -+|..+-.++-+.....+ ...+... T Consensus 76 ~~~l~IDq~KYfaf~-VdDD~~-Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~g 153 (322) T protein:vir:31 76 EISIILRDEVYAGNA-ISKKLR-QDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTG 153 (322) T ss_pred eEEEEEehhhhhccc-cchhHH-HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccC Confidence 124556666676654 788554 4567999999999999999999887632 112111000000000000 1111111 Q ss_pred ccchhhHHHHHHHHHHHhhhhhhhcc-cceeEe-chhHHHHHHh-------hhccccccccccccccccccccCCeeecc Q lcl|NC_010583. 315 KADGSVLVTAKTISKLRRKLGRHGLK-LSKLVL-IVSMDAYYDL-------LEDEEWQDVAQVGNDAVKLQGQVGRIYGL 385 (458) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~l~~-------~~d~~~~~~~~~~~~~~~~~~~~~~l~G~ 385 (458) + .....++.++++..++.....+ ...|++ .|.....|.. ++|.....+...+... .....++++|. T Consensus 154 t---~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~--g~~~Vg~~~GF 228 (322) T protein:vir:31 154 T---DQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAP--DMQFVRSVYGI 228 (322) T ss_pred C---CchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchh--hHHHHHHHhce Confidence 1 1223456777777777776655 356766 5777666633 3332211121221111 11125789999 Q ss_pred cceeccccccc----ccCCc--eEEEEEeceEE----------EEecceeEE---eecccccCCceEEEEEEeeccEEec Q lcl|NC_010583. 386 PVVVSEYFPAK----AASAE--FAVIVYKDNFV----------MPRQRAVTV---ERERQAGKQRDAYYVTQRVNLQRYF 446 (458) Q Consensus 386 pv~~~~~~~~~----~~~~~--~~~~~~~~~~~----------i~~~~~~~i---~~~~~~~~~~~~~~~~~r~d~~~~~ 446 (458) .|++|+.++.. .+|.. ...-+..+.+. +.-+..|.- .++++ +.-..+|+.+|+|.++.+ T Consensus 229 ~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~--~~~d~~~~~~~~g~g~~r 306 (322) T protein:vir:31 229 DLFVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDY--NDDLNTATTARWGNGLVR 306 (322) T ss_pred eeeeeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCcc--ccccceeeeeeecceeec Confidence 99999988631 11111 11111111111 111222211 11222 334567899999999999 Q ss_pred ccceEEEEeecC Q lcl|NC_010583. 447 ENGVVSGAYAAA 458 (458) Q Consensus 447 ~~afv~l~~aaa 458 (458) |+..+.|.-.++ T Consensus 307 ~e~l~~~~a~~~ 318 (322) T protein:vir:31 307 DENLVCVLANAD 318 (322) T ss_pred ccceEEEEeccc Confidence 999999888877 No 153 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.62 E-value=8.1e-09 Score=64.96 Aligned_cols=237 Identities=9% Similarity=0.054 Sum_probs=141.5 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccch-hHHHHHHHHHHhccchhhhcceeeeccCc-eEEEEecCCCccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYET-IFSTRIIRDLQKELVVGALFDELPMSSKI-LTMLVEPEAGRATWV 220 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v 220 (458) +..-. ...-+-......+-|. .+...|++.+...++|+....++...++. ..+.+.++-|.++|. T Consensus 1 m~~~~-------------~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR 67 (331) T protein:vir:10 1 MPTLS-------------TTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWR 67 (331) T ss_pred CCccc-------------cCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhh Confidence 11000 0000000001111132 24457999999999999999988765433 456777777887775 Q ss_pred ccccccccccccccccccceeeeeehhheeeeehhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccc Q lcl|NC_010583. 221 DASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKG 298 (458) Q Consensus 221 ~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~G 298 (458) .= +...+.+.+++.+++...+-+++.+.|.+.+.+... -++...-.....+++...+...||+|+.+..|.+ T Consensus 68 ~l------N~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~ 141 (331) T protein:vir:10 68 KL------NYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEK 141 (331) T ss_pred cc------CCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhh Confidence 43 345667889999999999999999999999888743 3345556666889999999999999986655555 Q ss_pred cccccc-----------------ccccc--ee------------------------------------------------ Q lcl|NC_010583. 299 LLKLAA-----------------DDGAK--VV------------------------------------------------ 311 (458) Q Consensus 299 i~~~~~-----------------~~~~~--~~------------------------------------------------ 311 (458) +..... ..+.. +. T Consensus 142 F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~ 221 (331) T protein:vir:10 142 FMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYK 221 (331) T ss_pred hccchhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEE Confidence 532111 00000 00 Q ss_pred ----------------------eccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhh-hcccccccccc Q lcl|NC_010583. 312 ----------------------TEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLL-EDEEWQDVAQV 368 (458) Q Consensus 312 ----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~d~~~~~~~~~ 368 (458) .......+.....+.++.+...++........|+||...+..|++. .+...-..... T Consensus 222 w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~ 301 (331) T protein:vir:10 222 WDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTM 301 (331) T ss_pred eeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeee Confidence 0000011112335556677777777777788999999999999875 33221111111 Q ss_pred ccccccccccCCeeecccceecccccccccCCceEE Q lcl|NC_010583. 369 GNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAV 404 (458) Q Consensus 369 ~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~ 404 (458) ..... -..-.++|+||..++.+....+- ++ T Consensus 302 ~~~~g---~~~t~~~gipir~~dai~~tE~~---Vv 331 (331) T protein:vir:10 302 EEIAG---KKVVAFDGIPCRRTDALLLTEAR---VV 331 (331) T ss_pred eecCC---cceeEECCeeEEEeeeeecCccc---cC Confidence 11110 11235889999988887643221 11 No 154 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.62 E-value=8.1e-09 Score=64.96 Aligned_cols=237 Identities=9% Similarity=0.054 Sum_probs=141.5 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccch-hHHHHHHHHHHhccchhhhcceeeeccCc-eEEEEecCCCccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYET-IFSTRIIRDLQKELVVGALFDELPMSSKI-LTMLVEPEAGRATWV 220 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v 220 (458) +..-. ...-+-......+-|. .+...|++.+...++|+....++...++. ..+.+.++-|.++|. T Consensus 1 m~~~~-------------~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR 67 (331) T protein:vir:98 1 MPTLS-------------TTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWR 67 (331) T ss_pred CCccc-------------cCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhh Confidence 11000 0000000001111132 24457999999999999999988765433 456777777887775 Q ss_pred ccccccccccccccccccceeeeeehhheeeeehhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccc Q lcl|NC_010583. 221 DASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKG 298 (458) Q Consensus 221 ~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~G 298 (458) .= +...+.+.+++.+++...+-+++.+.|.+.+.+... -++...-.....+++...+...||+|+.+..|.+ T Consensus 68 ~l------N~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~ 141 (331) T protein:vir:98 68 KL------NYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEK 141 (331) T ss_pred cc------CCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhh Confidence 43 345667889999999999999999999999888743 3345556666889999999999999986655555 Q ss_pred cccccc-----------------ccccc--ee------------------------------------------------ Q lcl|NC_010583. 299 LLKLAA-----------------DDGAK--VV------------------------------------------------ 311 (458) Q Consensus 299 i~~~~~-----------------~~~~~--~~------------------------------------------------ 311 (458) +..... ..+.. +. T Consensus 142 F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~ 221 (331) T protein:vir:98 142 FMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYK 221 (331) T ss_pred hccchhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEE Confidence 532111 00000 00 Q ss_pred ----------------------eccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhh-hcccccccccc Q lcl|NC_010583. 312 ----------------------TEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLL-EDEEWQDVAQV 368 (458) Q Consensus 312 ----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~d~~~~~~~~~ 368 (458) .......+.....+.++.+...++........|+||...+..|++. .+...-..... T Consensus 222 w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~ 301 (331) T protein:vir:98 222 WDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTM 301 (331) T ss_pred eeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeee Confidence 0000011112335556677777777777788999999999999875 33221111111 Q ss_pred ccccccccccCCeeecccceecccccccccCCceEE Q lcl|NC_010583. 369 GNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAV 404 (458) Q Consensus 369 ~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~ 404 (458) ..... -..-.++|+||..++.+....+- ++ T Consensus 302 ~~~~g---~~~t~~~gipir~~dai~~tE~~---Vv 331 (331) T protein:vir:98 302 EEIAG---KKVVAFDGIPCRRTDALLLTEAR---VV 331 (331) T ss_pred eecCC---cceeEECCeeEEEeeeeecCccc---cC Confidence 11110 11235889999988887643221 11 No 155 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.62 E-value=8.1e-09 Score=64.96 Aligned_cols=237 Identities=9% Similarity=0.054 Sum_probs=141.5 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccch-hHHHHHHHHHHhccchhhhcceeeeccCc-eEEEEecCCCccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYET-IFSTRIIRDLQKELVVGALFDELPMSSKI-LTMLVEPEAGRATWV 220 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v 220 (458) +..-. ...-+-......+-|. .+...|++.+...++|+....++...++. ..+.+.++-|.++|. T Consensus 1 m~~~~-------------~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR 67 (331) T protein:vir:10 1 MPTLS-------------TTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWR 67 (331) T ss_pred CCccc-------------cCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhh Confidence 11000 0000000001111132 24457999999999999999988765433 456777777887775 Q ss_pred ccccccccccccccccccceeeeeehhheeeeehhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccc Q lcl|NC_010583. 221 DASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKG 298 (458) Q Consensus 221 ~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~G 298 (458) .= +...+.+.+++.+++...+-+++.+.|.+.+.+... -++...-.....+++...+...||+|+.+..|.+ T Consensus 68 ~l------N~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~ 141 (331) T protein:vir:10 68 KL------NYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEK 141 (331) T ss_pred cc------CCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhh Confidence 43 345667889999999999999999999999888743 3345556666889999999999999986655555 Q ss_pred cccccc-----------------ccccc--ee------------------------------------------------ Q lcl|NC_010583. 299 LLKLAA-----------------DDGAK--VV------------------------------------------------ 311 (458) Q Consensus 299 i~~~~~-----------------~~~~~--~~------------------------------------------------ 311 (458) +..... ..+.. +. T Consensus 142 F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~ 221 (331) T protein:vir:10 142 FMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYK 221 (331) T ss_pred hccchhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEE Confidence 532111 00000 00 Q ss_pred ----------------------eccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhh-hcccccccccc Q lcl|NC_010583. 312 ----------------------TEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLL-EDEEWQDVAQV 368 (458) Q Consensus 312 ----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~d~~~~~~~~~ 368 (458) .......+.....+.++.+...++........|+||...+..|++. .+...-..... T Consensus 222 w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~ 301 (331) T protein:vir:10 222 WDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTM 301 (331) T ss_pred eeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeee Confidence 0000011112335556677777777777788999999999999875 33221111111 Q ss_pred ccccccccccCCeeecccceecccccccccCCceEE Q lcl|NC_010583. 369 GNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAV 404 (458) Q Consensus 369 ~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~ 404 (458) ..... -..-.++|+||..++.+....+- ++ T Consensus 302 ~~~~g---~~~t~~~gipir~~dai~~tE~~---Vv 331 (331) T protein:vir:10 302 EEIAG---KKVVAFDGIPCRRTDALLLTEAR---VV 331 (331) T ss_pred eecCC---cceeEECCeeEEEeeeeecCccc---cC Confidence 11110 11235889999988887643221 11 No 156 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.59 E-value=4.2e-09 Score=66.49 Aligned_cols=295 Identities=12% Similarity=0.030 Sum_probs=137.1 Q ss_pred HHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccc-hhHHHHHHHHHHhccchhhhcceeeecc-CceEEEEecCC Q lcl|NC_010583. 137 LVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYE-TIFSTRIIRDLQKELVVGALFDELPMSS-KILTMLVEPEA 214 (458) Q Consensus 137 ~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip-~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~ 214 (458) ....+.+.+ .....++.-..+. +.|.++|...+...+.++.+.++..+.+ ...++|+. +. T Consensus 1 Ms~~n~~t~-----------------~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~i-G~ 62 (402) T protein:vir:97 1 MSTPNTLTN-----------------VAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GE 62 (402) T ss_pred CCCcccccc-----------------cccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEE-ee Confidence 000000000 0000111112333 7888999999988999999887766554 45667766 33 Q ss_pred Ccccccccccccccccccccccccceeeeeehhhee-eeehhhHHHHhccHHH-HHHHHHHHHHHHHHHHHHHHHhc--- Q lcl|NC_010583. 215 GRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLA-AKSFITDETEEDAIFS-LLPLLRKRLIEAHAVSIEEAFMS--- 289 (458) Q Consensus 215 ~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~-~~~~is~ell~ds~~~-~~~~i~~~la~~~~~~~d~~~l~--- 289 (458) .++.+..-|+. .++ ..+..++..+....+- .-..|.+----++.++ +.+.+..++++++++..|+.++. T Consensus 63 ~~a~y~~~G~~-ldg-----~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~ 136 (402) T protein:vir:97 63 TELQVLAPGQS-PNA-----TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQML 136 (402) T ss_pred eEEeeeccccc-cCC-----CCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33343333332 122 2344455544444431 1111211111134466 78999999999999999997752 Q ss_pred cCC---C----CccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccce--eEechhHHHHHHhhhcc Q lcl|NC_010583. 290 GNG---T----GQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSK--LVLIVSMDAYYDLLEDE 360 (458) Q Consensus 290 G~g---~----~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~l~~~~d~ 360 (458) -.+ + ..|.+.-... ....+.+..............+..+...+.....+... .++.|..+..|.+-.+- T Consensus 137 ~aa~a~t~~~~~~~~~~~~g~--s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl 214 (402) T protein:vir:97 137 LGGIANTKAERNKPRVKGHGF--SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRI 214 (402) T ss_pred HhhccccccccccCccccccc--ccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccc Confidence 111 1 1111111100 01111111111222333344455555555555544443 45567666655442111 Q ss_pred ccccccccccccccccccCCeeecccceeccccccccc------------CCceEEEEE----------eceEEEEecce Q lcl|NC_010583. 361 EWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAA------------SAEFAVIVY----------KDNFVMPRQRA 418 (458) Q Consensus 361 ~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~------------~~~~~~~~~----------~~~~~i~~~~~ 418 (458) -.+. +.....+....|....+.|.||+.|+++|..+. |..+.+-++ .+.+....-.. T Consensus 215 ~n~d-~~~~~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~ 293 (402) T protein:vir:97 215 VDKT-YTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIE 293 (402) T ss_pred cchh-hccccCCccccceeEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeec Confidence 1110 111122334556667899999999999996421 110111111 11222222222 Q ss_pred eEEeecccccCCceEEEEEEeeccEEecccceEEEEeec------C Q lcl|NC_010583. 419 VTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAA------A 458 (458) Q Consensus 419 ~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aa------a 458 (458) ++.+....-.+-...+.++.-+|..+.+|++..++++.- + T Consensus 294 vT~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~t~~~~ 339 (402) T protein:vir:97 294 VTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDA 339 (402) T ss_pred cccchhhchhHHHHHHHHHHHhCCcccCccceEEEEEecccccccC Confidence 222221111111222344456788999999888774432 2 No 157 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.56 E-value=1e-08 Score=64.33 Aligned_cols=290 Identities=11% Similarity=0.045 Sum_probs=138.9 Q ss_pred HHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccC-ceEEEEecCCC Q lcl|NC_010583. 137 LVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSK-ILTMLVEPEAG 215 (458) Q Consensus 137 ~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~p~~~~~~ 215 (458) ....+.+.+ . ..+++...-. +.=+.|.++|...+...+.++.+..+.++.++ ...+|+. +.. T Consensus 1 Ms~~n~~t~--------------p-~~~gsg~~~a-L~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s 63 (400) T protein:vir:10 1 MSTPNNLTN--------------V-AVSASGEVDS-LLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GET 63 (400) T ss_pred CCCCccccc--------------c-ccccccchhh-hHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eee Confidence 000000000 0 0000001111 22367888899999999999999888776654 5566665 444 Q ss_pred cccccccccccccccccccccccceeeeeehhhee-eeehhhHHHHhccHHH-HHHHHHHHHHHHHHHHHHHHHhc---- Q lcl|NC_010583. 216 RATWVDASKFGTDETVGDEVKGQLTEISFKTYKLA-AKSFITDETEEDAIFS-LLPLLRKRLIEAHAVSIEEAFMS---- 289 (458) Q Consensus 216 ~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~-~~~~is~ell~ds~~~-~~~~i~~~la~~~~~~~d~~~l~---- 289 (458) .+.+..-|+.. + .+.+..++..++...+= +-..|.+----++.++ +.+.+..++++++++..|+.++. T Consensus 64 ~a~y~~pG~~l-d-----g~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~ 137 (400) T protein:vir:10 64 ELQVLAPGQSP-A-----ATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLL 137 (400) T ss_pred EEeeecCCCCc-C-----CCCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55555444432 2 22344555555544431 1222222112234567 89999999999999999997752 Q ss_pred cC--CCC----ccccccccccccccceeecccc--chhhHHHHHHHHHHHhhhhhhhcccce--eEechhHHHHHHhhhc Q lcl|NC_010583. 290 GN--GTG----QPKGLLKLAADDGAKVVTEAKA--DGSVLVTAKTISKLRRKLGRHGLKLSK--LVLIVSMDAYYDLLED 359 (458) Q Consensus 290 G~--g~~----~p~Gi~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~l~~~~d 359 (458) +. -+. .|.|+..... ..+.+... ..........+..+...+.....+... .++.|..+..| .+ T Consensus 138 a~~a~t~~~~~~~~g~~~g~s----~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~L---l~ 210 (400) T protein:vir:10 138 GGIANTQAKRTNPRVKGHGFS----VNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVL---RD 210 (400) T ss_pred hcccccccccccCCccccccc----eeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHH---Hh Confidence 21 011 2222221111 11111111 111122223344444445444444333 34444444433 22 Q ss_pred cccccccccc----cccccccccCCeeecccceeccccccccc------------CCceEE----------EEEeceEEE Q lcl|NC_010583. 360 EEWQDVAQVG----NDAVKLQGQVGRIYGLPVVVSEYFPAKAA------------SAEFAV----------IVYKDNFVM 413 (458) Q Consensus 360 ~~~~~~~~~~----~~~~~~~~~~~~l~G~pv~~~~~~~~~~~------------~~~~~~----------~~~~~~~~i 413 (458) .+ .++... ..+....+....+.|.||+.|+.+|.... +..+.+ ++-.+.+.. T Consensus 211 ~d--kLvnrdf~~s~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~t 288 (400) T protein:vir:10 211 AD--RIVDKSYTISQSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLV 288 (400) T ss_pred CC--cccchhccccCCCccccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEE Confidence 22 111111 12333445556799999999999986321 111111 111122333 Q ss_pred EecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 414 PRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 414 ~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..-.+++.+....-.+-...+.++.-+|..+.+|++.++++++-- T Consensus 289 vk~~~lt~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~ 333 (400) T protein:vir:10 289 GRSIDVIGDIFYEKKEKTYYIDTFMSEGAIPDRWEAVSVVTTKRQ 333 (400) T ss_pred EEeeccccccccchhhHHHHHHHHHHhCCcccchhheEEEEecCC Confidence 333333322211122233344555678999999999999988765 No 158 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.53 E-value=4.7e-08 Score=60.74 Aligned_cols=299 Identities=10% Similarity=-0.012 Sum_probs=155.8 Q ss_pred hhccchhHHHHHHHHhhh----hhcccccccCccccc---hhHHHHHHHHHHhccchhhhccee---eeccCceEEEEec Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKA----VNGSSSVSMSSEAYE---TIFSTRIIRDLQKELVVGALFDEL---PMSSKILTMLVEP 212 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a----~~~~~~~~~g~~~ip---~~~~~~ii~~~~~~~~l~~~~~~~---~~~~~~~~~p~~~ 212 (458) +.+....+.+........ +......+. |+... +.+.+.|++...+....+++..+. +.......+.+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~da~~~~-g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~ 79 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQAGVKQDAAATM-GIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFD 79 (319) T ss_pred CCCcchhHHhhHHHHHHHhhccchhhhhhhh-hhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeec Confidence 111111111111111110 111111122 22322 234456777776666666665443 2222334455555 Q ss_pred CCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_010583. 213 EAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA---IFSLLPLLRKRLIEAHAVSIEEAFMS 289 (458) Q Consensus 213 ~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~~~~~i~~~la~~~~~~~d~~~l~ 289 (458) ..+.+.|++.+. ...+..+..+.......+.++..+.+|..=|..+ ..++..--....+.++.+.+|+-+|+ T Consensus 80 ~~G~a~~~~d~~-----~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~ 154 (319) T protein:vir:10 80 KVGTAQIIADYT-----DDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFK 154 (319) T ss_pred cccceeeecCcc-----ccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEe Confidence 445555554332 2345556667777777777888888877655544 45688888889999999999999999 Q ss_pred cCCCCccccccccccccccceeecccc-chhhHHHHHHHHHHHhhhhh---hhcccceeEechhHHHHHHhhhccccccc Q lcl|NC_010583. 290 GNGTGQPKGLLKLAADDGAKVVTEAKA-DGSVLVTAKTISKLRRKLGR---HGLKLSKLVLIVSMDAYYDLLEDEEWQDV 365 (458) Q Consensus 290 G~g~~~p~Gi~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~~d~~~~~~ 365 (458) |+......|+++..+....+....... +.+....++++..++..+.. ....+...+++|+.+.+|.......|... T Consensus 155 G~~~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~t~ 234 (319) T protein:vir:10 155 GSAPHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPETTMSY 234 (319) T ss_pred ecccccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCCCeeH Confidence 998778899998765443322222211 12334555667666666542 34466678999999988875444333322 Q ss_pred cccccccccccccCCeeecccceecccccccccCCceEEEEE--eceEEEEecceeEEeecccccCCceEEEEEEeec-c Q lcl|NC_010583. 366 AQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVY--KDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVN-L 442 (458) Q Consensus 366 ~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~--~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d-~ 442 (458) ...-. ....+.+|.+.|.... .++.+....++.. .+.+.+...+.++...-. ...-...+....|++ . T Consensus 235 l~~lk----~~~~~l~I~~~pel~~----ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~e-~~~l~~~~~~~~r~~Gv 305 (319) T protein:vir:10 235 LDYFK----SQNSGIEIDSIAELED----IDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPAQ-PKDLHFKVPCTSKCTGL 305 (319) T ss_pred HHHHH----HhcCCceEEEeeeecc----cCCCcceEEEEEecCCceEEEecCcceeeeeee-ecCceEEEeeeeeeEEE Confidence 22111 1112233444444332 1222233233322 223333333444433211 112223444566665 5 Q ss_pred EEecccceEEEEee Q lcl|NC_010583. 443 QRYFENGVVSGAYA 456 (458) Q Consensus 443 ~~~~~~afv~l~~a 456 (458) .+.+|.||++++-= T Consensus 306 ~i~~P~ai~~~dGI 319 (319) T protein:vir:10 306 TIYRPMTIVLITGV 319 (319) T ss_pred EEEccceeEeeecC Confidence 88999999987766 No 159 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.49 E-value=5.2e-08 Score=60.51 Aligned_cols=277 Identities=14% Similarity=0.050 Sum_probs=151.1 Q ss_pred ccccccCccccc--hhHHHHHHHHHHhccchhhhcce---eeeccCceEEEEecCCCccccccccccccccccccccccc Q lcl|NC_010583. 164 SSSVSMSSEAYE--TIFSTRIIRDLQKELVVGALFDE---LPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 164 ~~~~~~g~~~ip--~~~~~~ii~~~~~~~~l~~~~~~---~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~ 238 (458) -.+.+.|.+++. +.+.+.|++.+.+....+++..+ ++.......+...+....+.|++.++ ...+..+.. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~-----~dip~~~~~ 75 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGA-----DDLPLVDVD 75 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcc-----ccccccccc Confidence 111122222222 23445677777777777766543 33333344555554444455544332 234555666 Q ss_pred ceeeeeehhheeeeehhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccc Q lcl|NC_010583. 239 LTEISFKTYKLAAKSFITDETEEDA---IFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 239 f~~v~~~~~k~~~~~~is~ell~ds---~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) ++........++.-+.++..=|+.+ ..++..--....+.++++.+|+.+|+|+..-...|+++..+.....++.... T Consensus 76 ~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~ 155 (301) T protein:vir:80 76 MVRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGV 155 (301) T ss_pred ceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCccc Confidence 7777777777887777777766554 4668888889999999999999999999877789999876543332221111 Q ss_pred ---c---chhhHHHHHHHHHHHhhhhh---hhcccceeEechhHHHHHHhhh--ccccccccccccccccccccCCeeec Q lcl|NC_010583. 316 ---A---DGSVLVTAKTISKLRRKLGR---HGLKLSKLVLIVSMDAYYDLLE--DEEWQDVAQVGNDAVKLQGQVGRIYG 384 (458) Q Consensus 316 ---~---~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~~--d~~~~~~~~~~~~~~~~~~~~~~l~G 384 (458) . ..+....+.++..++.++.. ....+...+++|+.+..|..-. +..+..+.+.-.. .....+|.. T Consensus 156 ~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~----~~~~~~I~~ 231 (301) T protein:vir:80 156 GNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQD----NAWFSAIVR 231 (301) T ss_pred ccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHH----HcCcceEEE Confidence 1 11233445666666666533 3335567889999999887432 3334333221110 111123444 Q ss_pred ccceecccccccccCCceEEEEE---eceEEEEecceeEEeecccccCC-ceEEEEEEee-ccEEecccceEEEEee Q lcl|NC_010583. 385 LPVVVSEYFPAKAASAEFAVIVY---KDNFVMPRQRAVTVERERQAGKQ-RDAYYVTQRV-NLQRYFENGVVSGAYA 456 (458) Q Consensus 385 ~pv~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~~~i~~~~~~~~~-~~~~~~~~r~-d~~~~~~~afv~l~~a 456 (458) .|-... .+.++....+.+ .+.+.+...+.++...-. .++ ...+-.+.|+ |..+.+|.||++++-= T Consensus 232 ~p~L~~-----~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e--~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 232 VPDLAG-----MGTAGSDSFAVIHDSNETAELIIPMDITRHPEE--YSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred cceecc-----CCCCcccEEEEEecCCcEEEEEecCceeeecce--ecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 444332 222222222222 223334434444432211 122 2233445666 5689999999997766 No 160 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.42 E-value=1.5e-08 Score=63.49 Aligned_cols=235 Identities=11% Similarity=0.069 Sum_probs=133.4 Q ss_pred hhccchhHHHHHHHHhhhhhcccc-cccCccccchhHHHHHHHHHHhccchhhhcceeeeccCc-eEEEEecCCCccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSS-VSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKI-LTMLVEPEAGRATWV 220 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~-~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v 220 (458) +..-. +..-+ ......+-|......||+.+...++|++...+....++. ....+.++-|.++|- T Consensus 1 m~~~~--------------~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR 66 (335) T protein:vir:73 1 MALIG--------------QTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWR 66 (335) T ss_pred CCcCC--------------CCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhh Confidence 10000 00000 000111234445566999999999999988887543322 234455666776664 Q ss_pred ccccccccccccccccccceeeeeehhheeeeehhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccc Q lcl|NC_010583. 221 DASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKG 298 (458) Q Consensus 221 ~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~G 298 (458) . -+...+.+..++.+++.+.+-+++.+.|.+.+.+... -++...-.....+++...+...||+|+.+..|.+ T Consensus 67 ~------lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~ 140 (335) T protein:vir:73 67 R------YNQGVQPTKTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEA 140 (335) T ss_pred h------cCCccccccceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhh Confidence 4 3455667889999999999999999999998777653 3356666666899999999999999987766665 Q ss_pred cccccccc--------------------ccc--ee--------------------------------------------- Q lcl|NC_010583. 299 LLKLAADD--------------------GAK--VV--------------------------------------------- 311 (458) Q Consensus 299 i~~~~~~~--------------------~~~--~~--------------------------------------------- 311 (458) +.....-. +.. +. T Consensus 141 FdGL~kR~~~~st~~a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~ 220 (335) T protein:vir:73 141 FMGLAPRFNTLSTSKAASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRD 220 (335) T ss_pred ccchhhhhcCccccccCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEe Confidence 55321100 000 00 Q ss_pred ----------------------eccc---cchhhHHHHHHHHHHHh--hhhhhhcccceeEechhHHHHHHhhh-ccccc Q lcl|NC_010583. 312 ----------------------TEAK---ADGSVLVTAKTISKLRR--KLGRHGLKLSKLVLIVSMDAYYDLLE-DEEWQ 363 (458) Q Consensus 312 ----------------------~~~~---~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~~~-d~~~~ 363 (458) ..+. .........+.++..+. .++........|+||...+..|++.. +.... T Consensus 221 ~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~ 300 (335) T protein:vir:73 221 EFKWDIGLSVRDWRSISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNV 300 (335) T ss_pred eeeeeeeeEEeCcccEEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCce Confidence 0000 00011122223333332 33443445578999999999999753 33222 Q ss_pred cccccccccccccccC-CeeecccceecccccccccCCceEEEE Q lcl|NC_010583. 364 DVAQVGNDAVKLQGQV-GRIYGLPVVVSEYFPAKAASAEFAVIV 406 (458) Q Consensus 364 ~~~~~~~~~~~~~~~~-~~l~G~pv~~~~~~~~~~~~~~~~~~~ 406 (458) .+-.... .+.. -.++|+||..++.+....+ .+.. T Consensus 301 ~l~~~~~-----~g~~~t~~~gipir~~Dail~tE~----~v~~ 335 (335) T protein:vir:73 301 NLTIEEY-----GGKKIVSFLGIPIRRVDAILNTES----AVTA 335 (335) T ss_pred eeeeecc-----CCceeEEECCeEEEEEeeeecCcc----cccC Confidence 2211111 1112 3478999998888764322 1111 No 161 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.33 E-value=7.5e-08 Score=59.65 Aligned_cols=296 Identities=11% Similarity=0.022 Sum_probs=136.7 Q ss_pred HHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccC-ceEEEEecCCC Q lcl|NC_010583. 137 LVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSK-ILTMLVEPEAG 215 (458) Q Consensus 137 ~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~p~~~~~~ 215 (458) ....+...+. ..+++... -.+.=+.|.++|...+...+.++.+..+.++.++ ...+|+. +.. T Consensus 1 Ms~~n~~t~~---------------~~~~sg~~-~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s 63 (401) T protein:vir:70 1 MSTPNNLTNV---------------AVSASGEV-DSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GET 63 (401) T ss_pred CCCCcccccc---------------ccccccch-hHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eee Confidence 0000000000 00000000 1122367888888899999999999887776654 5566666 444 Q ss_pred cccccccccccccccccccccccceeeeeehhhee-eeehhhHHHHhccHHH-HHHHHHHHHHHHHHHHHHHHHhc---- Q lcl|NC_010583. 216 RATWVDASKFGTDETVGDEVKGQLTEISFKTYKLA-AKSFITDETEEDAIFS-LLPLLRKRLIEAHAVSIEEAFMS---- 289 (458) Q Consensus 216 ~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~-~~~~is~ell~ds~~~-~~~~i~~~la~~~~~~~d~~~l~---- 289 (458) .+.+..-|+.. .-+.+..+++.+....+- .-..|.+----++.++ +.+.+..++++++++..|+.++. T Consensus 64 ~~~~~~pG~~l------d~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~ 137 (401) T protein:vir:70 64 ELQVLAPGQSP------AATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMML 137 (401) T ss_pred EeeeecCCCCc------CCCCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44444434321 223445555555544431 1112221111234566 78999999999999999986631 Q ss_pred -cC----C-CCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhccccee--EechhHHHHHHhhhccc Q lcl|NC_010583. 290 -GN----G-TGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKL--VLIVSMDAYYDLLEDEE 361 (458) Q Consensus 290 -G~----g-~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~l~~~~d~~ 361 (458) |= + +..|.|.-....... .................+.++...+.....+...+ ++.|..+..|.. .|.. T Consensus 138 aa~ana~~~~~~p~~~~~G~~i~v--~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~-~d~L 214 (401) T protein:vir:70 138 GGIANTQAKRTNPRVKGHGFSINV--EVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRD-ADRI 214 (401) T ss_pred hccccccccccCCCcCCCceEEec--cccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHh-cCcc Confidence 10 1 112322111100000 00111111122223444555666666655554433 333444433322 1111 Q ss_pred cccccccccccccccccCCeeecccceecccccccccC------------CceEEEEE----------eceEEEEeccee Q lcl|NC_010583. 362 WQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAAS------------AEFAVIVY----------KDNFVMPRQRAV 419 (458) Q Consensus 362 ~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~------------~~~~~~~~----------~~~~~i~~~~~~ 419 (458) =.--+.....+....|....+.|.||+.++.+|..+.. ..+.+-++ .+.+....-.++ T Consensus 215 ~nrd~~~s~~g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~l 294 (401) T protein:vir:70 215 VDKTYTISQSGATIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDV 294 (401) T ss_pred cchhhccccCCccccceEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeecc Confidence 00011111123345555677999999999999963211 11111111 122222222333 Q ss_pred EEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 420 TVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 420 ~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +-+......+-...+.++.-+|..+.+|++.++++.+-- T Consensus 295 t~~~~~d~r~~~~~id~~~a~g~g~~RPeaa~vv~~k~~ 333 (401) T protein:vir:70 295 TGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRN 333 (401) T ss_pred ccchhhhhhhhHHHHHHHHHhCCcccchhheEEEeecCc Confidence 222211112222333455568899999999888765544 No 162 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.30 E-value=2.7e-07 Score=56.63 Aligned_cols=296 Identities=14% Similarity=0.027 Sum_probs=150.4 Q ss_pred chhhhhhH-HHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccc--hhHHHHHHHHHHhccchhhhcceee-- Q lcl|NC_010583. 126 TQDAFEDE-VEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYE--TIFSTRIIRDLQKELVVGALFDELP-- 200 (458) Q Consensus 126 ~~~~~~~~-~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip--~~~~~~ii~~~~~~~~l~~~~~~~~-- 200 (458) ..-++..+ ....... .........++|.+++. +.+.+.|++...+....+++..+.. T Consensus 1 ~~~~~~~~~~~~~~~~------------------~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~ 62 (314) T protein:vir:10 1 MAIKFDAEQAKITTHL------------------EQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEI 62 (314) T ss_pred CccchHHHHHHHHHHH------------------HhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCC Confidence 00000000 0000000 00001111222334443 2344556666555555444443321 Q ss_pred -eccCceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc---HHHHHHHHHHHHH Q lcl|NC_010583. 201 -MSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA---IFSLLPLLRKRLI 276 (458) Q Consensus 201 -~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~~~~~i~~~la 276 (458) -......+...+..+.+.|++..+ ...+..+..++......+.++..+.+|..=|..+ ..++..--....+ T Consensus 63 ~~~~et~~~~~~e~~G~a~~~~d~~-----~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~ 137 (314) T protein:vir:10 63 PGHAKYFEYPEFDGVGIAQIIADYS-----DDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAF 137 (314) T ss_pred CCceeEEEeeeeccccceeeeCCcc-----cccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHH Confidence 112234555554444455544321 2345666677777778888888888876655544 4568888888999 Q ss_pred HHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhh---hhcccceeEechhHHHH Q lcl|NC_010583. 277 EAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGR---HGLKLSKLVLIVSMDAY 353 (458) Q Consensus 277 ~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 353 (458) .++.+.+|+.+++|+......|+++..+....+. ..+.. +....++++..++..+.. ....+...++.|..+.+ T Consensus 138 ~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~--~~~Wa-T~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~ 214 (314) T protein:vir:10 138 EAHDNLLDKLVWSGSAPHGIVSVFDQPNINNVVA--TPNWS-VPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRV 214 (314) T ss_pred HHHHHhhceEEEeecccccceeEeecCCCccccC--CCCcc-cHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHh Confidence 9999999999999987777889998765433221 11222 233456777777777654 33455568889888877 Q ss_pred HHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEe--ceEEEEecceeEEeecccccCCc Q lcl|NC_010583. 354 YDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYK--DNFVMPRQRAVTVERERQAGKQR 431 (458) Q Consensus 354 l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~--~~~~i~~~~~~~i~~~~~~~~~~ 431 (458) |...-+..+..+...-. .+..+-+|.+.|-..+ .++.+....++... +.+.+...+.++...-.+ ..-. T Consensus 215 L~~~~~~~~~tvl~~l~----~n~~~l~I~~~~el~~----ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~e~-~~~~ 285 (314) T protein:vir:10 215 MQGLVPQTNLSYGELFT----RNNPGLTIRFLQFLDN----YDGAGGKAALAFEKSPLNMSIEIPEVTNVLPAQP-KDLH 285 (314) T ss_pred hcccccCCCccHHHHHH----HhCCCcEEEEcccccc----cCCCcceEEEEEecCCcEEEEecCccceeeccee-cCce Confidence 75443333332221111 1112233444444332 12222222222211 122232333333322111 1222 Q ss_pred eEEEEEEee-ccEEecccceEE---EEee Q lcl|NC_010583. 432 DAYYVTQRV-NLQRYFENGVVS---GAYA 456 (458) Q Consensus 432 ~~~~~~~r~-d~~~~~~~afv~---l~~a 456 (458) ..+....|+ |..+.+|.||++ +++| T Consensus 286 ~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 286 FRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred EEEcceeeeEEEEEECcceeEeeeeeecC Confidence 334456676 468999999995 6777 No 163 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.27 E-value=3.4e-07 Score=56.05 Aligned_cols=310 Identities=12% Similarity=0.012 Sum_probs=153.0 Q ss_pred hhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhh-hhhcccccccCccccc--hhHHHHHHHHHHhcc Q lcl|NC_010583. 114 FVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIK-AVNGSSSVSMSSEAYE--TIFSTRIIRDLQKEL 190 (458) Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~-a~~~~~~~~~g~~~ip--~~~~~~ii~~~~~~~ 190 (458) -.+.. ..+.+...+. .....+.... ..........+.+++. +.+.+.|++...+.. T Consensus 1 ~~~~~--------------------~~~~~~~d~~-~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l 59 (329) T protein:vir:79 1 MRGNI--------------------MSKEMKYDEF-EANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAG 59 (329) T ss_pred Cccch--------------------hhhhhccchh-hhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhccc Confidence 00000 0011111100 0000000000 1111111112233332 234566777777766 Q ss_pred chhhhcce---eeeccCceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc---H Q lcl|NC_010583. 191 VVGALFDE---LPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA---I 264 (458) Q Consensus 191 ~l~~~~~~---~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~ 264 (458) ..+++..+ .+-......+.+.+..+.+.|++-.+ +..+..+..+.......+.++..+.++..=|..+ . T Consensus 60 ~~~~~i~i~~~~~~~~~~~t~~~~~~~G~a~~~~d~~-----~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g 134 (329) T protein:vir:79 60 SALRVFPVTSELSDTDKTFEYQTFDKVGHAKIIADYT-----DDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTG 134 (329) T ss_pred chhhhcccccCCCCceeEEEeeeeecceeeeeecCcc-----cccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhC Confidence 66666543 22233344555555555555554321 2344555566666666677777777776655543 4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc---hhhHHHHHHHHHHHhhhhhh---h Q lcl|NC_010583. 265 FSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD---GSVLVTAKTISKLRRKLGRH---G 338 (458) Q Consensus 265 ~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~---~ 338 (458) .++..--....+.++.+.+|+-+|+|++..+..|+++.........+...... .+....++++..++..+... . T Consensus 135 ~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~ 214 (329) T protein:vir:79 135 KSLSTRKANAAQNAHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQ 214 (329) T ss_pred CChHHHHHHHHHHHHHHhhccEEEeecccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCce Confidence 56888888899999999999999999987778899987655432222211111 13344556677766666542 2 Q ss_pred cccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEe--ceEEEEec Q lcl|NC_010583. 339 LKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYK--DNFVMPRQ 416 (458) Q Consensus 339 ~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~--~~~~i~~~ 416 (458) ..+...+++|..+.+|.......|......-. ....+-+|.+.|-..+ ++..+.+..++... +.+.+... T Consensus 215 ~~p~~L~Lpp~~~~~L~~~~~~~~~tvl~~lk----~~~~~l~I~~~~el~~----ag~~g~~~~v~y~~~~~~~~~~vp 286 (329) T protein:vir:79 215 HRANMILIPPSMRKVLMVRMPETTMSYLDYFK----QQNGGITIESISELED----IDGAGTKAALVYEKDPMNMSIEIP 286 (329) T ss_pred ecccEEEecHHHHHHhhcccCCCCccHHHHHH----HhCCCcEEEEcccccc----cCCCCceEEEEEecCCceEEEecC Confidence 34567888999888886544333433322111 1112223444444322 12223333333222 22333333 Q ss_pred ceeEEeecccccCCceEEEEEEeec-cEEecccceEEEEeecC Q lcl|NC_010583. 417 RAVTVERERQAGKQRDAYYVTQRVN-LQRYFENGVVSGAYAAA 458 (458) Q Consensus 417 ~~~~i~~~~~~~~~~~~~~~~~r~d-~~~~~~~afv~l~~aaa 458 (458) +.++...-.+ ..-...+....|++ ..+.+|.||+.++-=.- T Consensus 287 ~~~~~l~~q~-~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~ 328 (329) T protein:vir:79 287 EAFNMLTAQP-KDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVV 328 (329) T ss_pred cceeeeecee-cCceEEEceeeeEEEEEEECcceeeeeeeeee Confidence 4444332111 11223344556664 58889999887543333 No 164 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.23 E-value=4.8e-07 Score=55.24 Aligned_cols=266 Identities=17% Similarity=0.117 Sum_probs=117.0 Q ss_pred ccCccccchhHHHHHHHHHHhccchhhhccee---ee---ccCceEEEEecCCCccccccccccccccccccccccccee Q lcl|NC_010583. 168 SMSSEAYETIFSTRIIRDLQKELVVGALFDEL---PM---SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTE 241 (458) Q Consensus 168 ~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~---~~---~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~ 241 (458) .+...++|+.++..+++.++....+.++++.- .. .+...++|+.... ...+..... ...+....-.+.+-+. T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~ 78 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPS-RGHTRKLRG-AGAERNLTVSDFTEDS 78 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccc-cceeeeccc-cccCCcccccccccce Confidence 33345899999999999999999888877431 11 1334677764432 222211100 0111112222333344 Q ss_pred eeeeh--hheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchh Q lcl|NC_010583. 242 ISFKT--YKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGS 319 (458) Q Consensus 242 v~~~~--~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 319 (458) +++.. ++..+ +.|+++-......++...+.+...++++.++|..++.- +..+......... ... T Consensus 79 ~~~~id~~k~~~-~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~---------~~~a~~~~~~~~~----~~~ 144 (392) T protein:vir:99 79 FPVTLTDVAYHL-GVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDM---------IVGAPYEAAGAVH----EVA 144 (392) T ss_pred EEEEEeeeeecc-eeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH---------Hhccccccccccc----ccC Confidence 44444 33332 45666655556678888888889999999999887731 1111100000001 111 Q ss_pred hHHHHHHHHHHHhhhhhhhccccee-EechhHHHHHHhhhccccccccccccc--cccccccCCeeecccceeccccccc Q lcl|NC_010583. 320 VLVTAKTISKLRRKLGRHGLKLSKL-VLIVSMDAYYDLLEDEEWQDVAQVGND--AVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~d~~~~~~~~~~~~--~~~~~~~~~~l~G~pv~~~~~~~~~ 396 (458) ....+..++++...|.....+..+| ++.|..+..|. ++..-......+.. .....|..+++.|.+|+.+..+|.. T Consensus 145 ~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~--~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~ 222 (392) T protein:vir:99 145 PDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQIL--NDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHG 222 (392) T ss_pred hhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHh--cccceeecccccchhhhhhhcceeeeeeeeEEEeecccccc Confidence 2233456666666666655544445 44666555554 33211100001111 1123466688999999999988753 Q ss_pred ccCCceEEEEEeceEEEEecce-----------------e--EEee--cccccCCceEEEEEEeeccEEec---ccceEE Q lcl|NC_010583. 397 AASAEFAVIVYKDNFVMPRQRA-----------------V--TVER--ERQAGKQRDAYYVTQRVNLQRYF---ENGVVS 452 (458) Q Consensus 397 ~~~~~~~~~~~~~~~~i~~~~~-----------------~--~i~~--~~~~~~~~~~~~~~~r~d~~~~~---~~afv~ 452 (458) .. +.+..+.+.+..... + +... +.....+...+.. ..+...+. ..+|.. T Consensus 223 t~-----~a~~~~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~--~~g~~~v~~~~~~~~~~ 295 (392) T protein:vir:99 223 DA-----YLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDT--YFGLKVVEDPNGVGFVR 295 (392) T ss_pred cc-----eeeeccccccccccccccccccceeEEecccceecceeecccceeeccccccce--eEEEEEEeeccccceee Confidence 21 111111111111100 0 0000 0001111110000 00000000 001100 Q ss_pred ---EE------------eecC Q lcl|NC_010583. 453 ---GA------------YAAA 458 (458) Q Consensus 453 ---l~------------~aaa 458 (458) ++ .+.. T Consensus 296 ~~~~~~~~~~v~v~~v~~~~~ 316 (392) T protein:vir:99 296 ARKIHLIPGSIEVAPEAGANA 316 (392) T ss_pred eeeeeeecceeeeeeeecccc Confidence 00 0000 No 165 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.02 E-value=3.1e-06 Score=50.78 Aligned_cols=279 Identities=11% Similarity=0.043 Sum_probs=122.4 Q ss_pred hhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcce-------- Q lcl|NC_010583. 127 QDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDE-------- 198 (458) Q Consensus 127 ~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~-------- 198 (458) +.. ..+. +.-...++|+.|..=+.+.....+.|.+-+=+ T Consensus 1 M~~----------------------------~~~~-----T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~ 47 (367) T protein:vir:80 1 MPD----------------------------FNNQ-----VRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQ 47 (367) T ss_pred Ccc----------------------------hhhh-----hhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHH Confidence 000 0000 00112355555544333333333332221111 Q ss_pred -eeeccCceEEEEecCC-Cc-ccccccccccccccccccccccceeeeeehhheee--eehhhHHHHhccHHHHHHHHHH Q lcl|NC_010583. 199 -LPMSSKILTMLVEPEA-GR-ATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAA--KSFITDETEEDAIFSLLPLLRK 273 (458) Q Consensus 199 -~~~~~~~~~~p~~~~~-~~-a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~--~~~is~ell~ds~~~~~~~i~~ 273 (458) ...++...++|.+..- .. ..+. +.....+.+ +..-.+..++-...+.-.+ ...++..+- .-+....|.+ T Consensus 48 ~~~~gG~~v~iPf~~~L~g~~~n~~-~d~~~~~~t--~~kittg~~~a~v~~r~kaw~~~Dla~~ls---G~dpm~~Ia~ 121 (367) T protein:vir:80 48 FLSAPGRLINIPFWRDLDSLEPNYG-SDNPNVEAP--IDGLGSGEMKTTKTWLNKAYGAMDLTAELA---GSNPMTRIRN 121 (367) T ss_pred HhhcCCCEEEeeeeccCCCCccccC-CCCCccccc--ccccccchheeeeehhcccchhhhHHHHhh---CchHHHHHHH Confidence 1234556677776432 11 1111 110000000 0111112222111222222 234454443 2356777888 Q ss_pred HHHHHHHHHHHHHHhc---c----CCCCc-------------cccccccccccccceeeccccchhhHHHHHHHHHHHhh Q lcl|NC_010583. 274 RLIEAHAVSIEEAFMS---G----NGTGQ-------------PKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRK 333 (458) Q Consensus 274 ~la~~~~~~~d~~~l~---G----~g~~~-------------p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (458) +++.--.+...+.+|. | ++++. +.+..+.. ..-.++.+.+....+....+.+.... T Consensus 122 qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~----~~Dis~~t~~~~~~~s~~~~~~A~~~ 197 (367) T protein:vir:80 122 RFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRVPAEVLGTAGDM----VIDISGQTNPADAVFNREAFVDAAFT 197 (367) T ss_pred HHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhhhccccccccccCce----eeeeeccCCCccceecHHHHHHHHHH Confidence 8876655555554442 1 11110 01111100 00011122223344566677777777 Q ss_pred hhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccc--cCCceEEEEEeceE Q lcl|NC_010583. 334 LGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKA--ASAEFAVIVYKDNF 411 (458) Q Consensus 334 ~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~--~~~~~~~~~~~~~~ 411 (458) +.........++||+.++..|.+++- -.++- . ......-++++|++|++++.||... +.+.+..+.|+... T Consensus 198 lGD~~~~l~~i~mHS~V~~~L~~~~l--i~~i~-~----sd~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GA 270 (367) T protein:vir:80 198 MGDHVGSIAAIAVHSMVYKRMTNNDE--IEFIP-D----SKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA 270 (367) T ss_pred hccccccccEEEEchHHHHHHHhccc--ccccc-C----CCCccccceecceeEEEeCCCcccccCCCceEEEEEEecce Confidence 77777778899999999999887642 11111 1 1112345789999999999999643 34444445556543 Q ss_pred EE-Eec---ceeEEeeccccc--CCceEEEEEEeeccEEecccceEEEEeec-------------------C Q lcl|NC_010583. 412 VM-PRQ---RAVTVERERQAG--KQRDAYYVTQRVNLQRYFENGVVSGAYAA-------------------A 458 (458) Q Consensus 412 ~i-~~~---~~~~i~~~~~~~--~~~~~~~~~~r~d~~~~~~~afv~l~~aa-------------------a 458 (458) .- .+. ..+++.+|+... .++..+.-..| .++||..|.....+- + T Consensus 271 i~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~---~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt 339 (367) T protein:vir:80 271 FGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) T ss_pred eeecccCCccceecccchhhhcCCceEEEEeeee---EEeecceeeecccccccccccccccccccccCCCC Confidence 22 222 223445555542 23333333323 678888775533211 1 No 166 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=97.92 E-value=1.3e-06 Score=52.91 Aligned_cols=380 Identities=12% Similarity=0.077 Sum_probs=134.3 Q ss_pred HHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 12 LGLGDLAKSLEGLTAAQKAA---EAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTV 88 (458) Q Consensus 12 ~~~~~~~~~~~~l~~~~~~~---~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~ 88 (458) .+ + .+.+ +.+.... ....+++|...+......-+ |. +...+.+-.. +..+.-+ T Consensus 1 mr---i-S~~~--~~K~~l~EK~~~~a~~~E~~~~LKS~~~G~------ev-----------knaiedl~K~-~EL~~Tl 56 (400) T protein:vir:93 1 MR---I-SKRN--MNKPDLIEKQNRLAELKENNVSLKSQISGF------EV-----------KNAIEDLPKV-QELEKTL 56 (400) T ss_pred Cc---c-cccc--cccchHHHHHHHHhhhhhhhhhhhhhhhcc------hh-----------hhhhhhchhH-HHHHHhH Confidence 00 0 0000 0000000 00001111111100000000 00 0000000000 0000111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccc Q lcl|NC_010583. 89 EKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVS 168 (458) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~ 168 (458) .+..-++...+++....+....++. ...+-......-..|..-+......+..+.+-.++-+-.+.+.+ T Consensus 57 S~~~iEI~~~en~LNa~~E~~KGK~-----------kMt~~i~sq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiT 125 (400) T protein:vir:93 57 SENSIEIIKIENELNAQEEKPKGKD-----------KMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTIT 125 (400) T ss_pred hhcchhhhhhhhhhhhhhhhhhhhH-----------HHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCccee Confidence 1111111111111111100000000 00011111122223334444444433222222222222333344 Q ss_pred cCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccceeeeeehhh Q lcl|NC_010583. 169 MSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYK 248 (458) Q Consensus 169 ~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k 248 (458) .-...+|..+.-.|-..+..+.++.+...+...+. +.+..+-.++. +....-.+..+++...+|..-++.+. T Consensus 126 D~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~----~~V~~s~~s~~---~Aq~HkdGqTK~eqa~~~~~~Tl~~~- 197 (400) T protein:vir:93 126 DTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA----LLVSRSFDSAN---EAQVHKDGQTKTEQAATLTIDTLEPV- 197 (400) T ss_pred ccchhccHHHHHHHHHhhhccCcceeeeeeccchh----hhHHhhhhhhh---hhhhhccCCccccceeeeeeechhHH- Confidence 55678899998889999999999988655543332 22222222221 23323344455555555554444443 Q ss_pred eeeeehhh-HHHHhccHH---HHHHHHHHHHHHHHH-HHHHHHHhccCCCCccccccccccccccceeeccccchhhHHH Q lcl|NC_010583. 249 LAAKSFIT-DETEEDAIF---SLLPLLRKRLIEAHA-VSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVT 323 (458) Q Consensus 249 ~~~~~~is-~ell~ds~~---~~~~~i~~~la~~~~-~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (458) .++...| -++.++... .+.+||..+|+.++. +..|.+++-|+|++....+-..+......-.++-...+-.... T Consensus 198 -~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~~~Ttkaksagktpf 276 (400) T protein:vir:93 198 -MVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPF 276 (400) T ss_pred -HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCch Confidence 3333333 234444333 359999999999998 9999999999999865544443322211111100000000111 Q ss_pred HHHHHHHHhhhhhhhcccceeE-echhH-HHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCc Q lcl|NC_010583. 324 AKTISKLRRKLGRHGLKLSKLV-LIVSM-DAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAE 401 (458) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 401 (458) .+.+.....-+.+... -.++ ..... .+.|+.++.+...--. ++-.-...+.+ ..|.. T Consensus 277 adaieeavdfvrptag--rrylivktedrkalldelrqatanahv--------------riknddaeias-----evgvd 335 (400) T protein:vir:93 277 ADAIEEAVDFVRPTAG--RRYLIVKTEDRKALLDELRQATANAHV--------------RIKNDDAEIAS-----EVGVD 335 (400) T ss_pred hHHHHHHHhhhccCCC--ceEEEEeccchHHHHHHHHhhccccce--------------Eeecchhhhhh-----hcCcc Confidence 1222222222222211 1222 22223 3455656554322111 01100000000 01111 Q ss_pred eEEEEEeceE---EEEecceeEEe------eccc-ccCCceEEEEEEeeccEEecccceEEEEee Q lcl|NC_010583. 402 FAVIVYKDNF---VMPRQRAVTVE------RERQ-AGKQRDAYYVTQRVNLQRYFENGVVSGAYA 456 (458) Q Consensus 402 ~~~~~~~~~~---~i~~~~~~~i~------~~~~-~~~~~~~~~~~~r~d~~~~~~~afv~l~~a 456 (458) .+++.-++.. .+...+...|+ .|.| ...|.-.+.++..-.|.|---+|-++++++ T Consensus 336 eiivytgskalkptvlvdqkyhidmqdltkvdafewktnsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 336 EIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eeeeeeccccccceeeeccccccchhhhhhhhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 1222222210 11111111111 1111 223333334444445555555565666666 No 167 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=97.87 E-value=2.4e-06 Score=51.43 Aligned_cols=261 Identities=12% Similarity=-0.004 Sum_probs=128.6 Q ss_pred hhcccccccCccccchh--HHHHHHHHHHhccchhhhcceeeeccC-ceEEEEecCCCcccccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETI--FSTRIIRDLQKELVVGALFDELPMSSK-ILTMLVEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~--~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) +.....+....+.-|+. +.+++-..+.....++...+.+|+..| ..++|.+.....+.-|+||..+| -+.. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Ip------lskv 74 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIP------LSKV 74 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccc------hhhh Confidence 11111111111111222 223333333333334444477787754 66889888777777777776554 4444 Q ss_pred cce---eeeeehhheeeeehhhHHHHhccH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeec Q lcl|NC_010583. 238 QLT---EISFKTYKLAAKSFITDETEEDAI-FSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTE 313 (458) Q Consensus 238 ~f~---~v~~~~~k~~~~~~is~ell~ds~-~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~ 313 (458) +.. ..+++.+|++.- +|.|.++.|. -+-...-.++|..+++.+++..++.--.+ + .....+ T Consensus 75 t~~~~~t~t~kikK~rK~--tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lkt---------a----t~t~tg 139 (295) T protein:vir:99 75 TRTKDKDYTVKWFKKRRA--TTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKT---------K----PTKVKG 139 (295) T ss_pred eeeeeeeeEEEeeeeccc--ccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhcc---------C----ceeeeh Confidence 443 356666776654 4999986543 34677788999999999999999863111 0 001111 Q ss_pred cccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeeccc-ceeccc Q lcl|NC_010583. 314 AKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLP-VVVSEY 392 (458) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~p-v~~~~~ 392 (458) + .....+..+.+.+......+..+.+.++||.....+++-..-+++--...+.+ .- --++|.. |++|.. T Consensus 140 ---~-~lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~-----~L-~nfLG~q~II~S~k 209 (295) T protein:vir:99 140 ---V-GLQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMT-----LL-KNFLGMQNVIVMPS 209 (295) T ss_pred ---h-hHHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhhh-----hh-hhhhccceEEEccc Confidence 1 11122223333333333445556788899998887765444333221001111 10 1288987 889999 Q ss_pred cccccc---CCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEe-------------ecc---EEecccceEEE Q lcl|NC_010583. 393 FPAKAA---SAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQR-------------VNL---QRYFENGVVSG 453 (458) Q Consensus 393 ~~~~~~---~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r-------------~d~---~~~~~~afv~l 453 (458) +|.+.. -...+.+.+.+ ...+++. .-..+..|.+.+.+..+ +.+ -+-.++++++. T Consensus 210 v~~G~~~aT~~~Ni~~ay~~----~~~g~l~--~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~ 283 (295) T protein:vir:99 210 VPEGKIYSTAVENLVFASLN----VKGGDLG--GLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEA 283 (295) T ss_pred CCCceEEEeeccceEEEEec----CCchhhh--hhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEE Confidence 985421 11112221110 1111111 00011112222222111 122 23455789999 Q ss_pred EeecC Q lcl|NC_010583. 454 AYAAA 458 (458) Q Consensus 454 ~~aaa 458 (458) +..++ T Consensus 284 tI~~~ 288 (295) T protein:vir:99 284 TIEAA 288 (295) T ss_pred EEecC Confidence 99777 No 168 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.81 E-value=2.9e-06 Score=50.96 Aligned_cols=276 Identities=11% Similarity=-0.003 Sum_probs=136.9 Q ss_pred cccCccccchh--HHHHHHHHHHhccchhhhcc---eeeeccCceEEEEecCCCccc--ccccccccccccccccccccc Q lcl|NC_010583. 167 VSMSSEAYETI--FSTRIIRDLQKELVVGALFD---ELPMSSKILTMLVEPEAGRAT--WVDASKFGTDETVGDEVKGQL 239 (458) Q Consensus 167 ~~~g~~~ip~~--~~~~ii~~~~~~~~l~~~~~---~~~~~~~~~~~p~~~~~~~a~--~v~e~~~~~e~~~~~~~~~~f 239 (458) .++..+++.+. +.+.|.+...+....+++.. ..+.......+...+..+.+. |++-.+ ...+..+..+ T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a-----~dip~vd~~~ 75 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGT-----STLDQVEVGF 75 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcC-----Cccceeeccc Confidence 22223333311 12233333333333333322 222222344555554444444 654332 3456667777 Q ss_pred eeeeeehhheeeeehhhHHHHhccH---HHHHHHHHHHHHHHHHHHHHHHHhccCCC-Cccccccccccccccceeec-c Q lcl|NC_010583. 240 TEISFKTYKLAAKSFITDETEEDAI---FSLLPLLRKRLIEAHAVSIEEAFMSGNGT-GQPKGLLKLAADDGAKVVTE-A 314 (458) Q Consensus 240 ~~v~~~~~k~~~~~~is~ell~ds~---~~~~~~i~~~la~~~~~~~d~~~l~G~g~-~~p~Gi~~~~~~~~~~~~~~-~ 314 (458) +.-....+.++.-+.+|.+=|..+. .++.+-=.....+++...+|+..++|+-. ....|+++.........+.+ . T Consensus 76 ~~~~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a 155 (304) T protein:vir:52 76 TPTRSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQ 155 (304) T ss_pred ceeEEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCcc Confidence 7777777788877777765555432 45677666777789999999999999743 34789998766543322211 1 Q ss_pred cc---chhhHHHHHHHHHHHhhhhh---hhcccceeEechhHHHHHHhhh-ccccccccccccccccccccCCeeecccc Q lcl|NC_010583. 315 KA---DGSVLVTAKTISKLRRKLGR---HGLKLSKLVLIVSMDAYYDLLE-DEEWQDVAQVGNDAVKLQGQVGRIYGLPV 387 (458) Q Consensus 315 ~~---~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~~-d~~~~~~~~~~~~~~~~~~~~~~l~G~pv 387 (458) .. +.+......++..++.++.. ....+..+++.|+.+.+|.... ...+..+++.-....+ -..|.|+ T Consensus 156 ~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~------~~~g~~l 229 (304) T protein:vir:52 156 NTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLS------AAAGRQV 229 (304) T ss_pred CCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcc------cccCCcc Confidence 11 11333455566666555533 2234556788888888886532 2222212111100000 0124443 Q ss_pred eec----ccccccccCCceEEEEEec--eEEEEecceeEEeecccccCCce--EEEEEEeecc-EEecccceEEEEe Q lcl|NC_010583. 388 VVS----EYFPAKAASAEFAVIVYKD--NFVMPRQRAVTVERERQAGKQRD--AYYVTQRVNL-QRYFENGVVSGAY 455 (458) Q Consensus 388 ~~~----~~~~~~~~~~~~~~~~~~~--~~~i~~~~~~~i~~~~~~~~~~~--~~~~~~r~d~-~~~~~~afv~l~~ 455 (458) .+- .....+.++++..+++..+ .+.+...+.++...- ..++.. .+=.+.|+++ .+..|.+|+++.. T Consensus 230 ~I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~~--q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 230 AIKALPSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLDA--QPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred eEEEecccccccCCCCceEEEEEecChhheEEecCccccccch--hhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 321 1122233444444444332 222222222222221 123432 3336777765 8899999999999 No 169 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=97.76 E-value=4e-06 Score=50.16 Aligned_cols=287 Identities=9% Similarity=-0.002 Sum_probs=139.0 Q ss_pred hhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCc-ccccccccccccccc-ccccc Q lcl|NC_010583. 159 KAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGR-ATWVDASKFGTDETV-GDEVK 236 (458) Q Consensus 159 ~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~-a~~v~e~~~~~e~~~-~~~~~ 236 (458) .+....+-.+-.....-..+.+.|...-....|+..+......++....+....-... .....||...+.... ..... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~~ 80 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTML 80 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEEe Confidence 1222111111111233345667677777778888887655555554444443333222 222235543322221 11111 Q ss_pred ccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHH---HHHHHHHHHHHHhccCC-----C----Ccccccccccc Q lcl|NC_010583. 237 GQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRL---IEAHAVSIEEAFMSGNG-----T----GQPKGLLKLAA 304 (458) Q Consensus 237 ~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l---a~~~~~~~d~~~l~G~g-----~----~~p~Gi~~~~~ 304 (458) ..+.+|- ...+.||..+..-+.....+.+..++ ...+.+.+|..+|+|.- + .+..||+.... T Consensus 81 ~N~tQIf------~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~ 154 (317) T protein:vir:88 81 NNYCQIS------DETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYK 154 (317) T ss_pred ccEEEEE------EeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhc Confidence 2222222 23344555444433333333333333 45678899999999851 1 24567776543 Q ss_pred ccccceee------c----cccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccc--c Q lcl|NC_010583. 305 DDGAKVVT------E----AKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGND--A 372 (458) Q Consensus 305 ~~~~~~~~------~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~--~ 372 (458) ..+..... . .+.......+.+++.+++..+=.....+..+++++.....+.++....+..+...... . T Consensus 155 t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~ 234 (317) T protein:vir:88 155 TNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNRI 234 (317) T ss_pred cCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEE Confidence 32211000 0 0111111234444555554444455566667889988888888743232222111000 0 Q ss_pred ccccccCCeeec-ccceecccccccccCCceEEEEEeceEEEEecceeEEeeccccc-CCceEEEEEEeeccEEecccce Q lcl|NC_010583. 373 VKLQGQVGRIYG-LPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQAG-KQRDAYYVTQRVNLQRYFENGV 450 (458) Q Consensus 373 ~~~~~~~~~l~G-~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~-~~~~~~~~~~r~d~~~~~~~af 450 (458) ........+=+| ++++.+.++|+ ..+++.+.+.+.+..-..+. .++.+. -+......+..+++++.+|+|. T Consensus 235 g~~v~~~~tdfG~v~ii~~r~lp~-----~~~~~~D~~~~~l~~Lr~~~--~e~laKtGd~~k~~i~~E~tLe~~N~~a~ 307 (317) T protein:vir:88 235 AQTVDVYESDFGKYTIRANRWFHE-----NTLFVFDPKMHSLCYLRPFF--QHELAKTGDSEKRQLLVEYTFRVNNEKSG 307 (317) T ss_pred EEEEEEEEeCCeEEEEEeCCCCCC-----CeEEEEcccccceeecccce--eeccCCCcccceeEEEEEEEEEEcCccce Confidence 000000011123 46677777773 34566666654443323222 233332 3556777888999999999999 Q ss_pred EEEEeecC Q lcl|NC_010583. 451 VSGAYAAA 458 (458) Q Consensus 451 v~l~~aaa 458 (458) +++.--+| T Consensus 308 a~i~~l~~ 315 (317) T protein:vir:88 308 ALIRDVVA 315 (317) T ss_pred eEEEEecc Confidence 99888888 No 170 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=97.54 E-value=2.5e-05 Score=45.86 Aligned_cols=270 Identities=13% Similarity=0.017 Sum_probs=123.8 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceE--EEEecCCCccccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILT--MLVEPEAGRATWV 220 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--~p~~~~~~~a~~v 220 (458) +....-.+++ ..+.+.+. +...--.|.+++-..+....-++...+..|+..|..- ||.+.....+.-| T Consensus 1 ~~~~~~~~e~-------nlt~~~dl---~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dV 70 (296) T protein:vir:98 1 MVTSRTYPEE-------NLIKSTDL---KYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNV 70 (296) T ss_pred CCCccccCcC-------CCcchhhh---hhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccc Confidence 0000000000 00000000 1122223444444444444444445577888877543 4556766777777 Q ss_pred ccccccccccccccccccce---eeeeehhheeeeehhhHHHHhccH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc Q lcl|NC_010583. 221 DASKFGTDETVGDEVKGQLT---EISFKTYKLAAKSFITDETEEDAI-FSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQP 296 (458) Q Consensus 221 ~e~~~~~e~~~~~~~~~~f~---~v~~~~~k~~~~~~is~ell~ds~-~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p 296 (458) +||+.+| -+..+.. ..+++.+|++.-+ |.|.++.|. -+-...-.++|..+++.+++..++.- T Consensus 71 aEGe~Ip------lskvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~------ 136 (296) T protein:vir:98 71 PEGEVIP------LSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTA------ 136 (296) T ss_pred cCCcccc------hhhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHH------ Confidence 7776554 4444443 3566667776554 999986543 34677788999999999999999852 Q ss_pred ccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccccccccccccccccc Q lcl|NC_010583. 297 KGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQ 376 (458) Q Consensus 297 ~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~ 376 (458) ++.+. . + .. ...+..-...+..+-++...+........+.++||.....+.+ +++- ..+..+ + T Consensus 137 ---LktaT--~-t-~~-~t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg--~a~i--t~qt~f-G---- 199 (296) T protein:vir:98 137 ---LKTGT--G-T-QD-ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIA--KAGI--TTQTAF-G---- 199 (296) T ss_pred ---Hhccc--c-e-ee-echhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhc--CCcc--chhhee-c---- Confidence 11000 0 0 00 0111111111122222223333322345677889988766542 3211 011111 1 Q ss_pred ccCCe-eecccceecccccccccCCceEEEEE-eceEEEEe-cceeEEeecccccCCceEEEEEEe-------------e Q lcl|NC_010583. 377 GQVGR-IYGLPVVVSEYFPAKAASAEFAVIVY-KDNFVMPR-QRAVTVERERQAGKQRDAYYVTQR-------------V 440 (458) Q Consensus 377 ~~~~~-l~G~pv~~~~~~~~~~~~~~~~~~~~-~~~~~i~~-~~~~~i~~~~~~~~~~~~~~~~~r-------------~ 440 (458) +.... ++|.-|+.|..+|.+.. +....+ -..+.+.- .+++.-... +..|.+.+.+..+ + T Consensus 200 ~tyl~nfLG~~II~S~kV~~G~~---~~T~~~Ni~~ay~~~~~~~l~~~f~--~~~d~tglIGv~h~~~~~~~t~eT~~~ 274 (296) T protein:vir:98 200 LTYLVDFTGTVIISTNDVTKGEI---WATVPENIIFAYINPNNSELAKEFN--LYGDPTGYIGMNHFQENTTLTIQTLLV 274 (296) T ss_pred hhhhhhccccEEEEcCcCCCceE---EEeeecceEEEeecccccchhhhhc--cccccccceEEEeccccceeeehhHhH Confidence 11111 78888999999985321 111111 00111111 111111101 1112222221111 1 Q ss_pred cc---EEecccceEEEEeecC Q lcl|NC_010583. 441 NL---QRYFENGVVSGAYAAA 458 (458) Q Consensus 441 d~---~~~~~~afv~l~~aaa 458 (458) .+ -+-.++++++.+..+| T Consensus 275 ~~~~lfpE~~dgiv~~tI~~~ 295 (296) T protein:vir:98 275 SGMLMYPERIDGIVKVTLTPG 295 (296) T ss_pred hHHHhcccccceEEEEEecCC Confidence 22 2345578999999999 No 171 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=97.51 E-value=5.3e-05 Score=44.04 Aligned_cols=259 Identities=10% Similarity=-0.047 Sum_probs=124.2 Q ss_pred cccccCccccchhHHHHHHHHHHhccchhhhcceee-----eccCceEEEEecCCCcccccccccccccccccccccccc Q lcl|NC_010583. 165 SSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELP-----MSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQL 239 (458) Q Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~-----~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f 239 (458) -.+-.+.++-|+.++.++++.+++..++.++++.-. -.+...++|+...... ..+... .-.+.+= T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v----~dg~~~------~~~~~te 70 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKS----ASGRTL------VKQPMVD 70 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceee----cccCCc------ccccccc Confidence 111223456799999999999999999888875421 1234667776432211 112111 1112222 Q ss_pred ee--eeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 240 TE--ISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 240 ~~--v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) +. ++++.+|... +.|+.+=+..+..++...+.+...++++..+|..++. ++..+.....+. +. T Consensus 71 ~~v~l~id~~k~~~-~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~---------l~~~a~~~~gt~--gt--- 135 (418) T protein:vir:10 71 QTIPFKIAYQEHVG-LEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLAL---------TLKKAFHSSGTP--GV--- 135 (418) T ss_pred ceEEEEEecccccc-eeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHH---------HHhhcccccccC--Cc--- Confidence 33 4444444433 4555555555667888888888999999999998773 121111111110 01 Q ss_pred hhhHHHHHHHHHHHhhhhhhhccc-c-ee-EechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccccc Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKL-S-KL-VLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFP 394 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~-~-~~-~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~ 394 (458) ....+.++.++...+.....+. . .| +++|..+..|. ++.... ..+.+.......|..+++.|..|+.++.+| T Consensus 136 --~~~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~--~~~~~~-~~~~~~~~~lr~G~IG~i~GF~V~~S~nip 210 (418) T protein:vir:10 136 --RPGAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLS--DEVTKL-FKESMVEQAYKMGYRGNVAAYEVYESQNLP 210 (418) T ss_pred --CcchHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHh--hhcccc-ccccccchhhheeeeeeeeceEEEEecCCC Confidence 1123556677777777766653 3 55 46776665543 332221 112222223456777899999999999999 Q ss_pred ccccCC--c-eEEEEEe-ceEEEEeccee-----EEeecccccCCceEEEEEE---eecc-EEecccceEEEEeecC Q lcl|NC_010583. 395 AKAASA--E-FAVIVYK-DNFVMPRQRAV-----TVERERQAGKQRDAYYVTQ---RVNL-QRYFENGVVSGAYAAA 458 (458) Q Consensus 395 ~~~~~~--~-~~~~~~~-~~~~i~~~~~~-----~i~~~~~~~~~~~~~~~~~---r~d~-~~~~~~afv~l~~aaa 458 (458) ...++. . ..+.+-. ....+....+. .+...++| .|-+.. .+.. ..-++.-|++...+.+ T Consensus 211 ~~tag~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~-----ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~ 282 (418) T protein:vir:10 211 KHTVGDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVI-----TFGGVFGVNPQNYETTGLLQEFVVLEDVDT 282 (418) T ss_pred cccccccccceeeecccccceeEEEeecceeeccceeeccEE-----EECceeecccccccccccceEEEEEeeccc Confidence 644432 1 1111111 11111100000 01111111 110000 0000 0012233433322110 No 172 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=97.50 E-value=9e-06 Score=48.24 Aligned_cols=375 Identities=13% Similarity=0.093 Sum_probs=134.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 12 LGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDL-VSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEK 90 (458) Q Consensus 12 ~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~ 90 (458) .+.-.+.++...| .++.|..-....+.... +..+. |.-...++....+..+ T Consensus 1 mnkpdliekqnrl----------aelkennvslksqisgfevknai-edl~K~~ELe~TlSe~----------------- 52 (393) T protein:vir:16 1 MNKPDLIEKQNRL----------AELKENNVSLKSQISGFEVKNAI-EDLPKVQELEKTLSEN----------------- 52 (393) T ss_pred CCCcchhhhhhhh----------hhhhhcccchhhhccchhhhhhh-hhchhHHHHHHhHhhc----------------- Confidence 1111111111111 11111111000000000 00000 0001111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccC Q lcl|NC_010583. 91 QQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMS 170 (458) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g 170 (458) .-++...+++....+....++. ...+-......-..|..-+......+..+.+-.++-+-.+.+.+.- T Consensus 53 -~iEI~k~en~LN~~eE~~KGK~-----------kMt~~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~ 120 (393) T protein:vir:16 53 -SIEIIKIENELNAQEEKPKGKD-----------KMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDT 120 (393) T ss_pred -chhhhhhhhhhhhhhhcchhhH-----------HHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceecc Confidence 1111111111111000000000 0000111111222233333333333322222222222223334445 Q ss_pred ccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccceeeeeehhhee Q lcl|NC_010583. 171 SEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLA 250 (458) Q Consensus 171 ~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~ 250 (458) ...+|..+.-.|-..+..+.++.+...+...+. +.+..+-.++. +....-.+..+++...+|..-++.+. + T Consensus 121 ~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~----~~V~~s~~s~~---eAq~HkdGqTK~eqa~~~~~~Tl~~~--~ 191 (393) T protein:vir:16 121 TFQLPRKLVESINTALLNTNPVFKVFHVTNVGA----LLVSRSFDSAN---EAQVHKDGQTKTEQAATLTIDTLEPV--M 191 (393) T ss_pred chhccHHHHHHHHHhhhccCcceeeeeeccchh----hhHHhhhhhhh---hhhhhccCCccccceeeeeeechhHH--H Confidence 678899998889999999999988655543332 22222222221 22223344445444455544444443 4 Q ss_pred eeehhh-HHHHhccHH---HHHHHHHHHHHHHHH-HHHHHHHhccCCCCccccccccccccccceeeccccchhhHHHHH Q lcl|NC_010583. 251 AKSFIT-DETEEDAIF---SLLPLLRKRLIEAHA-VSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAK 325 (458) Q Consensus 251 ~~~~is-~ell~ds~~---~~~~~i~~~la~~~~-~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (458) ++...| -++.++... .+..||..+|+.++. +..|.+++-|+|++....+-..+......-.++-...+-.....+ T Consensus 192 VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagktpfad 271 (393) T protein:vir:16 192 VYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFAD 271 (393) T ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhH Confidence 444333 234444333 359999999999998 999999999999986544443332221111110000000011112 Q ss_pred HHHHHHhhhhhhhcccceeE-echhH-HHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceE Q lcl|NC_010583. 326 TISKLRRKLGRHGLKLSKLV-LIVSM-DAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFA 403 (458) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~ 403 (458) .+.....-+.+... -.++ ..... .+.|+.++.+...-.. ++-.-...+.+ ..|...+ T Consensus 272 aieeavdfvrptag--rrylivktedrkalldelrqatananv--------------riknddteias-----evgvdei 330 (393) T protein:vir:16 272 AIEEAVDFVRPTAG--RRYLIVKTEDRKALLDELRQATANANV--------------RIKNDDTEIAS-----EVGVDEI 330 (393) T ss_pred HHHHHHhhhccCCC--ceEEEEeccchHHHHHHHHhhhccCce--------------eeeccchhhhh-----hcCccee Confidence 22222222222211 1222 22233 3455556544321111 11110000000 0111112 Q ss_pred EEEEeceE---EEEecceeEEe------eccc-ccCCceEEEEEEeeccEEecccceEEEEee Q lcl|NC_010583. 404 VIVYKDNF---VMPRQRAVTVE------RERQ-AGKQRDAYYVTQRVNLQRYFENGVVSGAYA 456 (458) Q Consensus 404 ~~~~~~~~---~i~~~~~~~i~------~~~~-~~~~~~~~~~~~r~d~~~~~~~afv~l~~a 456 (458) ++.-++.. .+...+...|+ .|.| ...|.-.+.++..-.|.|---+|-++++++ T Consensus 331 ivytgskalkptvlvdqkyhidmqdltkvdafewktnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 331 IVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred eeeeccccccceeeeccccccchhhhhhhhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 22222210 11111111111 1111 223333334444445555555566666666 No 173 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=97.48 E-value=5.8e-05 Score=43.81 Aligned_cols=270 Identities=11% Similarity=0.030 Sum_probs=115.7 Q ss_pred hhhhcccccccCccccch--hHHHHHHHHHHhccchhhhcce---------eeeccCceEEEEecC-CCc---ccccccc Q lcl|NC_010583. 159 KAVNGSSSVSMSSEAYET--IFSTRIIRDLQKELVVGALFDE---------LPMSSKILTMLVEPE-AGR---ATWVDAS 223 (458) Q Consensus 159 ~a~~~~~~~~~g~~~ip~--~~~~~ii~~~~~~~~l~~~~~~---------~~~~~~~~~~p~~~~-~~~---a~~v~e~ 223 (458) .+ .+......+|+ .|.+=+.+...+.+.+.+-+=+ ...++...++|.+.. ... .+|.. . T Consensus 1 Ma-----~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~d-t 74 (349) T protein:vir:94 1 MA-----ITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSND-V 74 (349) T ss_pred CC-----ceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCC-C Confidence 11 11222346666 3444333333333333332111 223455667776643 121 12211 0 Q ss_pred cccccccccccccccceeeeeehhheeee--ehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccc Q lcl|NC_010583. 224 KFGTDETVGDEVKGQLTEISFKTYKLAAK--SFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLK 301 (458) Q Consensus 224 ~~~~e~~~~~~~~~~f~~v~~~~~k~~~~--~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~ 301 (458) .. ....+..-.++.++-...+.-.++ -.++..+-- -+..+.|.++++.-..+...+.+|. ..+|+++ T Consensus 75 ~~---~~~t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG---~dpm~~Ia~~va~yW~r~~q~~Lia-----~L~Gvf~ 143 (349) T protein:vir:94 75 YQ---DIATPRAIQTGEMMARVAYLNEGFGQADLTVELTS---QNPLQSVASRLDNFWQRQAQRRLIA-----TALGLYN 143 (349) T ss_pred cc---cccccccccccceeeeeeeeccccchhHHHHHhhC---chHHHHHHHHHHHHHhhHHHHHHHH-----HHHhhhc Confidence 00 000001111222222222222222 244555432 2567778888887777776665553 1123333 Q ss_pred ccccccc------c-eeeccccchhhHHHHHHHHHHHhhhhhh-----hcccceeEechhHHHHHHhhhccccccccccc Q lcl|NC_010583. 302 LAADDGA------K-VVTEAKADGSVLVTAKTISKLRRKLGRH-----GLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVG 369 (458) Q Consensus 302 ~~~~~~~------~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~ 369 (458) .....+. . +....+.+ .+....+++....+... ......++||+.++..|.+++-=. ++ + T Consensus 144 ~~~~~~~~~~~~~~~~~d~~~~a---~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~--~i-~-- 215 (349) T protein:vir:94 144 DNVSATDAYHEQNDMVVDVSATS---GFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID--FI-R-- 215 (349) T ss_pred ccccccccccccCceeEEecccC---CCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhh--hc-c-- Confidence 2111000 0 00000111 11222333333332222 234567899999999987654311 11 1 Q ss_pred cccccccccCCeeecccceeccccccccc--CCceEEEEEeceE-EEEecc---eeEEeeccccc--CCceEEEEEEeec Q lcl|NC_010583. 370 NDAVKLQGQVGRIYGLPVVVSEYFPAKAA--SAEFAVIVYKDNF-VMPRQR---AVTVERERQAG--KQRDAYYVTQRVN 441 (458) Q Consensus 370 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~--~~~~~~~~~~~~~-~i~~~~---~~~i~~~~~~~--~~~~~~~~~~r~d 441 (458) .......-++++|++|++++.||.... .+.+..+.|+... ...+.. .+++.+++... .++..+....+ T Consensus 216 --~s~~~~~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~-- 291 (349) T protein:vir:94 216 --DAENNTMFATYQGYRVIVDDSMTVVGQDTSRKFISIIFGQGAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKT-- 291 (349) T ss_pred --CcccCcccceecCcEEEEeCCCccccCCCCceEEEEEeecceEEeecCCCCcceeeecccccCCcceeEEEEEeeE-- Confidence 111222346899999999999996433 2334445566533 333332 24445555432 34455544444 Q ss_pred cEEecccceEEEEeec----------C Q lcl|NC_010583. 442 LQRYFENGVVSGAYAA----------A 458 (458) Q Consensus 442 ~~~~~~~afv~l~~aa----------a 458 (458) .++||..|..-.... + T Consensus 292 -~~~hp~G~s~~~a~v~~~~~~~~~~s 317 (349) T protein:vir:94 292 -WLLHPFGYSFTSAVITGNGTETIARS 317 (349) T ss_pred -EEeeeeeeeecccccCCCccccccCC Confidence 366777765443211 1 No 174 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=97.33 E-value=1.1e-05 Score=47.67 Aligned_cols=298 Identities=10% Similarity=0.050 Sum_probs=126.6 Q ss_pred chhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCc Q lcl|NC_010583. 126 TQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKI 205 (458) Q Consensus 126 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 205 (458) ...-......-..|..-+......+..+.+-.++-+-.+.+.+.-...+|..+.-.|-..+..+.++.+...+...+.-. T Consensus 1 mtn~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~ 80 (318) T protein:vir:86 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 80 (318) T ss_pred CcchhhhhHHHHHHHHHHhccCCchhhhhhhhhhhhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhh Confidence 01111112222334444444444432222222222223333445567889999888999999999998865554433222 Q ss_pred eEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhh-HHHHhccHH---HHHHHHHHHHHHHHH- Q lcl|NC_010583. 206 LTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFIT-DETEEDAIF---SLLPLLRKRLIEAHA- 280 (458) Q Consensus 206 ~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is-~ell~ds~~---~~~~~i~~~la~~~~- 280 (458) .+...++. .|....-.+..+++...+|..-++.+. .++...| -++.++... .+.+||..+|+.++. T Consensus 81 V~~s~~s~-------AeAq~HkdGqTK~eqa~~~~~~Tl~~~--~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vn 151 (318) T protein:vir:86 81 VSRSFDSS-------AEAQVHKDGQTKTEQAATLTIDTLEPV--MVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVN 151 (318) T ss_pred hhhhhhhh-------hhhhhhccCCccccceeeeeeechhHH--HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH Confidence 22111111 223333344455555555554444443 3333333 234444333 359999999999998 Q ss_pred HHHHHHHhccCCCCcccccccccccccccee---eccccchhhHHHHHHHHHHHhhhhhhhccccee-EechhHH-HHHH Q lcl|NC_010583. 281 VSIEEAFMSGNGTGQPKGLLKLAADDGAKVV---TEAKADGSVLVTAKTISKLRRKLGRHGLKLSKL-VLIVSMD-AYYD 355 (458) Q Consensus 281 ~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~l~ 355 (458) +..|.+++-|+|++....+-..+......-. ....+.. .....+.....-+.+.. +-.+ +.....+ +.|+ T Consensus 152 k~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagtt---pfanaieeavdfvrpta--grrylivkaedrkalld 226 (318) T protein:vir:86 152 KIVDLALVEGDGSNGFKSIDKEADVKKIKKITTKAKSAGTT---PFANAIEEAVDFVRPTA--GRRYLIVKAEDRKALLD 226 (318) T ss_pred HHHHhhheeecCCCCccchhhHHHHHHHHHHhhhhhccCCC---chhhHHHHHHhhhccCC--CceEEEEeecchHHHHH Confidence 9999999999998865444433322211100 0111111 11122222222222221 1122 2333333 4555 Q ss_pred hhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceE---EEEecceeEEe------eccc Q lcl|NC_010583. 356 LLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNF---VMPRQRAVTVE------RERQ 426 (458) Q Consensus 356 ~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~---~i~~~~~~~i~------~~~~ 426 (458) .++.+...--. ++-.-...+.+ ..|...++..-++.. .+...+...|+ .|.+ T Consensus 227 elrqatanahv--------------riknddteias-----evgvdeiivytgskalkptvlvdqkyhidmqdltkvdaf 287 (318) T protein:vir:86 227 ELRQATANAHV--------------RIKNDDTEIAS-----EVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAF 287 (318) T ss_pred HHHhhccccee--------------EEeccchhhhh-----hcCcceeeeeeccccccceeeeccceecchhhhhhhhcc Confidence 56544321111 01100000000 001111222222210 11111111111 1111 Q ss_pred -ccCCceEEEEEEeeccEEecccceEEEEee Q lcl|NC_010583. 427 -AGKQRDAYYVTQRVNLQRYFENGVVSGAYA 456 (458) Q Consensus 427 -~~~~~~~~~~~~r~d~~~~~~~afv~l~~a 456 (458) ...|.-.+.++..-.+.+---+|-++++++ T Consensus 288 ewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:86 288 EWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred eeccCCceEEEeecccCcceeecCceeEEeC Confidence 223333334444444555545555666666 No 175 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=97.20 E-value=0.00013 Score=41.81 Aligned_cols=276 Identities=9% Similarity=0.016 Sum_probs=107.3 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhhhc-------ceeeeccCceEEEEecCCCcccccccccccccccccc- Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALF-------DELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGD- 233 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~-------~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~- 233 (458) +.-++.- +-.+.+....++.+.+...+.+.+ ...+..+.....|.+..-..... |.....+.+..+ T Consensus 1 m~lsD~~----vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~--~~~~~~~~~~vt~ 74 (325) T protein:vir:95 1 MALSDLA----VYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLV--RRRNAYGSGTVAE 74 (325) T ss_pred Cchhhhh----hhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeecccccccccccc--ccccCCCCceecc Confidence 0000000 001111222222222221111111 11223344445665542111000 000000011111 Q ss_pred cccccceeeeeehhheeeeehhhHHHH---hccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccce Q lcl|NC_010583. 234 EVKGQLTEISFKTYKLAAKSFITDETE---EDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKV 310 (458) Q Consensus 234 ~~~~~f~~v~~~~~k~~~~~~is~ell---~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~ 310 (458) ..-.++.++......-.+++....+.+ .+..-.+...|.+.+++...+.+-+.++.+-. +.++... ..... T Consensus 75 ~kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~-----~a~~~~~-~~v~d 148 (325) T protein:vir:95 75 KVLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVY-----SALSQVS-DVVYD 148 (325) T ss_pred ceeccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----Hhhcccc-cceee Confidence 111122333222222112111111111 12233344555555555544444333432211 0111000 00000 Q ss_pred eeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceec Q lcl|NC_010583. 311 VTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVS 390 (458) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~ 390 (458) ...........++...+.+....+.........|+||+.++..|.++.-.+...++... +.. ..++.+|++|+++ T Consensus 149 is~~~~~~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~----g~~-~i~t~~G~~VIVd 223 (325) T protein:vir:95 149 ATANTDAADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYG----TVN-VVRDPFGKLLVMT 223 (325) T ss_pred eecccCcccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccC----Ccc-cccccCCcEEEEe Confidence 11111122233455677777777877777888999999999998875443332222111 111 1246889999999 Q ss_pred ccccccccCC--ceEEEEEece-EEEEecceeEEeeccccc--CCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 391 EYFPAKAASA--EFAVIVYKDN-FVMPRQRAVTVERERQAG--KQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 391 ~~~~~~~~~~--~~~~~~~~~~-~~i~~~~~~~i~~~~~~~--~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +.+|..+.+. ....++++.. +.+............... .....++... --++||..|.. +.+.. T Consensus 224 D~~p~~~~g~~~~ytty~lg~GAi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---tf~lhp~G~sw-~~s~~ 292 (325) T protein:vir:95 224 DSPNLFAAGTPNVYHILGLVPGGVLIGQNNDFDANEETKNGDENIIRTYQAEW---SYNIGVKGFAW-DKANG 292 (325) T ss_pred CCCCCCCccCceeEEEEEEecCeEEecCCCCccccccccCcccceeeeeeeee---eEEeecceeee-ecccc Confidence 9999755443 3333455543 334443333322211111 2222333211 14678888765 33322 No 176 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=97.15 E-value=8.1e-05 Score=43.03 Aligned_cols=315 Identities=10% Similarity=-0.023 Sum_probs=140.7 Q ss_pred hhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhh---hcccccccCccccc----hhHHHHHHHHHHhccchhh Q lcl|NC_010583. 122 ALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAV---NGSSSVSMSSEAYE----TIFSTRIIRDLQKELVVGA 194 (458) Q Consensus 122 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~---~~~~~~~~g~~~ip----~~~~~~ii~~~~~~~~l~~ 194 (458) .....+...-.+.++..+.-.................++. ...++. ....|| +.+.+.|++...+....+. T Consensus 1 ~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~--~~~~i~a~~~~~i~~~vy~~~~~~~~~~~ 78 (339) T protein:vir:94 1 MSINNDRTDIKQLEKVGIIFDGYSPKSISSEVSAYAMDAVNLTPTLQTT--ANAGIPAWMTTFVDRRVIDIQLAPMAAAK 78 (339) T ss_pred CceechHHHHHHHHhhceeeccchhhhcchhhHhhhccccccccccccc--cccchhhhhhhhhchhheeecccccchhh Confidence 0000000000000000000000000000000000011111 111111 223344 3333556666666666666 Q ss_pred hcceeeecc---CceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhc--cHHHHHH Q lcl|NC_010583. 195 LFDELPMSS---KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEED--AIFSLLP 269 (458) Q Consensus 195 ~~~~~~~~~---~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d--s~~~~~~ 269 (458) +..+.+... ....|++.+..+.+.+.+.++..| ....+.+|...++....++-.+. ..|+-.- ...++.+ T Consensus 79 l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~P----l~~~~v~~~~~~v~~~~~g~~y~-~~E~~~A~~~g~~l~~ 153 (339) T protein:vir:94 79 IFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANG----MSKANVNFESRQNYRYQTWTEYG-DLEMATYGEAGIDYVA 153 (339) T ss_pred hcccccCCCCcccEEEEeeeecccceEEcccccCCC----cccccceeeEEeEEEEEEEEeec-HHHHHHHHhhCCChHH Confidence 665554442 356777777777776765544322 22334556666555554443332 2222221 2367888 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhc------ccce Q lcl|NC_010583. 270 LLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGL------KLSK 343 (458) Q Consensus 270 ~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~ 343 (458) --.....+++.+.+|+-.++|+...+..|+++.........+...=...+....++++..++..+..... .+.. T Consensus 154 ~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~ 233 (339) T protein:vir:94 154 RQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMV 233 (339) T ss_pred HHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcE Confidence 8888899999999999999998777789999875443222111111122344455667766666644322 1235 Q ss_pred eEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEe-----ceEEEEecce Q lcl|NC_010583. 344 LVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYK-----DNFVMPRQRA 418 (458) Q Consensus 344 ~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~-----~~~~i~~~~~ 418 (458) .++.++.+.+|..- +..+..+...-... ..++.|.....+-+ +++....+... +...+...+. T Consensus 234 L~LP~~~~~~L~~~-n~~~~Tvl~~lk~n---------~pnl~i~~~~el~~--a~g~~~~~~~~~~~~~~~~~~~~p~~ 301 (339) T protein:vir:94 234 MALAPSALNNVNRT-NNFGLSAGAKIAQT---------YPNIQFVAVPEFDT--ASGRLVQLWVPEVNGQPTGEVAFAEK 301 (339) T ss_pred EEecHHHHHhcccC-CcCCccHHHHHHHh---------cCCcEEEEcccccc--CCCceEEEEEEeccCCcceEEEcchh Confidence 77788877777642 22222222111000 11222333222222 22222222111 1111221222 Q ss_pred eEEeecccccCCceEEEEEEee-ccEEecccceEEEEee Q lcl|NC_010583. 419 VTVERERQAGKQRDAYYVTQRV-NLQRYFENGVVSGAYA 456 (458) Q Consensus 419 ~~i~~~~~~~~~~~~~~~~~r~-d~~~~~~~afv~l~~a 456 (458) ++...- ....-...+-...|. |+.+..|.||+.++-= T Consensus 302 ~~~lpv-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 302 LRSHSI-ERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred hhcccc-EEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 221110 112233445566674 5588999999987666 No 177 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=97.14 E-value=0.00016 Score=41.46 Aligned_cols=270 Identities=11% Similarity=0.034 Sum_probs=115.5 Q ss_pred hhhhcccccccCccccch--hHHHHHHHHHHhccchhhhc---------ceeeeccCceEEEEecC-CCc---ccccccc Q lcl|NC_010583. 159 KAVNGSSSVSMSSEAYET--IFSTRIIRDLQKELVVGALF---------DELPMSSKILTMLVEPE-AGR---ATWVDAS 223 (458) Q Consensus 159 ~a~~~~~~~~~g~~~ip~--~~~~~ii~~~~~~~~l~~~~---------~~~~~~~~~~~~p~~~~-~~~---a~~v~e~ 223 (458) .+ .+......+|+ .|.+=+.+...+.+.+.+-+ .....++...++|.+.. ... .+|.. + T Consensus 1 Ma-----~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D-~ 74 (349) T protein:vir:78 1 MA-----ITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSND-V 74 (349) T ss_pred CC-----ceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCC-C Confidence 11 11222346676 34443333333333333211 11223455677787643 111 11211 0 Q ss_pred cccccccccccccccceeeeeehhheeee--ehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccc Q lcl|NC_010583. 224 KFGTDETVGDEVKGQLTEISFKTYKLAAK--SFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLK 301 (458) Q Consensus 224 ~~~~e~~~~~~~~~~f~~v~~~~~k~~~~--~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~ 301 (458) .. ....+..-.++.++-...+.-.++ -.++..+-- -+..+.|.++++.-..+...+.+|. ..+|++. T Consensus 75 ~~---~~~t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG---~dpm~~Ia~~va~yW~r~~q~~Lia-----~L~Gvf~ 143 (349) T protein:vir:78 75 YQ---DIATPRAIQTGEMMARVAYLNEGFGQADLTVELTS---QNPLQSVASRLDNFWQRQAQRRLIA-----TALGLYN 143 (349) T ss_pred cc---cccccccccccceeeeeeeeccccchhHHHHHhhC---chHHHHHHHHHHHHHhhHHHHHHHH-----HHHHhhc Confidence 00 000011112233333222222232 244544432 2567778888887766666655542 1122332 Q ss_pred ccccccc------c-eeeccccchhhHHHHHHHHHHHhhhhhh-----hcccceeEechhHHHHHHhhhccccccccccc Q lcl|NC_010583. 302 LAADDGA------K-VVTEAKADGSVLVTAKTISKLRRKLGRH-----GLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVG 369 (458) Q Consensus 302 ~~~~~~~------~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~ 369 (458) .....+. . +....+.+ .++...+++....+... ......++||+.++..|.++.-= .++ + T Consensus 144 ~~~~a~~~~~~~~~~t~d~s~~a---~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li--~~i-~-- 215 (349) T protein:vir:78 144 DNVSATDAYHEQNDMVVDVSATL---GFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLI--DFI-R-- 215 (349) T ss_pred ccccccchhhhcccceeeecccc---CCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhh--hhc-c-- Confidence 1110000 0 00000000 11222333333222222 23456789999999988765421 111 1 Q ss_pred cccccccccCCeeecccceecccccccccC--CceEEEEEeceEE-EEecc---eeEEeeccccc--CCceEEEEEEeec Q lcl|NC_010583. 370 NDAVKLQGQVGRIYGLPVVVSEYFPAKAAS--AEFAVIVYKDNFV-MPRQR---AVTVERERQAG--KQRDAYYVTQRVN 441 (458) Q Consensus 370 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~--~~~~~~~~~~~~~-i~~~~---~~~i~~~~~~~--~~~~~~~~~~r~d 441 (458) .......-++++|++|++++.||..+.+ ..+..+.|+.... ..+.. .+.+.+++... .++..+....++ T Consensus 216 --~s~~~~~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~- 292 (349) T protein:vir:78 216 --DAENNTMFATYQGYRVIVDDSMTVVGQGAQRKFISIIFGQGAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTW- 292 (349) T ss_pred --CcccCcccceecCeEEEEeCCCccccCCCCceEEEEEeecceEEEccCCCccceeeecccccCCcceeEEEEEeeEE- Confidence 1112233468999999999999965433 3334455665433 33322 24444555432 345555544443 Q ss_pred cEEecccceEEEEeec----------C Q lcl|NC_010583. 442 LQRYFENGVVSGAYAA----------A 458 (458) Q Consensus 442 ~~~~~~~afv~l~~aa----------a 458 (458) ++||..|..-..+. + T Consensus 293 --~~hp~G~s~~~a~v~~~~~~~~~~s 317 (349) T protein:vir:78 293 --LLHPFGYRFTSAVITGNGTETIARS 317 (349) T ss_pred --EeeeeeeeeccccccCCccccccCC Confidence 66676665433211 1 No 178 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=97.08 E-value=2.8e-05 Score=45.57 Aligned_cols=188 Identities=10% Similarity=0.037 Sum_probs=91.5 Q ss_pred ceeeeeehhheeeeehhhHHHHh-----ccHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCCccccccccccccccc Q lcl|NC_010583. 239 LTEISFKTYKLAAKSFITDETEE-----DAIFSLLPLLRKRLIEAHAVSIEEAFMS----GNGTGQPKGLLKLAADDGAK 309 (458) Q Consensus 239 f~~v~~~~~k~~~~~~is~ell~-----ds~~~~~~~i~~~la~~~~~~~d~~~l~----G~g~~~p~Gi~~~~~~~~~~ 309 (458) .|. .-+|.-++. ++..++.+...++++++++...|+.++. +..+..|.+- ...+.. T Consensus 1 iD~-----------lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~----~~~g~~ 65 (221) T protein:vir:17 1 MDD-----------LLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTG----QDGGFS 65 (221) T ss_pred CCc-----------chhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccc----cccCcc Confidence 110 123444443 3668899999999999999999998863 3222222100 000111 Q ss_pred eeeccccchhhHHHHHHHHHHHhhhhhhhccc-ceeEe-chhHHHHHHhhhccc-cccccccccccccccc-cCCeeecc Q lcl|NC_010583. 310 VVTEAKADGSVLVTAKTISKLRRKLGRHGLKL-SKLVL-IVSMDAYYDLLEDEE-WQDVAQVGNDAVKLQG-QVGRIYGL 385 (458) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~l~~~~d~~-~~~~~~~~~~~~~~~~-~~~~l~G~ 385 (458) .....+........++.+.++...+.....+. ..|++ .|..+-.|-+..|.. ....+. +..+....+ ..+.+.|. T Consensus 66 ~~~~a~~t~~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~-~s~g~~~~g~~i~~v~G~ 144 (221) T protein:vir:17 66 VNIGAGNTNNAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIG-NTQGDMNTGKGLYVNAGI 144 (221) T ss_pred eeccccccCCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecc-cccccccccceeeeecCc Confidence 11111122223344567777777777776653 44555 676655444322211 111111 111112222 34679999 Q ss_pred cceecccccccccCCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEE-eeccEEecccceEEEEeecC Q lcl|NC_010583. 386 PVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQ-RVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 386 pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~-r~d~~~~~~~afv~l~~aaa 458 (458) +|+.|+++|...... ... +...+.. . ..+...||+.. -.-+.++||+|...+|+-+- T Consensus 145 ~V~~SnnlP~~~gt~-~~~--~ag~~~~------~-------~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~~ 202 (221) T protein:vir:17 145 RIYKSNVLASLYGTN-LVT--DPGDATT------S-------GENNGSYRPAITDRAGLVFHKEAADTVEVLLP 202 (221) T ss_pred EEEEeccCCcccccc-ccc--CCccccc------c-------ccccccccccccceEEEEEcchheeeeeeecC Confidence 999999999743322 111 1111100 0 00111111111 12257788888877777665 No 179 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=97.05 E-value=6.7e-05 Score=43.48 Aligned_cols=265 Identities=14% Similarity=0.033 Sum_probs=119.8 Q ss_pred hhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce----EEEEecCCCccc Q lcl|NC_010583. 143 MMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL----TMLVEPEAGRAT 218 (458) Q Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~----~~p~~~~~~~a~ 218 (458) +. .+ ........=+...--+|.+++-..+....-++...+..|+..|.. ++|.......++ T Consensus 1 M~------~e---------~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~ 65 (303) T protein:vir:10 1 MS------AE---------NNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNG 65 (303) T ss_pred CC------CC---------cCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccc Confidence 00 00 000000000112223444545444444444444557777776532 344444555566 Q ss_pred ccccccccccccccccccccc---eeeeeehhheeeeehhhHHHHhccH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCC Q lcl|NC_010583. 219 WVDASKFGTDETVGDEVKGQL---TEISFKTYKLAAKSFITDETEEDAI-FSLLPLLRKRLIEAHAVSIEEAFMSGNGTG 294 (458) Q Consensus 219 ~v~e~~~~~e~~~~~~~~~~f---~~v~~~~~k~~~~~~is~ell~ds~-~~~~~~i~~~la~~~~~~~d~~~l~G~g~~ 294 (458) -|+||..+| -+..+. ...+++.+|++--+ |.|.++.+. -+-...-.++|..+++.+++..|+.- T Consensus 66 dVaEGe~Ip------lskvt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~---- 133 (303) T protein:vir:10 66 DVAEGDVIP------LTKVTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFET---- 133 (303) T ss_pred cccCCcccc------hhhheeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHH---- Confidence 666665554 444443 34677778877644 999986443 34667788889999999999988852 Q ss_pred cccccccccccc-ccceeeccccchhhHHHHHHHHHHHhh---hhhhhcccceeEechhHHHHHHhhhcccccccccccc Q lcl|NC_010583. 295 QPKGLLKLAADD-GAKVVTEAKADGSVLVTAKTISKLRRK---LGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGN 370 (458) Q Consensus 295 ~p~Gi~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~ 370 (458) ++.+... ..+..+. ..... ....+...... ..... ...+.++||.....+.+-..-..+ ... T Consensus 134 -----lktaT~t~~~t~~t~--~s~~g--lq~Al~~~~~kl~~~~ed~-~~~V~FvNP~Daa~yl~~A~i~~~----~t~ 199 (303) T protein:vir:10 134 -----LKSAIENGKRTNKTK--LSAEN--LQGALSKGRANLSVLLDDE-ITPIAFVNPNDTAEYLANGFINST----GAQ 199 (303) T ss_pred -----Hhhccccccccccee--ecHHH--HHHHHHhhhhhcccccccc-ccEEEEEchHHHHHHhhcCCcchh----hhh Confidence 1111000 0000000 00001 11111111111 11222 345788999988766532211111 000 Q ss_pred ccccccccCCeeecccceeccccccccc---CCceEEEEEeceEEEEecceeEEeecccccCCceEEEEEEe-------- Q lcl|NC_010583. 371 DAVKLQGQVGRIYGLPVVVSEYFPAKAA---SAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQR-------- 439 (458) Q Consensus 371 ~~~~~~~~~~~l~G~pv~~~~~~~~~~~---~~~~~~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r-------- 439 (458) .+.. .- --++|..|++|..+|.+.. -...+.+++.+ ..+++. .--.+..|.+.+.+..+ T Consensus 200 fG~n--~L-~nfLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~-----~~g~l~--~~f~~t~D~tglIGv~h~~~~~~~t 269 (303) T protein:vir:10 200 FGVN--LL-TPYVGVKIVEFADVPQGEVWMTVAENLNVAYAN-----PRGELS--RAFAFATDATGFVGVLHDIQPQRLT 269 (303) T ss_pred hhhh--hh-hhhhcceEEEeccCCCceEEEeeccceEEEEec-----Cchhhh--hhhhhccccccceEEEeccccceee Confidence 0111 11 1288999999999986421 11112222211 112111 00001111222211111 Q ss_pred -----ecc---EEecccceEEEEeecC Q lcl|NC_010583. 440 -----VNL---QRYFENGVVSGAYAAA 458 (458) Q Consensus 440 -----~d~---~~~~~~afv~l~~aaa 458 (458) +.+ -+-.++++++.+..++ T Consensus 270 ~eT~~~~~~~lfpE~~dgiv~~ti~~~ 296 (303) T protein:vir:10 270 SDTIYASAISMFPENIDAVIKVTIKKD 296 (303) T ss_pred ehhHhHhHHHhcccccceEEEEEEecc Confidence 122 2344578999999888 No 180 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=96.80 E-value=9.4e-05 Score=42.66 Aligned_cols=312 Identities=10% Similarity=-0.007 Sum_probs=134.8 Q ss_pred hcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcc-cccccCccccchhHHH----HHHHHHHhccchhhhcce Q lcl|NC_010583. 124 YGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGS-SSVSMSSEAYETIFST----RIIRDLQKELVVGALFDE 198 (458) Q Consensus 124 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~-~~~~~g~~~ip~~~~~----~ii~~~~~~~~l~~~~~~ 198 (458) .+......+.+. ..+ ..-........+.......+.... .-.+.+...||..+.+ .+++.+.+.....++..+ T Consensus 1 ~~~~~~~~~l~~-~gi-~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv 78 (336) T protein:vir:36 1 MRDAQRIQNLAR-AGV-ILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGE 78 (336) T ss_pred CchHHHHHHHhh-cCe-eecchhhhhhhHHHHhhhhhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhccc Confidence 000000000000 000 000000000001111111111111 1111233456655443 233333333333333333 Q ss_pred eeecc---CceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhh-HHHHhc--cHHHHHHHHH Q lcl|NC_010583. 199 LPMSS---KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFIT-DETEED--AIFSLLPLLR 272 (458) Q Consensus 199 ~~~~~---~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is-~ell~d--s~~~~~~~i~ 272 (458) ...+. ....+++.+..+.+.+.+ +.+..+.++..-...+-..+.++..+.++ .|+..- ...++.+.-+ T Consensus 79 ~t~g~W~~~~~~~~~~e~~G~a~~yg------d~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka 152 (336) T protein:vir:36 79 SKKGDWTTLVAAFITAEPTTKVATYG------DYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELN 152 (336) T ss_pred cccCCccceeEEEeeeeceeeEEEee------ccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHH Confidence 22221 133445544444444433 33444556655555566777888888888 444332 2356778888 Q ss_pred HHHHHHHHHHHHHHHhccCCCCccccccccccccccceeecc-ccchhhHHHHHHHHHHHhhhhhhhc------ccceeE Q lcl|NC_010583. 273 KRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEA-KADGSVLVTAKTISKLRRKLGRHGL------KLSKLV 345 (458) Q Consensus 273 ~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~ 345 (458) ...++++.+.+++-.++|++..+..|+++........+..+. ....+....++++..++..+..... .+...+ T Consensus 153 ~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~ 232 (336) T protein:vir:36 153 YSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMG 232 (336) T ss_pred HHHHHHHHHhhCcEEEEeccccceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEE Confidence 888999999999999999988888899986544322222211 1222334566777777777665332 245667 Q ss_pred echhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEec-----eEEEEecceeE Q lcl|NC_010583. 346 LIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKD-----NFVMPRQRAVT 420 (458) Q Consensus 346 ~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~-----~~~i~~~~~~~ 420 (458) +.+..+.+|..- +..|..+.+.-... .++-++...| .+ .++++....+...+ -..+.....++ T Consensus 233 LP~~~~~~Ls~~-n~~g~Tvl~~lk~n----~Pnl~i~t~p-----El--~~a~g~~~~l~~~~~~~~~t~~~~~p~~~~ 300 (336) T protein:vir:36 233 LPPTAMSDLSKT-NQYGLAAAAKLKDI----FPKLEFVTIP-----EY--DTASGRLVQLWAPRVEGKDTATCGFTEKMR 300 (336) T ss_pred echHHHHhccCC-CccCccHHHHHHHh----cCccEEEEcc-----cc--ccCCCceEEEEEEecCCCcceeeecchhhh Confidence 777766666432 22222111110000 0011222222 22 12222222221110 01111111111 Q ss_pred EeecccccCCceEEEEEEeec-cEEecccceEEEEee Q lcl|NC_010583. 421 VERERQAGKQRDAYYVTQRVN-LQRYFENGVVSGAYA 456 (458) Q Consensus 421 i~~~~~~~~~~~~~~~~~r~d-~~~~~~~afv~l~~a 456 (458) ...- ....-...+-...|.+ +.+.+|.||+.++-= T Consensus 301 ~l~v-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 301 AHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ccce-eecCceeEeccccceeeeeeeccchheeeecC Confidence 1000 0112224445556654 478889999987665 No 181 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=96.74 E-value=0.00037 Score=39.41 Aligned_cols=264 Identities=13% Similarity=0.037 Sum_probs=96.4 Q ss_pred hhhhcccccccCccccchhHHHHHHHHHHhccchhhhcc---e----eeeccCceEEEEec-CCCccccccccccccccc Q lcl|NC_010583. 159 KAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFD---E----LPMSSKILTMLVEP-EAGRATWVDASKFGTDET 230 (458) Q Consensus 159 ~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~---~----~~~~~~~~~~p~~~-~~~~a~~v~e~~~~~e~~ 230 (458) .+.+..++. .+--+.+..-.++.+.+...+.+.+. . .|..+.....+... ++. +..-.....+ T Consensus 1 ~~~t~~sdl----~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~----~~~rnv~~~~- 71 (315) T protein:vir:96 1 MATTVNSDL----VIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGA----IADRDVNSTA- 71 (315) T ss_pred Cceeeecce----eeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccc----hhhcccCCCc- Confidence 111111110 11122233334444443333222211 0 11111111111111 000 0000000000 Q ss_pred cccccc-ccceeeeeehhhee-eeehh--hHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccc Q lcl|NC_010583. 231 VGDEVK-GQLTEISFKTYKLA-AKSFI--TDETEE---DAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLA 303 (458) Q Consensus 231 ~~~~~~-~~f~~v~~~~~k~~-~~~~i--s~ell~---ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~ 303 (458) ...... .+...|.. |++ ..-|+ +...+. +.+..+..-|...+..+....+-...+.|. .+.++.. T Consensus 72 ~~t~~kit~~~dvaV---k~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~-----~aai~~~ 143 (315) T protein:vir:96 72 TVAGTKIAADEMVSV---KVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNAL-----QGAIGSN 143 (315) T ss_pred cccceecccccceeE---EEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhh-----hhhhccc Confidence 000000 01111111 122 22233 222222 334444444444444444433333333221 0011000 Q ss_pred cccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeee Q lcl|NC_010583. 304 ADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIY 383 (458) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~ 383 (458) . ..+ .......+....+.+....+.........|+||..++..|.+ + .....++.. .........++ .+ T Consensus 144 t---~~~----~~~~~a~~~~~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~-q-~L~~~~~~~-~~~~~~~~~~~-~l 212 (315) T protein:vir:96 144 A---GMN----VSGELATEGKKVLTKGLRTMGDKASSIAIWVMDSTSYFDIVD-E-AIDNKLYEE-AGVVVYGGTPG-TL 212 (315) T ss_pred c---ccc----ccccccccCHHHHHHHHHHhcccccCeeEEEEchHHHHHHHH-h-hhhhhcccc-cceeEecCcCc-cc Confidence 0 000 011222345566777778887778888899999999988876 2 122222211 11111122244 44 Q ss_pred cccceecccccccccCCceEEEEEeceE-EEEecceeEEeecccccCCceEEEEEEeecc-EEecccceEEEEeecC Q lcl|NC_010583. 384 GLPVVVSEYFPAKAASAEFAVIVYKDNF-VMPRQRAVTVERERQAGKQRDAYYVTQRVNL-QRYFENGVVSGAYAAA 458 (458) Q Consensus 384 G~pv~~~~~~~~~~~~~~~~~~~~~~~~-~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~-~~~~~~afv~l~~aaa 458 (458) |+||++++.||.. ..++++... .+.....+. .-++-..+.-.+....|..+ -.++|..|..-+.+.. T Consensus 213 GkrViVdD~~P~~------~~~gl~~GAi~~~~~~~~~--~~~~~~~g~e~l~~~~r~e~tf~l~p~G~sw~~~~~~ 281 (315) T protein:vir:96 213 GKPVLVTDQCPAT------KIFGLVAGAVMITESQAPG--MRSYQIDDQENLAIGFRAEGTANVEVLGYKWKTKTNV 281 (315) T ss_pred ccEEEEECCCCcc------eeeeeecceeeecCCCccc--cccccCCCcceeEEEEeeeeEeeeeeeeEEeecCCCc Confidence 9999999999952 234444432 233222221 11111222223333334333 4677777755322111 No 182 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=96.70 E-value=0.00013 Score=41.89 Aligned_cols=312 Identities=10% Similarity=-0.007 Sum_probs=135.7 Q ss_pred hcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcc-cccccCccccchhHH----HHHHHHHHhccchhhhcce Q lcl|NC_010583. 124 YGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGS-SSVSMSSEAYETIFS----TRIIRDLQKELVVGALFDE 198 (458) Q Consensus 124 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~-~~~~~g~~~ip~~~~----~~ii~~~~~~~~l~~~~~~ 198 (458) .+......+... ..+ ..-........+.......+.... .-.+.+...||..+. +.+++.+.+.....++..+ T Consensus 1 ~~~~~~~~~l~~-~gi-~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv 78 (336) T protein:vir:10 1 MRDAQRIQNLAR-AGV-ILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGE 78 (336) T ss_pred CchHHHHHHHhh-cCe-eecchhhhhhhhHHHhhhhhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhccc Confidence 000000000000 000 000000000000000001111111 111122345664433 2333444443334444333 Q ss_pred eeecc---CceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc---HHHHHHHHH Q lcl|NC_010583. 199 LPMSS---KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA---IFSLLPLLR 272 (458) Q Consensus 199 ~~~~~---~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~~~~~i~ 272 (458) ...+. ....+++.+..+.+.+.+ +.+..+.++..-...+-..+.++..+.++..=+.-+ .+++.+.-+ T Consensus 79 ~t~g~W~~~~~~~~~~e~~G~a~~yg------d~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka 152 (336) T protein:vir:10 79 SKKGDWTTLVAAFITAEPTTKVATYG------DYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELN 152 (336) T ss_pred cccCCccceeEEEeeeeceeeEEEee------ccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHH Confidence 22221 233445544444444433 334445566555555667788888888884433332 366888888 Q ss_pred HHHHHHHHHHHHHHHhccCCCCccccccccccccccceeecc-ccchhhHHHHHHHHHHHhhhhhhhc------ccceeE Q lcl|NC_010583. 273 KRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEA-KADGSVLVTAKTISKLRRKLGRHGL------KLSKLV 345 (458) Q Consensus 273 ~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~ 345 (458) ...++++.+.+++-.++|++..+..|+++........++.+. ....+....++++..++..+..... .+...+ T Consensus 153 ~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~ 232 (336) T protein:vir:10 153 YSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMG 232 (336) T ss_pred HHHHHHHHHhhCcEEEEeccccceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEE Confidence 888999999999999999988888899987554322222211 1222334566777777777765332 245667 Q ss_pred echhHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEec-----eEEEEecceeE Q lcl|NC_010583. 346 LIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKD-----NFVMPRQRAVT 420 (458) Q Consensus 346 ~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~-----~~~i~~~~~~~ 420 (458) +.+..+.+|..- +..|..+.+.-... +-++.+...+.+ .++++....+...+ -..+.....++ T Consensus 233 LP~~~~~~Ls~~-n~~g~Tvl~~lk~n---------~Pnl~i~t~pEl--~~a~G~~~~l~~~~~~~~~t~~~~~p~~~~ 300 (336) T protein:vir:10 233 LPPTAMSDLSKT-NQYGLAAAAKLKDI---------FPKLEFVTIPEY--DTASGRLVQLWAPRVEGKDTATCGFTEKMR 300 (336) T ss_pred ecHHHHHhccCC-CccCccHHHHHHHh---------cCccEEEEcccc--ccCCCceEEEEEEecCCCcceeeecchhhh Confidence 777766666432 22222111110000 001122222222 12222222221110 01111111111 Q ss_pred EeecccccCCceEEEEEEeec-cEEecccceEEEEee Q lcl|NC_010583. 421 VERERQAGKQRDAYYVTQRVN-LQRYFENGVVSGAYA 456 (458) Q Consensus 421 i~~~~~~~~~~~~~~~~~r~d-~~~~~~~afv~l~~a 456 (458) ...- ....-...+-...|.+ +.+.+|.||+.++-= T Consensus 301 ~l~v-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 301 AHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ccce-eecCceeEeccccceeeeeeeccchheeeecC Confidence 1000 0112224445556654 478889999987665 No 183 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=96.54 E-value=0.00053 Score=38.54 Aligned_cols=265 Identities=10% Similarity=-0.068 Sum_probs=107.0 Q ss_pred ccCcc--ccchhHHHHHHHHHHhccchhhhcce-ee----e--ccCceEEEEecCCCccccccccccccccccccccccc Q lcl|NC_010583. 168 SMSSE--AYETIFSTRIIRDLQKELVVGALFDE-LP----M--SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 168 ~~g~~--~ip~~~~~~ii~~~~~~~~l~~~~~~-~~----~--~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~ 238 (458) .+..+ ++|+.++.++++.+++..++.++++. .+ . .+...++|+........ . -+.... .....+.+ T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d-~-~~~~~t---~~~~~~l~ 75 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSER-T-MDGDIT---GKSKNSLI 75 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeec-c-cCcccC---cccccccc Confidence 22334 78999999999999999999888765 22 1 13444555433221110 0 000000 00001111 Q ss_pred c--eeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeecccc Q lcl|NC_010583. 239 L--TEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKA 316 (458) Q Consensus 239 f--~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~ 316 (458) = -.++++.+|...+ .++.+=+..+..+++++++.. .++++..+|..+...-....+ ...++..+ .. T Consensus 76 e~~v~l~id~~k~~a~-~v~d~E~~l~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~--------~~vgt~~t--~~ 143 (423) T protein:vir:10 76 SAKATGEVGNYITVAV-EYRQIEEALKLNQLDQILVPI-NERMVTDLETELALFMMKHGA--------LSLGSPNT--PI 143 (423) T ss_pred cceEEEEecceeeeee-eeChHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHhhhccc--------cccccccc--cc Confidence 1 2345555554433 454443345667787766555 689999999988632111101 00000001 00 Q ss_pred chhhHHHHHHHHHHHhhhhhhhccc-ce-eEechhHHHHHHhhhccccccccccccccccccc-cCCeeecccceecccc Q lcl|NC_010583. 317 DGSVLVTAKTISKLRRKLGRHGLKL-SK-LVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQG-QVGRIYGLPVVVSEYF 393 (458) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~-~~~~l~G~pv~~~~~~ 393 (458) ..+.++.++...+.....+. .. .++.|..+..|.. +-................+ ..+++.|..|+.|+.+ T Consensus 144 -----~a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~--~~~~~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~v 216 (423) T protein:vir:10 144 -----KKWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLAD--AQSGLHVSEQLVRTAWENAQISGNFGGIRALMSNGL 216 (423) T ss_pred -----ccHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhh--hhhhhccccccchHHHHhcccceeecceEEEEecCC Confidence 11334454544554444332 23 3567766655532 1000111111111112223 3478999999999999 Q ss_pred cccccCCce---EEEEE--eceEEEEe--cceeE-----------EeecccccCCceEEEEEEeeccEEe------cccc Q lcl|NC_010583. 394 PAKAASAEF---AVIVY--KDNFVMPR--QRAVT-----------VERERQAGKQRDAYYVTQRVNLQRY------FENG 449 (458) Q Consensus 394 ~~~~~~~~~---~~~~~--~~~~~i~~--~~~~~-----------i~~~~~~~~~~~~~~~~~r~d~~~~------~~~a 449 (458) |...++..- .+-++ .....+.+ ....+ +..-+.+.-.-+ .+..++...++ ++.- T Consensus 217 p~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv--~~v~~~tk~~l~~~~~~~~~~ 294 (423) T protein:vir:10 217 ASRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDT--HWLNQQSKQTLYNGASALSFT 294 (423) T ss_pred cccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecce--eeecccccceeecccCCcceE Confidence 954332211 10010 00000100 00000 111111100000 00001111110 1111 Q ss_pred eEEEE-------------ee----------------cC Q lcl|NC_010583. 450 VVSGA-------------YA----------------AA 458 (458) Q Consensus 450 fv~l~-------------~a----------------aa 458 (458) |++.- .. |+ T Consensus 295 ~~V~~~~~~~a~~~~tv~i~p~~~~~~~~~~~~~V~a~ 332 (423) T protein:vir:10 295 ATVMEDANAHSSGDVTVKISGVPIFDAGYPQYNAVDRL 332 (423) T ss_pred EEEEecccccccCceEEEeccccccccCcccccceecc Confidence 22211 10 00 No 184 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=96.50 E-value=0.00056 Score=38.39 Aligned_cols=266 Identities=10% Similarity=-0.018 Sum_probs=108.5 Q ss_pred ccCcc--ccchhHHHHHHHHHHhccchhhhcce-ee----e--ccCceEEEEecCCCccccccccccccccccccccccc Q lcl|NC_010583. 168 SMSSE--AYETIFSTRIIRDLQKELVVGALFDE-LP----M--SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 168 ~~g~~--~ip~~~~~~ii~~~~~~~~l~~~~~~-~~----~--~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~ 238 (458) .+..+ .+|+.++..+++.+++..++.++++. .+ . .+...+|++........+..- ....+.-.+.+ T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~-----~~~~~~~~dl~ 75 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTG-----DISGQNKNNLI 75 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCc-----cccccccCccc Confidence 11222 36999999999999999998888755 21 1 234455555432221111100 00011111222 Q ss_pred ce--eeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhcc-CCCCccccccccccccccceeeccc Q lcl|NC_010583. 239 LT--EISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSG-NGTGQPKGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 239 f~--~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G-~g~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) -+ .++++.+|..++--=+.|+ .....++++++... .++++..+|..++.- .+. .....++ .+.. T Consensus 76 e~~v~l~id~~k~va~~v~d~E~-~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~---------~~~~~gt--~~t~ 142 (423) T protein:vir:10 76 SGKATGRVGNYITVAVEYQQLEE-AIKLNQLEEILAPV-RQRIVTDLETELAHFMMNN---------GALSLGS--PNTP 142 (423) T ss_pred cceeEEEeeceeeeeeeechHHH-hcChhhHHHHHHHH-HHHHHHHHHHHHHHHHhhc---------ccccccc--CCcc Confidence 22 3555666554443324444 34556787766655 588999999988742 111 0000000 0000 Q ss_pred cchhhHHHHHHHHHHHhhhhhhhccc-cee-EechhHHHHHHhhhcccccccccccccccccccc-CCeeecccceeccc Q lcl|NC_010583. 316 ADGSVLVTAKTISKLRRKLGRHGLKL-SKL-VLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQ-VGRIYGLPVVVSEY 392 (458) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~-~~~l~G~pv~~~~~ 392 (458) . ..+.++.++...+.....+. ..| +++|..+..|.+- + ......+.........+. .+++.|..|+.|+. T Consensus 143 ~-----~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~-~-~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snn 215 (423) T protein:vir:10 143 I-----TKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADA-Q-TGLHASDQLVRTAWENAQIPTNFGGIRALMSNG 215 (423) T ss_pred c-----chHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhcc-c-cceecccccchhhhhhccceeeecceEEEEeCC Confidence 0 12345555555555544432 234 5566665555421 1 100000111111122232 37899999999999 Q ss_pred ccccccCCc--eEEEEEece---EEEEecceeEE--ee---c--cccc-CCceEEEEEE---eeccEEe------cccce Q lcl|NC_010583. 393 FPAKAASAE--FAVIVYKDN---FVMPRQRAVTV--ER---E--RQAG-KQRDAYYVTQ---RVNLQRY------FENGV 450 (458) Q Consensus 393 ~~~~~~~~~--~~~~~~~~~---~~i~~~~~~~i--~~---~--~~~~-~~~~~~~~~~---r~d~~~~------~~~af 450 (458) +|...++.. ......+.. -...+.....+ .. . .... -|.+.|-+.. +....++ .+.-| T Consensus 216 ip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~ 295 (423) T protein:vir:10 216 LASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTA 295 (423) T ss_pred CccccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEE Confidence 996433321 111111100 00011111111 00 0 0000 0111111100 0000000 11112 Q ss_pred EEEE---------------------------------eecC Q lcl|NC_010583. 451 VSGA---------------------------------YAAA 458 (458) Q Consensus 451 v~l~---------------------------------~aaa 458 (458) ++.. ++++ T Consensus 296 ~v~a~~~~~~~g~~tv~i~p~~i~~~~~~~~~~v~a~~a~~ 336 (423) T protein:vir:10 296 TVTADANSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQVEAG 336 (423) T ss_pred EEEeeeeeccCCceeeeccCccccccCCcccccccccccCC Confidence 2211 1111 No 185 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=96.49 E-value=0.00022 Score=40.68 Aligned_cols=312 Identities=11% Similarity=0.003 Sum_probs=139.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhccc-ccccC Q lcl|NC_010583. 92 QETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSS-SVSMS 170 (458) Q Consensus 92 ~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~g 170 (458) .+... .+..+ ++ .+-.+. +. ......+.......+..... -.+.+ T Consensus 1 ~~~~~----~~~~l----~~---~gi~~~----~~-------------------~~~~~~~~~~~a~da~d~~~~~~t~~ 46 (336) T protein:vir:78 1 MRDAQ----RIQNL----AR---AGVILP----RS-------------------VKNVSTPLAEYAMDAADLSPHLSSTG 46 (336) T ss_pred CchHH----HHHHH----hc---cCeecc----hh-------------------hhhhhHHHHHHHHhhhhhccccccCC Confidence 00000 00000 00 000000 00 00000000000111111111 11222 Q ss_pred ccccchhHHH----HHHHHHHhccchhhhcceeeecc---CceEEEEecCCCcccccccccccccccccccccccceeee Q lcl|NC_010583. 171 SEAYETIFST----RIIRDLQKELVVGALFDELPMSS---KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEIS 243 (458) Q Consensus 171 ~~~ip~~~~~----~ii~~~~~~~~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~ 243 (458) ...||..+.. ++++.+.......++..+...+. ....|++.+..+.+.+.+ +.+..+..+...+..+ T Consensus 47 ~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~yg------d~~D~P~vd~~~~~~~ 120 (336) T protein:vir:78 47 SSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYG------DYSSDGDSGTNINYPQ 120 (336) T ss_pred CcchHHHHHHhcccceeeehhhhhhhhhhcccccCCCccccEEEEeeeecceeeEEee------cccCCCeeecceeeEE Confidence 3345654433 33334444333444433322221 234555555444444443 3345566677777777 Q ss_pred eehhheeeeehhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccc-cchh Q lcl|NC_010583. 244 FKTYKLAAKSFITDETEEDA---IFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAK-ADGS 319 (458) Q Consensus 244 ~~~~k~~~~~~is~ell~ds---~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~-~~~~ 319 (458) -+.+.++..+.++..=+..+ ..++.+.-+...++++.+.++.-.++|++..+..|+++........++.... ...+ T Consensus 121 ~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T 200 (336) T protein:vir:78 121 RQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPA 200 (336) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcCcccccC Confidence 78888888888885544443 3668888888889999999999999999888889999976544322222211 1233 Q ss_pred hHHHHHHHHHHHhhhhhhhc------ccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccc Q lcl|NC_010583. 320 VLVTAKTISKLRRKLGRHGL------KLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYF 393 (458) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~ 393 (458) ....++++..++..+..... .+...++.+..+.+|..- +..|..+.+- +.. + .-++.|...+.+ T Consensus 201 ~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~~-n~~g~tv~~~-lk~---n-----~Pnl~i~t~pel 270 (336) T protein:vir:78 201 VEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT-NQYGLSAAAK-LKE---I-----FPKLEFVTIPEY 270 (336) T ss_pred HHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCC-CccCccHHHH-HHH---h-----cCccEEEEcccc Confidence 44566777777776654332 123567777777777532 2222211110 000 0 001223222222 Q ss_pred cccccCCceEEEEEec-----eEEEEecceeEEeecccccCCceEEEEEEeec-cEEecccceEEEEee Q lcl|NC_010583. 394 PAKAASAEFAVIVYKD-----NFVMPRQRAVTVERERQAGKQRDAYYVTQRVN-LQRYFENGVVSGAYA 456 (458) Q Consensus 394 ~~~~~~~~~~~~~~~~-----~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d-~~~~~~~afv~l~~a 456 (458) .++++....+...+ ...+.-...++... ..........-...|.+ ..+..|.||+.++-= T Consensus 271 --~~Agg~~~~~~~~~~~~~~t~~~~~p~~f~~lp-vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 271 --DTASGRLVQLWAPRVEGKDTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred --cccCcceEEEEEeeccCCcceeeecchhhhccc-eeecCceeEeccccceeeeeeeccchheeeccC Confidence 12233222221111 01111111111100 00112333445555654 478889999886655 No 186 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=96.32 E-value=0.00029 Score=39.96 Aligned_cols=311 Identities=10% Similarity=-0.014 Sum_probs=135.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhccc-ccccC Q lcl|NC_010583. 92 QETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSS-SVSMS 170 (458) Q Consensus 92 ~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~g 170 (458) .+... .+..+ ++ .+-.+. +. ......+.......+..... -.+.+ T Consensus 1 ~~~~~----~~~~l----~~---~gi~~~----~~-------------------~~~~~~~~~~~a~da~d~~~~~~t~~ 46 (336) T protein:vir:10 1 MRDAQ----RIQNL----AR---AGVILP----RS-------------------VKNVSTPLAEYAMDAADLSPHLSSTG 46 (336) T ss_pred CchHH----HHHHH----hc---cCeecc----hh-------------------hhhhhHHHHHHHHhhhhhccccccCC Confidence 00000 00000 00 000000 00 00000000000111111111 11222 Q ss_pred ccccchhHHHHHHH--HHHhccchhhhcceeeecc------CceEEEEecCCCcccccccccccccccccccccccceee Q lcl|NC_010583. 171 SEAYETIFSTRIIR--DLQKELVVGALFDELPMSS------KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEI 242 (458) Q Consensus 171 ~~~ip~~~~~~ii~--~~~~~~~l~~~~~~~~~~~------~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v 242 (458) ...||..+.. +++ .++-..+-++....+|+.+ ....+++.+..+.+.+.+ +.+..+..+.....- T Consensus 47 ~~g~~~~l~~-~i~p~~~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~yg------d~~d~P~~d~~~~~~ 119 (336) T protein:vir:10 47 SSGIPNYLTT-YVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYG------DYSSDGDSGTNINYP 119 (336) T ss_pred CcchHHHHHh-hcCcceeeeeechhchhhhcccccCCCcceeeEEEEeeeeeeeEEEcc------ccCCCcceeeeeeee Confidence 3345654443 432 2333333344444444432 223334444333333322 333455566555666 Q ss_pred eeehhheeeeehhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccc-cch Q lcl|NC_010583. 243 SFKTYKLAAKSFITDETEEDA---IFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAK-ADG 318 (458) Q Consensus 243 ~~~~~k~~~~~~is~ell~ds---~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~-~~~ 318 (458) +-+.+.++..+.++..=+..+ ..++.+.-+...++++.+.++.-.++|++..+..|+++........++.+.. ... T Consensus 120 ~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~ 199 (336) T protein:vir:10 120 QRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSP 199 (336) T ss_pred eeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeecccceEEEeecCCCCcccccCcCccccc Confidence 667778888888886544433 3568888888889999999999999999888889999976544322222211 123 Q ss_pred hhHHHHHHHHHHHhhhhhhhc------ccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccc Q lcl|NC_010583. 319 SVLVTAKTISKLRRKLGRHGL------KLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEY 392 (458) Q Consensus 319 ~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~ 392 (458) +....++++..++..+..... .+...++.+..+.+|.. .+..|..+.+- +.. + .-++.|...+. T Consensus 200 T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~~-lk~---n-----~Pnl~i~t~pe 269 (336) T protein:vir:10 200 AVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKE---I-----FPKLEFVTIPE 269 (336) T ss_pred CHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHHH-HHH---h-----CCccEEEEccc Confidence 345566777777776654332 12356677777777753 22222211110 000 0 00122332222 Q ss_pred ccccccCCceEEEEEec-----eEEEEecceeEEeecccccCCceEEEEEEeec-cEEecccceEEEEee Q lcl|NC_010583. 393 FPAKAASAEFAVIVYKD-----NFVMPRQRAVTVERERQAGKQRDAYYVTQRVN-LQRYFENGVVSGAYA 456 (458) Q Consensus 393 ~~~~~~~~~~~~~~~~~-----~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d-~~~~~~~afv~l~~a 456 (458) + .++++....+...+ ...+.-...++... ..........-...|.+ ..+.+|.||++++-= T Consensus 270 l--~~Agg~~~~~~~~~~~~~~t~~~~~P~~f~~lp-vq~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 270 Y--DTASGRLVQLWAPRVEGKDTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred c--cccCCceEEEEEecccCCcceeeecChhhhccc-eeecCceeEeccccceeeeeeeccchheeeccC Confidence 2 12233322222111 01111111111100 00112233445555654 477889999886655 No 187 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=96.29 E-value=0.00078 Score=37.63 Aligned_cols=266 Identities=9% Similarity=-0.041 Sum_probs=108.3 Q ss_pred ccCcc--ccchhHHHHHHHHHHhccchhhhcceee-----e--ccCceEEEEecCCCccccccccccccccccccccccc Q lcl|NC_010583. 168 SMSSE--AYETIFSTRIIRDLQKELVVGALFDELP-----M--SSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 168 ~~g~~--~ip~~~~~~ii~~~~~~~~l~~~~~~~~-----~--~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~ 238 (458) .+..+ .+|+.++...++.+++..++.++++.-. . .+...+|++........+- +. .+......+.+ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~--~~---~~~~~~~~~l~ 75 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTP--TG---DISGQNKNNLI 75 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeeccc--Cc---ccCCcccCccc Confidence 11121 3799999999999999999888875422 1 1335556643221111110 00 00001111111 Q ss_pred c--eeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhcc-CCCCccccccccccccccceeeccc Q lcl|NC_010583. 239 L--TEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSG-NGTGQPKGLLKLAADDGAKVVTEAK 315 (458) Q Consensus 239 f--~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G-~g~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) = ..++++.+|..++--=+.|. .....++++++... .++++..+|..++.- .+. +....++..+. T Consensus 76 e~~v~l~id~~k~va~~v~d~E~-~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~---------a~~~~gt~~t~-- 142 (423) T protein:vir:17 76 SGKATGRVGNYITVAVEYQQLEE-AIKLNQLEEILAPV-RQRIVTDLETELAHFMMNN---------GALSLGSPNTP-- 142 (423) T ss_pred cceeEEEeeceeeeeeeecHHHH-hcChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhc---------cccccccCCcc-- Confidence 1 24566666655544334444 44556787766665 588999999988742 110 00000000110 Q ss_pred cchhhHHHHHHHHHHHhhhhhhhccc-cee-EechhHHHHHHhhhcccccccccccccccccccc-CCeeecccceeccc Q lcl|NC_010583. 316 ADGSVLVTAKTISKLRRKLGRHGLKL-SKL-VLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQ-VGRIYGLPVVVSEY 392 (458) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~-~~~l~G~pv~~~~~ 392 (458) . ..+.++.++...|.....+. ..| +++|..+..|.+ +.......+.........+. .+++.|..|+.|+. T Consensus 143 ~-----~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~--~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snn 215 (423) T protein:vir:17 143 I-----TKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLAD--AQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNG 215 (423) T ss_pred c-----ccHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhc--cccceecccccchHHHhhccceeeecceEEEEeCC Confidence 0 12445555555665544442 244 556766555542 11111110111111122232 37899999999999 Q ss_pred ccccccCCce--EEEEEece---EEEEec------ceeEEeecc-cc-cCCceEEEEEE---eeccE------Eecccce Q lcl|NC_010583. 393 FPAKAASAEF--AVIVYKDN---FVMPRQ------RAVTVERER-QA-GKQRDAYYVTQ---RVNLQ------RYFENGV 450 (458) Q Consensus 393 ~~~~~~~~~~--~~~~~~~~---~~i~~~------~~~~i~~~~-~~-~~~~~~~~~~~---r~d~~------~~~~~af 450 (458) +|...++..- ........ ....+. .......+. .. .-+.+.|-+.. +.... ..++.-| T Consensus 216 ip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~ 295 (423) T protein:vir:17 216 LASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTA 295 (423) T ss_pred CccccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEE Confidence 9965433311 11111100 000000 000000000 00 01111111100 00000 0011122 Q ss_pred EEE---------------------------------EeecC Q lcl|NC_010583. 451 VSG---------------------------------AYAAA 458 (458) Q Consensus 451 v~l---------------------------------~~aaa 458 (458) ++. .+|++ T Consensus 296 ~v~~~~~~~a~~~~tv~i~p~~i~~~~~~~~~~v~a~~a~~ 336 (423) T protein:vir:17 296 TVTADANSDSSGDVTVTLSGVPIYDTTNPQYNSVSRQVAAG 336 (423) T ss_pred EEEecccccccCceEEEecCccccccCCcccccceecccCC Confidence 211 11111 No 188 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=96.12 E-value=0.00098 Score=37.07 Aligned_cols=266 Identities=11% Similarity=-0.002 Sum_probs=111.0 Q ss_pred ccCcc--ccchhHHHHHHHHHHhccchhhhcce-eee----c--cCceEEEEecCCCccccccccccccccccccccccc Q lcl|NC_010583. 168 SMSSE--AYETIFSTRIIRDLQKELVVGALFDE-LPM----S--SKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 168 ~~g~~--~ip~~~~~~ii~~~~~~~~l~~~~~~-~~~----~--~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~ 238 (458) .+..+ .||+.++...++.+++..++.++++. .+. . +...+|++........+.. ..+....-.+.+ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~-----~~~~~~~~~~~~ 75 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTET-----GDITGKDKNGLF 75 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccC-----cCCCCccccccc Confidence 11111 46999999999999999999888755 221 1 3344566543321111100 000111111222 Q ss_pred cee--eeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeecccc Q lcl|NC_010583. 239 LTE--ISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKA 316 (458) Q Consensus 239 f~~--v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~ 316 (458) -.+ ++++.+|... +.++.+=+..+..++++++...+ ++++..+|..++..= ...+....++..+ T Consensus 76 e~~v~l~id~~k~~a-~~v~d~e~~l~i~~~~~~l~~a~-~ala~~vd~~l~~~l--------~~~a~~~vgt~~t---- 141 (423) T protein:vir:35 76 SAKATGKVGKYITVA-VEWTQIEEALKLNQLDQILSPIH-ERMVTDLETELAHFM--------MNNGALSLGSPNT---- 141 (423) T ss_pred cceeeEEeccceecc-ceeCHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHHHH--------hhccccccccccC---- Confidence 223 4444444433 34555544445667887777664 778888998887420 0001000000000 Q ss_pred chhhHHHHHHHHHHHhhhhhhhccc-cee-EechhHHHHHHhhhcccccccc-cccccccccccc-CCeeecccceeccc Q lcl|NC_010583. 317 DGSVLVTAKTISKLRRKLGRHGLKL-SKL-VLIVSMDAYYDLLEDEEWQDVA-QVGNDAVKLQGQ-VGRIYGLPVVVSEY 392 (458) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~l~~~~d~~~~~~~-~~~~~~~~~~~~-~~~l~G~pv~~~~~ 392 (458) ....+..+.++...+.....+. ..| +++|..+..|.+ + +.+... +.........+. .|++.|..|+.|+. T Consensus 142 ---~~~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~--~-~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snn 215 (423) T protein:vir:35 142 ---AIKKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLAD--A-QSGLHAADQLVRTAWENAQISGNFGGIRALMSNG 215 (423) T ss_pred ---CcchHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhc--c-ccceeccccchhHHHhhccceeeecceEEEEcCC Confidence 0122455666666666555443 355 567776666532 1 111110 111111122332 37899999999999 Q ss_pred ccccccCCce--EEEEEec---eEEEEecceeEE-------eeccc-ccCCce-------------------------EE Q lcl|NC_010583. 393 FPAKAASAEF--AVIVYKD---NFVMPRQRAVTV-------ERERQ-AGKQRD-------------------------AY 434 (458) Q Consensus 393 ~~~~~~~~~~--~~~~~~~---~~~i~~~~~~~i-------~~~~~-~~~~~~-------------------------~~ 434 (458) +|...++... ....-.. ...+.+...-++ ..... ...|.+ .| T Consensus 216 vp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~ 295 (423) T protein:vir:35 216 LASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTA 295 (423) T ss_pred CccccccccccceeeccccccccccccccccceeeeeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEE Confidence 9964433211 1110000 011111110000 00000 011111 11 Q ss_pred EEE------------EeeccEEecccce-----EEEEeecC Q lcl|NC_010583. 435 YVT------------QRVNLQRYFENGV-----VSGAYAAA 458 (458) Q Consensus 435 ~~~------------~r~d~~~~~~~af-----v~l~~aaa 458 (458) ++. ..++.+++.|.++ |..+++++ T Consensus 296 ~V~~~~~~~a~g~~~v~i~p~~~~~~~~~~~~~v~a~~a~~ 336 (423) T protein:vir:35 296 TVLEETNSTASGDVTVKLSGVPIYDEKNSQYNAVDAKVKAG 336 (423) T ss_pred EEeccccccccCceeEEccccccccCCCcccccccccccCC Confidence 111 0011111222111 11111111 No 189 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=96.06 E-value=0.00075 Score=37.72 Aligned_cols=293 Identities=14% Similarity=0.112 Sum_probs=131.5 Q ss_pred HHHhhhccchhHHHHHHHHhhhhhcccccccCccccch-hHHHHHHHHHHhccchhhhcceeeecc--Cce-EEEEecCC Q lcl|NC_010583. 139 LLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYET-IFSTRIIRDLQKELVVGALFDELPMSS--KIL-TMLVEPEA 214 (458) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~--~~~-~~p~~~~~ 214 (458) ..++ .....+.. +++.++-|.-+-+ .+..+.+...++...+.+++.+.|+.. |.. ++-+...- T Consensus 1 ~~~~------------~a~~~~~~-~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl 67 (401) T protein:vir:95 1 MLNY------------NAPTDGQK-SSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPL 67 (401) T ss_pred CCcc------------CCCccccc-ccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccc Confidence 0000 01111111 2222222223333 333455555555678888998877663 222 11111111 Q ss_pred Cccc-cccccccccccc-----------------------------ccccccccceeeeeehhheeeeehhhHHHHhc-c Q lcl|NC_010583. 215 GRAT-WVDASKFGTDET-----------------------------VGDEVKGQLTEISFKTYKLAAKSFITDETEED-A 263 (458) Q Consensus 215 ~~a~-~v~e~~~~~e~~-----------------------------~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d-s 263 (458) +.+. -..||... .++ .......+-..+..+.++++.++++|+++... + T Consensus 68 ~~~~~pl~eGv~a-~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~ 146 (401) T protein:vir:95 68 LDDRNINDQGIDA-SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDS 146 (401) T ss_pred cccccchhcCCCc-ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhc Confidence 2211 12333211 111 01111223344666788999999999997764 3 Q ss_pred HHHHHHHH-HHHHHHHHHH---HHHHHHhccCCC----CccccccccccccccceeeccccchhhHHHHHHHHHHHhhhh Q lcl|NC_010583. 264 IFSLLPLL-RKRLIEAHAV---SIEEAFMSGNGT----GQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLG 335 (458) Q Consensus 264 ~~~~~~~i-~~~la~~~~~---~~d~~~l~G~g~----~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (458) +..+..-| .+.|.-+..+ .+-..+|++-++ +.-...-+.. .........++.++..+...|. T Consensus 147 D~~l~~h~s~ell~g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~----------~~~~~~t~vt~~~l~rl~~~L~ 216 (401) T protein:vir:95 147 DDGLMEHLSRELMNGATQITEAVLQKDLLAAAGTVLYAGAATSDATIT----------GEGSTPSVVSYKNLMRLDQILT 216 (401) T ss_pred chHHHHHHHHHHhhhhhhhHHHHHHHHHHhhcCeeecCCccceeeecc----------ccccccceechhHHHHHHHHHH Confidence 45566544 3334434333 334466654332 1111111100 0111112223333333333333 Q ss_pred h------------------hhcccc-eeEechhHHHHHHhhhc----cccccccccccccccccccCCeeecccceeccc Q lcl|NC_010583. 336 R------------------HGLKLS-KLVLIVSMDAYYDLLED----EEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEY 392 (458) Q Consensus 336 ~------------------~~~~~~-~~~~~~~~~~~l~~~~d----~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~ 392 (458) . .....+ .-+||+.....|..++| +.+.+............+..|.+-+..+++++. T Consensus 217 ~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~ 296 (401) T protein:vir:95 217 ENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPE 296 (401) T ss_pred hcccccchhhhhhhhccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEeccc Confidence 1 011112 13668877666665554 555555555555555667778888999998887 Q ss_pred cc--------cccc-----------CC---ceEEEEEec-eEEEEe--cce----eEEee-----------cccccCCce Q lcl|NC_010583. 393 FP--------AKAA-----------SA---EFAVIVYKD-NFVMPR--QRA----VTVER-----------ERQAGKQRD 432 (458) Q Consensus 393 ~~--------~~~~-----------~~---~~~~~~~~~-~~~i~~--~~~----~~i~~-----------~~~~~~~~~ 432 (458) +- ..++ ++ .+..+..++ .|.... ..+ +++.+ |++ |+. T Consensus 297 ~~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPl---gQ~ 373 (401) T protein:vir:95 297 MLHWAGAGAQATGANPGYRTSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPY---GET 373 (401) T ss_pred ceeecCCcccccccccccccccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcc---cce Confidence 43 1111 00 122233333 222211 111 12221 233 344 Q ss_pred EEEEEE-eeccEEecccceEEEEeecC Q lcl|NC_010583. 433 AYYVTQ-RVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 433 ~~~~~~-r~d~~~~~~~afv~l~~aaa 458 (458) .+.+.- ++++.+.+++=++.++.++= T Consensus 374 g~vgwK~~~a~~vL~~e~m~~ies~a~ 400 (401) T protein:vir:95 374 GFSSIKWYYGILVKRPERLALIKTVAP 400 (401) T ss_pred ehhhhhhhhhhheeccceeEEEEeecC Confidence 444333 45678888888888877666 No 190 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=95.23 E-value=0.0021 Score=35.24 Aligned_cols=333 Identities=13% Similarity=-0.014 Sum_probs=131.9 Q ss_pred HHHHHHHHhhhhhhhhhhh-hcchhhhhhHHHHHHHHHh---hhccchhHHHHHHHHhhhhhcc-------cccccCccc Q lcl|NC_010583. 105 LLAAREGRSFVGDSVAKAL-YGTQDAFEDEVEKLVLLSY---MMEKDVFETEHGKAHIKAVNGS-------SSVSMSSEA 173 (458) Q Consensus 105 ~~~~~e~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~~~---~~~~~~~~~~~~~~~~~a~~~~-------~~~~~g~~~ 173 (458) .+......+.......+.. ....+..... ...+.++ +................+.... .....+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~l~~~gi~~~~~~~~~~~~~~~amd~~~~~~~~~~~~~l~~~~~~g 78 (379) T protein:vir:10 1 MPQISKIHSSLNARQMTQMVMDSADVTLDN--LKHLESYGIHLNGRKNKLFELMQFAMDSNDIGPIPTPLSPLSPVSIPG 78 (379) T ss_pred CCCcceeeeecCccccchhhhccccccHHH--HHHHHhcCccccchhhhhhhhhhhhhccccccccccccCccccccccc Confidence 0000000000000000000 0000000000 0111111 0000000000000000010000 111112223 Q ss_pred cchhH---HHHHHHHHHhccchhhhcceeeecc---CceEEEEecCCCcccccccccccccccccccccccceeeeeehh Q lcl|NC_010583. 174 YETIF---STRIIRDLQKELVVGALFDELPMSS---KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTY 247 (458) Q Consensus 174 ip~~~---~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~ 247 (458) +|+.+ .+.+++.+-.-..+.++..+...+. ....+++.+..+.+.+.+-+ +..+..+...+..+-..+ T Consensus 79 ~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~------~d~pl~d~~~~~~~r~v~ 152 (379) T protein:vir:10 79 LIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDG------GNMALMSWTPTFETRTVV 152 (379) T ss_pred hHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEeccc------cCCCeeeeeeeeeeeeeE Confidence 44432 2455555544444444443332221 23444554444444444332 333344444444444556 Q ss_pred heeeeehhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHHhccCC--CCccccccccccccccceeecc---cc--- Q lcl|NC_010583. 248 KLAAKSFITDETEEDA---IFSLLPLLRKRLIEAHAVSIEEAFMSGNG--TGQPKGLLKLAADDGAKVVTEA---KA--- 316 (458) Q Consensus 248 k~~~~~~is~ell~ds---~~~~~~~i~~~la~~~~~~~d~~~l~G~g--~~~p~Gi~~~~~~~~~~~~~~~---~~--- 316 (458) .++..+.++..=+..+ ..++.+.-+....+++.+.+|+-.|+|.+ .....|+++........++++. .. T Consensus 153 ~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t~atg~~~~t~Wa 232 (379) T protein:vir:10 153 RFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVAVPNGAGGSPLWA 232 (379) T ss_pred EEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccccccCCcccccccc Confidence 6676677765433332 36788888999999999999999999953 3456799987655433222211 11 Q ss_pred chhhHHHHHHHHHHHhhhhhhhcc-------cceeEechhHHHHHHhhhccccccccccccccccccccCCeeeccccee Q lcl|NC_010583. 317 DGSVLVTAKTISKLRRKLGRHGLK-------LSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVV 389 (458) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~ 389 (458) ..+....++++..++..+...... +...++.+..+.+|..- +..|..+... +. .+ ..++.|.. T Consensus 233 ~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl~~-lk---~n-----~Pnl~i~t 302 (379) T protein:vir:10 233 QKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP-TELGYSVAQY-MR---ES-----YPNVTFVS 302 (379) T ss_pred cCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc-cccCccHHHH-HH---Hh-----cCCcEEEE Confidence 113334556666666655432211 12466677777777532 2112111110 00 00 11222333 Q ss_pred cccccccccCCceEEEEEeceEE---EEeccee-EEeecccc------cCCceEEEEEEee-ccEEecccceEEEEee Q lcl|NC_010583. 390 SEYFPAKAASAEFAVIVYKDNFV---MPRQRAV-TVERERQA------GKQRDAYYVTQRV-NLQRYFENGVVSGAYA 456 (458) Q Consensus 390 ~~~~~~~~~~~~~~~~~~~~~~~---i~~~~~~-~i~~~~~~------~~~~~~~~~~~r~-d~~~~~~~afv~l~~a 456 (458) .+.+-..+.++ ..++.+.+... ..+...+ .....++. ..-....-...|. |+.+..|.||+.+.-+ T Consensus 303 ~pEL~~aggg~-~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l~ve~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 303 APELNDANGGS-SAIYYYADAVENNGTDDGRTWLQVVPTKMFTLGVEKKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred cccccccCCCc-cEEEEEeeccCCCccCCcceEEEecchhhhhccceecCceeEeccccceeeeeeecchhhheecCC Confidence 33332222222 22222222110 0000001 11111111 1122333444555 5588899999998888 No 191 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=94.97 E-value=0.002 Score=35.43 Aligned_cols=303 Identities=11% Similarity=0.039 Sum_probs=121.9 Q ss_pred chhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCc Q lcl|NC_010583. 126 TQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKI 205 (458) Q Consensus 126 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 205 (458) ...-.++...-..|..-+.+....++-..+-..+-.-.+.+.+...+-+|..+...|-..+-...|+.+...+..++.-- T Consensus 1 mtnfiesqnavteffdvlkknsgkseiknawnaklaengvtitdttfqlprklvesintallntnpvfkvfhvtnvgall 80 (318) T protein:vir:94 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 80 (318) T ss_pred CccchhhhhhHHHHHHHHhcccChhhhhhhhhhhhhhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhhhhee Confidence 00000111111122222222222221111111111222333444456788888888877888888888776665544322 Q ss_pred eEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHH--HHhccHHHHHHHHHHHHHHHHHHH- Q lcl|NC_010583. 206 LTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDE--TEEDAIFSLLPLLRKRLIEAHAVS- 282 (458) Q Consensus 206 ~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~e--ll~ds~~~~~~~i~~~la~~~~~~- 282 (458) . .+.-... .|.+...+++.+++...++.--++.|--+.....+... -|++|--.+..+|..++..++..+ T Consensus 81 v--srsfdss-----neaqvhkdgqtkteqaatltidtlepvmvyklqslaervkrlqmsyselynlivaeltqaivnki 153 (318) T protein:vir:94 81 V--SRSFDSS-----NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKI 153 (318) T ss_pred e--ecccccc-----chhhhhcccccccccceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHhhh Confidence 2 1211111 23333444455555555555445555443333333222 244454457788888887777665 Q ss_pred HHHHHhccCCCCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeE-echhH-HHHHHhhhcc Q lcl|NC_010583. 283 IEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLV-LIVSM-DAYYDLLEDE 360 (458) Q Consensus 283 ~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~l~~~~d~ 360 (458) .|.+++-|+|++..+.|-..+......-.+.-...+-.....+.+.....-+.+.. +-.++ ..... .+.|+.++.+ T Consensus 154 vdlalvegdgtngfksidkeadvkkikkittkaksagktpfadaieeavdfvrpta--grrylivktedrkalldelrqa 231 (318) T protein:vir:94 154 VDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA--GRRYLIVKTEDRKALLDELRQA 231 (318) T ss_pred hheeeeecCCcchhhhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCC--CceEEEEeccchHHHHHHHHhh Confidence 46788999999877777665533322111110000001111222222222222221 11222 23333 3455556544 Q ss_pred ccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceE---EEEecceeEEe------eccc-ccCC Q lcl|NC_010583. 361 EWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNF---VMPRQRAVTVE------RERQ-AGKQ 430 (458) Q Consensus 361 ~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~---~i~~~~~~~i~------~~~~-~~~~ 430 (458) ...-.. ++-.-...+.+ ..|.+.++..-++.. .+...+...|+ .|.| ...| T Consensus 232 tananv--------------riknddteias-----evgvdeiivytgskavkptvlvdqkyhidmqdltkvdafewktn 292 (318) T protein:vir:94 232 TANANV--------------RIKNDDTEIAS-----EVGVDEIIVYTGSKAVKPTVLVDQKYHIDMQDLTKVDAFEWKTN 292 (318) T ss_pred hcccce--------------EEeccchhhhh-----hcCcceeEEeeccccccceeEeccceecchhhhhhhhceeeccC Confidence 321111 11110000000 011111222222210 11111111111 1111 2233 Q ss_pred ceEEEEEEeeccEEecccceEEEEee Q lcl|NC_010583. 431 RDAYYVTQRVNLQRYFENGVVSGAYA 456 (458) Q Consensus 431 ~~~~~~~~r~d~~~~~~~afv~l~~a 456 (458) .-.+.++..-.+.+---+|-++++++ T Consensus 293 snmilvetltsghvetynagavitvs 318 (318) T protein:vir:94 293 SNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred CceEEEEecccCcceeecCceeEEeC Confidence 33334444444555555565666666 No 192 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=93.69 E-value=0.0067 Score=32.51 Aligned_cols=286 Identities=10% Similarity=-0.022 Sum_probs=122.8 Q ss_pred hhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhh--hc Q lcl|NC_010583. 119 VAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGA--LF 196 (458) Q Consensus 119 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~--~~ 196 (458) ..+..++..-... ....+|. ..+++.+.+..-+.++..+-+ +.....+.. .+ T Consensus 1 ~~~~~~~~~~~~~--~~~~~~~-----------------------~~~~~~nt~~l~~k~~~~LD~-~~~~~~~s~~~~~ 54 (319) T protein:vir:94 1 MNKTIKNATGMLK--LNLQHFA-----------------------NKSVEPGQTLLKNKHVGILER-VTAVNAYSTPALI 54 (319) T ss_pred CCcccccccceeE--eehhhhh-----------------------ccCCCcchHHHHHHHHHHHHH-HHHHhhhhhhccc Confidence 0000000000000 0000000 111222233444445443333 222222221 12 Q ss_pred c--eeeeccCceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHH--HHHHHH Q lcl|NC_010583. 197 D--ELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFS--LLPLLR 272 (458) Q Consensus 197 ~--~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~--~~~~i~ 272 (458) + .....+..++||......-..+..-+. -.....+.++...+++..+.-.+.- ..-=..++... +...+. T Consensus 55 N~~~e~~gg~tVkIp~i~~~gl~DY~R~~g-----~~~g~vt~~~~t~tidqdR~~~F~V-D~~D~~Etn~~l~a~~i~~ 128 (319) T protein:vir:94 55 SNDAIFMEGRSFTVMKGDTTELKDYKRNAT-----NEFDHPKIEETTYFLDQEKYWGRFV-DALDRKDTEGNIDINYVVA 128 (319) T ss_pred CcceEeccCcEEEEeeecccccccccCCCC-----cccCCcccceeEEEeeccccccccc-chhhHhhhhchhhHHHHHH Confidence 2 344567788888877643332221111 1112234455556666665544331 11112223222 234455 Q ss_pred HHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhccccee-EechhHH Q lcl|NC_010583. 273 KRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKL-VLIVSMD 351 (458) Q Consensus 273 ~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 351 (458) +.+...++-.+|...+..- ...+... .+. ..+....+..+.++...+.....+...| ++.|..+ T Consensus 129 ~~~~~~v~PEiDay~~skl--------a~~a~~~----~~~---~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~ 193 (319) T protein:vir:94 129 RQGAEVVAPYLDNLRFATL--------ARNKAKH----LTV---GTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFY 193 (319) T ss_pred HHHHHHhhhhhhHHHHHHH--------Hhhcccc----ccc---ccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHH Confidence 5566666667776544321 0000000 000 0112234556666666666654443444 5577766 Q ss_pred HHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEec-ceeEEeecccccCC Q lcl|NC_010583. 352 AYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQ-RAVTVERERQAGKQ 430 (458) Q Consensus 352 ~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~~i~~~~~~~~~ 430 (458) ..|..-....... .........+..++|.|.||+.++... ..+...+++..+....... ..+++.... .... T Consensus 194 ~~L~~~~~f~~~~---~~~~~~~~~g~Vg~idG~~Vi~vps~~---~k~in~i~~h~~A~~~~~k~~~~~~~~p~-~~~~ 266 (319) T protein:vir:94 194 KGIKKFVIALPQG---DTRQQVLGKGVQGELDGFVIVKVPTKL---LQGLQAIAVVGEVLASPIQADLAKTNSNI-PGMF 266 (319) T ss_pred HHHHhhhhhhccc---cccccceeeeeceeecCeEEEEecccc---cccceEEEEcCCeeeeeeeeeeeeccCCC-cccc Confidence 6664432211111 111123346677889999999764422 1222334444443332222 234443311 1223 Q ss_pred ceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 431 RDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 431 ~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ...++....+|..|.+|+...+...+.+ T Consensus 267 a~~v~gr~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:94 267 GTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred ceeeeeeeeeeeEEeccccceEEEeecC Confidence 3677888889999999987666554444 No 193 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=93.69 E-value=0.0067 Score=32.51 Aligned_cols=286 Identities=10% Similarity=-0.022 Sum_probs=122.8 Q ss_pred hhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhh--hc Q lcl|NC_010583. 119 VAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGA--LF 196 (458) Q Consensus 119 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~--~~ 196 (458) ..+..++..-... ....+|. ..+++.+.+..-+.++..+-+ +.....+.. .+ T Consensus 1 ~~~~~~~~~~~~~--~~~~~~~-----------------------~~~~~~nt~~l~~k~~~~LD~-~~~~~~~s~~~~~ 54 (319) T protein:vir:97 1 MNKTIKNATGMLK--LNLQHFA-----------------------NKSVEPGQTLLKNKHVGILER-VTAVNAYSTPALI 54 (319) T ss_pred CCcccccccceeE--eehhhhh-----------------------ccCCCcchHHHHHHHHHHHHH-HHHHhhhhhhccc Confidence 0000000000000 0000000 111222233444445443333 222222221 12 Q ss_pred c--eeeeccCceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHHH--HHHHHH Q lcl|NC_010583. 197 D--ELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFS--LLPLLR 272 (458) Q Consensus 197 ~--~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~--~~~~i~ 272 (458) + .....+..++||......-..+..-+. -.....+.++...+++..+.-.+.- ..-=..++... +...+. T Consensus 55 N~~~e~~gg~tVkIp~i~~~gl~DY~R~~g-----~~~g~vt~~~~t~tidqdR~~~F~V-D~~D~~Etn~~l~a~~i~~ 128 (319) T protein:vir:97 55 SNDAIFMEGRSFTVMKGDTTELKDYKRNAT-----NEFDHPKIEETTYFLDQEKYWGRFV-DALDRKDTEGNIDINYVVA 128 (319) T ss_pred CcceEeccCcEEEEeeecccccccccCCCC-----cccCCcccceeEEEeeccccccccc-chhhHhhhhchhhHHHHHH Confidence 2 344567788888877643332221111 1112234455556666665544331 11112223222 234455 Q ss_pred HHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhccccee-EechhHH Q lcl|NC_010583. 273 KRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKL-VLIVSMD 351 (458) Q Consensus 273 ~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 351 (458) +.+...++-.+|...+..- ...+... .+. ..+....+..+.++...+.....+...| ++.|..+ T Consensus 129 ~~~~~~v~PEiDay~~skl--------a~~a~~~----~~~---~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~ 193 (319) T protein:vir:97 129 RQGAEVVAPYLDNLRFATL--------ARNKAKH----LTV---GTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFY 193 (319) T ss_pred HHHHHHhhhhhhHHHHHHH--------Hhhcccc----ccc---ccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHH Confidence 5566666667776544321 0000000 000 0112234556666666666654443444 5577766 Q ss_pred HHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEec-ceeEEeecccccCC Q lcl|NC_010583. 352 AYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQ-RAVTVERERQAGKQ 430 (458) Q Consensus 352 ~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~~i~~~~~~~~~ 430 (458) ..|..-....... .........+..++|.|.||+.++... ..+...+++..+....... ..+++.... .... T Consensus 194 ~~L~~~~~f~~~~---~~~~~~~~~g~Vg~idG~~Vi~vps~~---~k~in~i~~h~~A~~~~~k~~~~~~~~p~-~~~~ 266 (319) T protein:vir:97 194 KGIKKFVIALPQG---DTRQQVLGKGVQGELDGFVIVKVPTKL---LQGLQAIAVVGEVLASPIQADLAKTNSNI-PGMF 266 (319) T ss_pred HHHHhhhhhhccc---cccccceeeeeceeecCeEEEEecccc---cccceEEEEcCCeeeeeeeeeeeeccCCC-cccc Confidence 6664432211111 111123346677889999999764422 1222334444443332222 234443311 1223 Q ss_pred ceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 431 RDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 431 ~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ...++....+|..|.+|+...+...+.+ T Consensus 267 a~~v~gr~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:97 267 GTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred ceeeeeeeeeeeEEeccccceEEEeecC Confidence 3677888889999999987666554444 No 194 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=91.13 E-value=0.017 Score=30.24 Aligned_cols=294 Identities=13% Similarity=0.048 Sum_probs=138.9 Q ss_pred hhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce Q lcl|NC_010583. 127 QDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL 206 (458) Q Consensus 127 ~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 206 (458) +.+......+..|..++..- . ......+.+. .+.|-+.+...+.+.+.+.+-+++..+++++.--.. T Consensus 1 m~~~m~~~tr~~~~~y~~~~---------A---~~ngv~~~~~-~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~G 67 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQL---------A---KSYGVSNVAE-LFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEG 67 (341) T ss_pred CcccccHHHHHHHHHHHHHH---------H---HHcCcccccc-eEeecHHHHHHHHHHHHhhHHhhhcCccccccceee Confidence 11112222233333332210 0 0111112222 333444666778999999999999999999875443 Q ss_pred -EEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccH-----HHHHHHHHHHHHHHHH Q lcl|NC_010583. 207 -TMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAI-----FSLLPLLRKRLIEAHA 280 (458) Q Consensus 207 -~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~-----~~~~~~i~~~la~~~~ 280 (458) .+....+++-++-+.-+ .. ..++..+.-.+...+.---+.|+.+.|+... ++|...+++.+.+.++ T Consensus 68 e~v~lg~~g~iagrtdt~-------R~-~r~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~A 139 (341) T protein:vir:27 68 QVVDVGVSGLYTGRKAGG-------RF-TKQVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFA 139 (341) T ss_pred eEeecccccceeeccCCC-------ce-ecccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHh Confidence 33344444444433211 11 1112344444455544445677888887543 7799999999999988 Q ss_pred HHHHHHHhccCC----C---Cc------cccccccccccc-------cceeeccccchhhHHHHHHHHHHHhh-hhhhhc Q lcl|NC_010583. 281 VSIEEAFMSGNG----T---GQ------PKGLLKLAADDG-------AKVVTEAKADGSVLVTAKTISKLRRK-LGRHGL 339 (458) Q Consensus 281 ~~~d~~~l~G~g----~---~~------p~Gi~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 339 (458) .-.-.--++|+. | .. .+|++...-... ..+..+.++++...+ ..+.++... +++.++ T Consensus 140 LD~i~IGfnGts~A~~Td~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLD--AlV~D~~~~lI~~~~~ 217 (341) T protein:vir:27 140 LDIMRIGWNGVSAEADTDPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLD--AMASDIINNQIHPMFR 217 (341) T ss_pred hhhhhhcccceeeccCCChhhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHH--HHHHHHHhcccChHHh Confidence 877777778864 1 11 356665432111 111112222221111 112233332 355555 Q ss_pred ccc--eeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEe Q lcl|NC_010583. 340 KLS--KLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPR 415 (458) Q Consensus 340 ~~~--~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 415 (458) ... +.++-..... ++..+ +....|....- .. ....++-|+|.+..+++|.. -+++--++.+.|.. T Consensus 218 ~d~dLVvivG~dLla~k~~~l~-n~~~~ptE~~A--a~---~i~k~iGGlpa~~~PffP~~-----~~lVT~L~NLsIY~ 286 (341) T protein:vir:27 218 NDPRLTVFVGSGLIGAAQAKLY-DKADKPSEQIA--AQ---KLDKTIAGRPAYVPPFLPDN-----AMVVTIPENLQVLT 286 (341) T ss_pred cCCCEEEEEchhhhhhhhhhhh-ccCCCCHHHHH--HH---HHHHhhCCCeEEEccccCCC-----ceEEeeccceEEEE Confidence 433 4444443332 22222 22222222111 11 11247889999999999963 23333344433332 Q ss_pred cce-e--EEeec----ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 416 QRA-V--TVERE----RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 416 ~~~-~--~i~~~----~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+ . .+... .+-..+. +|+++. + .....-.|..+|+++| T Consensus 287 Q~gs~RR~~~d~p~r~rie~yes-~YvVEd-y--g~~~~~~~~~vkl~~~ 332 (341) T protein:vir:27 287 QHGTAQRKAKHESDRKRSKTHTG-AWKVTQ-W--VCWKRSPLTTQKKSTS 332 (341) T ss_pred ecCcEEEEEEeccccccccchhh-hheeeh-h--hhhhhccccccccCcc Confidence 222 1 12111 1111112 344433 2 3344455777888888 No 195 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=91.11 E-value=0.017 Score=30.23 Aligned_cols=420 Identities=11% Similarity=0.105 Sum_probs=140.6 Q ss_pred CcchHHHHHH------------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_010583. 1 MTIDINKLKE------------------------------------ELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEK- 43 (458) Q Consensus 1 ~~~~~~~~~~------------------------------------~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~- 43 (458) --...+++++ -..++.+.++++.+....+... .+....+... T Consensus 136 tG~~~e~i~~~m~~etwlta~EA~e~Gf~D~i~~~~~~~a~~~~~~~~~~~~~p~~~~~~~~~~~~~~-~~v~d~EPa~~ 214 (652) T protein:vir:79 136 TGKTTDEIAAMLADETWMSGAECLAQGFADQVTPAVKAMACIQSKRTEEFKKMPDSIRNMITPPRNSA-PRVQDDEPAAS 214 (652) T ss_pred hCCCHHHHHHHHhhhcCCCHHHHHhcCCcccccchhhhhhhhhhhhhhhhhhhHHHHHHHhccccccc-ccccccccccc Confidence 0000000000 0112222233322211100000 0000000000 Q ss_pred -----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 44 -----------ELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS-AELFAQTVEKQQETIVGLQDEIKSLLAAREG 111 (458) Q Consensus 44 -----------~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~-~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 111 (458) .....++ ....+.| ++...|+.+-...... .+...+.+....-.++..+.. ......+ T Consensus 215 ~~pvqAaAP~~De~airA---q~~aeer----aRi~~I~~l~a~Fggr~~~l~~~~l~d~~~s~e~ar~~---il~~l~~ 284 (652) T protein:vir:79 215 RTPVQAAAPVVDENSIRA---QVLAEQK----ARVNGINDLFAMFGGRYQTLQAQCLADPECSLEQAREK---LLNEMGR 284 (652) T ss_pred cccccccCCcCchhHHHH---HHHHHHH----HHHHHHHHHHHhhccccchHHHHHhhccCCCHHHHHHH---HHHHHHh Confidence 0000000 0011111 1122222221111100 000000000000000000000 0000000 Q ss_pred Hhh-----------------hhhhh-----hhhhh-c--chhhhhh----HHHHHHHHHhhhc-cchhHHHHHHHHhhhh Q lcl|NC_010583. 112 RSF-----------------VGDSV-----AKALY-G--TQDAFED----EVEKLVLLSYMME-KDVFETEHGKAHIKAV 161 (458) Q Consensus 112 ~~~-----------------~~~~~-----~~~~~-~--~~~~~~~----~~~~~a~~~~~~~-~~~~~~~~~~~~~~a~ 161 (458) ... ..... .+... . ....+.. +..+..+.....+ ..... ......+. T Consensus 285 ~~~p~~~~~~~~~~~~~g~~~~d~~~~aL~~R~g~~~~~~~~~~~g~~L~elAr~~L~~~G~~~~~~~~---~~~v~~A~ 361 (652) T protein:vir:79 285 ESTPSNKNTPAHIYAGNGNFVGDGIRQALMARAGFEKTERDNVYNGMTLREYARMSLTERGIGVSSYNP---MQMVGAAF 361 (652) T ss_pred hcCCCCCCcceeEeeccchhhHHHHHHHHHhhcCCcccccCccccCccHHHHHHHHHHhhccCCCCCCH---HHHHHHHh Confidence 000 00000 00000 0 0000000 0011111000000 00000 01111111 Q ss_pred hcccccccCccccchhHHHHHHHHH----Hh-ccchhhhcceeeeccCce-EEEEecCCCcccccccccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDL----QK-ELVVGALFDELPMSSKIL-TMLVEPEAGRATWVDASKFGTDETVGDEV 235 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~----~~-~~~l~~~~~~~~~~~~~~-~~p~~~~~~~a~~v~e~~~~~e~~~~~~~ 235 (458) + -++. --|..+.+-+-..+ .. .....++++..+++.-.. ......+.+.---|.|++ .++-. T Consensus 362 ~-hsTs-----DFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~g------Eyk~~ 429 (652) T protein:vir:79 362 T-HSTS-----DFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGA------EYKYV 429 (652) T ss_pred h-cCcc-----hHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCC------cccee Confidence 1 0111 12333332222221 11 223555565543332211 111122223333344443 33222 Q ss_pred cccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHH---hccCCC-C-ccccccccccccccce Q lcl|NC_010583. 236 KGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAF---MSGNGT-G-QPKGLLKLAADDGAKV 310 (458) Q Consensus 236 ~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~---l~G~g~-~-~p~Gi~~~~~~~~~~~ 310 (458) ...=...++...++|.++.||++++-.-+.++.+-|-..++++.++.++..+ |.++.. . .-+.++.-+.-.+. T Consensus 430 t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl-- 507 (652) T protein:vir:79 430 TTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANV-- 507 (652) T ss_pred eecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeecccccccc-- Confidence 2222345678889999999999998877889999999999999999998744 444432 1 12234411111111 Q ss_pred eeccccchhhHHHHHHHHHHHhhhh-hhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecc-cce Q lcl|NC_010583. 311 VTEAKADGSVLVTAKTISKLRRKLG-RHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGL-PVV 388 (458) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~-pv~ 388 (458) .+....+.................. .-...+.-|+..+........+-.+..-+. .....+....+.|. .|+ T Consensus 508 ~~~aa~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~~------a~~~~~~~Np~~~~~~~i 581 (652) T protein:vir:79 508 LESAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVKG------ADINAGIINPVKDFATVI 581 (652) T ss_pred cccccCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCCcc------cccccccccccccccccc Confidence 1111111111111111111111111 112344456666655444333322111100 00111112224443 444 Q ss_pred ecccccccccCCceEEEEEe--ceE---EEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEe Q lcl|NC_010583. 389 VSEYFPAKAASAEFAVIVYK--DNF---VMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAY 455 (458) Q Consensus 389 ~~~~~~~~~~~~~~~~~~~~--~~~---~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~ 455 (458) +.+.+... +...+.+.... +.+ ++....+-.|+....|..+.+.|++...||++++|--++++.+- T Consensus 582 ~eprL~~~-s~~~wylaa~~~~dtiev~yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 582 AEPRLDDN-SQTTFYLAASKGSDTIEVAYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred cccccCCC-CcccEEEecCCCCCeEEEEEecCCCCCeeeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 45444321 11222222111 111 12333445666667789999999999999999999999887776 No 196 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=87.72 E-value=0.037 Score=28.44 Aligned_cols=294 Identities=16% Similarity=0.098 Sum_probs=133.3 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p 209 (458) .....+..|.+++..- .. .......+. .+.|-+.+...+.+.+.+.+-+++..+++++.--.. ++. T Consensus 1 M~~~tr~~~~~y~~~~---------A~---~ngv~~~~~-~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~ 67 (337) T protein:vir:79 1 MRKETRQAYEKYAAQI---------AK---LNDTGDVSK-KFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLG 67 (337) T ss_pred CChHHHHHHHHHHHHH---------HH---hcChhhhcc-eeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEe Confidence 1112222333332210 00 011111111 233334566778889999999999999999875433 333 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) ...+++-++-+.-+. ....+..-..++.-.+..++.---+.|+.+.|+.. ..+|...+++.+.+.++.-.=.-- T Consensus 68 lg~~g~iagrt~t~~----~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IG 143 (337) T protein:vir:79 68 LSVSGPIASRTDTTK----AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIG 143 (337) T ss_pred eccCcceeeeecCCC----CccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhc Confidence 444444443322211 00111111223444444444444567888888874 368999999999999888777777 Q ss_pred hccCC----C---Cc------cccccccccc------------cccceeeccccchhhHHHHHHHHHHHhh-hhhhhccc Q lcl|NC_010583. 288 MSGNG----T---GQ------PKGLLKLAAD------------DGAKVVTEAKADGSVLVTAKTISKLRRK-LGRHGLKL 341 (458) Q Consensus 288 l~G~g----~---~~------p~Gi~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 341 (458) ++|+. | .. .+|++...-. .+..+..+.++++...+ ..+.++... +++.++.. T Consensus 144 fnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLD--alV~D~~~~lI~~~~~~d 221 (337) T protein:vir:79 144 WNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLD--ALVMDIVSSMIDPWFQED 221 (337) T ss_pred ccceeeccCCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHH--HHHHHHHhccCChHHhcC Confidence 78864 1 11 3456543211 11111122222222211 112333433 45555543 Q ss_pred --ceeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecc Q lcl|NC_010583. 342 --SKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQR 417 (458) Q Consensus 342 --~~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 417 (458) .+.+|-..... ++..+. ....|....-... .....++-|+|.+..+++|.. -+++--++++.|.-.. T Consensus 222 ~~LVvivG~dLladk~~~l~n-~~~~ptE~~Aa~~---i~s~k~iGGlpa~~~PffP~~-----~~lVT~L~NLsIY~Q~ 292 (337) T protein:vir:79 222 TGLVAICGRELLHDKYFPIVN-ATQAPTERLAADL---IVSQKRIGNLPAVRVPFFPKR-----ALMVTKLSNLSIYYQE 292 (337) T ss_pred CCEEEEEchhhhhHHhhHHhc-cCCCcHHHHHHHH---HHHhhhhCCceeEEccccCCC-----ceEEeechhcEEEEec Confidence 34444444332 222222 1222222111100 111246889999999999962 2333344444443222 Q ss_pred e-e--EEeec-------ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 418 A-V--TVERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 418 ~-~--~i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) + . .+... .|-..| ..|.++..-.+..++ -++++.| T Consensus 293 gs~RR~~~d~p~r~rie~y~s~N-e~YvVEd~~~~a~ie-----nI~~~~a 337 (337) T protein:vir:79 293 GARRRTLKEVPERDRIENYESSN-DAYVVEDFGCGCVAE-----NIELAAA 337 (337) T ss_pred CcEEEEEEEccccccccchhhcc-ceeeeeccccEEEEe-----ceeecCC Confidence 2 1 12111 111122 233333332222222 3455555 No 197 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=85.80 E-value=0.05 Score=27.71 Aligned_cols=282 Identities=13% Similarity=0.101 Sum_probs=133.8 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhccchhhhcc-eeeecc-CceEEEEecCCCccccccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFD-ELPMSS-KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~-~~~~~~-~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~ 238 (458) +. -++ .--.+++.++++..|...+.+...=-..++ +.-.++ ..+.||.. +.++. ...+|.+.-.-.... T Consensus 1 ~~-~TS-NT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~ti-Gs~~~------~~~~E~~~~~~~~i~ 71 (313) T protein:vir:95 1 MQ-LTS-NTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTI-GSVTL------QEAEEDTPLIYNPIE 71 (313) T ss_pred Cc-ccc-cchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEeccc-Cceee------eccccCCCeeecccc Confidence 11 111 112345566677766655555433223333 333333 34444433 22221 122333333334445 Q ss_pred ceeeeeehhheeeee-hhhHHHHhccHHHHHH---HHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeecc Q lcl|NC_010583. 239 LTEISFKTYKLAAKS-FITDETEEDAIFSLLP---LLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEA 314 (458) Q Consensus 239 f~~v~~~~~k~~~~~-~is~ell~ds~~~~~~---~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~ 314 (458) -++|++....+++-. .||+.|-+|+. ++.. .+.-+-+++|....+..++. +|+.+..|--.-....+...... T Consensus 72 TGEIt~~i~~Y~G~A~~vt~~LR~D~~-~I~~~~A~~~AE~~RAI~E~~~TD~L~-~G~~~FA~~~~P~~vNG~PH~~V- 148 (313) T protein:vir:95 72 TGEITFQITEYKGDAWYVTDDLREDGT-DIDRLMAERAAESTRAIQETFETDFLK-TGAEYFAANPGPHNVNGFPHVIV- 148 (313) T ss_pred cceEEEEEEeecCChhhhhhhhhhcch-hHHHHhhhcchhhHHHHHHHHhhHHHh-hchhhhccCCCCcccccccceEE- Confidence 577888888887765 89999999984 4444 44444556666666666663 23211111000011111111111 Q ss_pred ccchhhHHHHHHHHHHHhhhhhhhc--ccceeEechhHHHHHHhhhc------cccccccccccccccccccCCeeeccc Q lcl|NC_010583. 315 KADGSVLVTAKTISKLRRKLGRHGL--KLSKLVLIVSMDAYYDLLED------EEWQDVAQVGNDAVKLQGQVGRIYGLP 386 (458) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~~d------~~~~~~~~~~~~~~~~~~~~~~l~G~p 386 (458) ...+.......+++.+.+....... .+..++..|.....|..+.. ..++.+...+...... -.-.++|.. T Consensus 149 ~~~T~~~~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~--Fi~~~YG~D 226 (313) T protein:vir:95 149 SAETNGVFALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQR--FIMNLYGWD 226 (313) T ss_pred eccCCceehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhH--HHHHHhhhh Confidence 1111222334456666555554433 34467788888888887653 3355555444332111 112377888 Q ss_pred ceeccccccc--------ccCCc-eEEEEEec---eEEEEecceeEEe---ecccccCCceEEEEEEeeccEEecccceE Q lcl|NC_010583. 387 VVVSEYFPAK--------AASAE-FAVIVYKD---NFVMPRQRAVTVE---RERQAGKQRDAYYVTQRVNLQRYFENGVV 451 (458) Q Consensus 387 v~~~~~~~~~--------~~~~~-~~~~~~~~---~~~i~~~~~~~i~---~~~~~~~~~~~~~~~~r~d~~~~~~~afv 451 (458) +.+|+-+... ++|.. .++++-.+ .-.++-|..+... .++|-..+.+.. ..|+|..+++-+..+ T Consensus 227 i~~SN~L~~AN~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~--~~R~G~Gi~R~~~L~ 304 (313) T protein:vir:95 227 ILTSNRLHVANYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVV--RCRYGFGIQRLDTLG 304 (313) T ss_pred hhhhhhhhhccccccccccCceeeeeeeeeecccccceeeeecccccccccccccccccccee--eeeecccceeeccee Confidence 8887755421 11111 22222211 1234445544322 344444445544 468887777666654 Q ss_pred -EEEeecC Q lcl|NC_010583. 452 -SGAYAAA 458 (458) Q Consensus 452 -~l~~aaa 458 (458) +++-|.| T Consensus 305 ~~~~~A~~ 312 (313) T protein:vir:95 305 LLATSATA 312 (313) T ss_pred EEEecccc Confidence 5666666 No 198 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=85.78 E-value=0.05 Score=27.70 Aligned_cols=269 Identities=10% Similarity=0.048 Sum_probs=115.5 Q ss_pred ccCccccchhHHHHHHHHHHhccchhhhcc------eeeeccCceEEEEecCCCccccccccccccccccccccccccee Q lcl|NC_010583. 168 SMSSEAYETIFSTRIIRDLQKELVVGALFD------ELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTE 241 (458) Q Consensus 168 ~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~------~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~ 241 (458) .+ .+-.++.++..+.+.+...+....+.. +...++...+||......-..+..-+ .+......+.++.. T Consensus 1 MA-~~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~----~g~~~g~~~~~~~t 75 (299) T protein:vir:79 1 MA-ALNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDT----IAVAQRNYDNAWEP 75 (299) T ss_pred Cc-cchhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCC----CcccccccCcceeE Confidence 11 112346777777777777766555432 22234557889987654333222111 01111123456667 Q ss_pred eeeehhheeeee--hhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccc Q lcl|NC_010583. 242 ISFKTYKLAAKS--FITDETEEDAI--FSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKAD 317 (458) Q Consensus 242 v~~~~~k~~~~~--~is~ell~ds~--~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) .+++..+.-.+. ++. ...+. ..+...+.+...+.++-.+|...+.. |...+...+.. ..... T Consensus 76 ~~ldqdr~~~f~vD~~D---vdet~~~~~~a~v~~~~~~~~v~pEiDay~~sk--------l~~~a~~~g~~-~~~~~-- 141 (299) T protein:vir:79 76 KVLTNQRKWSTLVHPAD---INQTNYVASIGNITKVYNEEQKFPEMDAYCISK--------IYADWTALGNT-ADTTV-- 141 (299) T ss_pred EEeeccccceeccchhh---HHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHH--------HHHhhhhcCCc-ccccc-- Confidence 777777765443 111 11111 12334444445555566666655532 11111100000 01111 Q ss_pred hhhHHHHHHHHHHHhhhhhhhcccc-e-eEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccc--c Q lcl|NC_010583. 318 GSVLVTAKTISKLRRKLGRHGLKLS-K-LVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEY--F 393 (458) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~--~ 393 (458) .+....+..+.++...+.....+.. . .++.|..+..|.....-. +. ...........+..++|.|.||+.++. + T Consensus 142 ~T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~-k~-~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~ 219 (299) T protein:vir:79 142 LTTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQ-RT-VNIKDAGTSLNRQTTDIDTVKIIKVPSNLM 219 (299) T ss_pred cCHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhh-cc-cccccccceeeeeeeeecceEEEEechhhc Confidence 1122345667777777777655443 3 355777666665432111 11 111222234556678899999986433 3 Q ss_pred cc------c----ccCC-ceEEEEEeceE-EEEecceeEEeecccc-cCCceEEEEEEeeccEEeccc--c-eEEEEeec Q lcl|NC_010583. 394 PA------K----AASA-EFAVIVYKDNF-VMPRQRAVTVERERQA-GKQRDAYYVTQRVNLQRYFEN--G-VVSGAYAA 457 (458) Q Consensus 394 ~~------~----~~~~-~~~~~~~~~~~-~i~~~~~~~i~~~~~~-~~~~~~~~~~~r~d~~~~~~~--a-fv~l~~aa 457 (458) ++ + ..+. ...++...+.. .+..-..+++. +|.. .++--.+.-..+.|.-|.+.+ + ++..+-|. T Consensus 220 ~t~~~~~~G~~~~~~ak~in~ii~~~~a~~~~~K~~~~~~~-~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~ 298 (299) T protein:vir:79 220 KTAYDFTTGWKVGAGAKQIFMSLVHPSAIITPVSYQFSKLD-EPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEGAG 298 (299) T ss_pred CccceeccCccccCcccccceEEEcCCeeeeeEeeeeEEee-cCCCCCccceeeeeeeeeeeeeeccccCeEEEEeeecC Confidence 32 0 0111 11222222221 12222233332 2221 122112222223455555542 2 23344444 Q ss_pred C Q lcl|NC_010583. 458 A 458 (458) Q Consensus 458 a 458 (458) | T Consensus 299 ~ 299 (299) T protein:vir:79 299 A 299 (299) T ss_pred C Confidence 4 No 199 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=85.04 E-value=0.056 Score=27.45 Aligned_cols=294 Identities=16% Similarity=0.100 Sum_probs=133.4 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p 209 (458) .....+..|.+++..- .. .....+.+ ..+.|-+.+...+.+.+.+.+-+++..+++++.--.. ++. T Consensus 1 M~~~tr~~~~~y~~~~---------A~---~ngv~~~~-~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~ 67 (337) T protein:vir:10 1 MRKETRQAYEKYAAQI---------AK---LNDTGDVS-KKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLG 67 (337) T ss_pred CChHHHHHHHHHHHHH---------HH---hcChhhhc-ceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEe Confidence 1112222333332210 00 01111112 2333444566778889999999999999999875433 333 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) ...+++-++-..-+. ....+..-..++.-.+..++.---+.|+.+.|+.. ..+|...+++.+.+.++.-.=.-- T Consensus 68 lg~~g~iagrt~t~~----~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IG 143 (337) T protein:vir:10 68 LSVSGPIASRTDTTK----AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIG 143 (337) T ss_pred eccCcceeeeecCCC----CccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhc Confidence 444444443322211 00111111223444444444444567888888874 368999999999999888777777 Q ss_pred hccCC----C---Cc------cccccccccc------------cccceeeccccchhhHHHHHHHHHHHhh-hhhhhccc Q lcl|NC_010583. 288 MSGNG----T---GQ------PKGLLKLAAD------------DGAKVVTEAKADGSVLVTAKTISKLRRK-LGRHGLKL 341 (458) Q Consensus 288 l~G~g----~---~~------p~Gi~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 341 (458) ++|+. | .. .+|++...-. .+..+..+.++++...+ ....++... +++.++.. T Consensus 144 fnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLD--alV~D~~~~lI~~~~~~d 221 (337) T protein:vir:10 144 WNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLD--ALVMDIVSSMIDPWFQED 221 (337) T ss_pred ccceeeccCCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHH--HHHHHHHhccCChHHhcC Confidence 78864 1 11 3456543211 11111122222222211 112333433 45555543 Q ss_pred --ceeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecc Q lcl|NC_010583. 342 --SKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQR 417 (458) Q Consensus 342 --~~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 417 (458) .+.+|-..... ++..+. ....|....-... .....++-|+|.+..+++|.. -+++--++++.|.-.. T Consensus 222 ~~LVvivG~dLladk~~~l~n-~~~~ptE~~Aa~~---i~s~k~iGGlpa~~~PffP~~-----~~lVT~L~NLsIY~Q~ 292 (337) T protein:vir:10 222 TGLVVICGRELLHDKYFPIVN-ATQAPTERLAADL---IVSQKRIGNLPAVRVPFFPKR-----ALMVTKLSNLSIYYQE 292 (337) T ss_pred CCEEEEEchhhhhHHhhHHhc-cCCCcHHHHHHHH---HHHhhhhCCceeEEccccCCC-----ceEEeechhcEEEEec Confidence 34444444332 222222 1222222111100 111246889999999999962 2333344444443222 Q ss_pred e-e--EEeec-------ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 418 A-V--TVERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 418 ~-~--~i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) + . .+... .|-..| ..|.++..-.+..++ -++++.| T Consensus 293 gs~RR~~~d~p~r~rie~y~s~N-e~YvVEd~~~~a~ie-----nI~~~~a 337 (337) T protein:vir:10 293 GARRRTLKEVPERDRIENYESSN-DAYVVEDFGCGCVAE-----NIELAAA 337 (337) T ss_pred CcEEEEEEEccccccccchhhcc-ceeeeeccccEEEEe-----ceeecCC Confidence 2 1 12111 111122 233333332222222 3455555 No 200 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=83.92 E-value=0.065 Score=27.11 Aligned_cols=376 Identities=11% Similarity=-0.040 Sum_probs=92.7 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAA-QKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKK 79 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~ 79 (458) |+++ +++..+... .+..+...++ ++..++.+.+.+...++.+++... . +.+++..+..+...+.... T Consensus 5 ~~le--e~~a~l~~~--~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~-------~-~~~~~~~~~~~~~~~~~~~ 72 (419) T protein:vir:94 5 PTLE--EQRAALLAR--LDDTSLTTEQVQEIVAEARGLADALQAESDRAAAR-------A-ALLRTAPPAPKGPADGGTP 72 (419) T ss_pred HHHH--HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------H-HHHHHHHHHHHHHhhhhcc Confidence 5543 233322211 1111111111 111111111111111111111111 1 1111111111111111000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhh Q lcl|NC_010583. 80 SAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIK 159 (458) Q Consensus 80 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 159 (458) ................... +...... ....+. . .... . + ......... ..... T Consensus 73 ~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~---------~---~~~~---~-~-~~~~~~~~~--------~~~~~ 125 (419) T protein:vir:94 73 LTPAEAGTFRSLAQRFADS-DGLREYR-ARDKRG---------Q---FQVE---M-R-DIDPNRLLS--------RDAPA 125 (419) T ss_pred ccccccccccchhhhhhhH-HHHHHHH-Hhhhhh---------h---hhHH---H-H-HHHHHHhhc--------ccccc Confidence 0000000000000000000 0000000 000000 0 0000 0 0 000000000 00001 Q ss_pred hhhcccccccCccccchhHHHHHHH--HHHhccchhhh-cc--eeeeccCceEEEEecCCCccccccccccccccccccc Q lcl|NC_010583. 160 AVNGSSSVSMSSEAYETIFSTRIIR--DLQKELVVGAL-FD--ELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDE 234 (458) Q Consensus 160 a~~~~~~~~~g~~~ip~~~~~~ii~--~~~~~~~l~~~-~~--~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~ 234 (458) ..............++..+...+.. .++........ .. .+|..++ ..+|+....+.+.|++|++..++.. ..- T Consensus 126 ~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~v~Eg~~~~~~~-~~~ 203 (419) T protein:vir:94 126 GTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTS-GTAGAGSTWNKAAVVPEGTAKPQST-LSF 203 (419) T ss_pred ccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeecc-ccccccccCcccceecCCccccccc-cce Confidence 1111222222333444443322111 12222222211 11 1333332 3567788888999999999888754 455 Q ss_pred ccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhccC--CCCccccccccccccccceee Q lcl|NC_010583. 235 VKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGN--GTGQPKGLLKLAADDGAKVVT 312 (458) Q Consensus 235 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~--g~~~p~Gi~~~~~~~~~~~~~ 312 (458) ...+|..-++...-.-..--+. .... -.--|...|.+.++.++..++=..==.|. |--...|+.+..........+ T Consensus 204 ~~i~~~~~k~~~~~~is~ell~-d~~~-l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t 281 (419) T protein:vir:94 204 DTITTTLKTVAHWLPITRQAAD-DNSQ-LMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPAT 281 (419) T ss_pred eeEEeeeeeEEEeehhhHHHHH-hHHH-HHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccccccc Confidence 6667666555543211111111 1222 22347888888888888887642100010 100011111111100000000 Q ss_pred ccccch---------------------hhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccc Q lcl|NC_010583. 313 EAKADG---------------------SVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGND 371 (458) Q Consensus 313 ~~~~~~---------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~ 371 (458) . .... -...++..+..+.... +..++..+....- .-..-.|.|+.....- T Consensus 282 ~-~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~------~~~~~~~~~~~~~--~~~~l~G~pV~~~~~~ 352 (419) T protein:vir:94 282 D-EPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPG------SGVFRVIANVQGE--ATPRIWGLNVVSTVAI 352 (419) T ss_pred c-chhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcC------CCceeecCCcccC--CCccccceeeEEcCCC Confidence 0 0000 0011122222221111 1112221110000 0000013333221110 Q ss_pred cccccccCCe-eecccceecccccccccCCceEEEEEeceEEEE---------ecceeEEeecccccCCceEEEEEEeec Q lcl|NC_010583. 372 AVKLQGQVGR-IYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMP---------RQRAVTVERERQAGKQRDAYYVTQRVN 441 (458) Q Consensus 372 ~~~~~~~~~~-l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~---------~~~~~~i~~~~~~~~~~~~~~~~~r~d 441 (458) +.+. ++|-.- ....+++...+.+. ....+.+....++.-..+.=.++.++. T Consensus 353 ------~~~~~~~gd~~-------------~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~ 413 (419) T protein:vir:94 353 ------AQGTALVGGFR-------------QGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVT 413 (419) T ss_pred ------CCccEEEeecc-------------ceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEE Confidence 0111 222100 00011111111110 000011111111110000001111222 Q ss_pred cEEecc Q lcl|NC_010583. 442 LQRYFE 447 (458) Q Consensus 442 ~~~~~~ 447 (458) +..... T Consensus 414 ~~aa~~ 419 (419) T protein:vir:94 414 FAAATT 419 (419) T ss_pred eccCCC Confidence 222222 No 201 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=83.76 E-value=0.066 Score=27.06 Aligned_cols=294 Identities=15% Similarity=0.090 Sum_probs=131.8 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p 209 (458) .....+..|.+++..- ... ....+. ...+.|-+.+...+...+.+.+-+++..+++++.--.. ++. T Consensus 1 M~~~tr~~~~~y~~~~---------A~~---ngv~~~-~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~ 67 (337) T protein:vir:78 1 MRKETRQAYEKYAAQI---------AKL---NDTGDV-SKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLG 67 (337) T ss_pred CChHHHHHHHHHHHHH---------HHh---cChhhh-cceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEe Confidence 1112222333332210 000 011111 22344555667778889999999999999998875433 333 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) ...+++-++-..-+. ....+.+-..++.-.+...+.---+.|+.+.|+.. ..+|...+++.+.+.++.-.=.-- T Consensus 68 lg~~g~iagrtdt~~----~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IG 143 (337) T protein:vir:78 68 LSVSGPIASRTDTTK----AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIG 143 (337) T ss_pred cccCcceeeeecCCC----cccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceec Confidence 344444443322111 00111111223333444444444467888888864 367999999999988887766666 Q ss_pred hccCCC----C---c------ccccccccc------------ccccceeeccccchhhHHHHHHHHHHHhh-hhhhhccc Q lcl|NC_010583. 288 MSGNGT----G---Q------PKGLLKLAA------------DDGAKVVTEAKADGSVLVTAKTISKLRRK-LGRHGLKL 341 (458) Q Consensus 288 l~G~g~----~---~------p~Gi~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 341 (458) +||+.. + . .+|++...- ..+..+..+.++++...+ ....++... +++.++.. T Consensus 144 fNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLD--alV~d~~~~lI~~~~~~d 221 (337) T protein:vir:78 144 WNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLD--ALVMDIVSSMIDPWFQED 221 (337) T ss_pred ccceeeccCCChhhCcCccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHH--HHHHHHHhccCChHHhcC Confidence 777631 1 1 346553221 111111122332322211 122344443 45655543 Q ss_pred --ceeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecc Q lcl|NC_010583. 342 --SKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQR 417 (458) Q Consensus 342 --~~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 417 (458) .+.+|-..... ++..+.. ...|........ .....++-|+|.+..+++|.. -+++--++++.|.... T Consensus 222 ~dLVvivG~dLladk~~~l~n~-~~~ptE~~Aa~~---i~s~k~iGGl~a~~~PfFP~~-----~ilVT~L~NLsIY~Q~ 292 (337) T protein:vir:78 222 TGLVVICGRELLHDKYFPIVNA-TQAPTERLAADL---IVSQKRIGNLPAVRVPFFPKR-----ALMVTKLSNLSIYYQE 292 (337) T ss_pred CCEEEEEchhhhHHHHHHHHhc-CCCcHHHHHHHH---HHHhhhhcCcceEEccccCCC-----ceEEeechhcEEEEec Confidence 34445444332 2222221 223322111100 111246789999999999952 2333344444333222 Q ss_pred -eeE--Eeec-------ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 418 -AVT--VERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 418 -~~~--i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ..+ +... .|-..| ..|.++..-.+..++ -++++.| T Consensus 293 gs~RR~~~d~p~r~rie~y~s~N-e~YvVEd~~~~a~iE-----nI~~~~a 337 (337) T protein:vir:78 293 GARRRTLKEVPERDRIENYESSN-DAYVVEDFGCGCVAE-----NIELAAA 337 (337) T ss_pred CcEEEEEEeccccccccchhhcc-ceeeeeccccEEEEe-----ceeecCC Confidence 121 2111 111122 233333332222222 3455555 No 202 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=83.05 E-value=0.072 Score=26.86 Aligned_cols=300 Identities=13% Similarity=0.137 Sum_probs=135.2 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p 209 (458) .....+..|..++..- .....+.. ... ...+.|-+.+...+.+.+.+.+-+++..+++++.--.. ++. T Consensus 1 M~~~tr~~~~~y~~~~---------A~~ngv~~-~~~-~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~ 69 (355) T protein:vir:18 1 MRQETRFKFNAYLTQL---------AKLNGISV-DDV-SKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIG 69 (355) T ss_pred CChHHHHHHHHHHHHH---------HHHhCCCh-hHc-cceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEe Confidence 1112223333332210 00001100 011 12344444566778889999999999999999875433 334 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) ...+++-++-+.-.. ..+. .+.....++.-.+..++.---+.|+.+.|+.. ..+|...+++.+.+.++.-.=.-- T Consensus 70 lgv~g~iagrtdT~~-~~~R--~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IG 146 (355) T protein:vir:18 70 VGVTGTIASTTDTSG-DKER--QTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAG 146 (355) T ss_pred eccCcceeeccccCC-CCCc--ccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhc Confidence 444455444322110 0010 11112223444444555555567888888874 367999999999999888777777 Q ss_pred hccCC----CC---c------ccccccccccc------------cc-----ceeeccccchhhHHHHHHHHHHHhh-hhh Q lcl|NC_010583. 288 MSGNG----TG---Q------PKGLLKLAADD------------GA-----KVVTEAKADGSVLVTAKTISKLRRK-LGR 336 (458) Q Consensus 288 l~G~g----~~---~------p~Gi~~~~~~~------------~~-----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 336 (458) +||+. |+ . .+|++...-.. +. .+..+.++++...+ ..+.++... +++ T Consensus 147 fNG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLD--AlV~d~~~~lI~~ 224 (355) T protein:vir:18 147 FNGTTRADTSDRVKNPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLD--ALVMDGTNTLIDE 224 (355) T ss_pred ccceeeeccCChhhCcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHH--HHHHHHHhccCCh Confidence 78864 11 1 35666332111 11 11112222222111 122234433 455 Q ss_pred hhccc--ceeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEE Q lcl|NC_010583. 337 HGLKL--SKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFV 412 (458) Q Consensus 337 ~~~~~--~~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~ 412 (458) .++.. .+.+|...... ++..+. ..+.|........ .....++-|+|.+..+++|.. -+++--++++. T Consensus 225 ~~~~d~dLVvivG~dLla~k~~~l~n-~~~~ptE~~Aa~~---i~s~k~iGGlpa~~~PffP~~-----~~lVT~L~NLs 295 (355) T protein:vir:18 225 IYQDDPKLVAIVGRKLLADKYFPLVN-KQQENTESLAADI---IISQKRIGNLPAVRVPYFPAN-----AVFVTTLENLS 295 (355) T ss_pred HHhcCCCEEEEEchhhhHHHHhHHhh-ccCChHHHHHHHH---HHHHHhhCCceeEEccccCCC-----ceEEeeccccE Confidence 55543 34455444332 222222 2233332211110 011246889999999999962 23333344443 Q ss_pred EEecce-e--EEeec-------ccccCCceEEEEEEeeccEEecccceEEEEeec--C Q lcl|NC_010583. 413 MPRQRA-V--TVERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAA--A 458 (458) Q Consensus 413 i~~~~~-~--~i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aa--a 458 (458) |.-..+ . .+... .|-..| ..|.++..--+..++ .+.....++ + T Consensus 296 IY~Q~gs~RR~~~d~p~r~rie~y~s~N-e~YvVEd~~~~a~ie--ni~~~~~~~~~~ 350 (355) T protein:vir:18 296 IYFMDESHRRSIDENPKKDRVENYESMN-IDYVVEAYAAGCLLE--NITLGDFTAPAA 350 (355) T ss_pred EEEecCcEEEEEEeccccccccchhhhc-ceeeeeccccEEEEe--eeeecCCCCccc Confidence 332221 1 12111 122223 344444333333333 333333221 1 No 203 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=82.75 E-value=0.074 Score=26.78 Aligned_cols=298 Identities=10% Similarity=-0.023 Sum_probs=125.1 Q ss_pred hhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhh Q lcl|NC_010583. 115 VGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGA 194 (458) Q Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~ 194 (458) ....+.... +....+-+.+- +... ..... + ...+..-+....-+.+...+-+.+...+.... T Consensus 1 ~~~~~~~~~-----~~~~~~~~~~~------~~~~-~~~~~-~-----~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~ 62 (329) T protein:vir:10 1 MDGIFITGV-----KTMNKEIKNAT------GKLK-LNLQH-F-----ANKSVEPGDTLLKNKHVGILEKVTAANSYSAP 62 (329) T ss_pred CCceEEech-----hhhhhhhhccc------ceeE-Eehhh-h-----cCCccCCchhHHHHHHHHHHHHHHHhhceeee Confidence 000000000 00000000000 0000 00000 0 01112222334445555555444433322111 Q ss_pred -hcc--eeeeccCceEEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccHH--HHHH Q lcl|NC_010583. 195 -LFD--ELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIF--SLLP 269 (458) Q Consensus 195 -~~~--~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~--~~~~ 269 (458) +++ .....+...+||......-..+..-+.. .....+.++...+++..+.-.+.- ..-=..++.. .+.. T Consensus 63 ~~~N~~~e~~~g~tVkIp~i~~~gl~DY~R~~g~-----~~g~vt~~~~t~tidqdR~~~F~V-D~~D~dEtn~~l~a~~ 136 (329) T protein:vir:10 63 AVISNDAIFMQGRSFTVIKGDVTELKDYKRNATN-----EFDHPQIQETTYFLDQEKYWGRFV-DALDRRDTEGNIDINY 136 (329) T ss_pred eecccceeeccCcEEEEeeecccccccccCCCCc-----cccccccceeEEEeecccceeeec-chhhHhhhhhhhhHHH Confidence 122 3445677889998865433332211111 112234456666666666554431 1111222222 2344 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhccccee-Eech Q lcl|NC_010583. 270 LLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKL-VLIV 348 (458) Q Consensus 270 ~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 348 (458) .+.+.+...++-.+|...+.- |...+.. ...... .....+..+.++...+.....+...| ++.| T Consensus 137 i~~~~~~~~v~pEiDay~~sk--------la~~a~~---~~~~~~----t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP 201 (329) T protein:vir:10 137 VVAKQASEVVAPYLDNLRFAT--------LARNKAK---HLTVGS----GADAQYDAVLDVSVELDEIGAGASRILFVTP 201 (329) T ss_pred HHHHHHHHHhhhHHHHHHHHH--------HHhhccc---cccccc----CHHHHHHHHHHHHHHHHhcCCCCCcEEEeCH Confidence 455666777777777655421 0000000 000111 12233555666666666543333334 5577 Q ss_pred hHHHHHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEec-ceeEEeecccc Q lcl|NC_010583. 349 SMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQ-RAVTVERERQA 427 (458) Q Consensus 349 ~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~~i~~~~~~ 427 (458) ..+..|......... ..........+..++|.|.||+.++.... .+...+++..+....... ..+++.... . T Consensus 202 ~~~~~Lk~~~~f~~~---~~~~~~~~~~g~Vg~idG~~Ii~vps~~~---k~in~ii~~~~A~~~~~K~~~~~~~~p~-~ 274 (329) T protein:vir:10 202 KFYKGIKKFVIELPQ---GDNRQQVLGKGVQGELDGFTIVKVPSKML---QGVEAMAVIGEVMASPIQANEAKLNSNV-P 274 (329) T ss_pred HHHHHHHhhhhhhcc---ccccccceeeeeeeeecCeEEEEecCCcc---cceeEEEEcCCceeeeeeeeeeeeeCCC-C Confidence 766666542111111 11122233566678899999997754321 222333444443332222 244544321 1 Q ss_pred cCCceEEEEEEeeccEEecccceEEEEeec--C Q lcl|NC_010583. 428 GKQRDAYYVTQRVNLQRYFENGVVSGAYAA--A 458 (458) Q Consensus 428 ~~~~~~~~~~~r~d~~~~~~~afv~l~~aa--a 458 (458) ..+...|+....+|+.|.+|++..+..... . T Consensus 275 ~~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a~ 307 (329) T protein:vir:10 275 GMFGTLAEQMLYTGAFVPEHLQKYIFTIGGKEV 307 (329) T ss_pred ccchheeeeeeeeeeEEEccccCEEEEecccCc Confidence 223467788888999999998644333222 1 No 204 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=82.16 E-value=0.079 Score=26.62 Aligned_cols=302 Identities=11% Similarity=0.064 Sum_probs=132.8 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLV 210 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~ 210 (458) .....+..|..++.. ......+.......+..+.|.+.+...+.+.+.+.+-+++..+++++..-...+.. T Consensus 1 M~~~tr~~~~~y~~~---------~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~ 71 (343) T protein:vir:98 1 MNKTAQELFYSLIGD---------AAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDL 71 (343) T ss_pred CChHHHHHHHHHHHH---------HHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEE Confidence 111222223332211 00011111111111223556666777788899999999999999998643333322 Q ss_pred e-cCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc--HHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 211 E-PEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA--IFS-LLPLLRKRLIEAHAVSIEEA 286 (458) Q Consensus 211 ~-~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~-~~~~i~~~la~~~~~~~d~~ 286 (458) . .++..++-... .+...... ..+.-.+...+.---+.|+.+.|+.. ..| |...+++.+.+.++.-.=.- T Consensus 72 ~~~sg~~t~r~~t-----~~~~~~~~--~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~I 144 (343) T protein:vir:98 72 RSNRKRHYGAHDR-----RTPIQQRW--TRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKI 144 (343) T ss_pred eecCccccCcccc-----CCCccccc--cCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhcccee Confidence 2 22222211110 00000000 01111233333333457788888764 255 88888888888887766666 Q ss_pred HhccCC----CCc------cccccccc------------cccccceeeccccchhhHHHHHHHHHHHhhhhhhhccc--c Q lcl|NC_010583. 287 FMSGNG----TGQ------PKGLLKLA------------ADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKL--S 342 (458) Q Consensus 287 ~l~G~g----~~~------p~Gi~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 342 (458) -+||+. |.. .+|++... ...+.....+.++++...+. ...++...+++.++.. . T Consensus 145 GfNGts~A~~T~nPllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDa--lV~D~~~~I~~~~~~d~dL 222 (343) T protein:vir:98 145 GFYGTSVGTDTSDPNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDE--LAYDLKQGLDARHRDAGDL 222 (343) T ss_pred cccceeeccCCCCcchhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHH--HHHHHHhcCchHHhcCCCE Confidence 677763 122 24555322 11111111122222222111 1123344566666654 3 Q ss_pred eeEechhHHHHH-Hhhhccccc-cccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecc-ee Q lcl|NC_010583. 343 KLVLIVSMDAYY-DLLEDEEWQ-DVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQR-AV 419 (458) Q Consensus 343 ~~~~~~~~~~~l-~~~~d~~~~-~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~ 419 (458) +.++.......- ..+-...++ |....- .. ......++-|+|.+..+++|.. -+++--++++.|.-.. .. T Consensus 223 VvivG~dLla~~~~~l~n~~~~~ptEk~A--a~-~~~~~k~iGGl~a~~~PfFP~~-----~llVT~L~NLsIY~Q~gs~ 294 (343) T protein:vir:98 223 VFLVGADLVAKEASLVYKGNGLIATEKAA--LN-THDLMKSFGGMPAMIVPNMPPR-----AAIVTSLSNLSIYTQEGSM 294 (343) T ss_pred EEEEchhhhhhhhhhhhhhcCCChHHHHH--HH-HHHHHHhhCCCeeEEccccCCC-----ceEEeeccccEEEEecCcE Confidence 444444433211 111111221 211110 00 0011246779999999999962 2333344444433222 12 Q ss_pred --EEeecc------cccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 420 --TVERER------QAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 420 --~i~~~~------~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+...+ .+.+-...|.++..--+..++.-.|...+-+.+ T Consensus 295 RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~~g~ 341 (343) T protein:vir:98 295 RRGMKDDDDKKAVRDSYYRNEAYAVEDCGKFMAVDFTKVKLSSGKGT 341 (343) T ss_pred EEEEEeccccccccchhhhcceeeeeccccEEEeeeeeeeecCCCCC Confidence 121111 112223355555555555666555665555555 No 205 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=82.10 E-value=0.08 Score=26.60 Aligned_cols=302 Identities=14% Similarity=0.129 Sum_probs=131.5 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p 209 (458) .....+..|..++.. ......+.. .+. ...+.|-..+...+...+.+.+-+++..+++++.--.. ++. T Consensus 1 M~~~tr~~~~~y~~~---------~A~~ngv~~-~d~-~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~ 69 (357) T protein:vir:20 1 MRQETRFKFNAYLSR---------VAELNGIDA-GDV-SKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIG 69 (357) T ss_pred CChHHHHHHHHHHHH---------HHHHhCCCh-HHh-cceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEe Confidence 111222333333221 000001110 011 12344555666778889999999999999998875433 333 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) ...+++-++-+.-.. ..+. .+..-..++.-.+...+.---+.|+.+.|+.. ..+|...+++.+.+.++.-.=.-- T Consensus 70 lg~~g~iagrtdT~~-~~~R--~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IG 146 (357) T protein:vir:20 70 IGVTGSIASTTDTAG-GTER--QPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAG 146 (357) T ss_pred cccCccccccccCCC-CCCc--ccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceec Confidence 444445443321110 0000 01111123333444444444467888888864 367899999999888877766666 Q ss_pred hccCC----CC---c------cccccccccc------------ccc-----ceeeccccchhhHHHHHHHHHHHhh-hhh Q lcl|NC_010583. 288 MSGNG----TG---Q------PKGLLKLAAD------------DGA-----KVVTEAKADGSVLVTAKTISKLRRK-LGR 336 (458) Q Consensus 288 l~G~g----~~---~------p~Gi~~~~~~------------~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 336 (458) +||+. |+ . .+|++...-. .++ .+..+.++++...+ ....++... +++ T Consensus 147 fNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLD--alV~D~~~~lI~~ 224 (357) T protein:vir:20 147 FNGVKRAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLD--ALVMDATNNLIEP 224 (357) T ss_pred ccceeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHH--HHHHHHHhccCCh Confidence 77763 11 1 3566632211 111 11122222222211 112234433 456 Q ss_pred hhccc--ceeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEE Q lcl|NC_010583. 337 HGLKL--SKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFV 412 (458) Q Consensus 337 ~~~~~--~~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~ 412 (458) .++.. .+.+|-..... ++..+. ..+.|........ .....++-|+|.+..+++|.. -+++--++++. T Consensus 225 ~~~~d~dLVvivG~dLla~k~~~l~n-~~~~ptE~~Aa~~---i~s~k~iGGl~a~~~PfFP~~-----~ilVT~L~NLs 295 (357) T protein:vir:20 225 WYQEDPDLVVIVGRQLLADKYFPIVN-KEQDNSEMLAADV---IISQKRIGNLPAVRVPYFPAD-----AMLITKLENLS 295 (357) T ss_pred HHhcCCCEEEEEchhhhhhhhhhHhh-ccCChHHHHHHHH---HHHhhhhCCceeEEccccCCC-----ceEEeeccccE Confidence 55543 34444444332 222222 2223332211111 111246789999999999952 23333344433 Q ss_pred EEecc-ee--EEeec-------ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 413 MPRQR-AV--TVERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 413 i~~~~-~~--~i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.... .. .+... .|-..| ..|.++..-.+..++.-.|+..+..+. T Consensus 296 IY~Q~gs~RR~~~d~p~r~riE~y~s~N-e~YvVEd~~~~a~iE~i~~~~~~~p~~ 350 (357) T protein:vir:20 296 IYYMDDSHRRVIEENPKLDRVENYESMN-IDYVVEDYAAGCLVEKIKVGDFSTPAK 350 (357) T ss_pred EEEecCcEEEEEEeccccccccchhhhc-ceeeeeccccEEEeeeeeeccccCCcc Confidence 33222 11 12111 122222 344444333333333211211111111 No 206 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=81.74 E-value=0.083 Score=26.51 Aligned_cols=302 Identities=14% Similarity=0.132 Sum_probs=132.0 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p 209 (458) .....+..|..++.. ......+.. .+. ...+.|-..+...+...+...+-+++..+++++.--.. ++. T Consensus 1 M~~~tr~~~~~y~~~---------~A~~ngv~~-~d~-~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~ 69 (357) T protein:vir:56 1 MRQETRFKFNAYLSR---------VAELNGIDA-GDV-SKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIG 69 (357) T ss_pred CChHHHHHHHHHHHH---------HHHHhCCCh-HHh-cceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEe Confidence 111222333333221 000001110 011 12344555666778889999999999999998875433 333 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) ...+++-++-+.-.. ..+. .+..-..++.-.+...+.---+.|+.+.|+.. ..+|...+++.+.+.++.-.=.-- T Consensus 70 lg~~g~iagrtdT~~-~~~R--~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IG 146 (357) T protein:vir:56 70 IGVTGSIASTTDTAG-GTER--QPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAG 146 (357) T ss_pred cccCccccccccCCC-CCCc--ccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceec Confidence 444454443321110 0000 01111223333444444444467888888864 367899999999888877766666 Q ss_pred hccCC----CC-------c--ccccccccccc------------ccc-----eeeccccchhhHHHHHHHHHHHhh-hhh Q lcl|NC_010583. 288 MSGNG----TG-------Q--PKGLLKLAADD------------GAK-----VVTEAKADGSVLVTAKTISKLRRK-LGR 336 (458) Q Consensus 288 l~G~g----~~-------~--p~Gi~~~~~~~------------~~~-----~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 336 (458) +||+. |+ + .+|++...-.. ++. +..+.++++...+ ..+.++... +++ T Consensus 147 fNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLD--alV~D~~~~lI~~ 224 (357) T protein:vir:56 147 FNGVKRAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLD--ALVMDATNNLIEP 224 (357) T ss_pred ccceeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHH--HHHHHHHhccCCh Confidence 77753 11 1 35666322110 111 1122222222211 112234433 456 Q ss_pred hhccc--ceeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEE Q lcl|NC_010583. 337 HGLKL--SKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFV 412 (458) Q Consensus 337 ~~~~~--~~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~ 412 (458) .++.. .+.+|-..... ++..+. ..+.|........ .....++-|+|.+..+++|.. -+++--++++. T Consensus 225 ~~~~d~dLVvivG~dLla~k~~~l~n-~~~~pTE~~Aa~~---i~s~k~iGGl~a~~~PfFP~~-----~llVT~L~NLs 295 (357) T protein:vir:56 225 WYQEDPDLVVIVGRQLLADKYFPIVN-KEQDNSEMLAADV---IISQKRIGNLPAVRVPYFPAD-----AMLITKLENLS 295 (357) T ss_pred HHhcCCCEEEEEchhhhhhhhhhHhh-ccCChHHHHHHHH---HHHhhhhCCceeEEccccCCC-----ceEEeeccccE Confidence 55543 34444444332 222222 2223332211111 111246789999999999952 23333344433 Q ss_pred EEecc-ee--EEeec-------ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 413 MPRQR-AV--TVERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 413 i~~~~-~~--~i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.... .. .+... .|-..| ..|.++..--+..++.-.|+.....+. T Consensus 296 IY~Q~gs~RR~~~d~p~r~riE~y~s~N-e~YvVEd~~~~a~iE~i~i~~~~~~~~ 350 (357) T protein:vir:56 296 IYYMDDSHRRVIEENPKLDRVENYESMN-IDYVVEDYAAGCLVEKIKVGDFSTPAK 350 (357) T ss_pred EEEecCcEEEEEEeccccccccchhhhc-ceeeeeccccEEEeeeeeeccCCCCcc Confidence 33222 11 12111 122222 344444333333333222221111111 No 207 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=81.49 E-value=0.085 Score=26.45 Aligned_cols=300 Identities=13% Similarity=0.130 Sum_probs=133.5 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p 209 (458) .....+..|..++.. ......+. .... ...+.|-+.+...+.+.+.+.+-+++..+++++.--.. ++. T Consensus 1 M~~~tr~~~~~y~~~---------~A~~ngv~-~~~~-~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~ 69 (355) T protein:vir:98 1 MRPETRFKFNAYLTR---------VAELNNIS-TDDV-SKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIG 69 (355) T ss_pred CChHHHHHHHHHHHH---------HHHHhCCC-hhHc-cceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEee Confidence 111222233333221 00000110 0011 12344444566778889999999999999999875433 334 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) ...+++-++-+.-.. ..+ -.+.....++.-.+..++.---+.|+.+.|+.. ..+|...+++.+.+.++.-.=.-- T Consensus 70 lgv~g~iagrtdT~~-~~~--R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IG 146 (355) T protein:vir:98 70 VGVTGTIASTTDTSG-DKE--RQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAG 146 (355) T ss_pred eccCccccccccCCC-CCC--cccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhc Confidence 444455444321110 000 011112223334444555445567888888874 368999999999999888777777 Q ss_pred hccCC----CC---c------cccccccccc------------ccc-----ceeeccccchhhHHHHHHHHHHHhh-hhh Q lcl|NC_010583. 288 MSGNG----TG---Q------PKGLLKLAAD------------DGA-----KVVTEAKADGSVLVTAKTISKLRRK-LGR 336 (458) Q Consensus 288 l~G~g----~~---~------p~Gi~~~~~~------------~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 336 (458) +||+. |+ . .+|++...-. .+. .+..+.++++...+ ..+.++... +++ T Consensus 147 fNG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLD--AlV~D~~~~lI~~ 224 (355) T protein:vir:98 147 FNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENID--ALVMDATNNLIDE 224 (355) T ss_pred ccceeeeccCChhhCcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHH--HHHHHHHhccCCh Confidence 78864 11 1 3566632211 111 01112222222111 112234433 355 Q ss_pred hhccc--ceeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEE Q lcl|NC_010583. 337 HGLKL--SKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFV 412 (458) Q Consensus 337 ~~~~~--~~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~ 412 (458) .++.. .+.+|...... ++..+. ....|....-... .....++-|+|.+..+++|.. -+++--++++. T Consensus 225 ~~~~d~dLVvivG~dLla~k~~~l~n-~~~~ptE~~Aa~~---i~s~k~iGGlpa~~~PffP~~-----~~lVT~L~NLs 295 (355) T protein:vir:98 225 VYQDDPNLVAIVGRKLLADKYFPLVN-KQQENSESLAADI---IISQKRIGNLPAVRVPYFPAN-----AVLVTTLENLS 295 (355) T ss_pred HHhcCCCEEEEEchhhhHHHhhhHhh-ccCCcHHHHHHHH---HHHhhhhCCceeEEccccCCC-----ceEEeeccccE Confidence 55543 34455444332 222222 2223322111100 011246889999999999962 23333344443 Q ss_pred EEecce-e--EEeec-------ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 413 MPRQRA-V--TVERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 413 i~~~~~-~--~i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.-..+ . .+... .|-..| ..|.++..--+..++ .+......+. T Consensus 296 IY~Q~gs~RR~~~d~p~r~rie~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~~ 348 (355) T protein:vir:98 296 IYFMDESHRRSIDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLGDFTAP 348 (355) T ss_pred EEEecCcEEEEEEeccccccccchhhhc-ceeeeeccccEEEee--ceeeeCCCCC Confidence 332221 1 12111 122222 344444333333333 2322222111 No 208 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=80.80 E-value=0.092 Score=26.28 Aligned_cols=294 Identities=13% Similarity=0.095 Sum_probs=122.4 Q ss_pred HHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEEEec Q lcl|NC_010583. 134 VEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TMLVEP 212 (458) Q Consensus 134 ~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p~~~ 212 (458) ..|..|..++.. ......+.......+..+.|...+...+.+.+.+.+-+++..+++++..-.. ++.... T Consensus 1 mtr~~~~~y~~~---------~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~ 71 (336) T protein:vir:37 1 MNKQAYYALAAA---------LAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGAT 71 (336) T ss_pred CcHHHHHHHHHH---------HHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeecc Confidence 112222222211 0111111111111123456666777888899999999999999999875433 344444 Q ss_pred CCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccH--HHHH-HHHHHHHHHHHHHHHHHHHhc Q lcl|NC_010583. 213 EAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAI--FSLL-PLLRKRLIEAHAVSIEEAFMS 289 (458) Q Consensus 213 ~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~-~~i~~~la~~~~~~~d~~~l~ 289 (458) +++.++-..-+. .......+.-.+..++.---+.|+.+.|+... +++. ..+..-+.+.++.-.=.--+| T Consensus 72 ~g~iagrtdt~r--------~r~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IGfn 143 (336) T protein:vir:37 72 EKGVTGRKQTGR--------NLATLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIGWN 143 (336) T ss_pred CcccccccCCCC--------CccccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhccc Confidence 455443322111 11112233334444444445678888888753 3433 333333444444444444556 Q ss_pred cCC----CCc------cccccccccc------------cccce-eeccccchhhHHHHHHHHHHHhhhhhhhcccc--ee Q lcl|NC_010583. 290 GNG----TGQ------PKGLLKLAAD------------DGAKV-VTEAKADGSVLVTAKTISKLRRKLGRHGLKLS--KL 344 (458) Q Consensus 290 G~g----~~~------p~Gi~~~~~~------------~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 344 (458) |+. |+. .+|++...-. .++.+ ..+.++++...+ ....++...+++.++... +. T Consensus 144 G~s~A~~TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLD--alV~D~~~~I~~~~~~d~dLVv 221 (336) T protein:vir:37 144 GQSVATNTTKTDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLD--DLAFDLKQGLDFRHQNRNDLVF 221 (336) T ss_pred ceeeccCCCCccccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHH--HHHHHHHhccchHHhcCCCeEE Confidence 653 222 3455532211 11111 112223222211 113344455677666533 44 Q ss_pred EechhHHHH-HHhhhccc-cccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecce-e-- Q lcl|NC_010583. 345 VLIVSMDAY-YDLLEDEE-WQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRA-V-- 419 (458) Q Consensus 345 ~~~~~~~~~-l~~~~d~~-~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~-- 419 (458) +|....... ...+-... ..|....- .. ......++-|+|.+..+++|.. -+++--++++.|.-..+ . T Consensus 222 ivG~dLla~~~~~l~~~~~~~PtE~~A--a~-~~~~~k~iGGlpa~~~PffP~~-----~~lVT~L~NLsIY~Q~gs~RR 293 (336) T protein:vir:37 222 LVGADLVSKETKLIQQKHGLTPTEKAA--LG-SHNLMGSFGGMNAITPPNFPAR-----AAAVTTLKNLSVYTEAESVRR 293 (336) T ss_pred EEchhhhhhhhhhhhhhcCCCHHHHHH--HH-HHHHHHhhCCceEEEccccCCC-----ceEEeeccccEEEEecCcEEE Confidence 454433221 11111111 12221110 10 0112246889999999999962 23333344444332222 1 Q ss_pred EEeec-------ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 420 TVERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 420 ~i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+... .|-..| ..|.++..--+..++.-.|.. .+= T Consensus 294 ~~~d~p~r~rie~y~s~N-e~YvVEd~~~~a~iE~i~v~~---~~e 335 (336) T protein:vir:37 294 SLRNDEDKKGLVTSYYRQ-EGYVVEDLGLMTAIDHTKVKL---NGE 335 (336) T ss_pred EEEEccccccccchhhhc-ceeeeeccccEEEeeeeeeec---ccc Confidence 12111 111122 233333332223333222211 111 No 209 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=80.11 E-value=0.098 Score=26.11 Aligned_cols=340 Identities=11% Similarity=-0.020 Sum_probs=129.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhh--hhhhhhhhhcchhhhhhHHHHHHHHHhhhccc-hhHH---HHHHHHhhhh Q lcl|NC_010583. 88 VEKQQETIVGLQDEIKSLLAAREGRSFV--GDSVAKALYGTQDAFEDEVEKLVLLSYMMEKD-VFET---EHGKAHIKAV 161 (458) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~e~~~~~--~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~-~~~~---~~~~~~~~a~ 161 (458) ..+..+-... ...|... .....++..+.+.....+.++..+. +.+.. .... ........+. T Consensus 1 ~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~--~~~~~~~~~~~~~~~~~~~~~a~ 67 (388) T protein:vir:99 1 MKQLSKVHQS-----------LAGRSVRAFDMANGKADYRLTDMAVRELKKFGLV--FDHATVKRQIELLHEGGVATQAF 67 (388) T ss_pred CCCccceeee-----------cCCcccchhhhhcCCcceeeechhhHhhhhccee--ccCccchhhhhhhhhhhhhhccc Confidence 0000000000 0000000 0000000000000000001110000 00000 0000 0000001111 Q ss_pred h--cccccccCccccchhHHHH----HHHHHHhccchhhhcceeeecc---CceEEEEecCCCccccccccccccccccc Q lcl|NC_010583. 162 N--GSSSVSMSSEAYETIFSTR----IIRDLQKELVVGALFDELPMSS---KILTMLVEPEAGRATWVDASKFGTDETVG 232 (458) Q Consensus 162 ~--~~~~~~~g~~~ip~~~~~~----ii~~~~~~~~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~e~~~~~e~~~~ 232 (458) . -.+..+.++.-||-.+..- |++.+.......++..+...+. ....+++.+..+.+.+.+-+ +.. T Consensus 68 da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~------~D~ 141 (388) T protein:vir:99 68 DSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDL------TNI 141 (388) T ss_pred CcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecceeEEEeecc------cCC Confidence 1 1123344455567666553 3333333333333333322221 24455555544444444333 233 Q ss_pred ccccccceeeeeehhheeeeehhhHHHHhc---cHHHHHHHHHHHHHHHHHHHHHHHHhccC-CC--Ccccccccccccc Q lcl|NC_010583. 233 DEVKGQLTEISFKTYKLAAKSFITDETEED---AIFSLLPLLRKRLIEAHAVSIEEAFMSGN-GT--GQPKGLLKLAADD 306 (458) Q Consensus 233 ~~~~~~f~~v~~~~~k~~~~~~is~ell~d---s~~~~~~~i~~~la~~~~~~~d~~~l~G~-g~--~~p~Gi~~~~~~~ 306 (458) +..+......+-..+.++..+.++.+=+.. ...++.+.-+....+++.+.+|+-.|+|. |. .+..|+++..... T Consensus 142 Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~ 221 (388) T protein:vir:99 142 PLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLL 221 (388) T ss_pred CceeccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcc Confidence 333433333444445555556666543333 24678888888899999999999999995 43 3578999875543 Q ss_pred ccceeeccc-----cchhhHHHHHHHHHHHhhhhhhhc---c-c---ceeEechhHHHHHHhhhcccccccccccccccc Q lcl|NC_010583. 307 GAKVVTEAK-----ADGSVLVTAKTISKLRRKLGRHGL---K-L---SKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVK 374 (458) Q Consensus 307 ~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~---~-~---~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~ 374 (458) ....+++.. ...+....++++..++..+..... . . ...++.+..+.+|..- +..|..+.+. +.. T Consensus 222 a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl~~-lk~-- 297 (388) T protein:vir:99 222 PAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQ-- 297 (388) T ss_pred cccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhcccc-CcCCccHHHH-HHH-- Confidence 322222111 112344556677777766644322 1 1 1355666666666422 2112111110 000 Q ss_pred ccccCCeeecccceeccccc-ccccCCceEEEEEeceEE---EE--ecce-eEE-eecccc------cCCceEEEEEEee Q lcl|NC_010583. 375 LQGQVGRIYGLPVVVSEYFP-AKAASAEFAVIVYKDNFV---MP--RQRA-VTV-ERERQA------GKQRDAYYVTQRV 440 (458) Q Consensus 375 ~~~~~~~l~G~pv~~~~~~~-~~~~~~~~~~~~~~~~~~---i~--~~~~-~~i-~~~~~~------~~~~~~~~~~~r~ 440 (458) ...++.+.....+- +...++..++..+...+. +. ++.. ... ...++. ..-....-...|. T Consensus 298 ------n~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~~~~~~rt 371 (388) T protein:vir:99 298 ------TYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNAT 371 (388) T ss_pred ------hcCCcEEEEecccccccccCCceeEEEEecccccccccCccCcceeEEecccccccccceecCceeEeccccce Confidence 01122222222221 111223333222221110 00 0000 000 011111 1112222333444 Q ss_pred -ccEEecccceEEEEee Q lcl|NC_010583. 441 -NLQRYFENGVVSGAYA 456 (458) Q Consensus 441 -d~~~~~~~afv~l~~a 456 (458) |+.+..|.||+.++-= T Consensus 372 ~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 372 AGVMLKRPWAVVRLIGL 388 (388) T ss_pred eeeEEeccchhheeccC Confidence 5578889999886655 No 210 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=79.63 E-value=0.1 Score=26.01 Aligned_cols=302 Identities=14% Similarity=0.134 Sum_probs=132.5 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p 209 (458) .....+..|..++.. ......+.. .+. ...+.|-..+...+...+.+.+-+++..+++++.--.. ++. T Consensus 1 M~~~tr~~~~~y~~~---------~A~~ngv~~-~d~-~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~ 69 (357) T protein:vir:60 1 MRQETRFKFNAYLSR---------VAELNGIDA-GDV-SKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIG 69 (357) T ss_pred CChHHHHHHHHHHHH---------HHHHhCCCh-HHh-cceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEe Confidence 111222333333221 000001110 011 12344555666778889999999999999998875433 333 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) ...+++-++-+.-.. ..+. .+..-..++.-.+...+.---+.|+.+.|+.. ..+|...+++.+.+.++.-.=.-- T Consensus 70 lg~~g~iagrtdT~~-~~~R--~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IG 146 (357) T protein:vir:60 70 IGVTGSIASTTDTAG-GTER--QPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAG 146 (357) T ss_pred cccCcccccccccCC-CCCc--ccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceec Confidence 444445443321110 0000 01111223334444444444567888888864 367999999999888877766666 Q ss_pred hccCC----CC-------c--ccccccccccc------------ccc-----eeeccccchhhHHHHHHHHHHHhh-hhh Q lcl|NC_010583. 288 MSGNG----TG-------Q--PKGLLKLAADD------------GAK-----VVTEAKADGSVLVTAKTISKLRRK-LGR 336 (458) Q Consensus 288 l~G~g----~~-------~--p~Gi~~~~~~~------------~~~-----~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 336 (458) +||+. |+ + .+|++...-.. ++. +..+.++++...+ ..+.++... +++ T Consensus 147 fNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLD--alV~D~~~~lI~~ 224 (357) T protein:vir:60 147 FNGVRRAETSDRSSNQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLD--ALVMDATNNLIEP 224 (357) T ss_pred ccceeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHH--HHHHHHHhccCCh Confidence 77753 11 1 35666322110 111 1122222222211 112234433 456 Q ss_pred hhccc--ceeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEE Q lcl|NC_010583. 337 HGLKL--SKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFV 412 (458) Q Consensus 337 ~~~~~--~~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~ 412 (458) .++.. .+.+|-..... ++..+. ..+.|........ .....++-|+|.+..+++|.. -+++--++++. T Consensus 225 ~~~~d~dLVvivG~dLla~k~~~l~n-~~~~pTE~~Aa~~---i~s~k~iGGl~a~~~PfFP~~-----~llVT~L~NLs 295 (357) T protein:vir:60 225 WYQEDPDLVVIVGRQLLADKYFPIVN-REQDNSEMLAADV---IISQKRIGNLPAVRVPYFPAD-----AMLITKLENLS 295 (357) T ss_pred HHhcCCCEEEEEchhhhhHHhhhHhh-cCCChHHHHHHHH---HHHhhhhcCcceEEccccCCC-----ceEEeeccccE Confidence 55543 34444444332 232222 2223322111110 111246789999999999952 23333344433 Q ss_pred EEecc-ee--EEeec-------ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 413 MPRQR-AV--TVERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 413 i~~~~-~~--~i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) |.... .. .+... .|-..| ..|.++..--+..++.-.|+..+..+. T Consensus 296 IY~Q~gs~RR~~~d~p~r~riE~y~s~N-e~YvVEd~~~~a~iE~i~~~~~~~pa~ 350 (357) T protein:vir:60 296 IYYMDDSHRRVIEENPKLDRVENYESMN-IDYVVEDYAAGCLVEKIKVGDFSTPAK 350 (357) T ss_pred EEEecCcEEEEEEeccccccccchhhhc-ceeeeeccccEEEeeeeeeccCccccc Confidence 33222 11 12111 122222 344444333333333222222111111 No 211 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=79.36 E-value=0.11 Score=25.94 Aligned_cols=294 Identities=11% Similarity=0.092 Sum_probs=129.0 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p 209 (458) .....+..|..++.. ......+.......+-.+.|-+.+...+...+...+-+++..+++++.--.. ++. T Consensus 1 M~~~tr~~~~~y~~~---------~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~ 71 (342) T protein:vir:10 1 MKDLTLEKYNAYLAR---------QAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLG 71 (342) T ss_pred CChHHHHHHHHHHHH---------HHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEe Confidence 111122222222211 0000011100001111344555667778889999999999999999875433 344 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) ...+++-++-+.-... .+ -.+.+-..++.-.+...+.---+.|+.+.|+.. ..+|...+++.+.+.++.-.=.-- T Consensus 72 lg~~g~iagrtdT~~~-~~--R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IG 148 (342) T protein:vir:10 72 LDSAHTVASTTDTSGD-GE--RKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIG 148 (342) T ss_pred cccCcccccccccCCC-CC--cccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceec Confidence 4444554443211100 00 011111233444444444444567888888864 367999999999988887766666 Q ss_pred hccCCC----C---c------ccccccccccc-----------ccceeeccccchhhHHHHHHHHHHHhh-hhhhhccc- Q lcl|NC_010583. 288 MSGNGT----G---Q------PKGLLKLAADD-----------GAKVVTEAKADGSVLVTAKTISKLRRK-LGRHGLKL- 341 (458) Q Consensus 288 l~G~g~----~---~------p~Gi~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~- 341 (458) +||+.. + . .+|++...-.. ...+..+.++++...+ ....++... +++.++.. T Consensus 149 fNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLD--alV~D~~~~lI~~~~~~d~ 226 (342) T protein:vir:10 149 FNGTSRAATSDRNSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGKGQEYANLD--ALVMDATEELIDEWHRDDT 226 (342) T ss_pred ccceeeccCCChhhCcCccccchHHHHHHHhhhhhhhcccceeccceeecCCCCcccHH--HHHHHHHhccCChHHhcCC Confidence 777631 1 1 35666432111 0111112222222211 112234433 45655543 Q ss_pred -ceeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecc- Q lcl|NC_010583. 342 -SKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQR- 417 (458) Q Consensus 342 -~~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~- 417 (458) .+.+|-..... ++..+.. ...|....-... .....++-|+|.+..+++|.. -+++--++++.|.... T Consensus 227 dLVvivG~dLladk~~~l~n~-~~~ptE~~Aa~~---i~s~k~iGGl~a~~~PfFP~~-----~ilVT~L~NLsIY~Q~g 297 (342) T protein:vir:10 227 DLVVITGRKLLADKYFPIVNQ-QNAPTEELAADI---VISQKRIGGLKAVRVPFFPAN-----AILITKLENLAIYVQEG 297 (342) T ss_pred CEEEEEchhhhHHHHHHHHhc-CCChHHHHHHHH---HHhhhhhcCceeEEccccCCC-----ceEEeeccccEEEEecC Confidence 34445444332 2222221 222322111100 111246889999999999962 2333334443333221 Q ss_pred ee--EEeec-------ccccCCceEEEEEEee------ccEEeccc Q lcl|NC_010583. 418 AV--TVERE-------RQAGKQRDAYYVTQRV------NLQRYFEN 448 (458) Q Consensus 418 ~~--~i~~~-------~~~~~~~~~~~~~~r~------d~~~~~~~ 448 (458) +. .+... .|-..| ..|.++..- +..+.+|+ T Consensus 298 s~RR~~~d~p~r~rie~y~s~N-e~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 298 TTRKHIENVPKKDRIETYESEN-IDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred cEEEEEEeccccccccchhhhc-cceeeeccccEEEeecceecCCC Confidence 11 12111 111122 222222221 22333444 No 212 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=78.06 E-value=0.12 Score=25.67 Aligned_cols=295 Identities=13% Similarity=0.052 Sum_probs=118.3 Q ss_pred cchhhhhhHHHHHHHHHhhhccchhHHHHHHHHh-hhhhcccccccCccccchhHHHHHHHHHHhc--cchhhhcceeee Q lcl|NC_010583. 125 GTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHI-KAVNGSSSVSMSSEAYETIFSTRIIRDLQKE--LVVGALFDELPM 201 (458) Q Consensus 125 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~-~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~--~~l~~~~~~~~~ 201 (458) -..++...+... ...+.+. +....... .....-.+-.+|+.+--+.+.++|..+.... -.+.+-....|. T Consensus 1 ~~~~~~~~~~~~-~~~~~~~------e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a 73 (463) T protein:vir:99 1 MTIEKNLSDVQQ-KYADQFQ------EDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPA 73 (463) T ss_pred CCcccccchHHH-HHHhhhh------HHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchh Confidence 000011111100 0011110 01101100 0000111122233333334444443332221 122333334444 Q ss_pred ccCceEEEEecCC---CcccccccccccccccccccccccceeeeeehhheeeeehhhHH-HHhccHHHHHHHHHHHHHH Q lcl|NC_010583. 202 SSKILTMLVEPEA---GRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDE-TEEDAIFSLLPLLRKRLIE 277 (458) Q Consensus 202 ~~~~~~~p~~~~~---~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~e-ll~ds~~~~~~~i~~~la~ 277 (458) .+-...|-..... ..+.+++|+ ..++.++|++.......+-++....+|.- -|.++..+....+.++--- T Consensus 74 ~STV~~y~~~~~~G~~g~~~f~~E~------g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~ 147 (463) T protein:vir:99 74 QSTVVKYDQYLRHGNVGHSRFVKEI------GVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIA 147 (463) T ss_pred hhhhhhheeeeccCccccccccccc------cccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHH Confidence 4444444433322 334444554 56677899999999999888877766654 3455667888888899899 Q ss_pred HHHHHHHHHHhccCCC--C-------ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEech Q lcl|NC_010583. 278 AHAVSIEEAFMSGNGT--G-------QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIV 348 (458) Q Consensus 278 ~~~~~~d~~~l~G~g~--~-------~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 348 (458) .++..++.++|+|+.. . +..||.+.....+. -...+... ....+-.+......+|..+...+|+. T Consensus 148 ~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~env---iDarG~~L---s~~~ln~Aa~~i~~~fGt~TD~~lp~ 221 (463) T protein:vir:99 148 VVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNV---INAKGNQL---TEKHLNEAAVRIGKGFGTATDAYMPI 221 (463) T ss_pred HHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCe---eecCCCcc---cHHHHhhhhhhhhcccCChhheecch Confidence 9999999999999752 1 34566554432221 11111111 11223334444456777888888988 Q ss_pred hHHHHHHhhhccccccccccccccccccccCCeeecccce--ecccccccccCCceEEEEEeceEEEEecceeEEeec-- Q lcl|NC_010583. 349 SMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVV--VSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERE-- 424 (458) Q Consensus 349 ~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~--~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~-- 424 (458) .+.+.|..---..-|.+.+. ..+ ....|+||- ++.. +. +... ...+.+... .+..+ T Consensus 222 ~vka~f~~~~l~~qrv~~~~-N~~-------~~~~G~~v~~f~s~~-------G~---I~L~-~s~~m~~~~-il~~~~~ 281 (463) T protein:vir:99 222 GVHADFVNSILGRQMQLMQD-NSG-------NVNTGYSVNGFYSSR-------GF---IKLH-GSTVMENEL-ILDESLQ 281 (463) T ss_pred HHHHHHHHHhcCceEEEEcC-CCC-------ceeeeeeccceeeee-------ee---eeeC-CceecCCcc-cccchhh Confidence 88887764322222222211 111 124455542 1100 00 0000 000000000 00000 Q ss_pred --cc-ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 425 --RQ-AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 425 --~~-~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +. +.--.+..-+..--.+...+++...-+.+++. T Consensus 282 ~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~~Y~vv 318 (463) T protein:vir:99 282 PLPNAPQPAKVTATVETKQKGAFENEEDRAGLSYKVV 318 (463) T ss_pred cCCCCccCceeEEEEeeccCCCCCCcccccceEEEEE Confidence 00 00011111111100111111111111111111 No 213 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=78.06 E-value=0.12 Score=25.67 Aligned_cols=295 Identities=13% Similarity=0.052 Sum_probs=118.3 Q ss_pred cchhhhhhHHHHHHHHHhhhccchhHHHHHHHHh-hhhhcccccccCccccchhHHHHHHHHHHhc--cchhhhcceeee Q lcl|NC_010583. 125 GTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHI-KAVNGSSSVSMSSEAYETIFSTRIIRDLQKE--LVVGALFDELPM 201 (458) Q Consensus 125 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~-~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~--~~l~~~~~~~~~ 201 (458) -..++...+... ...+.+. +....... .....-.+-.+|+.+--+.+.++|..+.... -.+.+-....|. T Consensus 1 ~~~~~~~~~~~~-~~~~~~~------e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a 73 (463) T protein:vir:95 1 MTIEKNLSDVQQ-KYADQFQ------EDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPA 73 (463) T ss_pred CCcccccchHHH-HHHhhhh------HHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchh Confidence 000011111100 0011110 01101100 0000111122233333334444443332221 122333334444 Q ss_pred ccCceEEEEecCC---CcccccccccccccccccccccccceeeeeehhheeeeehhhHH-HHhccHHHHHHHHHHHHHH Q lcl|NC_010583. 202 SSKILTMLVEPEA---GRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDE-TEEDAIFSLLPLLRKRLIE 277 (458) Q Consensus 202 ~~~~~~~p~~~~~---~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~e-ll~ds~~~~~~~i~~~la~ 277 (458) .+-...|-..... ..+.+++|+ ..++.++|++.......+-++....+|.- -|.++..+....+.++--- T Consensus 74 ~STV~~y~~~~~~G~~g~~~f~~E~------g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~ 147 (463) T protein:vir:95 74 QSTVVKYDQYLRHGNVGHSRFVKEI------GVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIA 147 (463) T ss_pred hhhhhhheeeeccCccccccccccc------cccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHH Confidence 4444444433322 334444554 56677899999999999888877766654 3455667888888899899 Q ss_pred HHHHHHHHHHhccCCC--C-------ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEech Q lcl|NC_010583. 278 AHAVSIEEAFMSGNGT--G-------QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIV 348 (458) Q Consensus 278 ~~~~~~d~~~l~G~g~--~-------~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 348 (458) .++..++.++|+|+.. . +..||.+.....+. -...+... ....+-.+......+|..+...+|+. T Consensus 148 ~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~env---iDarG~~L---s~~~ln~Aa~~i~~~fGt~TD~~lp~ 221 (463) T protein:vir:95 148 VVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNV---INAKGNQL---TEKHLNEAAVRIGKGFGTATDAYMPI 221 (463) T ss_pred HHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCe---eecCCCcc---cHHHHhhhhhhhhcccCChhheecch Confidence 9999999999999752 1 34566554432221 11111111 11223334444456777888888988 Q ss_pred hHHHHHHhhhccccccccccccccccccccCCeeecccce--ecccccccccCCceEEEEEeceEEEEecceeEEeec-- Q lcl|NC_010583. 349 SMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVV--VSEYFPAKAASAEFAVIVYKDNFVMPRQRAVTVERE-- 424 (458) Q Consensus 349 ~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~--~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~i~~~-- 424 (458) .+.+.|..---..-|.+.+. ..+ ....|+||- ++.. +. +... ...+.+... .+..+ T Consensus 222 ~vka~f~~~~l~~qrv~~~~-N~~-------~~~~G~~v~~f~s~~-------G~---I~L~-~s~~m~~~~-il~~~~~ 281 (463) T protein:vir:95 222 GVHADFVNSILGRQMQLMQD-NSG-------NVNTGYSVNGFYSSR-------GF---IKLH-GSTVMENEL-ILDESLQ 281 (463) T ss_pred HHHHHHHHHhcCceEEEEcC-CCC-------ceeeeeeccceeeee-------ee---eeeC-CceecCCcc-cccchhh Confidence 88887764322222222211 111 124455542 1100 00 0000 000000000 00000 Q ss_pred --cc-ccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 425 --RQ-AGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 425 --~~-~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) +. +.--.+..-+..--.+...+++...-+.+++. T Consensus 282 ~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~~Y~vv 318 (463) T protein:vir:95 282 PLPNAPQPAKVTATVETKQKGAFENEEDRAGLSYKVV 318 (463) T ss_pred cCCCCccCceeEEEEeeccCCCCCCcccccceEEEEE Confidence 00 00011111111100111111111111111111 No 214 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=77.63 E-value=0.12 Score=25.58 Aligned_cols=333 Identities=10% Similarity=-0.004 Sum_probs=124.9 Q ss_pred HHHHHHHHhhhhhhhhhhhhc---chhhhhhHHHHHHHHHhhhccc---hhHHHHHH---HHhhhhhc--ccccccCccc Q lcl|NC_010583. 105 LLAAREGRSFVGDSVAKALYG---TQDAFEDEVEKLVLLSYMMEKD---VFETEHGK---AHIKAVNG--SSSVSMSSEA 173 (458) Q Consensus 105 ~~~~~e~~~~~~~~~~~~~~~---~~~~~~~~~~~~a~~~~~~~~~---~~~~~~~~---~~~~a~~~--~~~~~~g~~~ 173 (458) .+......+.......+.... ........+. +.-.+.... ......+. ....++.. .+..+.++.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~---~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~g 77 (382) T protein:vir:96 1 MSHISKTHSRLAGRHAKPFDLKNVTHEAVAALGR---IGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIP 77 (382) T ss_pred CCCcceeeeecCCccccchhhhcccHHHHHHHhc---cccccCcccchhHhhhhhhhhhhhhhcccccccCCccccCCcc Confidence 000000000000000000000 0000000000 000000000 00000000 00011111 1223334455 Q ss_pred cchhHHH----HHHHHHHhccchhhhcceeeecc---CceEEEEecCCCcccccccccccccccccccccccceeeeeeh Q lcl|NC_010583. 174 YETIFST----RIIRDLQKELVVGALFDELPMSS---KILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKT 246 (458) Q Consensus 174 ip~~~~~----~ii~~~~~~~~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~ 246 (458) ||-.+.. .|++.+.......++..+...+. ....|++.+..+.+.+.+-+...| ....+.+|.+.++ T Consensus 78 ~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~P----l~d~~~~~~~r~v-- 151 (382) T protein:vir:96 78 TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIP----LTSWNANFERRTI-- 151 (382) T ss_pred HHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEEeecccCCC----ccccccceeEEEE-- Confidence 6766654 44555554444444443322221 344666666555555554333222 1223444555543 Q ss_pred hheeeeehhh-HHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHHhccC--CCC-cccccccccccccccee-eccccchh Q lcl|NC_010583. 247 YKLAAKSFIT-DETEEDA--IFSLLPLLRKRLIEAHAVSIEEAFMSGN--GTG-QPKGLLKLAADDGAKVV-TEAKADGS 319 (458) Q Consensus 247 ~k~~~~~~is-~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~l~G~--g~~-~p~Gi~~~~~~~~~~~~-~~~~~~~~ 319 (458) +.++....++ .|+.+-+ ..++.+--+....+++.+.+|+-.|+|+ |.+ ...|+++........++ ...-...+ T Consensus 152 ~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~Wa~kT 231 (382) T protein:vir:96 152 VRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGWATAD 231 (382) T ss_pred EEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCccccc Confidence 4444445554 4444432 4567777788888999999999999995 333 45799987653322111 11112233 Q ss_pred hHHHHHHHHHHHhhhhhhhc----c---cceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccc Q lcl|NC_010583. 320 VLVTAKTISKLRRKLGRHGL----K---LSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEY 392 (458) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~----~---~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~ 392 (458) ....++++..++..+..... . +...++.+..+.+|..- +..|..+.+. +. .+ ..++.|...+. T Consensus 232 ~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~-n~~g~Tvl~~-lk---~n-----~Pnl~i~t~pe 301 (382) T protein:vir:96 232 WAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT-TPYGISVSDW-IE---QT-----YPKMRIVSAPE 301 (382) T ss_pred HHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc-CccCccHHHH-HH---Hh-----cCCcEEEEccc Confidence 44455666666666643321 1 12355666666666431 1111111110 00 00 11222222222 Q ss_pred ccc---cccCCceEEEEEeceEEEEecceeEEe-------ecccc-----cCCc-eEEEEEEe-eccEEecccceEEEEe Q lcl|NC_010583. 393 FPA---KAASAEFAVIVYKDNFVMPRQRAVTVE-------RERQA-----GKQR-DAYYVTQR-VNLQRYFENGVVSGAY 455 (458) Q Consensus 393 ~~~---~~~~~~~~~~~~~~~~~i~~~~~~~i~-------~~~~~-----~~~~-~~~~~~~r-~d~~~~~~~afv~l~~ 455 (458) +-. .+.++......+.+.+........+.. .-.+. .+.. ...-...| .|+.+..|.||+.++- T Consensus 302 L~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~G 381 (382) T protein:vir:96 302 LSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLG 381 (382) T ss_pred cccccCCCccceeEEEEecchhhhhcccccccCcceeccccceeeeccceeecceeEeccccceeeeEEEcchhhhhccC Confidence 211 111222222222221100000000000 00000 0000 11111122 4668888999887665 Q ss_pred e Q lcl|NC_010583. 456 A 456 (458) Q Consensus 456 a 456 (458) = T Consensus 382 I 382 (382) T protein:vir:96 382 I 382 (382) T ss_pred C Confidence 5 No 215 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=77.14 E-value=0.13 Score=25.48 Aligned_cols=425 Identities=12% Similarity=0.043 Sum_probs=135.9 Q ss_pred CcchHHHHHHHHHH-----------HHHHHHHHHHHHH------------HHHHHHHHHHHHHHHH-------------- Q lcl|NC_010583. 1 MTIDINKLKEELGL-----------GDLAKSLEGLTAA------------QKAAEAKRLREEQEEK-------------- 43 (458) Q Consensus 1 ~~~~~~~~~~~~~~-----------~~~~~~~~~l~~~------------~~~~~~~~~~~e~~~~-------------- 43 (458) .-.+.+++.+-++. ..+.+++..-... ....+..+.+...+.. T Consensus 168 tG~~~e~i~~~m~~etwlta~EAve~Gf~Dei~e~~~~~a~~~~~~~~~~~~~p~~l~~~~~~~~~~p~~~~~~PaPTPa 247 (693) T protein:vir:95 168 TGKSADDIKALLKEETWMNGREAVAAGFADQLTEPLQAAAHLSSKRMQEFAHMPEALKTLLAPRAQTPAAPANTPAPTPA 247 (693) T ss_pred hCCCHHHHHHHHhhhcCCCHHHHHhccchhhhhhhhHHHHhhHHHHHHHhhchHHHHHHHHhhhcccccccccCcccCcc Confidence 11112222211110 0011111000000 0000000000000000 Q ss_pred ------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_010583. 44 ------------ELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAR-- 109 (458) Q Consensus 44 ------------~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 109 (458) .....++.......++++.+......+.....+ ...+.+....-.+...++.+-...... T Consensus 248 aaaPaaP~aaap~~adirA~~~aae~~r~aaI~a~fa~f~~~~a~------l~a~~l~d~~~s~d~ar~~lL~~l~~~~~ 321 (693) T protein:vir:95 248 SAAPAAPVAAAPTEADIRARILAEESGRRSAITAAFGAFSTGHAE------LLATCLNDMNITVDQAREKLLAAIGADTQ 321 (693) T ss_pred CCCCCCCccCCCCcchhhHHHHHHHHHHHHHHHHHHHhccCChHH------HHHHHHhhcCCCHHHHHHHHHHHHhhccC Confidence 000000100011111111111111111100000 000000000000001111100000000 Q ss_pred ---------------------HHHhhhhhhhhhhhhcchhhhhh----HHHHHHHHHhhhccchh-HHHHHHHHhhhhhc Q lcl|NC_010583. 110 ---------------------EGRSFVGDSVAKALYGTQDAFED----EVEKLVLLSYMMEKDVF-ETEHGKAHIKAVNG 163 (458) Q Consensus 110 ---------------------e~~~~~~~~~~~~~~~~~~~~~~----~~~~~a~~~~~~~~~~~-~~~~~~~~~~a~~~ 163 (458) ..+....-............+.. +..|..+. +++... .........++.. T Consensus 322 p~~~~~~~~~~~~~~g~~~~d~~~~al~~R~g~~~~~~~n~~~g~~L~elAr~~L~---~rg~~~~~~~~~~~~~~a~~- 397 (693) T protein:vir:95 322 PAAALSAGAHIHAGNGNLVGDSVRASVLARIGRGERQADNAYNGMTLRELARASLV---DRGIGVASLNAPQMVGLAFT- 397 (693) T ss_pred CCCCcCcCccccCCchhHHHHHHHHHHHHhcCcccccCCccccCCcHHHHHHHHHH---hcCCccCCCCHHHHHHHHHh- Confidence 00000000000000000000000 00010100 000000 0000011111111 Q ss_pred ccccccCccccchhHHHHHHHHHHh-----ccchhhhcceeeeccCceEEE-EecCCCcccccccccccccccccccccc Q lcl|NC_010583. 164 SSSVSMSSEAYETIFSTRIIRDLQK-----ELVVGALFDELPMSSKILTML-VEPEAGRATWVDASKFGTDETVGDEVKG 237 (458) Q Consensus 164 ~~~~~~g~~~ip~~~~~~ii~~~~~-----~~~l~~~~~~~~~~~~~~~~p-~~~~~~~a~~v~e~~~~~e~~~~~~~~~ 237 (458) . +++ --|-.+.+-+-..++. ......++.....+.-...-- .....+.---|.|+ +.++-... T Consensus 398 h-tTS----DFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~------gEyk~~t~ 466 (693) T protein:vir:95 398 H-TSS----DFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFSSLRQVREG------AEYKYVTL 466 (693) T ss_pred c-Ccc----hhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCCChhhcCCC------Cceeeeec Confidence 0 111 1232222222121111 222344444322222111111 11111222223333 33221111 Q ss_pred cceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHh---ccCCC-Cccccccccccccccceeec Q lcl|NC_010583. 238 QLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFM---SGNGT-GQPKGLLKLAADDGAKVVTE 313 (458) Q Consensus 238 ~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l---~G~g~-~~p~Gi~~~~~~~~~~~~~~ 313 (458) .=..-++...+++.++.||++++-+-+.++.+-|-..++++.++.++..++ .++.. ..-+.++...-. + ..++ T Consensus 467 ~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~-N--l~tg 543 (693) T protein:vir:95 467 GERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHS-N--LLTG 543 (693) T ss_pred CCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeecccc-c--cccc Confidence 112235678889999999999998888899999999999999999987444 33321 001223322111 1 0111 Q ss_pred cccchhhHHHHHHHHHHHhh--h------hhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecc Q lcl|NC_010583. 314 AKADGSVLVTAKTISKLRRK--L------GRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGL 385 (458) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~--~------~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~ 385 (458) ................+... . ..-...+.-|+..+........+-.+...+. .....+....+.|+ T Consensus 544 a~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~------a~~~~~~~NP~~~~ 617 (693) T protein:vir:95 544 AASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPG------ADVNSGIVNPIRAF 617 (693) T ss_pred cccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccc------cccccccccchhcc Confidence 11010111111111111110 0 0112344456665555544444332221111 00111111224443 Q ss_pred -cceecccccccccCCceEEEEEec--eE---EEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEee Q lcl|NC_010583. 386 -PVVVSEYFPAKAASAEFAVIVYKD--NF---VMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYA 456 (458) Q Consensus 386 -pv~~~~~~~~~~~~~~~~~~~~~~--~~---~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~a 456 (458) .|+..+.+.+. .+..|.++.+.. .+ ++....+-.|.....|..+.+.|++...+|++++|--++++-.-| T Consensus 618 ~~vi~~prL~~~-s~~~Wyl~a~~~~dtie~~yL~G~~~P~ie~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 618 AQVIGEPRLDDA-SATAWYMAAKKGSDTIEVAYLDGVDTPYLEQQEGFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred ccccccceecCC-CCCceEEecCCCCCeEEEEEecCCCCCeEeecCCCCcceEEEEEEEeccCceeeccccccCCCC Confidence 44444444321 223344433321 11 123334456677777999999999999999999998888765555 No 216 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=75.94 E-value=0.14 Score=25.25 Aligned_cols=295 Identities=16% Similarity=0.128 Sum_probs=132.6 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p 209 (458) .....+..|..++.. ... ..+..+ ....+.|-+.+...+...+.+.+-+++..+++++.--.. ++. T Consensus 1 M~~~tr~~~~~y~~~---------~A~---~ngv~~-~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~ 67 (339) T protein:vir:79 1 MRNDTRRLFAAYKAA---------IAK---LNGVER-VDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIG 67 (339) T ss_pred CChHHHHHHHHHHHH---------HHH---HhCccc-ccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEe Confidence 111222233333221 000 011111 122345555667778889999999999999998875433 334 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) ...+++-++-+.-.. .+ -.+..-..++.-.+...+.---+.|+.+.|+.. ..+|...+++.+.+.++.-.=.-- T Consensus 68 lg~~g~iagrtdt~~--~~--R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IG 143 (339) T protein:vir:79 68 LGVSGPVASTTDTTQ--QD--RETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIG 143 (339) T ss_pred eccCcceeecccCCC--CC--cccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceec Confidence 444455443321110 00 111111233334444444444567888888864 367999999999988887766666 Q ss_pred hccCC----CC---c------ccccccccc------------ccccceee-ccccchhhHHHHHHHHHHHh-hhhhhhcc Q lcl|NC_010583. 288 MSGNG----TG---Q------PKGLLKLAA------------DDGAKVVT-EAKADGSVLVTAKTISKLRR-KLGRHGLK 340 (458) Q Consensus 288 l~G~g----~~---~------p~Gi~~~~~------------~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 340 (458) +||+. |+ . .+|++...- ..+..+.. +.++++...+ ....++.. .+++.++. T Consensus 144 fNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLD--alV~d~~~~lId~~~~~ 221 (339) T protein:vir:79 144 FNGVSRAATSDRVANPMLQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLD--ALVYDITNHLVEPWYAE 221 (339) T ss_pred ccceeeecCCChhhCcCccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHH--HHHHHHHhccCChHHhc Confidence 77753 11 1 356653221 11111111 2222222111 12234443 34565654 Q ss_pred c--ceeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEec Q lcl|NC_010583. 341 L--SKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQ 416 (458) Q Consensus 341 ~--~~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 416 (458) . .+.+|-..... ++..+. ....|........ .....++-|+|.+..+++|.. -+++--++++.|... T Consensus 222 d~dLVvivG~dLla~k~~~l~n-~~~~ptE~~Aa~~---i~s~k~iGGl~a~~~PfFP~~-----~llVT~L~NLsIY~Q 292 (339) T protein:vir:79 222 DPDLVVVCGRNLLSDKYFPLVN-RDRDPVQQIAADL---IISQKRIGNLPAIRVPYFPAN-----GLLVTRLDNLSIYYQ 292 (339) T ss_pred CCCEEEEEchhhhhhHhhhHhh-cCCChHHHHHHHH---HHHhhhhCCceeEEccccCCC-----ceEEeechhcEEEEe Confidence 4 34444444332 222222 2223322111110 111246789999999999952 233334444433322 Q ss_pred c-ee--EEeec-------ccccCCceEEEEEEeeccEEecccceEEEEeec Q lcl|NC_010583. 417 R-AV--TVERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAA 457 (458) Q Consensus 417 ~-~~--~i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aa 457 (458) . .. .+... .|-..| ..|.++..-.+..++ .+ .+..+| T Consensus 293 ~gs~RR~~~d~p~r~rie~y~s~N-e~YvVEd~~~~a~iE--ni-~~~~aa 339 (339) T protein:vir:79 293 EGGRRRTILDNAKRDRIENYESSN-DAYVIEDLACAAMAE--NI-ALAAAA 339 (339) T ss_pred cCcEEEEEEeccccccccchhhcc-ceeeeeccccEEEee--ee-ecccCC Confidence 2 11 12111 121222 244444333333333 12 222222 No 217 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=75.82 E-value=0.14 Score=25.22 Aligned_cols=294 Identities=14% Similarity=0.097 Sum_probs=124.1 Q ss_pred HHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEEEec Q lcl|NC_010583. 134 VEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TMLVEP 212 (458) Q Consensus 134 ~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p~~~ 212 (458) ..|..|..++.. ......+.......+..+.|.+.+...+.+.+.+.+-+++..+++++..-.. ++.... T Consensus 1 mtr~~~~~y~~~---------~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~ 71 (336) T protein:vir:37 1 MNKQAYYALAAA---------LAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGAT 71 (336) T ss_pred CcHHHHHHHHHH---------HHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeecc Confidence 122222222211 0111111111111122456666777889999999999999999999875433 334444 Q ss_pred CCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccH--HHHH-HHHHHHHHHHHHHHHHHHHhc Q lcl|NC_010583. 213 EAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAI--FSLL-PLLRKRLIEAHAVSIEEAFMS 289 (458) Q Consensus 213 ~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~~~-~~i~~~la~~~~~~~d~~~l~ 289 (458) +++.++-..-+ -. ..+...+.-.+..++.---+.|+.+.|+... +++. ..+..-+.+.++.-.=.--+| T Consensus 72 ~g~iagrtdt~-------R~-~~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iALD~i~IGfn 143 (336) T protein:vir:37 72 EKGVTGRKQTG-------RN-LANLDHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVALDILQIGWN 143 (336) T ss_pred CcccccccCCC-------cc-ccccCcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHhhchhhhccc Confidence 44444322211 11 1112344444444444445678888888753 3433 333333455555444445556 Q ss_pred cCC----CCc------cccccccccc------------cccce-eeccccchhhHHHHHHHHHHHhhhhhhhcccc--ee Q lcl|NC_010583. 290 GNG----TGQ------PKGLLKLAAD------------DGAKV-VTEAKADGSVLVTAKTISKLRRKLGRHGLKLS--KL 344 (458) Q Consensus 290 G~g----~~~------p~Gi~~~~~~------------~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 344 (458) |+. |.. .+|++...-. .++.+ ..+.++++...+ ....++...+++.++... +. T Consensus 144 G~s~A~~TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLD--alV~D~~~~I~~~~~~d~dLVv 221 (336) T protein:vir:37 144 GQSVADNTTKADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLD--DLAFDLKQGLDFRHQNRNDLVF 221 (336) T ss_pred ceeeccCCCCCcccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHH--HHHHHHHhcCchHHhcCCCeEE Confidence 653 222 3455532211 11111 112223222211 113344455667666533 44 Q ss_pred EechhHHHH-HHhhhccc-cccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEecce-e-- Q lcl|NC_010583. 345 VLIVSMDAY-YDLLEDEE-WQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRA-V-- 419 (458) Q Consensus 345 ~~~~~~~~~-l~~~~d~~-~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~-- 419 (458) +|....... ...+-... ..|....- .. ......++-|+|.+..+++|.. -+++--++++.|.-..+ . T Consensus 222 ivG~dLla~~~~~l~~~~~~~PtE~~A--a~-~~~~~k~iGGlpa~~~PffP~~-----~~lVT~L~NLsIY~Q~gs~RR 293 (336) T protein:vir:37 222 LVGADLVSKETKLIQQKHGLTPTEKAA--LG-SHNLMGSFGGMNAITPPNFPAR-----AAAVTTLKNLSVYTEAESVRR 293 (336) T ss_pred EEchhhhhhhhhhhhhhcCCCHHHHHH--HH-HHHHHHhhCCceeEEccccCCC-----ceEEeechhcEEEEecCcEEE Confidence 454433221 11121211 12221110 10 0111246889999999999962 23333344443332221 1 Q ss_pred EEeec-------ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 420 TVERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 420 ~i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .+... .|-..| ..|.++..-.+..++.-.| ++.+= T Consensus 294 ~~~d~p~r~rie~y~s~N-e~YvVEd~~~~a~iE~i~v---~~~~e 335 (336) T protein:vir:37 294 SLRNDEDKKGLVTSYYRQ-EGYVVEDLGLMTAIDHTKV---KLNGE 335 (336) T ss_pred EEEEccccccccchhhhc-ceeeeeccccEEEeeeeee---eecCc Confidence 12111 111122 2333333322233332222 22111 No 218 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=75.72 E-value=0.14 Score=25.21 Aligned_cols=306 Identities=13% Similarity=0.096 Sum_probs=122.9 Q ss_pred hhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhcc--chhh Q lcl|NC_010583. 117 DSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKEL--VVGA 194 (458) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~--~l~~ 194 (458) ..........+....+.. ..++.+.+..+ ....-.+-.+++.+--+.+.++|..+..... .+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~-~e~~~KS~~tg-------------~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~ 66 (462) T protein:vir:96 1 MHKDTNLTAEQNKYADKF-QEEVMKSYQTG-------------YGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYR 66 (462) T ss_pred Cccccccchhhhhhhchh-hHHHHHHHhcC-------------CCcCCccccccchhhhhhhhhhhheeeecccchhhhh Confidence 000000000000000000 01111111100 0000111112233333444444433322222 2233 Q ss_pred hcceeeeccCceEEEEecCC---CcccccccccccccccccccccccceeeeeehhheeeeehhhHHH-HhccHHHHHHH Q lcl|NC_010583. 195 LFDELPMSSKILTMLVEPEA---GRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDET-EEDAIFSLLPL 270 (458) Q Consensus 195 ~~~~~~~~~~~~~~p~~~~~---~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~el-l~ds~~~~~~~ 270 (458) -....|..+-...|-..... ..+.+++|+ ..++.++|++...+...+-++..-.+|... |..+..+.... T Consensus 67 ~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~------g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~ 140 (462) T protein:vir:96 67 EISRRPAQSTVQKYDVYLRHGNVGHSRFVREV------GVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQI 140 (462) T ss_pred hcCCchhhhhhhhheeeeccCccccccccccc------cccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHH Confidence 33344444444444443332 334445554 566788999999999999888776666553 34456778888 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCC---------ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhccc Q lcl|NC_010583. 271 LRKRLIEAHAVSIEEAFMSGNGTG---------QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKL 341 (458) Q Consensus 271 i~~~la~~~~~~~d~~~l~G~g~~---------~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 341 (458) ..++--..++..++.+.|+|+..= +..||.+.....+ + -...+.. .+...+-........++..+ T Consensus 141 ~~~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI~~~N--V-iDarG~~---Ls~~~ln~aa~~i~~~fGt~ 214 (462) T protein:vir:96 141 LTEDAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAKLIDKDN--V-IDAKGES---LTETLLNRSAVLIGKSFGTA 214 (462) T ss_pred HHHHHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhhhcCCCc--e-eecCCCC---ccHHHHhhhhhhcccccCCh Confidence 888888999999999999998531 3456655443222 1 1222221 11223333334445677788 Q ss_pred ceeEechhHHHHHHhhhcccccccccc-------ccccccc-------cccCCeeecccceecccc---cccccC-CceE Q lcl|NC_010583. 342 SKLVLIVSMDAYYDLLEDEEWQDVAQV-------GNDAVKL-------QGQVGRIYGLPVVVSEYF---PAKAAS-AEFA 403 (458) Q Consensus 342 ~~~~~~~~~~~~l~~~~d~~~~~~~~~-------~~~~~~~-------~~~~~~l~G~pv~~~~~~---~~~~~~-~~~~ 403 (458) ...+|+..+.+.|..---..-|.+.+. +...... ...+.++++.|-++.... |...+- +... T Consensus 215 TD~~~p~~v~a~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~p~ap~~~~vsa 294 (462) T protein:vir:96 215 TDAYMPIGVHADFVNSVLGRQMQLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPATVKA 294 (462) T ss_pred hheecchHHHHHHHHhhcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccCCCCCCCCceeE Confidence 888899888887773221111211111 1111000 001122223333322111 111000 0000 Q ss_pred EEEEeceEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 404 VIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 404 ~~~~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ....+..-.+++.. ......|++...-+..=-.|..+|-.+.+.. T Consensus 295 Tv~t~~~g~f~~~~----------d~~~y~Y~V~avs~dgeS~PS~~VtaTva~~ 339 (462) T protein:vir:96 295 TVETGKKGLFTDEH----------DRAELTYKVVVNSDDAQSAPSEAVTATVNNA 339 (462) T ss_pred EEEeCCCCCCCCcc----------CceeEEEEEEEECCCCccccceeeEeeeecc Confidence 00000000000000 0122222222221111112333333333322 No 219 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=75.41 E-value=0.15 Score=25.15 Aligned_cols=294 Identities=15% Similarity=0.113 Sum_probs=133.3 Q ss_pred hhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEE Q lcl|NC_010583. 131 EDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TML 209 (458) Q Consensus 131 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p 209 (458) .....+..|..++.. ... .....+ ....+.|.+.+...+.+.+...+-+++..+++++..-.. ++. T Consensus 1 M~~~tr~~~~~y~~~---------~A~---~ngv~~-~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~ 67 (338) T protein:vir:11 1 MRNETRKQFDAYLAQ---------LAK---LNGVNS-AVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIG 67 (338) T ss_pred CCHHHHHHHHHHHHH---------HHH---HhCCCc-ccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEee Confidence 111222223332211 000 111111 223455556677788999999999999999999885443 344 Q ss_pred EecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 210 VEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAF 287 (458) Q Consensus 210 ~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~ 287 (458) ...+++-++-+.-.... .-.+..-..++.-.+..++.---+.|+.+.|+.. ..+|...+++.+.+.++.-.=.-- T Consensus 68 lg~~g~iagrtdT~~~~---~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IG 144 (338) T protein:vir:11 68 IGVSGTIASRTDTTGDG---VRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIG 144 (338) T ss_pred eccCccccccccCCCCC---ccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhc Confidence 44445544332211000 0001111123334444554445567888888874 368999999999999888777777 Q ss_pred hccCC----C---Cc------cccccccccc-----------cccc--eeeccccchhhHHHHHHHHHHHh-hhhhhhcc Q lcl|NC_010583. 288 MSGNG----T---GQ------PKGLLKLAAD-----------DGAK--VVTEAKADGSVLVTAKTISKLRR-KLGRHGLK 340 (458) Q Consensus 288 l~G~g----~---~~------p~Gi~~~~~~-----------~~~~--~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 340 (458) ++|+. | .. .+|++...-. .... +..+.++++...+ ..+.++.. .+++.++. T Consensus 145 fnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLD--alV~d~~~~lI~~~~~~ 222 (338) T protein:vir:11 145 FNGTSAAATTNRAANPLLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLD--ALVFDVVSSLIDPWHRR 222 (338) T ss_pred ccceeeccCCChhhCcCccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHH--HHHHHHHhccCChHHhc Confidence 78864 1 11 3566532211 0011 1112112221111 11223443 33555554 Q ss_pred cc--eeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEeceEEEEec Q lcl|NC_010583. 341 LS--KLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQ 416 (458) Q Consensus 341 ~~--~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 416 (458) .. +.+|...... ++..+. ....|........ .....++-|+|.+..+++|.. -+++--++++.|.-. T Consensus 223 d~dLVvivG~dLladk~~~l~n-~~~~ptE~~Aa~~---~~s~k~iGGlpa~~~PffP~~-----~~lVT~L~NLsIY~Q 293 (338) T protein:vir:11 223 DPGLVVILGRELVHDKYFPMVN-KDQPATEKIATDL---ILSQKRMGGLPPVEVPYVPEK-----GLMVTTLKNLSLYWQ 293 (338) T ss_pred CCCEEEEEchhhhHHHHhHHHh-cCCChHHHHHHHH---HHHhhhhCCceeEEccccCCC-----ceEEeeccccEEEEe Confidence 33 4455544332 222222 2222222111100 011246889999999999962 233333444444322 Q ss_pred ce-e--EEeec-------ccccCCceEEEEEEeeccEEecccceEE Q lcl|NC_010583. 417 RA-V--TVERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVS 452 (458) Q Consensus 417 ~~-~--~i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~ 452 (458) .+ . .+... .|-..| ..|.++..-.+..++.-.|+. T Consensus 294 ~gs~RR~~~d~p~r~rie~y~s~N-e~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 294 IGGRRRYLKEVPEKNRIENYESSN-DAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred cCcEEEEEEeccccccccchhhhc-cceeeeccccEEEeecceecC Confidence 22 1 12111 121222 244433333333333222222 No 220 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=73.40 E-value=0.17 Score=24.79 Aligned_cols=275 Identities=10% Similarity=-0.047 Sum_probs=109.2 Q ss_pred cccCccccchhHHHHHHHHHH-hccchhhhcceeeeccCceEEEEecCCCcccccccccccccccccccccccceeeeee Q lcl|NC_010583. 167 VSMSSEAYETIFSTRIIRDLQ-KELVVGALFDELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFK 245 (458) Q Consensus 167 ~~~g~~~ip~~~~~~ii~~~~-~~~~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~ 245 (458) -+ .+.-++......+---.+ +..+-..++..+|+.....+|++......+ -+. .......+.....++.....++. T Consensus 1 ~~-~~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F-~~~-~t~r~~~~~~~~v~~~~~~~~~~ 77 (309) T protein:vir:99 1 MS-NAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGF-TVP-ETLVGRKSKPNEVEFSATDETGS 77 (309) T ss_pred CC-CCCcCcCHhHHHHHhhccChhhhhhhcCCccccCccccceeeechhhcc-ccc-chhhccCCCcceEeecccCceee Confidence 11 112223333333321111 222223456778888777888887542211 111 11122223333344444444555 Q ss_pred hhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHHhc--cCCCCccccccccccccccceeeccccchhhH Q lcl|NC_010583. 246 TYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAFMS--GNGTGQPKGLLKLAADDGAKVVTEAKADGSVL 321 (458) Q Consensus 246 ~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~l~--G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 321 (458) ...-+-..+|..+-..++ .++.++.-.+.+...|.+..+..+-. =+.++-|.|..-... ++ ...++.+.+.. T Consensus 78 ~~~~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Ls---gt-~~wsd~~SDPi 153 (309) T protein:vir:99 78 TEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS---GA-DQWSDPTSNPL 153 (309) T ss_pred ecccceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEec---Cc-cccCCCCCCcH Confidence 555555567777766654 36777777777777776666553322 122222222111000 00 01111222222 Q ss_pred HHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhh---hcc-ccccccccccccccccccCCeeecc-cceecccccc- Q lcl|NC_010583. 322 VTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLL---EDE-EWQDVAQVGNDAVKLQGQVGRIYGL-PVVVSEYFPA- 395 (458) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~---~d~-~~~~~~~~~~~~~~~~~~~~~l~G~-pv~~~~~~~~- 395 (458) . ++.....++ .+.+...+|...+|.+|... ... .++... .+....-.-..++|. .|++....-+ T Consensus 154 ~---~i~~~~~~~---g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~----~g~it~~~la~l~~ve~V~vg~a~~n~ 223 (309) T protein:vir:99 154 P---VITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGD----EGMVPMAFLQELLELDAIYIGEARLNI 223 (309) T ss_pred H---HHHHHHHhh---CCCcceEEechHHHHHHhhCHHHHHHhcCCCcc----ccccCHHHHHHHhCcceEEeecceeec Confidence 2 222222222 35666778888888776531 111 111000 000000001123343 2333221110 Q ss_pred ----------cccCCceEEEEEec------------eEEEEecceeEEeecccccCCceEEEEEEeeccEEecccceEEE Q lcl|NC_010583. 396 ----------KAASAEFAVIVYKD------------NFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGVVSG 453 (458) Q Consensus 396 ----------~~~~~~~~~~~~~~------------~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~afv~l 453 (458) .-++....++.... .|....+..=.+..+.+-..+...+|+...+.-.++-+++=..+ T Consensus 224 a~~g~~~~~~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~li 303 (309) T protein:vir:99 224 ARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) T ss_pred cccccccccccccCCcEEEEEcCCCCCCcccccccceeecccccCCceeeeeeccCCceEEEEeccccchhcchhcchhh Confidence 00111111111110 01111111111111222334455677777776666666664455 Q ss_pred EeecC Q lcl|NC_010583. 454 AYAAA 458 (458) Q Consensus 454 ~~aaa 458 (458) +-+.| T Consensus 304 ~~~va 308 (309) T protein:vir:99 304 ENAVA 308 (309) T ss_pred hhccc Confidence 54444 No 221 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=68.25 E-value=0.24 Score=23.98 Aligned_cols=268 Identities=9% Similarity=0.037 Sum_probs=119.9 Q ss_pred hhcccccccCccccchhHHHHHHHHHHhc-cchhhhcceeeeccCceEEEEecCCCc-cccccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTRIIRDLQKE-LVVGALFDELPMSSKILTMLVEPEAGR-ATWVDASKFGTDETVGDEVKGQ 238 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~~~~p~~~~~~~-a~~v~e~~~~~e~~~~~~~~~~ 238 (458) +..+... -.++-..+...+....... ....++|+.+|......++.....-|. --|++|-. -.... T Consensus 1 m~it~~~---l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge~~---------~~~l~ 68 (302) T protein:vir:10 1 MLINKQS---LNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGAKV---------VKNLK 68 (302) T ss_pred CcccHHH---HHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCcccccccee---------ecccc Confidence 0000000 0001111112222222222 224555666653333334443333333 23433322 11222 Q ss_pred ceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCC----Cc-------cccccccc Q lcl|NC_010583. 239 LTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMS----GNGT----GQ-------PKGLLKLA 303 (458) Q Consensus 239 f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~~~l~----G~g~----~~-------p~Gi~~~~ 303 (458) =...++..++++..+.||++.+.+-..++..-+...++++.++.++..++. |.++ ++ |.|--... T Consensus 69 ~~~~~i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~ 148 (302) T protein:vir:10 69 AYKYVVENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVS 148 (302) T ss_pred ccceeEEeecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccc Confidence 334567888999999999999999888999999999999999999886553 2111 11 11111000 Q ss_pred cccccceeecc-ccchhhHHHHHHHHHHHhhhhh-----hhcccceeEechhHHHHHHh-hhcccccccccccccccccc Q lcl|NC_010583. 304 ADDGAKVVTEA-KADGSVLVTAKTISKLRRKLGR-----HGLKLSKLVLIVSMDAYYDL-LEDEEWQDVAQVGNDAVKLQ 376 (458) Q Consensus 304 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~l~~-~~d~~~~~~~~~~~~~~~~~ 376 (458) +....... .........+...+.++..... -...+.-++..|........ +.+. ++-. + T Consensus 149 ---N~g~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~--~~~~-----g---- 214 (302) T protein:vir:10 149 ---NKGTAPLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNP--KLAD-----N---- 214 (302) T ss_pred ---cccchhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhcc--ccCC-----C---- Confidence 00000000 0001111122222222222211 11233344555554443333 3322 1110 0 Q ss_pred ccCCeeec-ccceecccccccccCCceEEEEEece---EEEEecceeEEeecccccCCceEEEEEEeeccEEecccce-- Q lcl|NC_010583. 377 GQVGRIYG-LPVVVSEYFPAKAASAEFAVIVYKDN---FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFENGV-- 450 (458) Q Consensus 377 ~~~~~l~G-~pv~~~~~~~~~~~~~~~~~~~~~~~---~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~af-- 450 (458) ....+.| ..+++++.+. ++..+.++.+... +.+..+.+-.+.....+..+.+.++....+|+..+-.-+| T Consensus 215 -~~Np~~g~~~~vv~p~L~---s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~ 290 (302) T protein:vir:10 215 -TPNPYVGTAELVVDGRIE---SDTAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGF 290 (302) T ss_pred -CcceeccceEEEEeeccC---CCCceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhh Confidence 0111223 3455555543 3345666655543 2233444556666666778888888888887644443333 Q ss_pred ----EEEEeecC Q lcl|NC_010583. 451 ----VSGAYAAA 458 (458) Q Consensus 451 ----v~l~~aaa 458 (458) ..-+-++| T Consensus 291 wq~a~~s~g~~~ 302 (302) T protein:vir:10 291 WQLAYGSTGTGA 302 (302) T ss_pred hhhhhccCccCC Confidence 23333344 No 222 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=68.21 E-value=0.24 Score=23.98 Aligned_cols=359 Identities=14% Similarity=0.082 Sum_probs=119.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcch Q lcl|NC_010583. 48 MNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQ 127 (458) Q Consensus 48 ~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~ 127 (458) |.- +..+..+++..-.++. +. +.+..+-+..-..++ ++.+.+... .. T Consensus 1 ~~~---~~~~~l~~kw~p~l~~-~~----~~~i~~~~~~~~a~~------~enq~~~~~-------------------~~ 47 (521) T protein:vir:10 1 MTI---KTKAELLNKWKPLLEG-EG----LPEIANSKQAIIAKI------FENQEKDFQ-------------------TA 47 (521) T ss_pred CCc---chhHHHHHhhhhhhcc-CC----CCccccchhhhhhhh------hhhhhhhhh-------------------hc Confidence 100 0000011111111110 00 000000000000000 000000000 00 Q ss_pred hhhhhHHHHHHHHHhhhccchhHH-HHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce Q lcl|NC_010583. 128 DAFEDEVEKLVLLSYMMEKDVFET-EHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL 206 (458) Q Consensus 128 ~~~~~~~~~~a~~~~~~~~~~~~~-~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 206 (458) ....++.-..+|..++.+...... +.........+.++... ..=|. +. .+++..-+..+..+++-|-||+++.. T Consensus 48 ~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~i~es~~t~~v~---~~~P~-Li-~lvRra~p~LIa~DIwGVQPMTgPTG 122 (521) T protein:vir:10 48 PEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAAGQTSGAVT---QIGPA-VM-GMVRRAIPNLIAFDICGVQPMNSPTG 122 (521) T ss_pred cccchhHHHHHHhhhhhhhcccCccccccccccccccccccc---cCCch-hh-hHHHHHHhhhhhhhceeeccCCchhh Confidence 011111112223333222100000 00000000001111111 11121 11 13344445556666777777665422 Q ss_pred -------EEEEecCC------------Cccccccccc------------------------------------------- Q lcl|NC_010583. 207 -------TMLVEPEA------------GRATWVDASK------------------------------------------- 224 (458) Q Consensus 207 -------~~p~~~~~------------~~a~~v~e~~------------------------------------------- 224 (458) .|+-.... +++.|-+.+. T Consensus 123 LIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~ 202 (521) T protein:vir:10 123 QVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTA 202 (521) T ss_pred hheeeeeeccCCccccccccccchhccccccccccccccccccccccccccccccccccccccccceecccccccCCCcc Confidence 12111100 0111100000 Q ss_pred --------------------------c-----------cccccccccccccceeeeeehhheeeeehhhHHHHhc----c Q lcl|NC_010583. 225 --------------------------F-----------GTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEED----A 263 (458) Q Consensus 225 --------------------------~-----------~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d----s 263 (458) . ...+...++-..+++.+++.++..+-...+|-||.+| . T Consensus 203 ~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVH 282 (521) T protein:vir:10 203 DDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVH 282 (521) T ss_pred cccccccccccccccccceeecccccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhc Confidence 0 0001112334455666777777777788999999998 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccC-CCC--ccccccccccccccceee--ccc---cchhhHHHHHHH----HHHH Q lcl|NC_010583. 264 IFSLLPLLRKRLIEAHAVSIEEAFMSGN-GTG--QPKGLLKLAADDGAKVVT--EAK---ADGSVLVTAKTI----SKLR 331 (458) Q Consensus 264 ~~~~~~~i~~~la~~~~~~~d~~~l~G~-g~~--~p~Gi~~~~~~~~~~~~~--~~~---~~~~~~~~~~~~----~~~~ 331 (458) ..|.+++|.+-|+..|...|++.||.-= -+. ...|+....+...++.-. ..+ .-+.. ..+..| .... T Consensus 283 GLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~-e~~k~L~~~i~~~a 361 (521) T protein:vir:10 283 GMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAG-ESFKALLFQIDKEA 361 (521) T ss_pred CCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHH-HHHHHHHHHHHHHH Confidence 4789999999999999999999998320 010 112222110000010000 000 11111 111111 1222 Q ss_pred hhhhh-hhcccce-eEechhHHHHHHhhh--cccccccccccccccc-ccccCCeeec-ccceecccccccccCCceEEE Q lcl|NC_010583. 332 RKLGR-HGLKLSK-LVLIVSMDAYYDLLE--DEEWQDVAQVGNDAVK-LQGQVGRIYG-LPVVVSEYFPAKAASAEFAVI 405 (458) Q Consensus 332 ~~~~~-~~~~~~~-~~~~~~~~~~l~~~~--d~~~~~~~~~~~~~~~-~~~~~~~l~G-~pv~~~~~~~~~~~~~~~~~~ 405 (458) ..+.. ..+..+. .++++.....|.... ++.--.....+...+. ..-..|.|.| ++|++..+.|. +.+++ T Consensus 362 n~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~v 436 (521) T protein:vir:10 362 VEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQ-----DYFTV 436 (521) T ss_pred HHHHHhcccccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCc-----ceEEE Confidence 22222 2223344 466777777776421 1110000011111111 0011245554 78888877652 23333 Q ss_pred EEe-ceEEEEecceeEEeecccc----------cCCceEEEEEEeeccEEecccceEEEE-eecC Q lcl|NC_010583. 406 VYK-DNFVMPRQRAVTVERERQA----------GKQRDAYYVTQRVNLQRYFENGVVSGA-YAAA 458 (458) Q Consensus 406 ~~~-~~~~i~~~~~~~i~~~~~~----------~~~~~~~~~~~r~d~~~~~~~afv~l~-~aaa 458 (458) ++. +.. ..-.+.+.||. .+-+-.+-...|+++. .+| |+.-. -+-+ T Consensus 437 G~KG~~~-----~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~-~NP--~~~~~~~~~~ 493 (521) T protein:vir:10 437 GYKGPNE-----MDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG-INP--FAESAAQAPA 493 (521) T ss_pred EEeCCcc-----cccceeeccccccccccccCCccccceeeeeeeecee-ecC--cccccCCccc Confidence 322 110 00112222222 1222333334455443 334 32211 1100 No 223 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=65.99 E-value=0.27 Score=23.66 Aligned_cols=417 Identities=10% Similarity=-0.015 Sum_probs=80.2 Q ss_pred CcchHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSL---EGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKS 77 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~---~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~ 77 (458) |.+.. +++.....++.++. +.+...++. ....++.+...+...+.+..++...++.+. ++ ++.+...+.. T Consensus 12 ~~~~e--~a~~~~~~~~~~k~~e~~~~~ke~~~-~~l~~~~e~~~k~~~E~~~~le~~~ee~k~-l~---ee~~~~~~~~ 84 (458) T protein:vir:10 12 LGLGD--LAKSLEGLTAAQKAQEAERMRKEQEE-KELARMNDLVSKAVGEDRKRLEEALELVKS-LD---EKSKKSNELF 84 (458) T ss_pred hchhh--HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH---HHHHHHHHHH Confidence 33332 22322322222222 211111111 111122222222222222222222222111 11 1111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhc--chhhhhhHHHHHHHHHhhhccchhHHHHHH Q lcl|NC_010583. 78 KKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYG--TQDAFEDEVEKLVLLSYMMEKDVFETEHGK 155 (458) Q Consensus 78 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~--~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~ 155 (458) ....+...+...+..++.....+....................+.... ..........+... ....... ..... T Consensus 85 a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~--~~~~~~~--~~~~a 160 (458) T protein:vir:10 85 AQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGV--FETEHGQ--RHLKA 160 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhcc--chhhhhh--hhhhh Confidence 111111111111111111111110000000000000000000000000 00000000000000 0000000 00000 Q ss_pred HHhhhhhcccccccCcc----ccchhHHHHHHHHHHhccchhhhcceeeeccCce-EEEEecCCCccccccccccccccc Q lcl|NC_010583. 156 AHIKAVNGSSSVSMSSE----AYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-TMLVEPEAGRATWVDASKFGTDET 230 (458) Q Consensus 156 ~~~~a~~~~~~~~~g~~----~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~p~~~~~~~a~~v~e~~~~~e~~ 230 (458) .........++...... ++.......++..+-...++..-...+|..++.. -.++..+.. ..+... ++.. T Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~----~~~~~~-~~~~ 235 (458) T protein:vir:10 161 VNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTY----GTDTTT-GEEV 235 (458) T ss_pred hhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeecccccc----cccccc-cccc Confidence 00000001111111111 2211111122222222222322222333333221 122222211 111111 1111 Q ss_pred ccccccccceeeeeehhheee--eehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHH--------HHhccCCCCcccccc Q lcl|NC_010583. 231 VGDEVKGQLTEISFKTYKLAA--KSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE--------AFMSGNGTGQPKGLL 300 (458) Q Consensus 231 ~~~~~~~~f~~v~~~~~k~~~--~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~--------~~l~G~g~~~p~Gi~ 300 (458) ...-...++...++...-.-. .+.-|.--+. . -+...|...|+.++..++=. .|++..+.... T Consensus 236 ~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~--~-~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~---- 308 (458) T protein:vir:10 236 KGALKEIHFSTYKLAAKSFITDETEEDAIFSLL--P-LLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSA---- 308 (458) T ss_pred cccceeeEeeeeeEEeeehhhHHHHhcchHHHH--H-HHHHHHHHHHHHHHHHHhhcCCCCCccceeeeccccccc---- Confidence 112223333333333221111 1122221122 1 26666666666666666642 23332221100 Q ss_pred ccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcccccccccccccc---ccccc Q lcl|NC_010583. 301 KLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDA---VKLQG 377 (458) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~---~~~~~ 377 (458) ......+........+.+... .+..+............+.-+........+....+.+++.++..... ....| T Consensus 309 --~~~~~~~~~~~~~~~~~~i~~--~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G 384 (458) T protein:vir:10 309 --KVVTEAKADGSVLVTAKTISK--LRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYG 384 (458) T ss_pred --ceeecccccccccccHHHHHH--HHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecc Confidence 000000001111111111111 11111111111111111111111222222222333222221111100 00111 Q ss_pred cCCeee-cccceecccc-cccccCCceEEEEEeceEEEEe-----cceeEEeecccccCCceEEEEEEeeccEEecccc Q lcl|NC_010583. 378 QVGRIY-GLPVVVSEYF-PAKAASAEFAVIVYKDNFVMPR-----QRAVTVERERQAGKQRDAYYVTQRVNLQRYFENG 449 (458) Q Consensus 378 ~~~~l~-G~pv~~~~~~-~~~~~~~~~~~~~~~~~~~i~~-----~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~~~a 449 (458) .|..+. .+|..-.+.. .-+.. .....+++...+.+-. ...+.+..+. .-|...++....+ +..-+.+ T Consensus 385 ~pv~~~~~~p~~~~~~~~~~~~f-~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~--r~~~~v~~~~a~v--~~~~aa~ 458 (458) T protein:vir:10 385 LPVVVSEYFPAKANSAEFAVIVY-KDNFVMPRQRAVTVERERQAGKQRDAYYVTQ--RVNLQRYFANGVV--SGTYAAS 458 (458) T ss_pred eeeEEccccccccCCcceEEEEe-cccEEEEEeeceEEEeecccCCCceEEEEEE--EecceEecccceE--EEeeccC Confidence 111110 1121100000 00111 1112233333222210 0001111100 0111111111111 1111111 No 224 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=65.34 E-value=0.29 Score=23.57 Aligned_cols=357 Identities=12% Similarity=0.070 Sum_probs=116.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhcc Q lcl|NC_010583. 67 LDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEK 146 (458) Q Consensus 67 ~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~ 146 (458) +..-+.+.+.=....|..+ .. .++.... ++.-.....+...+... ......++.--.+|..++.+- T Consensus 1 ~~~~~~l~~kw~p~l~~~~-~~-----------~~i~~~~-~~~~~a~llenq~~~~~-~~~~~~~~~~~~~~~~~l~ea 66 (524) T protein:vir:98 1 MSKKNELMEKWNDLLESQE-GL-----------PDIATKS-KKQLVAAILEAQEKDAE-TDPVYRDEKIVESFGGFLAEA 66 (524) T ss_pred CcchHHHHHHhHHHhcCCc-Cc-----------chhcchh-hHHHHHHHHhhHHHHHh-cCccccchHHHHhhhcccccc Confidence 0000011111000000000 00 0000000 00000000000000000 000111111122232222221 Q ss_pred chhHH-HHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCc-------eEEEEecCC-C-- Q lcl|NC_010583. 147 DVFET-EHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKI-------LTMLVEPEA-G-- 215 (458) Q Consensus 147 ~~~~~-~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-------~~~p~~~~~-~-- 215 (458) +.... ..........+.++... ..=|. +. .+++..-+..+..+++-|-||+++. .+|+-.... . T Consensus 67 ~~~~~~~~~~~~i~~s~~t~~v~---~~~P~-Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gte 141 (524) T protein:vir:98 67 EIAGDHNYDQTNIASGKSSGAIT---NIGPA-VI-GMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTP 141 (524) T ss_pred ccccccccccccccccccccccc---cccch-hh-hHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCcccc Confidence 10000 00000000000111111 11121 11 1333344455555666666665532 122211100 0 Q ss_pred -------------cccccc------------------------------------------------------------- Q lcl|NC_010583. 216 -------------RATWVD------------------------------------------------------------- 221 (458) Q Consensus 216 -------------~a~~v~------------------------------------------------------------- 221 (458) ++.|-+ T Consensus 142 A~~nEAf~~~ye~dt~fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~ 221 (524) T protein:vir:98 142 ADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENE 221 (524) T ss_pred cccccccccccccccccCCccccccccccccccccccccccccccccccceeccccccCccccccccccccccccccccc Confidence 000000 Q ss_pred --------cccc-----------cccccccccccccceeeeeehhheeeeehhhHHHHhc----cHHHHHHHHHHHHHHH Q lcl|NC_010583. 222 --------ASKF-----------GTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEED----AIFSLLPLLRKRLIEA 278 (458) Q Consensus 222 --------e~~~-----------~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d----s~~~~~~~i~~~la~~ 278 (458) .+-. ...+...++-..+++.+++.++..+-...+|-||.+| ...|.+++|.+-|+.. T Consensus 222 ~g~~~~~~~GmsTA~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTE 301 (524) T protein:vir:98 222 KGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATE 301 (524) T ss_pred ccceeecccccchhhhhhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHH Confidence 0000 0001123334455666777777777788899999998 2478899999999999 Q ss_pred HHHHHHHHHhccC-CC--Cccccccccccccccceeec-----cccchhhHHHHHHHH----HHHhhhhh-hhcc-ccee Q lcl|NC_010583. 279 HAVSIEEAFMSGN-GT--GQPKGLLKLAADDGAKVVTE-----AKADGSVLVTAKTIS----KLRRKLGR-HGLK-LSKL 344 (458) Q Consensus 279 ~~~~~d~~~l~G~-g~--~~p~Gi~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~----~~~~~~~~-~~~~-~~~~ 344 (458) |...|++.||.-= -+ -...|+........+..-.. .++-+.. ..+..|. ...+.+.. ..+. ..-. T Consensus 302 ImlEINReii~~i~~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~-e~~~~L~~~i~~~an~I~~~T~rg~~n~~ 380 (524) T protein:vir:98 302 IMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAG-ESYKALLIQIDKEANEIARQTGRGAGNFI 380 (524) T ss_pred HHHHhhHHHHHHHhhhheeceeecccccccccceeeccccccccccchhH-HHHHHHHHHHHHHHHHHHHhhccccccEE Confidence 9999999998320 01 11223221111100100000 0111111 1222222 22222222 2222 3345 Q ss_pred EechhHHHHHHhhh----cccccccccccccccc-ccccCCeee-cccceecccccccccCCceEEEEEe-ceEEEEecc Q lcl|NC_010583. 345 VLIVSMDAYYDLLE----DEEWQDVAQVGNDAVK-LQGQVGRIY-GLPVVVSEYFPAKAASAEFAVIVYK-DNFVMPRQR 417 (458) Q Consensus 345 ~~~~~~~~~l~~~~----d~~~~~~~~~~~~~~~-~~~~~~~l~-G~pv~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~~ 417 (458) ++++....+|..+. ++.+ ..+.....+. ..-..|.|. |++|++..+.|. +.+++++. +.- . T Consensus 381 i~S~~Va~~L~~~~~g~~~~s~--~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG~~~-----~ 448 (524) T protein:vir:98 381 IASRNVVSALARIDSGITPASQ--GLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQ-----DYFTVGFKGDNE-----M 448 (524) T ss_pred EEchHHHHHHhhhhcccccccc--hhhcccccCCccceEEEEecCceEEEecCCCCc-----ceEEEEeeCCcc-----c Confidence 67777777776421 1111 1111111010 001124444 478888877652 23333322 110 0 Q ss_pred eeEEeecccc----------cCCceEEEEEEeeccEEecccceEEEE-eecC Q lcl|NC_010583. 418 AVTVERERQA----------GKQRDAYYVTQRVNLQRYFENGVVSGA-YAAA 458 (458) Q Consensus 418 ~~~i~~~~~~----------~~~~~~~~~~~r~d~~~~~~~afv~l~-~aaa 458 (458) .-.+.+.||. .+-+-.+-...|+++. .+| |+... -+-+ T Consensus 449 ~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~-~NP--~~~~~~~~~~ 497 (524) T protein:vir:98 449 DAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG-INP--FANSRSQAPA 497 (524) T ss_pred ccceeeccccccccccccCCccccceeeeeeeecee-ecC--cccccCCccc Confidence 0112222222 1222222333454432 233 32211 1111 No 225 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=60.25 E-value=0.38 Score=22.91 Aligned_cols=359 Identities=11% Similarity=0.047 Sum_probs=117.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhh Q lcl|NC_010583. 51 LVSKAVGEDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAF 130 (458) Q Consensus 51 ~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 130 (458) +. .++.+++..-.++.- . +-+..+..++ ..... +-+...+..+. ...... T Consensus 1 ~~---~~~l~~kw~p~l~~~-~----~~~i~~~~~~---~~~a~---l~enq~~~~~~----------------~~~~~~ 50 (534) T protein:vir:10 1 MS---KKSLLKKWQPLVESE-G----MPAIASMKRK---DIVAR---IFENQDEDIAH----------------NEGGVY 50 (534) T ss_pred Cc---hhHHHHHhHHhhcCC-c----cccccchhhh---hhhhh---hhhhHHHHHhh----------------hccccc Confidence 00 000111110000000 0 0000000000 00000 00000000000 000000 Q ss_pred hhHHHHHHHHH---hhhccchhHHHHHHHH-hhhh--hcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccC Q lcl|NC_010583. 131 EDEVEKLVLLS---YMMEKDVFETEHGKAH-IKAV--NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSK 204 (458) Q Consensus 131 ~~~~~~~a~~~---~~~~~~~~~~~~~~~~-~~a~--~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 204 (458) .+...-.+|.. .+.+....+.+..... .++- ..++++.+-...-|. +. .+++..-+..+..+++-|-||+++ T Consensus 51 ~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~ia~s~~s~~v~~~~P~-Li-~lvRra~p~LIa~DIwGVQPMTgP 128 (534) T protein:vir:10 51 TDQVVVNSMVDVKGRIEEARLAEANIGGDHGYDATKIASGETSGSITNVGPA-VM-GLVRRAIPQLIAFDICGVQPMTSS 128 (534) T ss_pred chhhhhhhhhccccchhhccccccccccccccccccccccccccccccccch-hh-hHHHHHHHhhhhhhhheeccCCch Confidence 11111111111 1111111111100000 0000 001111110111121 11 133444455556666777676654 Q ss_pred ceE-------EEEecCC------------Ccccccccccc---------------------------------------- Q lcl|NC_010583. 205 ILT-------MLVEPEA------------GRATWVDASKF---------------------------------------- 225 (458) Q Consensus 205 ~~~-------~p~~~~~------------~~a~~v~e~~~---------------------------------------- 225 (458) ..- |--.... +++.|-+.+.. T Consensus 129 TGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~ 208 (534) T protein:vir:10 129 TGQVFTLRAIYGGNSQDANAREAFHPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKD 208 (534) T ss_pred hhhheeeeeeecCCCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 221 1100000 11111100000 Q ss_pred ---------------------------------------------cccccccccccccceeeeeehhheeeeehhhHHHH Q lcl|NC_010583. 226 ---------------------------------------------GTDETVGDEVKGQLTEISFKTYKLAAKSFITDETE 260 (458) Q Consensus 226 ---------------------------------------------~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell 260 (458) ...+...++-..+++.+++.++.-+-...+|-||. T Consensus 209 ~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELA 288 (534) T protein:vir:10 209 YAVDALPADQTEAGLAYKWLLANGYAVETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMA 288 (534) T ss_pred cccccccCCccccccccccccccccceecccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHH Confidence 00001123344556777777777777889999999 Q ss_pred hc----cHHHHHHHHHHHHHHHHHHHHHHHHhccC--CC--Cc---------cccccccccccccceeeccccchhhHHH Q lcl|NC_010583. 261 ED----AIFSLLPLLRKRLIEAHAVSIEEAFMSGN--GT--GQ---------PKGLLKLAADDGAKVVTEAKADGSVLVT 323 (458) Q Consensus 261 ~d----s~~~~~~~i~~~la~~~~~~~d~~~l~G~--g~--~~---------p~Gi~~~~~~~~~~~~~~~~~~~~~~~~ 323 (458) +| ...|.+++|.+-|+..|...|++.||.-= -+ ++ -.|++........ ...-+.. .. T Consensus 289 QDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~-----~~~~~~~-e~ 362 (534) T protein:vir:10 289 QDLRAVHGLDADSELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDI-----RGARWAG-ES 362 (534) T ss_pred HHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheeecccccccccccceeeeeccccc-----cchhHHH-HH Confidence 98 24778999999999999999999888430 00 11 1233322111100 0011111 12 Q ss_pred HHHHHHHHhhhhh-----hhc-ccceeEechhHHHHHHhhh--ccccccccccccc-cccccccCCeeec-ccceecccc Q lcl|NC_010583. 324 AKTISKLRRKLGR-----HGL-KLSKLVLIVSMDAYYDLLE--DEEWQDVAQVGND-AVKLQGQVGRIYG-LPVVVSEYF 393 (458) Q Consensus 324 ~~~~~~~~~~~~~-----~~~-~~~~~~~~~~~~~~l~~~~--d~~~~~~~~~~~~-~~~~~~~~~~l~G-~pv~~~~~~ 393 (458) +..|.--+..... ..+ ...-.++++.....|.... ++..-........ ........|+|.| ++|++..+. T Consensus 363 ~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~ 442 (534) T protein:vir:10 363 YKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYA 442 (534) T ss_pred HHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCC Confidence 2222222222211 112 2333567777777775421 1100000000000 0111111345554 899988776 Q ss_pred cccccCCceEEEEEe-ceEEEEecceeEEeecccc----------cCCceEEEEEEeeccEEecccc-------eEEEEe Q lcl|NC_010583. 394 PAKAASAEFAVIVYK-DNFVMPRQRAVTVERERQA----------GKQRDAYYVTQRVNLQRYFENG-------VVSGAY 455 (458) Q Consensus 394 ~~~~~~~~~~~~~~~-~~~~i~~~~~~~i~~~~~~----------~~~~~~~~~~~r~d~~~~~~~a-------fv~l~~ 455 (458) |. +.+++++. +.. ..-.+.+.||. .+-+-.+-...|+++.+ +|=+ +.++.- T Consensus 443 ~~-----dy~~vG~KG~~~-----~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~~i~~ 511 (534) T protein:vir:10 443 VE-----DYFTVGYKGASE-----MDAGLYYCPYVALTPLRGTDPKNFQPVLGFKTRYGVKL-HPMADATQNKGFAKISN 511 (534) T ss_pred Cc-----ceEEEEEeCCcc-----cccceeeccccccccccccCCccccceeeeeeeeceee-cCcccccCCcccccccc Confidence 63 23333322 110 00112222322 12233333344555432 3311 111111 Q ss_pred ec---C Q lcl|NC_010583. 456 AA---A 458 (458) Q Consensus 456 aa---a 458 (458) .. + T Consensus 512 g~~~~~ 517 (534) T protein:vir:10 512 GMPQHT 517 (534) T ss_pred CCcchh Confidence 00 0 No 226 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=59.93 E-value=0.38 Score=22.87 Aligned_cols=357 Identities=13% Similarity=0.072 Sum_probs=121.1 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccch Q lcl|NC_010583. 70 VKNLDEKSKKSAELFA-QTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDV 148 (458) Q Consensus 70 i~~~~e~~~~~~e~~~-~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 148 (458) ++...+..+......+ +...+ ++... ++.-.....+...+... .+....++....+|..++.+-.. T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~-----------i~~~~-~~~~~a~llenq~~~~~-~~~~~~~~~~~~~~~~~l~ea~~ 67 (528) T protein:vir:80 1 MKTTKELMEKWSPLLENEKLPE-----------IATAS-KQKLVAKILESQEADFA-VDPIYKDEKVVEAFGGFIAEAEV 67 (528) T ss_pred CcchHHHHHhhhHhhcCCccch-----------hcchh-hhhhhhhhhhhhhHHhh-ccccccchHHHHhhhhhcccccc Confidence 1111111111111100 00000 00000 00000000000000000 00012222223333333322110 Q ss_pred hHHH-HHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-------EEEEecCC------ Q lcl|NC_010583. 149 FETE-HGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-------TMLVEPEA------ 214 (458) Q Consensus 149 ~~~~-~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-------~~p~~~~~------ 214 (458) ...- .........+.++.... .=|. +. .++++.-+..+..+++-|-||+++.. +|+..... T Consensus 68 ~~~~~~~~~~i~es~~t~~v~~---~~P~-Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea 142 (528) T protein:vir:80 68 AGDHGYDASQIAAGQTTGAITN---VGPA-VI-GMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEA 142 (528) T ss_pred ccccCCcccccccccccccccc---CCch-hh-hHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCccccccccc Confidence 0000 00000000111111111 1121 11 13444445666666777777765411 11100000 Q ss_pred ------Ccc----------------------------------------------------------------------- Q lcl|NC_010583. 215 ------GRA----------------------------------------------------------------------- 217 (458) Q Consensus 215 ------~~a----------------------------------------------------------------------- 217 (458) +.+ T Consensus 143 ~~~~~~~da~fS~~~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~ 222 (528) T protein:vir:80 143 FHPMYAPDAFHSSLAAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMK 222 (528) T ss_pred cccccccccccccccccccccccccccccccccccccccccceeccccccccccccccccccccCccccCCccccccccc Confidence 000 Q ss_pred -------ccccccccc--c---------cccccccccccceeeeeehhheeeeehhhHHHHhc----cHHHHHHHHHHHH Q lcl|NC_010583. 218 -------TWVDASKFG--T---------DETVGDEVKGQLTEISFKTYKLAAKSFITDETEED----AIFSLLPLLRKRL 275 (458) Q Consensus 218 -------~~v~e~~~~--~---------e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d----s~~~~~~~i~~~l 275 (458) +-++.+-.. + .+...++-..+++.++++++..+-...+|-||.+| ...|.+++|.+-| T Consensus 223 ~~~~~~~~~~~~Gm~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNIL 302 (528) T protein:vir:80 223 LMEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAIL 302 (528) T ss_pred ccccccccccccccchhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHH Confidence 000000000 0 01113344455667777777777788999999998 3478999999999 Q ss_pred HHHHHHHHHHHHhcc---CCCCcccccc----ccccccccceeeccc-cchhhHHHHHHHH----HHHhhhhh-hhccc- Q lcl|NC_010583. 276 IEAHAVSIEEAFMSG---NGTGQPKGLL----KLAADDGAKVVTEAK-ADGSVLVTAKTIS----KLRRKLGR-HGLKL- 341 (458) Q Consensus 276 a~~~~~~~d~~~l~G---~g~~~p~Gi~----~~~~~~~~~~~~~~~-~~~~~~~~~~~~~----~~~~~~~~-~~~~~- 341 (458) +..|...|++.||.- ...-.-+|+. +.+............ .-+.. ..+..|. .....+.. ..+.. T Consensus 303 StEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~-e~~k~L~~~i~~~an~I~~~T~~~~g 381 (528) T protein:vir:80 303 ANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAG-ESFKSLIYQIDKEAAEIARQTGRGAG 381 (528) T ss_pred HHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeeccccccccccchhH-HHHHHHHHHHHHHHHHHHHhhccccc Confidence 999999999999631 1110011111 111111111110000 11111 1122222 22222222 12223 Q ss_pred ceeEechhHHHHHHhhh--cccccccccccccccccc-ccCCeeec-ccceecccccccccCCceEEEEEe-ceEEEEec Q lcl|NC_010583. 342 SKLVLIVSMDAYYDLLE--DEEWQDVAQVGNDAVKLQ-GQVGRIYG-LPVVVSEYFPAKAASAEFAVIVYK-DNFVMPRQ 416 (458) Q Consensus 342 ~~~~~~~~~~~~l~~~~--d~~~~~~~~~~~~~~~~~-~~~~~l~G-~pv~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~ 416 (458) .-.++++.....|.... ++...+..+.....+... -..|.|.| ++|++..+.|. +.+++++. +.- T Consensus 382 n~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG~~~----- 451 (528) T protein:vir:80 382 NFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQ-----DYFTVGYKGDNE----- 451 (528) T ss_pred cEEEEchHHHHHHhhccccccccccccccccccCCCCceEEEEecCceEEEecCCCCc-----ceEEEEEeCCcc----- Confidence 34566777777776432 222112111111111111 11345554 78988877652 23333321 110 Q ss_pred ceeEEeecccc----------cCCceEEEEEEeeccEEecccceEEEEeec--C Q lcl|NC_010583. 417 RAVTVERERQA----------GKQRDAYYVTQRVNLQRYFENGVVSGAYAA--A 458 (458) Q Consensus 417 ~~~~i~~~~~~----------~~~~~~~~~~~r~d~~~~~~~afv~l~~aa--a 458 (458) ..-.+.+.||. .+-+-.+-...|+++. ++| |+....-+ + T Consensus 452 ~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~-~NP--~~~~~~~~~~~ 502 (528) T protein:vir:80 452 MDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIG-INP--FADSKSQAPSA 502 (528) T ss_pred cccceeecccccceeeEeeCCccccceeeeeeeecee-ecC--cccccCCcccc Confidence 00011222221 1222222233454432 233 33211110 1 No 227 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=56.07 E-value=0.46 Score=22.41 Aligned_cols=407 Identities=10% Similarity=-0.010 Sum_probs=88.3 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |+..+.+|+++.. ++.++.+.+.++.+.+.......+..+....++.++ .. .-+.+++..++++++.+...+. T Consensus 8 m~~~i~eL~e~r~--~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el----~~-ei~~le~~~~~~~~~~~~~~~~ 80 (477) T protein:vir:84 8 LRALRAAAVEAVA--TLKAERQAIADGAKAEERAALSADETAEFRAKSASI----KA-ELDKVEDLDEQIRELESEIERS 80 (477) T ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHH----HH-HHHHHHHHHHHHHHHHHHHHHh Confidence 8888888888775 556666666555443322211111111111111111 11 1111122222222222222211 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhh Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQD-EIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIK 159 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~-~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 159 (458) .................... ...........+........+......................+............ . T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 158 (477) T protein:vir:84 81 GKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDL--D 158 (477) T ss_pred hcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccc--c Confidence 11111100000000000000 00000000011110000000000000000000000000000000000000111111 1 Q ss_pred hhhcccccccCccccchhHHHHH-----HHHHHhccchhhhc-c-eeee-ccCc-eEEEEecCCCccccccccccccccc Q lcl|NC_010583. 160 AVNGSSSVSMSSEAYETIFSTRI-----IRDLQKELVVGALF-D-ELPM-SSKI-LTMLVEPEAGRATWVDASKFGTDET 230 (458) Q Consensus 160 a~~~~~~~~~g~~~ip~~~~~~i-----i~~~~~~~~l~~~~-~-~~~~-~~~~-~~~p~~~~~~~a~~v~e~~~~~e~~ 230 (458) .....++.....-.++..+...+ +..+-...++.... + .+|. .++. .-+++..+... .+. ..++. T Consensus 159 ~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~----~~~-~~~~s- 232 (477) T protein:vir:84 159 RNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAAL----TAP-SAHEV- 232 (477) T ss_pred ccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCccc----ccc-ccccc- Confidence 11111111112222333333222 21111222222221 1 2343 2222 22233322211 111 11111 Q ss_pred ccccccccceeeeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHH---------HHhccCCCCccccccc Q lcl|NC_010583. 231 VGDEVKGQLTEISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE---------AFMSGNGTGQPKGLLK 301 (458) Q Consensus 231 ~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~---------~~l~G~g~~~p~Gi~~ 301 (458) .......++...++.....-..--+.+....--. -+...|...++.++..++=. .|++..|.+... T Consensus 233 ~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~-~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~---- 307 (477) T protein:vir:84 233 DLTDGFVQANVKTIAGQQGIAIQLLDQAAVSVDE-FVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVT---- 307 (477) T ss_pred ccceeeEEEeeeeEEeeeHHHHHHHhccchhHHH-HHHHHHHHHHHHHHHHHHhccCCCCCccceeeecccccccc---- Confidence 1112223333333322221111111221111111 25556666666666555432 233333322110 Q ss_pred cccccccceeeccccchh----------------------hHHHHHHHHHHHhhhhhhhcccceeEechhHHH--HHHhh Q lcl|NC_010583. 302 LAADDGAKVVTEAKADGS----------------------VLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDA--YYDLL 357 (458) Q Consensus 302 ~~~~~~~~~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~l~~~ 357 (458) .. ....+......... ...++..+..+.. .+..+++.|..-. .+..+ T Consensus 308 ~~--~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd-------~~G~~l~~~~~~~~~~~~~~ 378 (477) T protein:vir:84 308 AT--SAGSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFA-------GDDRPLIVPSGPGFNNLGVL 378 (477) T ss_pred cc--ccccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhc-------cCCCeeeecCcccccccccc Confidence 00 00000000000000 0011111111111 1112222221100 00000 Q ss_pred -h--------ccccccccccccccccccccCCeeec---ccceecccccccccCCceEEEEEeceEEEEecce-----eE Q lcl|NC_010583. 358 -E--------DEEWQDVAQVGNDAVKLQGQVGRIYG---LPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQRA-----VT 420 (458) Q Consensus 358 -~--------d~~~~~~~~~~~~~~~~~~~~~~l~G---~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-----~~ 420 (458) . .=.|.|...... .|. +.-.| ..+++- .. . ..+++.. .+.+..... .. T Consensus 379 ~~~~~~~~~~~l~G~pVv~s~~--~p~----~~~~~~d~~~i~~g------d~-~-~~~i~~~-~~~~~~~~~~~~~~~~ 443 (477) T protein:vir:84 379 TEVASQRVVGQMHGLPVVTDPT--LPT----TLGTGTDQDVIHVL------RA-S-DLALFES-SVRMRALQETRAENLS 443 (477) T ss_pred cccccccccchhcccceEecCc--ccc----cccccCCcceEEEE------Ee-c-eEEEEee-ceeEEeccccccccce Confidence 0 001222211100 000 00000 011111 11 1 1112111 111110000 01 Q ss_pred EeecccccCCceEEE---EEEeeccEEecccceE Q lcl|NC_010583. 421 VERERQAGKQRDAYY---VTQRVNLQRYFENGVV 451 (458) Q Consensus 421 i~~~~~~~~~~~~~~---~~~r~d~~~~~~~afv 451 (458) +....+.-.+...+| ++..+-+.-.-.-.|+ T Consensus 444 ~~~~v~~~~~~~~~r~~~afv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 444 VLLQVYGYLAFTAARFPQSVVEIGGTALTAPTFA 477 (477) T ss_pred eeeeehhhhhhhhhccccceEEeecccccccccC Confidence 100001001111222 2222222222222233 No 228 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=55.40 E-value=0.48 Score=22.33 Aligned_cols=122 Identities=8% Similarity=0.039 Sum_probs=9.4 Q ss_pred Ccc-----------hHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH---HHHHH Q lcl|NC_010583. 1 MTI-----------DINKLKEELGLG--DLAKSLEGLTAAQKAAEAKRLREEQEE-KELARMNDLVSKAVGE---DRKRL 63 (458) Q Consensus 1 ~~~-----------~~~~~~~~~~~~--~~~~~~~~l~~~~~~~~~~~~~~e~~~-~~~~~~~~~~~~~~~e---~~~~~ 63 (458) |-+ .++..+.+.... +...+...+.. +++..+.+....+. ++....+.+......+ .+.+. T Consensus 567 ~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~--q~e~~k~q~e~~~~q~e~q~~q~E~q~~q~e~e~~~~~~ 644 (705) T protein:vir:88 567 AGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKA--QADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEA 644 (705) T ss_pred hhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 110 011111111000 00000000000 00000000000000 0000000000000000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHH Q lcl|NC_010583. 64 EEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEK 136 (458) Q Consensus 64 ~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 136 (458) .....+......+.....+....+....+.+ ..+...........+.+ . ....+.....+| T Consensus 645 ~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~e-~~~e~~q~~~~~~~~~~----------~-~~~~k~~~~~rr 705 (705) T protein:vir:88 645 VLQQREMALKEAELQLERDRFTWERARNEAE-YHLEATQARAAYIGDGK----------V-PETKKPTKAVRR 705 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHh----------H-HHHHHHHHHhcC Confidence 0000000000000000000000000000000 00000000000000000 0 000111111111 No 229 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=50.98 E-value=0.59 Score=21.82 Aligned_cols=266 Identities=12% Similarity=0.061 Sum_probs=106.7 Q ss_pred hhhhcccccccCccccchhHHHHHHHHHHhccchhhhc--ceeeeccCceEEEEecCCCccccccccccccccccccccc Q lcl|NC_010583. 159 KAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALF--DELPMSSKILTMLVEPEAGRATWVDASKFGTDETVGDEVK 236 (458) Q Consensus 159 ~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~--~~~~~~~~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~ 236 (458) .++ -.-+.++..+.+.+...+.-..+. ++...++...+||......-..+..-++ -.....+ T Consensus 1 Mai-----------n~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R~~g-----~~~g~v~ 64 (290) T protein:vir:78 1 MAI-----------NYVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTRNKG-----YNEGSAS 64 (290) T ss_pred Cch-----------hHHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccccCCC-----cccCccc Confidence 000 001234444444444443333332 2233455678888776543332221111 1112234 Q ss_pred ccceeeeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeecc Q lcl|NC_010583. 237 GQLTEISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEA 314 (458) Q Consensus 237 ~~f~~v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~ 314 (458) .++...+++..+.-.+.-=... .+.+ ...+...+.+...+.++-.+|...+.- |.+.+...+....... T Consensus 65 ~~~et~tl~qdR~~~F~vD~~D-vDEt~~~~~~~nv~~ef~~~~v~PEiDayr~sk--------la~~a~~~~~~~~~t~ 135 (290) T protein:vir:78 65 NTNKSYTIDFDRDVEFFVDVMD-VDETGQALSAANVTKEFNSRHAGPEMDAYRFSK--------LATAAKTNSNSVAEEI 135 (290) T ss_pred cceeeEEeeccccceeeccccc-hhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHH--------HHhhhhccCccccccc Confidence 4566666666665443210001 1111 234666677777777887888765521 1111111111101111 Q ss_pred ccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceecccc- Q lcl|NC_010583. 315 KADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYF- 393 (458) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~- 393 (458) +....+..+.++...+..-...+-..++.|..+..|.....-.. .+...........+..++|.|.+|+.++.- T Consensus 136 ----t~~n~~~~i~~~~~~ldevp~~~rvl~vtp~~~~lL~~~~~f~r-~~~~~~~~~~~i~~~V~~idG~~ii~vps~~ 210 (290) T protein:vir:78 136 ----TKDNVFTKLKAAIRKVKKYGTQNLVMYVSPDVMAALELSDDFVR-AINVQNIGPSSIETRITAIDGTRIVEVEAED 210 (290) T ss_pred ----CHHHHHHHHHHHHHHHHhcCCCCeEEEECHHHHHHHhhChhhhc-cccccccccccccceeeeecCcEEEEecccc Confidence 11233344444445554422233344667877777654322221 111111112223556688999999865421 Q ss_pred -------------cccccCCceEEEEEece-EEEEecceeEEeecccccCCceEEEEEEe--eccEEecccc---eEEEE Q lcl|NC_010583. 394 -------------PAKAASAEFAVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQR--VNLQRYFENG---VVSGA 454 (458) Q Consensus 394 -------------~~~~~~~~~~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r--~d~~~~~~~a---fv~l~ 454 (458) +...+.....++...+. +.+..-..+++.. |......-+|....| .|.-|.+.+. |+... T Consensus 211 r~~t~~~f~~G~~~~~~ak~in~ii~~~~a~i~~~K~~~~~~~~-P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~ 289 (290) T protein:vir:78 211 RFYDTFDFTDGYKPAAGAKKLNFLLVNKGSVVGGAKHASIYLHA-PGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTE 289 (290) T ss_pred hhhhhhhhcccccccCCccceeEEEEcCCceeeeeeeeEEEeeC-CCCCcCcceeeeeeeeeeeeeeeccccCeeEEEee Confidence 11111122222222222 1122222233322 222222223444444 4444444422 22222 Q ss_pred e Q lcl|NC_010583. 455 Y 455 (458) Q Consensus 455 ~ 455 (458) + T Consensus 290 ~ 290 (290) T protein:vir:78 290 V 290 (290) T ss_pred C Confidence 2 No 230 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=48.85 E-value=0.66 Score=21.58 Aligned_cols=292 Identities=13% Similarity=0.103 Sum_probs=113.3 Q ss_pred hcccccccCccccchhHHHHHHHHHHhccchhh-hcc-----------eeeec-cCceEEEEecCCCccccccccccccc Q lcl|NC_010583. 162 NGSSSVSMSSEAYETIFSTRIIRDLQKELVVGA-LFD-----------ELPMS-SKILTMLVEPEAGRATWVDASKFGTD 228 (458) Q Consensus 162 ~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~-~~~-----------~~~~~-~~~~~~p~~~~~~~a~~v~e~~~~~e 228 (458) +..+....+.......++..++......++... +.. -+.-. +..+++.... .-...+|... + T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~-~L~g~gv~Gd----~ 75 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSV-HLRGKPTYGD----A 75 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeee-ecccCCcccC----c Confidence 223333333334345555566555544444333 321 01101 1112221111 1111112111 1 Q ss_pred ccccccccccceeeeeehhheeeeehhhHHHHh-ccHHHHHHHHHHHHHHHHHHHHHHHHh-ccCC---CCc-------c Q lcl|NC_010583. 229 ETVGDEVKGQLTEISFKTYKLAAKSFITDETEE-DAIFSLLPLLRKRLIEAHAVSIEEAFM-SGNG---TGQ-------P 296 (458) Q Consensus 229 ~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~~~~~i~~~la~~~~~~~d~~~l-~G~g---~~~-------p 296 (458) .-...+....|.+-++.+.-+.+-+.....+-+ -++++|...-++.|..-+....|..++ +-.| .+. + T Consensus 76 ~leGnee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~ 155 (364) T protein:vir:93 76 RVEGKEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDF 155 (364) T ss_pred eeeccccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCc Confidence 122344566676666666666555544333333 368899999999999999999999665 2222 221 1 Q ss_pred ccccccc--cccccce------eeccccchhhHHHHHHHHHHHhhhhhhhc----------------ccceeEechhHHH Q lcl|NC_010583. 297 KGLLKLA--ADDGAKV------VTEAKADGSVLVTAKTISKLRRKLGRHGL----------------KLSKLVLIVSMDA 352 (458) Q Consensus 297 ~Gi~~~~--~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~ 352 (458) +++.... ..+..-+ ......+.++..+...+..+...+..... ..-..++||..+. T Consensus 156 ~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~ 235 (364) T protein:vir:93 156 TGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQAT 235 (364) T ss_pred ccccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhh Confidence 1111100 0000000 01112222333444444444333222110 1114677888888 Q ss_pred HHHhhhcccccccccc-----ccccccccccCCeeecccceecccccc---cccCCceEE---EEEe-ceEEE--Eecce Q lcl|NC_010583. 353 YYDLLEDEEWQDVAQV-----GNDAVKLQGQVGRIYGLPVVVSEYFPA---KAASAEFAV---IVYK-DNFVM--PRQRA 418 (458) Q Consensus 353 ~l~~~~d~~~~~~~~~-----~~~~~~~~~~~~~l~G~pv~~~~~~~~---~~~~~~~~~---~~~~-~~~~i--~~~~~ 418 (458) .|....++.|+-+... +...+.-.|..+++.|.+|+-...++. .++++...+ +..| .++.+ +...+ T Consensus 236 ~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a~g~~~g 315 (364) T protein:vir:93 236 DMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANG 315 (364) T ss_pred hhhhcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEEeecCCC Confidence 8875444333322211 111122335567778888875544431 112222111 1111 12222 22233 Q ss_pred eEEee-ccccc-CCceEEEEEEeeccE-Eecc---cceEEEEeecC Q lcl|NC_010583. 419 VTVER-ERQAG-KQRDAYYVTQRVNLQ-RYFE---NGVVSGAYAAA 458 (458) Q Consensus 419 ~~i~~-~~~~~-~~~~~~~~~~r~d~~-~~~~---~afv~l~~aaa 458 (458) ++-.+ +..+. .|...+-+..-+|++ ...+ -++..+..+|. T Consensus 316 ~~~~w~Ee~~D~gn~~~i~~~~i~G~kK~rF~~~DfGvi~idtaa~ 361 (364) T protein:vir:93 316 LRFDWEETVKDYGNEPAIAAGFIAGMKKARFNNKDFGVISIDTAAK 361 (364) T ss_pred CCceeeecccCCCCchhhhhhhHhhhhhcccCCccceEEEeccccc Confidence 33321 11111 122222221122221 1111 11222222222 No 231 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=46.94 E-value=0.72 Score=21.37 Aligned_cols=357 Identities=12% Similarity=0.068 Sum_probs=117.4 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccch Q lcl|NC_010583. 70 VKNLDEKSKKSAELFA-QTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDV 148 (458) Q Consensus 70 i~~~~e~~~~~~e~~~-~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 148 (458) ++...+..+......+ +...+.. ..-++ .-.....+...+... .+....++....+|..++.+-.. T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~-------~~~~~-----~~~a~l~enq~~~~~-~~~~~~~~~~~~~~~~~l~ea~~ 67 (528) T protein:vir:66 1 MKTTKELMEKWSPLLENEKLPEIA-------TASKQ-----KLVAKILESQEADFA-VDPIYKDEKVVEAFGGFIAEAEV 67 (528) T ss_pred CcchHHHHHHhHHhhcCCCcchhc-------chhhh-----hhhhhhhhhhHHHhh-cccchhhHHHHHhhhhhhhhhcc Confidence 1111111111111100 0000000 00000 000000000000000 00011111222222222211100 Q ss_pred hHH-HHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCc-------eEEEEec-------- Q lcl|NC_010583. 149 FET-EHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKI-------LTMLVEP-------- 212 (458) Q Consensus 149 ~~~-~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-------~~~p~~~-------- 212 (458) ... ..........+.++.... .=|.. .. ++++.-+..+..+++-|-||+++. ..|+-.. T Consensus 68 ~~~~~~~~~~i~es~~t~~v~~---~~P~L-i~-lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eA 142 (528) T protein:vir:66 68 AGDHGYDASQIAAGQTTGAITN---VGPAV-IG-MVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREA 142 (528) T ss_pred cccccccchhcccccccccccc---CchhH-HH-HHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCccccccccc Confidence 000 000000000011111111 11211 11 333344455555666666665520 0000000 Q ss_pred ------------------------------------C---CC--------------------------c----------- Q lcl|NC_010583. 213 ------------------------------------E---AG--------------------------R----------- 216 (458) Q Consensus 213 ------------------------------------~---~~--------------------------~----------- 216 (458) . +. . T Consensus 143 fh~~~g~ea~fsea~t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~ 222 (528) T protein:vir:66 143 FHPMYAPDAFHSSLAAKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMK 222 (528) T ss_pred ccccccccccccccccccccccCCccceeecccccccccccceeeecccccceeeeccccccccccCccccccccccccc Confidence 0 00 0 Q ss_pred ------cccccccc--cccc---------ccccccccccceeeeeehhheeeeehhhHHHHhc----cHHHHHHHHHHHH Q lcl|NC_010583. 217 ------ATWVDASK--FGTD---------ETVGDEVKGQLTEISFKTYKLAAKSFITDETEED----AIFSLLPLLRKRL 275 (458) Q Consensus 217 ------a~~v~e~~--~~~e---------~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d----s~~~~~~~i~~~l 275 (458) .+-++.+- ..+| +...++-..+++.++++++..+-...+|-||.+| ...|.+++|.+-| T Consensus 223 ~~a~~~~~~~~~Gm~Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNIL 302 (528) T protein:vir:66 223 LIEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAIL 302 (528) T ss_pred ccccccceecccccchhhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHH Confidence 00000000 0000 1113344455677777888788888999999998 2478999999999 Q ss_pred HHHHHHHHHHHHhcc---CCCCcccccccc----ccccccceeecccc-chhhHHHHHHH----HHHHhhhhh-hhccc- Q lcl|NC_010583. 276 IEAHAVSIEEAFMSG---NGTGQPKGLLKL----AADDGAKVVTEAKA-DGSVLVTAKTI----SKLRRKLGR-HGLKL- 341 (458) Q Consensus 276 a~~~~~~~d~~~l~G---~g~~~p~Gi~~~----~~~~~~~~~~~~~~-~~~~~~~~~~~----~~~~~~~~~-~~~~~- 341 (458) +..|...|++.||.- ...-.-+|+... +............+ -+.. ..+..| -.....+.. ..+.. T Consensus 303 StEImlEINREii~~i~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~-e~~k~L~~~i~~~an~I~~~T~r~~g 381 (528) T protein:vir:66 303 ANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAG-ESFKSLIYQIDKEAAEIARQTGRGAG 381 (528) T ss_pred HHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeecccccccccchhHH-HHHHHHHHHHHHHHHHHHHhhccccc Confidence 999999999999631 111011121110 11111111111111 1111 111222 222222222 12223 Q ss_pred ceeEechhHHHHHHhhh--cccccccccccccccccc-ccCCeeec-ccceecccccccccCCceEEEEEe-ceEEEEec Q lcl|NC_010583. 342 SKLVLIVSMDAYYDLLE--DEEWQDVAQVGNDAVKLQ-GQVGRIYG-LPVVVSEYFPAKAASAEFAVIVYK-DNFVMPRQ 416 (458) Q Consensus 342 ~~~~~~~~~~~~l~~~~--d~~~~~~~~~~~~~~~~~-~~~~~l~G-~pv~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~ 416 (458) .-.++++.....|.... ++...+..+.....+... -..|.|.| ++|++..+.|. +.+++++. +.- T Consensus 382 n~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG~~~----- 451 (528) T protein:vir:66 382 NFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQ-----DYFTVGYKGDNE----- 451 (528) T ss_pred cEEEEchHHHHHHhhccccccccccccccccccCCCCceeEEEecCceEEEecCCCCc-----ceEEEEEeCCcc----- Confidence 34566777777776532 222222111111111111 11245554 88988877652 23333321 110 Q ss_pred ceeEEeecccc----------cCCceEEEEEEeeccEEecccceEEEE-ee-cC Q lcl|NC_010583. 417 RAVTVERERQA----------GKQRDAYYVTQRVNLQRYFENGVVSGA-YA-AA 458 (458) Q Consensus 417 ~~~~i~~~~~~----------~~~~~~~~~~~r~d~~~~~~~afv~l~-~a-aa 458 (458) ..-.+.+.||. .+-+-.+-...|+++. ++| |+.-+ -+ .+ T Consensus 452 ~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~-vNP--~~~~~~~~~~~ 502 (528) T protein:vir:66 452 MDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIG-INP--FADSKSQEPSA 502 (528) T ss_pred cccceeecccccceeeEeeCCccccceeeeeeeecee-ecC--cccccCccccc Confidence 00011222221 1222222233344432 233 22211 00 11 No 232 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=45.25 E-value=0.78 Score=21.19 Aligned_cols=273 Identities=12% Similarity=0.050 Sum_probs=105.5 Q ss_pred ccCccccchhHHHHHHHHHHhccc--hhhhc--ceeeeccCceEEEEecCCCccccccccc--cccccccccccccccee Q lcl|NC_010583. 168 SMSSEAYETIFSTRIIRDLQKELV--VGALF--DELPMSSKILTMLVEPEAGRATWVDASK--FGTDETVGDEVKGQLTE 241 (458) Q Consensus 168 ~~g~~~ip~~~~~~ii~~~~~~~~--l~~~~--~~~~~~~~~~~~p~~~~~~~a~~v~e~~--~~~e~~~~~~~~~~f~~ 241 (458) .+..+-..+.+...+.+.+...+. .+... .+.-.++...+||......-..+-..+. +.. .+.+.++.. T Consensus 1 Mantl~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~-----g~v~~~~et 75 (312) T protein:vir:10 1 MANTLAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVG-----GDVKFEYET 75 (312) T ss_pred CCcchhHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeecccccccccccCCcccc-----cccccccee Confidence 111122335555555444443322 11111 1223556778888766543332221111 111 122345556 Q ss_pred eeeehhheeeeehhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccceeeccccchh Q lcl|NC_010583. 242 ISFKTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAFMSGNGTGQPKGLLKLAADDGAKVVTEAKADGS 319 (458) Q Consensus 242 v~~~~~k~~~~~~is~ell~ds--~~~~~~~i~~~la~~~~~~~d~~~l~G~g~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 319 (458) .+++..+--.+.-=... .+.+ ...+...+.+...+.+.=.+|...+.- |...+...+.........+.+ T Consensus 76 ~tl~qDR~~~F~vD~mD-vDETn~~~s~anv~~ef~r~~vvPEiDayrfsk--------la~~a~~~~~~~~~~~~~~~T 146 (312) T protein:vir:10 76 KTMTQDRGRKFTLDAMD-VDETNFLVTATTVMGEFQRLKVIPEIDAYRLSR--------LATIAIGIKGDTNVEYSYSVN 146 (312) T ss_pred EEeeecccceeeccccc-hhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHH--------HHhhhhccccccccccccccC Confidence 66666654433210111 1112 133455555556666666777765531 111111111100000011112 Q ss_pred hHHHHHHHHHHHhhhhhhhcc-cceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccceeccc--ccc- Q lcl|NC_010583. 320 VLVTAKTISKLRRKLGRHGLK-LSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEY--FPA- 395 (458) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~--~~~- 395 (458) ....+..+..+...+.....+ +-..++.|.....|.. ...+.............+..+.|.|.||+.++. |.+ T Consensus 147 ~~ni~~~i~~~~~~lde~~vp~~rvl~vTp~~~~lLk~---~~~~~~~~~~~~~~~i~~~V~~iDgv~Ii~VPs~r~~t~ 223 (312) T protein:vir:10 147 SSTIINKIKTGIKIIRENGYNGPLVCHLTYDSMFAIEE---KVLEKLTAVTFAQGGIQTQVPSIDGCALIKTPQNRMYSS 223 (312) T ss_pred HHHHHHHHHHHHHHHHHccCCCceEEEeChHHHHHHhh---hhhceecccccccceeeeeeeeecccEEEEchhhhccce Confidence 233445555666666664433 2234556665555543 222222122222333455567899999996543 210 Q ss_pred -----c--------------ccCCceEEEEEece-EEEEecceeEEeecccccCCceEEEEEEe--eccEEecccc-eEE Q lcl|NC_010583. 396 -----K--------------AASAEFAVIVYKDN-FVMPRQRAVTVERERQAGKQRDAYYVTQR--VNLQRYFENG-VVS 452 (458) Q Consensus 396 -----~--------------~~~~~~~~~~~~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~r--~d~~~~~~~a-fv~ 452 (458) + ++.....++...+. +.+..-..+++. +|......-+|....| .|.-|.+.+. -+. T Consensus 224 ~~f~dG~t~~~~~gg~~~~~~ak~INfiiv~~~a~i~~~K~~~~~if-~P~~~~~~d~~~~~~R~Y~D~fv~~nk~~~Iy 302 (312) T protein:vir:10 224 ILLNDGTTSNQTAGGYLKGTKALDTNFIIAPVDVPLAITKQDKMRIF-DPETNQTANAWSMDYRRYHDLWVTDNKANSVY 302 (312) T ss_pred eeeccCcccccccCceeecCcccccceEEeCCceeeceeeeeeeeee-CCCCCCCcceeeeeeeeeeeeeeeccccCeEE Confidence 0 00000111111111 111111122221 1222222223444444 4555555532 224 Q ss_pred EEeecC Q lcl|NC_010583. 453 GAYAAA 458 (458) Q Consensus 453 l~~aaa 458 (458) +.++.| T Consensus 303 v~~k~a 308 (312) T protein:vir:10 303 ANFKDA 308 (312) T ss_pred EEeecc Confidence 455555 No 233 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=41.60 E-value=0.92 Score=20.78 Aligned_cols=299 Identities=15% Similarity=0.044 Sum_probs=135.4 Q ss_pred hhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce Q lcl|NC_010583. 127 QDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL 206 (458) Q Consensus 127 ~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 206 (458) +........+..|..++.. ......+.. .. ....+.|...+...+.+.+.+.+-+++..+++++.--.. T Consensus 1 m~~~M~~~tr~~~~~y~~~---------~A~~ngv~~-~~-~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~G 69 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYCDA---------LAKAYGIDI-SK-LDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKG 69 (358) T ss_pred CcccccHHHHHHHHHHHHH---------HHHHhCCCh-hH-ccceeeeChHHHHHHHHHHHHHHHHhhcCccccccccee Confidence 1111122223333333221 000001100 01 122455666677778889999999999999999885443 Q ss_pred -EEEEecCCCcccccccccccccccccccccccceeeeeehhheeeeehhhHHHHhccH-----HHHHHHHHHHHHHHHH Q lcl|NC_010583. 207 -TMLVEPEAGRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAKSFITDETEEDAI-----FSLLPLLRKRLIEAHA 280 (458) Q Consensus 207 -~~p~~~~~~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~-----~~~~~~i~~~la~~~~ 280 (458) ++....+++-++-..- ..+.....++.-.+...+.---+.|+.+.|+... .+|...+++.+.+.++ T Consensus 70 e~v~lg~~g~iagrt~t--------r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~A 141 (358) T protein:vir:78 70 QVVQVGVGQLYTGRKKG--------GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFA 141 (358) T ss_pred eEEeecCCcccceecCC--------CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHh Confidence 3333444554433221 1112222333344444444445678888888653 2699999999998888 Q ss_pred HHHHHHHhccCCC----C---c------cccccccccc------------cc-cceeeccccchhhHHHHHHHHHHH-hh Q lcl|NC_010583. 281 VSIEEAFMSGNGT----G---Q------PKGLLKLAAD------------DG-AKVVTEAKADGSVLVTAKTISKLR-RK 333 (458) Q Consensus 281 ~~~d~~~l~G~g~----~---~------p~Gi~~~~~~------------~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~ 333 (458) .-.=.--+||+.. + . .+|++...-. .+ .....+.++++...+ ..+.+++ .. T Consensus 142 LD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLD--alV~D~~~~l 219 (358) T protein:vir:78 142 LDMLRVGWNGVSAADDTDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLD--EMASDLINTT 219 (358) T ss_pred hccceecccceeeccCCChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHH--HHHHHHHhcc Confidence 7766667777631 1 1 3566643211 11 111111112221111 1122333 34 Q ss_pred hhhhhccc--ceeEechhHHH--HHHhhhccccccccccccccccccccCCeeecccceecccccccccCCceEEEEEec Q lcl|NC_010583. 334 LGRHGLKL--SKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKD 409 (458) Q Consensus 334 ~~~~~~~~--~~~~~~~~~~~--~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~ 409 (458) +++.++.. .+.+|-..... ++..+. ..+.|....- +. ....++-|+|.+..+++|.. -+++--++ T Consensus 220 I~~~~~~d~dLVvivG~dLla~k~~~l~n-~~~~pTE~~A--a~---~i~k~iGGlpa~~~PfFP~~-----~ilVT~L~ 288 (358) T protein:vir:78 220 IDPLFQQDPRLVVLVGTDLVAAAQAKLYS-EATKPSEQIA--AQ---QLAKSIAGRKAYIPPFFPGK-----RMVVTTLD 288 (358) T ss_pred CChHHhcCCCEEEEEchhhhhHHhhhHhh-cCCCcHHHHH--HH---HHHHHhCCCeEEEccccCCC-----ceEEeecc Confidence 45555554 34445444433 232222 2223332211 11 11146789999999999952 23333344 Q ss_pred eEEEEecc-ee--EEeec-------ccccCCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 410 NFVMPRQR-AV--TVERE-------RQAGKQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 410 ~~~i~~~~-~~--~i~~~-------~~~~~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) ++.|.... .. .+... .|- +-...|.++..--+..++.-.|......|. T Consensus 289 NLsIY~Q~gs~RR~~~d~p~r~riE~y~-s~Ne~YvVEd~~~~a~iE~i~v~~~~~pa~ 346 (358) T protein:vir:78 289 NLHCYTQRGTRKRKADDNQDSKSFDNQY-WRMEGYALGEHKAYGGFEEADIEIGADPAV 346 (358) T ss_pred ccEEEEecCcEEEEEEeccccccccchh-hhcceeeeeccccEEEEeeeeeeeCCCCCc Confidence 44333222 11 12111 121 222344444433334444333332222222 No 234 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=40.26 E-value=0.98 Score=20.63 Aligned_cols=300 Identities=14% Similarity=0.080 Sum_probs=108.9 Q ss_pred hhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhcc--chhh Q lcl|NC_010583. 117 DSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKEL--VVGA 194 (458) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~--~l~~ 194 (458) ....+....... ...++...+|.. .....-.+-.+|+.+--+.+.++|..+..... .+.+ T Consensus 1 ~~~~~n~~~~~~-~~~e~~~Ks~tt-----------------gy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~ 62 (464) T protein:vir:80 1 MTEKKNTERQLT-SVQEEVIKGFTT-----------------GYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYR 62 (464) T ss_pred CCcchhhHhhcC-cccHHHHHHHHh-----------------CCccCcccccCcchhhhhhhhhhhheeeecccchhhhh Confidence 000000000000 000000011110 01111112222333333344444433322211 2233 Q ss_pred hcceeeeccCceEEEEecCC---Ccccccccccccccccccccccccceeeeeehhheeee--ehhhHHHHhccHHHHHH Q lcl|NC_010583. 195 LFDELPMSSKILTMLVEPEA---GRATWVDASKFGTDETVGDEVKGQLTEISFKTYKLAAK--SFITDETEEDAIFSLLP 269 (458) Q Consensus 195 ~~~~~~~~~~~~~~p~~~~~---~~a~~v~e~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~--~~is~ell~ds~~~~~~ 269 (458) -....|..+-...|-..... ..+.+++|+ ..++.++|++...+...+-+... +-|-..+ .++..+-.. T Consensus 63 di~k~~a~STV~~y~~~~~~G~~g~~~f~~E~------g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~l-vn~~~d~~~ 135 (464) T protein:vir:80 63 DITKRPATSTVAKYDVYLAHGRVGHTRFTREI------GVAPISDPNLRQKTVNMKYVSDTKNMSIATGL-VNNIEDPMR 135 (464) T ss_pred hcCCchhhhhhhhhheeeccCccccccccccc------cccccCCCceEEEEEEeeeeecceeeeeehhh-hcchhhHHH Confidence 33444444444444433322 334444554 56677889999888776644432 2333333 344556666 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCC---C-------ccccccccccccccceeeccccchhhHHHHHHHHHHHhhhhhhhc Q lcl|NC_010583. 270 LLRKRLIEAHAVSIEEAFMSGNGT---G-------QPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGL 339 (458) Q Consensus 270 ~i~~~la~~~~~~~d~~~l~G~g~---~-------~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 339 (458) .+.++--..++..++.+.|+|+.. + +..||.+.....+. . ...+.. .+...+-.+......+|. T Consensus 136 ~~~~dai~~va~tiE~a~FyGds~l~~~~~~~~gleFDGl~~lI~~~NV--i-DarG~~---Ls~~~ln~Aa~~i~~~fG 209 (464) T protein:vir:80 136 ILTDDAISVVAKTIEWASFYGDSDLSENPDAGSGLEFDGLAKLIDKHNV--L-DAKGAS---LTEALLNQASVLVGKGYG 209 (464) T ss_pred HHHHHHHHHHHHHHHHHHhhhccccCCCCCCccccchhhhHhhcCCCce--e-ecCCCC---cCHHHHhhhhhhhhcccC Confidence 777777788999999999999742 1 45566655433221 1 122221 112233333344455777 Q ss_pred ccceeEechhHHHHH-HhhhccccccccccccccccccccCCeeecccce--ecccccccccCCceEEEEE-----eceE Q lcl|NC_010583. 340 KLSKLVLIVSMDAYY-DLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVV--VSEYFPAKAASAEFAVIVY-----KDNF 411 (458) Q Consensus 340 ~~~~~~~~~~~~~~l-~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~--~~~~~~~~~~~~~~~~~~~-----~~~~ 411 (458) .+...+|+..+.+.+ +..-+.+-+... ... .+...|+||- ++.. ..+.-.. ..++.+ .+.- T Consensus 210 t~TD~~lp~~v~a~f~n~~l~~q~~~~~--~n~-------~~~~~G~~v~~f~sa~-G~i~L~~-s~~m~~~~~ld~~~~ 278 (464) T protein:vir:80 210 TPTDAYMPIGVQADFVNQQLDRQVQVIS--DNG-------QNATMGFNVKGFNSAR-GFIRLHG-STVMELEQILDENRM 278 (464) T ss_pred ChhhcccchhHHHHHHhhhcCceeEEEc--CCC-------Ccceeeeecccccccc-cceeccC-ccccCcccccccccc Confidence 777788888877554 443333322221 111 1124455542 1110 0000000 000000 0000 Q ss_pred ---EEEecceeEEeeccccc-CCceE-EEEEEeeccEEecccceE-EEEeecC Q lcl|NC_010583. 412 ---VMPRQRAVTVERERQAG-KQRDA-YYVTQRVNLQRYFENGVV-SGAYAAA 458 (458) Q Consensus 412 ---~i~~~~~~~i~~~~~~~-~~~~~-~~~~~r~d~~~~~~~afv-~l~~aaa 458 (458) .......+....+.... ..... ..+...+-+.+.+..+=- ..+...+ T Consensus 279 ~~~~apaapsvt~tv~~~~~g~f~~~~~~~~~~Ykv~~vn~~GeS~ps~~~~~ 331 (464) T protein:vir:80 279 QLPNAPQKATVKATLEAGTKGKFRDEDLTIDTEYKVVVVSDDAESAPSDVASV 331 (464) T ss_pred cCCCCcCCceeEEEecCCcccCCccccccceeEEEEEEECCCCccccceeeee Confidence 00001111111111100 00000 000001111111111100 0011111 No 235 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=36.06 E-value=1.2 Score=20.16 Aligned_cols=318 Identities=14% Similarity=0.021 Sum_probs=122.9 Q ss_pred hhhhhhhhcchhhhhhHHHHHHHHHhhhccchh---HHHH----HHH---------------H-hhhh-hcccccccCcc Q lcl|NC_010583. 117 DSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVF---ETEH----GKA---------------H-IKAV-NGSSSVSMSSE 172 (458) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~---~~~~----~~~---------------~-~~a~-~~~~~~~~g~~ 172 (458) ..+.-...+.. ....+.....|...+.++-.. +.-. ... . .... +.....++..+ T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~PN~~~p~l~~i~~g~~~~~~~~t~~w~~d~l~~~~~~~ta~~~a~~T~i 79 (418) T protein:vir:96 1 MSVYAGIFNTT-LNPQELNMKSFAGTILRRVPNGSAPLLAMTSVVGSTTAKASTHGYFSKTMVFASAVVTAEALADATVL 79 (418) T ss_pred CceeeeecccC-CChhhhchhhhhhhhhhhcCCcccchhhhhcccCccccceeEEEEEeeEeeeeeEEEEEEEecCceEE Confidence 11111111110 011111122222222111000 0000 000 0 0000 01111122224 Q ss_pred ccchhHHHHHHHHHHhccch-----hhhcceeeeccCceEEEEecCCCccccccccc-------ccccccccccccccce Q lcl|NC_010583. 173 AYETIFSTRIIRDLQKELVV-----GALFDELPMSSKILTMLVEPEAGRATWVDASK-------FGTDETVGDEVKGQLT 240 (458) Q Consensus 173 ~ip~~~~~~ii~~~~~~~~l-----~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e~~-------~~~e~~~~~~~~~~f~ 240 (458) .+++.- . ++....+ ..+..+..+.+..++.-+...+..+..+..+. ..+|++.. ++ T Consensus 80 ~V~~~~---~---f~~~~l~~~~~~~EvirVtsVng~~lTV~RG~~~t~aa~iaag~~~~~ig~~~eEGsd~------~t 147 (418) T protein:vir:96 80 TVENSD---G---LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRIAAAIIAANTKLIVIGTAFEEGSQR------PT 147 (418) T ss_pred EecCCc---c---cccccEEEEecCCeEEEEEEEeCCEEEEEEccCCeeeeeeecCceEEEeecCccccccc------CC Confidence 444321 1 2233322 23345555666666666666665444444332 23344332 22 Q ss_pred eeeeehhheeeeehhhHHHHhccHHH-----------HHHHHHHHHHHHHHHHHHHHHhccC---C--CCcc-------- Q lcl|NC_010583. 241 EISFKTYKLAAKSFITDETEEDAIFS-----------LLPLLRKRLIEAHAVSIEEAFMSGN---G--TGQP-------- 296 (458) Q Consensus 241 ~v~~~~~k~~~~~~is~ell~ds~~~-----------~~~~i~~~la~~~~~~~d~~~l~G~---g--~~~p-------- 296 (458) .....+..+.-+..|-++...-|... +.....+.|.+. ...++.+++.|. | ++.| T Consensus 148 a~~~k~~~vsN~tQIf~e~vsVSgTAqA~v~qaGvsn~~~~e~d~l~~~-kv~iE~ali~g~~~~~~~ng~p~~~t~R~m 226 (418) T protein:vir:96 148 ARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQAFMGTYNGQPLHTTQGIV 226 (418) T ss_pred cceecceeccchhheehhhhhhhhhhhhhhhhcCcchhHHHHHHHHHHH-HHHHHHhhhccccccCCCCCcccccccchh Confidence 22333444444555555544433221 222223344444 446788888886 2 2223 Q ss_pred ccccccccccccceeeccccchhhHHHHHHHHHHHhhhhh---hhcccc-----eeEechhHHHHHHhhhcccccccccc Q lcl|NC_010583. 297 KGLLKLAADDGAKVVTEAKADGSVLVTAKTISKLRRKLGR---HGLKLS-----KLVLIVSMDAYYDLLEDEEWQDVAQV 368 (458) Q Consensus 297 ~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~-----~~~~~~~~~~~l~~~~d~~~~~~~~~ 368 (458) .||+...+... ....+. ...+.+.+.++....-. ....+. ..++++.....+.++-. .-+..-+. T Consensus 227 ~gI~~f~~~Nv---i~ag~~---~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~~~-~I~~~~~e 299 (418) T protein:vir:96 227 DAIRQYAPDNV---NAMPNP---TAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFG-EVTVTQRE 299 (418) T ss_pred HHHHhhccccc---cccCCC---CcCCHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhhhc-eeEecccc Confidence 45555543221 111111 11233334443333211 111111 13567777788887642 22221111 Q ss_pred ccccccccccCCeeec-ccceecccccccccCCceEEEEEeceEEEEec--ceeEEeeccccc----------------- Q lcl|NC_010583. 369 GNDAVKLQGQVGRIYG-LPVVVSEYFPAKAASAEFAVIVYKDNFVMPRQ--RAVTVERERQAG----------------- 428 (458) Q Consensus 369 ~~~~~~~~~~~~~l~G-~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~~--~~~~i~~~~~~~----------------- 428 (458) ...+... -...+-+| ++|++++.+|+..--.+.+++.|.+.+.+..= .+.. .+.+.. T Consensus 300 n~~G~vv-~~~~Td~G~v~ii~n~~~pad~I~~g~mlVvD~~~vkL~yL~~R~~~--~E~l~k~G~~~~~~~~~~~~~~~ 376 (418) T protein:vir:96 300 TSYGMVF-TEWKFFKGRLIIKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAK--VENYGQGGGENKSGATDYSYGHG 376 (418) T ss_pred ceeceEE-EEEEeeccEEEEEecCCCCccccCcceEEEEecCceEEEEecCCCcc--chhcccCCCcccccccccccccc Confidence 1111110 01112235 68888888886543444566666664433321 1111 111111 Q ss_pred CCceEEEEEEeeccEEecccceEEEEe-ecC Q lcl|NC_010583. 429 KQRDAYYVTQRVNLQRYFENGVVSGAY-AAA 458 (458) Q Consensus 429 ~~~~~~~~~~r~d~~~~~~~afv~l~~-aaa 458 (458) .|...=.....+.+++.+|++.++++- .-| T Consensus 377 ~D~~~G~l~~Eltle~~N~~a~a~itgl~~~ 407 (418) T protein:vir:96 377 VDAQGGSLTSEWALELLNPQGCAVITGLQKA 407 (418) T ss_pred cccccCEEEEEEEEEeecccccEEeeccccc Confidence 122222245577789999999887652 223 No 236 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=34.53 E-value=1.3 Score=19.98 Aligned_cols=318 Identities=13% Similarity=0.099 Sum_probs=112.9 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhhhh------ccccc Q lcl|NC_010583. 94 TIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKAVN------GSSSV 167 (458) Q Consensus 94 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a~~------~~~~~ 167 (458) .. -++..+.. ..++..+.+....-.-.+....+.. ......+++. .-.+- T Consensus 1 ~~--~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~k~a~t~gy~~~~~~~ 56 (514) T protein:vir:10 1 MY--TQDKTKDI-------------MKKSFFGGDRAVAFDTNKEDILNEN---------LPENVKKSAFTAGHSITPDTQ 56 (514) T ss_pred CC--ccchhhHH-------------HhhhhcccceeeeecCcHHHHHHHh---------cchhhhhhhhccccccCCccc Confidence 00 00000000 0000000000000000000000000 0000111110 11112 Q ss_pred ccCccccchhHHHHHHHHHHhc--cchhhhcceeeeccCceEEEEecC---CCcccccccccccccccccccccccceee Q lcl|NC_010583. 168 SMSSEAYETIFSTRIIRDLQKE--LVVGALFDELPMSSKILTMLVEPE---AGRATWVDASKFGTDETVGDEVKGQLTEI 242 (458) Q Consensus 168 ~~g~~~ip~~~~~~ii~~~~~~--~~l~~~~~~~~~~~~~~~~p~~~~---~~~a~~v~e~~~~~e~~~~~~~~~~f~~v 242 (458) ++|+.+--+.+.+++..+.... -.+.+-....|+.+-...|-.... ...+.+++|+ ...+.++|++... T Consensus 57 t~gaAlR~EsLd~~l~~Lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~------gi~~~~d~~~~rk 130 (514) T protein:vir:10 57 TDGAANRIESLNRDLKVTTWGERDFTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEI------GIGDVNNPNERQR 130 (514) T ss_pred cCccchhhhhhccceeEeeecCcchhhhhhcCCchhhHHHhhhhhhcccCccccccccccc------ccCcCCCcceEEE Confidence 2222222222222222111111 111222222333333223322222 1233444444 4667789999999 Q ss_pred eeehhheeeeehhhHHH-HhccHHHHHHHHHHHHHHHHHHHHHHHHhccCC---C------Cccccccccccccccceee Q lcl|NC_010583. 243 SFKTYKLAAKSFITDET-EEDAIFSLLPLLRKRLIEAHAVSIEEAFMSGNG---T------GQPKGLLKLAADDGAKVVT 312 (458) Q Consensus 243 ~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~la~~~~~~~d~~~l~G~g---~------~~p~Gi~~~~~~~~~~~~~ 312 (458) .+..+-++.-..+|.-+ +.++..+......+.--..++..++.++|+|+. + .+..||.+.....+. . T Consensus 131 ~~~~k~l~~~~~vS~~~~l~n~i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~~~Nv--I- 207 (514) T protein:vir:10 131 TINIKYIVDTHVTSIALQRANTIVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIAPENH--I- 207 (514) T ss_pred EEeeeeeeeeeeeeehhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhcCCCe--E- Confidence 98888887665555432 345777888888888889999999999999974 2 135678777644332 2 Q ss_pred ccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhccccccccccccccccccccCCeeecccce--ec Q lcl|NC_010583. 313 EAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDAVKLQGQVGRIYGLPVV--VS 390 (458) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~l~G~pv~--~~ 390 (458) ...+.. .....+..+......+|..+...+|+..+.+.|..--.+.-|-+.+. .+. +...|.||- ++ T Consensus 208 DarG~~---Ls~~~ln~aA~~i~~gfGt~TD~ylp~~vka~f~~~~~~~qRV~~~~----n~~----~~~~G~~v~~f~s 276 (514) T protein:vir:10 208 DLRGGR---LSPAALNMAARKIGEGFGTPTDAYMPIGIKADFVNQHLNGQRVMLPG----QTG----GMTTGLDIDKFLS 276 (514) T ss_pred ecCCCC---ccHHHHhhhhhhhhcccCChhheeCchHHHHHHhhcccCcceEEeec----Ccc----ceeeeeeccceeE Confidence 222221 11223333333445567777788888888877654333222222111 100 112222221 00 Q ss_pred ccccccccCCceEEEEEeceEE--------EEecceeEEeecccc-----------cCC----------ceEEEEEEee- Q lcl|NC_010583. 391 EYFPAKAASAEFAVIVYKDNFV--------MPRQRAVTVERERQA-----------GKQ----------RDAYYVTQRV- 440 (458) Q Consensus 391 ~~~~~~~~~~~~~~~~~~~~~~--------i~~~~~~~i~~~~~~-----------~~~----------~~~~~~~~r~- 440 (458) .. +........ ++...+.+- -.....+.+...++. .++ ...|++...- T Consensus 277 ~~-G~I~L~gs~-im~~~n~L~~~~~~~~~Ap~~~~va~svT~~~~g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~ 354 (514) T protein:vir:10 277 AH-GSIRIQGST-IMDSDNKLDFDRPVSPTAPTAPQLSATVTPDGGGLWHEADKTDSKGEVILNKEVGVEQSYVAVMVSR 354 (514) T ss_pred ec-cceeecCCe-eecccccCccCCccCCcCCCCCcceEEEecCcccccCcccccccccccccccccceeEEEEEEEECC Confidence 00 000000000 000000000 000000111111110 011 1112222111 Q ss_pred ccEEecccceEE-----------EEeec-C Q lcl|NC_010583. 441 NLQRYFENGVVS-----------GAYAA-A 458 (458) Q Consensus 441 d~~~~~~~afv~-----------l~~aa-a 458 (458) ++.- -|..++- |+... + T Consensus 355 ~GeS-~ps~~vtaT~a~~~~~i~ltItp~~ 383 (514) T protein:vir:10 355 HGDS-RPSLVQTATPTKKDDAITLTITPNA 383 (514) T ss_pred CCcc-cccceeeeeeeccCceEEEEEEecc Confidence 1111 2222222 22221 0 No 237 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=33.87 E-value=1.3 Score=19.91 Aligned_cols=349 Identities=13% Similarity=0.081 Sum_probs=115.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccc Q lcl|NC_010583. 70 VKNLDEKSKKSAELFAQTVEKQQETIVGLQD--EIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKD 147 (458) Q Consensus 70 i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~--~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 147 (458) +.-..++ -.++...-++. +. ++.... ++.-.....+...+..... ..+....--.++.+.+.+ T Consensus 1 ~~~~~~~----------l~~kw~p~l~~-~~~~~i~~~~-~~~~~a~l~enq~~~~~~~-~~~~~~~~~e~~~~~l~e-- 65 (529) T protein:vir:10 1 MSLKTKE----------ILNKWTPLLEG-EGLPEIAGKN-KQALVAQILEAQEKDSKTD-PVYRDDKLIEAFGQSLME-- 65 (529) T ss_pred CccchHH----------HHHHhhHhhcC-Cccchhcchh-hhhhhhhhhhhHHHHhhcc-cccchhhhhhhhhhccch-- Confidence 0000000 00010000000 00 000000 0000000000000000000 000000000111111111 Q ss_pred hhHHHHHHHH-hhhhh-ccccccc-CccccchhHHHHHHHHHHhccchhhhcceeeeccCce-------EEEEecCC--- Q lcl|NC_010583. 148 VFETEHGKAH-IKAVN-GSSSVSM-SSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-------TMLVEPEA--- 214 (458) Q Consensus 148 ~~~~~~~~~~-~~a~~-~~~~~~~-g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-------~~p~~~~~--- 214 (458) .+....+ ....+ ..+++++ -...=|. +. .+++..-+..+..+++-|-||+++.. +|+..... T Consensus 66 ---~~~~~~~~~~~~~ia~s~~t~~v~~~~P~-Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g 140 (529) T protein:vir:10 66 ---AEVAGDHGYDPTNIAAGQSSGAITNIGPA-VI-GMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGA 140 (529) T ss_pred ---hhcccccccccccccccccccccccccch-hh-hhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCcc Confidence 0000000 00000 0011111 0111121 11 13333334444555555555554321 11110000 Q ss_pred -----------------------------------------------------------------Ccc------------ Q lcl|NC_010583. 215 -----------------------------------------------------------------GRA------------ 217 (458) Q Consensus 215 -----------------------------------------------------------------~~a------------ 217 (458) +.+ T Consensus 141 ~eaf~~~~e~dt~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~ 220 (529) T protein:vir:10 141 KEAFHPMYAPDAWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDAL 220 (529) T ss_pred cccccccccccccccccccccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccc Confidence 000 Q ss_pred ----------ccccccccc-----------ccccccccccccceeeeeehhheeeeehhhHHHHhc----cHHHHHHHHH Q lcl|NC_010583. 218 ----------TWVDASKFG-----------TDETVGDEVKGQLTEISFKTYKLAAKSFITDETEED----AIFSLLPLLR 272 (458) Q Consensus 218 ----------~~v~e~~~~-----------~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d----s~~~~~~~i~ 272 (458) +-.+++... ..+...++-..+++.++++++..+-...+|-||.+| ...|.+++|. T Consensus 221 ~~~~~a~~~~~~~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELs 300 (529) T protein:vir:10 221 VSAKIAAGELAEIAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELN 300 (529) T ss_pred cccccccccccccccccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHH Confidence 000000000 001123334556677777777777888999999998 2478999999 Q ss_pred HHHHHHHHHHHHHHHhc--------cC-CC----CccccccccccccccceeeccccchhhHHHHHHHHH----HHhhhh Q lcl|NC_010583. 273 KRLIEAHAVSIEEAFMS--------GN-GT----GQPKGLLKLAADDGAKVVTEAKADGSVLVTAKTISK----LRRKLG 335 (458) Q Consensus 273 ~~la~~~~~~~d~~~l~--------G~-g~----~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~ 335 (458) +-|+..|...|++.||. |. |- +...|++...... .....-+.. ..+..|.- ..+.+. T Consensus 301 NILStEImlEINReii~~i~~~a~~~~~g~~~~~~~~~gv~d~~~~~-----d~~~~~~~~-e~~~~L~~~i~~~an~I~ 374 (529) T protein:vir:10 301 GILANEVMLEINREVIDWINYTAQVGKSGWTQTVGSAAGVFDFQDPI-----DVRGARWAG-ESYKALLIQIDKEANEIA 374 (529) T ss_pred HHHHHHHHHHhhHHHHHHhhhhceeeeeeeeccccccccceeccccc-----cccccchhH-HHHHHHHHHHHHHHHHHH Confidence 99999999999999996 11 10 1122333221110 000111111 12222222 222222 Q ss_pred h-hhcccce-eEechhHHHHHHhhhccccccccccccccc---cc-cccCCeeec-ccceecccccccccCCceEEEEEe Q lcl|NC_010583. 336 R-HGLKLSK-LVLIVSMDAYYDLLEDEEWQDVAQVGNDAV---KL-QGQVGRIYG-LPVVVSEYFPAKAASAEFAVIVYK 408 (458) Q Consensus 336 ~-~~~~~~~-~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~---~~-~~~~~~l~G-~pv~~~~~~~~~~~~~~~~~~~~~ 408 (458) . ..+..+. .++++.....|.. .+...+|-.+....+. .. .-..|.|.| ++|++..+.|. +.+++++. T Consensus 375 ~~T~rg~~n~vi~S~~Va~~L~~-~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~K 448 (529) T protein:vir:10 375 RQTGRGAGNFIIASRNVVSALAL-VDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQ-----DYFTMGYR 448 (529) T ss_pred HhhccccceEEEEchHHHHHHhh-hccccccccccccccceeecCCceEEEEecCceEEEecCCCCc-----ceEEEEEe Confidence 2 2222333 4667777777762 2222222221111111 10 011244544 78888877652 23333322 Q ss_pred -ceEEEEecceeEEeecccc----------cCCceEEEEEEeeccEEecccceEEEEeec--C Q lcl|NC_010583. 409 -DNFVMPRQRAVTVERERQA----------GKQRDAYYVTQRVNLQRYFENGVVSGAYAA--A 458 (458) Q Consensus 409 -~~~~i~~~~~~~i~~~~~~----------~~~~~~~~~~~r~d~~~~~~~afv~l~~aa--a 458 (458) +.- ..-.+.+.||. .+-+-.+-...|+++. .+| |+..+.-+ + T Consensus 449 G~~~-----~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~-~NP--~~~~~~~~~~~ 503 (529) T protein:vir:10 449 GANN-----LDAGIYYCPYVALTPLRGSDPKNFQPVMGFKTRYAIG-VNP--FAESRTQAPTS 503 (529) T ss_pred CCcc-----cccceeeccccccccccccCCCcccceeeeeeeecee-ecC--ccccccccccc Confidence 110 00112222322 1222233334455432 233 33222111 1 No 238 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=33.03 E-value=1.4 Score=19.81 Aligned_cols=106 Identities=17% Similarity=-0.062 Sum_probs=51.2 Q ss_pred EechhHHHHHHh-hhc------cccccccccccccccccccCCeeecccceecccccccccCCc--eEEEEEec------ Q lcl|NC_010583. 345 VLIVSMDAYYDL-LED------EEWQDVAQVGNDAVKLQGQVGRIYGLPVVVSEYFPAKAASAE--FAVIVYKD------ 409 (458) Q Consensus 345 ~~~~~~~~~l~~-~~d------~~~~~~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~--~~~~~~~~------ 409 (458) +.....|+.+.. +-+ -+..+.+.. +-+.+++|+.-+.+.++|...+--- ..+.++.+ T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE~aNp~ltG--------~lpV~~~GltWl~tpnlpg~~a~vlDst~lGgmaDE~l~~P 72 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPREQANIVLTG--------SLPVSAYGLTWVTSRHITGTDPWLFDVEQLGGMADEKLLSP 72 (123) T ss_pred CcchhhHHHHhcchhcccccccccCCceEec--------CcceeeeceeeeecCCCCCCccceeehhhhccccccccCCC Confidence 111111111110 000 011222211 2234578888888988884332111 11122222 Q ss_pred eEEEEecceeEEeeccccc--CCceEEEEEEeeccEEecccceEEEEeecC Q lcl|NC_010583. 410 NFVMPRQRAVTVERERQAG--KQRDAYYVTQRVNLQRYFENGVVSGAYAAA 458 (458) Q Consensus 410 ~~~i~~~~~~~i~~~~~~~--~~~~~~~~~~r~d~~~~~~~afv~l~~aaa 458 (458) .|.-.+..++++.+...-. +|+..+|+-------++.|.|.++++-..- T Consensus 73 gya~~~~~Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 73 EFAPAGNTGVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred cccCCCCcceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 3444455567776654444 777777765333346788899998887666 No 239 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=32.59 E-value=1.4 Score=19.76 Aligned_cols=336 Identities=12% Similarity=0.039 Sum_probs=117.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHH-HHHHHhhhccchh Q lcl|NC_010583. 71 KNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEK-LVLLSYMMEKDVF 149 (458) Q Consensus 71 ~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~~~~~~~~~~ 149 (458) -+ .+.+.+. ...+-..+...+....-...+.- .-++.+++- ..+.+.+. T Consensus 1 m~---------------~~~l~~~----w~~~l~~~~~~~i~~~~~~~~~~------~~lenq~~~~~~~~~~l~----- 50 (457) T protein:vir:10 1 MS---------------FQNLQEK----WAPVLEHDSLPEIGDSYKKGVVA------QLLENQEKAIAEEGKILT----- 50 (457) T ss_pred Cc---------------hHHHHHH----hhHhhccCccchhhhhHHHHHHH------HHhhhHHHHHHhcccccc----- Confidence 00 0000000 00000000000000000000000 000000000 00000000 Q ss_pred HHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCceEEEEec-------CC-------- Q lcl|NC_010583. 150 ETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKILTMLVEP-------EA-------- 214 (458) Q Consensus 150 ~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~p~~~-------~~-------- 214 (458) +.. ........++++.+-...-|.. .. ++++..+..+..+++-+-||+++..-|--.- +. T Consensus 51 -ea~--~~~g~~~~s~~t~~v~~~~P~L-i~-l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a~~~EA 125 (457) T protein:vir:10 51 -ETL--QTTGYTGGDTVTGPVAGFDPVL-IS-LIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAAGYDEA 125 (457) T ss_pred -ccc--cccCCCcccccccccccccchh-hh-hhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccccccccce Confidence 000 0000000011111111112221 11 3344445566666777777766433221100 00 Q ss_pred ----Ccccccc--------------------------------------------cccccc---cccccccccccceeee Q lcl|NC_010583. 215 ----GRATWVD--------------------------------------------ASKFGT---DETVGDEVKGQLTEIS 243 (458) Q Consensus 215 ----~~a~~v~--------------------------------------------e~~~~~---e~~~~~~~~~~f~~v~ 243 (458) +...|-+ ++.... .+...++-..+++.++ T Consensus 126 l~nEadt~fSg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~gmsTA~aE~lgd~~~n~~f~EMaFsIeK~t 205 (457) T protein:vir:10 126 FFNEPNAGFSGGPGAYDPGATGVTNDAEGTNPALLNDSPAGTYEQADDATGMSTATVEALDDSTANTAFREMGFSIEKVT 205 (457) T ss_pred eeeccCcccCcccccccccccccccccccccccccCccccccccccccccchhhhhhhccCCCCCccchhhheeEEEEEE Confidence 0000000 000000 0001122233346777 Q ss_pred eehhheeeeehhhHHHHhc----cHHHHHHHHHHHHHHHHHHHHHHHHhcc--CCC--Cc-----cccccccccccccce Q lcl|NC_010583. 244 FKTYKLAAKSFITDETEED----AIFSLLPLLRKRLIEAHAVSIEEAFMSG--NGT--GQ-----PKGLLKLAADDGAKV 310 (458) Q Consensus 244 ~~~~k~~~~~~is~ell~d----s~~~~~~~i~~~la~~~~~~~d~~~l~G--~g~--~~-----p~Gi~~~~~~~~~~~ 310 (458) ++++..+-...+|-||.+| ...|.+++|.+-|+..|...|++.||.- +-+ +. +.|++... T Consensus 206 VtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~~~~~~~~~gv~dl~------- 278 (457) T protein:vir:10 206 VTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAVAGAQNNTATAGVFDLD------- 278 (457) T ss_pred EeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeccccccceeeeee------- Confidence 7777777788999999998 2477899999999999999999988853 000 11 23333321 Q ss_pred eeccccchhhHHHHHHH-H---HHHhhh-hhhhcccce-eEechhHHHHHHhhhcccccccccccc----ccccccccCC Q lcl|NC_010583. 311 VTEAKADGSVLVTAKTI-S---KLRRKL-GRHGLKLSK-LVLIVSMDAYYDLLEDEEWQDVAQVGN----DAVKLQGQVG 380 (458) Q Consensus 311 ~~~~~~~~~~~~~~~~~-~---~~~~~~-~~~~~~~~~-~~~~~~~~~~l~~~~d~~~~~~~~~~~----~~~~~~~~~~ 380 (458) ....+.+.... +..+ . +..... ...-+..+. .++++.+..+|..-.--.-.|...... ..+......| T Consensus 279 -~~~~g~~~~e~-~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~~~~d~~~~~~~G 356 (457) T protein:vir:10 279 -VDSNGRWSVEK-FKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGMAGVLDYTPALNGNNGLAGVDDTSSTLVG 356 (457) T ss_pred -ccccchhhHHH-HHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhhcccccccchhhccccccccccccceeEE Confidence 11111222111 2222 1 222222 122233333 466777777776422111111111110 0112222245 Q ss_pred eeec-ccceecccccccccCCceEEEEEe-ceEEEEecceeEEeecccc----------cCCceEEEEEEeeccEEeccc Q lcl|NC_010583. 381 RIYG-LPVVVSEYFPAKAASAEFAVIVYK-DNFVMPRQRAVTVERERQA----------GKQRDAYYVTQRVNLQRYFEN 448 (458) Q Consensus 381 ~l~G-~pv~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~~~~~i~~~~~~----------~~~~~~~~~~~r~d~~~~~~~ 448 (458) .|+| ++|++..+.... +..+.+++++. +.- ..-.+.+.||. .+-+-.+-...|++. +.+|- T Consensus 357 ~l~~r~~vy~D~Ya~~n-s~~dy~~vG~KG~~~-----~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l-~~NP~ 429 (457) T protein:vir:10 357 TLNGRIKVYVDPYSANV-ADKHFYVAGYKGTSP-----YDAGLFYCPYVPLQQVRAINPDTFQPKIGFKTRYGM-VSNPF 429 (457) T ss_pred EecCCeEEEEecccccC-CccceEEEEEeCCcc-----eecceeecccccccccCccCCccccceeeeeeeeee-eeccc Confidence 6664 788887654311 11223333321 110 00112222222 123333344456665 55554 Q ss_pred ceEEEEeecC Q lcl|NC_010583. 449 GVVSGAYAAA 458 (458) Q Consensus 449 afv~l~~aaa 458 (458) +-- ++.+.+ T Consensus 430 ~~~-~~~~~~ 438 (457) T protein:vir:10 430 AGG-LTQGSG 438 (457) T ss_pred ccc-cccccc Confidence 321 221111 No 240 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=29.40 E-value=1.7 Score=19.37 Aligned_cols=353 Identities=12% Similarity=0.080 Sum_probs=118.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHH Q lcl|NC_010583. 58 EDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKL 137 (458) Q Consensus 58 e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 137 (458) ...+ .+.+.=....+. +...+..... ++.+ .....+...+... ......++.-.. T Consensus 1 ~~~~----------~l~~kw~p~l~~--~~~~~i~~~~---~~~i---------~~~~~en~~~~~~-~~~~~~~~~~~~ 55 (519) T protein:vir:10 1 MKKN----------ALVQKWSALLEN--EALPEIVGAS---KQAI---------IAKIFENQEQDIL-TAPEYRDEKISE 55 (519) T ss_pred Cchh----------HHHHHhHHhhcc--cccchhhhhh---hHHH---------HHHHHHHHHHHhh-hcccccchHHHH Confidence 1100 010000000000 0000000000 0000 0000000000000 000001111111 Q ss_pred HHHHhhhccchhHHHHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce-------EEEE Q lcl|NC_010583. 138 VLLSYMMEKDVFETEHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL-------TMLV 210 (458) Q Consensus 138 a~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-------~~p~ 210 (458) +|..++.+.... ....-. ...+..++++++-...-|.. .. ++.+..+..+..+++.+-||+++.. +|+- T Consensus 56 ~~~~~l~e~~~~-~~~~~~-~t~i~~~~~t~~v~~~~P~l-~~-l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n 131 (519) T protein:vir:10 56 AFGSFLTEAEIG-GDHGYD-ATNIAAGQTSGAVTQIGPAV-MG-MVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGK 131 (519) T ss_pred HHhhhcchhccC-CccccC-ccccccccccccccccchhH-HH-HHHHHHHhhhhhhhheeecCCchhhhhheeeeeecC Confidence 222221110000 000000 00000011111000111211 11 3334445555666677777665322 2221 Q ss_pred ecCC------------Ccccccc--------------------------------------------------------- Q lcl|NC_010583. 211 EPEA------------GRATWVD--------------------------------------------------------- 221 (458) Q Consensus 211 ~~~~------------~~a~~v~--------------------------------------------------------- 221 (458) .... +++.|-+ T Consensus 132 ~~~~~~g~ea~~~~nEadt~fSG~~~~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~ 211 (519) T protein:vir:10 132 DPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVT 211 (519) T ss_pred CccccccccccccccccccccCccccccccccccccccccccccccccccccccceeccccccccCCCCcCccccccccc Confidence 1100 0000000 Q ss_pred ------------ccccc--cc---------ccccccccccceeeeeehhheeeeehhhHHHHhc----cHHHHHHHHHHH Q lcl|NC_010583. 222 ------------ASKFG--TD---------ETVGDEVKGQLTEISFKTYKLAAKSFITDETEED----AIFSLLPLLRKR 274 (458) Q Consensus 222 ------------e~~~~--~e---------~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d----s~~~~~~~i~~~ 274 (458) ++... .| +...++-..+++.+++.++..+-...+|-||.+| ...|.+++|.+- T Consensus 212 ~~~~~~~~~~~~~gmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNI 291 (519) T protein:vir:10 212 ALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGI 291 (519) T ss_pred cccccccccccccccccchhhccccCCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHH Confidence 00000 00 0112334455667777777777788999999998 247899999999 Q ss_pred HHHHHHHHHHHHHhccCC-CC--ccc----------cccccccccccceeeccccchhhHHHHHHH----HHHHhhhhh- Q lcl|NC_010583. 275 LIEAHAVSIEEAFMSGNG-TG--QPK----------GLLKLAADDGAKVVTEAKADGSVLVTAKTI----SKLRRKLGR- 336 (458) Q Consensus 275 la~~~~~~~d~~~l~G~g-~~--~p~----------Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~- 336 (458) |+..|...|++.||.-=. +. ... |++...... ...++-+.. ..+..| -.....+.. T Consensus 292 LSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~aGv~d~~~~~-----d~~~~rw~~-e~~k~L~~~i~~~an~I~~~ 365 (519) T protein:vir:10 292 LATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPI-----DIRGARWAG-ESFKALLFQIDKEAAEIARQ 365 (519) T ss_pred HHHHHHHHhhHHHHhhhhhhhhcceeecccCcccccceeeccccc-----ccccchHHH-HHHHHHHHHHHHHHHHHHHh Confidence 999999999999994111 01 111 333211100 001111111 111122 222222222 Q ss_pred hhccc-ceeEechhHHHHHHhhhc--ccccccccccccccc-ccccCCeeec-ccceecccccccccCCceEEEEEe-ce Q lcl|NC_010583. 337 HGLKL-SKLVLIVSMDAYYDLLED--EEWQDVAQVGNDAVK-LQGQVGRIYG-LPVVVSEYFPAKAASAEFAVIVYK-DN 410 (458) Q Consensus 337 ~~~~~-~~~~~~~~~~~~l~~~~d--~~~~~~~~~~~~~~~-~~~~~~~l~G-~pv~~~~~~~~~~~~~~~~~~~~~-~~ 410 (458) ..+.. .-.+|++.....|....- +...-..+.....+. .....|.|.| ++|++..+.|. +.+++++. +. T Consensus 366 T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~KG~~ 440 (519) T protein:vir:10 366 TGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS-----DYFTIGYKGSN 440 (519) T ss_pred hccccccEEEEchHHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCCc-----ceEEEEEecCc Confidence 12233 345678887788775541 110000001111111 0011244554 78988877663 23333321 10 Q ss_pred EEEEecceeEEeecccc----------cCCceEEEEEEeeccEEecccceEEEEee--------------cC Q lcl|NC_010583. 411 FVMPRQRAVTVERERQA----------GKQRDAYYVTQRVNLQRYFENGVVSGAYA--------------AA 458 (458) Q Consensus 411 ~~i~~~~~~~i~~~~~~----------~~~~~~~~~~~r~d~~~~~~~afv~l~~a--------------aa 458 (458) - ..-.+.+.||. .+-+-.+-...|+++. .+| |+..... .+ T Consensus 441 ~-----~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~-~NP--~~~~~~~~~~~~i~~g~~~~a~~ 504 (519) T protein:vir:10 441 E-----MDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG-INP--FADPAAQAPTKRIQNGMPDIVNS 504 (519) T ss_pred c-----cccceeeccccccccccccCCccccceeeeeeeecee-ecC--cccccccCccceeccCchhhhcc Confidence 0 00112222332 1223333334455442 233 3211100 01 No 241 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=28.26 E-value=1.8 Score=19.23 Aligned_cols=138 Identities=12% Similarity=0.037 Sum_probs=22.1 Q ss_pred CcchHHHHHHHHHHHHHH---HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH--HHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLA---KSLEGLTAAQKA---AEAKRLREEQEEKELARMNDLVSKAVGED-RK--RLEEALDLVK 71 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~---~~~~~l~~~~~~---~~~~~~~~e~~~~~~~~~~~~~~~~~~e~-~~--~~~~~~~~i~ 71 (458) |..++.+........++. .....+...... +...+....+..+...+++........+. ++ +.+....+++ T Consensus 559 ~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q~~q~e~e 638 (705) T protein:vir:88 559 ILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIE 638 (705) T ss_pred HHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 433333332111111111 011111000000 00000000011111111111111111111 01 0111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhc Q lcl|NC_010583. 72 NLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMME 145 (458) Q Consensus 72 ~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 145 (458) .... ......++.+...++......+.....+..+..........+......... .......+.+++ T Consensus 639 ~~~~----~~~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~e~~~e~~q~~~~~~~~~~~---~~~~k~~~~~rr 705 (705) T protein:vir:88 639 LKKQ----EAVLQQREMALKEAELQLERDRFTWERARNEAEYHLEATQARAAYIGDGKV---PETKKPTKAVRR 705 (705) T ss_pred HHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhH---HHHHHHHHHhcC Confidence 1100 001111111111111111111111111111111111111111111111111 112223344444 No 242 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=27.66 E-value=1.8 Score=19.16 Aligned_cols=100 Identities=13% Similarity=0.144 Sum_probs=12.7 Q ss_pred CcchHHHHHHHH-------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEEL-------------------GLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRK 61 (458) Q Consensus 1 ~~~~~~~~~~~~-------------------~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~ 61 (458) +.- ++++.+.+ ...++..+...+..+....+.. ..+..+...+.+....+...+. . T Consensus 593 ~p~-~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~---~~qa~ae~~~Aqae~~qa~~e~-~ 667 (711) T protein:vir:10 593 WPG-ADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQAD---MAQAEADTAQAQADMLKAQLET-E 667 (711) T ss_pred CCC-HHHHHHHHHhhcCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHH-H Confidence 100 11111111 0011111111110000000000 0000000000000000000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 62 RLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSL 105 (458) Q Consensus 62 ~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 105 (458) ........++.+...-........++.....+++...+.+..+. T Consensus 668 ~~q~q~~~~~~~aq~~~~~~qq~~~~l~~~qaelq~~q~~~~q~ 711 (711) T protein:vir:10 668 EAQKQLAMIEDMAQGGDVVYQQVRELVAQALAEITASQANVTEQ 711 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 00000011111100000011111122222222222222222221 No 243 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=27.40 E-value=1.8 Score=19.12 Aligned_cols=367 Identities=12% Similarity=0.076 Sum_probs=75.9 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAAEAKRLREEQEEKELARMNDLVSKAVGEDRKRLEEALDLVKNLDEKSKKS 80 (458) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~i~~~~e~~~~~ 80 (458) |||+|.++++... ++.++.+.+.+..+. ...+ .+++ ...+.. ..+.++..+++++...+...+. T Consensus 1 m~~~lk~l~~~~~--el~~~~~~~k~~~~~--~~~~-~e~~---~~~l~~--------~~~~l~~~~~~~~~~~~~~~~~ 64 (401) T protein:vir:44 1 MAVDIKDVEQVAQ--ELQQKFDDFKAKNDK--RVEA-IEQE---KGKLAG--------QVETLNGKLSELENLKSDLEKE 64 (401) T ss_pred CCccHHHHHHHHH--HHHHHHHHHHHHHHH--HHHH-HHHH---HHHHHH--------HHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999755 555554433222111 1101 0111 001111 1111222222222222211111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhHHHHHHHHHhhhccchhHHHHHHHHhhh Q lcl|NC_010583. 81 AELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDEVEKLVLLSYMMEKDVFETEHGKAHIKA 160 (458) Q Consensus 81 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~a 160 (458) ..... .. ....... ...+.+......+... ....... .+..++. .. T Consensus 65 ~~~~~----~~---~~~~~~~-----~~~e~~~a~~~~lr~~---~~~~~~~-~e~~a~~-----------------~~- 110 (401) T protein:vir:44 65 LLELK----RP---ARGAQNK-----VAAEHKDAFVGFLRKG---REDGLRD-LERKALQ-----------------VG- 110 (401) T ss_pred HHHhh----cc---ccccccc-----hhHHHHHHHHHHHhhh---hhhhhHH-HHHHHhh-----------------cC- Confidence 10000 00 0000000 0000000000000000 0000000 0000000 00 Q ss_pred hhcccccccCccccchhHHHH-----HHHHHHhccchhhhcceeeecc-CceEEEEecCCCccccccccccccccccccc Q lcl|NC_010583. 161 VNGSSSVSMSSEAYETIFSTR-----IIRDLQKELVVGALFDELPMSS-KILTMLVEPEAGRATWVDASKFGTDETVGDE 234 (458) Q Consensus 161 ~~~~~~~~~g~~~ip~~~~~~-----ii~~~~~~~~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~e~~~~~e~~~~~~ 234 (458) ....++....-.+.+ .+... ++..+....++..-...+|+.. +....++..+.... +.. .+.-....- T Consensus 111 ~~~~GG~~iP~~~~~-~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~----~~~-~~~~~~v~~ 184 (401) T protein:vir:44 111 TDEDGGYAVPEELDR-SILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGTASGWVGETDTRS----QTA-TSRLGLIEP 184 (401) T ss_pred CCCCCceeccHhHHH-HHHHHHHhhhhhhhhceeeecCCCceEEEEecCCccceeeccccccC----ccc-cccceeeee Confidence 000000000111111 11111 1111111111111111233222 22222232222111 100 011111101 Q ss_pred cccccee-eeeehhheeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHH--------HHhccCCCCccccccccccc Q lcl|NC_010583. 235 VKGQLTE-ISFKTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE--------AFMSGNGTGQPKGLLKLAAD 305 (458) Q Consensus 235 ~~~~f~~-v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~~~~~~~d~--------~~l~G~g~~~p~Gi~~~~~~ 305 (458) .--++.. +.++-. + +.-|..-+. --+...|.+.++..+..++=. .|++..+...-.+....... T Consensus 185 ~~~k~~~~~~iS~e-l---l~ds~~~l~---~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~ 257 (401) T protein:vir:44 185 FMGEIYGNPQATQK-M---LDDAFFNVE---AWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKL 257 (401) T ss_pred ehhheeeehhhhHH-H---HhcchHHHH---HHHHHHHHHHHHHHHHhhhhccCCCCccceeeccccccccccccccccc Confidence 1111111 111111 1 122222222 136677777777777766542 23332221110000000000 Q ss_pred cccceeeccccchhhHHHHHHHHHHHhhhhhhhcccceeEechhHHHHHHhhhcc--------------ccccccccccc Q lcl|NC_010583. 306 DGAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDE--------------EWQDVAQVGND 371 (458) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~d~--------------~~~~~~~~~~~ 371 (458) ...++.......+.+.. ..+..+...+........+--+........+....+ .|+|+.....- T Consensus 258 ~~~~t~~~~~~~~d~i~--~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~ 335 (401) T protein:vir:44 258 QHIVSGEATAVTADAII--KLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQM 335 (401) T ss_pred cccccccccccCHHHHH--HHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEecCc Confidence 11111111111111111 111111111111110000000111111111222112 23332211110 Q ss_pred cccccccCCeeecccceecccccccccCCceEEEEEeceEEEEe-----cceeEEeecccccCCceEEEEEEeeccEEec Q lcl|NC_010583. 372 AVKLQGQVGRIYGLPVVVSEYFPAKAASAEFAVIVYKDNFVMPR-----QRAVTVERERQAGKQRDAYYVTQRVNLQRYF 446 (458) Q Consensus 372 ~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~~~~~~i~~-----~~~~~i~~~~~~~~~~~~~~~~~r~d~~~~~ 446 (458) +..+.. +.+|++-+ ... ...+++...+.+.. ..-+.+....++. ...+.....+ .+.. T Consensus 336 --p~~~~~----~~~i~~Gd------~~~-~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d--~~~~~~~a~~--~l~~ 398 (401) T protein:vir:44 336 --PDIAAD----AKAIAFGN------FKR-GYTIVDRIGTRILRDPYTNKPFVGFYTTKRTG--GMLVDSQAIK--LLKI 398 (401) T ss_pred --CCccCC----ccEEEEee------hhc-cEEEEEecceEEeeeccccCCcEEEEEEEEec--cEEecccceE--EEEe Confidence 000000 01111110 000 11122222222110 0001111111111 0000000000 0111 Q ss_pred ccc Q lcl|NC_010583. 447 ENG 449 (458) Q Consensus 447 ~~a 449 (458) ..| T Consensus 399 ~aa 401 (401) T protein:vir:44 399 AAA 401 (401) T ss_pred ecC Confidence 111 No 244 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=26.88 E-value=1.9 Score=19.06 Aligned_cols=113 Identities=9% Similarity=0.070 Sum_probs=12.7 Q ss_pred CcchH---HHHHHHHHHH--------HHH-HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDI---NKLKEELGLG--------DLA-KSLEGLTAAQKAAEAKRLR-EEQEEKELARMNDLVSKAVGEDRKRLEEAL 67 (458) Q Consensus 1 ~~~~~---~~~~~~~~~~--------~~~-~~~~~l~~~~~~~~~~~~~-~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 67 (458) ..||+ +++.+.+... .+. ++...-...+..++...++ .++..+...++++...++.........++. T Consensus 581 ~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~ 660 (714) T protein:vir:99 581 NLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQ 660 (714) T ss_pred HhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222 2333322110 000 0000000000000000000 001111111111111111110000000000 Q ss_pred HH-----HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh Q lcl|NC_010583. 68 DL-----VKNLDEKSKKSAELFAQTV---EKQQETIVGLQDEIKSLLAAREGRSFVGDSVAK 121 (458) Q Consensus 68 ~~-----i~~~~e~~~~~~e~~~~~~---~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~ 121 (458) .. .+....... .....+.+ ..++......+.+..+. .+. ..+.+.- T Consensus 661 ~~~~~~~~~~~~~~~~--~a~~a~~~~~~~~~~~~~~~~~~q~~q~---~~~---~~~~~~~ 714 (714) T protein:vir:99 661 REVALTQGQRYVDALN--QAHTAEIITGVQNMEQEQDVLQQQMLYT---LQQ---RMNEMSL 714 (714) T ss_pred HHHHHHHHHHHHHHHH--HHHHHHHHHhHhhhhhhhHHHHHHHHHH---HHH---HHHhcCC Confidence 00 000000000 00000000 00000000001110000 000 0000000 No 245 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=26.88 E-value=1.9 Score=19.06 Aligned_cols=113 Identities=9% Similarity=0.070 Sum_probs=12.7 Q ss_pred CcchH---HHHHHHHHHH--------HHH-HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDI---NKLKEELGLG--------DLA-KSLEGLTAAQKAAEAKRLR-EEQEEKELARMNDLVSKAVGEDRKRLEEAL 67 (458) Q Consensus 1 ~~~~~---~~~~~~~~~~--------~~~-~~~~~l~~~~~~~~~~~~~-~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 67 (458) ..||+ +++.+.+... .+. ++...-...+..++...++ .++..+...++++...++.........++. T Consensus 581 ~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~ 660 (714) T protein:vir:81 581 NLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQ 660 (714) T ss_pred HhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222 2333322110 000 0000000000000000000 001111111111111111110000000000 Q ss_pred HH-----HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh Q lcl|NC_010583. 68 DL-----VKNLDEKSKKSAELFAQTV---EKQQETIVGLQDEIKSLLAAREGRSFVGDSVAK 121 (458) Q Consensus 68 ~~-----i~~~~e~~~~~~e~~~~~~---~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~ 121 (458) .. .+....... .....+.+ ..++......+.+..+. .+. ..+.+.- T Consensus 661 ~~~~~~~~~~~~~~~~--~a~~a~~~~~~~~~~~~~~~~~~q~~q~---~~~---~~~~~~~ 714 (714) T protein:vir:81 661 REVALTQGQRYVDALN--QAHTAEIITGVQNMEQEQDVLQQQMLYT---LQQ---RMNEMSL 714 (714) T ss_pred HHHHHHHHHHHHHHHH--HHHHHHHHHhHhhhhhhhHHHHHHHHHH---HHH---HHHhcCC Confidence 00 000000000 00000000 00000000001110000 000 0000000 No 246 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=26.88 E-value=1.9 Score=19.06 Aligned_cols=113 Identities=9% Similarity=0.070 Sum_probs=12.7 Q ss_pred CcchH---HHHHHHHHHH--------HHH-HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDI---NKLKEELGLG--------DLA-KSLEGLTAAQKAAEAKRLR-EEQEEKELARMNDLVSKAVGEDRKRLEEAL 67 (458) Q Consensus 1 ~~~~~---~~~~~~~~~~--------~~~-~~~~~l~~~~~~~~~~~~~-~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 67 (458) ..||+ +++.+.+... .+. ++...-...+..++...++ .++..+...++++...++.........++. T Consensus 581 ~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~ 660 (714) T protein:vir:27 581 NLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQ 660 (714) T ss_pred HhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222 2333322110 000 0000000000000000000 001111111111111111110000000000 Q ss_pred HH-----HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh Q lcl|NC_010583. 68 DL-----VKNLDEKSKKSAELFAQTV---EKQQETIVGLQDEIKSLLAAREGRSFVGDSVAK 121 (458) Q Consensus 68 ~~-----i~~~~e~~~~~~e~~~~~~---~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~ 121 (458) .. .+....... .....+.+ ..++......+.+..+. .+. ..+.+.- T Consensus 661 ~~~~~~~~~~~~~~~~--~a~~a~~~~~~~~~~~~~~~~~~q~~q~---~~~---~~~~~~~ 714 (714) T protein:vir:27 661 REVALTQGQRYVDALN--QAHTAEIITGVQNMEQEQDVLQQQMLYT---LQQ---RMNEMSL 714 (714) T ss_pred HHHHHHHHHHHHHHHH--HHHHHHHHHhHhhhhhhhHHHHHHHHHH---HHH---HHHhcCC Confidence 00 000000000 00000000 00000000001110000 000 0000000 No 247 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=26.88 E-value=1.9 Score=19.06 Aligned_cols=113 Identities=9% Similarity=0.070 Sum_probs=12.7 Q ss_pred CcchH---HHHHHHHHHH--------HHH-HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDI---NKLKEELGLG--------DLA-KSLEGLTAAQKAAEAKRLR-EEQEEKELARMNDLVSKAVGEDRKRLEEAL 67 (458) Q Consensus 1 ~~~~~---~~~~~~~~~~--------~~~-~~~~~l~~~~~~~~~~~~~-~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 67 (458) ..||+ +++.+.+... .+. ++...-...+..++...++ .++..+...++++...++.........++. T Consensus 581 ~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~ 660 (714) T protein:vir:10 581 NLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQ 660 (714) T ss_pred HhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222 2333322110 000 0000000000000000000 001111111111111111110000000000 Q ss_pred HH-----HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh Q lcl|NC_010583. 68 DL-----VKNLDEKSKKSAELFAQTV---EKQQETIVGLQDEIKSLLAAREGRSFVGDSVAK 121 (458) Q Consensus 68 ~~-----i~~~~e~~~~~~e~~~~~~---~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~ 121 (458) .. .+....... .....+.+ ..++......+.+..+. .+. ..+.+.- T Consensus 661 ~~~~~~~~~~~~~~~~--~a~~a~~~~~~~~~~~~~~~~~~q~~q~---~~~---~~~~~~~ 714 (714) T protein:vir:10 661 REVALTQGQRYVDALN--QAHTAEIITGVQNMEQEQDVLQQQMLYT---LQQ---RMNEMSL 714 (714) T ss_pred HHHHHHHHHHHHHHHH--HHHHHHHHHhHhhhhhhhHHHHHHHHHH---HHH---HHHhcCC Confidence 00 000000000 00000000 00000000001110000 000 0000000 No 248 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=26.88 E-value=1.9 Score=19.06 Aligned_cols=113 Identities=9% Similarity=0.070 Sum_probs=12.7 Q ss_pred CcchH---HHHHHHHHHH--------HHH-HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010583. 1 MTIDI---NKLKEELGLG--------DLA-KSLEGLTAAQKAAEAKRLR-EEQEEKELARMNDLVSKAVGEDRKRLEEAL 67 (458) Q Consensus 1 ~~~~~---~~~~~~~~~~--------~~~-~~~~~l~~~~~~~~~~~~~-~e~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 67 (458) ..||+ +++.+.+... .+. ++...-...+..++...++ .++..+...++++...++.........++. T Consensus 581 ~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~ 660 (714) T protein:vir:32 581 NLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQ 660 (714) T ss_pred HhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222 2333322110 000 0000000000000000000 001111111111111111110000000000 Q ss_pred HH-----HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh Q lcl|NC_010583. 68 DL-----VKNLDEKSKKSAELFAQTV---EKQQETIVGLQDEIKSLLAAREGRSFVGDSVAK 121 (458) Q Consensus 68 ~~-----i~~~~e~~~~~~e~~~~~~---~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~ 121 (458) .. .+....... .....+.+ ..++......+.+..+. .+. ..+.+.- T Consensus 661 ~~~~~~~~~~~~~~~~--~a~~a~~~~~~~~~~~~~~~~~~q~~q~---~~~---~~~~~~~ 714 (714) T protein:vir:32 661 REVALTQGQRYVDALN--QAHTAEIITGVQNMEQEQDVLQQQMLYT---LQQ---RMNEMSL 714 (714) T ss_pred HHHHHHHHHHHHHHHH--HHHHHHHHHhHhhhhhhhHHHHHHHHHH---HHH---HHHhcCC Confidence 00 000000000 00000000 00000000001110000 000 0000000 No 249 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=20.54 E-value=2.8 Score=18.17 Aligned_cols=346 Identities=13% Similarity=0.073 Sum_probs=111.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcchhhhhhH--HH Q lcl|NC_010583. 58 EDRKRLEEALDLVKNLDEKSKKSAELFAQTVEKQQETIVGLQDEIKSLLAAREGRSFVGDSVAKALYGTQDAFEDE--VE 135 (458) Q Consensus 58 e~~~~~~~~~~~i~~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~--~~ 135 (458) +. -..+ .+.+.=....|.+. .. ++.. .-++.... .....+++...+ .+ T Consensus 1 ~~-----~~~~---~l~~kw~p~l~~~~--~~-----------~i~~----~~~~~~~a-----~l~enq~~~~~~~~~~ 50 (529) T protein:vir:10 1 MS-----LKNK---EILNKWTPLLEGEG--LP-----------EIAG----KNKQALVA-----QILEAQEKDSKSDPVY 50 (529) T ss_pred Cc-----ccHH---HHHHHhHHHhcCCc--cc-----------hhcc----chhhhhhh-----hhhhhhHHHHhhcccc Confidence 00 0000 00000000000000 00 0000 00000000 000000000000 00 Q ss_pred -----HHHHHHhhhccchhHH-HHHHHHhhhhhcccccccCccccchhHHHHHHHHHHhccchhhhcceeeeccCce--- Q lcl|NC_010583. 136 -----KLVLLSYMMEKDVFET-EHGKAHIKAVNGSSSVSMSSEAYETIFSTRIIRDLQKELVVGALFDELPMSSKIL--- 206 (458) Q Consensus 136 -----~~a~~~~~~~~~~~~~-~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~--- 206 (458) -.++...+.+...... ..........+.++.... .=|. +. .+++..-+..+..+++-|-||+++.. T Consensus 51 ~~~~~~e~~~~~l~~~~~~~~~~~~~~~i~est~t~~v~~---~~P~-Li-~lvRra~p~LIa~DIwGVQPMTgPTGLIF 125 (529) T protein:vir:10 51 RDDKLIEAFGQSLMEAEVAGDHGYDPTNIAAGQSSGAITN---IGPA-VI-GMVRRAIPSLIAFDIAGVQPMTGPTGQVF 125 (529) T ss_pred chhhhhhhhhcccchhhccccccccccccccccccccccc---cCch-hh-hhHHHHHhhhhhheeeeeecCCchhhhhh Confidence 0001111110000000 000000000011111111 1121 11 13333444555555666666654211 Q ss_pred ----EEEEecCC-------------------------------------------------------------------- Q lcl|NC_010583. 207 ----TMLVEPEA-------------------------------------------------------------------- 214 (458) Q Consensus 207 ----~~p~~~~~-------------------------------------------------------------------- 214 (458) .|+-.... T Consensus 126 AMRsrY~~~~~~~~~~eaf~~~y~Pda~~sga~~~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~ 205 (529) T protein:vir:10 126 ALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGAS 205 (529) T ss_pred hhheeecCCccccccccccccccccccccccccccccccccCccccccccccccccccCcceeeeecccceecccccccc Confidence 11100000 Q ss_pred C----------------------ccccccccccc-----------ccccccccccccceeeeeehhheeeeehhhHHHHh Q lcl|NC_010583. 215 G----------------------RATWVDASKFG-----------TDETVGDEVKGQLTEISFKTYKLAAKSFITDETEE 261 (458) Q Consensus 215 ~----------------------~a~~v~e~~~~-----------~e~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ 261 (458) . ..+-++++-.. ..+...++-..+++.++++++.-+-...+|-||.+ T Consensus 206 ~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQ 285 (529) T protein:vir:10 206 VTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQ 285 (529) T ss_pred cccCccccCcccccccccccccccccccccccchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHH Confidence 0 00000001000 00112334445567777777777778899999999 Q ss_pred c----cHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCC-C---------ccccccccccccccceeeccccchhhHHHH Q lcl|NC_010583. 262 D----AIFSLLPLLRKRLIEAHAVSIEEAFMSG---NGT-G---------QPKGLLKLAADDGAKVVTEAKADGSVLVTA 324 (458) Q Consensus 262 d----s~~~~~~~i~~~la~~~~~~~d~~~l~G---~g~-~---------~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (458) | ...|.+++|.+-|+..|...|++.||.- ... + ...|++...... .....-+.. ..+ T Consensus 286 DLKAVHGLDAEtELsNILStEImlEINReii~~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~-----~~~~~~~~~-e~~ 359 (529) T protein:vir:10 286 DLRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGSASGVFDFQDPI-----DVRGARWAG-ESY 359 (529) T ss_pred HHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhcccccccccccceeecccCc-----cccccchHH-HHH Confidence 8 2477899999999999999999988843 000 1 122333221110 000111111 122 Q ss_pred HHHH----HHHhhhhh-hhcccce-eEechhHHHHHHhhhcccccccc---cccc-ccccccccCCeeec-ccceecccc Q lcl|NC_010583. 325 KTIS----KLRRKLGR-HGLKLSK-LVLIVSMDAYYDLLEDEEWQDVA---QVGN-DAVKLQGQVGRIYG-LPVVVSEYF 393 (458) Q Consensus 325 ~~~~----~~~~~~~~-~~~~~~~-~~~~~~~~~~l~~~~d~~~~~~~---~~~~-~~~~~~~~~~~l~G-~pv~~~~~~ 393 (458) ..|. ...+.+.. ..+..+. .++++.....|.. .+...++-. ..+. .........|.|.| ++|++..+. T Consensus 360 k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~-~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~ 438 (529) T protein:vir:10 360 KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALAL-IDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYA 438 (529) T ss_pred HHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHh-hhhhccccccccccccccccCCceEEEEecCceEEEecCCC Confidence 2222 22222222 2222233 4667777777763 111111111 1100 01111112345554 788888776 Q ss_pred cccccCCceEEEEEe-ceEEEEecceeEEeecccc----------cCCceEEEEEEeeccEEecccceEEEEee------ Q lcl|NC_010583. 394 PAKAASAEFAVIVYK-DNFVMPRQRAVTVERERQA----------GKQRDAYYVTQRVNLQRYFENGVVSGAYA------ 456 (458) Q Consensus 394 ~~~~~~~~~~~~~~~-~~~~i~~~~~~~i~~~~~~----------~~~~~~~~~~~r~d~~~~~~~afv~l~~a------ 456 (458) |. +.+++++. +.- ..-.+.+.||. .+-+-.+-...|+++.+ +| |+.-+.- T Consensus 439 ~~-----dy~~vG~KG~~~-----~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP--~~~~~~~~~~~r~ 505 (529) T protein:vir:10 439 RQ-----DYFTMGYRGANN-----LDAGIYYCPYVALTPLRGSDPKNFQPVMGFKTRYAIGV-NP--FAESRTQAPQGRI 505 (529) T ss_pred Cc-----ceEEEEEeCCcc-----cccceeeccccccccccccCCCcccceeeeeeeeceee-cC--ccccccccccccc Confidence 52 23333321 110 00112222322 12222333334554322 22 2111100 Q ss_pred ----------------------cC Q lcl|NC_010583. 457 ----------------------AA 458 (458) Q Consensus 457 ----------------------aa 458 (458) .= T Consensus 506 ~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 506 TSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred cCCcchhhhcCccceeEEeeeccC Confidence 00 Done!