Query lcl|NC_019921.1_cdsid_YP_007237157.1 [gene=capsid] [protein=phage major capsid protein] [protein_id=YP_007237157.1] [location=22805..23950] Match_columns 381 No_of_seqs 174 out of 549 Neff 8.4 Searched_HMMs 1612 Date Thu Nov 7 17:47:10 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_28 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_28_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:100632 Length: 381 100.0 1E-102 9E-106 578.9 40.1 381 1-381 1-381 (381) 2 protein:vir:101291 Length: 381 100.0 1E-101 9E-105 573.5 40.3 381 1-381 1-381 (381) 3 protein:vir:9509 Length: 381 # 100.0 1E-101 9E-105 573.5 40.3 381 1-381 1-381 (381) 4 protein:vir:78350 Length: 383 100.0 1.9E-93 1.2E-96 528.9 38.6 376 1-376 1-383 (383) 5 protein:vir:9643 Length: 377 # 100.0 6.9E-92 4.3E-95 520.3 39.8 368 1-368 1-377 (377) 6 protein:vir:98635 Length: 377 100.0 6.8E-87 4.2E-90 493.0 37.8 368 1-368 1-377 (377) 7 protein:vir:95963 Length: 395 100.0 6.8E-86 4.2E-89 487.5 39.1 377 1-381 1-389 (395) 8 protein:vir:4092 Length: 390 # 100.0 6.2E-76 3.9E-79 432.9 39.0 369 1-381 4-388 (390) 9 protein:vir:80128 Length: 466 100.0 3.3E-68 2.1E-71 390.5 33.4 373 1-381 21-461 (466) 10 protein:vir:95376 Length: 425 100.0 3.6E-67 2.3E-70 384.8 36.0 353 1-372 8-425 (425) 11 protein:vir:4456 Length: 401 # 100.0 5.6E-66 3.5E-69 378.3 35.5 353 1-368 1-401 (401) 12 protein:vir:100247 Length: 425 100.0 3.6E-65 2.2E-68 373.9 35.0 355 1-369 21-425 (425) 13 protein:vir:485 Length: 407 # 100.0 1.3E-64 8.1E-68 370.8 36.9 357 1-375 1-407 (407) 14 protein:vir:1328 Length: 392 # 100.0 9.1E-65 5.6E-68 371.7 31.9 346 1-369 1-392 (392) 15 protein:vir:6242 Length: 390 # 100.0 9.3E-64 5.8E-67 366.1 30.6 344 1-369 4-390 (390) 16 protein:vir:6212 Length: 434 # 100.0 3.3E-62 2.1E-65 357.6 32.2 346 1-373 1-434 (434) 17 protein:vir:4511 Length: 409 # 100.0 7.1E-61 4.4E-64 350.3 33.5 345 1-371 1-409 (409) 18 protein:vir:8102 Length: 543 # 100.0 9.1E-61 5.7E-64 349.7 32.1 345 1-369 143-543 (543) 19 protein:vir:101650 Length: 497 100.0 3.5E-60 2.2E-63 346.5 33.0 364 1-374 7-497 (497) 20 protein:vir:7855 Length: 497 # 100.0 3.5E-60 2.2E-63 346.5 33.0 364 1-374 7-497 (497) 21 protein:vir:1433 Length: 435 # 100.0 6.6E-59 4.1E-62 339.6 33.7 350 1-371 1-435 (435) 22 protein:vir:80376 Length: 435 100.0 9.7E-59 6E-62 338.6 34.6 350 1-372 1-435 (435) 23 protein:vir:78640 Length: 352 100.0 2.4E-59 1.5E-62 342.0 30.5 337 3-374 1-352 (352) 24 protein:vir:105038 Length: 428 100.0 1.7E-58 1.1E-61 337.3 35.2 349 1-369 1-428 (428) 25 protein:vir:1268 Length: 397 # 100.0 1.3E-58 8.3E-62 337.9 34.6 329 1-368 1-397 (397) 26 protein:vir:9361 Length: 402 # 100.0 1.7E-59 1E-62 342.8 29.3 340 1-374 16-402 (402) 27 protein:vir:81070 Length: 390 100.0 2.3E-58 1.4E-61 336.6 35.3 339 1-366 1-390 (390) 28 protein:vir:10364 Length: 390 100.0 1.3E-58 7.8E-62 338.0 33.6 340 1-366 1-390 (390) 29 protein:vir:93881 Length: 387 100.0 7.8E-59 4.9E-62 339.1 31.7 340 1-374 1-387 (387) 30 protein:vir:81160 Length: 371 100.0 3.3E-58 2.1E-61 335.7 35.1 328 1-368 1-371 (371) 31 protein:vir:97053 Length: 390 100.0 3.9E-58 2.4E-61 335.3 34.6 339 1-366 1-390 (390) 32 protein:vir:100135 Length: 418 100.0 5.7E-58 3.5E-61 334.4 33.6 344 1-371 16-418 (418) 33 protein:vir:2685 Length: 387 # 100.0 2.1E-58 1.3E-61 336.8 30.0 340 1-374 1-387 (387) 34 protein:vir:94424 Length: 387 100.0 2.1E-58 1.3E-61 336.8 30.0 340 1-374 1-387 (387) 35 protein:vir:96978 Length: 387 100.0 2.1E-58 1.3E-61 336.8 30.0 340 1-374 1-387 (387) 36 protein:vir:4339 Length: 395 # 100.0 1.5E-57 9.6E-61 332.0 34.6 343 1-368 1-395 (395) 37 protein:vir:1025 Length: 408 # 100.0 2.3E-57 1.4E-60 331.1 33.9 343 1-381 5-406 (408) 38 protein:vir:102119 Length: 404 100.0 7.7E-57 4.8E-60 328.2 35.5 344 1-372 1-404 (404) 39 protein:vir:104256 Length: 458 100.0 1.1E-56 6.9E-60 327.3 35.7 348 1-370 24-458 (458) 40 protein:vir:4953 Length: 397 # 100.0 1.6E-57 9.6E-61 332.0 29.9 339 1-380 1-397 (397) 41 protein:vir:7409 Length: 408 # 100.0 1.4E-56 8.9E-60 326.7 34.9 343 1-381 5-406 (408) 42 protein:vir:1886 Length: 385 # 100.0 2.2E-56 1.3E-59 325.8 34.9 340 1-369 1-385 (385) 43 protein:vir:191 Length: 385 # 100.0 2.2E-56 1.3E-59 325.8 34.9 340 1-369 1-385 (385) 44 protein:vir:105004 Length: 392 100.0 2.5E-56 1.5E-59 325.5 34.9 335 1-376 1-392 (392) 45 protein:vir:107593 Length: 392 100.0 2.5E-56 1.5E-59 325.5 34.9 335 1-376 1-392 (392) 46 protein:vir:102082 Length: 392 100.0 2.5E-56 1.5E-59 325.5 34.9 335 1-376 1-392 (392) 47 protein:vir:102873 Length: 392 100.0 2.5E-56 1.5E-59 325.5 34.9 335 1-376 1-392 (392) 48 protein:vir:7771 Length: 330 # 100.0 1.1E-57 7E-61 332.8 27.2 290 65-375 1-330 (330) 49 protein:vir:81227 Length: 413 100.0 2.6E-56 1.6E-59 325.3 33.3 346 1-372 1-413 (413) 50 protein:vir:4997 Length: 397 # 100.0 6.4E-56 3.9E-59 323.2 34.6 341 1-380 1-397 (397) 51 protein:vir:3991 Length: 404 # 100.0 4.2E-55 2.6E-58 318.7 36.2 341 1-379 1-404 (404) 52 protein:vir:4830 Length: 397 # 100.0 5.2E-56 3.2E-59 323.7 30.7 339 3-380 1-397 (397) 53 protein:vir:4700 Length: 415 # 100.0 4.9E-55 3E-58 318.3 35.2 350 3-379 1-415 (415) 54 protein:vir:4600 Length: 415 # 100.0 4.9E-55 3E-58 318.3 35.2 350 3-379 1-415 (415) 55 protein:vir:79987 Length: 415 100.0 5.9E-55 3.7E-58 317.9 35.4 350 3-379 1-415 (415) 56 protein:vir:81100 Length: 415 100.0 5.9E-55 3.7E-58 317.9 35.4 350 3-379 1-415 (415) 57 protein:vir:98339 Length: 415 100.0 5.9E-55 3.7E-58 317.9 35.4 350 3-379 1-415 (415) 58 protein:vir:9410 Length: 415 # 100.0 7.2E-55 4.5E-58 317.4 34.7 350 3-379 1-415 (415) 59 protein:vir:3845 Length: 395 # 100.0 9.9E-55 6.1E-58 316.7 35.3 341 1-379 1-395 (395) 60 protein:vir:96762 Length: 632 100.0 2.1E-55 1.3E-58 320.4 31.4 345 1-369 224-632 (632) 61 protein:vir:4226 Length: 326 # 100.0 1.9E-56 1.2E-59 326.1 25.5 293 54-373 1-326 (326) 62 protein:vir:97148 Length: 324 100.0 3.4E-56 2.1E-59 324.7 26.9 301 37-381 1-324 (324) 63 protein:vir:41 Length: 299 # N 100.0 2.2E-56 1.4E-59 325.7 24.8 272 60-369 1-299 (299) 64 protein:vir:5739 Length: 366 # 100.0 2.8E-56 1.7E-59 325.2 24.4 336 5-369 1-366 (366) 65 protein:vir:2430 Length: 318 # 100.0 3.9E-55 2.4E-58 318.9 29.0 287 59-373 1-318 (318) 66 protein:vir:1383 Length: 421 # 100.0 4.4E-54 2.7E-57 313.1 32.7 341 1-381 1-402 (421) 67 protein:vir:104085 Length: 320 100.0 1.1E-54 7E-58 316.4 28.6 289 59-372 1-320 (320) 68 protein:vir:105905 Length: 304 100.0 3.9E-55 2.4E-58 318.9 25.9 279 65-369 1-304 (304) 69 protein:vir:94142 Length: 304 100.0 3.9E-55 2.4E-58 318.9 25.9 279 65-369 1-304 (304) 70 protein:vir:80684 Length: 315 100.0 9.8E-55 6.1E-58 316.7 28.1 280 76-379 1-315 (315) 71 protein:vir:78830 Length: 324 100.0 4.9E-55 3E-58 318.3 25.7 301 37-381 1-324 (324) 72 protein:vir:96392 Length: 324 100.0 4.9E-55 3E-58 318.3 25.7 301 37-381 1-324 (324) 73 protein:vir:94673 Length: 419 100.0 1.1E-53 6.7E-57 311.0 32.8 347 1-370 1-419 (419) 74 protein:vir:3870 Length: 400 # 100.0 3E-53 1.8E-56 308.6 33.6 325 1-369 1-400 (400) 75 protein:vir:78223 Length: 333 100.0 2.9E-54 1.8E-57 314.1 27.6 284 70-369 1-333 (333) 76 protein:vir:2344 Length: 397 # 100.0 3.1E-54 1.9E-57 313.9 27.3 288 60-381 1-319 (397) 77 protein:vir:9309 Length: 324 # 100.0 2.7E-54 1.7E-57 314.2 27.0 301 36-381 1-324 (324) 78 protein:vir:99749 Length: 324 100.0 2.9E-54 1.8E-57 314.1 26.0 301 37-381 1-324 (324) 79 protein:vir:100884 Length: 389 100.0 4.8E-53 3E-56 307.4 32.2 332 3-375 1-389 (389) 80 protein:vir:9704 Length: 394 # 100.0 7.8E-53 4.9E-56 306.2 32.8 328 1-373 1-394 (394) 81 protein:vir:4856 Length: 293 # 100.0 4.8E-54 3E-57 312.9 25.3 270 72-380 1-293 (293) 82 protein:vir:103955 Length: 324 100.0 5.1E-54 3.2E-57 312.7 25.3 301 20-381 1-324 (324) 83 protein:vir:8187 Length: 311 # 100.0 1.7E-53 1.1E-56 309.9 27.8 270 78-369 1-311 (311) 84 protein:vir:100172 Length: 394 100.0 6.2E-53 3.9E-56 306.8 30.7 335 3-379 1-394 (394) 85 protein:vir:95763 Length: 297 100.0 6.6E-54 4.1E-57 312.1 25.2 275 65-373 1-297 (297) 86 protein:vir:101607 Length: 379 100.0 4.6E-52 2.9E-55 302.0 34.6 332 1-370 1-379 (379) 87 protein:vir:2504 Length: 305 # 100.0 1.1E-53 6.8E-57 310.9 25.6 283 76-374 1-305 (305) 88 protein:vir:8420 Length: 477 # 100.0 1.2E-52 7.6E-56 305.2 31.0 354 1-377 8-477 (477) 89 protein:vir:1084 Length: 437 # 100.0 3.7E-52 2.3E-55 302.6 31.0 338 1-379 29-437 (437) 90 protein:vir:96223 Length: 324 100.0 1.4E-52 8.8E-56 304.8 28.7 301 37-381 1-324 (324) 91 protein:vir:93616 Length: 645 100.0 1.1E-51 6.5E-55 300.1 32.1 344 1-376 193-645 (645) 92 protein:vir:962 Length: 397 # 100.0 2.3E-52 1.4E-55 303.7 28.1 322 1-368 15-397 (397) 93 protein:vir:78523 Length: 338 100.0 3.3E-52 2.1E-55 302.8 27.8 288 64-372 1-338 (338) 94 protein:vir:9759 Length: 303 # 100.0 1.9E-51 1.2E-54 298.6 27.1 271 78-370 1-303 (303) 95 protein:vir:9574 Length: 300 # 100.0 2.2E-51 1.4E-54 298.3 26.3 267 76-370 1-300 (300) 96 protein:vir:1638 Length: 298 # 100.0 5.5E-51 3.4E-54 296.1 26.8 268 80-367 1-298 (298) 97 protein:vir:94771 Length: 298 100.0 1.3E-49 7.8E-53 288.7 26.5 265 80-367 1-298 (298) 98 protein:vir:99920 Length: 311 100.0 1.1E-49 6.8E-53 289.0 25.6 271 76-368 1-311 (311) 99 protein:vir:4159 Length: 315 # 100.0 3.7E-47 2.3E-50 275.1 23.0 284 53-367 1-315 (315) 100 protein:vir:4197 Length: 314 # 100.0 3.7E-46 2.3E-49 269.7 23.3 279 66-371 1-314 (314) 101 protein:vir:3158 Length: 321 # 100.0 7.8E-41 4.8E-44 240.5 25.2 296 57-379 1-321 (321) 102 protein:vir:97397 Length: 517 100.0 8.4E-37 5.2E-40 218.4 26.7 340 1-376 131-517 (517) 103 protein:vir:4074 Length: 480 # 100.0 1.7E-33 1E-36 200.3 17.9 320 1-373 131-480 (480) 104 protein:vir:9820 Length: 272 # 99.9 9.8E-29 6.1E-32 174.1 22.3 258 76-371 1-272 (272) 105 protein:vir:3033 Length: 272 # 99.9 9.8E-29 6.1E-32 174.1 22.3 258 76-371 1-272 (272) 106 protein:vir:94933 Length: 330 99.7 3.2E-19 2E-22 122.0 18.4 288 61-371 1-330 (330) 107 protein:vir:93742 Length: 274 99.7 4.5E-18 2.8E-21 115.7 20.3 260 76-376 1-274 (274) 108 protein:vir:3613 Length: 272 # 99.7 3.7E-18 2.3E-21 116.1 18.0 255 76-370 1-272 (272) 109 protein:vir:80930 Length: 278 99.6 1.6E-16 9.9E-20 107.2 19.7 263 76-371 1-278 (278) 110 protein:vir:96123 Length: 274 99.6 5.7E-16 3.5E-19 104.1 19.6 260 76-374 1-274 (274) 111 protein:vir:105334 Length: 276 99.6 7E-16 4.3E-19 103.7 19.1 262 76-377 1-276 (276) 112 protein:vir:99424 Length: 360 99.6 1.1E-15 6.8E-19 102.6 20.1 309 36-372 1-360 (360) 113 protein:vir:94494 Length: 274 99.5 2.1E-15 1.3E-18 101.0 20.3 260 76-376 1-274 (274) 114 protein:vir:97433 Length: 274 99.5 2.1E-15 1.3E-18 101.0 20.3 260 76-376 1-274 (274) 115 protein:vir:96833 Length: 275 99.5 1.1E-15 6.8E-19 102.6 18.5 261 74-372 1-275 (275) 116 protein:vir:1239 Length: 274 # 99.4 2.8E-14 1.7E-17 94.9 19.2 260 76-376 1-274 (274) 117 protein:vir:96262 Length: 274 99.4 9.4E-14 5.8E-17 92.0 19.3 260 76-376 1-274 (274) 118 protein:vir:95898 Length: 274 99.4 9.4E-14 5.8E-17 92.0 19.3 260 76-376 1-274 (274) 119 protein:vir:79928 Length: 393 99.4 3.5E-13 2.2E-16 88.8 22.2 342 1-377 1-393 (393) 120 protein:vir:97255 Length: 310 99.4 2.3E-13 1.4E-16 89.9 19.1 271 62-369 1-310 (310) 121 protein:vir:95107 Length: 270 99.3 1.3E-12 7.9E-16 85.8 18.7 258 76-376 1-270 (270) 122 protein:vir:739 Length: 231 # 99.0 6.3E-11 3.9E-14 76.5 15.4 219 110-370 1-231 (231) 123 protein:vir:2201 Length: 345 # 98.9 1.1E-10 6.8E-14 75.2 15.0 289 62-368 1-345 (345) 124 protein:vir:93858 Length: 400 98.9 6E-10 3.7E-13 71.1 17.5 334 1-366 35-400 (400) 125 protein:vir:80213 Length: 334 98.9 3.6E-10 2.2E-13 72.4 15.4 289 57-370 1-334 (334) 126 protein:vir:10450 Length: 344 98.8 2E-10 1.2E-13 73.8 13.6 287 57-368 1-344 (344) 127 protein:vir:102605 Length: 273 98.8 1.3E-09 8.1E-13 69.3 18.0 254 82-370 1-273 (273) 128 protein:vir:105822 Length: 273 98.8 1.3E-09 8.1E-13 69.3 18.0 254 82-370 1-273 (273) 129 protein:vir:3364 Length: 347 # 98.8 4.4E-10 2.8E-13 71.9 14.7 291 56-370 1-347 (347) 130 protein:vir:7990 Length: 273 # 98.8 2.7E-09 1.7E-12 67.5 17.9 252 76-370 1-273 (273) 131 protein:vir:94576 Length: 347 98.7 1.2E-09 7.6E-13 69.5 15.2 285 56-368 1-347 (347) 132 protein:vir:8885 Length: 347 # 98.7 1.2E-09 7.5E-13 69.5 13.4 292 56-369 1-347 (347) 133 protein:vir:1541 Length: 347 # 98.6 3.8E-09 2.4E-12 66.8 15.1 298 56-370 1-347 (347) 134 protein:vir:78739 Length: 332 98.6 9.6E-10 5.9E-13 70.0 11.7 276 70-368 1-332 (332) 135 protein:vir:8324 Length: 410 # 98.6 8.4E-08 5.2E-11 59.4 19.9 324 1-370 47-410 (410) 136 protein:vir:95318 Length: 328 98.5 2.9E-08 1.8E-11 61.9 17.4 237 57-312 1-328 (328) 137 protein:vir:94711 Length: 347 98.5 2.7E-09 1.7E-12 67.5 10.7 287 57-369 1-347 (347) 138 protein:vir:103323 Length: 364 98.5 1.3E-07 7.8E-11 58.4 19.0 296 65-381 1-349 (364) 139 protein:vir:6324 Length: 335 # 98.5 2.8E-08 1.7E-11 62.0 15.4 294 62-378 1-335 (335) 140 protein:vir:7019 Length: 401 # 98.5 1.1E-08 7.1E-12 64.1 12.9 292 65-381 1-349 (401) 141 protein:vir:103759 Length: 330 98.4 2.5E-08 1.5E-11 62.3 14.6 237 57-312 1-330 (330) 142 protein:vir:78935 Length: 335 98.4 5E-08 3.1E-11 60.6 16.2 294 57-378 1-335 (335) 143 protein:vir:108211 Length: 318 98.4 6E-08 3.7E-11 60.2 14.7 287 65-378 1-318 (318) 144 protein:vir:100057 Length: 375 98.4 9.2E-08 5.7E-11 59.2 15.7 295 65-376 1-375 (375) 145 protein:vir:80180 Length: 381 98.3 1.1E-07 7.1E-11 58.6 15.9 285 73-381 1-317 (381) 146 protein:vir:99675 Length: 324 98.3 5.5E-08 3.4E-11 60.4 12.6 259 109-381 1-309 (324) 147 protein:vir:107826 Length: 331 98.2 1.8E-07 1.1E-10 57.6 14.7 236 57-312 1-331 (331) 148 protein:vir:98525 Length: 331 98.2 1.8E-07 1.1E-10 57.6 14.7 236 57-312 1-331 (331) 149 protein:vir:107388 Length: 331 98.2 1.8E-07 1.1E-10 57.6 14.7 236 57-312 1-331 (331) 150 protein:vir:5974 Length: 324 # 98.2 9.1E-07 5.7E-10 53.7 17.8 271 76-381 1-298 (324) 151 protein:vir:97031 Length: 402 98.2 2.7E-07 1.7E-10 56.6 14.2 296 65-381 1-345 (402) 152 protein:vir:94622 Length: 341 98.1 3.2E-07 2E-10 56.2 14.2 279 70-372 1-341 (341) 153 protein:vir:9927 Length: 295 # 98.1 6.2E-07 3.9E-10 54.6 14.3 259 76-376 1-295 (295) 154 protein:vir:102944 Length: 330 98.0 4.6E-06 2.8E-09 49.9 17.8 283 76-381 1-310 (330) 155 protein:vir:7324 Length: 335 # 98.0 1.6E-06 1E-09 52.3 15.2 239 57-313 1-335 (335) 156 protein:vir:105645 Length: 400 97.9 1.8E-06 1.1E-09 52.1 13.8 295 65-381 1-345 (400) 157 protein:vir:1583 Length: 351 # 97.8 1E-05 6.4E-09 47.9 17.5 279 76-381 1-308 (351) 158 protein:vir:103285 Length: 296 97.8 1.7E-05 1E-08 46.8 18.4 273 65-368 1-296 (296) 159 protein:vir:3136 Length: 322 # 97.7 4.9E-06 3E-09 49.7 13.8 280 76-372 1-322 (322) 160 protein:vir:80068 Length: 301 97.7 2.8E-05 1.7E-08 45.6 21.7 277 78-367 1-301 (301) 161 protein:vir:9875 Length: 296 # 97.7 3E-06 1.9E-09 50.9 12.4 267 59-374 1-296 (296) 162 protein:vir:106647 Length: 303 97.6 3.9E-06 2.4E-09 50.2 11.8 263 65-379 1-303 (303) 163 protein:vir:107687 Length: 319 97.2 0.00013 8.3E-08 41.8 20.4 288 62-367 1-319 (319) 164 protein:vir:102655 Length: 322 97.1 0.00016 9.7E-08 41.4 15.4 276 71-369 1-322 (322) 165 protein:vir:8843 Length: 317 # 96.7 0.00032 2E-07 39.7 14.1 281 78-370 1-317 (317) 166 protein:vir:104342 Length: 314 96.3 0.0008 5E-07 37.6 19.1 287 48-368 1-314 (314) 167 protein:vir:1153 Length: 338 # 95.5 0.0019 1.2E-06 35.5 18.1 291 65-369 1-338 (338) 168 protein:vir:79642 Length: 329 94.4 0.0044 2.7E-06 33.5 19.9 299 37-370 1-329 (329) 169 protein:vir:270 Length: 341 # 93.6 0.007 4.3E-06 32.4 17.0 302 61-377 1-341 (341) 170 protein:vir:99075 Length: 392 93.3 0.0079 4.9E-06 32.1 15.9 267 82-381 1-318 (392) 171 protein:vir:79157 Length: 339 92.3 0.012 7.5E-06 31.1 20.0 293 65-373 1-339 (339) 172 protein:vir:100331 Length: 342 92.0 0.013 8.2E-06 30.9 17.7 293 65-371 1-342 (342) 173 protein:vir:98566 Length: 355 90.7 0.019 1.2E-05 30.0 19.3 298 65-377 1-355 (355) 174 protein:vir:78777 Length: 358 90.7 0.019 1.2E-05 30.0 19.4 307 61-381 1-356 (358) 175 protein:vir:1781 Length: 221 # 90.5 0.011 7.1E-06 31.2 9.0 192 158-381 1-221 (221) 176 protein:vir:104011 Length: 337 89.9 0.023 1.5E-05 29.5 17.9 290 65-370 1-337 (337) 177 protein:vir:1829 Length: 355 # 89.8 0.024 1.5E-05 29.4 19.7 299 65-378 1-355 (355) 178 protein:vir:79171 Length: 337 89.4 0.026 1.6E-05 29.2 17.8 290 65-370 1-337 (337) 179 protein:vir:96792 Length: 315 89.0 0.029 1.8E-05 29.0 13.0 262 76-381 1-288 (315) 180 protein:vir:95131 Length: 325 88.3 0.033 2E-05 28.7 15.2 271 65-381 1-301 (325) 181 protein:vir:5694 Length: 357 # 87.9 0.036 2.2E-05 28.5 18.4 300 65-381 1-356 (357) 182 protein:vir:98856 Length: 343 87.4 0.039 2.4E-05 28.3 18.4 292 65-375 1-343 (343) 183 protein:vir:2016 Length: 357 # 85.1 0.055 3.4E-05 27.5 18.4 300 65-381 1-356 (357) 184 protein:vir:6061 Length: 357 # 84.8 0.058 3.6E-05 27.4 18.3 300 65-381 1-356 (357) 185 protein:vir:80446 Length: 367 84.6 0.059 3.7E-05 27.3 16.1 288 62-381 1-343 (367) 186 protein:vir:3746 Length: 336 # 81.6 0.085 5.2E-05 26.5 16.6 287 65-374 1-336 (336) 187 protein:vir:78186 Length: 337 81.6 0.085 5.2E-05 26.5 18.4 290 65-370 1-337 (337) 188 protein:vir:1663 Length: 393 # 80.6 0.094 5.8E-05 26.2 12.4 333 1-366 1-393 (393) 189 protein:vir:3783 Length: 336 # 79.1 0.11 6.7E-05 25.9 16.4 287 65-373 1-336 (336) 190 protein:vir:93966 Length: 400 78.1 0.12 7.3E-05 25.7 12.2 338 1-366 1-400 (400) 191 protein:vir:348 Length: 321 # 71.8 0.19 0.00012 24.5 14.2 286 63-370 1-321 (321) 192 protein:vir:103463 Length: 521 70.6 0.21 0.00013 24.3 14.3 345 1-381 1-504 (521) 193 protein:vir:94989 Length: 349 60.6 0.37 0.00023 23.0 17.1 286 76-381 1-323 (349) 194 protein:vir:5255 Length: 304 # 57.8 0.43 0.00026 22.6 16.3 272 81-365 1-304 (304) 195 protein:vir:94800 Length: 319 57.5 0.43 0.00027 22.6 18.2 286 52-381 1-307 (319) 196 protein:vir:97331 Length: 319 57.5 0.43 0.00027 22.6 18.2 286 52-381 1-307 (319) 197 protein:vir:78387 Length: 349 53.1 0.54 0.00033 22.1 16.9 286 76-381 1-323 (349) 198 protein:vir:861 Length: 318 # 47.8 0.69 0.00043 21.5 9.3 298 33-366 1-318 (318) 199 protein:vir:108303 Length: 418 45.1 0.78 0.00048 21.2 19.2 266 79-381 1-319 (418) 200 protein:vir:79548 Length: 652 40.4 0.97 0.0006 20.6 19.6 341 1-369 240-652 (652) 201 protein:vir:107120 Length: 329 39.9 1 0.00062 20.6 20.0 291 53-381 1-319 (329) 202 protein:vir:3525 Length: 423 # 34.5 1.3 0.0008 20.0 16.1 269 76-381 1-318 (423) 203 protein:vir:7214 Length: 521 # 30.8 1.6 0.00096 19.5 14.9 347 1-381 1-504 (521) 204 protein:vir:78558 Length: 336 29.8 1.6 0.001 19.4 14.0 309 20-370 1-336 (336) 205 protein:vir:3643 Length: 336 # 29.0 1.7 0.0011 19.3 16.1 303 37-370 1-336 (336) 206 protein:vir:100603 Length: 529 26.1 2 0.0012 19.0 14.5 347 1-381 1-513 (529) 207 protein:vir:101557 Length: 336 21.9 2.5 0.0016 18.4 15.2 309 20-370 1-336 (336) 208 protein:vir:105374 Length: 423 21.0 2.7 0.0017 18.2 17.3 270 82-381 1-318 (423) No 1 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=1.5e-102 Score=578.86 Aligned_cols=381 Identities=98% Similarity=1.367 Sum_probs=364.3 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhhcc Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNV 80 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~ 80 (381) |+|++.++++++++++.+++++.+.++++.+.+....+++.++..+..+.+++++++..++.+.++++|+++++++++++ T Consensus 1 m~~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~l~~~e~~~~~~~~~~t 80 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQTLSANQRNFFMDINKSV 80 (381) T ss_pred CchhHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccCHHHHHHHHHHhhcC Confidence 99999999999999999999887777777777888888888888888888999999999999999999999999999999 Q ss_pred CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEe Q lcl|NC_019921. 81 NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAF 160 (381) Q Consensus 81 ~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~ 160 (381) +++|||+||+++.++|++.|++.||||++|++++++++.++|+.++.+.+.|++|.++++++++|+|++++|.+||++++ T Consensus 81 ~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~ 160 (381) T protein:vir:10 81 GYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAF 160 (381) T ss_pred CCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCcceEEEeecCCcceEEeecccccccccCccceeEeecceeEEee Confidence 99999999999999999999999999999999999999999999999999999999888888899999999999999999 Q ss_pred eeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhH Q lcl|NC_019921. 161 VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATV 240 (381) Q Consensus 161 ~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~ 240 (381) ++||++||+||.+||++||+++++++|+++++.+|++|||++||+||++++.....+..++.++++..+++++.+...++ T Consensus 161 i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~ 240 (381) T protein:vir:10 161 VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATV 240 (381) T ss_pred ccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCccccccccccccccccccccccchhhHH Confidence 99999999999999999999999999999999999999999999999999888888888888888888999999999999 Q ss_pred HHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccccCCCceeEecCCCCCCcEEEEeecceE Q lcl|NC_019921. 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYD 320 (381) Q Consensus 241 ~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~~G~pVv~s~~~p~~~i~fgd~~~y~ 320 (381) +.+.++++.++.......+.|++|++|+|||.|++.+++.++.++++|+|+|.+|+|+||+++++||+++|+||||++|+ T Consensus 241 ~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~lp~g~~vv~~~~~p~~~i~fGDfs~Y~ 320 (381) T protein:vir:10 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYD 320 (381) T ss_pred HHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceeecCCCCceeEEcCCCCcCcEEEEEcccEE Confidence 99999999998888888888999999999999999999988999999999999999999999999999999999999999 Q ss_pred EEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 321 i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) |++|++++|++|+|.+|.+|+++||+++|+||+|++++||+|++|++.+++|+++.+.+|| T Consensus 321 i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEDTEETL 381 (381) T ss_pred EEEecccEEEeechhhhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCccccccccccC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=1.4e-101 Score=573.53 Aligned_cols=381 Identities=100% Similarity=1.381 Sum_probs=365.5 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhhcc Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNV 80 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~ 80 (381) |+|++.++++++++++.+++++.+.++++.+.+..+++.+.++..++.+.++++++...++++.++++|+++|+++.+++ T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~~~~~~ 80 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNV 80 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHHHhccc Confidence 99999999999999999999887777777778888888888888888888999999999999999999999999999999 Q ss_pred CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEe Q lcl|NC_019921. 81 NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAF 160 (381) Q Consensus 81 ~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~ 160 (381) +++|||+||++++++|++.|++.||||++|++++++++.++|+.++.+.+.|++|.++++++++|+|++++|.+|||+++ T Consensus 81 ~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~ 160 (381) T protein:vir:10 81 NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAF 160 (381) T ss_pred CCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcceEEEEecCCcceeeecccccccccccccceeeeecceeEEee Confidence 99999999999999999999999999999999999999999999999999999999888878899999999999999999 Q ss_pred eeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhH Q lcl|NC_019921. 161 VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATV 240 (381) Q Consensus 161 ~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~ 240 (381) ++||++||+||.+||++||+++++++|+++++.+|++|+|++||+||++++....+++++++++++..++++..++..++ T Consensus 161 ~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~ 240 (381) T protein:vir:10 161 VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATV 240 (381) T ss_pred chhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhH Confidence 99999999999999999999999999999999999999999999999999998888889998888899999999999999 Q ss_pred HHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccccCCCceeEecCCCCCCcEEEEeecceE Q lcl|NC_019921. 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYD 320 (381) Q Consensus 241 ~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~~G~pVv~s~~~p~~~i~fgd~~~y~ 320 (381) +.+.++++.++.........|+++++|+|||.|++.++++.+.++++|+|+|.+|+|++|+++++||+++|+||||++|+ T Consensus 241 ~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~~g~~vv~s~~~p~~~iifgDfs~Y~ 320 (381) T protein:vir:10 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYD 320 (381) T ss_pred HHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecCCCCceEEecCCCCcCcEEEEecccEE Confidence 99999999999888888889999999999999999999998999999999999999999999999999999999999999 Q ss_pred EEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 321 i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) |++|++++|++|+|.+|.+|+++||+++|+||+|++++||||++|++++.+++++++.+|| T Consensus 321 i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred EEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCcccccccC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=1.4e-101 Score=573.53 Aligned_cols=381 Identities=100% Similarity=1.381 Sum_probs=365.5 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhhcc Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNV 80 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~ 80 (381) |+|++.++++++++++.+++++.+.++++.+.+..+++.+.++..++.+.++++++...++++.++++|+++|+++.+++ T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~~~~~~ 80 (381) T protein:vir:95 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNV 80 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHHHhccc Confidence 99999999999999999999887777777778888888888888888888999999999999999999999999999999 Q ss_pred CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEe Q lcl|NC_019921. 81 NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAF 160 (381) Q Consensus 81 ~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~ 160 (381) +++|||+||++++++|++.|++.||||++|++++++++.++|+.++.+.+.|++|.++++++++|+|++++|.+|||+++ T Consensus 81 ~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~ 160 (381) T protein:vir:95 81 NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAF 160 (381) T ss_pred CCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcceEEEEecCCcceeeecccccccccccccceeeeecceeEEee Confidence 99999999999999999999999999999999999999999999999999999999888878899999999999999999 Q ss_pred eeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhH Q lcl|NC_019921. 161 VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATV 240 (381) Q Consensus 161 ~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~ 240 (381) ++||++||+||.+||++||+++++++|+++++.+|++|+|++||+||++++....+++++++++++..++++..++..++ T Consensus 161 ~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~ 240 (381) T protein:vir:95 161 VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATV 240 (381) T ss_pred chhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhH Confidence 99999999999999999999999999999999999999999999999999998888889998888899999999999999 Q ss_pred HHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccccCCCceeEecCCCCCCcEEEEeecceE Q lcl|NC_019921. 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYD 320 (381) Q Consensus 241 ~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~~G~pVv~s~~~p~~~i~fgd~~~y~ 320 (381) +.+.++++.++.........|+++++|+|||.|++.++++.+.++++|+|+|.+|+|++|+++++||+++|+||||++|+ T Consensus 241 ~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~~g~~vv~s~~~p~~~iifgDfs~Y~ 320 (381) T protein:vir:95 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLTYVKGLYD 320 (381) T ss_pred HHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecCCCCceEEecCCCCcCcEEEEecccEE Confidence 99999999999888888889999999999999999999998999999999999999999999999999999999999999 Q ss_pred EEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 321 i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) |++|++++|++|+|.+|.+|+++||+++|+||+|++++||||++|++++.+++++++.+|| T Consensus 321 i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:95 321 GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred EEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCcccccccC Confidence 9999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=1.9e-93 Score=528.95 Aligned_cols=376 Identities=57% Similarity=0.952 Sum_probs=341.3 Q ss_pred CchhHHHHHH---HHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHhhccccccCHHHHHHH Q lcl|NC_019921. 1 MTINLSETFA---NAKNEFINAVNNGEPQERQNELYGDMINQLFEETK----LQAKAEAERVSSLPKSAQSLSANQRSFF 73 (381) Q Consensus 1 mt~el~~~~~---~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~lt~~e~~~~ 73 (381) |+++++++.. ++++++.+.+++.+.++++.+.+.++.+.+.++.. ++.+...+......++.+.++.+|++++ T Consensus 1 M~~kl~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~lt~~e~~~~ 80 (383) T protein:vir:78 1 MTIKLKNNLANYEEKRTAFVNAVKNEDTQEIQNKAYVEMVDAMAADIMEQAKKEARQEADAYISASRTDKNITNEEIKFF 80 (383) T ss_pred CchhHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhHHHHHHH Confidence 9999988774 66777777777777777777777777766554443 3444555666778888899999999999 Q ss_pred HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecccccccccCcceeeEeec Q lcl|NC_019921. 74 MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAI 153 (381) Q Consensus 74 ~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~ 153 (381) ++++++++++|||+||++++++|++.|+++||||++|++++++|+.++|+.++.+.+.|++|.++++++++|+|++++|. T Consensus 81 ~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~ 160 (383) T protein:vir:78 81 NDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGLRTKFLKSETSGVAVWGKIFGEIKGQLDATFSDEESI 160 (383) T ss_pred HHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCCceEEEEEcCCcceEEeecccccccccCcceeeEeec Confidence 99999999999999999999999999999999999999999999999999999999999999988887889999999999 Q ss_pred ceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecc Q lcl|NC_019921. 154 QNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTF 233 (381) Q Consensus 154 ~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~ 233 (381) +|||+++++||++||+||.+||++||+++++++|+++++.+||+|+|++||+||++++.....+.+++.++.+..+++++ T Consensus 161 ~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (383) T protein:vir:78 161 QNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTF 240 (383) T ss_pred ceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCcccccccccccccccchhhh Confidence 99999999999999999999999999999999999999999999999999999999888778888888888888888888 Q ss_pred cccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccccCCCceeEecCCCCCCcEEE Q lcl|NC_019921. 234 ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKVLT 313 (381) Q Consensus 234 ~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~~G~pVv~s~~~p~~~i~f 313 (381) .+...+++.+..+.+..+...++....+.++++|+||+.+++.+++..+.++++|+|++.+|+|++|+++++||+++|+| T Consensus 241 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~t~l~~~~~iv~s~~~p~~~iif 320 (383) T protein:vir:78 241 ANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSLNANGVYVTALPFNLNIIESLFVPEKKAIS 320 (383) T ss_pred hhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchhccCCCCceeeecCCCceEEecCCCCcccEEE Confidence 88888888888877777777777777788999999999999999988888999999999999999999999999999999 Q ss_pred EeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 314 YVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 314 gd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) |||++|+|++|++++|++|+|.+|.+|+++||+++|+||+|++++||+|++|++++.++..+| T Consensus 321 gdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~~~~~~~~ 383 (383) T protein:vir:78 321 YVAERYDALIGGPLDIGTYDQTLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNINPAEQTPEG 383 (383) T ss_pred eeccceEEEecccceEEecchhhhhcCceEEEEEEEEcCEEecCCeEEEEEEEecCCCCCCCC Confidence 999999999999999999999999999999999999999999999999999999888877777 No 5 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=6.9e-92 Score=520.34 Aligned_cols=368 Identities=39% Similarity=0.644 Sum_probs=334.7 Q ss_pred CchhHH--HHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhh Q lcl|NC_019921. 1 MTINLS--ETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINK 78 (381) Q Consensus 1 mt~el~--~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~ 78 (381) |||.++ +++.++++++.+++++.+.++++.+.++++.+.+.++..++.+.+.++.+...+..+.++++|+++++++.+ T Consensus 1 M~i~~~~~~~~~e~~~~l~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~ 80 (377) T protein:vir:96 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK 80 (377) T ss_pred CCccHHHHHHHHHHHHHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHh Confidence 777664 478888999999998877788888889999888988888888888899999899999999999999987654 Q ss_pred -ccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecccccccccCcceeeEeecceeE Q lcl|NC_019921. 79 -NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKL 157 (381) Q Consensus 79 -~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl 157 (381) +++++|||+||+++.++|++.+.+.||||++|+++++++.+++|+.++.+.+.|++|.++++++++|+|++++|.+||+ T Consensus 81 ~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl 160 (377) T protein:vir:96 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKL 160 (377) T ss_pred cCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceeEeecccccccccCccceeEeeeeeeE Confidence 5677899999999999999999999999999999999999999999999999999999888878899999999999999 Q ss_pred EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeecccccccccc------ccccceeeeeee Q lcl|NC_019921. 158 TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTE------GAYPEKEEQGTL 231 (381) Q Consensus 158 ~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~------~~~~~~~~~~~~ 231 (381) +++++||++||+||.+|+++||+++++++|+++++.+|++|+|++||+|||+++........ +.++++...+++ T Consensus 161 ~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (377) T protein:vir:96 161 TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL 240 (377) T ss_pred EeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeecccccccc Confidence 99999999999999999999999999999999999999999999999999997765443322 234445566777 Q ss_pred cccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccccCCCceeEecCCCCCCcE Q lcl|NC_019921. 232 TFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKV 311 (381) Q Consensus 232 t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~~G~pVv~s~~~p~~~i 311 (381) +.++++..++.+.++.+.++.+..+.+..+.++++|+|||.|++.+++.+.+++++|+|++.+|+|++|++|++||+++| T Consensus 241 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~~p~~v~~s~~~p~~~i 320 (377) T protein:vir:96 241 SDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVLPHGITILESLAVETGKA 320 (377) T ss_pred ccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhccccccccCCCCCceeccCCCceEEecCCCCcccE Confidence 78888899999999999888887777788899999999999999998888899999999999999999999999999999 Q ss_pred EEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEec Q lcl|NC_019921. 312 LTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 312 ~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~ 368 (381) +||||++|+|++|++++|++|+|++|.+|+++||+++|+||+|++++||+|++|++- T Consensus 321 ~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 321 IAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEcCcEEEEEecccEEEeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 999999999999999999999999999999999999999999999999999998885 No 6 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=6.8e-87 Score=492.98 Aligned_cols=368 Identities=39% Similarity=0.645 Sum_probs=314.3 Q ss_pred CchhHH--HHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHH-h Q lcl|NC_019921. 1 MTINLS--ETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDI-N 77 (381) Q Consensus 1 mt~el~--~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~-~ 77 (381) |+|+++ ++++++++++.+++++....+++.+.++.+.+.+.++...+.+.+.++++...+..+.++++|+++++++ . T Consensus 1 M~i~~k~~~~~~~~~~~l~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~ 80 (377) T protein:vir:98 1 MAINLKELPKYREAVAELSAKISAGATSEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDK 80 (377) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHh Confidence 766654 4778888888888887776777788888888888888888888899999999999999999999999866 4 Q ss_pred hccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecccccccccCcceeeEeecceeE Q lcl|NC_019921. 78 KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKL 157 (381) Q Consensus 78 ~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl 157 (381) .+++++|||+||+++.++|++.+.+.||||++|++++++|+.++|+.++.+.+.|++|.++++++++|+|++++|.+||+ T Consensus 81 ~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl 160 (377) T protein:vir:98 81 NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKL 160 (377) T ss_pred ccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCcceEEEEecCCcceeEeecccccCcccCccceeEeecceeE Confidence 56788899999999999999999999999999999999999999999999999999999888878899999999999999 Q ss_pred EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeecccccccccccc------ccceeeeeee Q lcl|NC_019921. 158 TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGA------YPEKEEQGTL 231 (381) Q Consensus 158 ~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~------~~~~~~~~~~ 231 (381) +++++||++||+||.+|+++||+++++++|+++++.+|++|+|++||+|||+.+.......... ++++.....+ T Consensus 161 ~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 240 (377) T protein:vir:98 161 TAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL 240 (377) T ss_pred EeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccccccccchhhhHhhh Confidence 9999999999999999999999999999999999999999999999999998765443322221 1111222222 Q ss_pred cccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccccCCCceeEecCCCCCCcE Q lcl|NC_019921. 232 TFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEAGKV 311 (381) Q Consensus 232 t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~~G~pVv~s~~~p~~~i 311 (381) ....+.........+...++.....+.....++.+|+|||.+++.+++..+.++++|+|++.||+|++|++|++||+++| T Consensus 241 ~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i 320 (377) T protein:vir:98 241 SDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKA 320 (377) T ss_pred hhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCceEEecCCCCcccE Confidence 22222223333333444444444455567789999999999999998888889999999999999999999999999999 Q ss_pred EEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEec Q lcl|NC_019921. 312 LTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 312 ~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~ 368 (381) +||||++|+|++|++++|++|+|++|.+|+++||+++|+||+|++++||++++|.+= T Consensus 321 ~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 321 IAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEecceeEEeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 999999999999999999999999999999999999999999999999999887774 No 7 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=6.8e-86 Score=487.49 Aligned_cols=377 Identities=44% Similarity=0.765 Sum_probs=322.3 Q ss_pred CchhHH-----HHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhhccccccCHHHH Q lcl|NC_019921. 1 MTINLS-----ETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAE-----RVSSLPKSAQSLSANQR 70 (381) Q Consensus 1 mt~el~-----~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~lt~~e~ 70 (381) ||+.+. ++++++++++.+++++....+++.+.+.++++.+..+..++.+.+.+ +.....++.+.++.+|+ T Consensus 1 mt~~~~~~e~~~~~~e~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~r~~~~l~~ee~ 80 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQFANLVQNGASDEEQSKAFGAMFDALSNDLQEEITAEINNRVVDNGILAKRSQDPLTSEER 80 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccchHHH Confidence 776533 33345555666666666666666777777766665444333333222 23445677888999999 Q ss_pred HHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 71 SFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 71 ~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) ++++++.++++++|||+||++++++|++.+++.+|||++|++++++|+.++|+.++.+.+.|++|.++++++++|+|+++ T Consensus 81 ~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 160 (395) T protein:vir:95 81 KFFNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGIKTRVIKADPAGQAVWGKVFGEIKGQLDAAFREE 160 (395) T ss_pred HHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecccccCccccccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999888887889999999 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCC--cceEeeeccccccccccccccceeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD--QPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~--~P~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) +|.+|+++++++||++||+||.+|+++||+++|+++|+++++.+|++|+|++ ||+||++++...... ....... T Consensus 161 ~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~----~~~~~~~ 236 (395) T protein:vir:95 161 NFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGA----VTDKASS 236 (395) T ss_pred eeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeecccccccc----ccccccc Confidence 9999999999999999999999999999999999999999999999999986 799999876543322 1222334 Q ss_pred eeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccccCCCceeEecCCCCC Q lcl|NC_019921. 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALPFNLNVIESTVQEA 308 (381) Q Consensus 229 ~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~~G~pVv~s~~~p~ 308 (381) ++++..+...++..+.++++.++...+.....+.++++|+|||.|++.+++.+++++++|+|++.++||+||++|++||+ T Consensus 237 ~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~~~~~~~G~~~~~lg~g~~v~~~~~~p~ 316 (395) T protein:vir:95 237 GTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARYTYLTANGGFVTVLPYNVTIITSEFVPE 316 (395) T ss_pred chhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcceeccCCCcceeccCCcceEEEcCCCCC Confidence 45555666677777888888888777777888999999999999999998888889999999999999999999999999 Q ss_pred CcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 309 GKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 309 ~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) ++|+||||++|+|++|++++|++++|.+|.+|+++||+++|+||+|++++||+|++|+++...+.++-+|.|- T Consensus 317 ~~i~fgdfs~y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~~~~~~~~~~ 389 (395) T protein:vir:95 317 GKLVAFVTDRYNAVRGGGLTVKKFDQTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVASAPRRQTSAGGTT 389 (395) T ss_pred CcEEEEecccEEEEEecceEEEeccchhhhCCcEEEEEEEEECCEEeccccEEEEEeeccCCCCCCCCCCCCC Confidence 9999999999999999999999999999999999999999999999999999999999988888888888777 No 8 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=6.2e-76 Score=432.89 Aligned_cols=369 Identities=27% Similarity=0.423 Sum_probs=286.7 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHhhccccccCHHHHHHHHH Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEA-----ERVSSLPKSAQSLSANQRSFFMD 75 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~lt~~e~~~~~~ 75 (381) |.- +.++..+.++++.+++++....+++.+.+.++.+.+.++...+.+.+. .......++.+.++.++|+++++ T Consensus 4 L~e-~~~e~~e~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~ 82 (390) T protein:vir:40 4 LDK-KDSETLNISTAFLNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKYYNE 82 (390) T ss_pred HHH-HHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHHHHH Confidence 222 334455666667777766666666667777776666554444433322 22344566778899999999987 Q ss_pred Hhh-ccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc-eEEEEecCCcceEEeecccccccccCcceeeEeec Q lcl|NC_019921. 76 INK-NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAI 153 (381) Q Consensus 76 ~~~-~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~-~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~ 153 (381) +.. ++.++||++||++++++|++.+++.+||+++|+++|+++. ..+|+.++.+.+.|++|+++++++++++|++++|. T Consensus 83 ~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~ 162 (390) T protein:vir:40 83 VIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTG 162 (390) T ss_pred HHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEcCCcceeeeccccccCccccccceeeEee Confidence 655 5577899999999999999999999999999999998765 77899999999999999988887789999999999 Q ss_pred ceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecc Q lcl|NC_019921. 154 QNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTF 233 (381) Q Consensus 154 ~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~ 233 (381) +|+++++++||++||+||.+|+++||+++|+++++++++.+|++|+|+++|.||++.+.......... .....++. T Consensus 163 ~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~----~~~~~~t~ 238 (390) T protein:vir:40 163 MYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPV----KTATPLTD 238 (390) T ss_pred eeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeecccccccccccc----ccccccch Confidence 99999999999999999999999999999999999999999999999999999998765433222111 11122333 Q ss_pred cccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHH-hhhhhccCCCCccccc-cCCCceeEecCCCCCCcE Q lcl|NC_019921. 234 ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEV-QAQYTHLNANGVYVTA-LPFNLNVIESTVQEAGKV 311 (381) Q Consensus 234 ~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~-~~~~~~~~~~G~~~~~-l~~G~pVv~s~~~p~~~i 311 (381) .+...++..+. ..+ ......+.++++|+||+.|++.. ......++++|+|+|. +++|+||+++++||+++| T Consensus 239 ~~~~~~~~~l~---~~~----~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~~~~g~pvv~~~~~p~~~i 311 (390) T protein:vir:40 239 LTPATLATKVM---LPL----TDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTGILPVPLEIVQSVAVPVGKA 311 (390) T ss_pred hhHHHHHHHHH---HHh----hcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCccccccCCCceeEEEcCCCCCCcE Confidence 33222222222 221 22233456789999999997654 3455678999999987 458999999999999999 Q ss_pred EEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccc-------cCcccC Q lcl|NC_019921. 312 LTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALE-------GTEETL 381 (381) Q Consensus 312 ~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~-------~~~~~~ 381 (381) +||||++|++++|++++|++++|.+|.+|+++||+++|+||++++++||++++++-++..+++. ..|+|- T Consensus 312 ~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 388 (390) T protein:vir:40 312 VAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPKDNSSFLVFDITGLEGSPAIDVNVVNNATPSETP 388 (390) T ss_pred EEEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEecccceEEEEeeccCCCCCCCcceeeCCCCCCCC Confidence 9999999999999999999999999999999999999999999999999998888776653321 122222 No 9 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=3.3e-68 Score=390.51 Aligned_cols=373 Identities=12% Similarity=0.136 Sum_probs=253.1 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhHH------HHHHHHHHH-------HHHHHHHHHH-HHH----------------- Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEPQE------RQNELYGDM-------INQLFEETKL-QAK----------------- 49 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~~~------~~~~~~~~~-------~~~~~~~~~~-~~~----------------- 49 (381) --++.++++.+..+++..++.+...++ ++...++.. .+.+.++... +.+ T Consensus 21 el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~le~el~e~~~~~~~~~~~~~~ 100 (466) T protein:vir:80 21 ELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKELENELEQLNNKEPKNNSEPAQ 100 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCchhHH Confidence 001122222222222222221111100 000011110 0001000000 000 Q ss_pred --HHH----------HHHHH-----hhccccccCHHHHHHHHHH----hh-ccCCCCceeccHHHHHHHHHHHHhhhhhh Q lcl|NC_019921. 50 --AEA----------ERVSS-----LPKSAQSLSANQRSFFMDI----NK-NVNYKEEKLLPEETIDRIFEDLTTNHPLL 107 (381) Q Consensus 50 --~~~----------~~~~~-----~~~~~~~lt~~e~~~~~~~----~~-~~~~~gg~lvP~~~~~~I~~~l~~~~~l~ 107 (381) ... .+... ..+....+..+++.++... .. .+.++|+++||+++++.|++.+++++||+ T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~ 180 (466) T protein:vir:80 101 VSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLI 180 (466) T ss_pred HHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhh Confidence 000 00000 0000011112222222221 22 23455678999999999999999999999 Q ss_pred hhceeEecCCceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHH Q lcl|NC_019921. 108 ADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAF 187 (381) Q Consensus 108 ~~~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~ 187 (381) ++|++.+++|..++|+....+.+.|++|++.++ +++|+|++|++.+|+++++++||++||+||.+|+++||+++|+++| T Consensus 181 ~~~~v~~~~g~~~~~~~~~~~~a~wv~E~~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~ 259 (466) T protein:vir:80 181 SKVRLRPLKGTARQNIAGAIPEGVWTEAVANLN-ELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAI 259 (466) T ss_pred hheeeeecCceeEeeeecCCcceeecccccccc-cccccccceeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHH Confidence 999999999999999999889999999988876 5789999999999999999999999999999999999999999999 Q ss_pred HHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccch----------hHHHHHHHHHHhhhccccc Q lcl|NC_019921. 188 AVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRA----------TVNELTQVFKYHSTNEKGK 257 (381) Q Consensus 188 ~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~----------~~~~l~~l~~~l~~~~~~~ 257 (381) +++++.+||+|+|+++|+|||+.+............ ....+..+... ....+.++...+ ...+ T Consensus 260 ~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 332 (466) T protein:vir:80 260 GFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTK----APAWTNLSTTNLLKIDPTGKSAEEFFSELVLKL---SKAR 332 (466) T ss_pred HHHHhhheeeccCCCCcceeeecccccccccccccc----cccccccchhhhhhhhhhccchhhHHHHHHHHH---Hhhh Confidence 999999999999999999999876544333221111 01111111111 111122221111 1123 Q ss_pred cccccCceEEEEchhhHHHHhhhhhccCCCCcccccc-----CCCceeEecCCCCCCcEEEEeecceEEEeecceEEEee Q lcl|NC_019921. 258 SVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL-----PFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKF 332 (381) Q Consensus 258 ~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l-----~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~ 332 (381) ....+++.+|+||+.++..+++.....+++|.|++.. .+|+||+++++||+++++||||++|+|++|++++|.+| T Consensus 333 ~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~~r~~~~i~~~ 412 (466) T protein:vir:80 333 ANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLNNTMPIVGGDIVILDFIPDNDIIGGYGSLYLLAERADIKLAQS 412 (466) T ss_pred ccccCCceeEEecchhHHHhhcccccccCCccccccCCCcccccccceeecCccCccceeeeccccEEEEeecceEEEec Confidence 4456677889999999888887766678888888653 47999999999999999999999999999999999999 Q ss_pred hhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 333 KETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 333 ~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) +|.+|.+|+++||+++|+||+|++++|||++++....+...++.+|++- T Consensus 413 ~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~~~~~~~~~~ 461 (466) T protein:vir:80 413 EHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPTTSITFAPDEA 461 (466) T ss_pred hhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcccceeeecCcC Confidence 9999999999999999999999999999998877777777777777777 No 10 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=3.6e-67 Score=384.83 Aligned_cols=353 Identities=17% Similarity=0.180 Sum_probs=245.2 Q ss_pred CchhHHH-------------HHHHHHHHHHHHHhhhhhHHHHH------HHHHHHHHHHHHHH---HHHHHH---HHHH- Q lcl|NC_019921. 1 MTINLSE-------------TFANAKNEFINAVNNGEPQERQN------ELYGDMINQLFEET---KLQAKA---EAER- 54 (381) Q Consensus 1 mt~el~~-------------~~~~~~~~~~~~~k~~~~~~~~~------~~~~~~~~~~~~~~---~~~~~~---~~~~- 54 (381) |.-|+++ ++.++++++..++.+.+.+++.. +.++...+.+.+.. ..+... +.++ T Consensus 8 ~~~el~~~~~~l~el~~~~~el~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~~~~~~~~~l~~~ 87 (425) T protein:vir:95 8 LTKKIEQRKAALDELVKREQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEIAQLEDELEQI 87 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 1111111 11111111111111111111000 00000001100000 000000 0000 Q ss_pred ----------------------------HHHhhccccccCHHHHHHHHHHhh-ccCCCCceeccHHHHHHHHHHHHhhhh Q lcl|NC_019921. 55 ----------------------------VSSLPKSAQSLSANQRSFFMDINK-NVNYKEEKLLPEETIDRIFEDLTTNHP 105 (381) Q Consensus 55 ----------------------------~~~~~~~~~~lt~~e~~~~~~~~~-~~~~~gg~lvP~~~~~~I~~~l~~~~~ 105 (381) ......+....+.+.+++.+.+.. .+.++||++||+++.+.|++.+++.++ T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~ 167 (425) T protein:vir:95 88 NSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTT 167 (425) T ss_pred hhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhhh Confidence 000001111122233333333332 355679999999999999999999999 Q ss_pred hhhhceeEecCCceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHH Q lcl|NC_019921. 106 LLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEE 185 (381) Q Consensus 106 l~~~~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~ 185 (381) |+++|++++++|++++|+..+.+.+.|++|+++.+....++|++|++.+|+++++++||++||+||.+++++||+++|++ T Consensus 168 i~~~~~~~~~~g~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ 247 (425) T protein:vir:95 168 LYPLVDKIRVKGTTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIAR 247 (425) T ss_pred HHHhhceeecCceeEEEEecCCccccccccccccccccccccceeeeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999887666689999999999999999999999999999999999999999 Q ss_pred HHHHHHhhheeeccCC--CcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccC Q lcl|NC_019921. 186 AFAVALETAFLKGTGK--DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKG 263 (381) Q Consensus 186 ~~~~~~~~a~i~G~G~--~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~ 263 (381) +|++++|.+||+|+|+ ++|+||++.++....... .....+++.+.+++..+.. ...+.+ T Consensus 248 ~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~-----~~~~~~ 308 (425) T protein:vir:95 248 AIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTV--------------EADNNLLKNLVKQIGLIDT-----GDDSVG 308 (425) T ss_pred HHHHHHHHHhhccCCCCccccceeeccccccccccc--------------ccccchHHHHHHHHHhhhh-----hccccC Confidence 9999999999999996 489999986554322111 1112244555555443321 112347 Q ss_pred ceEEEEchhhHHH-HhhhhhccCCCCccccccC-------CCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhh Q lcl|NC_019921. 264 NVTMVVNPSDAFE-VQAQYTHLNANGVYVTALP-------FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKET 335 (381) Q Consensus 264 ~a~~~mn~~t~~~-~~~~~~~~~~~G~~~~~l~-------~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~ 335 (381) +++|+||+.|++. +......+|.+|+|+|.+| ||+||++|++||++.|+||||++|++++|++++|.+|+|. T Consensus 309 ~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~l~G~pvv~~~~~~~~~i~~Gd~~~~~~~~~~~~~i~~~~~~ 388 (425) T protein:vir:95 309 EIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPDLLGLRVVFNNFLDDDTVLFGEFEQYTLVERENITIDSSTHV 388 (425) T ss_pred ceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCCCCccccceeeEEcCcCCCccEEEEecccEEEEeecceEEEeeccc Confidence 8899999999765 4445567899999998643 7999999999999999999999999999999999999999 Q ss_pred hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcc Q lcl|NC_019921. 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 336 ~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~ 372 (381) +|.+|+++||++.|+||+|++|+||++++++.....+ T Consensus 389 ~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 389 KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred ccccCceEEEEEEeeCcEeecccceEEEEecCcCCCC Confidence 9999999999999999999999999998877633333 No 11 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=5.6e-66 Score=378.33 Aligned_cols=353 Identities=12% Similarity=0.122 Sum_probs=250.0 Q ss_pred CchhHHHHHHHHHHHHHHHHhh---hhhH-----HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH-hh---c-ccccc Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNN---GEPQ-----ERQNELYGDMINQLFEETKL--QAKAEAERVSS-LP---K-SAQSL 65 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~---~~~~-----~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~-~~---~-~~~~l 65 (381) ||+++++ +.+..+++...+++ ...+ +++........+.+...... ....+.++... .. . ..... T Consensus 1 m~~~lk~-l~~~~~el~~~~~~~k~~~~~~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (401) T protein:vir:44 1 MAVDIKD-VEQVAQELQQKFDDFKAKNDKRVEAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNKV 79 (401) T ss_pred CCccHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch Confidence 9988653 33333333332221 1100 11111112222222111110 00000000000 00 0 11111 Q ss_pred CHHHHHHH-----------------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCC Q lcl|NC_019921. 66 SANQRSFF-----------------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETS 127 (381) Q Consensus 66 t~~e~~~~-----------------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~~ 127 (381) ..++++.| .++..+++++||++||++++++|++.+++.++|+++|+++++++ ..++|+..+. T Consensus 80 ~~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 159 (401) T protein:vir:44 80 AAEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGG 159 (401) T ss_pred hHHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCC Confidence 22223222 23456677899999999999999999999999999999999865 4789999888 Q ss_pred cceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEe Q lcl|NC_019921. 128 GVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGL 207 (381) Q Consensus 128 ~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gi 207 (381) +.+.|++|++..+....++|++|++.+||++++++||+|||+||.+|+++||.++|+++++++++.+|++|+|+++|+|| T Consensus 160 ~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gi 239 (401) T protein:vir:44 160 TASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGF 239 (401) T ss_pred ccceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCcccee Confidence 99999999888776677999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCC Q lcl|NC_019921. 208 NRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNAN 287 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~ 287 (381) |+............. .......+......+++.+.+++..+ +..|+.|++|+||+.++..+.. ++|.+ T Consensus 240 l~~~~~~~~~~~~~~--~~~~~~~t~~~~~~~~d~i~~~~~~l-------~~~~~~~a~~v~n~~~~~~L~~---lkd~~ 307 (401) T protein:vir:44 240 LAYESTEESDKARAF--GKLQHIVSGEATAVTADAIIKLIYTL-------RKAHRTGAKFMMNNNSLFAIRL---LKDTE 307 (401) T ss_pred ecccccccccccccc--ccccccccccccccCHHHHHHHHHhc-------chhhhcCCEEEEcHHHHHHHHH---hhccC Confidence 976543322211111 11111222333445677777777654 4568899999999999877765 47889 Q ss_pred Cccccc---------cCCCceeEecCCCCCC-----cEEEEeecc-eEEEeecceEEEeehhhhhhcCceEEEEEEEEcC Q lcl|NC_019921. 288 GVYVTA---------LPFNLNVIESTVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQFAYG 352 (381) Q Consensus 288 G~~~~~---------l~~G~pVv~s~~~p~~-----~i~fgd~~~-y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dG 352 (381) |+|+|. ..+|+||+.+++||.. .|+||||++ |.+.+|.++++.+++ ++.+|+++||+..|+|| T Consensus 308 G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~--~~~~~~v~~~a~~r~d~ 385 (401) T protein:vir:44 308 GNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDP--YTNKPFVGFYTTKRTGG 385 (401) T ss_pred CceeecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeeec--cccCCcEEEEEEEEecc Confidence 999874 2489999999999852 288999987 999999999987654 57899999999999999 Q ss_pred EEecCceEEEEEEEec Q lcl|NC_019921. 353 KAKDNKVAAVWKLDLK 368 (381) Q Consensus 353 k~~~~~Afvv~~~~~~ 368 (381) ++++++||++++++-+ T Consensus 386 ~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 386 MLVDSQAIKLLKIAAA 401 (401) T ss_pred EEecccceEEEEeecC Confidence 9999999999877766 No 12 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=3.6e-65 Score=373.91 Aligned_cols=355 Identities=14% Similarity=0.127 Sum_probs=246.1 Q ss_pred Cc---hhHHHHHHHHHHHHHHHHhh-hh-hHHHHHHHHHHH------------HHHHHHHHHH-HHHHHHHHHH--Hh-- Q lcl|NC_019921. 1 MT---INLSETFANAKNEFINAVNN-GE-PQERQNELYGDM------------INQLFEETKL-QAKAEAERVS--SL-- 58 (381) Q Consensus 1 mt---~el~~~~~~~~~~~~~~~k~-~~-~~~~~~~~~~~~------------~~~~~~~~~~-~~~~~~~~~~--~~-- 58 (381) |. +|++++..+..+++.+.++. .+ ..+++.+.++.. .+.+..+... +...+..... .. T Consensus 21 ~~~~l~e~ra~~~~e~~~l~~~~~~~~~~~k~~~~~~~~~~~~~~~~~e~~~~~~~~~~ei~~~~~~~~~~~~~~~~~~~ 100 (425) T protein:vir:10 21 VPRGIISVRAEGPTEVKALIENLQKAFHDFKAEHTKQLDAVKAGLPTSDALAKVDKVSADLEALQAAVDEANIKIAAAQM 100 (425) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 21 22232222222222222111 00 001111111100 0011101000 0000000000 00 Q ss_pred -hcccc-ccCHHHHHHH----------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEec Q lcl|NC_019921. 59 -PKSAQ-SLSANQRSFF----------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSE 125 (381) Q Consensus 59 -~~~~~-~lt~~e~~~~----------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~~~~p~~~ 125 (381) ..... ..+.+.++.| ++++.+++++||++||+++++.|++.+++.++|+++|++++++ +..++|+.. T Consensus 101 ~~~~~~~~~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~ 180 (425) T protein:vir:10 101 GANGVKPLRDPEYTEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNM 180 (425) T ss_pred ccccccccccHHHHHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEc Confidence 00111 1222333333 3456778899999999999999999999999999999999976 568999999 Q ss_pred CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcce Q lcl|NC_019921. 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) Q Consensus 126 ~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~ 205 (381) +.+.+.|++|++..+....++|+++++.+|+++++++||+|||+||.+++++||.+++++++++++|.+|++|+|+++|. T Consensus 181 ~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~ 260 (425) T protein:vir:10 181 GGTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPN 260 (425) T ss_pred CCcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcc Confidence 99999999998887765668999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC Q lcl|NC_019921. 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~ 285 (381) ||++.++............. ....+.......++.+.++...+ +..|+++++|+||+.++..+.. ++| T Consensus 261 Gil~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~d~l~~l~~~l-------~~~~~~~a~~vmn~~~~~~L~~---lkD 328 (425) T protein:vir:10 261 GLLTYIAGGANAAKHPFGAI--EVVNSGAAADITSDGIIDLVYDL-------PSAFTGNARFAMNRNTQRQVRK---LKD 328 (425) T ss_pred eeeecccccccccccccccc--ccccccccccccHHHHHHHHhhh-------hhhhccCCEEEEchHHHHHHHH---hhc Confidence 99987765443322111111 11112223344566677766543 4578899999999999887765 478 Q ss_pred CCCccccc---------cCCCceeEecCCCCC-----CcEEEEeecc-eEEEeecceEEEeehhhhhhcCceEEEEEEEE Q lcl|NC_019921. 286 ANGVYVTA---------LPFNLNVIESTVQEA-----GKVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQFA 350 (381) Q Consensus 286 ~~G~~~~~---------l~~G~pVv~s~~~p~-----~~i~fgd~~~-y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~ 350 (381) .+|+|+|. ..+|+||+++++||. ..|+||||++ |.+++|.++++.. +.++.+|+++||+..|+ T Consensus 329 ~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~--d~~~~~~~~~~~~~~r~ 406 (425) T protein:vir:10 329 GQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLR--DPYTAKPYVLFYTTKRV 406 (425) T ss_pred CCCceeeccCccCCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEe--cccccCCcEEEEEEEEe Confidence 99999874 347999999999995 2389999998 8999999988754 55688999999999999 Q ss_pred cCEEecCceEEEEEEEecC Q lcl|NC_019921. 351 YGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 351 dGk~~~~~Afvv~~~~~~~ 369 (381) ||++++++||++++++-+. T Consensus 407 d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 407 GGGLLNPEPMRAMKVAASE 425 (425) T ss_pred ccEeecccceEEEEeeccC Confidence 9999999999986666555 No 13 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=1.3e-64 Score=370.83 Aligned_cols=357 Identities=12% Similarity=0.085 Sum_probs=250.1 Q ss_pred Cch-h-HHHHHHHHHHHHHHHHhhhhh-----HHHHHHHHHHHHHHHHHHHHHH------------------------HH Q lcl|NC_019921. 1 MTI-N-LSETFANAKNEFINAVNNGEP-----QERQNELYGDMINQLFEETKLQ------------------------AK 49 (381) Q Consensus 1 mt~-e-l~~~~~~~~~~~~~~~k~~~~-----~~~~~~~~~~~~~~~~~~~~~~------------------------~~ 49 (381) |+- + +++..++.++++.+ +++... .+++........+.+....... .. T Consensus 1 l~~~k~l~~~i~e~~~~~~~-~k~~~~~~~~~~e~~~~~l~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (407) T protein:vir:48 1 MADVKDVEQVAQELQRKFDD-FKEKNDKRIDAIEQEKGKLAGEVETLNGKLAELENLKSDLEAELAEVKRPAGGTQNKVA 79 (407) T ss_pred CchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchh Confidence 331 1 22222222222211 111100 0011111111111111110000 00 Q ss_pred HHHHHHHHhh--c-cccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEec Q lcl|NC_019921. 50 AEAERVSSLP--K-SAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSE 125 (381) Q Consensus 50 ~~~~~~~~~~--~-~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~ 125 (381) .++.+++... + ....++..|++ ++..+++++||++||++++++|++.++++++|+++|+++++++ ..++|+.. T Consensus 80 ~e~~~a~~~~l~~g~~~~~~~~e~~---a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 156 (407) T protein:vir:48 80 SEHKEAFIGFMRKGREDGLRELERK---ALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNL 156 (407) T ss_pred hHHHHHHHHHHhccchhhhhHHHHH---hhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEec Confidence 1111111110 0 11122233332 4456778899999999999999999999999999999999754 68999999 Q ss_pred CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcce Q lcl|NC_019921. 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) Q Consensus 126 ~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~ 205 (381) +.+.+.|++|++..+....++|+++++.+|+++++++||+|||+||.+|+++||.++|+++++++++.+|++|+|+++|+ T Consensus 157 ~~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~ 236 (407) T protein:vir:48 157 GGTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPK 236 (407) T ss_pred CCcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccc Confidence 99999999998887766679999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC Q lcl|NC_019921. 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~ 285 (381) ||++................. ...+......+++.+.++.+.+ +..|+++++|+||+.++..+.. ++| T Consensus 237 Gil~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~d~i~~l~~~l-------~~~~~~~a~~v~n~~~~~~L~~---lkD 304 (407) T protein:vir:48 237 GFLAYESTDEDDKTRAFGKLQ--HIASGAASGVTADAIIKLIYTL-------RKAHRSGAKFMMNNSSLFAIRL---LKD 304 (407) T ss_pred eeeeccccccccccccccccc--ccccccccccChHHHHHHHHhh-------chhhhcCCEEEEcHHHHHHHHH---hhc Confidence 999765543322211111111 1112223344567777776654 4568899999999999877654 478 Q ss_pred CCCccccc---------cCCCceeEecCCCCC-----CcEEEEeecc-eEEEeecceEEEeehhhhhhcCceEEEEEEEE Q lcl|NC_019921. 286 ANGVYVTA---------LPFNLNVIESTVQEA-----GKVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQFA 350 (381) Q Consensus 286 ~~G~~~~~---------l~~G~pVv~s~~~p~-----~~i~fgd~~~-y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~ 350 (381) .+|+|+|+ ..+|+||+++++||+ ..|+||||++ |.+++|.+++|.+++ ++.+|+++||+..|+ T Consensus 305 ~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~--~~~~~~~~~~~~~r~ 382 (407) T protein:vir:48 305 NDGNYLWRPGIELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDP--YTNKPFVGFYTTKRT 382 (407) T ss_pred cCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEeec--cccCCcEEEEEEEEe Confidence 99999874 248999999999996 2388999987 999999999987754 577899999999999 Q ss_pred cCEEecCceEEEEEEEecCCccccc Q lcl|NC_019921. 351 YGKAKDNKVAAVWKLDLKGHKPALE 375 (381) Q Consensus 351 dGk~~~~~Afvv~~~~~~~~~~~~~ 375 (381) ||++++++||+++++.-+..+.+.- T Consensus 383 d~~v~~~~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 383 GGMLVDSQAIKLMKIGAATRQKAAA 407 (407) T ss_pred ccEEecccceEEEEeeccCCCCCCC Confidence 9999999999998887777775555 No 14 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=9.1e-65 Score=371.69 Aligned_cols=346 Identities=16% Similarity=0.082 Sum_probs=248.0 Q ss_pred CchhHHHHHHHHHHHHHHHHhh-------hhhHHHHHHHHH---HHHHHHHHHHHHHHHHH----HHHHHHhhccc---- Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNN-------GEPQERQNELYG---DMINQLFEETKLQAKAE----AERVSSLPKSA---- 62 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~-------~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~----~~~~~~~~~~~---- 62 (381) |...+.+++.++++++...++. .+..+++.+.++ ...+.+.++..+..... ........... T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFAGKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGSG 80 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccc Confidence 7766666666666554433332 111122222222 22223322222111110 00000000000 Q ss_pred --cccC-----------HHHHHHH---HHHhhccCCCCceeccHHHHHHHHHHH-HhhhhhhhhceeEecCC--ceEEEE Q lcl|NC_019921. 63 --QSLS-----------ANQRSFF---MDINKNVNYKEEKLLPEETIDRIFEDL-TTNHPLLADLGIKNAGL--RLKFLK 123 (381) Q Consensus 63 --~~lt-----------~~e~~~~---~~~~~~~~~~gg~lvP~~~~~~I~~~l-~~~~~l~~~~~v~~~~g--~~~~p~ 123 (381) +... ..+++.+ .....++.+++|.++|+++.+.++..+ ...++++++++++++++ .+.+|+ T Consensus 81 ~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 160 (392) T protein:vir:13 81 AQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTV 160 (392) T ss_pred hhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEE Confidence 0000 1111111 122334556667777877877777665 44567788899888743 478999 Q ss_pred ecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCc Q lcl|NC_019921. 124 SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ 203 (381) Q Consensus 124 ~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~ 203 (381) .++.+.+.|++|+++.+ +++++|+++++.+||++++++||++||+||.+|+++||.++|+++++++++.+|++|+|+++ T Consensus 161 ~~~~~~a~~v~E~~~~~-~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~ 239 (392) T protein:vir:13 161 ITGRATAGIVGETAEIP-ESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQ 239 (392) T ss_pred EcCCcceeeeccccccc-ccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCcc Confidence 99999999999988875 57899999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc Q lcl|NC_019921. 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~ 283 (381) |+||++..+....... +.......++.+.+++..+ +..|+++++|+||+.++..+.. + T Consensus 240 p~Gil~~~~~~~~~~~------------~~~~~~~~~d~l~~~~~~l-------~~~~~~~a~~v~n~~~~~~l~~---l 297 (392) T protein:vir:13 240 PRGILTDATGANAAFG------------EADADSKVSDALIDLFHEV-------PSAYRKNAKFVVNDLRAAQMRK---L 297 (392) T ss_pred cccccccccccccccc------------ccccccccHHHHHHHHHhh-------hhhhhcCCEEEEcHHHHHHHHH---h Confidence 9999986543222111 0011123455666665543 4468889999999999887764 4 Q ss_pred cCCCCcccccc---------CCCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEE Q lcl|NC_019921. 284 LNANGVYVTAL---------PFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKA 354 (381) Q Consensus 284 ~~~~G~~~~~l---------~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~ 354 (381) ++.+|+|+|.. .+|+||+++++||++.|+||||++|++++|++++++++.+.+|.+|+++||++.|+||++ T Consensus 298 kd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~ 377 (392) T protein:vir:13 298 KDANGQYLWQSALTVGAPDTFNGKVVETDDGMPADKVLFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLL 377 (392) T ss_pred hccCCceeecCCcCCCCCceecceeeEEcCCCCCCcEEEeeccceeEEeecceEEEeeccccccCCcEEEEEEEEeccEE Confidence 88999998753 479999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCceEEEEEEEecC Q lcl|NC_019921. 355 KDNKVAAVWKLDLKG 369 (381) Q Consensus 355 ~~~~Afvv~~~~~~~ 369 (381) ++++||++++++-++ T Consensus 378 ~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 378 VDARGAKVLTVTPAA 392 (392) T ss_pred ecccceEEEEeeccC Confidence 999999999998877 No 15 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=9.3e-64 Score=366.15 Aligned_cols=344 Identities=17% Similarity=0.098 Sum_probs=237.6 Q ss_pred Cchh-HHHHHHHHHH---HHHHHHhhhhhHHHHHHHHHHH---HHHHHHHHHHHHHH----HHHHHHHhhccc------c Q lcl|NC_019921. 1 MTIN-LSETFANAKN---EFINAVNNGEPQERQNELYGDM---INQLFEETKLQAKA----EAERVSSLPKSA------Q 63 (381) Q Consensus 1 mt~e-l~~~~~~~~~---~~~~~~k~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~----~~~~~~~~~~~~------~ 63 (381) |+|+ |+++..++.. ++.+..++.+..+++.+.++.+ .+.+.+++...... +..+........ + T Consensus 4 ~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (390) T protein:vir:62 4 TTLSANFEARERATAELRTLTDEFAGKEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGSGAQR 83 (390) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh Confidence 5544 3233322222 2222222212222333333332 22222222211111 011111110000 0 Q ss_pred ccCHH-----------HHHHHH---HHhhccCCCCceeccHHHHH-HHHHHHHhhhhhhhhceeEecCC--ceEEEEecC Q lcl|NC_019921. 64 SLSAN-----------QRSFFM---DINKNVNYKEEKLLPEETID-RIFEDLTTNHPLLADLGIKNAGL--RLKFLKSET 126 (381) Q Consensus 64 ~lt~~-----------e~~~~~---~~~~~~~~~gg~lvP~~~~~-~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~ 126 (381) ....+ +++.+. ....++.+++|.++|+++.+ .|++.++..++|+++|+++++++ .+++|+.++ T Consensus 84 ~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~ 163 (390) T protein:vir:62 84 SADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITG 163 (390) T ss_pred hcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcC Confidence 01111 111111 11223444445555555554 45566777778899999998753 478999999 Q ss_pred CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceE Q lcl|NC_019921. 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIG 206 (381) Q Consensus 127 ~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~G 206 (381) .+.+.|++|+++++ +++++|+++++.+|+++++++||+|||+||.+|+++||+++++++|++++|.+|++|+| +|+| T Consensus 164 ~~~a~wv~E~~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G--~p~G 240 (390) T protein:vir:62 164 RSSASIVGETAEIP-ESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTG--QPRG 240 (390) T ss_pred Ccceeeeccccccc-ccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCC--cccc Confidence 99999999988876 57899999999999999999999999999999999999999999999999999999987 7999 Q ss_pred eeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCC Q lcl|NC_019921. 207 LNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 207 il~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~ 286 (381) |++.......... .......+++.+.++++.+ ...|+++++|+||+.++..+.. +++. T Consensus 241 i~~~~~~~~~~~~------------~~~~~~~~~~~l~~~~~~l-------~~~~~~~a~~vmn~~~~~~L~~---lkd~ 298 (390) T protein:vir:62 241 ILTDASPATATFL------------ATDTDSKVSDALIDLFHEV-------PSAYRANAKYVVNDLRAAQMRK---LKDA 298 (390) T ss_pred cccccccccccee------------cccccccchHHHHHHHHhh-------hhhhhcCCEEEEchHHHHHHHH---hhcc Confidence 9986543221110 0011123455666665544 3467889999999999877764 4789 Q ss_pred CCcccccc---------CCCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecC Q lcl|NC_019921. 287 NGVYVTAL---------PFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDN 357 (381) Q Consensus 287 ~G~~~~~l---------~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~ 357 (381) +|+|+|+. .+|+||+++++||++.|+||||++|+++++++++++++.|.+|.+|+++||+..|+||+++++ T Consensus 299 ~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~ 378 (390) T protein:vir:62 299 NGQYLWQSGLTVGAPSLFNGKVVETDDGMPADKILFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDA 378 (390) T ss_pred CCCeeecCCcCCCccceecccceEEecCCCCccEEEeeccceeEEeecceEEEeeccccccCCcEEEEEEEEeCcEeech Confidence 99999852 479999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEEEecC Q lcl|NC_019921. 358 KVAAVWKLDLKG 369 (381) Q Consensus 358 ~Afvv~~~~~~~ 369 (381) +||++++++-++ T Consensus 379 ~A~~~l~~~~~a 390 (390) T protein:vir:62 379 RGAKVLTVTPGA 390 (390) T ss_pred hheEEEEeecCC Confidence 999998888776 No 16 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=3.3e-62 Score=357.61 Aligned_cols=346 Identities=14% Similarity=0.055 Sum_probs=241.9 Q ss_pred CchhH-HHHHHHHHHHHHHHHh----hhhhHHHH-------HHHHHHHHHHHHHHHHH-------HHH------------ Q lcl|NC_019921. 1 MTINL-SETFANAKNEFINAVN----NGEPQERQ-------NELYGDMINQLFEETKL-------QAK------------ 49 (381) Q Consensus 1 mt~el-~~~~~~~~~~~~~~~k----~~~~~~~~-------~~~~~~~~~~~~~~~~~-------~~~------------ 49 (381) |+|+. .++...+.++....++ ..+...++ .+.+....+.+.++... +.+ T Consensus 1 M~l~el~~~~~~~~~~~~a~l~~~~~~~~~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~~~~ 80 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRLAELQGKVEKNEVRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPEKKE 80 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhhhc Confidence 88763 3334433333322211 11111111 11111111111111100 000 Q ss_pred ---------------HHHHHHHHh-------hccc-cccCHHHHHHHHHHh------------hccCCCCceeccHHHHH Q lcl|NC_019921. 50 ---------------AEAERVSSL-------PKSA-QSLSANQRSFFMDIN------------KNVNYKEEKLLPEETID 94 (381) Q Consensus 50 ---------------~~~~~~~~~-------~~~~-~~lt~~e~~~~~~~~------------~~~~~~gg~lvP~~~~~ 94 (381) .+....... .++. .....+++++|.... ..++++||++||+++++ T Consensus 81 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~ 160 (434) T protein:vir:62 81 DPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNGSVTIPDFLSK 160 (434) T ss_pred chhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcccccccceecchhhHH Confidence 000000000 0000 011223444443211 12346799999999999 Q ss_pred HHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecc--cccccccCcceeeEeecceeEEEeeeccHHhhhcCH Q lcl|NC_019921. 95 RIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIY--GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP 172 (381) Q Consensus 95 ~I~~~l~~~~~l~~~~~v~~~~g~~~~p~~~~~~~a~wv~e~--~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~ 172 (381) .|++.++++++|+++|+++++++++++|+....+.+.|..+. +...++++++|++|++.+|+++++++||++||+||. T Consensus 161 ~Ii~~l~~~~~i~~~~~~~~~~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~ 240 (434) T protein:vir:62 161 EIITYAQEENFLRRLGTGVKTKENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLARTG 240 (434) T ss_pred HHHHhhhhhhhhhhhcceeccCCceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHHHhcch Confidence 999999999999999999999999999998877777776432 233446899999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhheeeccCCCcc-eEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhh Q lcl|NC_019921. 173 AWIERFVRVQIEEAFAVALETAFLKGTGKDQP-IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHS 251 (381) Q Consensus 173 ~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P-~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~ 251 (381) +||++||.++|+++++++++.+||+|+|+++| .|+++...... ......+++.+.++...+ T Consensus 241 ~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~-----------------~~~~~~~~d~l~~l~~~l- 302 (434) T protein:vir:62 241 LPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEF-----------------KTDEKNLYDALVKMKNTP- 302 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccc-----------------cccccchhhHHHHHHhhc- Confidence 99999999999999999999999999999875 56664321110 011123456666665544 Q ss_pred hccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc-----------cCCCceeEecCCCCCCc------EEEE Q lcl|NC_019921. 252 TNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-----------LPFNLNVIESTVQEAGK------VLTY 314 (381) Q Consensus 252 ~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~-----------l~~G~pVv~s~~~p~~~------i~fg 314 (381) ...|++|++|+||+.++..++. ++|.+|+|+|+ ..+|+||+.+++||.+. |+|| T Consensus 303 ------~~~~~~~a~~v~n~~~~~~L~~---lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~G 373 (434) T protein:vir:62 303 ------VKEVRKKARWVLNTAALTKIET---MKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFG 373 (434) T ss_pred ------chhhhcCCEEEEcHHHHHHHHH---hhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEe Confidence 4568899999999999887765 47899999984 24799999999998643 8899 Q ss_pred eecceEEEeec-ceEEEeehhhhhhcCceEEEEEEEEcCEEec-CceEEEEEEEecCCccc Q lcl|NC_019921. 315 VKGLYDGYLAG-GINVQKFKETLALDDMDLYTAKQFAYGKAKD-NKVAAVWKLDLKGHKPA 373 (381) Q Consensus 315 d~~~y~i~~r~-~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~-~~Afvv~~~~~~~~~~~ 373 (381) |||+|+|++|. +++++++++.+|.+|+++||++.|+|||++. |++++++++.++.++.+ T Consensus 374 dfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 374 DFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred eccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 99999999885 5889999999999999999999999999987 99999999998777777 No 17 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=7.1e-61 Score=350.34 Aligned_cols=345 Identities=14% Similarity=0.096 Sum_probs=249.5 Q ss_pred Cchh-HHHHHHHHHHHHHHHHh---hhhhHHHHHHHHHHH---HHHHHHHHHHHH------------------------- Q lcl|NC_019921. 1 MTIN-LSETFANAKNEFINAVN---NGEPQERQNELYGDM---INQLFEETKLQA------------------------- 48 (381) Q Consensus 1 mt~e-l~~~~~~~~~~~~~~~k---~~~~~~~~~~~~~~~---~~~~~~~~~~~~------------------------- 48 (381) |+|+ |+++.+++.+++.+.+. +....+++.+.++.. ++.+.++..... T Consensus 1 M~l~eL~e~r~~l~~e~~~l~~k~~~~~~t~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (409) T protein:vir:45 1 MKLHELKQKRNTIATDMRALNEKIGDNAWTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPEN 80 (409) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCCCC Confidence 9987 45555554444333221 111112222222221 122111111000 Q ss_pred ---HHH-HHHHHH--hhccccccCHHHHHHHHHH---hhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc- Q lcl|NC_019921. 49 ---KAE-AERVSS--LPKSAQSLSANQRSFFMDI---NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR- 118 (381) Q Consensus 49 ---~~~-~~~~~~--~~~~~~~lt~~e~~~~~~~---~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~- 118 (381) ..+ ..+++. ...+...++.+|++.+.+. ..+++.+||++||+++.++|++.+++.+||+++|+++++++. T Consensus 81 ~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 160 (409) T protein:vir:45 81 NSQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGR 160 (409) T ss_pred cchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCc Confidence 000 111111 1123456777888876544 446778899999999999999999999999999999998653 Q ss_pred -eEEEEecC-CcceEEeecccccccccCcceeeEeecceeEE-EeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhe Q lcl|NC_019921. 119 -LKFLKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLT-AFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) Q Consensus 119 -~~~p~~~~-~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~-~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~ 195 (381) ..+|...+ ...+.|++|++..+ +++++|++++|.++|++ ++++||++||+||.+|+++||.++|+++++++++.+| T Consensus 161 ~~~~~~~~~~~~~~~~v~E~~~~~-~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~ 239 (409) T protein:vir:45 161 TMEWATADGTSEVGVLLGENEEAG-EEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYL 239 (409) T ss_pred eEEEEeeccCcccccccccccccc-ccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 45566554 35678999987765 68899999999999996 5789999999999999999999999999999999999 Q ss_pred eeccCCC---cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEE--EEc Q lcl|NC_019921. 196 LKGTGKD---QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTM--VVN 270 (381) Q Consensus 196 i~G~G~~---~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~--~mn 270 (381) |+|+|++ +|+||++.++...... .....+++.+.++...+ +..|+.++.| +|| T Consensus 240 l~G~G~~~~~~p~Gil~~~~~~~~~~---------------~~~~~~~d~i~~l~~~l-------~~~~~~~a~~~~~~n 297 (409) T protein:vir:45 240 IQGTGAGTPKQPKGLAASVTGTTQTA---------------AANAVKWQEILALKHSI-------DPAYRRGPKFRLAFN 297 (409) T ss_pred hccCCCCCccccceeeeccccccccc---------------cccccchHHHHHHHHhh-------hhhhccCCeEEEEEC Confidence 9999975 8999997654322111 11123345566665544 4456777765 679 Q ss_pred hhhHHHHhhhhhccCCCCccccc---------cCCCceeEecCCCCC-----CcEEEEeecceEEEeecceEEEeehhhh Q lcl|NC_019921. 271 PSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEA-----GKVLTYVKGLYDGYLAGGINVQKFKETL 336 (381) Q Consensus 271 ~~t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~~~p~-----~~i~fgd~~~y~i~~r~~i~i~~~~~~~ 336 (381) +.++..+.. ++|.+|+|+|. ..+|+||+++++||+ ..|+||||++|++++++++.++.+++.+ T Consensus 298 ~~~~~~l~~---lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~~~~~d~~ 374 (409) T protein:vir:45 298 DNTLKLISE---MEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERY 374 (409) T ss_pred HHHHHHHHH---hhcCCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhhhheeeccceEEEEeeccc Confidence 999877654 47899999874 348999999999995 2388999999999999999999999999 Q ss_pred hhcCceEEEEEEEEcCEEecCceEEEEEEEecCCc Q lcl|NC_019921. 337 ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 337 ~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~ 371 (381) |.+|+++||+..|+||++++++||++++++-++.. T Consensus 375 ~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 375 AEYDQTGFLAFHRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred ccCCcEEEEEEEEeccEeechhheEEEEeccCCCC Confidence 99999999999999999999999998777665554 No 18 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=9.1e-61 Score=349.74 Aligned_cols=345 Identities=11% Similarity=0.015 Sum_probs=241.3 Q ss_pred CchhH-HHHHHHHHHHH---HHHHhhh--hhHHHHHHHHHHH---HHHHHHHHHHHH-H-----------------HHHH Q lcl|NC_019921. 1 MTINL-SETFANAKNEF---INAVNNG--EPQERQNELYGDM---INQLFEETKLQA-K-----------------AEAE 53 (381) Q Consensus 1 mt~el-~~~~~~~~~~~---~~~~k~~--~~~~~~~~~~~~~---~~~~~~~~~~~~-~-----------------~~~~ 53 (381) |+++. +.+......++ .+.+.+. +...+..+.++.+ .+.+.....+.. + .... T Consensus 143 ~~l~e~~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~ 222 (543) T protein:vir:81 143 DSIEDCRFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLARQCLATSSPAYL 222 (543) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhh Confidence 65542 22222221111 1111110 0000111111111 111111000000 0 0000 Q ss_pred HH---HHhhccccccCHHHHHHHHHHh--hccCCCCceeccHHHHHHHH-HHHHhhhhhhhhceeEecCCceEEEEecCC Q lcl|NC_019921. 54 RV---SSLPKSAQSLSANQRSFFMDIN--KNVNYKEEKLLPEETIDRIF-EDLTTNHPLLADLGIKNAGLRLKFLKSETS 127 (381) Q Consensus 54 ~~---~~~~~~~~~lt~~e~~~~~~~~--~~~~~~gg~lvP~~~~~~I~-~~l~~~~~l~~~~~v~~~~g~~~~p~~~~~ 127 (381) ++ .........++..+++.+.... ..++++||++||++++..|+ +.++..+||++++++.+++|.+.+|+.++. T Consensus 223 ~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~~ 302 (543) T protein:vir:81 223 RAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVATGDVWHGVSSAA 302 (543) T ss_pred hHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccCCcceEEEEecCC Confidence 00 0011112234444555444332 23567899999999998876 557888999999999999999999999999 Q ss_pred cceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCC-cceE Q lcl|NC_019921. 128 GVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD-QPIG 206 (381) Q Consensus 128 ~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~-~P~G 206 (381) +.+.|++|++..+ +++++|+++++.+++++++++||++||+|+ +|+++||.++|+++++++++.+|++|+|++ +|.| T Consensus 303 ~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~G 380 (543) T protein:vir:81 303 VQWSWDAEFEEVS-DDSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTG 380 (543) T ss_pred cceeecccCcccc-ccccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccc Confidence 9999999988775 689999999999999999999999999998 699999999999999999999999999985 9999 Q ss_pred eeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCC Q lcl|NC_019921. 207 LNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 207 il~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~ 286 (381) |++......... .+......+.+.+.++...+ +..|+.+++|+||+.++..+.. +++. T Consensus 381 i~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~l-------~~~~~~~~~~v~n~~~~~~l~~---lkd~ 438 (543) T protein:vir:81 381 IVTALAGTAAEI------------APVTAETFALADVYAVYEQL-------AARHRRQGAWLANNLIYNKIRQ---FDTQ 438 (543) T ss_pred chhhcccccccc------------cccccccccHHHHHHHHHhh-------hccccCCcEEEEcHHHHHHHHH---hhcC Confidence 997643322110 01112223455566655443 4567888999999999887765 4788 Q ss_pred CCcccccc--------CCCceeEecCCCCCCc----------EEEEeecceEEEeecceEEEeehhhh----hhcCceEE Q lcl|NC_019921. 287 NGVYVTAL--------PFNLNVIESTVQEAGK----------VLTYVKGLYDGYLAGGINVQKFKETL----ALDDMDLY 344 (381) Q Consensus 287 ~G~~~~~l--------~~G~pVv~s~~~p~~~----------i~fgd~~~y~i~~r~~i~i~~~~~~~----~~~d~~~~ 344 (381) +|.|+|.. .+|+||+.+++||.+. |+||||++|+|+++++++|.++++.+ |.+++++| T Consensus 439 ~G~~l~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~ 518 (543) T protein:vir:81 439 GGAGLWTTIGNGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGW 518 (543) T ss_pred CCceeccCcCCCCCccccceeeEEeccccccccccccCCcceEEEeeccceeEEeecccEEEEeccccccchhhcCceEE Confidence 99998752 4799999999998642 89999999999999999999998765 45679999 Q ss_pred EEEEEEcCEEecCceEEEEEEEecC Q lcl|NC_019921. 345 TAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 345 r~~~r~dGk~~~~~Afvv~~~~~~~ 369 (381) |++.|+||++++++||++++++.++ T Consensus 519 ~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 519 FAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred EEEEeeccEeecccceEEEEecccC Confidence 9999999999999999998888877 No 19 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=3.5e-60 Score=346.51 Aligned_cols=364 Identities=13% Similarity=0.056 Sum_probs=227.9 Q ss_pred Cch---hHHHHHHHHH---HHHHHHH-----------hhhh-------hHHHHHHH---HHHHHHHHHHHHHHH-H---- Q lcl|NC_019921. 1 MTI---NLSETFANAK---NEFINAV-----------NNGE-------PQERQNEL---YGDMINQLFEETKLQ-A---- 48 (381) Q Consensus 1 mt~---el~~~~~~~~---~~~~~~~-----------k~~~-------~~~~~~~~---~~~~~~~~~~~~~~~-~---- 48 (381) |+- ++.+++.+.+ .++...+ +..+ ..+++.+. .....+.+....... . T Consensus 7 l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~e~~~~~ 86 (497) T protein:vir:10 7 LEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLK 86 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 110 1111111100 0000000 0000 00000000 000001110000000 0 Q ss_pred -------HH--HH---HHHHHhhcccc-------c-cC-------HHHHH----------HHHHHhhccCCCCceeccHH Q lcl|NC_019921. 49 -------KA--EA---ERVSSLPKSAQ-------S-LS-------ANQRS----------FFMDINKNVNYKEEKLLPEE 91 (381) Q Consensus 49 -------~~--~~---~~~~~~~~~~~-------~-lt-------~~e~~----------~~~~~~~~~~~~gg~lvP~~ 91 (381) +. .. .+........+ . .. .+.+. ....+..+++++||++||++ T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~ 166 (497) T protein:vir:10 87 QIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPT 166 (497) T ss_pred hHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchh Confidence 00 00 00000000000 0 00 00000 11223445677899999999 Q ss_pred HHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecC-CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhh Q lcl|NC_019921. 92 TIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLND 169 (381) Q Consensus 92 ~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~-~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ 169 (381) +...|++.+++.++|+++|+++++++ .+++|+.++ .+.+.|++|++..+ +++++|++|++.+||++++++||+|||+ T Consensus 167 ~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~-~s~~~f~~i~~~~~k~a~~~~iS~ell~ 245 (497) T protein:vir:10 167 FLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTITDEGLR 245 (497) T ss_pred hhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccc-cccccceeeEeeeeeeEeecHhHHHHHH Confidence 99999999999999999999998765 589999765 46899999988765 6889999999999999999999999999 Q ss_pred cCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeee----------eeecc------ Q lcl|NC_019921. 170 FGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ----------GTLTF------ 233 (381) Q Consensus 170 ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~----------~~~t~------ 233 (381) |++ ++++||.++|+++|++++|.+||+|+|+++|+||++..+................ ++.+. T Consensus 246 d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (497) T protein:vir:10 246 DAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDT 324 (497) T ss_pred hHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhH Confidence 986 6999999999999999999999999999999999987654433322211110000 00000 Q ss_pred ---------------------cccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCcccc Q lcl|NC_019921. 234 ---------------------ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVT 292 (381) Q Consensus 234 ---------------------~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~ 292 (381) ....+..+.+..+...+... ....++....|+||+.++..++. ++|.+|+|+| T Consensus 325 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~vmn~~~~~~l~~---lkd~~G~~i~ 398 (497) T protein:vir:10 325 VASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI---QLTLFQTPNAVVMNPRDWELLRL---TKDANGQYMG 398 (497) T ss_pred HHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhh---hhhcccCCCeEEEchHHHHHHHH---hhcCCCceec Confidence 00001111111111111111 11233444469999999888765 4889999988 Q ss_pred cc---------------CCCceeEecCCCCCCcEEEEeecc--eEEEeecceEEEeehh--hhhhcCceEEEEEEEEcCE Q lcl|NC_019921. 293 AL---------------PFNLNVIESTVQEAGKVLTYVKGL--YDGYLAGGINVQKFKE--TLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 293 ~l---------------~~G~pVv~s~~~p~~~i~fgd~~~--y~i~~r~~i~i~~~~~--~~~~~d~~~~r~~~r~dGk 353 (381) +. .||+||+++++||+++++||||++ |.|++|++++|.++++ .+|.+|+++||+..|+||. T Consensus 399 ~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~ 478 (497) T protein:vir:10 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL 478 (497) T ss_pred cCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecce Confidence 53 369999999999999999999987 5689999999999987 4599999999999999999 Q ss_pred EecCceEEEEEEEecCCcccc Q lcl|NC_019921. 354 AKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 354 ~~~~~Afvv~~~~~~~~~~~~ 374 (381) +++|+|||+++++- ...++ T Consensus 479 v~~p~A~~~l~~~~--~~~~~ 497 (497) T protein:vir:10 479 VYRPSAFQLIQLKK--GATGS 497 (497) T ss_pred eeccccEEEEEecC--CccCC Confidence 99999999876643 33333 No 20 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=3.5e-60 Score=346.51 Aligned_cols=364 Identities=13% Similarity=0.056 Sum_probs=227.9 Q ss_pred Cch---hHHHHHHHHH---HHHHHHH-----------hhhh-------hHHHHHHH---HHHHHHHHHHHHHHH-H---- Q lcl|NC_019921. 1 MTI---NLSETFANAK---NEFINAV-----------NNGE-------PQERQNEL---YGDMINQLFEETKLQ-A---- 48 (381) Q Consensus 1 mt~---el~~~~~~~~---~~~~~~~-----------k~~~-------~~~~~~~~---~~~~~~~~~~~~~~~-~---- 48 (381) |+- ++.+++.+.+ .++...+ +..+ ..+++.+. .....+.+....... . T Consensus 7 l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~e~~~~~ 86 (497) T protein:vir:78 7 LEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLK 86 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 110 1111111100 0000000 0000 00000000 000001110000000 0 Q ss_pred -------HH--HH---HHHHHhhcccc-------c-cC-------HHHHH----------HHHHHhhccCCCCceeccHH Q lcl|NC_019921. 49 -------KA--EA---ERVSSLPKSAQ-------S-LS-------ANQRS----------FFMDINKNVNYKEEKLLPEE 91 (381) Q Consensus 49 -------~~--~~---~~~~~~~~~~~-------~-lt-------~~e~~----------~~~~~~~~~~~~gg~lvP~~ 91 (381) +. .. .+........+ . .. .+.+. ....+..+++++||++||++ T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~ 166 (497) T protein:vir:78 87 QIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPT 166 (497) T ss_pred hHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchh Confidence 00 00 00000000000 0 00 00000 11223445677899999999 Q ss_pred HHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecC-CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhh Q lcl|NC_019921. 92 TIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLND 169 (381) Q Consensus 92 ~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~-~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ 169 (381) +...|++.+++.++|+++|+++++++ .+++|+.++ .+.+.|++|++..+ +++++|++|++.+||++++++||+|||+ T Consensus 167 ~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~-~s~~~f~~i~~~~~k~a~~~~iS~ell~ 245 (497) T protein:vir:78 167 FLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYP-FSSEEFARVYEQVGKVANALTITDEGLR 245 (497) T ss_pred hhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccc-cccccceeeEeeeeeeEeecHhHHHHHH Confidence 99999999999999999999998765 589999765 46899999988765 6889999999999999999999999999 Q ss_pred cCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeee----------eeecc------ Q lcl|NC_019921. 170 FGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ----------GTLTF------ 233 (381) Q Consensus 170 ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~----------~~~t~------ 233 (381) |++ ++++||.++|+++|++++|.+||+|+|+++|+||++..+................ ++.+. T Consensus 246 d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (497) T protein:vir:78 246 DAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDT 324 (497) T ss_pred hHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhH Confidence 986 6999999999999999999999999999999999987654433322211110000 00000 Q ss_pred ---------------------cccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCcccc Q lcl|NC_019921. 234 ---------------------ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVT 292 (381) Q Consensus 234 ---------------------~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~ 292 (381) ....+..+.+..+...+... ....++....|+||+.++..++. ++|.+|+|+| T Consensus 325 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~vmn~~~~~~l~~---lkd~~G~~i~ 398 (497) T protein:vir:78 325 VASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI---QLTLFQTPNAVVMNPRDWELLRL---TKDANGQYMG 398 (497) T ss_pred HHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhh---hhhcccCCCeEEEchHHHHHHHH---hhcCCCceec Confidence 00001111111111111111 11233444469999999888765 4889999988 Q ss_pred cc---------------CCCceeEecCCCCCCcEEEEeecc--eEEEeecceEEEeehh--hhhhcCceEEEEEEEEcCE Q lcl|NC_019921. 293 AL---------------PFNLNVIESTVQEAGKVLTYVKGL--YDGYLAGGINVQKFKE--TLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 293 ~l---------------~~G~pVv~s~~~p~~~i~fgd~~~--y~i~~r~~i~i~~~~~--~~~~~d~~~~r~~~r~dGk 353 (381) +. .||+||+++++||+++++||||++ |.|++|++++|.++++ .+|.+|+++||+..|+||. T Consensus 399 ~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~ 478 (497) T protein:vir:78 399 GNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLL 478 (497) T ss_pred cCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecce Confidence 53 369999999999999999999987 5689999999999987 4599999999999999999 Q ss_pred EecCceEEEEEEEecCCcccc Q lcl|NC_019921. 354 AKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 354 ~~~~~Afvv~~~~~~~~~~~~ 374 (381) +++|+|||+++++- ...++ T Consensus 479 v~~p~A~~~l~~~~--~~~~~ 497 (497) T protein:vir:78 479 VYRPSAFQLIQLKK--GATGS 497 (497) T ss_pred eeccccEEEEEecC--CccCC Confidence 99999999876643 33333 No 21 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=6.6e-59 Score=339.57 Aligned_cols=350 Identities=14% Similarity=0.079 Sum_probs=238.5 Q ss_pred Cchh-HHHHHHHHHHHHHHHHh---hhh-hHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH-----------hhcc Q lcl|NC_019921. 1 MTIN-LSETFANAKNEFINAVN---NGE-PQERQNELY---GDMINQLFEETKLQAKAEAERVSS-----------LPKS 61 (381) Q Consensus 1 mt~e-l~~~~~~~~~~~~~~~k---~~~-~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~ 61 (381) |+|+ |+++..++..++.+..+ +.. ..+++.+.+ ...++.+..++......+...... .... T Consensus 1 M~i~eL~e~r~~~~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 80 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAPA 80 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhcc Confidence 9998 44444444433333222 110 111222222 223333333222111111000000 0000 Q ss_pred ----------------------------ccccCHHHHH---------HHHHHhhccCCCCceeccHHHHHHHHHHHHhhh Q lcl|NC_019921. 62 ----------------------------AQSLSANQRS---------FFMDINKNVNYKEEKLLPEETIDRIFEDLTTNH 104 (381) Q Consensus 62 ----------------------------~~~lt~~e~~---------~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~ 104 (381) ........+. .-+.+..+++.+||++||+++.++|++.+++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~ 160 (435) T protein:vir:14 81 AAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKS 160 (435) T ss_pred ccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhc Confidence 0000000000 012334566778999999999999999999999 Q ss_pred hhhhh-ceeEec-CCceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHH--HHHHHHH Q lcl|NC_019921. 105 PLLAD-LGIKNA-GLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPA--WIERFVR 180 (381) Q Consensus 105 ~l~~~-~~v~~~-~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~--~~e~~l~ 180 (381) +++++ ++++++ ++.+++|+.++.+.+.|++|++..+ +++++|+++++.+|+++++++||++||+||.+ +|++||. T Consensus 161 ~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~ 239 (435) T protein:vir:14 161 VVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIP-TTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVV 239 (435) T ss_pred hhhhhcceeeecCCCceEEEEEeCCcceeeeccCcccc-ccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHH Confidence 99997 777775 5679999999999999999988765 68899999999999999999999999999965 5999999 Q ss_pred HHHHHHHHHHHhhheeeccCC-CcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccc Q lcl|NC_019921. 181 VQIEEAFAVALETAFLKGTGK-DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSV 259 (381) Q Consensus 181 ~~la~~~~~~~~~a~i~G~G~-~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~ 259 (381) ++|+++++++++.+|++|+|+ ++|.||++.......... ............+.+++..+.. .. T Consensus 240 ~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~l~~~~~~-----~~ 303 (435) T protein:vir:14 240 GDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITA-----------SDASTLQKIETDLGKVILALEN-----AD 303 (435) T ss_pred HHHHHHHHHHHHHHhhccCCCCccccceeecccccceecc-----------ccccchhhHHHHHHHHHHHhhh-----cc Confidence 999999999999999999998 489999864322111000 0001111222334444433321 12 Q ss_pred cccCceEEEEchhhHHHHhhhhhccCCCCccccc-----cCCCceeEecCCCCCC--------cEEEEeecceEEEeecc Q lcl|NC_019921. 260 AVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQEAG--------KVLTYVKGLYDGYLAGG 326 (381) Q Consensus 260 ~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~-----l~~G~pVv~s~~~p~~--------~i~fgd~~~y~i~~r~~ 326 (381) .++.+++|+||+.++..+.. +++.+|+|+|. .++|+||++++.||++ .|+||||++|+|++|++ T Consensus 304 ~~~~~~~~v~n~~~~~~L~~---lkd~~G~~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~ 380 (435) T protein:vir:14 304 ANLTQPGWIMAPRTFRFLEG---LRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEEET 380 (435) T ss_pred ccccCCEEEEcHHHHHHHHH---hhccCCceeccCCCCCeeecceeEeeccccccccCCCccceEEEeecccEEEEEecc Confidence 24567889999999877754 47899999873 3489999999999863 58999999999999999 Q ss_pred eEEEeehhh-----------hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCc Q lcl|NC_019921. 327 INVQKFKET-----------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 327 i~i~~~~~~-----------~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~ 371 (381) +++.++++. +|.+|+++||+.+|+|+++++++||++++=-- .+. T Consensus 381 ~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~-~~~ 435 (435) T protein:vir:14 381 LEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVA-WGA 435 (435) T ss_pred cEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCC-CCC Confidence 999999874 48899999999999999999999999866222 221 No 22 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=9.7e-59 Score=338.64 Aligned_cols=350 Identities=14% Similarity=0.087 Sum_probs=239.5 Q ss_pred Cch-hHHHHHHHHHHHHHHHHh---hhh-hHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHh-----------hcc Q lcl|NC_019921. 1 MTI-NLSETFANAKNEFINAVN---NGE-PQERQNEL---YGDMINQLFEETKLQAKAEAERVSSL-----------PKS 61 (381) Q Consensus 1 mt~-el~~~~~~~~~~~~~~~k---~~~-~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~ 61 (381) |+| ||+++..+..+++.+.++ +.. ..+++.+. +...++.+.+++......+....... ... T Consensus 1 M~l~eL~~~r~~~~~~~~~l~~~~~e~~~l~~ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 80 (435) T protein:vir:80 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTASA 80 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhcccc Confidence 999 466655555544433222 110 11122222 22223333332221111000000000 000 Q ss_pred ----------------------------ccccCHHHH---------HHHHHHhhccCCCCceeccHHHHHHHHHHHHhhh Q lcl|NC_019921. 62 ----------------------------AQSLSANQR---------SFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNH 104 (381) Q Consensus 62 ----------------------------~~~lt~~e~---------~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~ 104 (381) ......... ...+.+..+++..||++||+++.++|++.+++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~ 160 (435) T protein:vir:80 81 AAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKS 160 (435) T ss_pred ccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhc Confidence 000000000 0112234566778999999999999999999999 Q ss_pred hhhhh-ceeEec-CCceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHH--HHHHHHH Q lcl|NC_019921. 105 PLLAD-LGIKNA-GLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPA--WIERFVR 180 (381) Q Consensus 105 ~l~~~-~~v~~~-~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~--~~e~~l~ 180 (381) +|+++ ++++++ ++..++|+.++.+.+.|++|++..+ +++++|++|++.+|+++++++||++||+||.+ ++++||. T Consensus 161 ~i~~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~-~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~ 239 (435) T protein:vir:80 161 VVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIP-TTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVV 239 (435) T ss_pred hhhhccceeeecCCCceEEEEEeCCcceeeeccCcccc-ccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHH Confidence 99998 788775 5679999999999999999987765 68899999999999999999999999999954 7999999 Q ss_pred HHHHHHHHHHHhhheeeccCC-CcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccc Q lcl|NC_019921. 181 VQIEEAFAVALETAFLKGTGK-DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSV 259 (381) Q Consensus 181 ~~la~~~~~~~~~a~i~G~G~-~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~ 259 (381) ++++++++++++.+|++|+|+ ++|.||++.......... ............+.+++..+.. .. T Consensus 240 ~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~-----------~~~~~~~~~~~d~~~~~~~~~~-----~~ 303 (435) T protein:vir:80 240 GDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITA-----------SDGSTLQKIETDLGKAILALEN-----AD 303 (435) T ss_pred HHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeec-----------ccccchhhHHHHHHHHHHHhhc-----cc Confidence 999999999999999999997 489999875432221110 0011111112223333322211 12 Q ss_pred cccCceEEEEchhhHHHHhhhhhccCCCCccccc-----cCCCceeEecCCCCCC--------cEEEEeecceEEEeecc Q lcl|NC_019921. 260 AVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQEAG--------KVLTYVKGLYDGYLAGG 326 (381) Q Consensus 260 ~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~-----l~~G~pVv~s~~~p~~--------~i~fgd~~~y~i~~r~~ 326 (381) .++.+++|+||+.++..+.. +++.+|+|+|. ..+|+||++++.||++ .|+||||++|+|++|++ T Consensus 304 ~~~~~~~~vmn~~~~~~L~~---lkd~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~ 380 (435) T protein:vir:80 304 ANLTQPGWIMAPRTFRFLEG---LRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEET 380 (435) T ss_pred cccccCEEEEcHHHHHHHHh---hhccCCceeccCCCCCeEeeeeeEEeccccccccCCCCcceEEEEEcccEEEEeecc Confidence 35678899999999877654 57899999874 3489999999999863 48999999999999999 Q ss_pred eEEEeehhhh-----------hhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcc Q lcl|NC_019921. 327 INVQKFKETL-----------ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 327 i~i~~~~~~~-----------~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~ 372 (381) ++|++++|.. |..|+++||+..|+|+++++++||++++= + +-.+ T Consensus 381 ~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~-~-~~~~ 435 (435) T protein:vir:80 381 LEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSG-V-AWGA 435 (435) T ss_pred eEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEec-c-CCCC Confidence 9999999863 88999999999999999999999998662 2 2222 No 23 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=2.4e-59 Score=342.00 Aligned_cols=337 Identities=12% Similarity=0.046 Sum_probs=232.2 Q ss_pred hhHHHHHHHHHHHHHHHHhhhhhHHHH-----HHHHHHHHHHHHH--HHHH--HHHHHHHHHHHhhccccccCHHHHHHH Q lcl|NC_019921. 3 INLSETFANAKNEFINAVNNGEPQERQ-----NELYGDMINQLFE--ETKL--QAKAEAERVSSLPKSAQSLSANQRSFF 73 (381) Q Consensus 3 ~el~~~~~~~~~~~~~~~k~~~~~~~~-----~~~~~~~~~~~~~--~~~~--~~~~~~~~~~~~~~~~~~lt~~e~~~~ 73 (381) ||..++++++.+.+..+....+.+..+ .+........... ...+ ....+..+........+.......... T Consensus 1 ~eei~~l~~~~~~l~~~~~~l~~~~d~~e~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~ 80 (352) T protein:vir:78 1 MEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLNDNEKLVKAKAEFYRHAILPNEFEKPSMEAQRLL 80 (352) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHhhHHHHH Confidence 777767766666655443322211110 0000000000000 0000 001111111111100000001112233 Q ss_pred HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEE-ecCCcceEEeecccccccccCcceeeEee Q lcl|NC_019921. 74 MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLK-SETSGVAVWGKIYGEIKGQLDAAFSEETA 152 (381) Q Consensus 74 ~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~~~~p~-~~~~~~a~wv~e~~~~~~~~~~~f~~v~l 152 (381) +++..+++++||++||+++.++|++.+++++|||++|+++++++. .+|+ ..+.+.+.|++|++..+ +++++|++|++ T Consensus 81 ~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~~-~~p~~~~~~~~a~~v~E~~~~~-~~~~~f~~v~~ 158 (352) T protein:vir:78 81 HALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL-EIPRVSYTLDDDDFITDVETAK-ELKLKGDTVKF 158 (352) T ss_pred HHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCCc-eEEEEecCCCcccccccccccc-cccccceeeee Confidence 566778899999999999999999999999999999999998765 4555 45567899999987765 57899999999 Q ss_pred cceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhh-heeeccCCCcceEeeeccccccccccccccceeeeeee Q lcl|NC_019921. 153 IQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALET-AFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTL 231 (381) Q Consensus 153 ~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~-a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~ 231 (381) .+|+++++++||++||+||.+|+++||.++|+++++++++. +|.+|+|+++|.|+++..... .+ T Consensus 159 ~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~---------------~~ 223 (352) T protein:vir:78 159 TTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVK---------------EV 223 (352) T ss_pred cceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceeccccc---------------cc Confidence 99999999999999999999999999999999999998655 778999999999998642211 01 Q ss_pred cccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc----cCCCceeEecCCCC Q lcl|NC_019921. 232 TFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA----LPFNLNVIESTVQE 307 (381) Q Consensus 232 t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~----l~~G~pVv~s~~~p 307 (381) ....++|.+.+++..+ +..|++|++|+||+.+++.++.+ ++.+|.|+|. ..+|+||+++++++ T Consensus 224 ---t~~~~~d~i~~~~~~l-------~~~~~~~a~~~mn~~t~~~l~~~---~~~~~~~~~~~~~~~llG~PV~~~~~~~ 290 (352) T protein:vir:78 224 ---EGANMYDAIINALADL-------HEDYRDNATIYMRYADYVKIISV---LSNGTTNFFDTPAEKVFGKPVVFTDAAV 290 (352) T ss_pred ---cccchHHHHHHHHhcc-------ChhhhcCCEEEEehHHHHHHHHH---HhccCCcccccCCccccccceEEecCCC Confidence 1223467777766544 55788999999999998887654 2334455442 23799999999886 Q ss_pred CCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcccc Q lcl|NC_019921. 308 AGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 308 ~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~ 374 (381) +++||||++|++. +.++.++++.+ +.+++++|++..|+||++++++||++++++-++....+ T Consensus 291 --~~~~Gdf~~~~~~-~~~~~~~~~~~--~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~~~~ 352 (352) T protein:vir:78 291 --KPIVGDFNYFGIN-YDGTTYDTDKD--VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGSLPS 352 (352) T ss_pred --ceeEeehhhhhhh-hhhheeeeecc--ccCCeeEEEEEeeeCceeechhheEEEEeecccCCCCC Confidence 6899999997654 45677777776 34799999999999999999999998777665444222 No 24 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=1.7e-58 Score=337.28 Aligned_cols=349 Identities=12% Similarity=0.030 Sum_probs=235.2 Q ss_pred CchhHHHHHHHHHHHHHHHHhh---hh-----hHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHh------hccc- Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNN---GE-----PQERQNELYGD---MINQLFEETKLQAKAEAERVSSL------PKSA- 62 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~---~~-----~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~------~~~~- 62 (381) |.- .+++++.++++.+.++. .+ ..+++.+.++. .++.+..++......+....... ..+. T Consensus 1 M~k--l~~L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~ 78 (428) T protein:vir:10 1 MPQ--IEELRRQRAGINEQIQALATIEATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPA 78 (428) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccc Confidence 544 33444444333222211 10 11222233332 23333322221111110000000 0000 Q ss_pred -------cccCH-HHHH-----------------H---------HHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhh Q lcl|NC_019921. 63 -------QSLSA-NQRS-----------------F---------FMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLA 108 (381) Q Consensus 63 -------~~lt~-~e~~-----------------~---------~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~ 108 (381) ..... ...+ + .......+.+.||++||++++++|++.+++.++|++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~ 158 (428) T protein:vir:10 79 VIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRK 158 (428) T ss_pred cccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhh Confidence 00000 0000 0 000112344578999999999999999999999999 Q ss_pred h-ceeEec-CCceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHH Q lcl|NC_019921. 109 D-LGIKNA-GLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEA 186 (381) Q Consensus 109 ~-~~v~~~-~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~ 186 (381) + ++++++ +|.+++|+.++.+.+.|++|++..+ +++++|++|++.+++++++++||++||+||.+++++||.++|+++ T Consensus 159 ~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~a 237 (428) T protein:vir:10 159 LGARSIPLPNGNMSLPRLAGGATASYTGENQDAK-VSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTA 237 (428) T ss_pred hcceeeecCCcceEEEEEeCCcceeeeccCcccc-ccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHH Confidence 9 777775 5789999999899999999988765 578999999999999999999999999999999999999999999 Q ss_pred HHHHHhhheeeccCCC-cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCce Q lcl|NC_019921. 187 FAVALETAFLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNV 265 (381) Q Consensus 187 ~~~~~~~a~i~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a 265 (381) ++++++.+|++|+|++ +|.||++..+.......... . . .......+.+.+.+... ......+..++ T Consensus 238 i~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~-----~---~-~~~~~~~~~~~~~~~~~----~~~~~~~~~~~ 304 (428) T protein:vir:10 238 ISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAA-----D---A-AVNLDTIDTYLDSIILM----SMDGNSNMISS 304 (428) T ss_pred HHHHHHHHHhccCCCCccccccccccccccccccccc-----c---c-cccHHHHHHHHHHHHHh----hhccccccccC Confidence 9999999999999985 99999976443222111100 0 0 00111122222222211 11234566788 Q ss_pred EEEEchhhHHHHhhhhhccCCCCcccccc-----CCCceeEecCCCCCC--------cEEEEeecceEEEeecceEEEee Q lcl|NC_019921. 266 TMVVNPSDAFEVQAQYTHLNANGVYVTAL-----PFNLNVIESTVQEAG--------KVLTYVKGLYDGYLAGGINVQKF 332 (381) Q Consensus 266 ~~~mn~~t~~~~~~~~~~~~~~G~~~~~l-----~~G~pVv~s~~~p~~--------~i~fgd~~~y~i~~r~~i~i~~~ 332 (381) +|+||+.++..+.. +++.+|+|+|.. .+|+||+++++||++ .|+||||++|++++++++++.++ T Consensus 305 ~~v~n~~~~~~L~~---lkd~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i~~~ 381 (428) T protein:vir:10 305 GWGMSNRTYMKLFG---LRDGNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKVDFS 381 (428) T ss_pred EEEEcHHHHHHHHH---hhccCCceeccCCCCCeeeceeeEEeccccccccCCCccceEEEEecceEEEEEecceEEEee Confidence 99999999887765 478999999842 379999999999863 38999999999999999999999 Q ss_pred hhh-----------hhhcCceEEEEEEEEcCEEecCceEEEEEEEecC Q lcl|NC_019921. 333 KET-----------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 333 ~~~-----------~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~ 369 (381) ++. .|.+|+++||+..|+|+++++|+|||+++-.. = T Consensus 382 ~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~-~ 428 (428) T protein:vir:10 382 KEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVL-F 428 (428) T ss_pred cccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccC-C Confidence 874 58899999999999999999999999977322 2 No 25 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=1.3e-58 Score=337.87 Aligned_cols=329 Identities=10% Similarity=0.050 Sum_probs=241.6 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhh---hH--HHHHHHHHHHHHHHHHHHHH-------H--------------------- Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGE---PQ--ERQNELYGDMINQLFEETKL-------Q--------------------- 47 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~---~~--~~~~~~~~~~~~~~~~~~~~-------~--------------------- 47 (381) |.|++++++.+.+++..+.+.+.+ .+ .++.+......+.+.++... . T Consensus 1 ~~~~m~k~l~el~~~~~~~~~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:12 1 MPMQMSKKEIALRQQFTEKKQQADKALQEGNTDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQ 80 (397) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhccc Confidence 888888777766555443322211 00 01111111111111111000 0 Q ss_pred ---------HHHHHHHHHHhhccccccCHHHHHHHH-----HHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeE Q lcl|NC_019921. 48 ---------AKAEAERVSSLPKSAQSLSANQRSFFM-----DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK 113 (381) Q Consensus 48 ---------~~~~~~~~~~~~~~~~~lt~~e~~~~~-----~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~ 113 (381) ...++.+++.....++.+..+++.++. ++..+++++||++||+++.+.|++.+++.+||+++|+++ T Consensus 81 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~ 160 (397) T protein:vir:12 81 RSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVE 160 (397) T ss_pred ccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhccee Confidence 001122222222223344555554432 334556788999999999999999999999999999998 Q ss_pred ecC---CceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019921. 114 NAG---LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVA 190 (381) Q Consensus 114 ~~~---g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~ 190 (381) +++ +.+.+|+..+.+.+.|++|+++.+..+.++|++|++.+|+++++++||++||+||.+++++||.++|+++++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~ 240 (397) T protein:vir:12 161 PVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVT 240 (397) T ss_pred eccCCceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHH Confidence 874 45778888888999999999887766789999999999999999999999999999999999999999999999 Q ss_pred HhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEc Q lcl|NC_019921. 191 LETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVN 270 (381) Q Consensus 191 ~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn 270 (381) +|.+|++|+|+++|.|+++ ++.+..++.. .....|+++++|+|| T Consensus 241 ~d~~il~G~g~~~~~g~~~------------------------------~~~i~~~~~~------~l~~~~~~~a~~~~n 284 (397) T protein:vir:12 241 RNNLILAAIASLKKVDIDG------------------------------LDGIKKALNV------TLDPMVAPGSIVLTN 284 (397) T ss_pred HHHHHHhcccccccccccc------------------------------HHHHHHHHhh------ccchhhhCCCEEEEc Confidence 9999999999999998853 1222222211 123467889999999 Q ss_pred hhhHHHHhhhhhccCCCCccccc---------cCCCceeEecCC-CCC---C--cEEEEeecc-eEEEeecceEEEeehh Q lcl|NC_019921. 271 PSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTV-QEA---G--KVLTYVKGL-YDGYLAGGINVQKFKE 334 (381) Q Consensus 271 ~~t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~~-~p~---~--~i~fgd~~~-y~i~~r~~i~i~~~~~ 334 (381) +.++..+.. +++++|+|+|. ..+|+||+++++ +|. + .++||||++ |.+++|++++|+.+++ T Consensus 285 ~~~~~~L~~---lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 361 (397) T protein:vir:12 285 QDGYDWLDT---LKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDT 361 (397) T ss_pred HHHHHHHHH---hhccCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecceEEEEecc Confidence 999877764 47889999874 247999987655 442 2 389999998 6799999999988765 Q ss_pred h--hhhcCceEEEEEEEEcCEEecCceEEEEEEEec Q lcl|NC_019921. 335 T--LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 335 ~--~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~ 368 (381) . .|.+|++.||+..|+||++++++||+++++... T Consensus 362 ~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 362 GAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred ccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 4 589999999999999999999999999998886 No 26 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=1.7e-59 Score=342.82 Aligned_cols=340 Identities=11% Similarity=0.046 Sum_probs=225.4 Q ss_pred Cc--hhHHHHHHHHHHHHHHH---H----hhhhhHHHHHHHHHHHHHHH-------HHHHHHHHHHH------------- Q lcl|NC_019921. 1 MT--INLSETFANAKNEFINA---V----NNGEPQERQNELYGDMINQL-------FEETKLQAKAE------------- 51 (381) Q Consensus 1 mt--~el~~~~~~~~~~~~~~---~----k~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~------------- 51 (381) |. .|+++++.+.+.++.+. + .+.+...++...+....+.+ .++........ T Consensus 16 mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 95 (402) T protein:vir:93 16 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 95 (402) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 54 34554444444333221 1 11111011111111111111 11110000000 Q ss_pred -------------HHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc Q lcl|NC_019921. 52 -------------AERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR 118 (381) Q Consensus 52 -------------~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~ 118 (381) +.+................+..+++..+++++||++||++++++|++.+++++|||++|+++++++ T Consensus 96 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~- 174 (402) T protein:vir:93 96 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 174 (402) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC- Confidence 000000000000000111122345667788999999999999999999999999999999999875 Q ss_pred eEEEEe-cCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHh-hhee Q lcl|NC_019921. 119 LKFLKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE-TAFL 196 (381) Q Consensus 119 ~~~p~~-~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~-~a~i 196 (381) ..+|+. .+.+++.|++|++..+ +++|+|+++++.+|+++++++||++||+||.+|+++||.++|+++++++++ .+|. T Consensus 175 ~~~p~~~~~~~~a~~v~Eg~~~~-~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~ 253 (402) T protein:vir:93 175 LEIPRVSYTLDDDDFITDVETAK-ELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 253 (402) T ss_pred ceeeeeeccCCcccccccccccc-ccccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 456764 4567899999987765 578999999999999999999999999999999999999999999999975 4678 Q ss_pred eccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|NC_019921. 197 KGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 197 ~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~ 276 (381) +|+|+++|.|+++..+... .....+++.+.++++.+ +..|+.|++|+||+.|++. T Consensus 254 ~g~g~g~p~g~~~~~~~~~------------------~~~~~~~d~l~~~~~~l-------~~~y~~na~~imn~~t~~~ 308 (402) T protein:vir:93 254 VSPKSGLEHMSFYNGSVKE------------------VEGADMYDAIINALADL-------HEDYRDNATIYMRYADYVK 308 (402) T ss_pred cCCCccccceeeecccccc------------------ccccchHHHHHHHHhcc-------ChhhhcCCEEEEechHHHH Confidence 9999999999986422110 11223466677766543 4578899999999999887 Q ss_pred HhhhhhccCCCCcccccc---CCCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCE Q lcl|NC_019921. 277 VQAQYTHLNANGVYVTAL---PFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~l---~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk 353 (381) ++.. .++.+|.+.++. .+|+||+++++++ +++||||++|++. +.++.+..+++. ..|+++||+..|+||+ T Consensus 309 ~~~~--~~d~~~~~~~~~~~~llG~PV~~t~~~~--~i~~GDf~~~~~~-~~~~~~~~~~~~--~~~~~~~~~~~r~Dg~ 381 (402) T protein:vir:93 309 IISV--LSNGTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLFVLTAWYDQQ 381 (402) T ss_pred HHHH--HhcCCCcccccCCccccccceEEecCCC--ceeeechhhhhhh-hhhhhhhhhhcc--cCCceEEEEEEEeCcE Confidence 7654 344544444333 3799999999886 6899999995443 234556666664 3599999999999999 Q ss_pred EecCceEEEEEEEecCCcccc Q lcl|NC_019921. 354 AKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 354 ~~~~~Afvv~~~~~~~~~~~~ 374 (381) +++++||++++++-+.....+ T Consensus 382 v~~~~A~~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 382 RTLDSAFRIAKAKENTGPLPS 402 (402) T ss_pred EechhheEEEEeecCCCCCCC Confidence 999999999887654332222 No 27 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=2.3e-58 Score=336.59 Aligned_cols=339 Identities=12% Similarity=0.101 Sum_probs=238.5 Q ss_pred CchhHHHHHHHHHHHHHHHHhh--------hhhHHHHHHHHHHHHHH---HHHHHHH-HHHHHHH--------------- Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNN--------GEPQERQNELYGDMINQ---LFEETKL-QAKAEAE--------------- 53 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~--------~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~~~~--------------- 53 (381) |+- +.+++.+.+++..+.++. .+..++..+.++.+..+ +..+... +.+.... T Consensus 1 m~~-l~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~ 79 (390) T protein:vir:81 1 MTD-ITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVG 79 (390) T ss_pred ChH-HHHHHHHHHHHHHHHHHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccch Confidence 432 222333333333222221 11111222233332222 2111111 0000000 Q ss_pred ---------HHHHhhc--cccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEE Q lcl|NC_019921. 54 ---------RVSSLPK--SAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKF 121 (381) Q Consensus 54 ---------~~~~~~~--~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~ 121 (381) +...... .......+.+...+....++..++|+++|+++...|++.+++.++|+++|+++++++ .+++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:81 80 DMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEE Confidence 0000000 000111222233334445567788999999999999999999999999999999765 4789 Q ss_pred EEecC-CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccC Q lcl|NC_019921. 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) Q Consensus 122 p~~~~-~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G 200 (381) |+.++ .+.+.|++|+++.+ +++++|+++++.+|+++++++||++||+|+. ++++||.++|+++++++++.+|++|+| T Consensus 160 ~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~~~~~i~~~l~~~~~~~~d~a~l~G~g 237 (390) T protein:vir:81 160 VQETGFVNNAAIVAEGALKP-ESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTG 237 (390) T ss_pred EEEecCCcceeeecCCcccc-cccceeeEEEEeeeEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 98765 46899999988765 6789999999999999999999999999985 799999999999999999999999999 Q ss_pred CCc-ceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhh Q lcl|NC_019921. 201 KDQ-PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 201 ~~~-P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~ 279 (381) +++ |.||++......... .......++.+.++...+ ...+..+++|+|||.++..+.. T Consensus 238 ~~~~~~Gi~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~v~~~~~~~~l~~ 296 (390) T protein:vir:81 238 ANDGLLGLIPQATTYAAPT--------------TIAGATRVDQLRLAMLQA-------SLAEYNPSGIVINPIDWAAIEL 296 (390) T ss_pred CCCcccceeeccccccccc--------------ccccchhHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHH Confidence 875 999997543221111 111223455566555443 3345677789999999887765 Q ss_pred hhhccCCCCcccccc--------CCCceeEecCCCCCCcEEEEeecc-eEEEeecceEEEeehh-hhhhcCceEEEEEEE Q lcl|NC_019921. 280 QYTHLNANGVYVTAL--------PFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKE-TLALDDMDLYTAKQF 349 (381) Q Consensus 280 ~~~~~~~~G~~~~~l--------~~G~pVv~s~~~p~~~i~fgd~~~-y~i~~r~~i~i~~~~~-~~~~~d~~~~r~~~r 349 (381) +++.+|+|+|.. ++|+||+++++||+++++||||++ |.+++|++++|+.+++ .+|.+|++.||+..| T Consensus 297 ---lkd~~G~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r 373 (390) T protein:vir:81 297 ---AKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVGEDFQRNMITVLAEER 373 (390) T ss_pred ---hhcCCCceeecCcccccCceecceeeEEcCCCCCCcEEEEehhceEEEEEecceEEEEecccchhhcCcEEEEEEEe Confidence 478899998753 379999999999999999999998 8999999999999885 689999999999999 Q ss_pred EcCEEecCceEEEEEEE Q lcl|NC_019921. 350 AYGKAKDNKVAAVWKLD 366 (381) Q Consensus 350 ~dGk~~~~~Afvv~~~~ 366 (381) +||++++++|||++++- T Consensus 374 ~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 374 LALVVYRPEALISGSFA 390 (390) T ss_pred eccEEecccceEEEEeC Confidence 99999999999987766 No 28 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=1.3e-58 Score=338.00 Aligned_cols=340 Identities=14% Similarity=0.101 Sum_probs=234.5 Q ss_pred Cchh---HHHHHHHHHHHHHHHHh----hhhhHHHHHHHHHHHH---HHHHHHHHH-HHHHHHHH--------------- Q lcl|NC_019921. 1 MTIN---LSETFANAKNEFINAVN----NGEPQERQNELYGDMI---NQLFEETKL-QAKAEAER--------------- 54 (381) Q Consensus 1 mt~e---l~~~~~~~~~~~~~~~k----~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~~~~--------------- 54 (381) |+-. +++++.+...++.+.+. +.+..++..+.++.+. +.+..+..+ +.+.+... T Consensus 1 m~e~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (390) T protein:vir:10 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGD 80 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccchhh Confidence 4322 33333333322222111 1111122222222222 112111111 00000000 Q ss_pred ---------HHHhhc-ccc-ccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEE Q lcl|NC_019921. 55 ---------VSSLPK-SAQ-SLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFL 122 (381) Q Consensus 55 ---------~~~~~~-~~~-~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p 122 (381) ..+... ... ....+.+...+....++..++|.++|+++.+.|++.+++.+||+++|+++++++ .+++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 160 (390) T protein:vir:10 81 LFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYV 160 (390) T ss_pred hhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEE Confidence 000000 000 001111112222333445566778888899999999999999999999999865 47999 Q ss_pred EecC-CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCC Q lcl|NC_019921. 123 KSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGK 201 (381) Q Consensus 123 ~~~~-~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~ 201 (381) +.++ .+.+.|++|+++.+ +++++|+++++.+|+++++++||++||+|+. ++++||.++|+++++++++.+||+|+|+ T Consensus 161 ~~~~~~~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~l~~~~~~~~~~~il~G~G~ 238 (390) T protein:vir:10 161 QETGFVNNAAIVAEGALKP-ESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGA 238 (390) T ss_pred EEecCCcceeeecCCcccc-ccccceeEEEEeeEEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 8765 46899999987765 6789999999999999999999999999986 8999999999999999999999999998 Q ss_pred C-cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhh Q lcl|NC_019921. 202 D-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) Q Consensus 202 ~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~ 280 (381) + +|.||++........ +.......++.+.++...+ ...++.+++|+|||.++..+.. T Consensus 239 ~~~p~Gi~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~l-------~~~~~~~~~~v~n~~~~~~L~~- 296 (390) T protein:vir:10 239 NDGLLGLIPQATTYAAP--------------TTIAGATRVDQLRLAMLQA-------SLAEYPASGIVINPIDWAAIEL- 296 (390) T ss_pred Ccccccccccccccccc--------------ccccccchHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHH- Confidence 6 599999754322111 1111223455566655444 3356778899999999877764 Q ss_pred hhccCCCCcccccc--------CCCceeEecCCCCCCcEEEEeecc-eEEEeecceEEEeehh-hhhhcCceEEEEEEEE Q lcl|NC_019921. 281 YTHLNANGVYVTAL--------PFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKE-TLALDDMDLYTAKQFA 350 (381) Q Consensus 281 ~~~~~~~G~~~~~l--------~~G~pVv~s~~~p~~~i~fgd~~~-y~i~~r~~i~i~~~~~-~~~~~d~~~~r~~~r~ 350 (381) .++.+|+|+|.. .+|+||+++++||+++++||||++ |.+++|++++|+.+++ .+|.+|++.||+..|+ T Consensus 297 --lkd~~g~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~ 374 (390) T protein:vir:10 297 --AKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERL 374 (390) T ss_pred --hhcCCCceeecCCcCcCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEee Confidence 478999998763 379999999999999999999998 8899999999999885 6899999999999999 Q ss_pred cCEEecCceEEEEEEE Q lcl|NC_019921. 351 YGKAKDNKVAAVWKLD 366 (381) Q Consensus 351 dGk~~~~~Afvv~~~~ 366 (381) ||++++++||+++++- T Consensus 375 d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 375 ALVVYRPEALISGSFA 390 (390) T ss_pred ccEEeccccEEEEEeC Confidence 9999999999986665 No 29 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=7.8e-59 Score=339.14 Aligned_cols=340 Identities=12% Similarity=0.059 Sum_probs=230.0 Q ss_pred Cc--hhHHHHHHHHHHHHHH-------HHhhhhhHHHHHHHHH-------HHHHHHHHHHHH---HHHH----------- Q lcl|NC_019921. 1 MT--INLSETFANAKNEFIN-------AVNNGEPQERQNELYG-------DMINQLFEETKL---QAKA----------- 50 (381) Q Consensus 1 mt--~el~~~~~~~~~~~~~-------~~k~~~~~~~~~~~~~-------~~~~~~~~~~~~---~~~~----------- 50 (381) |- .|+++++.+.+.++.+ ...+.+...++.+... ...+.+.++... +.+. T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCCC Confidence 44 2444444443333322 1111111011111111 111111111110 0000 Q ss_pred ------------HHHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc Q lcl|NC_019921. 51 ------------EAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR 118 (381) Q Consensus 51 ------------~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~ 118 (381) ++.+........+......+...+++..+++++|||+||++++++|++.+++++||+++|+++++++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~- 159 (387) T protein:vir:93 81 LNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 159 (387) T ss_pred cchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCC- Confidence 0000000000000001111223345667888999999999999999999999999999999999875 Q ss_pred eEEEE-ecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhh-hee Q lcl|NC_019921. 119 LKFLK-SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALET-AFL 196 (381) Q Consensus 119 ~~~p~-~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~-a~i 196 (381) ..+|+ ..+.+.+.|++|++..+ +++++|+++++.+|+++++++||+|||+||.+|+++||.++++++++++++. +|. T Consensus 160 ~~~p~~~~~~~~a~~v~E~~~~~-~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~ 238 (387) T protein:vir:93 160 LEIPRVSYTLDDDDFITDVETAK-ELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 238 (387) T ss_pred ceEEEEeecCCccccccCccccc-ccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 45676 44567899999987765 5789999999999999999999999999999999999999999999999765 678 Q ss_pred eccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|NC_019921. 197 KGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 197 ~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~ 276 (381) +|+|+++|.|++...... ......+++.+.++++.+ +..|+.|++|+||+.+++. T Consensus 239 ~g~g~g~p~g~l~~~~~~------------------~v~~~~~~d~i~~~~~~l-------~~~~~~~a~~~mn~~t~~~ 293 (387) T protein:vir:93 239 VSPKSGLDHMSFYNGSVK------------------EVEGADMYDAIINALADL-------HEDYRDNATIYMRYADYVK 293 (387) T ss_pred cCCCccccceeeeccccc------------------cccccchHHHHHHHHhcc-------ChhhhcCCEEEEechHHHH Confidence 999999999998542111 011223466677766544 4578899999999999877 Q ss_pred HhhhhhccCCCCccccccC---CCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCE Q lcl|NC_019921. 277 VQAQYTHLNANGVYVTALP---FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~l~---~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk 353 (381) ++.. .++++|.|.++.| +|+||+++++++ +++||||++|++. +.++.+.++.+ +.+++++|++..|+||+ T Consensus 294 ~~~~--~~d~~~~~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~~~~--~~~~~~~~~~~~r~d~~ 366 (387) T protein:vir:93 294 IISV--LSNGTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKD--VKKGEYLFVLTAWYDQQ 366 (387) T ss_pred HHHH--HhcCCCcccccCCccccccceEEecCCC--ceeeeehhhhhee-hhhheeeeccc--ccCCceeEEEEeeeCce Confidence 6543 4566676665443 799999999886 5899999997654 55677777665 55799999999999999 Q ss_pred EecCceEEEEEEEecCCcccc Q lcl|NC_019921. 354 AKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 354 ~~~~~Afvv~~~~~~~~~~~~ 374 (381) +++++||++++++-++....+ T Consensus 367 v~~~eA~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 367 RTLDSAFRIAKAKENTGSLPS 387 (387) T ss_pred eechhheEEEEeecCCCCCCC Confidence 999999999877664443222 No 30 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=3.3e-58 Score=335.69 Aligned_cols=328 Identities=13% Similarity=0.071 Sum_probs=240.7 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhh---ccccccCHHHHHHH Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEP--QERQNELYGDMINQLFEETKLQAKA--EAERVSSLP---KSAQSLSANQRSFF 73 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~---~~~~~lt~~e~~~~ 73 (381) |+-++++..++ +++..+.++.... +.++.......++.+.++....... +..+..... ........++++.| T Consensus 1 M~k~l~~l~e~-~~~~~~e~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (371) T protein:vir:81 1 MPKELRELLEQ-INNKKEEARKLLAENKIEEAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAF 79 (371) T ss_pred CcHHHHHHHHH-HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHH Confidence 99776544444 3333333333211 1122222333333333332221111 111110000 00011112222222 Q ss_pred ---------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEEecCCcceEEeeccccccc Q lcl|NC_019921. 74 ---------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSETSGVAVWGKIYGEIKG 141 (381) Q Consensus 74 ---------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g---~~~~p~~~~~~~a~wv~e~~~~~~ 141 (381) +++..+++++||++||+++++.|++.+++.++|+++++++++++ ...+++..+.+.+.|++|+++.+. T Consensus 80 ~~~l~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 159 (371) T protein:vir:81 80 VNHIRTRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGE 159 (371) T ss_pred HHHHHHHHHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeecccccccc Confidence 34567788899999999999999999999999999999998753 455677777888999999888776 Q ss_pred ccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeecccccccccccc Q lcl|NC_019921. 142 QLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGA 221 (381) Q Consensus 142 ~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~ 221 (381) .++++|+++++.+||++++++||++||+||.++|++||.+.|+++++++++.+|++|+|++.|.|+.+. T Consensus 160 ~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~~~~~~----------- 228 (371) T protein:vir:81 160 KATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAKTAIADL----------- 228 (371) T ss_pred ccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccH----------- Confidence 678999999999999999999999999999999999999999999999999999999999999887521 Q ss_pred ccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc-------- Q lcl|NC_019921. 222 YPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-------- 293 (381) Q Consensus 222 ~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~-------- 293 (381) +.+..++.. ..+..|+.+++|+||+.++..+.. .++.+|+|+|. T Consensus 229 -------------------~~i~~~~~~------~l~~~~~~~a~~vmn~~~~~~L~~---lkd~~g~~l~~~~~~~~~~ 280 (371) T protein:vir:81 229 -------------------DGLKQIINV------QLDPVFRSTSSVIVNQDAFNWLDT---LKDQNGQYLLQPSISSPTG 280 (371) T ss_pred -------------------HHHHHHHHh------hcchhhhcCCEEEEcHHHHHHHHH---hhccCCCeeeecccCCCCC Confidence 112222111 113467889999999999887765 47889999875 Q ss_pred -cCCCceeEecCCCCC------------CcEEEEeecc-eEEEeecceEEEeehhh--hhhcCceEEEEEEEEcCEEecC Q lcl|NC_019921. 294 -LPFNLNVIESTVQEA------------GKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAKDN 357 (381) Q Consensus 294 -l~~G~pVv~s~~~p~------------~~i~fgd~~~-y~i~~r~~i~i~~~~~~--~~~~d~~~~r~~~r~dGk~~~~ 357 (381) ..+|+||+++++||. ..|+||||++ |.+++|.+++|.++++. .|.+|+++||+.+|+||+++++ T Consensus 281 ~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~ 360 (371) T protein:vir:81 281 RQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDD 360 (371) T ss_pred ceecceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecc Confidence 347999999999973 2489999998 78899999999999986 5889999999999999999999 Q ss_pred ceEEEEEEEec Q lcl|NC_019921. 358 KVAAVWKLDLK 368 (381) Q Consensus 358 ~Afvv~~~~~~ 368 (381) +||++++++.+ T Consensus 361 ~a~~~~~~~~A 371 (371) T protein:vir:81 361 EAFVFGEVQLA 371 (371) T ss_pred cceEEEEEecC Confidence 99999998887 No 31 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=3.9e-58 Score=335.32 Aligned_cols=339 Identities=12% Similarity=0.094 Sum_probs=238.5 Q ss_pred CchhHHHHHHHHHHHHHHHHhhh--------hhHHHHHHHHHHHH---HHHHHHHHH-HHHHHHH--------------- Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNG--------EPQERQNELYGDMI---NQLFEETKL-QAKAEAE--------------- 53 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~--------~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~~~--------------- 53 (381) |+- +.+++.+.+++..+.++.. +..++..+.++.+. +.+..+... +.+.+.. T Consensus 1 m~~-~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~ 79 (390) T protein:vir:97 1 MTD-ITAKLEATLANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVG 79 (390) T ss_pred ChH-HHHHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccch Confidence 321 2223333333333332221 11112222222221 111111111 0000000 Q ss_pred ---------HHHHhh--ccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEE Q lcl|NC_019921. 54 ---------RVSSLP--KSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKF 121 (381) Q Consensus 54 ---------~~~~~~--~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~ 121 (381) +..... .+......+.+...+....++..++|++||++++..|++.+++.++|+++|++.++++ .+++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:97 80 DMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEE Confidence 000000 0000111112223344455677889999999999999999999999999999999754 5789 Q ss_pred EEecC-CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccC Q lcl|NC_019921. 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) Q Consensus 122 p~~~~-~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G 200 (381) |+.++ .+.+.|++|+++.+ +++++|+++++.+|+++++++||++||+|+. ++++||.++++++++++++.+|++|+| T Consensus 160 ~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~la~a~~~~~d~a~l~G~g 237 (390) T protein:vir:97 160 VQETGFVNNAAIVAEGALKP-ESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTG 237 (390) T ss_pred EEEecCCcceeeecCCcccc-ccccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 98765 46899999988765 6789999999999999999999999999985 799999999999999999999999999 Q ss_pred CC-cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhh Q lcl|NC_019921. 201 KD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 201 ~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~ 279 (381) ++ +|.||++......... .......++.+.+++..+ ...|..+++|+||+.++..+.. T Consensus 238 ~~~~p~Gi~~~~~~~~~~~--------------~~~~~~~~d~~~~~~~~~-------~~~~~~~~~~v~n~~~~~~L~~ 296 (390) T protein:vir:97 238 ANDGLLGLIPQATTYAAPT--------------TIAGATRVDQLRLAMLQA-------SLAEYPASGIVINPIDWAAIEL 296 (390) T ss_pred CCccccceeeccccccccc--------------cccccchHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHH Confidence 86 5999997543222111 111233445555554433 3456677889999999887764 Q ss_pred hhhccCCCCcccccc--------CCCceeEecCCCCCCcEEEEeecc-eEEEeecceEEEeehh-hhhhcCceEEEEEEE Q lcl|NC_019921. 280 QYTHLNANGVYVTAL--------PFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKE-TLALDDMDLYTAKQF 349 (381) Q Consensus 280 ~~~~~~~~G~~~~~l--------~~G~pVv~s~~~p~~~i~fgd~~~-y~i~~r~~i~i~~~~~-~~~~~d~~~~r~~~r 349 (381) +++.+|+|+|.. .+|+||+++++||+++++||||++ |.+++|+++.+..+++ .+|.+|+++||+.+| T Consensus 297 ---lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r 373 (390) T protein:vir:97 297 ---AKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEER 373 (390) T ss_pred ---hhcCCCceeecCccCCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEe Confidence 478899998742 379999999999999999999997 8899999999999875 689999999999999 Q ss_pred EcCEEecCceEEEEEEE Q lcl|NC_019921. 350 AYGKAKDNKVAAVWKLD 366 (381) Q Consensus 350 ~dGk~~~~~Afvv~~~~ 366 (381) +||++++++|||++++- T Consensus 374 ~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 374 LALVVYRPEALITGSFA 390 (390) T ss_pred eccEEeccccEEEEEeC Confidence 99999999999987766 No 32 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=5.7e-58 Score=334.41 Aligned_cols=344 Identities=11% Similarity=0.057 Sum_probs=237.2 Q ss_pred CchhHHHHHHHHHHHHHHH---Hhhhh-h---H--------HHHHHHHHHHHHHHH---HHHHHHHHH------------ Q lcl|NC_019921. 1 MTINLSETFANAKNEFINA---VNNGE-P---Q--------ERQNELYGDMINQLF---EETKLQAKA------------ 50 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~---~k~~~-~---~--------~~~~~~~~~~~~~~~---~~~~~~~~~------------ 50 (381) ..-|+.+++++.+.++.+. +++.. . + .+..+.++.+..... +......+. T Consensus 16 ~~~el~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~ 95 (418) T protein:vir:10 16 GDSHPEQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATVDELLIKQGELQARLLEAEQKLARGGGSAELET 95 (418) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccch Confidence 4434444433333333221 11110 0 0 011111111111110 000000000 Q ss_pred -----------HHHHHHHh-hccccccCHHHHHHHH---HHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec Q lcl|NC_019921. 51 -----------EAERVSSL-PKSAQSLSANQRSFFM---DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 51 -----------~~~~~~~~-~~~~~~lt~~e~~~~~---~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~ 115 (381) ...+.... .+... ....+++... .....+.++||++||+++++.|++.+++.++|+++|+++++ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~ 174 (418) T protein:vir:10 96 PKTLGQLVTESEEMKGMDGSARKSV-RVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQT 174 (418) T ss_pred hhhhhHHhhhHHHHHHHHHHHhhhh-hhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeec Confidence 00000000 00000 0011111111 11233566789999999999999999999999999999998 Q ss_pred CCc-eEEEEecC-CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019921. 116 GLR-LKFLKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALET 193 (381) Q Consensus 116 ~g~-~~~p~~~~-~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~ 193 (381) ++. +++|+..+ .+.+.|++|+++.+ +++++|+++++.+|+++++++||++||+|+. ++++||+++|++++++++|. T Consensus 175 ~~~~~~~~~~~~~~~~a~~v~E~~~~~-~~~~~f~~v~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~l~~a~~~~~d~ 252 (418) T protein:vir:10 175 SSSSIEYTVETGFTNNAAAVAEGAQKP-TSDLKFNLKNQPVRTIAHLFKASRQILDDAP-ALQSYIDGRARYGLQLTEEG 252 (418) T ss_pred cCCceeEEEEecCCCceeeeccCcccc-ccccceeeEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHH Confidence 754 79999766 57899999988765 6789999999999999999999999999985 89999999999999999999 Q ss_pred heeeccCCC-cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|NC_019921. 194 AFLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 194 a~i~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~ 272 (381) +|++|+|++ +|.||++......... .......++.+.+++..+ ...+..+++|+||+. T Consensus 253 a~l~G~g~~~~p~Gi~~~~~~~~~~~--------------~~~~~~~~~~i~~~~~~~-------~~~~~~~~~~v~n~~ 311 (418) T protein:vir:10 253 QILKGDGTGANILGILPQASAFMPSI--------------TLANATPIDKIRLALLQA-------VLAEFPATGIVLNPI 311 (418) T ss_pred HHhccCCCCccccccccccccccccc--------------cccccccHHHHHHHHHhh-------ccccCCCCEEEEcHH Confidence 999999987 5999997643221111 011123345555554433 345667788999999 Q ss_pred hHHHHhhhhhccCCCCccccc--------cCCCceeEecCCCCCCcEEEEeecc-eEEEeecceEEEeehhh--hhhcCc Q lcl|NC_019921. 273 DAFEVQAQYTHLNANGVYVTA--------LPFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDM 341 (381) Q Consensus 273 t~~~~~~~~~~~~~~G~~~~~--------l~~G~pVv~s~~~p~~~i~fgd~~~-y~i~~r~~i~i~~~~~~--~~~~d~ 341 (381) ++..+.. +++.+|+|+|. ..+|+||+.+++||+++++||||++ |+++++++++|..+++. .|.+|+ T Consensus 312 ~~~~L~~---lkd~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f~~~~ 388 (418) T protein:vir:10 312 DWASIEL---TKDSQGRYIVGNPVNGTTPRLWNLPVVETQAMTANEFLVGAFSMAAQIFDRMEIEVLLSTENVDDFEKNM 388 (418) T ss_pred HHHHHHH---hhcCCCceeccccccCCCceecceeeEEcCCCCCCcEEEeeccceEEEEEecceEEEEecccchhhhcCc Confidence 9877654 47889999874 3479999999999999999999998 88999999999998876 489999 Q ss_pred eEEEEEEEEcCEEecCceEEEEEEEecCCc Q lcl|NC_019921. 342 DLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 342 ~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~ 371 (381) +.||+..|+||++++++||++++++-++.. T Consensus 389 ~~~r~~~~~d~~~~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 389 VSIRAEERLALAVYRPESFVTGALVEQAGG 418 (418) T ss_pred eEEEEEEeeccEEecccceEEEEeccCCCC Confidence 999999999999999999999888866655 No 33 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=2.1e-58 Score=336.83 Aligned_cols=340 Identities=11% Similarity=0.054 Sum_probs=226.5 Q ss_pred Cc--hhHHHHHHHHHHHHHHH---Hhh----hhhHHHHHHH----HHHH---HHHHHHHHHHHHHHH------------- Q lcl|NC_019921. 1 MT--INLSETFANAKNEFINA---VNN----GEPQERQNEL----YGDM---INQLFEETKLQAKAE------------- 51 (381) Q Consensus 1 mt--~el~~~~~~~~~~~~~~---~k~----~~~~~~~~~~----~~~~---~~~~~~~~~~~~~~~------------- 51 (381) |. .|+++++.+.+.++.+. +++ .+...++... +..+ .+.+.++........ T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 65 24555555444433222 111 1100011111 1111 111111111000000 Q ss_pred -------------HHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc Q lcl|NC_019921. 52 -------------AERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR 118 (381) Q Consensus 52 -------------~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~ 118 (381) +.+................+..+++..+++++||++||++++++|++.+++++|||++|+++++++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~- 159 (387) T protein:vir:26 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 159 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC- Confidence 000000000000000111122345667788999999999999999999999999999999999876 Q ss_pred eEEEE-ecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHh-hhee Q lcl|NC_019921. 119 LKFLK-SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE-TAFL 196 (381) Q Consensus 119 ~~~p~-~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~-~a~i 196 (381) ..+|+ ..+.+++.|++|++..+ +++|+|+++++.+|+++++++||+|||+||.+|+++||.++|+++++++++ .+|. T Consensus 160 ~~~p~~~~~~~~a~~v~Eg~~~~-~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~ 238 (387) T protein:vir:26 160 LEIPRVSYTLDDDDFITDVETAK-ELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 238 (387) T ss_pred ceeeeeeccCCcccccccccccc-ccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 45665 44567899999987665 578999999999999999999999999999999999999999999999975 5678 Q ss_pred eccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|NC_019921. 197 KGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 197 ~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~ 276 (381) +|+|+++|.|++...... ......+++.+.++++.+ +..|+.|++|+||+.+++. T Consensus 239 ~g~g~g~~~g~~~~~~~~------------------~~~~~~~~d~i~~~~~~l-------~~~y~~na~~imn~~t~~~ 293 (387) T protein:vir:26 239 VSPKSGLEHMSFYNGSVK------------------EVEGADMYDAIINALADL-------HEDYRDNATIYMRYADYVK 293 (387) T ss_pred cCCCccccceeeeccccc------------------cccccchHHHHHHHHhcc-------ChhhhcCCEEEEechHHHH Confidence 999999999998542111 011223466677666543 4578899999999999887 Q ss_pred HhhhhhccCCCCccccc---cCCCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCE Q lcl|NC_019921. 277 VQAQYTHLNANGVYVTA---LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~---l~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk 353 (381) ++.. ..+.+|.+.++ ..+|+||+++++++ +++||||++|++. +.++.+.++++. ..|+++||++.|+||+ T Consensus 294 ~~~~--~~~~~~~~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~~~~~--~~~~~~~~~~~r~Dg~ 366 (387) T protein:vir:26 294 IISV--LSNGTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLFVLTAWYDQQ 366 (387) T ss_pred HHHH--HhcCCCcccccCCccccccceEEecCCC--ceeeechhhhhhh-hhhhhheecccc--cCCceEEEEEEEeCcE Confidence 7653 34444444333 33799999999886 6899999996654 456777777664 4699999999999999 Q ss_pred EecCceEEEEEEEecCCcccc Q lcl|NC_019921. 354 AKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 354 ~~~~~Afvv~~~~~~~~~~~~ 374 (381) +++++||++++++-+.....+ T Consensus 367 v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:26 367 RTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred eechhheEEEEeecCCCCCCC Confidence 999999999776553332222 No 34 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=2.1e-58 Score=336.83 Aligned_cols=340 Identities=11% Similarity=0.054 Sum_probs=226.5 Q ss_pred Cc--hhHHHHHHHHHHHHHHH---Hhh----hhhHHHHHHH----HHHH---HHHHHHHHHHHHHHH------------- Q lcl|NC_019921. 1 MT--INLSETFANAKNEFINA---VNN----GEPQERQNEL----YGDM---INQLFEETKLQAKAE------------- 51 (381) Q Consensus 1 mt--~el~~~~~~~~~~~~~~---~k~----~~~~~~~~~~----~~~~---~~~~~~~~~~~~~~~------------- 51 (381) |. .|+++++.+.+.++.+. +++ .+...++... +..+ .+.+.++........ T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 65 24555555444433222 111 1100011111 1111 111111111000000 Q ss_pred -------------HHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc Q lcl|NC_019921. 52 -------------AERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR 118 (381) Q Consensus 52 -------------~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~ 118 (381) +.+................+..+++..+++++||++||++++++|++.+++++|||++|+++++++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~- 159 (387) T protein:vir:94 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 159 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC- Confidence 000000000000000111122345667788999999999999999999999999999999999876 Q ss_pred eEEEE-ecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHh-hhee Q lcl|NC_019921. 119 LKFLK-SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE-TAFL 196 (381) Q Consensus 119 ~~~p~-~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~-~a~i 196 (381) ..+|+ ..+.+++.|++|++..+ +++|+|+++++.+|+++++++||+|||+||.+|+++||.++|+++++++++ .+|. T Consensus 160 ~~~p~~~~~~~~a~~v~Eg~~~~-~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~ 238 (387) T protein:vir:94 160 LEIPRVSYTLDDDDFITDVETAK-ELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 238 (387) T ss_pred ceeeeeeccCCcccccccccccc-ccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 45665 44567899999987665 578999999999999999999999999999999999999999999999975 5678 Q ss_pred eccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|NC_019921. 197 KGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 197 ~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~ 276 (381) +|+|+++|.|++...... ......+++.+.++++.+ +..|+.|++|+||+.+++. T Consensus 239 ~g~g~g~~~g~~~~~~~~------------------~~~~~~~~d~i~~~~~~l-------~~~y~~na~~imn~~t~~~ 293 (387) T protein:vir:94 239 VSPKSGLEHMSFYNGSVK------------------EVEGADMYDAIINALADL-------HEDYRDNATIYMRYADYVK 293 (387) T ss_pred cCCCccccceeeeccccc------------------cccccchHHHHHHHHhcc-------ChhhhcCCEEEEechHHHH Confidence 999999999998542111 011223466677666543 4578899999999999887 Q ss_pred HhhhhhccCCCCccccc---cCCCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCE Q lcl|NC_019921. 277 VQAQYTHLNANGVYVTA---LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~---l~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk 353 (381) ++.. ..+.+|.+.++ ..+|+||+++++++ +++||||++|++. +.++.+.++++. ..|+++||++.|+||+ T Consensus 294 ~~~~--~~~~~~~~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~~~~~--~~~~~~~~~~~r~Dg~ 366 (387) T protein:vir:94 294 IISV--LSNGTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLFVLTAWYDQQ 366 (387) T ss_pred HHHH--HhcCCCcccccCCccccccceEEecCCC--ceeeechhhhhhh-hhhhhheecccc--cCCceEEEEEEEeCcE Confidence 7653 34444444333 33799999999886 6899999996654 456777777664 4699999999999999 Q ss_pred EecCceEEEEEEEecCCcccc Q lcl|NC_019921. 354 AKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 354 ~~~~~Afvv~~~~~~~~~~~~ 374 (381) +++++||++++++-+.....+ T Consensus 367 v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:94 367 RTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred eechhheEEEEeecCCCCCCC Confidence 999999999776553332222 No 35 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=2.1e-58 Score=336.83 Aligned_cols=340 Identities=11% Similarity=0.054 Sum_probs=226.5 Q ss_pred Cc--hhHHHHHHHHHHHHHHH---Hhh----hhhHHHHHHH----HHHH---HHHHHHHHHHHHHHH------------- Q lcl|NC_019921. 1 MT--INLSETFANAKNEFINA---VNN----GEPQERQNEL----YGDM---INQLFEETKLQAKAE------------- 51 (381) Q Consensus 1 mt--~el~~~~~~~~~~~~~~---~k~----~~~~~~~~~~----~~~~---~~~~~~~~~~~~~~~------------- 51 (381) |. .|+++++.+.+.++.+. +++ .+...++... +..+ .+.+.++........ T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 65 24555555444433222 111 1100011111 1111 111111111000000 Q ss_pred -------------HHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc Q lcl|NC_019921. 52 -------------AERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR 118 (381) Q Consensus 52 -------------~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~ 118 (381) +.+................+..+++..+++++||++||++++++|++.+++++|||++|+++++++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~- 159 (387) T protein:vir:96 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 159 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC- Confidence 000000000000000111122345667788999999999999999999999999999999999876 Q ss_pred eEEEE-ecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHh-hhee Q lcl|NC_019921. 119 LKFLK-SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE-TAFL 196 (381) Q Consensus 119 ~~~p~-~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~-~a~i 196 (381) ..+|+ ..+.+++.|++|++..+ +++|+|+++++.+|+++++++||+|||+||.+|+++||.++|+++++++++ .+|. T Consensus 160 ~~~p~~~~~~~~a~~v~Eg~~~~-~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~ 238 (387) T protein:vir:96 160 LEIPRVSYTLDDDDFITDVETAK-ELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 238 (387) T ss_pred ceeeeeeccCCcccccccccccc-ccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 45665 44567899999987665 578999999999999999999999999999999999999999999999975 5678 Q ss_pred eccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|NC_019921. 197 KGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 197 ~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~ 276 (381) +|+|+++|.|++...... ......+++.+.++++.+ +..|+.|++|+||+.+++. T Consensus 239 ~g~g~g~~~g~~~~~~~~------------------~~~~~~~~d~i~~~~~~l-------~~~y~~na~~imn~~t~~~ 293 (387) T protein:vir:96 239 VSPKSGLEHMSFYNGSVK------------------EVEGADMYDAIINALADL-------HEDYRDNATIYMRYADYVK 293 (387) T ss_pred cCCCccccceeeeccccc------------------cccccchHHHHHHHHhcc-------ChhhhcCCEEEEechHHHH Confidence 999999999998542111 011223466677666543 4578899999999999887 Q ss_pred HhhhhhccCCCCccccc---cCCCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCE Q lcl|NC_019921. 277 VQAQYTHLNANGVYVTA---LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~---l~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk 353 (381) ++.. ..+.+|.+.++ ..+|+||+++++++ +++||||++|++. +.++.+.++++. ..|+++||++.|+||+ T Consensus 294 ~~~~--~~~~~~~~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~~~~~--~~~~~~~~~~~r~Dg~ 366 (387) T protein:vir:96 294 IISV--LSNGTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDTDKDV--KKGEYLFVLTAWYDQQ 366 (387) T ss_pred HHHH--HhcCCCcccccCCccccccceEEecCCC--ceeeechhhhhhh-hhhhhheecccc--cCCceEEEEEEEeCcE Confidence 7653 34444444333 33799999999886 6899999996654 456777777664 4699999999999999 Q ss_pred EecCceEEEEEEEecCCcccc Q lcl|NC_019921. 354 AKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 354 ~~~~~Afvv~~~~~~~~~~~~ 374 (381) +++++||++++++-+.....+ T Consensus 367 v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:96 367 RTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred eechhheEEEEeecCCCCCCC Confidence 999999999776553332222 No 36 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=1.5e-57 Score=332.05 Aligned_cols=343 Identities=12% Similarity=0.083 Sum_probs=237.8 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhH----HHHHHHHHHHHHHHHHH---HHHHH-----HH-HHHHHHH--hhcccc-- Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEPQ----ERQNELYGDMINQLFEE---TKLQA-----KA-EAERVSS--LPKSAQ-- 63 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~~----~~~~~~~~~~~~~~~~~---~~~~~-----~~-~~~~~~~--~~~~~~-- 63 (381) |+ ++.+++++.+.++.+..++.+.. +++.+.+....+.+.++ ...+. +. +...... ...... T Consensus 1 m~-~~~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (395) T protein:vir:43 1 MS-DFEKQIGELNASLKQVGDQIKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEE 79 (395) T ss_pred Ch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Confidence 77 44455555554443322211100 00111111111111000 00000 00 0000000 000000 Q ss_pred -cc-------CHHH-HHHH------------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc-eEE Q lcl|NC_019921. 64 -SL-------SANQ-RSFF------------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKF 121 (381) Q Consensus 64 -~l-------t~~e-~~~~------------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~-~~~ 121 (381) .. .... +.+. ......++..+|++||++++..|++.+++.++|+++|++.++++. +++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~ 159 (395) T protein:vir:43 80 APKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEY 159 (395) T ss_pred hhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEE Confidence 00 0000 0111 112234567789999999999999999999999999999998764 899 Q ss_pred EEecC-CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccC Q lcl|NC_019921. 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) Q Consensus 122 p~~~~-~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G 200 (381) |+.++ .+.+.|++|++..+ +++++|+++++.+|+++++++||++||+|+. ++++||.++|+++++++++.+|++|+| T Consensus 160 ~~~~~~~~~a~~v~E~~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~v~~~la~a~~~~~d~~~l~G~g 237 (395) T protein:vir:43 160 VRETGFVNNAAPVSEGTQKP-YSDLTFELENAPVRTIAHLFKASRQILDDAS-ALQSYIDARARYGLMLVEECQLLYGNG 237 (395) T ss_pred EEEecCCCceeeecCCcccc-ccccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 99765 47899999987655 6789999999999999999999999999975 699999999999999999999999999 Q ss_pred CCcc-eEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhh Q lcl|NC_019921. 201 KDQP-IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 201 ~~~P-~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~ 279 (381) +++| .||++.......... ........++.+.++...+ ...|..+++|+||+.++..+.. T Consensus 238 ~~~~~~Gi~~~~~~~~~~~~------------~~~~~~~~~~~i~~~~~~~-------~~~~~~~~~~vmn~~~~~~l~~ 298 (395) T protein:vir:43 238 TGANLHGIIPQAQAYAPPSG------------VVVTAEQRIDRIRLAILQA-------QLAEFPASGIVLNPIDWALIEL 298 (395) T ss_pred CCCccccccccccccccccc------------cccccchhHHHHHHHHHhh-------ccccCCCcEEEEcHHHHHHHHH Confidence 9765 899976543222111 1112233455555555443 3456778899999999877754 Q ss_pred hhhccCCCCcccccc--------CCCceeEecCCCCCCcEEEEeecc-eEEEeecceEEEeehhh--hhhcCceEEEEEE Q lcl|NC_019921. 280 QYTHLNANGVYVTAL--------PFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLYTAKQ 348 (381) Q Consensus 280 ~~~~~~~~G~~~~~l--------~~G~pVv~s~~~p~~~i~fgd~~~-y~i~~r~~i~i~~~~~~--~~~~d~~~~r~~~ 348 (381) +++.+|+|+|.. .+|+||+.+++||+++++||||++ |.+++|++++|+.+++. .|.+|+++||+.. T Consensus 299 ---lkd~~G~~i~~~~~~~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~ 375 (395) T protein:vir:43 299 ---NKDAENRYIIGSPQNGTTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEE 375 (395) T ss_pred ---hhccCCceeccccccCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEE Confidence 478899998742 379999999999999999999998 88999999999988865 5889999999999 Q ss_pred EEcCEEecCceEEEEEEEec Q lcl|NC_019921. 349 FAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 349 r~dGk~~~~~Afvv~~~~~~ 368 (381) |+||++++++|||+++++-+ T Consensus 376 r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 376 RLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred eeccEEecccceEEEEeccC Confidence 99999999999999877665 No 37 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=2.3e-57 Score=331.14 Aligned_cols=343 Identities=10% Similarity=0.019 Sum_probs=236.9 Q ss_pred Cchh-HHHHHHHHHHHHHHHHhhhhh-------HHHHHHHHHHHHHHHHHHH---HHHHHH-HHHHHHHhh-cccc---- Q lcl|NC_019921. 1 MTIN-LSETFANAKNEFINAVNNGEP-------QERQNELYGDMINQLFEET---KLQAKA-EAERVSSLP-KSAQ---- 63 (381) Q Consensus 1 mt~e-l~~~~~~~~~~~~~~~k~~~~-------~~~~~~~~~~~~~~~~~~~---~~~~~~-~~~~~~~~~-~~~~---- 63 (381) |+|+ |++++.+..+++.+...+.+. ..++.+.+....+.+..+. ..+... +........ .... T Consensus 5 m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (408) T protein:vir:10 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNK 84 (408) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc Confidence 6664 555555555544333221110 0111111111111111100 000000 000000000 0000 Q ss_pred ---ccCHHHHHH----------------HHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEE Q lcl|NC_019921. 64 ---SLSANQRSF----------------FMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKF 121 (381) Q Consensus 64 ---~lt~~e~~~----------------~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g---~~~~ 121 (381) .......+. ..++..+++++||++||+++++.|++.+++.+||+++|+++++++ .+.+ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 164 (408) T protein:vir:10 85 SENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVY 164 (408) T ss_pred chhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEE Confidence 001111111 123456778899999999999999999999999999999999753 3455 Q ss_pred EEec-CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccC Q lcl|NC_019921. 122 LKSE-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) Q Consensus 122 p~~~-~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G 200 (381) +... ..+.+.|++|+++.+..+.|+|++|++.+|+++++++||++||+||.+|+++||.++|+++++++++.+|++|+| T Consensus 165 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g 244 (408) T protein:vir:10 165 EKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMK 244 (408) T ss_pred eeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 5543 446789999998887667799999999999999999999999999999999999999999999999999999999 Q ss_pred CCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhh Q lcl|NC_019921. 201 KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) Q Consensus 201 ~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~ 280 (381) +++|.+-. ..++.+.+++.. .....|+++++|+||+.++..+.. T Consensus 245 ~~~~~~~~-----------------------------~~~~~l~~~~~~------~~~~~~~~~a~~v~n~~~~~~l~~- 288 (408) T protein:vir:10 245 AAPKKPTI-----------------------------AKFDDVITMINT------AVDPAIIATSSLLTNQSGLNKLAL- 288 (408) T ss_pred cccccccc-----------------------------ccHHHHHHHHHH------hhhhhhccCCEEEEcHHHHHHHHH- Confidence 88764311 112334333321 123468889999999999887765 Q ss_pred hhccCCCCccccc---------cCCCceeEecC--CCCCC-----cEEEEeecc-eEEEeecceEEEeehhhh--hhcCc Q lcl|NC_019921. 281 YTHLNANGVYVTA---------LPFNLNVIEST--VQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETL--ALDDM 341 (381) Q Consensus 281 ~~~~~~~G~~~~~---------l~~G~pVv~s~--~~p~~-----~i~fgd~~~-y~i~~r~~i~i~~~~~~~--~~~d~ 341 (381) .++.+|+|+|+ ..+|+||++++ .+|+. .++||||++ |.+++|++++|..+++.+ |.+|+ T Consensus 289 --lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~ 366 (408) T protein:vir:10 289 --VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDT 366 (408) T ss_pred --hhccCCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCc Confidence 47889999985 34799998854 46652 289999998 789999999999998755 88999 Q ss_pred eEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 342 DLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 342 ~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) +.||+..|+||++++++||++++++-+++....++||-+- T Consensus 367 ~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~ 406 (408) T protein:vir:10 367 TKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTST 406 (408) T ss_pred eEEEEEEeeccEEeccccEEEEEeeccccCCCCCCCCCcc Confidence 9999999999999999999998877776666666666554 No 38 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=7.7e-57 Score=328.23 Aligned_cols=344 Identities=13% Similarity=0.095 Sum_probs=240.6 Q ss_pred CchhHH---HHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------------- Q lcl|NC_019921. 1 MTINLS---ETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAER----------------------- 54 (381) Q Consensus 1 mt~el~---~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------------- 54 (381) |+-+++ ++++++.+++...+.+.+...++.+.....++.+.++.....+.+..+ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKDGVTAEELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNG 80 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHH Confidence 996543 333333333333333222211222222222222222221111100000 Q ss_pred -HHHhhc----------cccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC---CceE Q lcl|NC_019921. 55 -VSSLPK----------SAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG---LRLK 120 (381) Q Consensus 55 -~~~~~~----------~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~---g~~~ 120 (381) ...... ........|+ .++..+++++||++||++++++|++.+++.+||+++|++.+++ |.+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~e~---~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~ 157 (404) T protein:vir:10 81 ALFVRAIADNLLKQKNQRGLNLSEKEI---NAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRT 157 (404) T ss_pred HHHHHHHHHHHHHHHHhhhhcchhhHH---hhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceE Confidence 000000 0001112222 2455677889999999999999999999999999999998874 5678 Q ss_pred EEEecCCcceEEeecccccccc-cCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeecc Q lcl|NC_019921. 121 FLKSETSGVAVWGKIYGEIKGQ-LDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGT 199 (381) Q Consensus 121 ~p~~~~~~~a~wv~e~~~~~~~-~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~ 199 (381) +|+..+.+.+.|++|+++.+.+ .+++|+++++.+++++++++||++||+||.+++++||.++++++++++++.+|++|+ T Consensus 158 ~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~ 237 (404) T protein:vir:10 158 YEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGA 237 (404) T ss_pred EEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 8998899999999998877644 479999999999999999999999999999999999999999999999999999999 Q ss_pred CCCc-ceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHh Q lcl|NC_019921. 200 GKDQ-PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQ 278 (381) Q Consensus 200 G~~~-P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~ 278 (381) |+++ |.||++..+..+.. ......++.+.+++.. ..+..|+.+++|+|||.++..++ T Consensus 238 g~~~~~~gi~~~~~~~~~~----------------~~~~~~~~~~~~~~~~------~l~~~~~~~~~~v~n~~~~~~L~ 295 (404) T protein:vir:10 238 GGDEHATGIMTANKFKKIT----------------LPKSPALKDFKKCKNV------ELLNVFKATSSWIVNQDGFNYLD 295 (404) T ss_pred CCCCcccceeeccccceee----------------ccccccHHHHHHHHHh------hhhccccCCCEEEEcHHHHHHHH Confidence 9864 67887543221110 1112234444443321 12446788999999999988776 Q ss_pred hhhhccCCCCccccc---------cCCCceeEe-cCCCCCC-----cEEEEeecc-eEEEeecceEEEeehhh--hhhcC Q lcl|NC_019921. 279 AQYTHLNANGVYVTA---------LPFNLNVIE-STVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKET--LALDD 340 (381) Q Consensus 279 ~~~~~~~~~G~~~~~---------l~~G~pVv~-s~~~p~~-----~i~fgd~~~-y~i~~r~~i~i~~~~~~--~~~~d 340 (381) + +++.+|+|+|. ..+|+||++ ++.+|.+ .++||||++ |.+++|++++|.++++. .|.+| T Consensus 296 ~---lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~ 372 (404) T protein:vir:10 296 S---LEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETN 372 (404) T ss_pred H---hhccCCceeeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccccchhhcC Confidence 5 47888999874 247999975 4556542 389999997 88999999999998864 48899 Q ss_pred ceEEEEEEEEcCEEecCceEEEEEEEecCCcc Q lcl|NC_019921. 341 MDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 341 ~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~ 372 (381) ++.||+.+|+|+++.+++||++++++.++... T Consensus 373 ~~~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 373 TTKARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred ceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 99999999999999999999999988855554 No 39 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=1.1e-56 Score=327.35 Aligned_cols=348 Identities=14% Similarity=0.046 Sum_probs=228.8 Q ss_pred CchhHHH-HHHHHHHHHHH-HHh---hh------------hhHHHHHHHHHHHHHH-----------HHHHHHH---HHH Q lcl|NC_019921. 1 MTINLSE-TFANAKNEFIN-AVN---NG------------EPQERQNELYGDMINQ-----------LFEETKL---QAK 49 (381) Q Consensus 1 mt~el~~-~~~~~~~~~~~-~~k---~~------------~~~~~~~~~~~~~~~~-----------~~~~~~~---~~~ 49 (381) |++..+. +..+.+++..+ .++ +. +...++.+.+....+. ..+.... +.. T Consensus 24 ~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k~~~E~~~~le~~~ee~k~l~ee~~~~~~~~a~~~e~~~~~~~~~~~~~~ 103 (458) T protein:vir:10 24 LTAAQKAQEAERMRKEQEEKELARMNDLVSKAVGEDRKRLEEALELVKSLDEKSKKSNELFAQTVEKQQETIVGLQDEIK 103 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222211 11111111100 000 00 0000000000000000 0000000 000 Q ss_pred -----HHHHHHHHhh-----------------------ccc-cccCHHHHHH--HHHHh-hccCCCCceeccHHHHHHHH Q lcl|NC_019921. 50 -----AEAERVSSLP-----------------------KSA-QSLSANQRSF--FMDIN-KNVNYKEEKLLPEETIDRIF 97 (381) Q Consensus 50 -----~~~~~~~~~~-----------------------~~~-~~lt~~e~~~--~~~~~-~~~~~~gg~lvP~~~~~~I~ 97 (381) .+.+...... ... .....+++.. +.... ..+..+||++||+++++.|+ T Consensus 104 ~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii 183 (458) T protein:vir:10 104 SLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRII 183 (458) T ss_pred HHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHH Confidence 0000000000 000 0000111111 11111 23456799999999999999 Q ss_pred HHHHhhhhhhhhceeEecCCc-eEEEEecCCcceEEeecccccccc-----cCcceeeEeecceeEEEeeeccHHhhhcC Q lcl|NC_019921. 98 EDLTTNHPLLADLGIKNAGLR-LKFLKSETSGVAVWGKIYGEIKGQ-----LDAAFSEETAIQNKLTAFVVLPKDLNDFG 171 (381) Q Consensus 98 ~~l~~~~~l~~~~~v~~~~g~-~~~p~~~~~~~a~wv~e~~~~~~~-----~~~~f~~v~l~~~kl~~~~~iS~ell~ds 171 (381) +.+++.++|+++|+++|++++ ..+|+..+.+.+.|++|++..+.+ ++++|+++++.+||++++++||++||+|| T Consensus 184 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds 263 (458) T protein:vir:10 184 RDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDA 263 (458) T ss_pred HHHHhhhhHHhhcceeecCCcceEEEEecCCcceeecccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcc Confidence 999999999999999998765 678999889999999998766543 46789999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhh Q lcl|NC_019921. 172 PAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHS 251 (381) Q Consensus 172 ~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~ 251 (381) .+++++||.++|+++|++++|.+||+|+|+++|+||++.......... ...........+++.+.++...+ T Consensus 264 ~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~--------~~~~~~~~~~~~~~~i~~~~~~l- 334 (458) T protein:vir:10 264 IFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVV--------TEAKADGSVLVTAKTISKLRRKL- 334 (458) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeeccccccccee--------ecccccccccccHHHHHHHHHhh- Confidence 999999999999999999999999999999999999987543322111 00111112233456666665544 Q ss_pred hccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc-------------cCCCceeEecCCCCCC----cEEEE Q lcl|NC_019921. 252 TNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-------------LPFNLNVIESTVQEAG----KVLTY 314 (381) Q Consensus 252 ~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~-------------l~~G~pVv~s~~~p~~----~i~fg 314 (381) +..|+++++|+||+.++..+.. +++.+|+|+|. ..+|+||+++++||++ .++|| T Consensus 335 ------~~~~~~~~~~v~~~~~~~~l~~---lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~ 405 (458) T protein:vir:10 335 ------GRHGLKLSKLVLIVSMDAYYDL---LEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSAEFAVI 405 (458) T ss_pred ------hhhhcCCCEEEEcHHHHHHHHh---hcccCCceeeccccccccccCcCceecceeeEEccccccccCCcceEEE Confidence 3467889999999999877654 57888988763 2479999999999974 58999 Q ss_pred eecc-eEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCC Q lcl|NC_019921. 315 VKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 315 d~~~-y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~ 370 (381) ||+. |.+++|++++|.+++ ++.+++++||+..|+|+.+.+|+|||+. +++++ T Consensus 406 ~f~~~~~~~~~~~~~v~~d~--~~~~~~~~~~~~~r~~~~v~~~~a~v~~--~~aa~ 458 (458) T protein:vir:10 406 VYKDNFVMPRQRAVTVERER--QAGKQRDAYYVTQRVNLQRYFANGVVSG--TYAAS 458 (458) T ss_pred EecccEEEEEeeceEEEeec--ccCCCceEEEEEEEecceEecccceEEE--eeccC Confidence 9975 899999999998754 5779999999999999999999999874 44444 No 40 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=1.6e-57 Score=332.03 Aligned_cols=339 Identities=12% Similarity=-0.001 Sum_probs=234.6 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhH-----------HHHHHHHHHHHHHHHHHHHHHH--HHHHHH-HHH--hhcccc- Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEPQ-----------ERQNELYGDMINQLFEETKLQA--KAEAER-VSS--LPKSAQ- 63 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-~~~--~~~~~~- 63 (381) |. ..+++.+.++++.+.+++...+ .++.+.....++.+..+..... ..+... ... .....+ T Consensus 1 Mk--~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 78 (397) T protein:vir:49 1 MK--TSNELHDLWVAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKKP 78 (397) T ss_pred Cc--hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Confidence 54 3333333333333332221111 0111111111111111111000 000000 000 000000 Q ss_pred ------ccCHHHHHHH------------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC---CceEEE Q lcl|NC_019921. 64 ------SLSANQRSFF------------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG---LRLKFL 122 (381) Q Consensus 64 ------~lt~~e~~~~------------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~---g~~~~p 122 (381) ....++++.| ..+..+++++||++||+++++.|++.+++.++|+++|++++++ +...+| T Consensus 79 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 158 (397) T protein:vir:49 79 LTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYE 158 (397) T ss_pred cccchhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEE Confidence 1122233333 2334567788999999999999999999999999999998874 446667 Q ss_pred Eec-CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCC Q lcl|NC_019921. 123 KSE-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGK 201 (381) Q Consensus 123 ~~~-~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~ 201 (381) +.. ..+.+.|++|+++.++.++++|+++++.+|+++++++||++||+||.+|+++||.++|+++++++++.+|++|+|+ T Consensus 159 ~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~ 238 (397) T protein:vir:49 159 KWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAA 238 (397) T ss_pred eeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 654 4577999999988876678999999999999999999999999999999999999999999999999999999998 Q ss_pred CcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh Q lcl|NC_019921. 202 DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY 281 (381) Q Consensus 202 ~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~ 281 (381) ++|.+... .++.+.++...+ ...|+.+++|+||+.++..+.. T Consensus 239 ~~~~~~~~-----------------------------~~d~i~~~~~~l-------~~~~~~~a~~vmn~~~~~~l~~-- 280 (397) T protein:vir:49 239 LPTKPTLT-----------------------------KWDDIIDLEAKV-------DPAIKQTSFFLTNTSGFTALKK-- 280 (397) T ss_pred cccccccc-----------------------------cHHHHHHHHHhh-------hhhhcCCCEEEEcHHHHHHHHH-- Confidence 87754321 234455554443 3457888999999999887765 Q ss_pred hccCCCCccccc---------cCCCceeEecC--CCCC-----CcEEEEeecc-eEEEeecceEEEeehhh--hhhcCce Q lcl|NC_019921. 282 THLNANGVYVTA---------LPFNLNVIEST--VQEA-----GKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMD 342 (381) Q Consensus 282 ~~~~~~G~~~~~---------l~~G~pVv~s~--~~p~-----~~i~fgd~~~-y~i~~r~~i~i~~~~~~--~~~~d~~ 342 (381) +++.+|+|+|. ..+|+||++++ .+|. ..++||||++ |.+++|++++++++++. +|.+|++ T Consensus 281 -lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 359 (397) T protein:vir:49 281 -VKNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTT 359 (397) T ss_pred -hhcCCCceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEeccccchhhcCce Confidence 47889999875 34799998744 3554 3489999997 78999999999999865 6999999 Q ss_pred EEEEEEEEcCEEecCceEEEEEEEecCCccccccCccc Q lcl|NC_019921. 343 LYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEET 380 (381) Q Consensus 343 ~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~ 380 (381) .||+..|+||++++++||++++++-++..+.+.++--- T Consensus 360 ~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 360 KVRVIDRFDVVATDTEAFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred eEEEEeeeCcEEecccceEEEEeecccCCCCCcccccC Confidence 99999999999999999999887776665554443322 No 41 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=1.4e-56 Score=326.74 Aligned_cols=343 Identities=10% Similarity=0.033 Sum_probs=236.3 Q ss_pred Cchh-HHHHHHHHHHHHHHHHhhhhh-------HHHHHHHHHHHHHHHHHHH---HHHHHH-HHHHHHHh-hcccc---- Q lcl|NC_019921. 1 MTIN-LSETFANAKNEFINAVNNGEP-------QERQNELYGDMINQLFEET---KLQAKA-EAERVSSL-PKSAQ---- 63 (381) Q Consensus 1 mt~e-l~~~~~~~~~~~~~~~k~~~~-------~~~~~~~~~~~~~~~~~~~---~~~~~~-~~~~~~~~-~~~~~---- 63 (381) |+|+ |++++.+..+++.+..++.+. ..++.+......+.+.++. .++... +....... ..... T Consensus 5 m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (408) T protein:vir:74 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNK 84 (408) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 6664 444444444433332221110 0111111111111111110 000000 00000000 00000 Q ss_pred ---ccCHHHHHHH----------------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEE Q lcl|NC_019921. 64 ---SLSANQRSFF----------------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKF 121 (381) Q Consensus 64 ---~lt~~e~~~~----------------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g---~~~~ 121 (381) ....++++.| .++..+++++||++||+++++.|++.+++.++|+++|+++++++ .+.+ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 164 (408) T protein:vir:74 85 SENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVY 164 (408) T ss_pred hhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEE Confidence 0011111111 12345677889999999999999999999999999999998753 4556 Q ss_pred EEecC-CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccC Q lcl|NC_019921. 122 LKSET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTG 200 (381) Q Consensus 122 p~~~~-~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G 200 (381) ++..+ .+.+.|++|+++.++.++++|+++++.+|+++++++||++||+||.+|+++||.++|++++++++|.+|++|+| T Consensus 165 ~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G 244 (408) T protein:vir:74 165 EKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMG 244 (408) T ss_pred EeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 66544 45678999988887667899999999999999999999999999999999999999999999999999999999 Q ss_pred CCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhh Q lcl|NC_019921. 201 KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) Q Consensus 201 ~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~ 280 (381) +++|.|... .++.+.++++. ..+..|+++++|+||+.++..+.. T Consensus 245 ~~~~~~~~~-----------------------------~~~~i~~~~~~------~l~~~~~~~a~~v~n~~~~~~l~~- 288 (408) T protein:vir:74 245 TVPKKPTIA-----------------------------NFDDVITMINT------SVDPAIIATSSLLTNQSGLNKLAL- 288 (408) T ss_pred ccccccccc-----------------------------cHHHHHHHHHH------hhhhhhcCCCEEEEcHHHHHHHHH- Confidence 998865321 12233333221 124468889999999999877764 Q ss_pred hhccCCCCccccc---------cCCCceeEecC--CCCC-----CcEEEEeecc-eEEEeecceEEEeehhh--hhhcCc Q lcl|NC_019921. 281 YTHLNANGVYVTA---------LPFNLNVIEST--VQEA-----GKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDM 341 (381) Q Consensus 281 ~~~~~~~G~~~~~---------l~~G~pVv~s~--~~p~-----~~i~fgd~~~-y~i~~r~~i~i~~~~~~--~~~~d~ 341 (381) +++.+|+|+|. ..+|+||+.++ .||. ..++||||++ |.+++|++++++++++. .|.+|+ T Consensus 289 --lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~ 366 (408) T protein:vir:74 289 --VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDT 366 (408) T ss_pred --hhcCCCceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccchhhcce Confidence 47889999885 24799998765 4663 3489999997 88999999999999875 589999 Q ss_pred eEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 342 DLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 342 ~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) +.||+.+|+||++++++||++++++-....+..+.++-.- T Consensus 367 ~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 406 (408) T protein:vir:74 367 TKIRVIDRFDVKATDSEALVAGSFTAIADQVGNFKTTTST 406 (408) T ss_pred eeEEEEEeeCcEEecccceEEEEeecccCCCCCCCCCccc Confidence 9999999999999999999998887777776666666555 No 42 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=2.2e-56 Score=325.75 Aligned_cols=340 Identities=13% Similarity=0.101 Sum_probs=235.0 Q ss_pred Cch--hHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHH-------HHHH-HHH-HHHHHHhhccc------- Q lcl|NC_019921. 1 MTI--NLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEET-------KLQA-KAE-AERVSSLPKSA------- 62 (381) Q Consensus 1 mt~--el~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~-------~~~~-~~~-~~~~~~~~~~~------- 62 (381) |+= +|++++++..+++.+...+. +++.+......+.+.++. .+.. +.+ ........... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~---~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQ---KAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh Confidence 663 23333333333332222111 111111111111111111 1000 000 00000000000 Q ss_pred -cccCHHHHHHH------------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecC-C Q lcl|NC_019921. 63 -QSLSANQRSFF------------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSET-S 127 (381) Q Consensus 63 -~~lt~~e~~~~------------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~-~ 127 (381) +....+.++.+ +.....+...+|.+||++++..|++.+++.++|+++|+++++++ .+++|+..+ . T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 157 (385) T protein:vir:18 78 SERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFT 157 (385) T ss_pred HHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCC Confidence 00000111111 11122344556778899999999999999999999999999865 488999765 5 Q ss_pred cceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcc-eE Q lcl|NC_019921. 128 GVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQP-IG 206 (381) Q Consensus 128 ~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P-~G 206 (381) +.+.|++|++..+ +++++|+++++.+|+++++++||++||+|+. ++++||.++|+++++++++.+|++|+|+++| .| T Consensus 158 ~~a~~v~E~~~~~-~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~G 235 (385) T protein:vir:18 158 NNADVVAEKALKP-ESDITFSKQTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEG 235 (385) T ss_pred cceeeeccCcccc-ccccceeEEEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccc Confidence 7899999977654 6789999999999999999999999999875 6999999999999999999999999999865 79 Q ss_pred eeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCC Q lcl|NC_019921. 207 LNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 207 il~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~ 286 (381) |++........ ........++.+.++...+ ...+..+++|+||+.++..++. +++. T Consensus 236 i~~~~~~~~~~--------------~~~~~~~~~d~i~~~~~~l-------~~~~~~~~~~~~~~~~~~~l~~---lkd~ 291 (385) T protein:vir:18 236 LNKVATAYDTS--------------LNATGDTRADIIAHAIYQV-------TESEFSASGIVLNPRDWHNIAL---LKDN 291 (385) T ss_pred ccccccccccc--------------ccccccchHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHH---hhcC Confidence 98654322111 0112234566676665544 3456777899999999887765 4788 Q ss_pred CCccccc--------cCCCceeEecCCCCCCcEEEEeecc-eEEEeecceEEEeehhh--hhhcCceEEEEEEEEcCEEe Q lcl|NC_019921. 287 NGVYVTA--------LPFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAK 355 (381) Q Consensus 287 ~G~~~~~--------l~~G~pVv~s~~~p~~~i~fgd~~~-y~i~~r~~i~i~~~~~~--~~~~d~~~~r~~~r~dGk~~ 355 (381) +|+|+|. ..+|+||+++++||+++++||||++ |.++++++++|+.+++. +|.+|++.||+.+|+||+++ T Consensus 292 ~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~ 371 (385) T protein:vir:18 292 EGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHY 371 (385) T ss_pred CCceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEe Confidence 9999874 2379999999999999999999997 89999999999988875 48999999999999999999 Q ss_pred cCceEEEEEEEecC Q lcl|NC_019921. 356 DNKVAAVWKLDLKG 369 (381) Q Consensus 356 ~~~Afvv~~~~~~~ 369 (381) +++||++++++-+. T Consensus 372 ~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 372 RPTAIIKGTFSSGS 385 (385) T ss_pred cccceEEEEeccCC Confidence 99999998877655 No 43 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=2.2e-56 Score=325.75 Aligned_cols=340 Identities=13% Similarity=0.101 Sum_probs=235.0 Q ss_pred Cch--hHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHH-------HHHH-HHH-HHHHHHhhccc------- Q lcl|NC_019921. 1 MTI--NLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEET-------KLQA-KAE-AERVSSLPKSA------- 62 (381) Q Consensus 1 mt~--el~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~-------~~~~-~~~-~~~~~~~~~~~------- 62 (381) |+= +|++++++..+++.+...+. +++.+......+.+.++. .+.. +.+ ........... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~---~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQ---KAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh Confidence 663 23333333333332222111 111111111111111111 1000 000 00000000000 Q ss_pred -cccCHHHHHHH------------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecC-C Q lcl|NC_019921. 63 -QSLSANQRSFF------------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSET-S 127 (381) Q Consensus 63 -~~lt~~e~~~~------------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~-~ 127 (381) +....+.++.+ +.....+...+|.+||++++..|++.+++.++|+++|+++++++ .+++|+..+ . T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 157 (385) T protein:vir:19 78 SERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFT 157 (385) T ss_pred HHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCC Confidence 00000111111 11122344556778899999999999999999999999999865 488999765 5 Q ss_pred cceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcc-eE Q lcl|NC_019921. 128 GVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQP-IG 206 (381) Q Consensus 128 ~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P-~G 206 (381) +.+.|++|++..+ +++++|+++++.+|+++++++||++||+|+. ++++||.++|+++++++++.+|++|+|+++| .| T Consensus 158 ~~a~~v~E~~~~~-~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~G 235 (385) T protein:vir:19 158 NNADVVAEKALKP-ESDITFSKQTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEG 235 (385) T ss_pred cceeeeccCcccc-ccccceeEEEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccc Confidence 7899999977654 6789999999999999999999999999875 6999999999999999999999999999865 79 Q ss_pred eeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCC Q lcl|NC_019921. 207 LNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 207 il~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~ 286 (381) |++........ ........++.+.++...+ ...+..+++|+||+.++..++. +++. T Consensus 236 i~~~~~~~~~~--------------~~~~~~~~~d~i~~~~~~l-------~~~~~~~~~~~~~~~~~~~l~~---lkd~ 291 (385) T protein:vir:19 236 LNKVATAYDTS--------------LNATGDTRADIIAHAIYQV-------TESEFSASGIVLNPRDWHNIAL---LKDN 291 (385) T ss_pred ccccccccccc--------------ccccccchHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHH---hhcC Confidence 98654322111 0112234566676665544 3456777899999999887765 4788 Q ss_pred CCccccc--------cCCCceeEecCCCCCCcEEEEeecc-eEEEeecceEEEeehhh--hhhcCceEEEEEEEEcCEEe Q lcl|NC_019921. 287 NGVYVTA--------LPFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAK 355 (381) Q Consensus 287 ~G~~~~~--------l~~G~pVv~s~~~p~~~i~fgd~~~-y~i~~r~~i~i~~~~~~--~~~~d~~~~r~~~r~dGk~~ 355 (381) +|+|+|. ..+|+||+++++||+++++||||++ |.++++++++|+.+++. +|.+|++.||+.+|+||+++ T Consensus 292 ~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~ 371 (385) T protein:vir:19 292 EGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHY 371 (385) T ss_pred CCceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEe Confidence 9999874 2379999999999999999999997 89999999999988875 48999999999999999999 Q ss_pred cCceEEEEEEEecC Q lcl|NC_019921. 356 DNKVAAVWKLDLKG 369 (381) Q Consensus 356 ~~~Afvv~~~~~~~ 369 (381) +++||++++++-+. T Consensus 372 ~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 372 RPTAIIKGTFSSGS 385 (385) T ss_pred cccceEEEEeccCC Confidence 99999998877655 No 44 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=2.5e-56 Score=325.46 Aligned_cols=335 Identities=14% Similarity=0.081 Sum_probs=239.6 Q ss_pred CchhHH---HHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHH Q lcl|NC_019921. 1 MTINLS---ETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQ-----------------------AKAEAER 54 (381) Q Consensus 1 mt~el~---~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~ 54 (381) |+-+|+ ++++++..++...+.+.+. ++.+......+.+..+.... ...++++ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~--~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKV--AEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRD 78 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHH Confidence 987654 3333333333322222111 11111111112222211111 1112222 Q ss_pred HHHhhccccccCHHHHHHHH------HHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEEec Q lcl|NC_019921. 55 VSSLPKSAQSLSANQRSFFM------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSE 125 (381) Q Consensus 55 ~~~~~~~~~~lt~~e~~~~~------~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g---~~~~p~~~ 125 (381) +.......+.++.+++.+.. .+..+++++||++||+++.+.|++.+++.+||+++|+++++++ ...+|+.. T Consensus 79 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~ 158 (392) T protein:vir:10 79 VFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) T ss_pred HHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeec Confidence 33322233344555554432 2345567889999999999999999999999999999998753 45678888 Q ss_pred CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcce Q lcl|NC_019921. 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) Q Consensus 126 ~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~ 205 (381) +.+.+.|++|+++.+..+.++|++|++.+|+++++++||++||+||.+|+++||.+.|+++++++++.+|++|+|+++|. T Consensus 159 ~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~ 238 (392) T protein:vir:10 159 DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ 238 (392) T ss_pred CCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 88899999999887765679999999999999999999999999999999999999999999999999999999988775 Q ss_pred EeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC Q lcl|NC_019921. 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~ 285 (381) |..+ ++.+.+++.. .....|+.+++|+|||.++..++. +++ T Consensus 239 ~~~~------------------------------~d~i~~~~~~------~l~~~~~~~a~~vm~~~~~~~L~~---lkd 279 (392) T protein:vir:10 239 AIKS------------------------------LDDIKDVLNV------KLDPAISPNAILLTNQDGFNYLDK---LKD 279 (392) T ss_pred CccC------------------------------HHHHHHHHHH------hhhhhhccCCEEEEcHHHHHHHHH---hhc Confidence 5421 2233333211 124468889999999999888765 478 Q ss_pred CCCcccccc---------CCCceeEe-cCCC-C------CC--cEEEEeecc-eEEEeecceEEEeehh--hhhhcCceE Q lcl|NC_019921. 286 ANGVYVTAL---------PFNLNVIE-STVQ-E------AG--KVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDL 343 (381) Q Consensus 286 ~~G~~~~~l---------~~G~pVv~-s~~~-p------~~--~i~fgd~~~-y~i~~r~~i~i~~~~~--~~~~~d~~~ 343 (381) .+|+|+|.. .+|+|++. ++.+ | .+ .++||||++ |.+++|++++++++++ .+|.+|+++ T Consensus 280 ~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~ 359 (392) T protein:vir:10 280 KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLD 359 (392) T ss_pred cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceE Confidence 899998842 37876554 3322 2 12 379999998 7899999999999885 479999999 Q ss_pred EEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 344 YTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 344 ~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) ||+..|+||++++++||+.++++.+++..+.-| T Consensus 360 ~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 360 LRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEEEEeeccEEecccceEEEEecccccccCCCC Confidence 999999999999999999988888777766666 No 45 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=2.5e-56 Score=325.46 Aligned_cols=335 Identities=14% Similarity=0.081 Sum_probs=239.6 Q ss_pred CchhHH---HHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHH Q lcl|NC_019921. 1 MTINLS---ETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQ-----------------------AKAEAER 54 (381) Q Consensus 1 mt~el~---~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~ 54 (381) |+-+|+ ++++++..++...+.+.+. ++.+......+.+..+.... ...++++ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~--~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKV--AEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRD 78 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHH Confidence 987654 3333333333322222111 11111111112222211111 1112222 Q ss_pred HHHhhccccccCHHHHHHHH------HHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEEec Q lcl|NC_019921. 55 VSSLPKSAQSLSANQRSFFM------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSE 125 (381) Q Consensus 55 ~~~~~~~~~~lt~~e~~~~~------~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g---~~~~p~~~ 125 (381) +.......+.++.+++.+.. .+..+++++||++||+++.+.|++.+++.+||+++|+++++++ ...+|+.. T Consensus 79 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~ 158 (392) T protein:vir:10 79 VFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) T ss_pred HHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeec Confidence 33322233344555554432 2345567889999999999999999999999999999998753 45678888 Q ss_pred CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcce Q lcl|NC_019921. 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) Q Consensus 126 ~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~ 205 (381) +.+.+.|++|+++.+..+.++|++|++.+|+++++++||++||+||.+|+++||.+.|+++++++++.+|++|+|+++|. T Consensus 159 ~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~ 238 (392) T protein:vir:10 159 DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ 238 (392) T ss_pred CCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 88899999999887765679999999999999999999999999999999999999999999999999999999988775 Q ss_pred EeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC Q lcl|NC_019921. 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~ 285 (381) |..+ ++.+.+++.. .....|+.+++|+|||.++..++. +++ T Consensus 239 ~~~~------------------------------~d~i~~~~~~------~l~~~~~~~a~~vm~~~~~~~L~~---lkd 279 (392) T protein:vir:10 239 AIKS------------------------------LDDIKDVLNV------KLDPAISPNAILLTNQDGFNYLDK---LKD 279 (392) T ss_pred CccC------------------------------HHHHHHHHHH------hhhhhhccCCEEEEcHHHHHHHHH---hhc Confidence 5421 2233333211 124468889999999999888765 478 Q ss_pred CCCcccccc---------CCCceeEe-cCCC-C------CC--cEEEEeecc-eEEEeecceEEEeehh--hhhhcCceE Q lcl|NC_019921. 286 ANGVYVTAL---------PFNLNVIE-STVQ-E------AG--KVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDL 343 (381) Q Consensus 286 ~~G~~~~~l---------~~G~pVv~-s~~~-p------~~--~i~fgd~~~-y~i~~r~~i~i~~~~~--~~~~~d~~~ 343 (381) .+|+|+|.. .+|+|++. ++.+ | .+ .++||||++ |.+++|++++++++++ .+|.+|+++ T Consensus 280 ~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~ 359 (392) T protein:vir:10 280 KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLD 359 (392) T ss_pred cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceE Confidence 899998842 37876554 3322 2 12 379999998 7899999999999885 479999999 Q ss_pred EEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 344 YTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 344 ~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) ||+..|+||++++++||+.++++.+++..+.-| T Consensus 360 ~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 360 LRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEEEEeeccEEecccceEEEEecccccccCCCC Confidence 999999999999999999988888777766666 No 46 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=2.5e-56 Score=325.46 Aligned_cols=335 Identities=14% Similarity=0.081 Sum_probs=239.6 Q ss_pred CchhHH---HHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHH Q lcl|NC_019921. 1 MTINLS---ETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQ-----------------------AKAEAER 54 (381) Q Consensus 1 mt~el~---~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~ 54 (381) |+-+|+ ++++++..++...+.+.+. ++.+......+.+..+.... ...++++ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~--~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKV--AEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRD 78 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHH Confidence 987654 3333333333322222111 11111111112222211111 1112222 Q ss_pred HHHhhccccccCHHHHHHHH------HHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEEec Q lcl|NC_019921. 55 VSSLPKSAQSLSANQRSFFM------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSE 125 (381) Q Consensus 55 ~~~~~~~~~~lt~~e~~~~~------~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g---~~~~p~~~ 125 (381) +.......+.++.+++.+.. .+..+++++||++||+++.+.|++.+++.+||+++|+++++++ ...+|+.. T Consensus 79 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~ 158 (392) T protein:vir:10 79 VFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) T ss_pred HHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeec Confidence 33322233344555554432 2345567889999999999999999999999999999998753 45678888 Q ss_pred CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcce Q lcl|NC_019921. 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) Q Consensus 126 ~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~ 205 (381) +.+.+.|++|+++.+..+.++|++|++.+|+++++++||++||+||.+|+++||.+.|+++++++++.+|++|+|+++|. T Consensus 159 ~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~ 238 (392) T protein:vir:10 159 DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ 238 (392) T ss_pred CCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 88899999999887765679999999999999999999999999999999999999999999999999999999988775 Q ss_pred EeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC Q lcl|NC_019921. 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~ 285 (381) |..+ ++.+.+++.. .....|+.+++|+|||.++..++. +++ T Consensus 239 ~~~~------------------------------~d~i~~~~~~------~l~~~~~~~a~~vm~~~~~~~L~~---lkd 279 (392) T protein:vir:10 239 AIKS------------------------------LDDIKDVLNV------KLDPAISPNAILLTNQDGFNYLDK---LKD 279 (392) T ss_pred CccC------------------------------HHHHHHHHHH------hhhhhhccCCEEEEcHHHHHHHHH---hhc Confidence 5421 2233333211 124468889999999999888765 478 Q ss_pred CCCcccccc---------CCCceeEe-cCCC-C------CC--cEEEEeecc-eEEEeecceEEEeehh--hhhhcCceE Q lcl|NC_019921. 286 ANGVYVTAL---------PFNLNVIE-STVQ-E------AG--KVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDL 343 (381) Q Consensus 286 ~~G~~~~~l---------~~G~pVv~-s~~~-p------~~--~i~fgd~~~-y~i~~r~~i~i~~~~~--~~~~~d~~~ 343 (381) .+|+|+|.. .+|+|++. ++.+ | .+ .++||||++ |.+++|++++++++++ .+|.+|+++ T Consensus 280 ~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~ 359 (392) T protein:vir:10 280 KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLD 359 (392) T ss_pred cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceE Confidence 899998842 37876554 3322 2 12 379999998 7899999999999885 479999999 Q ss_pred EEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 344 YTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 344 ~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) ||+..|+||++++++||+.++++.+++..+.-| T Consensus 360 ~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 360 LRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEEEEeeccEEecccceEEEEecccccccCCCC Confidence 999999999999999999988888777766666 No 47 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=2.5e-56 Score=325.46 Aligned_cols=335 Identities=14% Similarity=0.081 Sum_probs=239.6 Q ss_pred CchhHH---HHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHH-----------------------HHHHHHH Q lcl|NC_019921. 1 MTINLS---ETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQ-----------------------AKAEAER 54 (381) Q Consensus 1 mt~el~---~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~ 54 (381) |+-+|+ ++++++..++...+.+.+. ++.+......+.+..+.... ...++++ T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~--~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKV--AEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRD 78 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHH Confidence 987654 3333333333322222111 11111111112222211111 1112222 Q ss_pred HHHhhccccccCHHHHHHHH------HHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEEec Q lcl|NC_019921. 55 VSSLPKSAQSLSANQRSFFM------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKSE 125 (381) Q Consensus 55 ~~~~~~~~~~lt~~e~~~~~------~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g---~~~~p~~~ 125 (381) +.......+.++.+++.+.. .+..+++++||++||+++.+.|++.+++.+||+++|+++++++ ...+|+.. T Consensus 79 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~ 158 (392) T protein:vir:10 79 VFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS 158 (392) T ss_pred HHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeec Confidence 33322233344555554432 2345567889999999999999999999999999999998753 45678888 Q ss_pred CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcce Q lcl|NC_019921. 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) Q Consensus 126 ~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~ 205 (381) +.+.+.|++|+++.+..+.++|++|++.+|+++++++||++||+||.+|+++||.+.|+++++++++.+|++|+|+++|. T Consensus 159 ~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~ 238 (392) T protein:vir:10 159 DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ 238 (392) T ss_pred CCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 88899999999887765679999999999999999999999999999999999999999999999999999999988775 Q ss_pred EeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC Q lcl|NC_019921. 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~ 285 (381) |..+ ++.+.+++.. .....|+.+++|+|||.++..++. +++ T Consensus 239 ~~~~------------------------------~d~i~~~~~~------~l~~~~~~~a~~vm~~~~~~~L~~---lkd 279 (392) T protein:vir:10 239 AIKS------------------------------LDDIKDVLNV------KLDPAISPNAILLTNQDGFNYLDK---LKD 279 (392) T ss_pred CccC------------------------------HHHHHHHHHH------hhhhhhccCCEEEEcHHHHHHHHH---hhc Confidence 5421 2233333211 124468889999999999888765 478 Q ss_pred CCCcccccc---------CCCceeEe-cCCC-C------CC--cEEEEeecc-eEEEeecceEEEeehh--hhhhcCceE Q lcl|NC_019921. 286 ANGVYVTAL---------PFNLNVIE-STVQ-E------AG--KVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDL 343 (381) Q Consensus 286 ~~G~~~~~l---------~~G~pVv~-s~~~-p------~~--~i~fgd~~~-y~i~~r~~i~i~~~~~--~~~~~d~~~ 343 (381) .+|+|+|.. .+|+|++. ++.+ | .+ .++||||++ |.+++|++++++++++ .+|.+|+++ T Consensus 280 ~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~ 359 (392) T protein:vir:10 280 KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLD 359 (392) T ss_pred cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceE Confidence 899998842 37876554 3322 2 12 379999998 7899999999999885 479999999 Q ss_pred EEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 344 YTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 344 ~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) ||+..|+||++++++||+.++++.+++..+.-| T Consensus 360 ~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 360 LRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEEEEeeccEEecccceEEEEecccccccCCCC Confidence 999999999999999999988888777766666 No 48 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=1.1e-57 Score=332.80 Aligned_cols=290 Identities=12% Similarity=0.040 Sum_probs=234.5 Q ss_pred cCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceEEeeccccccccc Q lcl|NC_019921. 65 LSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIKGQL 143 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~~~~a~wv~e~~~~~~~~ 143 (381) |..++.+..+ ..++.++|.+||++++++|++.+++.++|+++++++++++ ..++|+.++.+.+.|++|+++++ ++ T Consensus 1 m~~~~~~a~~---~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~ 76 (330) T protein:vir:77 1 MAGSTVPSTQ---VALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGEAERKP-IT 76 (330) T ss_pred Ccccccchhh---ccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecCCCccc-cc Confidence 6666655432 3445566778888899999999999999999999999865 58999999999999999987765 67 Q ss_pred CcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCc-ceEeeeccccccccccccc Q lcl|NC_019921. 144 DAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ-PIGLNRQVQKGVSVTEGAY 222 (381) Q Consensus 144 ~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~-P~Gil~~~~~~~~~~~~~~ 222 (381) +++|+++++.+||++++++||+|||+||.+++++||.++|+++++++++.+|++|+|+++ |.||++.+.......... T Consensus 77 ~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~- 155 (330) T protein:vir:77 77 KGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTN- 155 (330) T ss_pred cceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeeccc- Confidence 899999999999999999999999999999999999999999999999999999999864 579987654333222111 Q ss_pred cceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCcccccc-------- Q lcl|NC_019921. 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL-------- 294 (381) Q Consensus 223 ~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l-------- 294 (381) ..+. .......++.+.+++..+ ...+..+++|+||+.++..++. +++.+|+|+|.. T Consensus 156 -----~~~~-~~~~~~~~~~l~~~~~~~-------~~~~~~~~~~vmn~~~~~~l~~---lkd~~G~~l~~~~~~~~~~~ 219 (330) T protein:vir:77 156 -----LTTA-SGPQGNAYLAVNNALSLL-------VNSGKKWTGTLLDNVTEPILNT---AVDGNGRPLFVESTYTEQVG 219 (330) T ss_pred -----cccc-ccccchhHHHHHHHHHhh-------hhcCCCccEEEEcHHHHHHHHH---HhccCCceeecCcccccccc Confidence 0011 111222334444444332 2345667789999999887765 478889988752 Q ss_pred ------CCCceeEecCCCCCCc------EEEEeecceEEEeecceEEEeehhhh------------------hhcCceEE Q lcl|NC_019921. 295 ------PFNLNVIESTVQEAGK------VLTYVKGLYDGYLAGGINVQKFKETL------------------ALDDMDLY 344 (381) Q Consensus 295 ------~~G~pVv~s~~~p~~~------i~fgd~~~y~i~~r~~i~i~~~~~~~------------------~~~d~~~~ 344 (381) .+|+||+.+++||++. ++||||++|+++++++++|++++|.+ |.+|++.| T Consensus 220 ~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~ 299 (330) T protein:vir:77 220 AIREGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAV 299 (330) T ss_pred ccCCceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEE Confidence 3699999999999754 79999999999999999999999865 78899999 Q ss_pred EEEEEEcCEEecCceEEEEEEEecCCccccc Q lcl|NC_019921. 345 TAKQFAYGKAKDNKVAAVWKLDLKGHKPALE 375 (381) Q Consensus 345 r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~ 375 (381) |+.+|+|+++++++||++++.+.++.+|+-+ T Consensus 300 r~~~r~d~~v~~~~a~~~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 300 RCEAEFAFMVNDKDAFVKLTDQVAGTDPEEE 330 (330) T ss_pred EEEEEeccEEecccceEEEEeccCCcCCCCC Confidence 9999999999999999999999888888777 No 49 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=2.6e-56 Score=325.33 Aligned_cols=346 Identities=15% Similarity=0.093 Sum_probs=231.5 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhH--------HHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHH--------- Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEPQ--------ERQNELYGDMINQLFEET-------KLQAKAEAERVS--------- 56 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~~--------~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~--------- 56 (381) |==|.++...+..++..+.++....+ ++..+......+.+.+.. ..+......... T Consensus 1 ~~ke~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (413) T protein:vir:81 1 MVKEAGDAPTNAQVAEIAEVKSMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYKSIGEF 80 (413) T ss_pred ChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhhhhhhh Confidence 55555544444443322222211000 000000111111100000 000000000000 Q ss_pred ---------Hhhc-------cccccCHHHHHHHHH--HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc Q lcl|NC_019921. 57 ---------SLPK-------SAQSLSANQRSFFMD--INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR 118 (381) Q Consensus 57 ---------~~~~-------~~~~lt~~e~~~~~~--~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~ 118 (381) ...+ ........+.+.+.. ...+++.++|++||++++++|++.+++.++|+++|++.++++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 160 (413) T protein:vir:81 81 FAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNT 160 (413) T ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCC Confidence 0000 000111112222211 1234567899999999999999999999999999999998764 Q ss_pred -eEEEEecCC----cceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019921. 119 -LKFLKSETS----GVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALET 193 (381) Q Consensus 119 -~~~p~~~~~----~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~ 193 (381) .++|+.... +.+.|++|+++.+....++|+++++.+|+++++++||++||+|+.. |++||+++|++++++++|. T Consensus 161 ~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~-l~~~i~~~la~~~~~~~d~ 239 (413) T protein:vir:81 161 TIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYDF-LVSYINARLLEELAIEEER 239 (413) T ss_pred ceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 788876543 4579999988876544589999999999999999999999999965 9999999999999999999 Q ss_pred heeeccCCCcc-eEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|NC_019921. 194 AFLKGTGKDQP-IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 194 a~i~G~G~~~P-~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~ 272 (381) +|++|+|+++| .||++....... ........++.+......... +..|+.++ |+||+. T Consensus 240 ~~l~G~G~~~~~~Gi~~~~~~~~~---------------~~~~~~~~~~~i~~~~~~~~~-----~~~~~~~~-~vmn~~ 298 (413) T protein:vir:81 240 QLLLGDGTGNNLTGLLKRDGIQTL---------------AVSNKDELADSIYKAMTNISL-----ATPFQADA-LVINPL 298 (413) T ss_pred HHhccCCCCCcccccccccccccc---------------cccccchhHHHHHHHHHHhhh-----hccCCCcE-EEEcHH Confidence 99999999865 799865332211 111222334444443332211 23455555 999999 Q ss_pred hHHHHhhhhhccCCCCccccc----------------cCCCceeEecCCCCCCcEEEEeecc-eEEEeecceEEEeehhh Q lcl|NC_019921. 273 DAFEVQAQYTHLNANGVYVTA----------------LPFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQKFKET 335 (381) Q Consensus 273 t~~~~~~~~~~~~~~G~~~~~----------------l~~G~pVv~s~~~p~~~i~fgd~~~-y~i~~r~~i~i~~~~~~ 335 (381) ++..+.+ +++.+|+|+|. .+||+||+++++||+++++||||++ |.+++|++++|+.+++. T Consensus 299 ~~~~l~~---lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~ 375 (413) T protein:vir:81 299 DYQELRL---AKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVVGAFRSAASVLRKGGVRIDSTNTN 375 (413) T ss_pred HHHHHHH---hhccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEEEecccEEEEEEecceEEEEeccc Confidence 9887765 47888998874 2479999999999999999999997 89999999999998876 Q ss_pred --hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcc Q lcl|NC_019921. 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 336 --~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~ 372 (381) +|.+|+++||+.+|+||++++++||++++++- +.+| T Consensus 376 ~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~-~~~p 413 (413) T protein:vir:81 376 VDDFENNLITVRAEERVGLMVTFPEAIVQLDVAE-VVTP 413 (413) T ss_pred cchhhcCcEEEEEEEeeccEEecccceEEEEecC-CCCC Confidence 69999999999999999999999999977643 3333 No 50 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=6.4e-56 Score=323.19 Aligned_cols=341 Identities=12% Similarity=0.014 Sum_probs=234.0 Q ss_pred Cc-hh-HHHHHHHHHHHHHHHHhhhhh-------HHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHH---Hhhccc---- Q lcl|NC_019921. 1 MT-IN-LSETFANAKNEFINAVNNGEP-------QERQNELYGDMINQLFEETKL--QAKAEAERVS---SLPKSA---- 62 (381) Q Consensus 1 mt-~e-l~~~~~~~~~~~~~~~k~~~~-------~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~---~~~~~~---- 62 (381) |. ++ |++++.+..+++.+..+..+. ..++...+....+.+..+... +...+.+... ...... T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 44 22 333344444333322111111 111111111111111111100 0000000000 000000 Q ss_pred ---cccCHHHHHH------------HHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC---ceEEEEe Q lcl|NC_019921. 63 ---QSLSANQRSF------------FMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL---RLKFLKS 124 (381) Q Consensus 63 ---~~lt~~e~~~------------~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g---~~~~p~~ 124 (381) .....++++. +..+..+++++||++||+++++.|++.+++.++|+++|+++++++ .+.+|+. T Consensus 81 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (397) T protein:vir:49 81 KNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKW 160 (397) T ss_pred chhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEee Confidence 0111222222 234455678889999999999999999999999999999988753 4566665 Q ss_pred c-CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCc Q lcl|NC_019921. 125 E-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ 203 (381) Q Consensus 125 ~-~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~ 203 (381) . ..+.+.|++|++..+..+.++|++|++.+|+++++++||++||+||.+|+++||.+++++++++++|.+|++|+|+++ T Consensus 161 ~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~ 240 (397) T protein:vir:49 161 ADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLP 240 (397) T ss_pred ccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 4 457799999988877656789999999999999999999999999999999999999999999999999999999987 Q ss_pred ceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc Q lcl|NC_019921. 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~ 283 (381) |.+.. ..++.+.++...+ ...|+.+++|+||+.++..+.. + T Consensus 241 ~~~~~-----------------------------~~~d~i~~~~~~l-------~~~~~~~a~~v~n~~~~~~l~~---l 281 (397) T protein:vir:49 241 NKPTL-----------------------------AKWDDIIDLQAKV-------DPAIKQTSLFLTNTSGFTALKK---V 281 (397) T ss_pred ccccc-----------------------------cCHHHHHHHHHhh-------hhhhcCCCEEEEcHHHHHHHHH---h Confidence 75321 1234455544433 3457889999999999887765 4 Q ss_pred cCCCCccccc---------cCCCceeEecC--CCCC-----CcEEEEeecc-eEEEeecceEEEeehhh--hhhcCceEE Q lcl|NC_019921. 284 LNANGVYVTA---------LPFNLNVIEST--VQEA-----GKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLY 344 (381) Q Consensus 284 ~~~~G~~~~~---------l~~G~pVv~s~--~~p~-----~~i~fgd~~~-y~i~~r~~i~i~~~~~~--~~~~d~~~~ 344 (381) ++.+|+|+|. ..+|+||+.++ .||. ..++||||++ |++++|++++|+++++. +|.+|++.| T Consensus 282 kd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 361 (397) T protein:vir:49 282 KNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKV 361 (397) T ss_pred hccCCceeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeE Confidence 7889999874 34799998754 4553 3489999997 88999999999999865 699999999 Q ss_pred EEEEEEcCEEecCceEEEEEEEecCCccccccCccc Q lcl|NC_019921. 345 TAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEET 380 (381) Q Consensus 345 r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~ 380 (381) |+..|+||++++++||++++++-.+.+++.+.+.-- T Consensus 362 ~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 362 RVIDRFDVVSTDTEAFVPASFKAIADQKAKLSTAGA 397 (397) T ss_pred EEEEeeccEEecccceEEEEecccccccCcccccCC Confidence 999999999999999999887776665443332222 No 51 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=4.2e-55 Score=318.71 Aligned_cols=341 Identities=9% Similarity=0.006 Sum_probs=234.8 Q ss_pred Cchh-----HHHHHHHHHHHHHHHHhhh-------hhHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHhhc--cc- Q lcl|NC_019921. 1 MTIN-----LSETFANAKNEFINAVNNG-------EPQERQNELYGDMINQLFEET---KLQAKAEAERVSSLPK--SA- 62 (381) Q Consensus 1 mt~e-----l~~~~~~~~~~~~~~~k~~-------~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~--~~- 62 (381) |++| |++++++.+.++.+..++. +..+++.+.+.+..+....+. ..+......+...... .. T Consensus 1 ~~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (404) T protein:vir:39 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (404) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 5554 4444444444332222111 001111111111111111110 0000000000000000 00 Q ss_pred ------cccCHHHHHHH----------------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--- Q lcl|NC_019921. 63 ------QSLSANQRSFF----------------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--- 117 (381) Q Consensus 63 ------~~lt~~e~~~~----------------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--- 117 (381) .....++++.| +++..+++++||++||+++++.|++.+++.+||+++|+++++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 160 (404) T protein:vir:39 81 PLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (404) T ss_pred ccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcc Confidence 00011111111 23345677889999999999999999999999999999998753 Q ss_pred ceEEEEe-cCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhee Q lcl|NC_019921. 118 RLKFLKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL 196 (381) Q Consensus 118 ~~~~p~~-~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i 196 (381) ...+++. +..+.+.|++|+++.++.++++|+++++.+|+++++++||++||+||.+|+++||.++|+++++++++.+|+ T Consensus 161 ~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il 240 (404) T protein:vir:39 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (404) T ss_pred eEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3444444 445778999998887766789999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHH Q lcl|NC_019921. 197 KGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) Q Consensus 197 ~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~ 276 (381) +|+|+++|.|... ..+.+.+++... ....|+.+++|+||+.++.. T Consensus 241 ~g~g~~~~~~~~~-----------------------------~~~~i~~~~~~~------~~~~~~~~a~~v~n~~~~~~ 285 (404) T protein:vir:39 241 AAMGTVPKKPTIA-----------------------------KFDDVITMINTS------VDPAIIATSSLLTNQSGLNK 285 (404) T ss_pred hcccccccccccc-----------------------------cHHHHHHHHHHh------hhhhhccCCEEEEcHHHHHH Confidence 9999998865431 122233333211 13467889999999999887 Q ss_pred HhhhhhccCCCCccccc---------cCCCceeEecCC--CCC-----CcEEEEeecc-eEEEeecceEEEeehhh--hh Q lcl|NC_019921. 277 VQAQYTHLNANGVYVTA---------LPFNLNVIESTV--QEA-----GKVLTYVKGL-YDGYLAGGINVQKFKET--LA 337 (381) Q Consensus 277 ~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~~--~p~-----~~i~fgd~~~-y~i~~r~~i~i~~~~~~--~~ 337 (381) +.. +++.+|+|++. ..+|+||+.+++ +|. ..++||||++ |.+++|+++++..+++. +| T Consensus 286 L~~---lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~ 362 (404) T protein:vir:39 286 LAL---VKTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAF 362 (404) T ss_pred HHH---hhccCCceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhh Confidence 765 47889999874 337999998654 453 2489999998 78999999999999976 69 Q ss_pred hcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcc Q lcl|NC_019921. 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 338 ~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~ 379 (381) .+|++.||+.+|+||++.+++||++++++-++....+.++-- T Consensus 363 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 363 ETDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQVGNFTAGK 404 (404) T ss_pred hhceeeEEEEeeeccEEecccceEEEEeeccccCCCCCCCCC Confidence 999999999999999999999999999888777666555555 No 52 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=5.2e-56 Score=323.68 Aligned_cols=339 Identities=12% Similarity=0.017 Sum_probs=234.0 Q ss_pred hhHHHHHHHHHHHHHHHHhhhhh-----------HHHHHHHHHHHHHHHHHHHHH------HHHHHH-HHHH-Hhhc--- Q lcl|NC_019921. 3 INLSETFANAKNEFINAVNNGEP-----------QERQNELYGDMINQLFEETKL------QAKAEA-ERVS-SLPK--- 60 (381) Q Consensus 3 ~el~~~~~~~~~~~~~~~k~~~~-----------~~~~~~~~~~~~~~~~~~~~~------~~~~~~-~~~~-~~~~--- 60 (381) |+..+++++.+.++.+.+++.+. ..++.+.+...++.+..+... +.+... .+.. .... T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcccccc Confidence 55555554444443333222111 111111122222111111110 000000 0000 0000 Q ss_pred -cccccCHHHHHH------------HHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc---eEEEE- Q lcl|NC_019921. 61 -SAQSLSANQRSF------------FMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR---LKFLK- 123 (381) Q Consensus 61 -~~~~lt~~e~~~------------~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~---~~~p~- 123 (381) .......++++. ......+++++||++||+++++.|++.+++.++|+++|+++++++. ..++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (397) T protein:vir:48 81 KSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKW 160 (397) T ss_pred chhhHHHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEee Confidence 000111222222 2234456778899999999999999999999999999999987542 33333 Q ss_pred ecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCc Q lcl|NC_019921. 124 SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ 203 (381) Q Consensus 124 ~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~ 203 (381) .+..+.+.|++|++..+..++++|++|++.+++++++++||++||+||.+++++||.++|+++++++++.+|++|+|+++ T Consensus 161 ~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~ 240 (397) T protein:vir:48 161 ADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATLP 240 (397) T ss_pred cCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 34556799999998887666799999999999999999999999999999999999999999999999999999999987 Q ss_pred ceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc Q lcl|NC_019921. 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~ 283 (381) |.|.+ ..++.+.++...+ ...|+.+++|+||+.++..+.. + T Consensus 241 ~~~~~-----------------------------~~~d~i~~~~~~l-------~~~~~~~a~~v~n~~~~~~L~~---l 281 (397) T protein:vir:48 241 TKPTL-----------------------------TKWDDIIDLQAKV-------DPAIKQTSFFLTNTSGFTALKK---V 281 (397) T ss_pred ccccc-----------------------------ccHHHHHHHHHHh-------hhhhcCCCEEEECHHHHHHHHH---h Confidence 75422 1234455554443 3457788999999999887765 4 Q ss_pred cCCCCccccc---------cCCCceeEecC--CCC-----CCcEEEEeecc-eEEEeecceEEEeehhh--hhhcCceEE Q lcl|NC_019921. 284 LNANGVYVTA---------LPFNLNVIEST--VQE-----AGKVLTYVKGL-YDGYLAGGINVQKFKET--LALDDMDLY 344 (381) Q Consensus 284 ~~~~G~~~~~---------l~~G~pVv~s~--~~p-----~~~i~fgd~~~-y~i~~r~~i~i~~~~~~--~~~~d~~~~ 344 (381) ++.+|+|++. ..+|+||+.++ .+| +..++||||++ |.+++|++++++.+++. +|.+|++.| T Consensus 282 kd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 361 (397) T protein:vir:48 282 KNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKI 361 (397) T ss_pred hcCCCceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeE Confidence 7889999874 24799998754 344 34589999997 67999999999998865 699999999 Q ss_pred EEEEEEcCEEecCceEEEEEEEecCCccccccCccc Q lcl|NC_019921. 345 TAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEET 380 (381) Q Consensus 345 r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~ 380 (381) |+.+|+||++++++||+.++++-+..++...++=.- T Consensus 362 r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 362 RVIDRFDVVATDTESFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred EEEeeeccEEecccceEEEEecccccCCCCccccCC Confidence 999999999999999999887776666555544443 No 53 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=4.9e-55 Score=318.33 Aligned_cols=350 Identities=9% Similarity=0.009 Sum_probs=237.4 Q ss_pred hhHHHHHHHHHHHHHH----HHhhhhh-----HHHHHHHHHHHHHHHHHHHHHHH------HHHH--------------- Q lcl|NC_019921. 3 INLSETFANAKNEFIN----AVNNGEP-----QERQNELYGDMINQLFEETKLQA------KAEA--------------- 52 (381) Q Consensus 3 ~el~~~~~~~~~~~~~----~~k~~~~-----~~~~~~~~~~~~~~~~~~~~~~~------~~~~--------------- 52 (381) ||-.++..+..+++.+ ..++... +.++.+.....++++..+..... .... T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 4433333333333222 2111100 00011111111111211111100 0000 Q ss_pred --------HHHHHhhccccccCHHHHHHHHHH---------hhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec Q lcl|NC_019921. 53 --------ERVSSLPKSAQSLSANQRSFFMDI---------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 53 --------~~~~~~~~~~~~lt~~e~~~~~~~---------~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~ 115 (381) ...............++++.|... ...+..+||++||+++++.|++.+++.++|+++|+++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:47 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeec Confidence 000000001112333444444322 112455789999999999999999999999999999997 Q ss_pred C-CceEEE--EecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019921. 116 G-LRLKFL--KSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE 192 (381) Q Consensus 116 ~-g~~~~p--~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~ 192 (381) + +..++| ...+.+.+.|++|++..++.+.++|++|++.+|+++++++||++||+||.+++++||.++|+++++++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d 240 (415) T protein:vir:47 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 5 334555 5566778999999988876678999999999999999999999999999999999999999999999999 Q ss_pred hheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|NC_019921. 193 TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 193 ~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~ 272 (381) .+|++|+|+++|.++.......... +......+++.+.+++..+ ...|+.++.|+||+. T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~~--------------~~~~~~~~~~~i~~~~~~~-------~~~~~~~~~~v~n~~ 299 (415) T protein:vir:47 241 KAIIDVITKGSTGSTSSGFEKEGKK--------------LEVKKAKSLDDIKDAINLN-------VKPNYEHNVAIVSQT 299 (415) T ss_pred HHHhhccccCCccccccccccccce--------------eccccccchHHHHHHHHhh-------hhhccCCCEEEEcHH Confidence 9999999999887765432111110 1112223455566665544 234667889999999 Q ss_pred hHHHHhhhhhccCCCCccccc---------cCCCceeEecCCCCCC-----cEEEEeecc-eEEEeecceEEEeehhhhh Q lcl|NC_019921. 273 DAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETLA 337 (381) Q Consensus 273 t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~~~p~~-----~i~fgd~~~-y~i~~r~~i~i~~~~~~~~ 337 (381) ++..+.. +++.+|+|+|. ..+|+||+.+++||.+ .++||||++ |++++|+++++..++ | T Consensus 300 ~~~~L~~---lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~---~ 373 (415) T protein:vir:47 300 MFAKLDK---MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD---Y 373 (415) T ss_pred HHHHHHH---hhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeec---c Confidence 9877654 57899999874 3479999999999853 389999998 789999999999987 5 Q ss_pred hcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcc Q lcl|NC_019921. 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 338 ~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~ 379 (381) .++++.||+.+|+||++++++||++++++-++..+-.-|-.- T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:47 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEeeccCCCCCCccCCC Confidence 677899999999999999999999999887766654433333 No 54 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=4.9e-55 Score=318.33 Aligned_cols=350 Identities=9% Similarity=0.009 Sum_probs=237.4 Q ss_pred hhHHHHHHHHHHHHHH----HHhhhhh-----HHHHHHHHHHHHHHHHHHHHHHH------HHHH--------------- Q lcl|NC_019921. 3 INLSETFANAKNEFIN----AVNNGEP-----QERQNELYGDMINQLFEETKLQA------KAEA--------------- 52 (381) Q Consensus 3 ~el~~~~~~~~~~~~~----~~k~~~~-----~~~~~~~~~~~~~~~~~~~~~~~------~~~~--------------- 52 (381) ||-.++..+..+++.+ ..++... +.++.+.....++++..+..... .... T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 4433333333333222 2111100 00011111111111211111100 0000 Q ss_pred --------HHHHHhhccccccCHHHHHHHHHH---------hhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec Q lcl|NC_019921. 53 --------ERVSSLPKSAQSLSANQRSFFMDI---------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 53 --------~~~~~~~~~~~~lt~~e~~~~~~~---------~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~ 115 (381) ...............++++.|... ...+..+||++||+++++.|++.+++.++|+++|+++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:46 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeec Confidence 000000001112333444444322 112455789999999999999999999999999999997 Q ss_pred C-CceEEE--EecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019921. 116 G-LRLKFL--KSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE 192 (381) Q Consensus 116 ~-g~~~~p--~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~ 192 (381) + +..++| ...+.+.+.|++|++..++.+.++|++|++.+|+++++++||++||+||.+++++||.++|+++++++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d 240 (415) T protein:vir:46 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 5 334555 5566778999999988876678999999999999999999999999999999999999999999999999 Q ss_pred hheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|NC_019921. 193 TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 193 ~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~ 272 (381) .+|++|+|+++|.++.......... +......+++.+.+++..+ ...|+.++.|+||+. T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~~--------------~~~~~~~~~~~i~~~~~~~-------~~~~~~~~~~v~n~~ 299 (415) T protein:vir:46 241 KAIIDVITKGSTGSTSSGFEKEGKK--------------LEVKKAKSLDDIKDAINLN-------VKPNYEHNVAIVSQT 299 (415) T ss_pred HHHhhccccCCccccccccccccce--------------eccccccchHHHHHHHHhh-------hhhccCCCEEEEcHH Confidence 9999999999887765432111110 1112223455566665544 234667889999999 Q ss_pred hHHHHhhhhhccCCCCccccc---------cCCCceeEecCCCCCC-----cEEEEeecc-eEEEeecceEEEeehhhhh Q lcl|NC_019921. 273 DAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETLA 337 (381) Q Consensus 273 t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~~~p~~-----~i~fgd~~~-y~i~~r~~i~i~~~~~~~~ 337 (381) ++..+.. +++.+|+|+|. ..+|+||+.+++||.+ .++||||++ |++++|+++++..++ | T Consensus 300 ~~~~L~~---lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~---~ 373 (415) T protein:vir:46 300 MFAKLDK---MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD---Y 373 (415) T ss_pred HHHHHHH---hhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeec---c Confidence 9877654 57899999874 3479999999999853 389999998 789999999999987 5 Q ss_pred hcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcc Q lcl|NC_019921. 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 338 ~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~ 379 (381) .++++.||+.+|+||++++++||++++++-++..+-.-|-.- T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:46 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEeeccCCCCCCccCCC Confidence 677899999999999999999999999887766654433333 No 55 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=5.9e-55 Score=317.87 Aligned_cols=350 Identities=8% Similarity=0.005 Sum_probs=239.5 Q ss_pred hhHHHHHHHHHHHHHHHH----hhhhh--HHHHHHHHHH---HHHHHHHHHHHHHH----H------------------- Q lcl|NC_019921. 3 INLSETFANAKNEFINAV----NNGEP--QERQNELYGD---MINQLFEETKLQAK----A------------------- 50 (381) Q Consensus 3 ~el~~~~~~~~~~~~~~~----k~~~~--~~~~~~~~~~---~~~~~~~~~~~~~~----~------------------- 50 (381) |+..+++.+..+++.+.+ ++.+. .+++.+.++. .++.+..++..... . T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 444444444444433322 11110 0111111111 11111111110000 0 Q ss_pred ------HHHHHHHhhccccccCHHHHHHHHHH---------hhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec Q lcl|NC_019921. 51 ------EAERVSSLPKSAQSLSANQRSFFMDI---------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 51 ------~~~~~~~~~~~~~~lt~~e~~~~~~~---------~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~ 115 (381) ..............+..++++.+... ...+..+||++||+++.+.|++.+++.+||+++|+++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:79 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec Confidence 00000000111122334444444221 112455789999999999999999999999999999987 Q ss_pred C-C--ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019921. 116 G-L--RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE 192 (381) Q Consensus 116 ~-g--~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~ 192 (381) + + .+.+|+..+.+.+.|++|+++.++.+.++|+++++.+|+++++++||++||+||.+|+++||.++|+++++++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~ 240 (415) T protein:vir:79 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 4 3 355666777788999999998887678999999999999999999999999999999999999999999999999 Q ss_pred hheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|NC_019921. 193 TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 193 ~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~ 272 (381) .+|++|+|+++|.+.+........ .+......+++.+.+++..+ ...|..+++|+||+. T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~--------------~~~~~~~~~~~~i~~~~~~~-------~~~~~~~~~~v~n~~ 299 (415) T protein:vir:79 241 KAIIDVITKGSTGSTSSGFEKEGK--------------KLEVKKAKSLDDIKDAINLN-------VKPNYEHNVAIVSQT 299 (415) T ss_pred HHHhhccccCcccccccccccccc--------------ccccccccchhHHHHHHHhh-------hhhccCCCEEEEcHH Confidence 999999999988776533211111 11122234466666665544 234667889999999 Q ss_pred hHHHHhhhhhccCCCCccccc---------cCCCceeEecCCCCCCc-----EEEEeecc-eEEEeecceEEEeehhhhh Q lcl|NC_019921. 273 DAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAGK-----VLTYVKGL-YDGYLAGGINVQKFKETLA 337 (381) Q Consensus 273 t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~~~p~~~-----i~fgd~~~-y~i~~r~~i~i~~~~~~~~ 337 (381) ++..+.. +++.+|+|+|. ..+|+||+.++++|.+. ++||||++ |++++|+++++..+++ T Consensus 300 ~~~~l~~---lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~--- 373 (415) T protein:vir:79 300 MFAKLDK---MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY--- 373 (415) T ss_pred HHHHHHH---hhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEecc--- Confidence 9887764 58899999874 34799999999998543 89999998 7899999999999874 Q ss_pred hcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcc Q lcl|NC_019921. 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 338 ~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~ 379 (381) ..++++||+.+|+||++++++||++++++-++..+-.-|-.- T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:79 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 567789999999999999999999999887665544333222 No 56 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=5.9e-55 Score=317.87 Aligned_cols=350 Identities=8% Similarity=0.005 Sum_probs=239.5 Q ss_pred hhHHHHHHHHHHHHHHHH----hhhhh--HHHHHHHHHH---HHHHHHHHHHHHHH----H------------------- Q lcl|NC_019921. 3 INLSETFANAKNEFINAV----NNGEP--QERQNELYGD---MINQLFEETKLQAK----A------------------- 50 (381) Q Consensus 3 ~el~~~~~~~~~~~~~~~----k~~~~--~~~~~~~~~~---~~~~~~~~~~~~~~----~------------------- 50 (381) |+..+++.+..+++.+.+ ++.+. .+++.+.++. .++.+..++..... . T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 444444444444433322 11110 0111111111 11111111110000 0 Q ss_pred ------HHHHHHHhhccccccCHHHHHHHHHH---------hhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec Q lcl|NC_019921. 51 ------EAERVSSLPKSAQSLSANQRSFFMDI---------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 51 ------~~~~~~~~~~~~~~lt~~e~~~~~~~---------~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~ 115 (381) ..............+..++++.+... ...+..+||++||+++.+.|++.+++.+||+++|+++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:81 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec Confidence 00000000111122334444444221 112455789999999999999999999999999999987 Q ss_pred C-C--ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019921. 116 G-L--RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE 192 (381) Q Consensus 116 ~-g--~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~ 192 (381) + + .+.+|+..+.+.+.|++|+++.++.+.++|+++++.+|+++++++||++||+||.+|+++||.++|+++++++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~ 240 (415) T protein:vir:81 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 4 3 355666777788999999998887678999999999999999999999999999999999999999999999999 Q ss_pred hheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|NC_019921. 193 TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 193 ~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~ 272 (381) .+|++|+|+++|.+.+........ .+......+++.+.+++..+ ...|..+++|+||+. T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~--------------~~~~~~~~~~~~i~~~~~~~-------~~~~~~~~~~v~n~~ 299 (415) T protein:vir:81 241 KAIIDVITKGSTGSTSSGFEKEGK--------------KLEVKKAKSLDDIKDAINLN-------VKPNYEHNVAIVSQT 299 (415) T ss_pred HHHhhccccCcccccccccccccc--------------ccccccccchhHHHHHHHhh-------hhhccCCCEEEEcHH Confidence 999999999988776533211111 11122234466666665544 234667889999999 Q ss_pred hHHHHhhhhhccCCCCccccc---------cCCCceeEecCCCCCCc-----EEEEeecc-eEEEeecceEEEeehhhhh Q lcl|NC_019921. 273 DAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAGK-----VLTYVKGL-YDGYLAGGINVQKFKETLA 337 (381) Q Consensus 273 t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~~~p~~~-----i~fgd~~~-y~i~~r~~i~i~~~~~~~~ 337 (381) ++..+.. +++.+|+|+|. ..+|+||+.++++|.+. ++||||++ |++++|+++++..+++ T Consensus 300 ~~~~l~~---lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~--- 373 (415) T protein:vir:81 300 MFAKLDK---MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY--- 373 (415) T ss_pred HHHHHHH---hhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEecc--- Confidence 9887764 58899999874 34799999999998543 89999998 7899999999999874 Q ss_pred hcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcc Q lcl|NC_019921. 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 338 ~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~ 379 (381) ..++++||+.+|+||++++++||++++++-++..+-.-|-.- T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:81 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 567789999999999999999999999887665544333222 No 57 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=5.9e-55 Score=317.87 Aligned_cols=350 Identities=8% Similarity=0.005 Sum_probs=239.5 Q ss_pred hhHHHHHHHHHHHHHHHH----hhhhh--HHHHHHHHHH---HHHHHHHHHHHHHH----H------------------- Q lcl|NC_019921. 3 INLSETFANAKNEFINAV----NNGEP--QERQNELYGD---MINQLFEETKLQAK----A------------------- 50 (381) Q Consensus 3 ~el~~~~~~~~~~~~~~~----k~~~~--~~~~~~~~~~---~~~~~~~~~~~~~~----~------------------- 50 (381) |+..+++.+..+++.+.+ ++.+. .+++.+.++. .++.+..++..... . T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 444444444444433322 11110 0111111111 11111111110000 0 Q ss_pred ------HHHHHHHhhccccccCHHHHHHHHHH---------hhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec Q lcl|NC_019921. 51 ------EAERVSSLPKSAQSLSANQRSFFMDI---------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 51 ------~~~~~~~~~~~~~~lt~~e~~~~~~~---------~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~ 115 (381) ..............+..++++.+... ...+..+||++||+++.+.|++.+++.+||+++|+++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:98 81 RTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeec Confidence 00000000111122334444444221 112455789999999999999999999999999999987 Q ss_pred C-C--ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019921. 116 G-L--RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE 192 (381) Q Consensus 116 ~-g--~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~ 192 (381) + + .+.+|+..+.+.+.|++|+++.++.+.++|+++++.+|+++++++||++||+||.+|+++||.++|+++++++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~ 240 (415) T protein:vir:98 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 4 3 355666777788999999998887678999999999999999999999999999999999999999999999999 Q ss_pred hheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|NC_019921. 193 TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 193 ~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~ 272 (381) .+|++|+|+++|.+.+........ .+......+++.+.+++..+ ...|..+++|+||+. T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~--------------~~~~~~~~~~~~i~~~~~~~-------~~~~~~~~~~v~n~~ 299 (415) T protein:vir:98 241 KAIIDVITKGSTGSTSSGFEKEGK--------------KLEVKKAKSLDDIKDAINLN-------VKPNYEHNVAIVSQT 299 (415) T ss_pred HHHhhccccCcccccccccccccc--------------ccccccccchhHHHHHHHhh-------hhhccCCCEEEEcHH Confidence 999999999988776533211111 11122234466666665544 234667889999999 Q ss_pred hHHHHhhhhhccCCCCccccc---------cCCCceeEecCCCCCCc-----EEEEeecc-eEEEeecceEEEeehhhhh Q lcl|NC_019921. 273 DAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAGK-----VLTYVKGL-YDGYLAGGINVQKFKETLA 337 (381) Q Consensus 273 t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~~~p~~~-----i~fgd~~~-y~i~~r~~i~i~~~~~~~~ 337 (381) ++..+.. +++.+|+|+|. ..+|+||+.++++|.+. ++||||++ |++++|+++++..+++ T Consensus 300 ~~~~l~~---lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~--- 373 (415) T protein:vir:98 300 MFAKLDK---MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY--- 373 (415) T ss_pred HHHHHHH---hhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEecc--- Confidence 9887764 58899999874 34799999999998543 89999998 7899999999999874 Q ss_pred hcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcc Q lcl|NC_019921. 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 338 ~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~ 379 (381) ..++++||+.+|+||++++++||++++++-++..+-.-|-.- T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:98 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 567789999999999999999999999887665544333222 No 58 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=7.2e-55 Score=317.41 Aligned_cols=350 Identities=9% Similarity=0.008 Sum_probs=241.3 Q ss_pred hhHHHHHHHHHHHHHHHHhhhhh---------HHHHHHHHHHHHHHHHHHHHHHH----HHH------------------ Q lcl|NC_019921. 3 INLSETFANAKNEFINAVNNGEP---------QERQNELYGDMINQLFEETKLQA----KAE------------------ 51 (381) Q Consensus 3 ~el~~~~~~~~~~~~~~~k~~~~---------~~~~~~~~~~~~~~~~~~~~~~~----~~~------------------ 51 (381) ||..+++.++.+++.+.+.+... +.++.......++.+..+..... +.+ T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccch Confidence 44444444444333322211100 00111111111222211111000 000 Q ss_pred -------HHHHHHhhccccccCHHHHHHHHHH---------hhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec Q lcl|NC_019921. 52 -------AERVSSLPKSAQSLSANQRSFFMDI---------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 52 -------~~~~~~~~~~~~~lt~~e~~~~~~~---------~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~ 115 (381) ...........+.+..+|++.|... ...+..+||++||++++++|++.+++.++|+++|+++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 160 (415) T protein:vir:94 81 STYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV 160 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeec Confidence 0000111112223445555554321 112456789999999999999999999999999999997 Q ss_pred C-C--ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019921. 116 G-L--RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALE 192 (381) Q Consensus 116 ~-g--~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~ 192 (381) + + .+.+|+..+.+.+.|++|++..++.+.++|+++++.+|+++++++||++||+||.+|+++||.++|+++++++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~ 240 (415) T protein:vir:94 161 TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRN 240 (415) T ss_pred cCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHH Confidence 5 3 345666777888999999988876678999999999999999999999999999999999999999999999999 Q ss_pred hheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|NC_019921. 193 TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 193 ~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~ 272 (381) .+|++|+|+++|.++.......... .......+++.+.+++..+ ...|..++.|+||+. T Consensus 241 ~~il~g~g~g~~~~~~~~~~~~~~~--------------~~~~~~~~~~~i~~~~~~~-------~~~~~~~~~~vmn~~ 299 (415) T protein:vir:94 241 KAIIDVITKGSTGSTSSGFEKEGKK--------------LEVKKAKSLDDIKDAINLN-------VKPNYEHNVAIVSQT 299 (415) T ss_pred HHHhhccccCccccccccccccccc--------------cccccccchHHHHHHHHhh-------hhhccCCCEEEEcHH Confidence 9999999999988765432211110 1112233456666665543 234566889999999 Q ss_pred hHHHHhhhhhccCCCCccccc---------cCCCceeEecCCCCCCc-----EEEEeecc-eEEEeecceEEEeehhhhh Q lcl|NC_019921. 273 DAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAGK-----VLTYVKGL-YDGYLAGGINVQKFKETLA 337 (381) Q Consensus 273 t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~~~p~~~-----i~fgd~~~-y~i~~r~~i~i~~~~~~~~ 337 (381) ++..+.. +++.+|+|++. ..+|+||+.+++||.+. ++||||++ |++++|+++++..++ | T Consensus 300 ~~~~l~~---lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~---~ 373 (415) T protein:vir:94 300 MFAKLDK---MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD---Y 373 (415) T ss_pred HHHHHHH---hhccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec---c Confidence 9877764 48899999874 24799999999998643 89999998 789999999999987 4 Q ss_pred hcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcc Q lcl|NC_019921. 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 338 ~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~ 379 (381) .++++.||+..|+||++++++||++++++-++..+-.-|-.- T Consensus 374 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:94 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 677899999999999999999999999887665543333222 No 59 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=9.9e-55 Score=316.65 Aligned_cols=341 Identities=12% Similarity=0.010 Sum_probs=232.2 Q ss_pred Cchh-HHHHHHHHHHHHHH---HHhhhhhHHH--HHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHhhcccc--- Q lcl|NC_019921. 1 MTIN-LSETFANAKNEFIN---AVNNGEPQER--QNELYGDMINQLFEETKL--------QAKAEAERVSSLPKSAQ--- 63 (381) Q Consensus 1 mt~e-l~~~~~~~~~~~~~---~~k~~~~~~~--~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~--- 63 (381) |+++ |++++.+..+++.+ .++....++. ........++.+.++... +...+..+......... T Consensus 1 M~~~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (395) T protein:vir:38 1 MNINQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNKKP 80 (395) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 9997 77766666544333 2222111110 000000111111111110 00000001000000000 Q ss_pred ----ccCHH-------HHHHH-HHHh--hccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC---CceEEEEe-c Q lcl|NC_019921. 64 ----SLSAN-------QRSFF-MDIN--KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG---LRLKFLKS-E 125 (381) Q Consensus 64 ----~lt~~-------e~~~~-~~~~--~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~---g~~~~p~~-~ 125 (381) ....+ .++.+ +.+. ..+.++||++||+++++.|++.+++.++|+++|++++++ +.+.++.. + T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 160 (395) T protein:vir:38 81 LPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLAD 160 (395) T ss_pred cchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeecc Confidence 00000 11111 1112 234557999999999999999999999999999998864 34455544 4 Q ss_pred CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcce Q lcl|NC_019921. 126 TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) Q Consensus 126 ~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~ 205 (381) ..+.+.|++|++..+..+.++|++|++.+|+++++++||++||+|+.+||++||.++|+++++++++.+|++|+|+++|. T Consensus 161 ~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~ 240 (395) T protein:vir:38 161 ITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKK 240 (395) T ss_pred CCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 45678999998887766679999999999999999999999999999999999999999999999999999999988765 Q ss_pred EeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC Q lcl|NC_019921. 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~ 285 (381) +... .++.+.+++.. .....|+.+++|+||+.++..+.. .++ T Consensus 241 ~~~~-----------------------------~~~~i~~~~~~------~l~~~~~~~a~~v~n~~~~~~L~~---lkd 282 (395) T protein:vir:38 241 PTIS-----------------------------QFDNIKDLENN------TLDPAIESTSSFITNQSGYNILSK---VKD 282 (395) T ss_pred cccc-----------------------------cHHHHHHHHHH------hhhhhhcCCCEEEEcHHHHHHHHH---hhc Confidence 3221 12223333221 124568889999999999877764 478 Q ss_pred CCCccccc---------cCCCceeEecCCCCC------CcEEEEeecc-eEEEeecceEEEeehh--hhhhcCceEEEEE Q lcl|NC_019921. 286 ANGVYVTA---------LPFNLNVIESTVQEA------GKVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDLYTAK 347 (381) Q Consensus 286 ~~G~~~~~---------l~~G~pVv~s~~~p~------~~i~fgd~~~-y~i~~r~~i~i~~~~~--~~~~~d~~~~r~~ 347 (381) .+|+|+|. ..+|+||+.+++++. ..|+||||++ |.+++|++++|+.+++ .+|.+|+++||+. T Consensus 283 ~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~ 362 (395) T protein:vir:38 283 ADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFI 362 (395) T ss_pred cCCceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEE Confidence 89999884 247999999887542 2389999997 8899999999999885 5699999999999 Q ss_pred EEEcCEEecCceEEEEEEEecCCccccc-cCcc Q lcl|NC_019921. 348 QFAYGKAKDNKVAAVWKLDLKGHKPALE-GTEE 379 (381) Q Consensus 348 ~r~dGk~~~~~Afvv~~~~~~~~~~~~~-~~~~ 379 (381) .|+|+++.+++||++++++-++.+++.+ .+=- T Consensus 363 ~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:38 363 DRFDVQLIDDGAFAAASFKTVANQAQGTAGTGK 395 (395) T ss_pred EeeccEEecccceEEEEeecccCCCCCccCCCC Confidence 9999999999999998877665553332 2222 No 60 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=2.1e-55 Score=320.36 Aligned_cols=345 Identities=13% Similarity=0.082 Sum_probs=232.6 Q ss_pred CchhHH-------------------HH-------HHHHHHHHHHHHhhhhhHHHHH--HH-------------------- Q lcl|NC_019921. 1 MTINLS-------------------ET-------FANAKNEFINAVNNGEPQERQN--EL-------------------- 32 (381) Q Consensus 1 mt~el~-------------------~~-------~~~~~~~~~~~~k~~~~~~~~~--~~-------------------- 32 (381) |..+.+ ++ +.+.+++..+.+...+...... +. T Consensus 224 ~~~~E~~r~~eI~~l~~~~~~~~~~~~ai~~g~sld~~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re 303 (632) T protein:vir:96 224 ILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKE 303 (632) T ss_pred hhhhhHHHHHHHHHHHHHhhhhhhHHHHHhccccHHHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHH Confidence 110000 00 1111111122211110000000 00 Q ss_pred --HHHHHHHHHHHHHH---HH--HHHHHHHHHhhccccc---cCHHHHHHHHHHhhccCCCCceeccHHH-HHHHHHHHH Q lcl|NC_019921. 33 --YGDMINQLFEETKL---QA--KAEAERVSSLPKSAQS---LSANQRSFFMDINKNVNYKEEKLLPEET-IDRIFEDLT 101 (381) Q Consensus 33 --~~~~~~~~~~~~~~---~~--~~~~~~~~~~~~~~~~---lt~~e~~~~~~~~~~~~~~gg~lvP~~~-~~~I~~~l~ 101 (381) -..+.+.+...... .. ..+.........+... ..+.+.-.-+++.++++++||++||+++ .+.|++.++ T Consensus 304 ~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr 383 (632) T protein:vir:96 304 LQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILR 383 (632) T ss_pred HHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHh Confidence 00000000000000 00 0000001111111000 0001111123556678889999999886 688999999 Q ss_pred hhhhhhhh-ceeEec-CCceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHH Q lcl|NC_019921. 102 TNHPLLAD-LGIKNA-GLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFV 179 (381) Q Consensus 102 ~~~~l~~~-~~v~~~-~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l 179 (381) ..++++++ +++++. +|++++|+.++.+.++|++|+++.+ +++++|+++++.+|+++++++||++||+||.+++++|| T Consensus 384 ~~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~~~~~-~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i 462 (632) T protein:vir:96 384 NKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQ-DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLI 462 (632) T ss_pred hcchhhhhcceEeecCCcceEEEEEeCCceeEeecCCcccc-ccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHH Confidence 99999998 677775 5789999999999999999988776 57899999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhheeeccCC-CcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhcccccc Q lcl|NC_019921. 180 RVQIEEAFAVALETAFLKGTGK-DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKS 258 (381) Q Consensus 180 ~~~la~~~~~~~~~a~i~G~G~-~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~ 258 (381) +++|++++++++|.+||+|+|+ ++|.||++..+..+. +........+.+.++...+.... T Consensus 463 ~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~---------------~~~~~~~~~~~i~~~~~~i~~~~---- 523 (632) T protein:vir:96 463 REDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPAL---------------TYPAGGVDWASVVDMETKISTFN---- 523 (632) T ss_pred HHHHHHHHHHHHHHHhhcccCCCCccceeeecccccce---------------ecccccCCHHHHHHHHHHHhhcc---- Confidence 9999999999999999999996 689999865332111 11112233444555544332211 Q ss_pred ccccCceEEEEchhhHHHHhhhhhccCCCCccccc--cCCCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhhh Q lcl|NC_019921. 259 VAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA--LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETL 336 (381) Q Consensus 259 ~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~--l~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~ 336 (381) ...+++.|+||+.+...++. ...++.+|+|+|. .++|+||+.++.||+++++||||++|+++++++++|.++++.+ T Consensus 524 -~~~~~~~~~~~~~~~~~l~~-~~l~d~~G~~i~~~~~l~G~pv~~s~~ip~~~~~~gd~s~~~i~~~~~~~i~~~~~~~ 601 (632) T protein:vir:96 524 -ADAGRLAYLTSVTQRGAAKK-AQVFDNTGERIWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTK 601 (632) T ss_pred -cccCccEEEEchhHHHHHHH-HhccCCCCceeecCCeecccceEeccccccCcEEEeecceEEEEEecceEEEEccccc Confidence 12457889999987765543 2357889999985 4479999999999999999999999999999999999999999 Q ss_pred hhcCceEEEEEEEEcCEEecCceEEEEEEEecC Q lcl|NC_019921. 337 ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 337 ~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~ 369 (381) |.+|++.||+++|+|+++++++||++++ .++ T Consensus 602 ~~~~~v~~~~~~~~d~~v~~~~af~~~k--~~A 632 (632) T protein:vir:96 602 AASDGLVLRVFQDVDAGVRRKEAFCIAK--KGA 632 (632) T ss_pred cccCceEEEEEeecCceeechhhhhhee--ecC Confidence 9999999999999999999999999854 444 No 61 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=1.9e-56 Score=326.09 Aligned_cols=293 Identities=14% Similarity=0.023 Sum_probs=224.6 Q ss_pred HHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceEE Q lcl|NC_019921. 54 RVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVW 132 (381) Q Consensus 54 ~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~~~~a~w 132 (381) -++...|....+..+|++.+ .++++++|| +||++++++|++.+++.+||+++|+++++++ ..++|+.++.+.+.| T Consensus 1 ~~~~~~r~~~~~~~~e~~a~---~~~~~~~g~-~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~ 76 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVA---QTGDSMFEG-YLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVSASW 76 (326) T ss_pred CCCCccchhhhcCcchhhhe---eccccCCcc-eechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcceEE Confidence 01111112222334444433 344444444 6999999999999999999999999999865 589999999999999 Q ss_pred eecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccc Q lcl|NC_019921. 133 GKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQ 212 (381) Q Consensus 133 v~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~ 212 (381) ++|+++++ +++++|+++++.+||++++++||+|||+||.+++++||.++|++++++++|.+|++|+|+++|.||+.... T Consensus 77 v~Eg~~~~-~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~ 155 (326) T protein:vir:42 77 IGEGDMKP-ITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTK 155 (326) T ss_pred ecCCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccc Confidence 99987766 57899999999999999999999999999999999999999999999999999999999999999986543 Q ss_pred cccccccccccceeeeeeecccccchhHHH--HHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCcc Q lcl|NC_019921. 213 KGVSVTEGAYPEKEEQGTLTFANPRATVNE--LTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVY 290 (381) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~--l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~ 290 (381) ......... +.......... +...... ....+..+++|+||+.++..+++ +++++|+| T Consensus 156 ~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~a~~v~n~~~~~~L~~---lkd~~G~~ 215 (326) T protein:vir:42 156 EVSLVDPDG----------TGSNADLTVYDAVAVNALSL-------LVNAGKKWTHTLLDDITEPILNG---AKDKSGRP 215 (326) T ss_pred ccceeeccc----------ccccccchhHHHHHHHHHhh-------hhhhccCccEEEEeHHHHHHHHH---hhccCCce Confidence 322211110 11111111111 1111111 13346678899999999887765 47888888 Q ss_pred cccc--------------CCCceeEecCCCCCCc--EEEEeecceEEEeecceEEEeehhhh--------------hhcC Q lcl|NC_019921. 291 VTAL--------------PFNLNVIESTVQEAGK--VLTYVKGLYDGYLAGGINVQKFKETL--------------ALDD 340 (381) Q Consensus 291 ~~~l--------------~~G~pVv~s~~~p~~~--i~fgd~~~y~i~~r~~i~i~~~~~~~--------------~~~d 340 (381) +|.. ++|+||+.+++||+++ ++||||++|+++++++++|+++++.+ |.+| T Consensus 216 l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d 295 (326) T protein:vir:42 216 LFIESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHN 295 (326) T ss_pred eeccccccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcC Confidence 8642 4799999999999987 46899999999999999999999876 8889 Q ss_pred ceEEEEEEEEcCEEecCceEEEEEEEecCCccc Q lcl|NC_019921. 341 MDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) Q Consensus 341 ~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~ 373 (381) +++||+.+|+|+++++++||+.++-+. ...| T Consensus 296 ~~~~r~~~~~d~~v~~~~a~~~l~~~~--~~~~ 326 (326) T protein:vir:42 296 LVAVRVEAEYAFHCNDKDAFVKLTNVD--ATEA 326 (326) T ss_pred cEEEEEEEEeccEEecccceEEEeecc--ccCC Confidence 999999999999999999998755333 3333 No 62 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=3.4e-56 Score=324.67 Aligned_cols=301 Identities=14% Similarity=0.047 Sum_probs=230.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC Q lcl|NC_019921. 37 INQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG 116 (381) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~ 116 (381) +++. +.. +.+..+..... .+.+.+++.......+||++||++++++|++.+++.++|+++|++++++ T Consensus 1 ~~~~--~~~---~~~~~~f~~~~--------~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~ 67 (324) T protein:vir:97 1 MEQT--QKL---KLNLQHFASNN--------VKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME 67 (324) T ss_pred Cccc--hhH---HHHHHHHHHhh--------hhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeecc Confidence 0000 000 00000000000 1111223333445667899999999999999999999999999999986 Q ss_pred C-ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhe Q lcl|NC_019921. 117 L-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) Q Consensus 117 g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~ 195 (381) + .+++|+.++.+.+.|++|++..+ +++++|+++++.+||++++++||+|||+||.+++++||.+++++++++++|.+| T Consensus 68 ~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~ 146 (324) T protein:vir:97 68 GTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) T ss_pred CCceEEEEEecCcceeEeccCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHh Confidence 5 58999999999999999988765 678999999999999999999999999999999999999999999999999999 Q ss_pred eeccCCC-cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhH Q lcl|NC_019921. 196 LKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) Q Consensus 196 i~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~ 274 (381) |+|+|++ +|.||+.......... ....+++.+.++...+ ...++.+++|+||+.++ T Consensus 147 l~G~g~~~~~~gi~~~~~~~~~~~----------------~~~~~~~~i~~~~~~l-------~~~~~~~~~~v~n~~~~ 203 (324) T protein:vir:97 147 ILNQGNNPFGKSIAQSIEKTNKVI----------------KGDFTQDNIIDLEALL-------EDDELEANAFISKTQNR 203 (324) T ss_pred hccCCCCccCccccccccccceec----------------cccCCHHHHHHHHHhh-------hhccCCCCEEEEcHHHH Confidence 9999986 7889886543222111 1223455666665544 33566778999999998 Q ss_pred HHHhhhhhccCCCCccccc-----cCCCceeEecCCCC--CCcEEEEeecceEEEeecceEEEeehhhh----------- Q lcl|NC_019921. 275 FEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQE--AGKVLTYVKGLYDGYLAGGINVQKFKETL----------- 336 (381) Q Consensus 275 ~~~~~~~~~~~~~G~~~~~-----l~~G~pVv~s~~~p--~~~i~fgd~~~y~i~~r~~i~i~~~~~~~----------- 336 (381) ..++. .++++|+|++. ..+|+||+.+++++ ++.++||||++|++++|++++|++++|.. T Consensus 204 ~~L~~---lkd~~g~~~~~~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~ 280 (324) T protein:vir:97 204 SLLRK---IVDPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) T ss_pred HHHHH---hhcCCCceeecCCCCccccceeeEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccc Confidence 87764 47888988764 34899999988755 56699999999999999999999998854 Q ss_pred ---hhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 337 ---ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 337 ---~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) |.+|+++||+.+|+|+++++++||++++...++. +.||-.+ T Consensus 281 ~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~----~~~~~~~ 324 (324) T protein:vir:97 281 VNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKT----DSVPGEV 324 (324) T ss_pred hhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCC----CCCCCCC Confidence 8999999999999999999999999877655433 3455555 No 63 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=2.2e-56 Score=325.68 Aligned_cols=272 Identities=11% Similarity=0.066 Sum_probs=226.2 Q ss_pred ccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc-eEEEEecCCcceEEeecccc Q lcl|NC_019921. 60 KSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKFLKSETSGVAVWGKIYGE 138 (381) Q Consensus 60 ~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~-~~~p~~~~~~~a~wv~e~~~ 138 (381) .| +++....+.++||++||++++++|++.+++.++|+++|+++|+++. .++|+.+. +.+.|++|+++ T Consensus 1 ~g-----------~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~ 68 (299) T protein:vir:41 1 MG-----------FNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMSG-VGAFWVDEAER 68 (299) T ss_pred CC-----------cCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEcC-CceeeeecCcc Confidence 11 1233344566888999999999999999999999999999998665 67787654 77999999877 Q ss_pred cccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccc Q lcl|NC_019921. 139 IKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVT 218 (381) Q Consensus 139 ~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~ 218 (381) .+ +++++|+++++.+||++++++||+|||+||.+++++||.+.|++++++++|.+|++|+|+++|.||++......... T Consensus 69 ~~-~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~ 147 (299) T protein:vir:41 69 IQ-TSKPTFTKAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLV 147 (299) T ss_pred cc-ccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceee Confidence 65 57899999999999999999999999999999999999999999999999999999999999999997644332211 Q ss_pred cccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc----- Q lcl|NC_019921. 219 EGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA----- 293 (381) Q Consensus 219 ~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~----- 293 (381) . .....++.+.+++..+ ...+..+++|+||+.++..+.+ .++.+|+|++. T Consensus 148 ~---------------~~~~~~~~l~~~~~~l-------~~~~~~~~~~v~n~~~~~~L~~---lkd~~G~~l~~~~~~~ 202 (299) T protein:vir:41 148 E---------------ETANKYDDLNEAIGLI-------EAEDLEPNGIATIRKQRVKYRS---TKDGNGMPIFNTATSN 202 (299) T ss_pred c---------------cccccHHHHHHHHHhh-------hcccCCcCEEEEcHHHHHHHHH---hhccCCceeecCCcCC Confidence 1 1112345565555443 2345667889999999888765 47888998874 Q ss_pred ---cCCCceeEecCCCCCCc----EEEEeecceEEEeecceEEEeehhhh--------------hhcCceEEEEEEEEcC Q lcl|NC_019921. 294 ---LPFNLNVIESTVQEAGK----VLTYVKGLYDGYLAGGINVQKFKETL--------------ALDDMDLYTAKQFAYG 352 (381) Q Consensus 294 ---l~~G~pVv~s~~~p~~~----i~fgd~~~y~i~~r~~i~i~~~~~~~--------------~~~d~~~~r~~~r~dG 352 (381) ..+|+||+.+++||.++ ++||||++|++++|++++++++++.+ |.+|++.||+.+|+|+ T Consensus 203 ~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~ 282 (299) T protein:vir:41 203 GVDDVLGLPIAYTPKYTFGDKDISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGF 282 (299) T ss_pred CCceecceeeEEecccCCCCCceEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Confidence 24799999999999876 89999999999999999999999875 7899999999999999 Q ss_pred EEecCceEEEEEEEecC Q lcl|NC_019921. 353 KAKDNKVAAVWKLDLKG 369 (381) Q Consensus 353 k~~~~~Afvv~~~~~~~ 369 (381) ++++++||++++.+-+. T Consensus 283 ~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 283 MVVKDEAFSAVQPKAGN 299 (299) T ss_pred EEecccceEEEEeccCC Confidence 99999999998877766 No 64 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=2.8e-56 Score=325.15 Aligned_cols=336 Identities=13% Similarity=0.041 Sum_probs=228.4 Q ss_pred HHHHHHHHHHHH--HH-HHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhhccC Q lcl|NC_019921. 5 LSETFANAKNEF--IN-AVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVN 81 (381) Q Consensus 5 l~~~~~~~~~~~--~~-~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~ 81 (381) |..+...-.+.- .. .+...+.+..+...+.++...+........+... . ...... .++. -..+. .+. T Consensus 1 ~a~~~a~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~--~-a~~~~~----~~~~--~~a~~-~~~ 70 (366) T protein:vir:57 1 MAAAVAVPVKAHSVAPGIIIKEELQQYKGAGMTRMVMSIAAGKGNLADAAK--F-AATELG----DTGL--SMAIS-TAA 70 (366) T ss_pred CcccccccccccccccccccccccccccchhHHHHHHHHHhcccchhHHHH--H-HHHhhc----chhh--hhhcc-ccc Confidence 111111111100 00 0000011111112223332222111111111100 0 000000 0111 11223 344 Q ss_pred CCCceeccHHHHHHHHHHHHhhhhhhhh-ceeEec-CCceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEE Q lcl|NC_019921. 82 YKEEKLLPEETIDRIFEDLTTNHPLLAD-LGIKNA-GLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTA 159 (381) Q Consensus 82 ~~gg~lvP~~~~~~I~~~l~~~~~l~~~-~~v~~~-~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~ 159 (381) .+||++||+++.++|++.+++.++++++ ++++++ +|.+++|+.++.+.++|++|+++.+ +++++|+++++.+||+++ T Consensus 71 ~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a~wv~E~~~~~-~s~~~f~~i~~~~~k~~~ 149 (366) T protein:vir:57 71 GSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGATAGYVGEGKDVV-ATGATFDDVKLSAKTMIA 149 (366) T ss_pred cCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcceeeeccCcccc-ccccceeEEEEeeEEEEE Confidence 5799999999999999999999999998 888875 5679999999999999999987765 578999999999999999 Q ss_pred eeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCC-cceEeeeccccccccccccccceeeeeeecccccch Q lcl|NC_019921. 160 FVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRA 238 (381) Q Consensus 160 ~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~ 238 (381) +++||+|||+||.+++++||+++|++++++++|.+|++|+|++ +|+||++..+....... ..+ +..+ .. T Consensus 150 ~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~-------~~~--t~~~-~~ 219 (366) T protein:vir:57 150 LVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVA-------WTG--TAIN-LT 219 (366) T ss_pred eehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceee-------ccc--cccc-hh Confidence 9999999999999999999999999999999999999999974 99999976543221111 000 1111 11 Q ss_pred hHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc-----cCCCceeEecCCCCCC---- Q lcl|NC_019921. 239 TVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQEAG---- 309 (381) Q Consensus 239 ~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~-----l~~G~pVv~s~~~p~~---- 309 (381) .++.+.+.+... ......++.++.|+||+.++..++. +++.+|+|+|. ..+|+||+.+++||++ T Consensus 220 ~~~~~~~~~~~~----~~~~~~~~~~a~~vmn~~~~~~L~~---lkd~~G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~ 292 (366) T protein:vir:57 220 TIDEYLDSLILK----HMDSNSNMIRCGWGLSNRTYMTLFG---LRDGNGNKVYPEMSQGILKGYPIQRTSAIPANLGDD 292 (366) T ss_pred hHHHHHHHHHHh----hhccccccccCEEEecHHHHHHHHh---hhccCCceeccCCCCCeecceeeEEccccccccccC Confidence 122222222111 1123457788999999999887765 47899999984 3489999999999962 Q ss_pred ----cEEEEeecceEEEeecceEEEeehhh-----------hhhcCceEEEEEEEEcCEEecCceEEEEEEEecC Q lcl|NC_019921. 310 ----KVLTYVKGLYDGYLAGGINVQKFKET-----------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 310 ----~i~fgd~~~y~i~~r~~i~i~~~~~~-----------~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~ 369 (381) .|+||||++|+|++|++++|+.+++. .|.+|+++||+.+|+|+++.+++||++++ .+.= T Consensus 293 ~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt-~~~~ 366 (366) T protein:vir:57 293 GNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGT-GVIW 366 (366) T ss_pred CCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEe-cccC Confidence 38999999999999999999998874 37789999999999999999999999876 2212 No 65 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=3.9e-55 Score=318.86 Aligned_cols=287 Identities=11% Similarity=0.008 Sum_probs=231.7 Q ss_pred hccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceEEeeccc Q lcl|NC_019921. 59 PKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYG 137 (381) Q Consensus 59 ~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~~~~a~wv~e~~ 137 (381) .+.+..+..+++.. ...+++++|.+||++++++|++.+++.++|+++|+++++++ ..++|+.++.+.+.|++|++ T Consensus 1 ~~~~~~~~~e~~~~----~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~ 76 (318) T protein:vir:24 1 MAAGTAFAVDHAQI----AQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEGD 76 (318) T ss_pred CCCCCCCCHHHHHh----hcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCCc Confidence 33345566666643 33455677889999999999999999999999999999865 58999999999999999988 Q ss_pred ccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeecccccccc Q lcl|NC_019921. 138 EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSV 217 (381) Q Consensus 138 ~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~ 217 (381) +++ +++++|+++++.+||+++++++|+|||+||.+++++||.++|++++++++|.+|++|+|+++|.|++..+...... T Consensus 77 ~~~-~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~ 155 (318) T protein:vir:24 77 MKP-ITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIA 155 (318) T ss_pred ccc-ccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCccccccccccccc Confidence 776 5789999999999999999999999999999999999999999999999999999999999999998654321111 Q ss_pred ccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc---- Q lcl|NC_019921. 218 TEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---- 293 (381) Q Consensus 218 ~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~---- 293 (381) .. + .......+.+.++...+ ...+..+++|+||+.++..+.. .++.+|+|+|. T Consensus 156 ~~------------~-~~~~~~~~~~~~~~~~~-------~~~~~~~~~~v~n~~~~~~L~~---lkd~~G~~l~~~~~~ 212 (318) T protein:vir:24 156 DT------------T-GATTVYDQVAVNGLSLL-------VNDGKKWTHTLLDDITEPILNG---AKDQNGRPLFIESTY 212 (318) T ss_pred cc------------c-cccchHHHHHHHHHHhh-------ccccCCCCEEEEcHHHHHHHHH---hhccCCceeecCccc Confidence 10 0 00111122233333222 3456778899999999887764 47888988764 Q ss_pred ----------cCCCceeEecCCCCCCc--EEEEeecceEEEeecceEEEeehhhh--------------hhcCceEEEEE Q lcl|NC_019921. 294 ----------LPFNLNVIESTVQEAGK--VLTYVKGLYDGYLAGGINVQKFKETL--------------ALDDMDLYTAK 347 (381) Q Consensus 294 ----------l~~G~pVv~s~~~p~~~--i~fgd~~~y~i~~r~~i~i~~~~~~~--------------~~~d~~~~r~~ 347 (381) .++|+||+.++++|+++ ++||||++|+++++++++|+.++|.. |.+|++.||+. T Consensus 213 ~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~ 292 (318) T protein:vir:24 213 GEAASPFRSGRIVARPTILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVE 292 (318) T ss_pred cCccccccCceEEEEeeEEeCCCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEE Confidence 24689999999999876 58999999999999999999999865 88999999999 Q ss_pred EEEcCEEecCceEEEEEEEecCCccc Q lcl|NC_019921. 348 QFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) Q Consensus 348 ~r~dGk~~~~~Afvv~~~~~~~~~~~ 373 (381) +|+|+++++++||++++...+++..- T Consensus 293 ~r~d~~v~~~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 293 AEYAFHCNDAEAFVALTNVVSGGGEG 318 (318) T ss_pred EEEccEEecccceEEEEeeccCCCCC Confidence 99999999999999877665554433 No 66 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=4.4e-54 Score=313.10 Aligned_cols=341 Identities=9% Similarity=-0.058 Sum_probs=228.2 Q ss_pred CchhH-HHHHHHHHHHHHHHHh-------hhhhH--HHHHHHHHHHHHHHHHHHHHHHH--HHHHH-----HHH-hhccc Q lcl|NC_019921. 1 MTINL-SETFANAKNEFINAVN-------NGEPQ--ERQNELYGDMINQLFEETKLQAK--AEAER-----VSS-LPKSA 62 (381) Q Consensus 1 mt~el-~~~~~~~~~~~~~~~k-------~~~~~--~~~~~~~~~~~~~~~~~~~~~~~--~~~~~-----~~~-~~~~~ 62 (381) |+|+. .+++.++++++.+..+ +...+ .++.+......+.+.++...... ..... ... ..... T Consensus 1 Mn~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGG 80 (421) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 99773 3334444444433322 11000 01111111122222221111000 00000 000 00000 Q ss_pred c----ccCHH-----HHHHHH-HH----------hhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEE Q lcl|NC_019921. 63 Q----SLSAN-----QRSFFM-DI----------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKF 121 (381) Q Consensus 63 ~----~lt~~-----e~~~~~-~~----------~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~~~~ 121 (381) + ..... +++.|. .+ ...++++||++||+++++.|++.+++.++|+++|++++++ +..++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~ 160 (421) T protein:vir:13 81 RVIINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKM 160 (421) T ss_pred ccccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEE Confidence 0 01111 111111 11 1235667999999999999999999999999999999976 45788 Q ss_pred EEecCCc--ceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeecc Q lcl|NC_019921. 122 LKSETSG--VAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGT 199 (381) Q Consensus 122 p~~~~~~--~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~ 199 (381) |+....+ .+.|++|+++.+ +++++|++|++.+|+++++++||++||+||.+|+++||.++|+++++++++.++ T Consensus 161 ~~~~~~~~~~~~~~~E~~~~~-~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i---- 235 (421) T protein:vir:13 161 PVRAGASVDKLANLAKDTELV-KAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEI---- 235 (421) T ss_pred EEeecCCccceeecccccccc-ccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhH---- Confidence 8765544 466788977654 579999999999999999999999999999999999999999999999887655 Q ss_pred CCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhh Q lcl|NC_019921. 200 GKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 200 G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~ 279 (381) .++|.|+++..+. ..++.+.+++..+ ...|+.+++|+||+.++..+.. T Consensus 236 -~~~~~g~~~~~~~------------------------~~~d~i~~~~~~l-------~~~~~~~a~~v~n~~~~~~l~~ 283 (421) T protein:vir:13 236 -VKQAKAVLAEETI------------------------NDYAGLVKTINSL-------VPNARKRAIIVTNSDGRAYLDG 283 (421) T ss_pred -hhhhhhccccccc------------------------cchHHHHHHHHHh-------hhhhcCCCEEEEcHHHHHHHHH Confidence 4678998743210 1234455554443 2356778999999999877764 Q ss_pred hhhccCCCCccccc--------cCCCceeEecCCCCCC-----cEEEEeecc-eEEEeecceEEEeehhhhhhcCceEEE Q lcl|NC_019921. 280 QYTHLNANGVYVTA--------LPFNLNVIESTVQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLYT 345 (381) Q Consensus 280 ~~~~~~~~G~~~~~--------l~~G~pVv~s~~~p~~-----~i~fgd~~~-y~i~~r~~i~i~~~~~~~~~~d~~~~r 345 (381) +++.+|+|+|. ..+|+||+++++||.+ .++||||++ |.+++|++++|+++++.+|.+|+++|| T Consensus 284 ---lkd~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r 360 (421) T protein:vir:13 284 ---LMDKQGRPLLKELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNETIAR 360 (421) T ss_pred ---hhcCCCceeecCcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeecccccccCeeEEE Confidence 47899999985 3489999999999853 389999998 889999999999999999999999999 Q ss_pred EEEEEcCEEecCceEEEEEEEecCCc------cccccCcccC Q lcl|NC_019921. 346 AKQFAYGKAKDNKVAAVWKLDLKGHK------PALEGTEETL 381 (381) Q Consensus 346 ~~~r~dGk~~~~~Afvv~~~~~~~~~------~~~~~~~~~~ 381 (381) +..|+||++++++||+.+.+.-.+.- |+.+.+.++. T Consensus 361 ~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~~~~~~~ 402 (421) T protein:vir:13 361 IIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKSSPRSGKN 402 (421) T ss_pred EEeeecceeecchhhheeeecccceeeccccccCCCCcCCCC Confidence 99999999999999876554433322 2222222222 No 67 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=1.1e-54 Score=316.35 Aligned_cols=289 Identities=12% Similarity=0.012 Sum_probs=226.9 Q ss_pred hccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceEEeeccc Q lcl|NC_019921. 59 PKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYG 137 (381) Q Consensus 59 ~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~~~~p~~~~~~~a~wv~e~~ 137 (381) ...+..+..+++. +...++.++|.+||++++++|++.+++.++|+++|++++++ +..++|+.++.+.+.|++|++ T Consensus 1 ~~~~~~~~~~~~~----~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~ 76 (320) T protein:vir:10 1 MAAGTAFQVDHAQ----IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEGD 76 (320) T ss_pred CCCCccCCHHHHH----hhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCCc Confidence 1122233334332 33445566777899999999999999999999999999975 468999999999999999988 Q ss_pred ccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeecccccccc Q lcl|NC_019921. 138 EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSV 217 (381) Q Consensus 138 ~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~ 217 (381) +++ +++++|+++++.+||++++++||+|||+||.+++++||.+++++++++++|.+|++|+|+++|.|++......... T Consensus 77 ~~~-~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~ 155 (320) T protein:vir:10 77 MKP-ITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLA 155 (320) T ss_pred ccc-ccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccce Confidence 776 6789999999999999999999999999999999999999999999999999999999999999887544332222 Q ss_pred ccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc---- Q lcl|NC_019921. 218 TEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---- 293 (381) Q Consensus 218 ~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~---- 293 (381) ..+. .+........+.+.++...+ ...++.+++|+||+.++..++. +++.+|+|++. T Consensus 156 ~~~~---------~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~v~n~~~~~~L~~---lkd~~G~~l~~~~~~ 216 (320) T protein:vir:10 156 DPGG---------ATASDLTAYDAVAVNGLSLL-------VNAKKKWTHTLLDDIVEPILNG---AKDKNGRPLFIESTY 216 (320) T ss_pred eccc---------ccccccccHHHHHHHHHhhh-------hcccCCCcEEEEcHHHHHHHHH---hhccCCceeeccccc Confidence 1111 11111111112233332222 3467888999999999887764 47788888764 Q ss_pred ----------cCCCceeEecCCCCCCc--EEEEeecceEEEeecceEEEeehhhh--------------hhcCceEEEEE Q lcl|NC_019921. 294 ----------LPFNLNVIESTVQEAGK--VLTYVKGLYDGYLAGGINVQKFKETL--------------ALDDMDLYTAK 347 (381) Q Consensus 294 ----------l~~G~pVv~s~~~p~~~--i~fgd~~~y~i~~r~~i~i~~~~~~~--------------~~~d~~~~r~~ 347 (381) .++|+||+.+++||+++ ++||||++|++++|+++++++++|.+ |.+|++.||+. T Consensus 217 ~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~ 296 (320) T protein:vir:10 217 TDENSPFRAGRIVSRPTILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVE 296 (320) T ss_pred cCccccccCceeeeeeeEecCCCCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEE Confidence 24799999999999987 57899999999999999999999865 88999999999 Q ss_pred EEEcCEEecCceEEEEEEEecCCcc Q lcl|NC_019921. 348 QFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 348 ~r~dGk~~~~~Afvv~~~~~~~~~~ 372 (381) +|+|+++++++||++++- ++++.+ T Consensus 297 ~~~d~~v~~~~a~~~l~~-~~ap~~ 320 (320) T protein:vir:10 297 AEYAFHNNDKDAFVKLTN-VVTPDA 320 (320) T ss_pred EeeccEEecccceEEEEe-ccCCCC Confidence 999999999999998763 333333 No 68 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=3.9e-55 Score=318.90 Aligned_cols=279 Identities=14% Similarity=0.042 Sum_probs=226.3 Q ss_pred cCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceEEeeccccccccc Q lcl|NC_019921. 65 LSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIKGQL 143 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~~~~a~wv~e~~~~~~~~ 143 (381) +..++. ++.+..++++||++||++++++|++.+++.++|+++|+++++++ ..++|+.++.+.+.|++|+++.+ ++ T Consensus 1 ma~~~~---~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~-~~ 76 (304) T protein:vir:10 1 MATPTY---TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQ-TS 76 (304) T ss_pred Cccccc---ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccc-cc Confidence 222211 23345567789999999999999999999999999999999865 58999999999999999988765 57 Q ss_pred CcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeecccccccccccccc Q lcl|NC_019921. 144 DAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYP 223 (381) Q Consensus 144 ~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~ 223 (381) +++|+++++.++|++++++||+|||+||.+|+++||.++|+++++++++.+|++|+|+++|.|++.......... T Consensus 77 ~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~----- 151 (304) T protein:vir:10 77 KPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEE----- 151 (304) T ss_pred cceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccc----- Confidence 899999999999999999999999999999999999999999999999999999999999987753211100000 Q ss_pred ceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCcccccc----CCCce Q lcl|NC_019921. 224 EKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL----PFNLN 299 (381) Q Consensus 224 ~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l----~~G~p 299 (381) ...+.......++.+.++...+ ...+..+++|+||+.++..++. .++.+|+|+|.. .+|+| T Consensus 152 -----~~~~~~~~~~~~~~i~~~~~~l-------~~~~~~~~~~v~~~~~~~~L~~---lkd~~G~~l~~~~~~~l~G~P 216 (304) T protein:vir:10 152 -----KGNVVTDTNNLYVDLSALMATI-------EDEELDPNGVLTTRSFRSKMRN---ALDANDRPLFDANGNEIMGLP 216 (304) T ss_pred -----cccccccccchHHHHHHHHHHh-------hhccCCcCEEEEcHHHHHHHHH---hhccCCcEeecCCCcccccee Confidence 0111122334466666665544 3456778899999999988765 478899998753 47999 Q ss_pred eEecCCCCCC----cEEEEeecceEEEeecceEEEeehhh----------------hhhcCceEEEEEEEEcCEEecCce Q lcl|NC_019921. 300 VIESTVQEAG----KVLTYVKGLYDGYLAGGINVQKFKET----------------LALDDMDLYTAKQFAYGKAKDNKV 359 (381) Q Consensus 300 Vv~s~~~p~~----~i~fgd~~~y~i~~r~~i~i~~~~~~----------------~~~~d~~~~r~~~r~dGk~~~~~A 359 (381) |+.+++||.+ .++||||++|++++|+++++++++|. .|.+|++.||+.+|+|+++++++| T Consensus 217 V~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a 296 (304) T protein:vir:10 217 LSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEA 296 (304) T ss_pred eEEecccccCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccc Confidence 9999999853 48999999999999999999999984 499999999999999999999999 Q ss_pred EEEEEEEecC Q lcl|NC_019921. 360 AAVWKLDLKG 369 (381) Q Consensus 360 fvv~~~~~~~ 369 (381) |++++ .+. T Consensus 297 ~~~l~--~a~ 304 (304) T protein:vir:10 297 FATLK--PTE 304 (304) T ss_pred eEEEE--ecC Confidence 99744 444 No 69 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=3.9e-55 Score=318.90 Aligned_cols=279 Identities=14% Similarity=0.042 Sum_probs=226.3 Q ss_pred cCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceEEeeccccccccc Q lcl|NC_019921. 65 LSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIKGQL 143 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~~~~a~wv~e~~~~~~~~ 143 (381) +..++. ++.+..++++||++||++++++|++.+++.++|+++|+++++++ ..++|+.++.+.+.|++|+++.+ ++ T Consensus 1 ma~~~~---~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~-~~ 76 (304) T protein:vir:94 1 MATPTY---TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQ-TS 76 (304) T ss_pred Cccccc---ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccc-cc Confidence 222211 23345567789999999999999999999999999999999865 58999999999999999988765 57 Q ss_pred CcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeecccccccccccccc Q lcl|NC_019921. 144 DAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYP 223 (381) Q Consensus 144 ~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~ 223 (381) +++|+++++.++|++++++||+|||+||.+|+++||.++|+++++++++.+|++|+|+++|.|++.......... T Consensus 77 ~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~----- 151 (304) T protein:vir:94 77 KPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEE----- 151 (304) T ss_pred cceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccc----- Confidence 899999999999999999999999999999999999999999999999999999999999987753211100000 Q ss_pred ceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCcccccc----CCCce Q lcl|NC_019921. 224 EKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL----PFNLN 299 (381) Q Consensus 224 ~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l----~~G~p 299 (381) ...+.......++.+.++...+ ...+..+++|+||+.++..++. .++.+|+|+|.. .+|+| T Consensus 152 -----~~~~~~~~~~~~~~i~~~~~~l-------~~~~~~~~~~v~~~~~~~~L~~---lkd~~G~~l~~~~~~~l~G~P 216 (304) T protein:vir:94 152 -----KGNVVTDTNNLYVDLSALMATI-------EDEELDPNGVLTTRSFRSKMRN---ALDANDRPLFDANGNEIMGLP 216 (304) T ss_pred -----cccccccccchHHHHHHHHHHh-------hhccCCcCEEEEcHHHHHHHHH---hhccCCcEeecCCCcccccee Confidence 0111122334466666665544 3456778899999999988765 478899998753 47999 Q ss_pred eEecCCCCCC----cEEEEeecceEEEeecceEEEeehhh----------------hhhcCceEEEEEEEEcCEEecCce Q lcl|NC_019921. 300 VIESTVQEAG----KVLTYVKGLYDGYLAGGINVQKFKET----------------LALDDMDLYTAKQFAYGKAKDNKV 359 (381) Q Consensus 300 Vv~s~~~p~~----~i~fgd~~~y~i~~r~~i~i~~~~~~----------------~~~~d~~~~r~~~r~dGk~~~~~A 359 (381) |+.+++||.+ .++||||++|++++|+++++++++|. .|.+|++.||+.+|+|+++++++| T Consensus 217 V~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a 296 (304) T protein:vir:94 217 LSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEA 296 (304) T ss_pred eEEecccccCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccc Confidence 9999999853 48999999999999999999999984 499999999999999999999999 Q ss_pred EEEEEEEecC Q lcl|NC_019921. 360 AAVWKLDLKG 369 (381) Q Consensus 360 fvv~~~~~~~ 369 (381) |++++ .+. T Consensus 297 ~~~l~--~a~ 304 (304) T protein:vir:94 297 FATLK--PTE 304 (304) T ss_pred eEEEE--ecC Confidence 99744 444 No 70 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=9.8e-55 Score=316.67 Aligned_cols=280 Identities=10% Similarity=-0.005 Sum_probs=218.7 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceEEeecccccccccCcceeeEeecc Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~ 154 (381) |..+++++||++||++++++|++.+++.|+||++|++++++ +.+++|+.++.+.++|++|++.++ +++++|+++++.+ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~-~s~~~f~~v~l~~ 79 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKP-SASVDVSAFTAQP 79 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCcccc-ccccceeeeEeee Confidence 77888999999999999999999999999999999999986 468999999999999999987765 6789999999999 Q ss_pred eeEEEeeeccHHhhhcCHHH----HHHHHHHHHHHHHHHHHhhheeeccCCC---cceEeeeccccccccccccccceee Q lcl|NC_019921. 155 NKLTAFVVLPKDLNDFGPAW----IERFVRVQIEEAFAVALETAFLKGTGKD---QPIGLNRQVQKGVSVTEGAYPEKEE 227 (381) Q Consensus 155 ~kl~~~~~iS~ell~ds~~~----~e~~l~~~la~~~~~~~~~a~i~G~G~~---~P~Gil~~~~~~~~~~~~~~~~~~~ 227 (381) ||++++++||+|||+|+..+ |++||.+++++++++++|.+|++|+|.+ .|.|+.+.+...... T Consensus 80 ~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~---------- 149 (315) T protein:vir:80 80 IKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNI---------- 149 (315) T ss_pred eeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccce---------- Confidence 99999999999999988876 8899999999999999999999998753 234444322111100 Q ss_pred eeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc--cCCCCccccc--------cCCC Q lcl|NC_019921. 228 QGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH--LNANGVYVTA--------LPFN 297 (381) Q Consensus 228 ~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~--~~~~G~~~~~--------l~~G 297 (381) ... .......+.+++..+. ...+..+..|+||+.++..++++++. ++.+|+|+|. ..+| T Consensus 150 -~~~----~~~~~~d~~~~~~~~~------~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G 218 (315) T protein:vir:80 150 -VDA----TDSATADLVKAVGLIA------GAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRG 218 (315) T ss_pred -eec----cccchHHHHHHHHHHh------hccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecc Confidence 000 1111223334443332 12344455799999999888876543 3456777652 3589 Q ss_pred ceeEecCCCCCC---------cEEEEeecceEEEeecceEEEeehhh--------hhhcCceEEEEEEEEcCEEecCceE Q lcl|NC_019921. 298 LNVIESTVQEAG---------KVLTYVKGLYDGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYGKAKDNKVA 360 (381) Q Consensus 298 ~pVv~s~~~p~~---------~i~fgd~~~y~i~~r~~i~i~~~~~~--------~~~~d~~~~r~~~r~dGk~~~~~Af 360 (381) +||+.+++||++ .++||||++|.++.|++++++++++. +|.+|+++||+.+|+|+++++++|| T Consensus 219 ~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~ 298 (315) T protein:vir:80 219 LNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSF 298 (315) T ss_pred eeeEecCcCCcccccccccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccce Confidence 999999999864 37899999999999999999988763 5899999999999999999999999 Q ss_pred EEEEEEecCCccccccCcc Q lcl|NC_019921. 361 AVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 361 vv~~~~~~~~~~~~~~~~~ 379 (381) ++++-+.+ ..++.-+.+ T Consensus 299 ~~l~~~~a--~~~~~~~~~ 315 (315) T protein:vir:80 299 AVVKEKAA--PKPNPPAEN 315 (315) T ss_pred EEEeeccC--CCCCCCCCC Confidence 98665443 333333333 No 71 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=4.9e-55 Score=318.34 Aligned_cols=301 Identities=13% Similarity=0.031 Sum_probs=229.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC Q lcl|NC_019921. 37 INQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG 116 (381) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~ 116 (381) +++. +..++.. +.+.. ...+.+.+++.......+||++||+++.++|++.+++.++|++++++++++ T Consensus 1 ~~~~--~~~~~~~----~~~~~-------~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~ 67 (324) T protein:vir:78 1 MEQT--QKLKLNL----QHFAS-------NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME 67 (324) T ss_pred CCcc--hhhhHHH----HHHHH-------HhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc Confidence 0000 0000000 00000 001112233334455678899999999999999999999999999999986 Q ss_pred C-ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhe Q lcl|NC_019921. 117 L-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) Q Consensus 117 g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~ 195 (381) + .+++|+.++.+.+.|++|++..+ +++++|+++++.+||++++++||+|||+||.+|+++||.++|++++++++|.++ T Consensus 68 ~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~ 146 (324) T protein:vir:78 68 GTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) T ss_pred CCceEEEEEecCcceeEecCCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Confidence 5 58999999999999999987765 678999999999999999999999999999999999999999999999999999 Q ss_pred eeccCCC-cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhH Q lcl|NC_019921. 196 LKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) Q Consensus 196 i~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~ 274 (381) |+|+|++ +|.||++.+....... ....+++.+.++...+ ...+...++|+||+.++ T Consensus 147 l~G~g~~~~~~gi~~~~~~~~~~~----------------~~~~t~~~i~~~~~~l-------~~~~~~~~~~vmn~~~~ 203 (324) T protein:vir:78 147 ILNQGNNPFGKSIAQSIEKTNKVI----------------KGDFTQDNIIDLEALL-------EDDELEANAFISKTQNR 203 (324) T ss_pred hccCCCCCcCccccccccccceec----------------cccccHHHHHHHHHhh-------hhccCCCCEEEEcHHHH Confidence 9999975 6888876443222111 1123455666655444 23466777899999998 Q ss_pred HHHhhhhhccCCCCccccc-----cCCCceeEecCCCC--CCcEEEEeecceEEEeecceEEEeehhhh----------- Q lcl|NC_019921. 275 FEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQE--AGKVLTYVKGLYDGYLAGGINVQKFKETL----------- 336 (381) Q Consensus 275 ~~~~~~~~~~~~~G~~~~~-----l~~G~pVv~s~~~p--~~~i~fgd~~~y~i~~r~~i~i~~~~~~~----------- 336 (381) ..++. .++.+|+|++. ..+|+||+.+++++ ++.++||||++|+++++++++++.++|.. T Consensus 204 ~~L~~---l~d~~G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~ 280 (324) T protein:vir:78 204 SLLRK---IVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) T ss_pred HHHHH---hhccCCCeeecCCCCCcccceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccc Confidence 87754 47788888753 34799999877654 56799999999999999999999998853 Q ss_pred ---hhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 337 ---ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 337 ---~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) |.+|++.||+.+|+|+++++++||++++- ..+.+|-||-.+ T Consensus 281 ~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~----a~~~~~~~~~~~ 324 (324) T protein:vir:78 281 VNLFEQDMVALRATMHVALHIADDKAFAKLVP----ADKRTDSVPGEV 324 (324) T ss_pred hhhhhcCcEEEEEEEEEccEEecccceEEEec----ccccCCCCCCCC Confidence 89999999999999999999999997553 344455566666 No 72 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=4.9e-55 Score=318.34 Aligned_cols=301 Identities=13% Similarity=0.031 Sum_probs=229.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC Q lcl|NC_019921. 37 INQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG 116 (381) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~ 116 (381) +++. +..++.. +.+.. ...+.+.+++.......+||++||+++.++|++.+++.++|++++++++++ T Consensus 1 ~~~~--~~~~~~~----~~~~~-------~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~ 67 (324) T protein:vir:96 1 MEQT--QKLKLNL----QHFAS-------NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME 67 (324) T ss_pred CCcc--hhhhHHH----HHHHH-------HhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc Confidence 0000 0000000 00000 001112233334455678899999999999999999999999999999986 Q ss_pred C-ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhe Q lcl|NC_019921. 117 L-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) Q Consensus 117 g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~ 195 (381) + .+++|+.++.+.+.|++|++..+ +++++|+++++.+||++++++||+|||+||.+|+++||.++|++++++++|.++ T Consensus 68 ~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~ 146 (324) T protein:vir:96 68 GTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) T ss_pred CCceEEEEEecCcceeEecCCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Confidence 5 58999999999999999987765 678999999999999999999999999999999999999999999999999999 Q ss_pred eeccCCC-cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhH Q lcl|NC_019921. 196 LKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) Q Consensus 196 i~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~ 274 (381) |+|+|++ +|.||++.+....... ....+++.+.++...+ ...+...++|+||+.++ T Consensus 147 l~G~g~~~~~~gi~~~~~~~~~~~----------------~~~~t~~~i~~~~~~l-------~~~~~~~~~~vmn~~~~ 203 (324) T protein:vir:96 147 ILNQGNNPFGKSIAQSIEKTNKVI----------------KGDFTQDNIIDLEALL-------EDDELEANAFISKTQNR 203 (324) T ss_pred hccCCCCCcCccccccccccceec----------------cccccHHHHHHHHHhh-------hhccCCCCEEEEcHHHH Confidence 9999975 6888876443222111 1123455666655444 23466777899999998 Q ss_pred HHHhhhhhccCCCCccccc-----cCCCceeEecCCCC--CCcEEEEeecceEEEeecceEEEeehhhh----------- Q lcl|NC_019921. 275 FEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQE--AGKVLTYVKGLYDGYLAGGINVQKFKETL----------- 336 (381) Q Consensus 275 ~~~~~~~~~~~~~G~~~~~-----l~~G~pVv~s~~~p--~~~i~fgd~~~y~i~~r~~i~i~~~~~~~----------- 336 (381) ..++. .++.+|+|++. ..+|+||+.+++++ ++.++||||++|+++++++++++.++|.. T Consensus 204 ~~L~~---l~d~~G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~ 280 (324) T protein:vir:96 204 SLLRK---IVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) T ss_pred HHHHH---hhccCCCeeecCCCCCcccceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccc Confidence 87754 47788888753 34799999877654 56799999999999999999999998853 Q ss_pred ---hhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 337 ---ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 337 ---~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) |.+|++.||+.+|+|+++++++||++++- ..+.+|-||-.+ T Consensus 281 ~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~----a~~~~~~~~~~~ 324 (324) T protein:vir:96 281 VNLFEQDMVALRATMHVALHIADDKAFAKLVP----ADKRTDSVPGEV 324 (324) T ss_pred hhhhhcCcEEEEEEEEEccEEecccceEEEec----ccccCCCCCCCC Confidence 89999999999999999999999997553 344455566666 No 73 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=1.1e-53 Score=310.97 Aligned_cols=347 Identities=12% Similarity=0.073 Sum_probs=223.0 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhh-------hHHHH-HHH---HHHHHHHHHHHHHHHH-HHHHHHHHHh----hc---- Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGE-------PQERQ-NEL---YGDMINQLFEETKLQA-KAEAERVSSL----PK---- 60 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~-------~~~~~-~~~---~~~~~~~~~~~~~~~~-~~~~~~~~~~----~~---- 60 (381) |+.+ +++++.+.++.......+ ...++ .+. +....+.+..+..... ..+....... .. T Consensus 1 m~~~--~~lee~~a~l~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (419) T protein:vir:94 1 MPPT--PTLEEQRAALLARLDDTSLTTEQVQEIVAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEA 78 (419) T ss_pred CCHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 6654 233333333322111110 00011 111 1111111111100000 0000000000 00 Q ss_pred -ccccc----------------------CHHHHHHHHHH------hhc-cCCCCceeccHHHHHHHHHHHHhhhhhhhhc Q lcl|NC_019921. 61 -SAQSL----------------------SANQRSFFMDI------NKN-VNYKEEKLLPEETIDRIFEDLTTNHPLLADL 110 (381) Q Consensus 61 -~~~~l----------------------t~~e~~~~~~~------~~~-~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~ 110 (381) ..+.. ..+.+...... ..+ ....|++++|+.+.+.|+..+.....|+++| T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~ 158 (419) T protein:vir:94 79 GTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLL 158 (419) T ss_pred ccccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcc Confidence 00000 00000000000 111 2234556677777777777888888999999 Q ss_pred eeEecCC-ceEEEEecC--------CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHH Q lcl|NC_019921. 111 GIKNAGL-RLKFLKSET--------SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRV 181 (381) Q Consensus 111 ~v~~~~g-~~~~p~~~~--------~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~ 181 (381) ++.++++ .+++|+.++ .+.+.|++|++..+ +++++|+++++.+|+++++++||++||+|+. ++++||.+ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~ 236 (419) T protein:vir:94 159 DQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKP-QSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQG 236 (419) T ss_pred eeeeccCCceeeeeeccccccccccCcccceecCCcccc-ccccceeeEEeeeeeEEEeehhhHHHHHhHH-HHHHHHHH Confidence 9999765 467776543 34688999987755 6889999999999999999999999999875 79999999 Q ss_pred HHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccc Q lcl|NC_019921. 182 QIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAV 261 (381) Q Consensus 182 ~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~ 261 (381) +|++++++++|.+||+|+|+++|+||++............ .........++.+.+++..+ ...+ T Consensus 237 ~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~---------~~~~t~~~~~~~l~~~~~~~-------~~~~ 300 (419) T protein:vir:94 237 RLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKP---------TAPATDEPPLVDIRRAKTVA-------EIAG 300 (419) T ss_pred HHHHHHHHHHHHHHHhccCcccccceeccccccccccccc---------ccccccchhHHHHHHHHHhh-------hhcc Confidence 9999999999999999999999999997544322221111 11122334456666655443 2345 Q ss_pred cCceEEEEchhhHHHHhhhhhccCCCC-ccccc---------cCCCceeEecCCCCCCcEEEEeecc-eEEEeecceEEE Q lcl|NC_019921. 262 KGNVTMVVNPSDAFEVQAQYTHLNANG-VYVTA---------LPFNLNVIESTVQEAGKVLTYVKGL-YDGYLAGGINVQ 330 (381) Q Consensus 262 ~~~a~~~mn~~t~~~~~~~~~~~~~~G-~~~~~---------l~~G~pVv~s~~~p~~~i~fgd~~~-y~i~~r~~i~i~ 330 (381) ..+.+|+||+.++..+... ++.+| .|.+. ..+|+||+++++||+++++||||++ |.+++|++++++ T Consensus 301 ~~~~~~v~n~~~~~~l~~~---k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~ 377 (419) T protein:vir:94 301 FPPDGVVVHPQDWESIELD---QAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVL 377 (419) T ss_pred CCCCEEEEcHHHHHHHHHH---hhcCCCceeecCCcccCCCccccceeeEEcCCCCCccEEEeeccceEEEEEecceEEE Confidence 6677899999998887654 44333 33321 4489999999999999999999998 889999999999 Q ss_pred eehhh--hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCC Q lcl|NC_019921. 331 KFKET--LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 331 ~~~~~--~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~ 370 (381) .+++. .|.+|+++||+..|+||++++++||++++++-+.+ T Consensus 378 ~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 378 MTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred EeccccchhhcCcEEEEEEEeeccEEeccccEEEEEeccCCC Confidence 99876 49999999999999999999999999877655444 No 74 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=3e-53 Score=308.55 Aligned_cols=325 Identities=11% Similarity=0.001 Sum_probs=221.0 Q ss_pred Cchh-HHHHHHHHHHHHHHHHhhhhhH-----------H------HHHHHHHHHH---HHHHHHHHHHHH-HHHHHHHHh Q lcl|NC_019921. 1 MTIN-LSETFANAKNEFINAVNNGEPQ-----------E------RQNELYGDMI---NQLFEETKLQAK-AEAERVSSL 58 (381) Q Consensus 1 mt~e-l~~~~~~~~~~~~~~~k~~~~~-----------~------~~~~~~~~~~---~~~~~~~~~~~~-~~~~~~~~~ 58 (381) |+|+ +.++.+++.+++.+.+.....+ + +..+.++... +.+.+....... .+..+.... T Consensus 1 ~~l~e~i~e~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~ 80 (400) T protein:vir:38 1 MTLDEKLAAVKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSG 80 (400) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 6665 3333333322322222111000 0 0001111111 111111100000 000000000 Q ss_pred hcccc-----------------------------------ccCHHHHHHHHHHhhc-cCCCCceeccHHHHHHHHHHHHh Q lcl|NC_019921. 59 PKSAQ-----------------------------------SLSANQRSFFMDINKN-VNYKEEKLLPEETIDRIFEDLTT 102 (381) Q Consensus 59 ~~~~~-----------------------------------~lt~~e~~~~~~~~~~-~~~~gg~lvP~~~~~~I~~~l~~ 102 (381) ..... .....+.++...+..+ ++++||++||+++++.|++.+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~ 160 (400) T protein:vir:38 81 KKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQT 160 (400) T ss_pred ccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHh Confidence 00000 0000000111222333 56789999999999999999999 Q ss_pred hhhhhhhceeEecC-CceEEEEec-CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHH Q lcl|NC_019921. 103 NHPLLADLGIKNAG-LRLKFLKSE-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVR 180 (381) Q Consensus 103 ~~~l~~~~~v~~~~-g~~~~p~~~-~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~ 180 (381) +++|+++|++++++ +..++|+.. +.+.+.|++|+++.+..++++|++|++.+|+++++++||++||+||.+++++||. T Consensus 161 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~ 240 (400) T protein:vir:38 161 VVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIA 240 (400) T ss_pred hhhhhhcceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHH Confidence 99999999999976 457888765 4567889999888877789999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhcccccccc Q lcl|NC_019921. 181 VQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVA 260 (381) Q Consensus 181 ~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~ 260 (381) ++++++++.+++.+|++|+|+++|.|+.+ ++.+.+++... .. T Consensus 241 ~~l~~~~~~~~~~~i~~~~~~~~~~~~~~------------------------------~~~~~~~~~~~--------~~ 282 (400) T protein:vir:38 241 QNGQQIKVNTTNGAVATLLKGFTAKTISS------------------------------VDDLKHINNVD--------LD 282 (400) T ss_pred HHHHHHHHHHHHHhhhhcccccccccccc------------------------------HHHHHHHHHhh--------hh Confidence 99999999999999999999877765531 22333332211 11 Q ss_pred ccCceEEEEchhhHHHHhhhhhccCCCCccccc---------cCCCceeEecCCCCCC---c--EEEEeecc-eEEEeec Q lcl|NC_019921. 261 VKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQEAG---K--VLTYVKGL-YDGYLAG 325 (381) Q Consensus 261 ~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~~~p~~---~--i~fgd~~~-y~i~~r~ 325 (381) +..+++|+||+.++..+.. +++.+|+|+|. ..+|+||+.+++||.+ + ++||||++ |++++|+ T Consensus 283 ~~~~a~~v~~~~~~~~l~~---lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~ 359 (400) T protein:vir:38 283 PAYSRVIIASQSFYNFLDT---VKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRA 359 (400) T ss_pred hhhCcEEEEcHHHHHHHHH---hhccCCCeeeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeec Confidence 2236889999999887764 47889999985 2479999999999853 2 89999998 7899999 Q ss_pred ceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecC Q lcl|NC_019921. 326 GINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 326 ~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~ 369 (381) ++++.++++.+|. ++||+++|+||++++++||+.++++-++ T Consensus 360 ~~~~~~~~~~~~~---~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 360 DFMVRWVDDQIYG---QFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred ceEEEEecccccc---eeEEEEEEeccEEecccceEEEEeecCC Confidence 9999999987664 6899999999999999999998887666 No 75 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=2.9e-54 Score=314.11 Aligned_cols=284 Identities=11% Similarity=0.027 Sum_probs=220.7 Q ss_pred HHHHHHH---hhccCCC------CceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceEEeeccc-- Q lcl|NC_019921. 70 RSFFMDI---NKNVNYK------EEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYG-- 137 (381) Q Consensus 70 ~~~~~~~---~~~~~~~------gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~~~~p~~~~~~~a~wv~e~~-- 137 (381) ...++++ ..+.... ++.+||+++.++|++.+++.++|+++|++++++ +..++|+.++.+.+.|++|+. T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~ 80 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCcccc Confidence 1222222 2222223 344899999999999999999999999999976 468999999999999998753 Q ss_pred -----ccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcce---Eeee Q lcl|NC_019921. 138 -----EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI---GLNR 209 (381) Q Consensus 138 -----~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~---Gil~ 209 (381) +..++++++|+++++.+||++++++||+|||+||.+++++||+++|+++|++++|.+|++|+|+++|. |+++ T Consensus 81 ~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~ 160 (333) T protein:vir:78 81 EQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDT 160 (333) T ss_pred cccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccccccc Confidence 23346789999999999999999999999999999999999999999999999999999999987654 5543 Q ss_pred ccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCc Q lcl|NC_019921. 210 QVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGV 289 (381) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~ 289 (381) ....... +. . ..........++.+.+++..+. ...+.....|+|||.++..+++....+|.+|. T Consensus 161 ~~~~~~~-~~--------~-~~~~~~~~~~~~~i~~~~~~~~------~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~ 224 (333) T protein:vir:78 161 DNVIANT-TN--------V-DYLQETGDPLLDRLLDGYDLVS------ANTDVEFNGWAVDPRFRAHLLRAQAYRDANGN 224 (333) T ss_pred ccccccc-cc--------c-cccccccchhHHHHHHHHHhhc------cccccCceEEEEcchHHHHHHHHhhhcCCCCc Confidence 2211110 00 0 0011122334555555554332 12344556799999999888887788999999 Q ss_pred cccc---------cCCCceeEecCCCCCC---------cEEEEeecceEEEeecceEEEeehhh-----------hhhcC Q lcl|NC_019921. 290 YVTA---------LPFNLNVIESTVQEAG---------KVLTYVKGLYDGYLAGGINVQKFKET-----------LALDD 340 (381) Q Consensus 290 ~~~~---------l~~G~pVv~s~~~p~~---------~i~fgd~~~y~i~~r~~i~i~~~~~~-----------~~~~d 340 (381) |+|. .++|+||+++++||++ .++||||++|++++|++++|+++++. .|.+| T Consensus 225 ~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 304 (333) T protein:vir:78 225 VDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTN 304 (333) T ss_pred eeecCccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcC Confidence 9875 2379999999999964 48999999999999999999999873 58899 Q ss_pred ceEEEEEEEEcCEEecCceEEEEEEEecC Q lcl|NC_019921. 341 MDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 341 ~~~~r~~~r~dGk~~~~~Afvv~~~~~~~ 369 (381) ++.||+.+|+|+++++++||++++-..++ T Consensus 305 ~v~~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 305 QIAILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred cEEEEEEEEEccEEecccceEEEeccCCC Confidence 99999999999999999999987644433 No 76 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=3.1e-54 Score=313.92 Aligned_cols=288 Identities=12% Similarity=-0.003 Sum_probs=224.3 Q ss_pred ccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceEEeecccc Q lcl|NC_019921. 60 KSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGE 138 (381) Q Consensus 60 ~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~~~~a~wv~e~~~ 138 (381) .+ ..++.+.. ..+ +.+++|.+||++++++|++.+++.++|+++++++++++ .+++|+.+..+.+.|++|+++ T Consensus 1 ~g---~~~e~~~~---~~~-~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~ 73 (397) T protein:vir:23 1 MG---FSADHSQI---AQT-KDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDM 73 (397) T ss_pred CC---cCHHHHHH---hhc-cCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCcc Confidence 11 23333322 223 33344446677789999999999999999999999865 589999999999999999877 Q ss_pred cccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccc Q lcl|NC_019921. 139 IKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVT 218 (381) Q Consensus 139 ~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~ 218 (381) ++ +++++|+++++.+||++++++||+|||+||.+|+++||+++|++++++++|.+||+|+|+.+|.+.+......... T Consensus 74 ~~-~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~- 151 (397) T protein:vir:23 74 KP-ITKGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQS- 151 (397) T ss_pred cc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceee- Confidence 65 5789999999999999999999999999999999999999999999999999999999998765544322111110 Q ss_pred cccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc----- Q lcl|NC_019921. 219 EGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA----- 293 (381) Q Consensus 219 ~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~----- 293 (381) . ......+.+.+....+ ...++.++.|+||+.++..+++ .++.+|+|+|. T Consensus 152 -----------~----~~~~~~~~~~~~~~~l-------~~~~~~~a~~vmn~~~~~~L~~---lkd~~G~~i~~~~~~~ 206 (397) T protein:vir:23 152 -----------I----SPNAYQGLGVSGLTKL-------VTDGKKWTHTLLDDTVEPVLNG---SVDANGRPLFVESTYE 206 (397) T ss_pred -----------e----cccchhHHHHHHHHhh-------hhcccCCCEEEEcHHHHHHHHH---hhccCCceeecccccc Confidence 0 0111222333333222 2346678899999999887765 47888998864 Q ss_pred ---------cCCCceeEecCCCCCCc--EEEEeecceEEEeecceEEEeehhhh--------------hhcCceEEEEEE Q lcl|NC_019921. 294 ---------LPFNLNVIESTVQEAGK--VLTYVKGLYDGYLAGGINVQKFKETL--------------ALDDMDLYTAKQ 348 (381) Q Consensus 294 ---------l~~G~pVv~s~~~p~~~--i~fgd~~~y~i~~r~~i~i~~~~~~~--------------~~~d~~~~r~~~ 348 (381) ..+|+||+.+++||+++ ++||||++|++++++++.+++++|.. |.+|++.||+.+ T Consensus 207 ~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~ 286 (397) T protein:vir:23 207 SLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEA 286 (397) T ss_pred cccccccCceeeeeeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEe Confidence 23799999999999987 47899999999999999999998864 889999999999 Q ss_pred EEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 349 FAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 349 r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) |+|+++++++||+.++.+....+.....+|.|= T Consensus 287 r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~ 319 (397) T protein:vir:23 287 EYGLLINDVNAFVKLTFDPVLTTYALDLDGASA 319 (397) T ss_pred eeccceecccceEEEeeccccceeeecccccCc Confidence 999999999999998887766665544444433 No 77 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=2.7e-54 Score=314.23 Aligned_cols=301 Identities=13% Similarity=0.036 Sum_probs=228.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec Q lcl|NC_019921. 36 MINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~ 115 (381) |.+ . .+.+.+....... ..+.+.+++.......++|++||++++++|++.+++.++|+++|+++++ T Consensus 1 ~~~-~-----~~~~~~~~~f~~~--------~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~ 66 (324) T protein:vir:93 1 MEQ-T-----QKLKLNLQHFASN--------NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPM 66 (324) T ss_pred Cch-h-----HHHHHHHHHHHHh--------hhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeec Confidence 000 0 0001111111000 0111223333334455677899999999999999999999999999998 Q ss_pred CC-ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019921. 116 GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETA 194 (381) Q Consensus 116 ~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a 194 (381) ++ .+++|+.++.+.+.|++|++..+ +++++|+++++.+||++++++||+|||+||.+++++||++++++++++++|.+ T Consensus 67 ~~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a 145 (324) T protein:vir:93 67 EGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) T ss_pred cCCceEEEEEecCcceeeecCCcccc-ccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHH Confidence 65 58999999999999999987765 57899999999999999999999999999999999999999999999999999 Q ss_pred eeeccCCC-cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhh Q lcl|NC_019921. 195 FLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSD 273 (381) Q Consensus 195 ~i~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t 273 (381) +|+|+|++ +|.|++..+........ ...+++.+.++...+ ...+...++|+||+.+ T Consensus 146 ~l~G~g~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~i~~~~~~l-------~~~~~~~~~~v~n~~~ 202 (324) T protein:vir:93 146 GILNQGNNPFGKSIAQSIEKTNKVIK----------------GDFTQDNIIDLEALL-------EDDELEANAFISKTQN 202 (324) T ss_pred HhcCCCCCCcCccccccccccceecc----------------ccccHHHHHHHHHhh-------hhccCCCCEEEEcHHH Confidence 99999975 78888865433222111 123355566655544 2345667789999999 Q ss_pred HHHHhhhhhccCCCCccccc-----cCCCceeEecCCC--CCCcEEEEeecceEEEeecceEEEeehhhh---------- Q lcl|NC_019921. 274 AFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQ--EAGKVLTYVKGLYDGYLAGGINVQKFKETL---------- 336 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~~~~-----l~~G~pVv~s~~~--p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~---------- 336 (381) +..++. .++.+|+|++. ..+|+||+.+.++ +++.++||||++|+++++++++|+.++|.. T Consensus 203 ~~~L~~---l~d~~G~~~~~~~~~~~l~G~PVv~~~~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:93 203 RSLLRK---IVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHHH---hhCCCCCeeecCCCCCcccceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeeccccccccccccc Confidence 887764 47888988753 3479999987664 456799999999999999999999999854 Q ss_pred ----hhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 337 ----ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 337 ----~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) |.+|+++||+.+|+|+++++++|||+++... +.++-||-.+ T Consensus 280 ~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~----~~~~~~~~~~ 324 (324) T protein:vir:93 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD----KRTDSVPGEV 324 (324) T ss_pred chhhhhcCcEEEEEEEEeccEEecccceEEEeccc----ccCCCCCCCC Confidence 8899999999999999999999999765333 3344466666 No 78 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=2.9e-54 Score=314.12 Aligned_cols=301 Identities=13% Similarity=0.044 Sum_probs=227.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC Q lcl|NC_019921. 37 INQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG 116 (381) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~ 116 (381) +++. ++.+.+.+ .+... .. ....+++.......++|.+||++++++|++.+++.++|+++|+++|++ T Consensus 1 ~~k~-~~~~~~~~-~~~~~---~~--------~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~ 67 (324) T protein:vir:99 1 MEQT-QKLKLNLQ-HFASN---NV--------KPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPME 67 (324) T ss_pred CCCc-hHhhHHHH-HHHHH---hh--------hhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc Confidence 0000 00000000 00000 00 001112222334456677999999999999999999999999999986 Q ss_pred C-ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhe Q lcl|NC_019921. 117 L-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) Q Consensus 117 g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~ 195 (381) + .+++|+.++.+.+.|++|++..+ +++++|+++++.+||++++++||+|||+||.+++++||.++++++++++++.+| T Consensus 68 ~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~ 146 (324) T protein:vir:99 68 GTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) T ss_pred CCceEEEEEecCcceeEeccCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHh Confidence 4 58999999889999999987765 578999999999999999999999999999999999999999999999999999 Q ss_pred eeccCCC-cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhH Q lcl|NC_019921. 196 LKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) Q Consensus 196 i~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~ 274 (381) |+|+|++ +|.|+++.+....... ....+.+.+.++...+ ...+..+++|+|||.++ T Consensus 147 l~G~g~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~i~~~~~~l-------~~~~~~~~~~v~n~~~~ 203 (324) T protein:vir:99 147 ILNQGNNPFGKSIAQSIEKTNKVI----------------KGDFTQDNIIDLEALL-------EDDELEANAFISKTQNR 203 (324) T ss_pred hhcCCCCccCccccccccccceec----------------cccCCHHHHHHHHHhh-------hhccCCCCEEEEcHHHH Confidence 9999986 7888876543322111 1123355566665544 23456677899999998 Q ss_pred HHHhhhhhccCCCCccccc-----cCCCceeEecCCCCC--CcEEEEeecceEEEeecceEEEeehhhh----------- Q lcl|NC_019921. 275 FEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQEA--GKVLTYVKGLYDGYLAGGINVQKFKETL----------- 336 (381) Q Consensus 275 ~~~~~~~~~~~~~G~~~~~-----l~~G~pVv~s~~~p~--~~i~fgd~~~y~i~~r~~i~i~~~~~~~----------- 336 (381) ..++. +++.+|+|++. .++|+||+.++.++. +.++||||++|+++++++++|+.++|.. T Consensus 204 ~~L~~---l~d~~g~~~~~~~~~~~l~G~PVv~~~~~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 280 (324) T protein:vir:99 204 SLLRK---IVDPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) T ss_pred HHHHH---hhcCCCceeecCCCCccccceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccc Confidence 87764 47788888753 347999999888764 5599999999999999999999999854 Q ss_pred ---hhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 337 ---ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 337 ---~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) |.+|++.||+.+|+|+++++++||++++....+.++ +|--. T Consensus 281 ~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~----~~~~~ 324 (324) T protein:vir:99 281 VNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDS----VPGEV 324 (324) T ss_pred hhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCC----CCCCC Confidence 889999999999999999999999988766655543 33333 No 79 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=4.8e-53 Score=307.42 Aligned_cols=332 Identities=11% Similarity=-0.022 Sum_probs=223.7 Q ss_pred hhHHHHHHHHHHHHHHHHhhh-h-------hHHHHHHHHHHHHHHHHHHH---HHHHHH-HHHHHHH-----hhcc---c Q lcl|NC_019921. 3 INLSETFANAKNEFINAVNNG-E-------PQERQNELYGDMINQLFEET---KLQAKA-EAERVSS-----LPKS---A 62 (381) Q Consensus 3 ~el~~~~~~~~~~~~~~~k~~-~-------~~~~~~~~~~~~~~~~~~~~---~~~~~~-~~~~~~~-----~~~~---~ 62 (381) |+..++..+.+++..+.+++. + ...++.+......+....+. .++... +...... .... . T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~~~~~~~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGSKKG 80 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 554444444443333222211 0 00011111111111111111 000000 0000000 0000 0 Q ss_pred cc----cCHHHHHHHH-----------HHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecC Q lcl|NC_019921. 63 QS----LSANQRSFFM-----------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSET 126 (381) Q Consensus 63 ~~----lt~~e~~~~~-----------~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~ 126 (381) .. ....+++.+. .+..+++++||++||+++++.|++.++++++|+++|+++|+++ ..++|+... T Consensus 81 ~~~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (389) T protein:vir:10 81 TDLSKKPIDAKKKAINDFIHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR 160 (389) T ss_pred cccchhHHHHHHHHHHHHhhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEec Confidence 00 1111222222 3345677889999999999999999999999999999999764 477777643 Q ss_pred -CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcce Q lcl|NC_019921. 127 -SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) Q Consensus 127 -~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~ 205 (381) .+.+.|++|+++.++.++++|+++++.+|++++++++|++||+||.+|+++||.++|+++++++++.+|++|+|++.|. T Consensus 161 ~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~ 240 (389) T protein:vir:10 161 ATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTAK 240 (389) T ss_pred CCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 4556788998888877899999999999999999999999999999999999999999999999999999999988776 Q ss_pred EeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC Q lcl|NC_019921. 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~ 285 (381) |..+. ..++.+.+++... .+..| ++.|+||+.++..++. +++ T Consensus 241 ~~~~~---------------------------~~~d~l~~~~~~~------~~~~~--~a~~~~n~~~~~~L~~---lkd 282 (389) T protein:vir:10 241 KTTTD---------------------------TLVDSLKHILNVD------LDPAY--SRALVVTQSLFNTLDT---LKD 282 (389) T ss_pred ccccc---------------------------ccHHHHHHHHHhh------hhhhh--CcEEEecHHHHHHHHH---hhc Confidence 54321 1234444443321 11223 5789999999877764 578 Q ss_pred CCCccccc-------------cCCCceeEecCC-CC-C--C--cEEEEeecc-eEEEeecceEEEeehhhhhhcCceEEE Q lcl|NC_019921. 286 ANGVYVTA-------------LPFNLNVIESTV-QE-A--G--KVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLYT 345 (381) Q Consensus 286 ~~G~~~~~-------------l~~G~pVv~s~~-~p-~--~--~i~fgd~~~-y~i~~r~~i~i~~~~~~~~~~d~~~~r 345 (381) .+|+|+|. ..+|+||++.++ ++ . + .++||||++ |.+++|++++|.++++.+|. +.|| T Consensus 283 ~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~---~~~~ 359 (389) T protein:vir:10 283 KNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKIYG---KYLG 359 (389) T ss_pred cCCCeeeecCcccccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeecccccc---ceEE Confidence 89999873 248999976544 33 2 2 289999998 89999999999999987776 4789 Q ss_pred EEEEEcCEEecCceEEEEEEEecCCccccc Q lcl|NC_019921. 346 AKQFAYGKAKDNKVAAVWKLDLKGHKPALE 375 (381) Q Consensus 346 ~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~ 375 (381) +++|+||++++++||+.+++.-++...+.. T Consensus 360 ~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 389 (389) T protein:vir:10 360 AAFRFGVQKADSKAGYFVTNTDVPGSALGK 389 (389) T ss_pred EEEEeccEEecccceEEEEeeccCCCCCCC Confidence 999999999999999998877655555444 No 80 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=7.8e-53 Score=306.25 Aligned_cols=328 Identities=11% Similarity=-0.035 Sum_probs=223.3 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhH---------HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhh-----cccccc Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEPQ---------ERQNELYGDMINQLFEETKLQAK-AEAERVSSLP-----KSAQSL 65 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-----~~~~~l 65 (381) |.-+.-+++.+++.++.+.+.+...+ .++.+.....++.+.++...... .+........ ...... T Consensus 1 M~~~~l~el~~~l~e~~~~i~~~~~e~~~~~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~ 80 (394) T protein:vir:97 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 77765555555554444433221111 11111111122222221111000 0000000000 000000 Q ss_pred ---CHHHH---------------------------HHH------HHH-hhccCCCCceeccHHHHHHHHHHHHhhhhhhh Q lcl|NC_019921. 66 ---SANQR---------------------------SFF------MDI-NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLA 108 (381) Q Consensus 66 ---t~~e~---------------------------~~~------~~~-~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~ 108 (381) ..+++ ... +.. ...+..+||++||+++++.|++.+++.++|++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~ 160 (394) T protein:vir:97 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) T ss_pred chhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhh Confidence 00000 000 000 12256679999999999999999999999999 Q ss_pred hceeEecC-CceEEEEec-CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHH Q lcl|NC_019921. 109 DLGIKNAG-LRLKFLKSE-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEA 186 (381) Q Consensus 109 ~~~v~~~~-g~~~~p~~~-~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~ 186 (381) +|++++++ +...+|+.. +.+.+.|++|+++.+..++++|++|++.+|+++++++||++||+||.+++++||.++++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~ 240 (394) T protein:vir:97 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) T ss_pred hceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHH Confidence 99999975 457888764 4567899999988876678999999999999999999999999999999999999999999 Q ss_pred HHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceE Q lcl|NC_019921. 187 FAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVT 266 (381) Q Consensus 187 ~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~ 266 (381) ++++++.+|++|.|++.|.|.. .++.+.+++.... .+..++. T Consensus 241 ~~~~~~~~i~~g~~~~~~~~~~------------------------------~~~~~~~~~~~~~--------~~~~~a~ 282 (394) T protein:vir:97 241 KVNTTNDAIAKVLKSFTTKTVK------------------------------NLDEIKALLNGGF--------DPAYNVS 282 (394) T ss_pred HHHHHHHHHhhccccccccccc------------------------------cHHHHHHHHHhhh--------hhhhCCE Confidence 9999999999998876654432 1233444433211 1233678 Q ss_pred EEEchhhHHHHhhhhhccCCCCccccc---------cCCCceeEec--CCCCCCcEEEEeecc-eEEEeecceEEEeehh Q lcl|NC_019921. 267 MVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIES--TVQEAGKVLTYVKGL-YDGYLAGGINVQKFKE 334 (381) Q Consensus 267 ~~mn~~t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s--~~~p~~~i~fgd~~~-y~i~~r~~i~i~~~~~ 334 (381) |+||+.++..+.. ++|.+|+|+|. ..+|+||+++ ..++++.++||||++ |.+++|++++++.+++ T Consensus 283 ~v~n~~~~~~l~~---lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~ 359 (394) T protein:vir:97 283 LIVSQSFYQTLDT---LKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADN 359 (394) T ss_pred EEEcHHHHHHHHH---hhccCCCeeeecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEecc Confidence 9999999877754 57899999985 3479999884 456778899999998 8899999999999887 Q ss_pred hhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccc Q lcl|NC_019921. 335 TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) Q Consensus 335 ~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~ 373 (381) .++ .++||+++|+||++++++||+.++++- .++|- T Consensus 360 ~~~---~~~~~~~~r~d~~v~~~~a~~~~~~~~-~~~p~ 394 (394) T protein:vir:97 360 EIY---GQYLQAVLRFGVSKVDDKAGYYVTFTP-EPLPL 394 (394) T ss_pred ccc---ceeEEEEEEEccEEecccceEEEEecc-cccCC Confidence 665 468999999999999999999887753 22222 No 81 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=4.8e-54 Score=312.88 Aligned_cols=270 Identities=12% Similarity=0.005 Sum_probs=229.0 Q ss_pred HHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC---CceEEEEec-CCcceEEeecccccccccCcce Q lcl|NC_019921. 72 FFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG---LRLKFLKSE-TSGVAVWGKIYGEIKGQLDAAF 147 (381) Q Consensus 72 ~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~---g~~~~p~~~-~~~~a~wv~e~~~~~~~~~~~f 147 (381) .++++..+++++||++||++++++|++.++++++|+++|++++++ +.+.+|... ..+.+.|++|+++.++.++++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 667778889999999999999999999999999999999998864 446677664 4678999999988876678999 Q ss_pred eeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceee Q lcl|NC_019921. 148 SEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEE 227 (381) Q Consensus 148 ~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~ 227 (381) +++++.+||++++++||+|||+||.+|+++||.++++++++++++.+|++|+|+..+.+ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~--------------------- 139 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTKP--------------------- 139 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccccc--------------------- Confidence 99999999999999999999999999999999999999999999999999987543210 Q ss_pred eeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc---------cCCCc Q lcl|NC_019921. 228 QGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNL 298 (381) Q Consensus 228 ~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~---------l~~G~ 298 (381) ....++.+.++...+ ...|+.+++|+||+.++..++. +++.+|+|+|. ..+|+ T Consensus 140 --------~~~~~d~i~~~~~~l-------~~~~~~~a~~vmn~~~~~~L~~---lkd~~g~~l~~~~~~~~~~~~l~G~ 201 (293) T protein:vir:48 140 --------TLTKWDDIIDLEAKV-------DPAIKQTSFFLTNTSGFTALKK---VKNALGDYLMERDVKSPTGYSIAGF 201 (293) T ss_pred --------cccCHHHHHHHHHhh-------hhhhcCCCEEEEcHHHHHHHHH---hhccCCceEeecCcCCCCCceecce Confidence 012345566655544 3457889999999999887765 47889999885 24799 Q ss_pred eeEecC--CCCCC-----cEEEEeecc-eEEEeecceEEEeehh--hhhhcCceEEEEEEEEcCEEecCceEEEEEEEec Q lcl|NC_019921. 299 NVIEST--VQEAG-----KVLTYVKGL-YDGYLAGGINVQKFKE--TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 299 pVv~s~--~~p~~-----~i~fgd~~~-y~i~~r~~i~i~~~~~--~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~ 368 (381) ||+.++ .+|.. .++||||++ |.+++|++++++++++ .+|.+|+++||+.+|+||++++++||++++++-+ T Consensus 202 Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 281 (293) T protein:vir:48 202 AVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAI 281 (293) T ss_pred eeEEecccccCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeecc Confidence 997754 44532 389999998 7899999999999885 5799999999999999999999999999999999 Q ss_pred CCccccccCccc Q lcl|NC_019921. 369 GHKPALEGTEET 380 (381) Q Consensus 369 ~~~~~~~~~~~~ 380 (381) ...|++.+|+.- T Consensus 282 ~~~~~~~~~~~~ 293 (293) T protein:vir:48 282 ADQKGNIGSTAV 293 (293) T ss_pred ccCCccccccCC Confidence 999999999988 No 82 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=5.1e-54 Score=312.74 Aligned_cols=301 Identities=13% Similarity=0.049 Sum_probs=227.4 Q ss_pred HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHH Q lcl|NC_019921. 20 VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFED 99 (381) Q Consensus 20 ~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~ 99 (381) ++..+ + .+.+..+.......+. .+++.......++|.+||++++++|++. T Consensus 1 ~~~~~-----------------~-----~~~~~~~f~~~~~~~~--------~~~a~~~~~~~~~~~liP~~~~~~ii~~ 50 (324) T protein:vir:10 1 MEQTQ-----------------K-----LKLNLQHFASNNVKPQ--------VFNPDNVMMHEKKDGTLLNDFTTPILQE 50 (324) T ss_pred CCCch-----------------H-----HHHHHHHHHHHhhccc--------eecccceeccCCCcceechhHHHHHHHH Confidence 00000 0 0000000000000011 1112223345566789999999999999 Q ss_pred HHhhhhhhhhceeEecC-CceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHH Q lcl|NC_019921. 100 LTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERF 178 (381) Q Consensus 100 l~~~~~l~~~~~v~~~~-g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~ 178 (381) +++.++|+++|++++++ +.+++|+.++.+.+.|++|+++.+ +++++|+++++.+||++++++||+|||+||.+++++| T Consensus 51 ~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~ 129 (324) T protein:vir:10 51 VMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEE 129 (324) T ss_pred HHhhchhhhhcceeeccCCceEEEEEeCCcceeEeccCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHH Confidence 99999999999999986 458999998889999999988765 5789999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhheeeccCCC-cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccc Q lcl|NC_019921. 179 VRVQIEEAFAVALETAFLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGK 257 (381) Q Consensus 179 l~~~la~~~~~~~~~a~i~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~ 257 (381) |.+++++++++++|.++|+|+|++ .|.||++.+....... ....+.+.+.++...+ T Consensus 130 i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~----------------~~~~t~~~i~~~~~~l------- 186 (324) T protein:vir:10 130 MKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVI----------------KGDFTQDNIIDLEALL------- 186 (324) T ss_pred HHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceec----------------cccCCHHHHHHHHHhh------- Confidence 999999999999999999999986 7899886544322211 1123455666665544 Q ss_pred cccccCceEEEEchhhHHHHhhhhhccCCCCccccc-----cCCCceeEecCCCC--CCcEEEEeecceEEEeecceEEE Q lcl|NC_019921. 258 SVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQE--AGKVLTYVKGLYDGYLAGGINVQ 330 (381) Q Consensus 258 ~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~-----l~~G~pVv~s~~~p--~~~i~fgd~~~y~i~~r~~i~i~ 330 (381) ...++..++|+||+.++..++. .++.+|+|++. .++|+||+.++.++ ++.++||||++|+++++++++|+ T Consensus 187 ~~~~~~~~~~v~n~~~~~~L~~---l~d~~g~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~ 263 (324) T protein:vir:10 187 EDDELEANAFISKTQNRSLLRK---IVDPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYK 263 (324) T ss_pred hhccCCCCEEEEcHHHHHHHHH---hhccCCceeecCCCCccccceeEEeecCCCCCcceEEEEecccEEEEEecCcEEE Confidence 2346677789999999887764 47788888753 34899999988766 45699999999999999999999 Q ss_pred eehhhh--------------hhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 331 KFKETL--------------ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 331 ~~~~~~--------------~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) +++|.. |.+|+++||+.+|+|+++++++||++++...++.+. ||--. T Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~----~~~~~ 324 (324) T protein:vir:10 264 IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDS----VPGEV 324 (324) T ss_pred EeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCC----CCCCC Confidence 999853 889999999999999999999999987765544432 33333 No 83 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=1.7e-53 Score=309.89 Aligned_cols=270 Identities=11% Similarity=-0.009 Sum_probs=214.9 Q ss_pred hccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceEEeecccccccccCcceeeEeeccee Q lcl|NC_019921. 78 KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNK 156 (381) Q Consensus 78 ~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~k 156 (381) -.+.++||++||+++.++|++.+++.++|+++|++++++ +..++|+.++.+.+.|++|+++.+ +++++|++++|.+|| T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~-~~~~~f~~v~l~~~k 79 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKS-ESTATFAPVTAIPRK 79 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccc-cccceeeEEEEeeEE Confidence 456778999999999999999999999999999999986 669999999999999999987766 689999999999999 Q ss_pred EEEeeeccHHhhh---cCHHHHHHHHHHHHHHHHHHHHhhheeeccCCC---cceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 157 LTAFVVLPKDLND---FGPAWIERFVRVQIEEAFAVALETAFLKGTGKD---QPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 157 l~~~~~iS~ell~---ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~---~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++++++||+|||+ |+.+++++||+++++++|++++|.+|++|+|.+ .|.||++.+..+..... T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~----------- 148 (311) T protein:vir:81 80 VQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVE----------- 148 (311) T ss_pred EEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeee----------- Confidence 9999999999995 677889999999999999999999999998643 46788765433222111 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc---------cCCCceeE Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv 301 (381) .+..........+..++..+. ...+ ....|+||+.++..+++ +++.+|+|+|. ..+|+||+ T Consensus 149 ~~~~~~~~~~~~i~~~~~~~~------~~~~-~~~~~vmn~~~~~~l~~---lkd~~G~~l~~~~~~~~~~~tl~G~Pv~ 218 (311) T protein:vir:81 149 LTTGTSATPDLAVEAAVGLVL------GDNL-SPDGVALDNTFSFMLAT---QRDSQGRKLYPELGFGTDVASFAGLNAA 218 (311) T ss_pred ecccccchHHHHHHHHHHHhh------hcCC-CceEEEEcHHHHHHHHh---hhccCCCeeecCccccCCCceecceeEE Confidence 111111111122222222211 1112 22359999999888765 47899999874 24799999 Q ss_pred ecCCCCCC------------------cEEEEeecceEEEeecceEEEeehhh-------hhhcCceEEEEEEEEcCEEec Q lcl|NC_019921. 302 ESTVQEAG------------------KVLTYVKGLYDGYLAGGINVQKFKET-------LALDDMDLYTAKQFAYGKAKD 356 (381) Q Consensus 302 ~s~~~p~~------------------~i~fgd~~~y~i~~r~~i~i~~~~~~-------~~~~d~~~~r~~~r~dGk~~~ 356 (381) .++.||.+ .++||||++|+++.|++++++++++. +|.+|++.||+..|+|++|++ T Consensus 219 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~ 298 (311) T protein:vir:81 219 VSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMS 298 (311) T ss_pred ecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeec Confidence 99999843 36899999999999999999998763 599999999999999999999 Q ss_pred CceEEEEEEEecC Q lcl|NC_019921. 357 NKVAAVWKLDLKG 369 (381) Q Consensus 357 ~~Afvv~~~~~~~ 369 (381) ++||++++-...+ T Consensus 299 ~~a~~~l~~a~~~ 311 (311) T protein:vir:81 299 TDAFAVVRDADES 311 (311) T ss_pred ccceEEEEeeccC Confidence 9999987766555 No 84 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=6.2e-53 Score=306.80 Aligned_cols=335 Identities=10% Similarity=-0.008 Sum_probs=224.7 Q ss_pred hhHHHHHHHHHHHHHHHHhhh--------hhHHHH----HHHHHHHHH---HHHHHHHHHHHHH-----HHHHH-Hhhcc Q lcl|NC_019921. 3 INLSETFANAKNEFINAVNNG--------EPQERQ----NELYGDMIN---QLFEETKLQAKAE-----AERVS-SLPKS 61 (381) Q Consensus 3 ~el~~~~~~~~~~~~~~~k~~--------~~~~~~----~~~~~~~~~---~~~~~~~~~~~~~-----~~~~~-~~~~~ 61 (381) |+..+++.+++++..+.+++. +...++ .+.++.... .+..+........ ..... ...+. T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 80 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQPN 80 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhccc Confidence 443333333333222222111 000011 111111111 1111110000000 00000 00000 Q ss_pred ccc----cCHHHHHHHHH------------HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEe Q lcl|NC_019921. 62 AQS----LSANQRSFFMD------------INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKS 124 (381) Q Consensus 62 ~~~----lt~~e~~~~~~------------~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~ 124 (381) ... ....+++.+.. ....++++||++||++++++|++.++++++|+++|+++++++ ..++|+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 160 (394) T protein:vir:10 81 GTDLKKKPIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPIL 160 (394) T ss_pred ccchhhhHHHHHHHHHHHHHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEE Confidence 100 11222333322 234677889999999999999999999999999999999865 4778876 Q ss_pred cC-CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCc Q lcl|NC_019921. 125 ET-SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQ 203 (381) Q Consensus 125 ~~-~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~ 203 (381) .. .+.+.|++|+++.++.++++|++|++.+|+++++++||++||+||.+|+++||.++|+++++++++.+|++|+|+++ T Consensus 161 ~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~ 240 (394) T protein:vir:10 161 KRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFT 240 (394) T ss_pred ecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 53 46788999988887778899999999999999999999999999999999999999999999999999999999988 Q ss_pred ceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc Q lcl|NC_019921. 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~ 283 (381) |.++.+. .+++.+.+++... .+..| +++|+||+.++..+.. + T Consensus 241 ~~~~~~~---------------------------~~~d~l~~~~~~~------~~~~~--~a~~vmn~~~~~~l~~---l 282 (394) T protein:vir:10 241 AKATTTD---------------------------TLVDSLKHILNVD------LDPAY--SRALVVTQSLFNTLDT---L 282 (394) T ss_pred ccccccc---------------------------ccHHHHHHHHHhh------hhhhc--cCEEEecHHHHHHHHH---h Confidence 8765421 1233444433211 11223 5789999999887765 4 Q ss_pred cCCCCccccc-------------cCCCceeEecCCC--CC--C--cEEEEeecc-eEEEeecceEEEeehhhhhhcCceE Q lcl|NC_019921. 284 LNANGVYVTA-------------LPFNLNVIESTVQ--EA--G--KVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDL 343 (381) Q Consensus 284 ~~~~G~~~~~-------------l~~G~pVv~s~~~--p~--~--~i~fgd~~~-y~i~~r~~i~i~~~~~~~~~~d~~~ 343 (381) ++.+|+|+|. .++|+||++++++ |. + .|+||||++ |++++++++++.++++..|. ++ T Consensus 283 kd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~~~---~~ 359 (394) T protein:vir:10 283 KDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKIYG---RY 359 (394) T ss_pred hccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEecccccc---ee Confidence 7889998863 2489999876543 32 2 289999998 88999999999999987765 57 Q ss_pred EEEEEEEcCEEecCceEEEEEEEecCCccccccCcc Q lcl|NC_019921. 344 YTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 344 ~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~ 379 (381) ||+++|+||++++++||++++++-+ .++++.++-- T Consensus 360 ~~~~~r~d~~~~~~~ai~~~~~~~~-~~~~~~~~~~ 394 (394) T protein:vir:10 360 LGAAFRFGVKQADSNAGYFVTNTDA-ASGSTSGTGK 394 (394) T ss_pred EEEEEEeccEEeccccEEEEEeecc-cCCCCCCCCC Confidence 9999999999999999999887664 3344444444 No 85 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=6.6e-54 Score=312.15 Aligned_cols=275 Identities=13% Similarity=0.021 Sum_probs=224.5 Q ss_pred cCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEeecccccccc Q lcl|NC_019921. 65 LSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKIYGEIKGQ 142 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv~e~~~~~~~ 142 (381) ++.+..+ ..+..+.+++|.+||++++++|++.+++.++|+++|+++++++ ...+|+..+.+.+.|++|+++.+ + T Consensus 1 m~~~~~~---~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~ 76 (297) T protein:vir:95 1 MTVQTFN---PENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIK-T 76 (297) T ss_pred CCccccc---cccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCcccc-c Confidence 3333332 3334456678889999999999999999999999999999764 46788888889999999988765 5 Q ss_pred cCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccc Q lcl|NC_019921. 143 LDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAY 222 (381) Q Consensus 143 ~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~ 222 (381) ++++|+++++.+||++++++||+|||+||.+|+++||++++++++++++|.+|++|+|+++|.||++.+........ T Consensus 77 ~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~--- 153 (297) T protein:vir:95 77 DKPEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIG--- 153 (297) T ss_pred cccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecc--- Confidence 78999999999999999999999999999999999999999999999999999999999999999875433221111 Q ss_pred cceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCcccccc----CCCc Q lcl|NC_019921. 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL----PFNL 298 (381) Q Consensus 223 ~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l----~~G~ 298 (381) ...+++.+.++...+ ...+..+++|+||+.++..++. .++.+|+|+|.. ++|+ T Consensus 154 -------------~~~t~~~i~~~~~~l-------~~~~~~~~~~v~~~~~~~~L~~---l~d~~G~~i~~~~~~~l~G~ 210 (297) T protein:vir:95 154 -------------GPINYDNILKLQDAL-------YDADVEPNAFVSKIQNRSALRE---ARDGNKVSIYDKAANTIDGI 210 (297) T ss_pred -------------cccCHHHHHHHHHHh-------hhccCCcCEEEEcHHHHHHHHH---hhccCCceeecCCCCcccce Confidence 112345565655544 2345667889999999887764 478889998864 3799 Q ss_pred eeEecCC--CCCCcEEEEeecceEEEeecceEEEeehhhh--------------hhcCceEEEEEEEEcCEEecCceEEE Q lcl|NC_019921. 299 NVIESTV--QEAGKVLTYVKGLYDGYLAGGINVQKFKETL--------------ALDDMDLYTAKQFAYGKAKDNKVAAV 362 (381) Q Consensus 299 pVv~s~~--~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~--------------~~~d~~~~r~~~r~dGk~~~~~Afvv 362 (381) ||+.+.. +++++++||||++|+++++++++++.+++.. |.+|++.||+.+|+|+++++++||++ T Consensus 211 Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~ 290 (297) T protein:vir:95 211 TTVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAK 290 (297) T ss_pred eeEeecCCCCCCceEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEE Confidence 9987654 5678899999999999999999999999865 88999999999999999999999997 Q ss_pred EEEEecCCccc Q lcl|NC_019921. 363 WKLDLKGHKPA 373 (381) Q Consensus 363 ~~~~~~~~~~~ 373 (381) ++ ..+|. T Consensus 291 l~----~at~~ 297 (297) T protein:vir:95 291 LT----PAERV 297 (297) T ss_pred Ee----ecCCC Confidence 54 22333 No 86 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=4.6e-52 Score=302.01 Aligned_cols=332 Identities=11% Similarity=0.022 Sum_probs=221.0 Q ss_pred CchhHHHHHHHHH-HHHHHHHhhhhhH-HHHHHHHHHHH--------HHHHHHHHH-HHHHHHHHHHHhhccc-cccCHH Q lcl|NC_019921. 1 MTINLSETFANAK-NEFINAVNNGEPQ-ERQNELYGDMI--------NQLFEETKL-QAKAEAERVSSLPKSA-QSLSAN 68 (381) Q Consensus 1 mt~el~~~~~~~~-~~~~~~~k~~~~~-~~~~~~~~~~~--------~~~~~~~~~-~~~~~~~~~~~~~~~~-~~lt~~ 68 (381) |++...++..+.. .++.........+ +...+...+.+ +.+..+... +.+.+..+........ .....+ T Consensus 1 m~~~e~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 80 (379) T protein:vir:10 1 MEALEIKVALEAIKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKSEDKSDS 80 (379) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccchh Confidence 7766443332222 2222111100000 00011111111 111100000 0000000000001100 000000 Q ss_pred HH-------HHHH------HH------hhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecC-- Q lcl|NC_019921. 69 QR-------SFFM------DI------NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSET-- 126 (381) Q Consensus 69 e~-------~~~~------~~------~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~-- 126 (381) .. +... .. ...++++++.+||++++..|++.+++.++|+++|+++++++ .+++|+.++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 160 (379) T protein:vir:10 81 LVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENGAG 160 (379) T ss_pred HHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeecCC Confidence 00 0000 00 11234566678999999999999999999999999999865 489998764 Q ss_pred CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceE Q lcl|NC_019921. 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIG 206 (381) Q Consensus 127 ~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~G 206 (381) .+.+.|++|++..+ +++++|++|++.+|+++++++||++||+|+. ++++||.++++++++++++.+|+.|+|++.+.+ T Consensus 161 ~~~~~~v~Eg~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~-~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~ 238 (379) T protein:vir:10 161 EGAIGAQVEGATKG-QKDYDISMIDVNTDFIAGFTRYSKKMANNLP-FLTSFIPNALRRDYAKAENAAFNAVLAANATAS 238 (379) T ss_pred CcccccccCCcccc-ccccceeeeEeeeeeEEeeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 45678999987665 6789999999999999999999999999986 599999999999999999999999998765544 Q ss_pred eeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCC Q lcl|NC_019921. 207 LNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA 286 (381) Q Consensus 207 il~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~ 286 (381) ..+. .....++.+.+++..+ ...+..+..|+|||.++..++. .+++ T Consensus 239 ~~~~------------------------~~~~~~d~i~~~~~~~-------~~~~~~~~~~vmn~~~~~~l~~---lkd~ 284 (379) T protein:vir:10 239 TEII------------------------TNKNKVEMLINEIAKQ-------ENLDFPVTAIVLRPTDYYDILV---TQKS 284 (379) T ss_pred cccc------------------------cCcccHHHHHHHHHhh-------hhccCCCCEEEEcHHHHHHHHH---hhcc Confidence 3321 0112234444444333 2245566779999999887765 4788 Q ss_pred CCccccc-----------cCCCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhh--hhhcCceEEEEEEEEcCE Q lcl|NC_019921. 287 NGVYVTA-----------LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGK 353 (381) Q Consensus 287 ~G~~~~~-----------l~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~--~~~~d~~~~r~~~r~dGk 353 (381) +|+|+|+ .++|+||+++++||+|+++||||++|.+.+|.++.|+.+++. +|.+|++.||+.+|+|++ T Consensus 285 ~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~ 364 (379) T protein:vir:10 285 VGAGYGLPGVVTQDNGVLRINGIPLFRATWLAANKYYVGDWTRVTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALA 364 (379) T ss_pred CCceeccCCccCCCCCcceecceeeEecCCCCCCceEEeecccEEEEEEeceEEEEeecccccccCCcEEEEEEEEeccE Confidence 9999874 247999999999999999999999999999999999888765 599999999999999999 Q ss_pred EecCceEEEEEEEecCC Q lcl|NC_019921. 354 AKDNKVAAVWKLDLKGH 370 (381) Q Consensus 354 ~~~~~Afvv~~~~~~~~ 370 (381) +++|+|||.+++. +. T Consensus 365 v~~p~a~v~~~~~--~~ 379 (379) T protein:vir:10 365 VEQPAALIFGDFT--AV 379 (379) T ss_pred EecCccEEEEEec--CC Confidence 9999999875544 44 No 87 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=1.1e-53 Score=310.94 Aligned_cols=283 Identities=13% Similarity=0.028 Sum_probs=217.7 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceEEeecccccc----cccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIK----GQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~~~~a~wv~e~~~~~----~~~~~~f~~v 150 (381) |...++++||++||++++++|++.+++.+||+++++++++++ ..++|+.++.+.+.|++|++..+ +.++++|+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 777888899999999999999999999999999999999865 58999999999999999987643 3468999999 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++.+||++++++||+|||+||.+|+++||+++|++++++++|.+|++|+|++++.+............... . T Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~-------~- 152 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAV-------E- 152 (305) T ss_pred EeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccc-------c- Confidence 99999999999999999999999999999999999999999999999999765544432222111111100 0 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc--cCCCceeEecCCCCC Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA--LPFNLNVIESTVQEA 308 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~--l~~G~pVv~s~~~p~ 308 (381) +........+.+..+........ ...+..+ .|+||+.++..+++ .++.+|+|+|+ ..+|+||++++.+|. T Consensus 153 -~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~-~~v~~~~~~~~l~~---lkd~~G~~i~~~~~l~G~Pv~~~~~~~~ 224 (305) T protein:vir:25 153 -VVGGVANESDIVGATNRAAKAVA---SAGWAPD-TLLSSLALRYEVAN---IRDANGNPVFRDDSFAGFRTFFNRNGAW 224 (305) T ss_pred -ccccchhhhHHHHHHHHHHHhhh---hcccccc-eeEecHHHHHHHHH---hhccCCceeecCCcccccceEEcCccCC Confidence 00011111111122221111111 1123333 49999999888765 47899999986 458999999999984 Q ss_pred ----CcEEEEeecceEEEeecceEEEeehhh----------hhhcCceEEEEEEEEcCEEecCceEEEEEEEecC-Cccc Q lcl|NC_019921. 309 ----GKVLTYVKGLYDGYLAGGINVQKFKET----------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG-HKPA 373 (381) Q Consensus 309 ----~~i~fgd~~~y~i~~r~~i~i~~~~~~----------~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~-~~~~ 373 (381) +.++||||++|+++++++++|+.+++. .|.+|++.+|+..|+|+.+++++||+.++..-.+ .+|+ T Consensus 225 ~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~pa 304 (305) T protein:vir:25 225 DADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) T ss_pred CCCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCCC Confidence 358999999999999999999999875 4788999999999999999999999976644222 2444 Q ss_pred c Q lcl|NC_019921. 374 L 374 (381) Q Consensus 374 ~ 374 (381) + T Consensus 305 ~ 305 (305) T protein:vir:25 305 A 305 (305) T ss_pred C Confidence 4 No 88 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=1.2e-52 Score=305.18 Aligned_cols=354 Identities=10% Similarity=0.028 Sum_probs=225.1 Q ss_pred CchhHHHHHHHHHHHHHH----HHhhhhhH------HHHHHHHHHHHHHHHHHHH------HHHHH----HHHH------ Q lcl|NC_019921. 1 MTINLSETFANAKNEFIN----AVNNGEPQ------ERQNELYGDMINQLFEETK------LQAKA----EAER------ 54 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~----~~k~~~~~------~~~~~~~~~~~~~~~~~~~------~~~~~----~~~~------ 54 (381) |+.++. ++++++.++.. .+.+.+.+ +++.+.+....+.+..+.. .+.+. .... T Consensus 8 m~~~i~-eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~~~~~~~~~ 86 (477) T protein:vir:84 8 LRALRA-AAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIERSGKLEAE 86 (477) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhh Confidence 665432 33333333322 22111111 1111112111111111000 00000 0000 Q ss_pred ---------------------------H-HHhhccccccCH---------------HHHHHHH-----HHhhccCCCCce Q lcl|NC_019921. 55 ---------------------------V-SSLPKSAQSLSA---------------NQRSFFM-----DINKNVNYKEEK 86 (381) Q Consensus 55 ---------------------------~-~~~~~~~~~lt~---------------~e~~~~~-----~~~~~~~~~gg~ 86 (381) . ....++...... +++.... ....+++..||+ T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~ 166 (477) T protein:vir:84 87 TKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGY 166 (477) T ss_pred hhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcce Confidence 0 000000000000 0000000 001234567889 Q ss_pred eccHHH-HHHHHHHHHhhhhhhhhceeEecC---CceEEEEecCC-cceEEeeccccc----ccccCcceeeEeecceeE Q lcl|NC_019921. 87 LLPEET-IDRIFEDLTTNHPLLADLGIKNAG---LRLKFLKSETS-GVAVWGKIYGEI----KGQLDAAFSEETAIQNKL 157 (381) Q Consensus 87 lvP~~~-~~~I~~~l~~~~~l~~~~~v~~~~---g~~~~p~~~~~-~~a~wv~e~~~~----~~~~~~~f~~v~l~~~kl 157 (381) +||+++ .++|++.++..++|+++|++++++ +.+++|+..++ ..+.|++|++.. .++++++|+++++.+|++ T Consensus 167 lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~ 246 (477) T protein:vir:84 167 AVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTI 246 (477) T ss_pred eeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeE Confidence 988875 688999999999999999988754 46899986544 567899997643 346788999999999999 Q ss_pred EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCC-CcceEeeeccccccccccccccceeeeeeeccccc Q lcl|NC_019921. 158 TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGK-DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANP 236 (381) Q Consensus 158 ~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~-~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~ 236 (381) +++++||++||+||.+++++||.++|++++++++|.+||+|+|+ ++|.||++........... ...+.... T Consensus 247 ~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~--------~~~t~~~~ 318 (477) T protein:vir:84 247 AGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATS--------AGSALEKH 318 (477) T ss_pred EeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccc--------cccchhhH Confidence 99999999999999999999999999999999999999999997 5999999754322111110 01111222 Q ss_pred chhHHHHHHHHHHhhhcccccccccc-CceEEEEchhhHHHHhhhhhccCCCCccccc---------------------- Q lcl|NC_019921. 237 RATVNELTQVFKYHSTNEKGKSVAVK-GNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------------------- 293 (381) Q Consensus 237 ~~~~~~l~~l~~~l~~~~~~~~~~~~-~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~---------------------- 293 (381) ..+++.+.++...+ ...|. +...|+||+.++..++++ ++.+|+|+|. T Consensus 319 ~~~~~~i~~~~~~~-------~~~~~~~~~~~v~~~~~~~~l~~l---kd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~ 388 (477) T protein:vir:84 319 QIIYQKIADAIQRV-------HTSRFLEPEVIVMHPRRWASFHAI---FAGDDRPLIVPSGPGFNNLGVLTEVASQRVVG 388 (477) T ss_pred HHHHHHHHHHHhhc-------cccccCCccEEEEcHHHHHHHHHh---hccCCCeeeecCcccccccccccccccccccc Confidence 23334444433322 22343 445799999998877654 7889998874 Q ss_pred cCCCceeEecCCCCCC--------cEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEec-CceEEEEE Q lcl|NC_019921. 294 LPFNLNVIESTVQEAG--------KVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKD-NKVAAVWK 364 (381) Q Consensus 294 l~~G~pVv~s~~~p~~--------~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~-~~Afvv~~ 364 (381) ..+|+||++++.||++ .|+||||++|++++ .++++.++++.++.++++.|+...++++++++ |+|||+++ T Consensus 389 ~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~-~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t 467 (477) T protein:vir:84 389 QMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE-SSVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIG 467 (477) T ss_pred hhcccceEecCcccccccccCCcceEEEEEeceEEEEe-eceeEEeccccccccceeeeeehhhhhhhhhccccceEEee Confidence 2369999999999964 48999999998887 57999999999999999999999999998886 99999755 Q ss_pred EEecCCccccccC Q lcl|NC_019921. 365 LDLKGHKPALEGT 377 (381) Q Consensus 365 ~~~~~~~~~~~~~ 377 (381) .++.+ +.+++ T Consensus 468 --~~~~~-~~~~~ 477 (477) T protein:vir:84 468 --GTALT-APTFA 477 (477) T ss_pred --ccccc-ccccC Confidence 43333 33344 No 89 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=3.7e-52 Score=302.56 Aligned_cols=338 Identities=10% Similarity=-0.016 Sum_probs=215.9 Q ss_pred CchhHH----HHHHHHHHHHHHHHhhhhh-----------HHHH-------------HHHHHHHHHHHH---HHHHHH-- Q lcl|NC_019921. 1 MTINLS----ETFANAKNEFINAVNNGEP-----------QERQ-------------NELYGDMINQLF---EETKLQ-- 47 (381) Q Consensus 1 mt~el~----~~~~~~~~~~~~~~k~~~~-----------~~~~-------------~~~~~~~~~~~~---~~~~~~-- 47 (381) +..+.. ++..+..+++.+.+++.+. ..++ ............ ...... T Consensus 29 ~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~ 108 (437) T protein:vir:10 29 ESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDLVAPELEENSADNEEDDPEKLKTETKSEAE 108 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111100 0000000011111100000 0000 000000000000 000000 Q ss_pred -----HHHHHHHHHHh-----hccccccCHHHHHHH---------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhh Q lcl|NC_019921. 48 -----AKAEAERVSSL-----PKSAQSLSANQRSFF---------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLA 108 (381) Q Consensus 48 -----~~~~~~~~~~~-----~~~~~~lt~~e~~~~---------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~ 108 (381) ......+.... ..........+++.+ ......++.+||++||+++.+.|. .+++.++|++ T Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~-~~~~~~~l~~ 187 (437) T protein:vir:10 109 KDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPEK-EVHQFPRLGS 187 (437) T ss_pred HHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcccccccccchHHHHHHHH-Hhhhhhhhhh Confidence 00000000000 000000111111111 123445778999999999998775 5688999999 Q ss_pred hceeEecC-CceEEEEec-CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHH Q lcl|NC_019921. 109 DLGIKNAG-LRLKFLKSE-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEA 186 (381) Q Consensus 109 ~~~v~~~~-g~~~~p~~~-~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~ 186 (381) +|++++++ +...+|+.. ..+.+.|++|++..+..++++|++|++.+|+++++++||++||+||.+||++||.++|+++ T Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~ 267 (437) T protein:vir:10 188 LVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIEL 267 (437) T ss_pred cceeEeeccCceeeEEeeccccccccccccccccccccccceeeeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHH Confidence 99999875 557788764 4577899999888876778999999999999999999999999999999999999999999 Q ss_pred HHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceE Q lcl|NC_019921. 187 FAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVT 266 (381) Q Consensus 187 ~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~ 266 (381) ++++++.+|++|+|+++|.+..+. ..+.+.+++.. ..+..|+.+++ T Consensus 268 ~~~~~~~~i~~g~g~~~~~~~~~~----------------------------~~~~~~~~~~~------~l~~~~~~~~~ 313 (437) T protein:vir:10 268 RDNTDDSLIITALTDGIKKTTSTY----------------------------LLGDLKKVLNV------TLKPQDSAAAS 313 (437) T ss_pred HHHHHHHHHhhhhccccccccccc----------------------------chhhHHHHHHh------hhhhhhhcCCE Confidence 999999999999998877543210 11223333221 12456888999 Q ss_pred EEEchhhHHHHhhhhhccCCCCccccc---------cCCCceeEecCCC--CC---Cc--EEEEeecc-eEEEeecceEE Q lcl|NC_019921. 267 MVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQ--EA---GK--VLTYVKGL-YDGYLAGGINV 329 (381) Q Consensus 267 ~~mn~~t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~~~--p~---~~--i~fgd~~~-y~i~~r~~i~i 329 (381) |+||+.++..+.. +++.+|+|+|. ..+|+||+++++| |. ++ ++||||++ |.+++|+++++ T Consensus 314 ~~~~~~~~~~l~~---lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~ 390 (437) T protein:vir:10 314 IVMSQSAYNLFDM---ATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITG 390 (437) T ss_pred EEEcHHHHHHHHH---hhccCCCeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEE Confidence 9999999877754 47899999984 2489999987765 53 22 89999997 77999999999 Q ss_pred EeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcc Q lcl|NC_019921. 330 QKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 330 ~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~ 379 (381) ..+++ +..+.+.+|+.+|+||++++++|||+++.++.+.+ .+..|+- T Consensus 391 ~~~~~--~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~-~~~~~~~ 437 (437) T protein:vir:10 391 QFQDT--YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVT-VVQSTAV 437 (437) T ss_pred EEecc--cccccceeeEEEEEccEEecccceEEEEeeccccc-cCCCCCC Confidence 98775 56678899999999999999999999887753322 2222222 No 90 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=1.4e-52 Score=304.82 Aligned_cols=301 Identities=13% Similarity=0.039 Sum_probs=224.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC Q lcl|NC_019921. 37 INQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG 116 (381) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~ 116 (381) +++.++. + .....+.+... .. ..+++.......++|++||++++++|++.+++.++|++++++++++ T Consensus 1 ~~~~~~~-~-~~~~~f~~~~~---~~--------~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~ 67 (324) T protein:vir:96 1 MEQTQKL-K-LNLQHFASNNV---KP--------QVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME 67 (324) T ss_pred CCcchhh-h-HHHHHHHHhhh---hh--------hhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc Confidence 0000000 0 00000000000 00 0111222233456778999999999999999999999999999986 Q ss_pred C-ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhe Q lcl|NC_019921. 117 L-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) Q Consensus 117 g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~ 195 (381) + .+++|+.++.+.+.|++|++..+ +++++|+++++.+||++++++||+|||+||.+++++||.+++++++++++|.+| T Consensus 68 ~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~ 146 (324) T protein:vir:96 68 GTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) T ss_pred CCceEEEEEecCcceeeecCCcccc-ccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHh Confidence 5 58999999889999999987765 578999999999999999999999999999999999999999999999999999 Q ss_pred eeccCCC-cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhH Q lcl|NC_019921. 196 LKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) Q Consensus 196 i~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~ 274 (381) |+|+|++ .|.|++.......... ....+++.+.++...+ ...+..+++|+||+.++ T Consensus 147 l~G~g~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~i~~~~~~i-------~~~~~~~~~~i~n~~~~ 203 (324) T protein:vir:96 147 ILNQGNNPFGKSIAQSIKKTNKVI----------------KGDFTQDNIIDLEALL-------EDDELEANAFISKTQNR 203 (324) T ss_pred hhcCCCCCcCccccccccccceec----------------ccccchHHHHHHHHhh-------hhccCCCCEEEEcHHHH Confidence 9999975 6788875433221111 1122345566555443 23456677899999998 Q ss_pred HHHhhhhhccCCCCccccc-----cCCCceeEecCCCC--CCcEEEEeecceEEEeecceEEEeehhh------------ Q lcl|NC_019921. 275 FEVQAQYTHLNANGVYVTA-----LPFNLNVIESTVQE--AGKVLTYVKGLYDGYLAGGINVQKFKET------------ 335 (381) Q Consensus 275 ~~~~~~~~~~~~~G~~~~~-----l~~G~pVv~s~~~p--~~~i~fgd~~~y~i~~r~~i~i~~~~~~------------ 335 (381) ..++. .++.+|+|+.. ..+|+||+.+..++ ++.++||||++|+++++++++|+.++|. T Consensus 204 ~~L~~---lkd~~G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 280 (324) T protein:vir:96 204 SLLRK---IVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) T ss_pred HHHHH---hhCCCCCeeecCCCCCcccceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccc Confidence 77654 47888988653 34799999877654 5669999999999999999999999885 Q ss_pred --hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 336 --~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) +|.+|++.||+.+|+|+++++++||++++. ..+.++-||-.+ T Consensus 281 ~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~----a~~~~~~~~~~~ 324 (324) T protein:vir:96 281 VNLFEQDMVALRATMHVALHIADDKAFAKLVP----ADKRTDSVPGEV 324 (324) T ss_pred hhhhhcCcEEEEEEEEeccEEecccceEEEec----ccccCCCCCCCC Confidence 488999999999999999999999997663 344444466666 No 91 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=1.1e-51 Score=300.05 Aligned_cols=344 Identities=15% Similarity=0.124 Sum_probs=226.0 Q ss_pred CchhH-HHHHHHHHHHHHHHHhhh----h-----hHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH------------ Q lcl|NC_019921. 1 MTINL-SETFANAKNEFINAVNNG----E-----PQERQNELYGD---MINQLFEETKLQAKAEAERV------------ 55 (381) Q Consensus 1 mt~el-~~~~~~~~~~~~~~~k~~----~-----~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~------------ 55 (381) |++.. .+++.+.+.++.+.++.. . ..+++.+.++. .++.+..+.......+...+ T Consensus 193 ~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~ 272 (645) T protein:vir:93 193 MNIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEHYDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGNG 272 (645) T ss_pred cchhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 66432 223333333332222111 0 00112222222 22222222111000000000 Q ss_pred -----------------------------HHhhccccccCHHH---------HHH---H-HHHh----hccCCCCceecc Q lcl|NC_019921. 56 -----------------------------SSLPKSAQSLSANQ---------RSF---F-MDIN----KNVNYKEEKLLP 89 (381) Q Consensus 56 -----------------------------~~~~~~~~~lt~~e---------~~~---~-~~~~----~~~~~~gg~lvP 89 (381) .....+.. ..+.+ .+. + .++. .++.+.||+++| T Consensus 273 ~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~-~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp 351 (645) T protein:vir:93 273 NVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVR-SEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEY 351 (645) T ss_pred ccccccccccccchhhhhhhhhHHHHHHHHHhcccch-hHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCc Confidence 00000000 00000 000 0 1111 223456999999 Q ss_pred HHHHHHHHHHHHhhhhhhhhceeE-e----cCCceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeecc Q lcl|NC_019921. 90 EETIDRIFEDLTTNHPLLADLGIK-N----AGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLP 164 (381) Q Consensus 90 ~~~~~~I~~~l~~~~~l~~~~~v~-~----~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS 164 (381) +++..+|++.++..+++++++... + +.+++++|+.++.+.++|++|++..+ +++++|+++++.+||+++++++| T Consensus 352 ~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~-~s~~~f~~v~l~~~kla~~~~iS 430 (645) T protein:vir:93 352 QEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKP-LTKFDFESITFSHAKVSAIAVLT 430 (645) T ss_pred hhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCcccc-ccccceeEEEEeeEEEEEeehhH Confidence 999999999999999999986543 2 24678999999999999999987765 67999999999999999999999 Q ss_pred HHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCC----cceEeeeccccccccccccccceeeeeeecccccchhH Q lcl|NC_019921. 165 KDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD----QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATV 240 (381) Q Consensus 165 ~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~----~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~ 240 (381) +|||+||.+++++||+++++++|++++|.+||+|+|++ +|.||+....... . ..... T Consensus 431 ~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~--~-----------------~~~~~ 491 (645) T protein:vir:93 431 EELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTA--S-----------------SGNPD 491 (645) T ss_pred HHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccc--c-----------------ccchH Confidence 99999999999999999999999999999999998864 6888864321110 0 00111 Q ss_pred HHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc-------cCCCceeEecCCCCCCcEEE Q lcl|NC_019921. 241 NELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA-------LPFNLNVIESTVQEAGKVLT 313 (381) Q Consensus 241 ~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~-------l~~G~pVv~s~~~p~~~i~f 313 (381) ..+..++..+.. ......+++|+|||.++..+.. +++++|+|++. ..+|+||+.+++||+ +++| T Consensus 492 ~d~~~~~~~~~~-----a~~~~~~a~~vmn~~~~~~L~~---lkd~~G~~~~~~~~~~~~tL~G~PV~~s~~vp~-~~~~ 562 (645) T protein:vir:93 492 ADAEAAFGQFVA-----ANLQPTGAVWLMSSTNALALSM---RKNALGQKEYPDMTLLGGSFQGLPVIVSQYVGD-QLVL 562 (645) T ss_pred HHHHHHHHHHHh-----cCCCccccEEEEcHHHHHHHHh---ccccCCceeecCCCCCCceeeceeeEEeccCCc-ceeE Confidence 223333332211 1123457889999999877754 47889988752 237999999999996 4789 Q ss_pred EeecceEEEeecceEEEeehhhh----------------------hhcCceEEEEEEEEcCEEecCceEEEEEEEecCCc Q lcl|NC_019921. 314 YVKGLYDGYLAGGINVQKFKETL----------------------ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 314 gd~~~y~i~~r~~i~i~~~~~~~----------------------~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~ 371 (381) |||++|++++++++.|..+++.. |.+|+++||+.+|+|+++++|+|||+++ -+.-. T Consensus 563 gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt--~~~~g 640 (645) T protein:vir:93 563 VNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVIT--GVNYG 640 (645) T ss_pred eccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEe--cccCC Confidence 99999999999999998877642 8899999999999999999999999755 33444 Q ss_pred ccccc Q lcl|NC_019921. 372 PALEG 376 (381) Q Consensus 372 ~~~~~ 376 (381) .+.-| T Consensus 641 ~~~~~ 645 (645) T protein:vir:93 641 SASGG 645 (645) T ss_pred cccCC Confidence 55555 No 92 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=2.3e-52 Score=303.68 Aligned_cols=322 Identities=11% Similarity=0.038 Sum_probs=214.5 Q ss_pred CchhHHHHHHHHHHHHHH-------HHhhhhhHHHH---HHHH---HHHHHHHHHHHHH---HHHH-------------- Q lcl|NC_019921. 1 MTINLSETFANAKNEFIN-------AVNNGEPQERQ---NELY---GDMINQLFEETKL---QAKA-------------- 50 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~-------~~k~~~~~~~~---~~~~---~~~~~~~~~~~~~---~~~~-------------- 50 (381) |.-+ .+++.++++++.. .+.+...+++. .+.+ ...++.+.+.... +.+. T Consensus 15 l~~~-l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l~~~~~~~~~~~~~ 93 (397) T protein:vir:96 15 RSSE-IDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDLEDELAKAADPTDQ 93 (397) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh Confidence 1111 1122222222222 22211111100 0111 1111111111110 0000 Q ss_pred ----HHHHHHHhhccccccCHHHHHHHHH---------HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC Q lcl|NC_019921. 51 ----EAERVSSLPKSAQSLSANQRSFFMD---------INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL 117 (381) Q Consensus 51 ----~~~~~~~~~~~~~~lt~~e~~~~~~---------~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g 117 (381) ...+.............+++..+.. ....+..+||++||+++.+.|++ +.+.+++++.|+++++++ T Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~ 172 (397) T protein:vir:96 94 KPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNS 172 (397) T ss_pred hhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccc Confidence 0000000000011111122222221 12345678999999999999997 678889999999988754 Q ss_pred -ceEEEEec-CCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhe Q lcl|NC_019921. 118 -RLKFLKSE-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) Q Consensus 118 -~~~~p~~~-~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~ 195 (381) ...+|+.. +.+.+.|++|++..+..++++|++|++.+|++++++++|++||+||.+|+++||.++++++++++++.+| T Consensus 173 ~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i 252 (397) T protein:vir:96 173 ASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADI 252 (397) T ss_pred cceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 46666643 3466789999888876789999999999999999999999999999999999999999999999999999 Q ss_pred eeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHH Q lcl|NC_019921. 196 LKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAF 275 (381) Q Consensus 196 i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~ 275 (381) ++|+|.++|.|+.+ ++.+.+++.... + .+ .+++|+||+.++. T Consensus 253 ~~g~g~~~~~~~~~------------------------------~d~~~~~~~~~~------~-~~-~~a~~v~n~~~~~ 294 (397) T protein:vir:96 253 AAVLKTATAKSVVG------------------------------VDGLKDLINKEI------K-KV-YDVKLFISASMYS 294 (397) T ss_pred hhcccccccccccc------------------------------hHHHHHHHHHhh------h-hh-cCcEEEEcHHHHH Confidence 99999998877542 233334433211 1 12 2678999999988 Q ss_pred HHhhhhhccCCCCccccc---------cCCCceeEecCCC-CC-----CcEEEEeecc-eEEEeecceEEEeehhhhhhc Q lcl|NC_019921. 276 EVQAQYTHLNANGVYVTA---------LPFNLNVIESTVQ-EA-----GKVLTYVKGL-YDGYLAGGINVQKFKETLALD 339 (381) Q Consensus 276 ~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~~~-p~-----~~i~fgd~~~-y~i~~r~~i~i~~~~~~~~~~ 339 (381) .++. +++.+|+|+|. ..+|+||+.++++ |. ..++||||++ |++++|+++++..+++.+| T Consensus 295 ~l~~---lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~-- 369 (397) T protein:vir:96 295 ELDK---LKDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNNIY-- 369 (397) T ss_pred HHHH---hhccCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEeccccc-- Confidence 8765 47899999984 3479999876543 32 2389999998 7899999999999998665 Q ss_pred CceEEEEEEEEcCEEecCceEEEEEEEec Q lcl|NC_019921. 340 DMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 340 d~~~~r~~~r~dGk~~~~~Afvv~~~~~~ 368 (381) .++||+++|+||+|++++|||+++++.+ T Consensus 370 -~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 370 -GQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred -ceeEEEEEEEccEEecccceEEEEeecC Confidence 5789999999999999999999999997 No 93 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=3.3e-52 Score=302.81 Aligned_cols=288 Identities=11% Similarity=0.021 Sum_probs=216.6 Q ss_pred ccCHHHHHHHHHHhhcc------CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcc------- Q lcl|NC_019921. 64 SLSANQRSFFMDINKNV------NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGV------- 129 (381) Q Consensus 64 ~lt~~e~~~~~~~~~~~------~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~~~~------- 129 (381) .-+-.| +.....+. .+.++.+||++++++|++.+++.++|+++|+++++++ .+++|+.+..+. T Consensus 1 ~~~~~e---~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~ 77 (338) T protein:vir:78 1 MATLNE---LAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVG 77 (338) T ss_pred CcchHH---hhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeeccc Confidence 001111 11222222 2234558999999999999999999999999999865 589999876554 Q ss_pred -eEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCC---cce Q lcl|NC_019921. 130 -AVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD---QPI 205 (381) Q Consensus 130 -a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~---~P~ 205 (381) +.|++|++..+ +++++|+++++.+||++++++||+|||+||.+++++||.++|++++++++|.+|++|+|++ +|. T Consensus 78 ~~~~~~Eg~~~~-~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~ 156 (338) T protein:vir:78 78 TSNEQREGGTKP-LSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQ 156 (338) T ss_pred cccccccccccc-ccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccc Confidence 44556766554 6789999999999999999999999999999999999999999999999999999999975 567 Q ss_pred EeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC Q lcl|NC_019921. 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~ 285 (381) ||++.......... ..........++.+.++...+..+ .......|+||+.++..+......+| T Consensus 157 gi~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~m~~~~~~~L~~~~~l~d 220 (338) T protein:vir:78 157 GIDTNNVIVNTTNV----------DYLQTGTTPLLDRFLDGYDLVSAN------TDVDFNGWAADPRYRARLLRSQAYRD 220 (338) T ss_pred cccccccccccccc----------ccccccchhhHHHHHHHHHHhhhh------ccccceEEEEchHHHHHHHHHhhhcc Confidence 77654322111100 011122234455555554443221 12344569999999988877777899 Q ss_pred CCCccccc---------cCCCceeEecCCCCCC---------cEEEEeecceEEEeecceEEEeehhh------------ Q lcl|NC_019921. 286 ANGVYVTA---------LPFNLNVIESTVQEAG---------KVLTYVKGLYDGYLAGGINVQKFKET------------ 335 (381) Q Consensus 286 ~~G~~~~~---------l~~G~pVv~s~~~p~~---------~i~fgd~~~y~i~~r~~i~i~~~~~~------------ 335 (381) .+|+|+|. ..+|+||+.+++||++ .++||||++|++++|++++|+++++. T Consensus 221 ~~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 300 (338) T protein:vir:78 221 ANGNVDPTRINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQT 300 (338) T ss_pred CCCceeecccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccc Confidence 99999874 2379999999999852 37899999999999999999999874 Q ss_pred --hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcc Q lcl|NC_019921. 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 336 --~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~ 372 (381) +|.+|+++||+.+|+|+++++++||++++- ..++.+ T Consensus 301 ~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~-~~~~~~ 338 (338) T protein:vir:78 301 VSMWQTNQIAILIEVTFGWLLGDKQAFVKFVD-DEDPDA 338 (338) T ss_pred hhhhhcCcEEEEEEEEeccEeecccceEEEec-ccCCCC Confidence 488999999999999999999999998663 223333 No 94 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=1.9e-51 Score=298.61 Aligned_cols=271 Identities=10% Similarity=-0.052 Sum_probs=211.6 Q ss_pred hccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceEEeecccccccccCcceeeEeeccee Q lcl|NC_019921. 78 KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNK 156 (381) Q Consensus 78 ~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~k 156 (381) -++.++||++||++++++|++.+++.|+|+++|++++++ +..++|+.++.+.+.|++|+++.+ +++++|+++++.+|| T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~-~s~~~f~~v~l~~~k 79 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKT-HGGLSLEPVTIVPIK 79 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCcccc-ccccceeeEEeeeEE Confidence 457778999999999999999999999999999999986 569999999999999999987765 688999999999999 Q ss_pred EEEeeeccHHhh---hcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEee-eccccccccccccccceeeeeeec Q lcl|NC_019921. 157 LTAFVVLPKDLN---DFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLN-RQVQKGVSVTEGAYPEKEEQGTLT 232 (381) Q Consensus 157 l~~~~~iS~ell---~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil-~~~~~~~~~~~~~~~~~~~~~~~t 232 (381) +++++++|+||| .|+.+++++||++++++++++++|.+|++|+|+..+.+.. ........... .... T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~---------~~~~ 150 (303) T protein:vir:97 80 VEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVT---------QVVK 150 (303) T ss_pred EEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccc---------cccc Confidence 999999999999 5788999999999999999999999999997653332211 10000000000 0001 Q ss_pred ccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc----------cCCCceeEe Q lcl|NC_019921. 233 FANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA----------LPFNLNVIE 302 (381) Q Consensus 233 ~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~----------l~~G~pVv~ 302 (381) .......++.+.+++..+. ..+..+..|+||+.++..++. +++.+|+|+|. ..+|+||+. T Consensus 151 ~~~~~~~~~~i~~~~~~~~-------~~~~~~~~~vmn~~~~~~L~~---lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~ 220 (303) T protein:vir:97 151 FTESEDADANIEAAVNLIQ-------GAEGVVTGLAMDTEFSTALAK---VTNGEMGPKMYPELAWGANPDSINGLKSSV 220 (303) T ss_pred cccccchHHHHHHHHHHHh-------hcCCCccEEEEcHHHHHHHHH---hhccCCCeEEecCccCCCCCceecceeeEE Confidence 1112234455555554432 234455679999999887764 47889988763 357999999 Q ss_pred cCCCCCC--------cEEEEeecc-eEEEeecceEEEeehhh--------hhhcCceEEEEEEEEcCEEecCceEEEEEE Q lcl|NC_019921. 303 STVQEAG--------KVLTYVKGL-YDGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYGKAKDNKVAAVWKL 365 (381) Q Consensus 303 s~~~p~~--------~i~fgd~~~-y~i~~r~~i~i~~~~~~--------~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~ 365 (381) +++||.+ .++||||+. |.+++|++++++.+++. +|.+|+++||+.+|+|+++++++||++++ T Consensus 221 s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~- 299 (303) T protein:vir:97 221 NTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVT- 299 (303) T ss_pred ecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEee- Confidence 9999853 278999965 89999999999988753 59999999999999999999999999643 Q ss_pred EecCC Q lcl|NC_019921. 366 DLKGH 370 (381) Q Consensus 366 ~~~~~ 370 (381) .++. T Consensus 300 -~~~~ 303 (303) T protein:vir:97 300 -KGEV 303 (303) T ss_pred -CCCC Confidence 3333 No 95 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=2.2e-51 Score=298.30 Aligned_cols=267 Identities=10% Similarity=-0.007 Sum_probs=210.1 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceEEeecccccccccCcceeeEeecc Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~ 154 (381) |..+ ...+|++||++++.+|++.+++.|+++++|++++++ ++.++|+.++.+.+.|++|+++.+ +++++|+++++.+ T Consensus 1 ma~~-t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~-~s~~~f~~v~l~~ 78 (300) T protein:vir:95 1 MSEA-QLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKKT-HGGVSLDPVTIVP 78 (300) T ss_pred Cccc-ccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcccc-cccccceeeEeee Confidence 4444 445678999999999999999999999999999975 568999999889999999987665 6889999999999 Q ss_pred eeEEEeeeccHHhh---hcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCC--cc---eEeeecccccccccccccccee Q lcl|NC_019921. 155 NKLTAFVVLPKDLN---DFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD--QP---IGLNRQVQKGVSVTEGAYPEKE 226 (381) Q Consensus 155 ~kl~~~~~iS~ell---~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~--~P---~Gil~~~~~~~~~~~~~~~~~~ 226 (381) ||++++++||+||| +|+.+++++||++++++++++++|.+|++|++.+ ++ .|+......... T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~---------- 148 (300) T protein:vir:95 79 LKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQ---------- 148 (300) T ss_pred EEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccce---------- Confidence 99999999999999 5788999999999999999999999999997543 33 333211111000 Q ss_pred eeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc---------cCCC Q lcl|NC_019921. 227 EQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFN 297 (381) Q Consensus 227 ~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~---------l~~G 297 (381) ... .......+.+.++...+. ..+..+++|+|||.++..++. +++.+|+|+|. ..+| T Consensus 149 ---~~~-~~~~~~~~~i~~~~~~~~-------~~~~~~~~~vmn~~~~~~L~~---lkd~~G~~i~~~~~~~~~~~~l~G 214 (300) T protein:vir:95 149 ---TVP-FKDTNPDESMEDAVGMID-------GSERDITGAILDPIFTTALSK---MKNAEGGKLYPELAWGGVPDAING 214 (300) T ss_pred ---eec-ccccchHHHHHHHHHHhh-------hcCCCccEEEECHHHHHHHHH---hhccCCCeeccCccccCCCceecc Confidence 000 111233455555555442 234556689999999887765 47899999873 2379 Q ss_pred ceeEecCCCCCCc------EEEEeecc-eEEEeecceEEEeehhh--------hhhcCceEEEEEEEEcCEEecCceEEE Q lcl|NC_019921. 298 LNVIESTVQEAGK------VLTYVKGL-YDGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYGKAKDNKVAAV 362 (381) Q Consensus 298 ~pVv~s~~~p~~~------i~fgd~~~-y~i~~r~~i~i~~~~~~--------~~~~d~~~~r~~~r~dGk~~~~~Afvv 362 (381) +||+.++.||.+. ++||||++ |.++.|++++++++++. +|.+|+++||+.+|+|+++++++||+. T Consensus 215 ~Pv~~s~~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~ 294 (300) T protein:vir:95 215 LAVDKNRTVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFAR 294 (300) T ss_pred eeeEEecCCCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEE Confidence 9999999998643 67899998 55899999999988753 599999999999999999999999997 Q ss_pred EEEEecCC Q lcl|NC_019921. 363 WKLDLKGH 370 (381) Q Consensus 363 ~~~~~~~~ 370 (381) + +-++. T Consensus 295 l--~~~~g 300 (300) T protein:vir:95 295 I--VKTGG 300 (300) T ss_pred E--ecCCC Confidence 4 45455 No 96 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=5.5e-51 Score=296.12 Aligned_cols=268 Identities=9% Similarity=-0.043 Sum_probs=210.0 Q ss_pred cCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceEEeecccccccccCcceeeEeecceeEE Q lcl|NC_019921. 80 VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLT 158 (381) Q Consensus 80 ~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~ 158 (381) -..+||++||++++++|++.+++.++++++|++++++ +..++|+.++.+.+.|++|+++.+ +++++|+++++.+||++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~-~~~~~f~~v~l~~~k~a 79 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKT-HGGVTLAPQTMVPIKVE 79 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCcccc-ccccceeEEEEeeeeEE Confidence 3467789999999999999999999999999999975 569999999999999999987765 67899999999999999 Q ss_pred EeeeccHHhh---hcCHHHHHHHHHHHHHHHHHHHHhhheeeccC--CCcceEeeeccccccccccccccceeeeeeecc Q lcl|NC_019921. 159 AFVVLPKDLN---DFGPAWIERFVRVQIEEAFAVALETAFLKGTG--KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTF 233 (381) Q Consensus 159 ~~~~iS~ell---~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G--~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~ 233 (381) +++++|+||| .|+.++|++||.+++++++++++|.+|++|+| +++|.++.............. . .. T Consensus 80 ~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-----~----~~ 150 (298) T protein:vir:16 80 YGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKV-----E----AP 150 (298) T ss_pred EeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccccc-----c----cc Confidence 9999999999 46778999999999999999999999999964 556655432111111110000 0 00 Q ss_pred cccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc---------cCCCceeEecC Q lcl|NC_019921. 234 ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVIEST 304 (381) Q Consensus 234 ~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv~s~ 304 (381) ......++.+.+++..+ ...+..++.|+||+.++..++. +++.+|+|+|. ..+|+||+.++ T Consensus 151 ~~~~~~~~~i~~~~~~~-------~~~~~~~~~~vmn~~~~~~l~~---lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~ 220 (298) T protein:vir:16 151 RGIADPNGAIENAVELL-------TGVDADVTGIAINPSFRSALAK---QKDLQDNALFPELKWGATPDTINGLPVDVNK 220 (298) T ss_pred cccccHHHHHHHHHHHh-------hhcCCCccEEEEcHHHHHHHHH---hhccCCCeeecCcccCCCCceecceeeEEec Confidence 11112233444444433 2234566789999999887765 47889999874 24799999999 Q ss_pred CCCCC------cEEEEeecc-eEEEeecceEEEeehh--------hhhhcCceEEEEEEEEcCEEecCceEEEEEEEe Q lcl|NC_019921. 305 VQEAG------KVLTYVKGL-YDGYLAGGINVQKFKE--------TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDL 367 (381) Q Consensus 305 ~~p~~------~i~fgd~~~-y~i~~r~~i~i~~~~~--------~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~ 367 (381) +||++ .++||||++ |.++.|++++++.+++ .+|.+|+++||+.+|+|+++++++||++++-.. T Consensus 221 ~v~~~~~~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 221 TVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccccccCCCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 99863 378999998 5689999999998875 268999999999999999999999999764222 No 97 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=1.3e-49 Score=288.68 Aligned_cols=265 Identities=10% Similarity=-0.036 Sum_probs=207.2 Q ss_pred cCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceEEeecccccccccCcceeeEeecceeEE Q lcl|NC_019921. 80 VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLT 158 (381) Q Consensus 80 ~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~ 158 (381) -..+||++||+++.++|++.++++++|+++|++++++ +.+++|+.++.+.+.|++|+++.+ +++++|+++++.+||++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~-~~~~~f~~v~l~~~k~~ 79 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKT-HGGVTLAPQTMVPIKVE 79 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCcccc-ccccceeEEEEeeeEEE Confidence 3347789999999999999999999999999999976 568999999999999999987765 68999999999999999 Q ss_pred EeeeccHHhhh---cCHHHHHHHHHHHHHHHHHHHHhhheeeccC--CCc---ceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 159 AFVVLPKDLND---FGPAWIERFVRVQIEEAFAVALETAFLKGTG--KDQ---PIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 159 ~~~~iS~ell~---ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G--~~~---P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++++||+|||. |+.+++++||.+++++++++++|.+|++|.+ +++ +.|+........... T Consensus 80 ~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~------------ 147 (298) T protein:vir:94 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKV------------ 147 (298) T ss_pred EeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccccc------------ Confidence 99999999995 6678999999999999999999999999943 222 233221111100000 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc---------cCCCceeE Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~---------l~~G~pVv 301 (381) ..........+.+.+++..+ ...+..++.|+||+.++..++++ ++.+|+|+|. ..+|+||+ T Consensus 148 ~~~~~~~~~~~~i~~~~~~~-------~~~~~~~~~~vmn~~~~~~l~~l---kd~~G~~l~~~~~~~~~~~tl~G~PV~ 217 (298) T protein:vir:94 148 EAPRGIADPNGAIENAVELL-------TGVDADVTGIAINPSFRSALAKQ---KDLQGNALFPELKWGATPDTINGLPVD 217 (298) T ss_pred ccccccccHHHHHHHHHHhh-------hhcCCCccEEEEcHHHHHHHHHh---hccCCCeeecCcccCCCCceecceeeE Confidence 00011112334455554443 23355667899999998887654 7889999874 24799999 Q ss_pred ecCCCCCC------cEEEEeecc-eEEEeecceEEEeehh--------hhhhcCceEEEEEEEEcCEEecCceEEEEEEE Q lcl|NC_019921. 302 ESTVQEAG------KVLTYVKGL-YDGYLAGGINVQKFKE--------TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) Q Consensus 302 ~s~~~p~~------~i~fgd~~~-y~i~~r~~i~i~~~~~--------~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~ 366 (381) .+++||++ .++||||++ |.++.|++++++.+++ .+|.+|+++||+.+|+|+++.+++||++++-. T Consensus 218 ~~~~v~~~~~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~ 297 (298) T protein:vir:94 218 VNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) T ss_pred EecccccccCCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEec Confidence 99999863 378999999 6689999999988764 26899999999999999999999999975422 Q ss_pred e Q lcl|NC_019921. 367 L 367 (381) Q Consensus 367 ~ 367 (381) . T Consensus 298 t 298 (298) T protein:vir:94 298 N 298 (298) T ss_pred C Confidence 2 No 98 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=1.1e-49 Score=289.01 Aligned_cols=271 Identities=11% Similarity=-0.024 Sum_probs=206.3 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcceEEeecccccccccCcceeeEeecc Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~ 154 (381) |. +.+++||++||++++++|++.+++.++|+++|++++++ +..++|+.++.+.+.|++|+++++ +++++|+++++.+ T Consensus 1 Ma-t~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~-~~~~~f~~v~l~~ 78 (311) T protein:vir:99 1 MA-TFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKS-STTGEFDFVTSTP 78 (311) T ss_pred Cc-eecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcccc-cccceeeEEEEee Confidence 33 44568899999999999999999999999999999986 569999999999999999988776 6789999999999 Q ss_pred eeEEEeeeccHHhh---hcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEee---eccccccccccccccceeee Q lcl|NC_019921. 155 NKLTAFVVLPKDLN---DFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLN---RQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 155 ~kl~~~~~iS~ell---~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil---~~~~~~~~~~~~~~~~~~~~ 228 (381) ||++++++||+||| +|+.++|++||++++++++++++|.+|++|+|++++.++. ..+...... T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~----------- 147 (311) T protein:vir:99 79 KKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKR----------- 147 (311) T ss_pred EEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccce----------- Confidence 99999999999999 5788999999999999999999999999999987665543 211111100 Q ss_pred eeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc---------cCCCce Q lcl|NC_019921. 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA---------LPFNLN 299 (381) Q Consensus 229 ~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~---------l~~G~p 299 (381) .+.+..........+.+++..+... ...+..++ |+||+.++..+.. +++.+|+|+|. ..+|+| T Consensus 148 ~~~~~~~~~~~~~~i~~~~~~~~~~----~~~~~~~~-~vmn~~~~~~L~~---lkd~~G~~l~~~~~~~~~~~~l~G~P 219 (311) T protein:vir:99 148 VELTADTIANPDLAIEAAVGLLVAN----GHPTPVNG-LALHPSIAWGLST---ARYTDGRKKFPELGLGIGVSSFEGID 219 (311) T ss_pred eeccccccchhHHHHHHHHHHHhhh----ccCCCccE-EEEcHHHHHHHHh---hhccCCCeeecCcccCCCCceeccee Confidence 0011111111112222233222111 12233333 9999999887765 47899999984 247999 Q ss_pred eEecCCCCCC----------------cEEEEeecc-eEEEeecceEEEeehhh-------hhhcCceEEEEEEEEcCEEe Q lcl|NC_019921. 300 VIESTVQEAG----------------KVLTYVKGL-YDGYLAGGINVQKFKET-------LALDDMDLYTAKQFAYGKAK 355 (381) Q Consensus 300 Vv~s~~~p~~----------------~i~fgd~~~-y~i~~r~~i~i~~~~~~-------~~~~d~~~~r~~~r~dGk~~ 355 (381) |+.++.+|.+ .+++|||++ +.+.++.+++++.+++. .|.+|+++||+.+|+|++++ T Consensus 220 v~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~ 299 (311) T protein:vir:99 220 ASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVF 299 (311) T ss_pred eEeecccccccccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceec Confidence 9999988732 257899998 66899999999987752 48999999999999999988 Q ss_pred cCceEEEEEEEec Q lcl|NC_019921. 356 DNKVAAVWKLDLK 368 (381) Q Consensus 356 ~~~Afvv~~~~~~ 368 (381) ++ |||+++-..+ T Consensus 300 ~~-~~v~~~~~~A 311 (311) T protein:vir:99 300 TD-RFVVIENAVA 311 (311) T ss_pred Ch-hHeeeecccC Confidence 75 6776554443 No 99 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=3.7e-47 Score=275.15 Aligned_cols=284 Identities=11% Similarity=0.006 Sum_probs=209.5 Q ss_pred HHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc--eEEEEec----C Q lcl|NC_019921. 53 ERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR--LKFLKSE----T 126 (381) Q Consensus 53 ~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~--~~~p~~~----~ 126 (381) .-.+..-++++. .+..+++. .++.+|||++|++. +++++.+.+.||+|++|++++..+. ..++... . T Consensus 1 ~~~~~~~~~~~~-----~~~~k~~t-~~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~ 73 (315) T protein:vir:41 1 MLTIEDIRGGKP-----FEIVPKID-VPDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDV 73 (315) T ss_pred CcccchhhcCCh-----hhhhhhcC-CcCCCCceechHHH-HHHHHHHHhhhhhhhhceeeeccccccccccccccCccc Confidence 001111111111 11223333 34567999999886 5688999999999999998764332 3333221 1 Q ss_pred CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHH--HHHHHHHHHHHHHHHHHHhhheeeccCC--- Q lcl|NC_019921. 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPA--WIERFVRVQIEEAFAVALETAFLKGTGK--- 201 (381) Q Consensus 127 ~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~--~~e~~l~~~la~~~~~~~~~a~i~G~G~--- 201 (381) .+...|.++.++ .++++|+|++++|.+|++++.++||+++|+|+.+ |+|+||.++++++|++.++.+|++|||+ T Consensus 74 ~~g~~~~~~~~~-~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~ 152 (315) T protein:vir:41 74 GPGRDETGQKLA-PPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSD 152 (315) T ss_pred ccccccccCcCC-CCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcC Confidence 123457777554 4568899999999999999999999999999975 9999999999999999999999999995 Q ss_pred ---CcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc---CceEEEEchhhHH Q lcl|NC_019921. 202 ---DQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK---GNVTMVVNPSDAF 275 (381) Q Consensus 202 ---~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~---~~a~~~mn~~t~~ 275 (381) ++|.|+|+.......... .+........+.+.++++.+ +..|+ +|++|+||+.|+. T Consensus 153 p~~~~~~G~l~~a~~~~~~~~-----------~~~~a~~~~~d~l~~l~~sl-------~~~yr~~~~~~~~imn~~t~~ 214 (315) T protein:vir:41 153 PLLRMSDGWLKLASEKLTESD-----------VDPEAEDWPMNLFDTMIESL-------PTPYRNNLPNMKFYVTWDIYR 214 (315) T ss_pred ccccccccceecccccccccc-----------cccccccccHHHHHHHHHhc-------ChHHhhcCCceEEEEcHHHHH Confidence 477899986543221111 11111223456677777655 45565 5789999999998 Q ss_pred HHhhhhhccCCCCcccccc---------CCCceeEecCCCC-----CCcEEEEeecceEEEeecceEEEeehhhhhhcCc Q lcl|NC_019921. 276 EVQAQYTHLNANGVYVTAL---------PFNLNVIESTVQE-----AGKVLTYVKGLYDGYLAGGINVQKFKETLALDDM 341 (381) Q Consensus 276 ~~~~~~~~~~~~G~~~~~l---------~~G~pVv~s~~~p-----~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~ 341 (381) .++++ ++++|+|+|.. .+|+||+.+++|| ++.|+||||++|+++++.+++++++.+ +.++. T Consensus 215 ~~rkl---k~~~g~~lw~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~--a~~~~ 289 (315) T protein:vir:41 215 AYRDA---LKGRETGLGDQALTGANSILYDGRPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYD--AEMRL 289 (315) T ss_pred HHHHH---hccCCCccccchhhcCCCceecccceEecccccccCCCCccEEEecccceEEEeccccEEEeeec--CCCCc Confidence 87764 67889998852 3699999999996 466999999999999999999988765 45678 Q ss_pred eEEEEEEEEcCEEecCceEEEEEEEe Q lcl|NC_019921. 342 DLYTAKQFAYGKAKDNKVAAVWKLDL 367 (381) Q Consensus 342 ~~~r~~~r~dGk~~~~~Afvv~~~~~ 367 (381) +.|....|+||+.++.++.|+..+++ T Consensus 290 ~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 290 TKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred eEEEEEEEeceeEEeccceeEeeeeC Confidence 89999999999999999988888888 No 100 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=3.7e-46 Score=269.66 Aligned_cols=279 Identities=11% Similarity=0.001 Sum_probs=210.6 Q ss_pred CHHHHHHHHHHh--hccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec--CCceEEEEecCC----cceEEeeccc Q lcl|NC_019921. 66 SANQRSFFMDIN--KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA--GLRLKFLKSETS----GVAVWGKIYG 137 (381) Q Consensus 66 t~~e~~~~~~~~--~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~--~g~~~~p~~~~~----~~a~wv~e~~ 137 (381) -.+.++.++... ...+.+||||+|+++ +++++.+.+.+|+|++++++++ +....+|+-..+ +...|.++.. T Consensus 1 ~~~~~~~~~~~k~it~~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~~ 79 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTKV 79 (314) T ss_pred CchhhhHHHhhcccccccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCCc Confidence 223344444322 224567999999997 5799999999999999999865 335677764322 2345666644 Q ss_pred ccccccCcceeeEeecceeEEEeeeccHHhhhcCHH--HHHHHHHHHHHHHHHHHHhhheeeccCC--------CcceEe Q lcl|NC_019921. 138 EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPA--WIERFVRVQIEEAFAVALETAFLKGTGK--------DQPIGL 207 (381) Q Consensus 138 ~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~--~~e~~l~~~la~~~~~~~~~a~i~G~G~--------~~P~Gi 207 (381) +. ++++|+|++++|.+||+...++||+++|+|+.+ |||+||...++++|++.++.+|++|||+ ++|.|| T Consensus 80 ~~-~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~ 158 (314) T protein:vir:41 80 AP-TADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGW 158 (314) T ss_pred cC-CcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhh Confidence 43 468999999999999999999999999999986 9999999999999999999999999995 378999 Q ss_pred eeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc---CceEEEEchhhHHHHhhhhhcc Q lcl|NC_019921. 208 NRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK---GNVTMVVNPSDAFEVQAQYTHL 284 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~---~~a~~~mn~~t~~~~~~~~~~~ 284 (381) ++........ .+........+.+.++++.+ +..|+ ++++|+||+.|+..+++. + T Consensus 159 l~~a~~~~~~-------------~~~~~~~~~~~~~~~l~~sl-------~~~yr~~~~~~~~~m~~~t~~~~r~~---l 215 (314) T protein:vir:41 159 MKLAGNQYTD-------------AEPEDENWPLNLFDGMMDEL-------DTRYLQLKPRMKFYVSNEIYNGYRKQ---L 215 (314) T ss_pred hhhcccceee-------------cCccccccHHHHHHHHHHhc-------CchhhcCCCceEEEecHHHHHHHHHH---H Confidence 9754322111 11122234456666776655 55565 477899999998887764 3 Q ss_pred CCCCccccc---------cCCCceeEecCCCC-----CCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEE Q lcl|NC_019921. 285 NANGVYVTA---------LPFNLNVIESTVQE-----AGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFA 350 (381) Q Consensus 285 ~~~G~~~~~---------l~~G~pVv~s~~~p-----~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~ 350 (381) +.+|.++|. ..+|+||+.+++|| ++.|+||||++|+++++..+++.+ +.++.++++.|.+..|+ T Consensus 216 ~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~--~~~a~~~~~~~~~~~r~ 293 (314) T protein:vir:41 216 LVRETGLGDSALIGATGLQYDGIPIQYVPALDALGDDKARALLTVPTNLVYGFWRNIRIEP--KRDAAMRRTEYIASLRA 293 (314) T ss_pred hccCCcccchhhhCCCCceecceeeEecccccccCCCCceEEEechhheEEEeeceeEEee--cccCcCCeEEEEEEEEe Confidence 344555442 23699999999885 467999999999988888777765 45578999999999999 Q ss_pred cCEEecCceEEEEEEEecCCc Q lcl|NC_019921. 351 YGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 351 dGk~~~~~Afvv~~~~~~~~~ 371 (381) |+...+.+|.|+..+..+... T Consensus 294 d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 294 DCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred ceEEEEcCcEEEEEeeccCCC Confidence 999999999998888877666 No 101 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=7.8e-41 Score=240.47 Aligned_cols=296 Identities=11% Similarity=0.077 Sum_probs=209.2 Q ss_pred HhhccccccCHHHHHHHH-HHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceEEee Q lcl|NC_019921. 57 SLPKSAQSLSANQRSFFM-DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGK 134 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~-~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~~~~a~wv~ 134 (381) +..+ .+...-++... ......+.++|++||+++.++|++.+.+.++++++++++++.. ...+|....++...|++ T Consensus 1 ~~~k---~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~~~~ 77 (321) T protein:vir:31 1 MASR---TINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHRRPQ 77 (321) T ss_pred CchH---HHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCcccccc Confidence 0000 00000000100 0112245678999999999999999999999999999999754 56778766666667776 Q ss_pred c-ccccccccCcceeeEeecceeEEEeeeccHHhhhcCH--HHHHHHHHHHHHHHHHHHHhhheeeccCCCcc------e Q lcl|NC_019921. 135 I-YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP--AWIERFVRVQIEEAFAVALETAFLKGTGKDQP------I 205 (381) Q Consensus 135 e-~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~--~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P------~ 205 (381) + +....+.++|+|+++++..|++.+.++||+++|+|+. +|+++||.+.++++|++.++.++++|+|..+| . T Consensus 78 ~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~ 157 (321) T protein:vir:31 78 DEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQND 157 (321) T ss_pred cccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccch Confidence 4 3333446789999999999999999999999999985 59999999999999999999999999998765 5 Q ss_pred EeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--CceEEEEchhhHHHHhhhhhc Q lcl|NC_019921. 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a~~~mn~~t~~~~~~~~~~ 283 (381) |+++.+...... .+........+.+.++.+.+ +..|+ ++++|+||+.++..++... T Consensus 158 G~l~~a~~~~~~-------------~~~~~~~~~~d~l~~l~~~l-------~~~yr~~~~~v~im~~~~~~~~~~~l-- 215 (321) T protein:vir:31 158 GFITVAEGDVET-------------IDAADDILDNDLVIRTIAGL-------DSKYRARMNPALIVSEDQLLSYHYTL-- 215 (321) T ss_pred hhhhhhcccccc-------------ccccccccCHHHHHHHHHhc-------cHhHhcCCCeEEEechHHHHHHHHHH-- Confidence 777643322111 11112223456666666554 44555 5789999999987665432 Q ss_pred cCCCCccccc---------cCCCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhc-CceEEEE--EEEEc Q lcl|NC_019921. 284 LNANGVYVTA---------LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALD-DMDLYTA--KQFAY 351 (381) Q Consensus 284 ~~~~G~~~~~---------l~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~-d~~~~r~--~~r~d 351 (381) .+.++ ++|. .++|+||+.+++||++.|+|+||+++.++.+++++++++.+..... ....|+. ..++| T Consensus 216 ~~~~~-~~~~~~l~~~~~~tl~G~pvv~~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (321) T protein:vir:31 216 TDRDT-PLGDNVIMGEADVNPFSFPIIGSGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDD 294 (321) T ss_pred hcCCC-ccccchhhccccccccceeEEEcCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecc Confidence 22222 2221 2479999999999999999999999999999999998877755433 3345554 44678 Q ss_pred CEEecCceEEEEEEEecCCccccccCcc Q lcl|NC_019921. 352 GKAKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 352 Gk~~~~~Afvv~~~~~~~~~~~~~~~~~ 379 (381) +.+-+.+|+++++ .|..+...++.+|- T Consensus 295 ~~ve~~~a~a~~~-~i~~~~~~~~~~~~ 321 (321) T protein:vir:31 295 FAIENTEAVVLAE-GLGDPLEHLEEETS 321 (321) T ss_pred eeEeccccEEEEe-cCCcchhcccCCCC Confidence 8888888988877 66665555555555 No 102 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=8.4e-37 Score=218.37 Aligned_cols=340 Identities=12% Similarity=0.065 Sum_probs=198.5 Q ss_pred CchhHHHHHH-HHHHHHHHHHhh----hhhHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHhhccccc--cCH- Q lcl|NC_019921. 1 MTINLSETFA-NAKNEFINAVNN----GEPQERQNELYGDMINQLFEETKLQ-----AKAEAERVSSLPKSAQS--LSA- 67 (381) Q Consensus 1 mt~el~~~~~-~~~~~~~~~~k~----~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~--lt~- 67 (381) +.++-..+.+ ...++..+.+++ .++.++..+.+..+.+.+.+...+. .+.+..+......+... ... T Consensus 131 ~~vke~~~~e~~~~~~~~a~~ee~~e~~~k~~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~~~~~~~~~~~~ 210 (517) T protein:vir:97 131 TYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPE 210 (517) T ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhhcccccccccch Confidence 2211111000 000111111111 0111111111222111111110000 00000000000000000 000 Q ss_pred --H-----HHHHHHHHh-------------hccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCC Q lcl|NC_019921. 68 --N-----QRSFFMDIN-------------KNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETS 127 (381) Q Consensus 68 --~-----e~~~~~~~~-------------~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~~~~p~~~~~ 127 (381) + +........ .....-||+++|..+...|...+...+++++++++.+++ ...+|..... T Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~-~~~~~~~~~~ 289 (517) T protein:vir:97 211 ATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP-TLVVGGDNAL 289 (517) T ss_pred hhHHHHHHHHHHHHHHhcccccccceeeeecccccccccccchHHHHHHHHhhhhhccceeeeeecccc-ceeeeccccc Confidence 0 000000000 012234789999999999999999999999988876653 4567777777 Q ss_pred cceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHH----HHHHHHHHHHHHHHHHHhhheeeccCCC- Q lcl|NC_019921. 128 GVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAW----IERFVRVQIEEAFAVALETAFLKGTGKD- 202 (381) Q Consensus 128 ~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~----~e~~l~~~la~~~~~~~~~a~i~G~G~~- 202 (381) ..+.|+.|++. +++++++|+++++.+|++++++++|++||+||.+| |++||.++|+++|+++++.+||+|+|++ T Consensus 290 ~~a~~~~eG~~-kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~ 368 (517) T protein:vir:97 290 TQGTGHTTGTD-KTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGV 368 (517) T ss_pred ceeeeeecCCc-ccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCc Confidence 77889998654 56789999999999999999999999999999998 9999999999999999999999999987 Q ss_pred cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh Q lcl|NC_019921. 203 QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT 282 (381) Q Consensus 203 ~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~ 282 (381) .+.|++........ .+........+.+..+...+ .. ..+++|+|||.|+..+++ T Consensus 369 ~~~gi~~~a~~~~~--------------~~~~~~~~~~d~i~~l~~a~--------~~-a~~a~~vmn~~t~~~I~k--- 422 (517) T protein:vir:97 369 SETQIYPVVGDAWA--------------TNVTGTTNIQELLEKLSVAT--------PK-AADSTLVIHRNDLAAIRF--- 422 (517) T ss_pred cccccccccccccc--------------ccccccchHHHHHHHHHHHh--------hh-ccCCEEEECHHHHHHHHH--- Confidence 45677643211100 01111122223222221111 11 136789999999888875 Q ss_pred ccCCCCccccccC---------CCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCE Q lcl|NC_019921. 283 HLNANGVYVTALP---------FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 283 ~~~~~G~~~~~l~---------~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk 353 (381) +||.+|+|+|+.. ||..-+ -+.++.+....+.++.|+++++.|+.+.++-+ ...+++.|+..+|++|. T Consensus 423 lKD~~G~Yl~~~~~~~~~~~~l~G~~~~-~~~~~~~~~~~~~~~~y~i~~~~g~~~~~~fd--~~~n~~~f~~~~~~~g~ 499 (517) T protein:vir:97 423 LKDKNGNYVFPVGVSNQTIATHFGFNRL-VQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTI--LVENNKEYLFEMPISGS 499 (517) T ss_pred hhcCCCCeeccCcCCcccccccCCcccc-ccccccCceeEeeccccEEEeecceeeeeeee--cccCceeEeeeeeeccc Confidence 4899999999643 231101 12334455566677889999999887665433 34688999999999999 Q ss_pred EecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 354 AKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 354 ~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) +..+++|++..++ |.+-| T Consensus 500 i~~~~r~a~~~~~-----p~~~~ 517 (517) T protein:vir:97 500 LEYKGTTAYGTYT-----PPVAG 517 (517) T ss_pred cccccceEEEEEc-----CCCCC Confidence 9999999974433 33333 No 103 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=100.00 E-value=1.7e-33 Score=200.29 Aligned_cols=320 Identities=11% Similarity=0.043 Sum_probs=171.9 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhhccccccCHHHHHH Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQ--------AKAEAERVSSLPKSAQSLSANQRSF 72 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~lt~~e~~~ 72 (381) |..+.. ++...+..+..+.....+++...+.+..+......... ......+.... ..+ ...+..+ T Consensus 131 ~~~~e~---~e~~~e~~e~~~~~~el~akl~el~k~~ee~k~~~~~~~~~~~~~~~~~~e~r~~~~--~~~--~~~e~~~ 203 (480) T protein:vir:40 131 MGANET---QEIMKQAIEAGVKVRELEAKVEELNKEREELKKEREASIPSEKPEDAERKFMRELGS--KMA--EMPEQGF 203 (480) T ss_pred hhhHHH---HHHHHhhhhhhhhhhhHHHHHHHHHhHHHHHhhhhhhhccccchhhhhhHHHHHHHH--Hhc--cchhhhh Confidence 111111 11111111111110000000000000000000000000 00000000000 000 0011112 Q ss_pred HHHH----hhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecccccccc-cCcce Q lcl|NC_019921. 73 FMDI----NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQ-LDAAF 147 (381) Q Consensus 73 ~~~~----~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~~-~~~~f 147 (381) +..+ ..+...++|+ +|+.+...+.......+|+...+.+...++ ....|++|..+...+ ....+ T Consensus 204 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~----------~~~~~~~e~~~~~~~~~~~~~ 272 (480) T protein:vir:40 204 LREFANGADLNVVNSLGS-ITSKYARKSGIYDGAMKARFQGLTLAEDGV----------DDTFISGTFKAGTDKNKSQTA 272 (480) T ss_pred hhhhhhhccccccccccc-cccchhhheeechhhhhhhhhcceeeeccc----------cceeeeeeeeccccccccccc Confidence 2111 1122233444 555566655555566666666655444332 235566664433222 12234 Q ss_pred eeEeec---ceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeecc--CCCcceEeeeccccccccccccc Q lcl|NC_019921. 148 SEETAI---QNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGT--GKDQPIGLNRQVQKGVSVTEGAY 222 (381) Q Consensus 148 ~~v~l~---~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~--G~~~P~Gil~~~~~~~~~~~~~~ 222 (381) ....+. +|++++++++|+++|+|+. +|++||.++|+++|+++++.+||+|+ |+++|.||.+..... + T Consensus 273 ~~~~~~~~~v~~l~~~~k~t~~lLDDa~-~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~~~~---~---- 344 (480) T protein:vir:40 273 TKRSLRPQMAEAYLQMDKATVRGVNDSG-ALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTATDGW---T---- 344 (480) T ss_pred ccchhhHHHHHHHHHhHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceeecccc---c---- Confidence 444444 5899999999999999987 89999999999999999999999995 456788886431100 0 Q ss_pred cceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCce-EEEEchhhHHHHhhhhhccCCCCcccccc------- Q lcl|NC_019921. 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNV-TMVVNPSDAFEVQAQYTHLNANGVYVTAL------- 294 (381) Q Consensus 223 ~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a-~~~mn~~t~~~~~~~~~~~~~~G~~~~~l------- 294 (381) . ..+..+.+..|++.+ +.+|+.++ .|+|||.|+..++++ ||.+|+|+|+. T Consensus 345 -----~-------~~~~~d~id~L~~al-------~~~y~~~a~~~vmn~~t~~~I~kl---KD~~G~Yi~q~~~~~~~~ 402 (480) T protein:vir:40 345 -----K-------QIEYTDLFEGITDAV-------AECSISDAITIVMSPQTFAELRKA---KGTDGHSRFNELATKEQI 402 (480) T ss_pred -----c-------cchhHHHHHHHHHhh-------hHHhhCCCCEEEECHHHHHHHHHh---hcCCCCeeccCcccccCc Confidence 0 011123344454443 45677777 699999999888764 89999999964 Q ss_pred --CCCceeEec-CCCCCCcEEEEeec-ceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCC Q lcl|NC_019921. 295 --PFNLNVIES-TVQEAGKVLTYVKG-LYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 295 --~~G~pVv~s-~~~p~~~i~fgd~~-~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~ 370 (381) +||+||+++ ..||++....|.++ +|.++||+ ++.. +..-+..++.-|....|++|.+..|+||.+++.+-. T Consensus 403 ~~llG~pvv~~~~~~~~~~~~~~~~~~~~~~~d~~-~~~~--~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~-- 477 (480) T protein:vir:40 403 AQSFGAVNLETRVWMPKDEVAVYNHDEYVLIGDLN-VENY--NDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGS-- 477 (480) T ss_pred ceecccceeeeeccccCCcceeeeCCccEEEEecc-ccee--cccccccchhhhhhhhhhceeeEccccEEEEEeccC-- Confidence 279998765 56777765555554 46788875 4433 333344567789999999999999999999665542 Q ss_pred ccc Q lcl|NC_019921. 371 KPA 373 (381) Q Consensus 371 ~~~ 373 (381) =.. T Consensus 478 ~~~ 480 (480) T protein:vir:40 478 LGV 480 (480) T ss_pred cCC Confidence 222 No 104 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.94 E-value=9.8e-29 Score=174.14 Aligned_cols=258 Identities=16% Similarity=0.082 Sum_probs=196.1 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe----cCCc-eEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----AGLR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~----~~g~-~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |...+...+..++|+.+++.|.+.+.+...+.+++.+.. .+|. +++|+....+.+.|++|+++++ .++++|+++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~-~~~~~~~~~ 79 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIP-MTQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccc-ccccccceE Confidence 443344556689999999999999999988888876532 2343 8999988888999999988775 578999999 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++.+|+++..+++|+++..++..|+.+++.+.+++.+++.+|..++..- +... .. T Consensus 80 ~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~---------~~a~--~~-------------- 134 (272) T protein:vir:98 80 TMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDAL---------SKST--QT-------------- 134 (272) T ss_pred EEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHh---------cccc--cc-------------- Confidence 9999999999999999999999999999999999999999999987531 1000 00 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc--cC--CCCc--ccc---ccCCCceeE Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH--LN--ANGV--YVT---ALPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~--~~--~~G~--~~~---~l~~G~pVv 301 (381) .....+++.+.++...+. ..+....+|+|||.++..+++.... .+ ..|. ... +...|+||+ T Consensus 135 ---~~~~~t~d~i~da~~~l~-------~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi 204 (272) T protein:vir:98 135 ---VEATATVDGVSKALDIFN-------DEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIV 204 (272) T ss_pred ---cccccCHHHHHHHHHHHh-------ccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEE Confidence 011123455555544442 1123345799999998887653211 11 1111 111 234699999 Q ss_pred ecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCc Q lcl|NC_019921. 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 302 ~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~ 371 (381) .|++||++++++.+...+.+..+++++++.+++. ..+.+.+++..|++.++++++++|+++++-++-+ T Consensus 205 ~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~--~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 205 RSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDI--TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred EcCCCCcceEEEEcCCeEEEEecCCceeeecccc--ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 9999999999888888888899999999887764 5688999999999999999999999888876666 No 105 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.94 E-value=9.8e-29 Score=174.14 Aligned_cols=258 Identities=16% Similarity=0.082 Sum_probs=196.1 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe----cCCc-eEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----AGLR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~----~~g~-~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |...+...+..++|+.+++.|.+.+.+...+.+++.+.. .+|. +++|+....+.+.|++|+++++ .++++|+++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~-~~~~~~~~~ 79 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIP-MTQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccc-ccccccceE Confidence 443344556689999999999999999988888876532 2343 8999988888999999988775 578999999 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++.+|+++..+++|+++..++..|+.+++.+.+++.+++.+|..++..- +... .. T Consensus 80 ~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~---------~~a~--~~-------------- 134 (272) T protein:vir:30 80 TMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDAL---------SKST--QT-------------- 134 (272) T ss_pred EEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHh---------cccc--cc-------------- Confidence 9999999999999999999999999999999999999999999987531 1000 00 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc--cC--CCCc--ccc---ccCCCceeE Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH--LN--ANGV--YVT---ALPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~--~~--~~G~--~~~---~l~~G~pVv 301 (381) .....+++.+.++...+. ..+....+|+|||.++..+++.... .+ ..|. ... +...|+||+ T Consensus 135 ---~~~~~t~d~i~da~~~l~-------~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi 204 (272) T protein:vir:30 135 ---VEATATVDGVSKALDIFN-------DEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIV 204 (272) T ss_pred ---cccccCHHHHHHHHHHHh-------ccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEE Confidence 011123455555544442 1123345799999998887653211 11 1111 111 234699999 Q ss_pred ecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCc Q lcl|NC_019921. 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 302 ~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~ 371 (381) .|++||++++++.+...+.+..+++++++.+++. ..+.+.+++..|++.++++++++|+++++-++-+ T Consensus 205 ~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~--~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 205 RSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDI--TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred EcCCCCcceEEEEcCCeEEEEecCCceeeecccc--ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 9999999999888888888899999999887764 5688999999999999999999999888876666 No 106 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.73 E-value=3.2e-19 Score=121.98 Aligned_cols=288 Identities=11% Similarity=0.047 Sum_probs=191.0 Q ss_pred cccccCHHHHHHHH---------HHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec-CCceEEEEecCCcce Q lcl|NC_019921. 61 SAQSLSANQRSFFM---------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLRLKFLKSETSGVA 130 (381) Q Consensus 61 ~~~~lt~~e~~~~~---------~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~-~g~~~~p~~~~~~~a 130 (381) ..+.-++.-+-.+. +|.+-+-...+.+.|.+....|+|.+.+.++|++.+.+..+ ++...+++....+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~l~m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~lp~a 80 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPELKMPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVLGDV 80 (330) T ss_pred CceecCCccccceeehhccccccchhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeeecCCcc Confidence 11111111110000 12223445567889999999999999999999999988776 455788888889999 Q ss_pred EEeecccccccccCcceeeEeecceeEEEeeeccHHhh--hcCHHHHHHHHHHHHHHHHHHHHhhheeeccCC-CcceEe Q lcl|NC_019921. 131 VWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLN--DFGPAWIERFVRVQIEEAFAVALETAFLKGTGK-DQPIGL 207 (381) Q Consensus 131 ~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell--~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~-~~P~Gi 207 (381) .|...++..+++...+|.+++...+.+.+.+.|...+. ..+..|...+-.....+++++..+..|||||++ +++.|| T Consensus 81 ~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~F~GL 160 (330) T protein:vir:94 81 QFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNSFQGM 160 (330) T ss_pred eeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccch Confidence 99998887766555689999999999999999999994 557789999999999999999999999999976 577799 Q ss_pred eeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh----- Q lcl|NC_019921. 208 NRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT----- 282 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~----- 282 (381) ++.+........+. + + ...+.+.+..|+.... ..-....+|+||+.+...+.+..- T Consensus 161 ~~~~~~~q~i~tg~--------~-g---g~~T~d~LDeLl~~v~-------~~~g~~~~~l~n~a~~r~I~a~~R~~~~~ 221 (330) T protein:vir:94 161 MGLVAASQTISAGA--------N-G---GTLTFELLDQLLDLVK-------DKDGQVDYLMSSFAMRRKYFSLLRALGGA 221 (330) T ss_pred hhcCCcccEEecCC--------C-C---CCCCHHHHHHHHHHhc-------CCCCCCcEEEechhHHHHHHHHHHhccCC Confidence 87664433221111 0 1 1122333433332210 001124578999998777765422 Q ss_pred -----ccCCCCccccccCCCceeEecCCCCCC----------cEEE---Eee--cceEEEee----cceEEEeehhhhhh Q lcl|NC_019921. 283 -----HLNANGVYVTALPFNLNVIESTVQEAG----------KVLT---YVK--GLYDGYLA----GGINVQKFKETLAL 338 (381) Q Consensus 283 -----~~~~~G~~~~~l~~G~pVv~s~~~p~~----------~i~f---gd~--~~y~i~~r----~~i~i~~~~~~~~~ 338 (381) ..+..|.++..- .|+||+.++.+|.+ .|++ |+- .+.+++.. .|++++.-- .--. T Consensus 222 ~v~~~~~~~~G~~v~~~-~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G-~~~~ 299 (330) T protein:vir:94 222 AIGEVMTLPSGRQIPTY-RGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVG-AKEN 299 (330) T ss_pred CCCCcccccCCCEEeee-CCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCC-Cccc Confidence 122334443221 38899999999863 2444 432 34566663 366664411 1123 Q ss_pred cCceEEEEEEEEcCEEecCceEEEEEEEecCCc Q lcl|NC_019921. 339 DDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 339 ~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~ 371 (381) ++..-|+..++++..+..++|+.+++ ..... T Consensus 300 k~v~~~~v~~y~~~av~~~~a~~~L~--~V~~g 330 (330) T protein:vir:94 300 ADETITRVKMYCGFANFSQLGLAAIK--GLIPG 330 (330) T ss_pred cceeeEEEEEeeeeEEechhheeeec--cccCC Confidence 46678999999999999999988754 33333 No 107 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.69 E-value=4.5e-18 Score=115.66 Aligned_cols=260 Identities=13% Similarity=0.042 Sum_probs=184.5 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~----~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |..+...-+-.++|+.+.+.+.+.+.....+.+++.+... +| .+++|+....+.+.|..|+++++ ..+.++++. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~-~~~it~~~~ 79 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCccc-cccccccee Confidence 3333334455789999999999999888777777766432 24 58899987777888999988776 457889999 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++..++.+..+.++.+...++..|+.+.+.+.+++++++.++..++..-.+.... + . T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~-----------~-~----------- 136 (274) T protein:vir:93 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT-----------V-N----------- 136 (274) T ss_pred EEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----------c-c----------- Confidence 9999999988999999999999999999999999999999999887532111100 0 0 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh---hccCC-CCccc-----cccCCCceeE Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---THLNA-NGVYV-----TALPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~---~~~~~-~G~~~-----~~l~~G~pVv 301 (381) .....++.+.+....+.. ..+.. .+++|||..+..+++.. ....+ .|..+ -+...|++|+ T Consensus 137 ----~~~~~~d~i~dA~~~l~d------~~~~~-~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:93 137 ----ADITKLNGLQSAIDKFND------EDLEP-MVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIV 205 (274) T ss_pred ----ccccCHHHHHHHHHHhhh------ccCCc-cEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEE Confidence 011234455555444421 11222 35899999998887532 11221 12211 1123599999 Q ss_pred ecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 302 ~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) .|+.||.+++++.....+.+..+.++.++..++... ....+++..+++.++++++++|+++ .++. ++|- T Consensus 206 ~s~~~p~~t~~l~~~gai~~~~~~~~~vE~~Rd~~~--~~d~i~~~~~y~~~~~~~~~~v~~t--~~~~--s~~~ 274 (274) T protein:vir:93 206 RTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKIT--KGSG--SLEM 274 (274) T ss_pred EcCCCCcceEEEEeCCeEEEEecCCcccccccchhh--cccEEEEEEEEEEEEEcCCceEEEe--eCcc--ccCC Confidence 999999999888887777777788888887776544 6789999999999999999988644 3333 3333 No 108 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.68 E-value=3.7e-18 Score=116.15 Aligned_cols=255 Identities=14% Similarity=0.100 Sum_probs=175.8 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~----~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |..+...-...++|+-+.+.+.+.+.....+.+++.+-+. +| .+++|+....+.+.|..|+++++. ...+.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~-~~lt~~~~ 79 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISL-DKIGTTTK 79 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccCh-hhcCCcce Confidence 3322333344688999999999999888777787766442 23 488999887788889999887764 56788999 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++..++.+..+.++.+...++..|+.+.+.++++..+++.++..++..- +. ... T Consensus 80 ~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l---------~~---~~~-------------- 133 (272) T protein:vir:36 80 SVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAA---------KT---TSQ-------------- 133 (272) T ss_pred eEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHh---------cc---ccc-------------- Confidence 9999999888999999999999999999999999999999998876421 00 000 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc---cCCCCccc--c---ccCCCceeEe Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH---LNANGVYV--T---ALPFNLNVIE 302 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~---~~~~G~~~--~---~l~~G~pVv~ 302 (381) .......++.+.++...+... ... ..+++|||..++.+++...+ .+..|..+ . +...|++|+. T Consensus 134 --~~~~~~~~d~i~~A~~~lgd~------~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~ 204 (272) T protein:vir:36 134 --TVSTKANVDGVQAALDIFNDE------DAQ-AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVR 204 (272) T ss_pred --cccccccHHHHHHHHHHhhhc------CCC-ceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEE Confidence 001123445555555444211 111 23589999999888753322 12223221 1 1236999999 Q ss_pred cCCCCCCcEEEEe----ecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCC Q lcl|NC_019921. 303 STVQEAGKVLTYV----KGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 303 s~~~p~~~i~fgd----~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~ 370 (381) |+.||.++..+.. ...+.++..++++++..++.. .....+++..+++.++++++++|+++ +++. T Consensus 205 s~~~p~~~~~~~~~~~~~gA~~~~~~~~~~vE~~R~~~--~~~d~i~~~~~y~~~v~~~~~vv~~t--~~g~ 272 (272) T protein:vir:36 205 SKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV--TKTTVITADEHYAAYLYDLTKVVNIT--FTGV 272 (272) T ss_pred eCCCCCCceeEEEEEecccceeeeecCCcccccccchh--hcCcEEEEEEEEEEEEEcCccEEEEe--ecCC Confidence 9999988743221 122445566788887666544 45678999999999999999988755 4444 No 109 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.60 E-value=1.6e-16 Score=107.19 Aligned_cols=263 Identities=13% Similarity=0.014 Sum_probs=177.3 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~----~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |...+..-+..++|+.+.+.+.+.+.....+.+++.+-.. +| .+++|+....+.+.+..++..++. .+.++++. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~-~~lt~~~~ 79 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDY-SALETESV 79 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcc-ccccccee Confidence 3333344456799999999999999887777777654331 23 488999877777888888777764 57888999 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeecc-CCCcceEeeeccccccccccccccceeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGT-GKDQPIGLNRQVQKGVSVTEGAYPEKEEQG 229 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~-G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~ 229 (381) ++..++.+..+.++.+...++..|+.+.+.+.++..+++..+..++..- |.. ..... T Consensus 80 ~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~------------~~~~~---------- 137 (278) T protein:vir:80 80 KHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTT------------LEVKG---------- 137 (278) T ss_pred eEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccc------------ccccc---------- Confidence 9999998888999999999999999999999999999999999877532 110 00000 Q ss_pred eecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh---hccCC---CCcccc---ccCCCcee Q lcl|NC_019921. 230 TLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---THLNA---NGVYVT---ALPFNLNV 300 (381) Q Consensus 230 ~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~---~~~~~---~G~~~~---~l~~G~pV 300 (381) ..+.......++.+.+....+.. .. ... ..+++|||..++.+++.. ....+ +|.... +...|++| T Consensus 138 ~~t~~~~~~~~~~~~da~~~l~~----~~-~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~V 211 (278) T protein:vir:80 138 AINIGLIDKIENTFTDAPDAIED----ES-ITT-TGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEI 211 (278) T ss_pred ccccchhhhHHHHHHHHHHhhcc----cC-CCc-ccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeE Confidence 00111111223333333333211 11 112 235789999988886542 11111 221111 12359999 Q ss_pred EecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCc Q lcl|NC_019921. 301 IESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) Q Consensus 301 v~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~ 371 (381) +.|+.||.++.++..-..+.++..+++.++..++.. ..+..+++..+++.++++++++|+++ ..+.. T Consensus 212 i~s~~~p~~t~~l~~~gAi~~~~~~~~~vE~~Rd~~--~~~d~i~~~~~yg~~v~~~~~~v~it--~~a~~ 278 (278) T protein:vir:80 212 VRTKKLADGNALAVKAGALKTFLKRNLLAESGRDMD--HKLTKFNADQHYAVALVDETKAVKVV--PVAGN 278 (278) T ss_pred EEcCCCCcceEEEEeccceeeeecCCcccccccchh--hccceeeeeeEEEEEEEcCcceEEEe--eccCC Confidence 999999999876665566666777788887766544 46789999999999999999988654 44443 No 110 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.57 E-value=5.7e-16 Score=104.15 Aligned_cols=260 Identities=13% Similarity=0.027 Sum_probs=178.3 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~----~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |......-...++|+-++..+.+.+.....+.+++++-+. +| .+++|+....+.+....++.+++. .+.+++.. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~-~~it~~~~ 79 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPV-DQIGTSKR 79 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCch-hhccccee Confidence 3333333456789999999999998877766677665432 23 488998776667777777776654 56788999 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++..++.+..+.++.+...++..|+.+.+.+.++..+++..+..++.--... +. . T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a---------~~--~-------------- 134 (274) T protein:vir:96 80 EAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA---------TL--T-------------- 134 (274) T ss_pred EEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcC---------CC--C-------------- Confidence 9999998888999999999999999999999999999999999877531100 00 0 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh---hccCC-CCccc-----cccCCCceeE Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---THLNA-NGVYV-----TALPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~---~~~~~-~G~~~-----~~l~~G~pVv 301 (381) ...+..+++.+.+....+... .+. ...++|||..+..+++.. +...+ .|+.. -+...|++|+ T Consensus 135 --~~~~~~~~d~i~dA~~~l~d~------~~~-~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi 205 (274) T protein:vir:96 135 --VEADITKLDGLQTAIDKFNDE------DLE-PMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIV 205 (274) T ss_pred --cCcccccHHHHHHHHHHhccc------CCC-ceEEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEE Confidence 001112355566655544211 122 235789999988886642 12221 12111 1122599999 Q ss_pred ecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcccc Q lcl|NC_019921. 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 302 ~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~ 374 (381) .|+.+|.++.++.....+.+....++.++..++.. .....+++..+++.++++++++|+++ .+...... T Consensus 206 ~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~--~~~d~i~~~~~yg~~~~~~~~vv~~t--~~~~~~~~ 274 (274) T protein:vir:96 206 RSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDAS--RKSTALYSDKHYVAYLYDESKVVKIT--KGAGDEVM 274 (274) T ss_pred EcCCCCcceEEEEeCcceeeeecCCcccccccchh--hcccEEEEeeEEEEEEEcCccEEEEE--cCcccccC Confidence 99999999876666666666777888887666544 46789999999999999999988754 33333333 No 111 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.56 E-value=7e-16 Score=103.67 Aligned_cols=262 Identities=13% Similarity=0.060 Sum_probs=183.4 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe----cCC-ceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~----~~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |..+...-...++|+-+...+.+.+.+...+.+++.+-+ .+| .+.+|.....+.+.++.|+.+++. ...++++. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~-~~lt~~~~ 79 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPV-DKIETNRR 79 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCc-ccccccee Confidence 332233344568899999999999988888888877644 234 489999877788888999887764 56788999 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ....++.+..+.++.+....+..|..+.+.+.++..+++..+..++. .++..+. .. T Consensus 80 ~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~---------~l~~~~~--~~------------- 135 (276) T protein:vir:10 80 EAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLE---------ALRGTKL--TV------------- 135 (276) T ss_pred eEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHH---------HHhcccc--cc------------- Confidence 99999999999999999999999999999999999999999987763 1111000 00 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh---hccCCC-Ccc-c----cccCCCceeE Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---THLNAN-GVY-V----TALPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~---~~~~~~-G~~-~----~~l~~G~pVv 301 (381) .....+++.+.+....+... .+.. .+.+|||..+..+++.. +...++ |.. + -+...|++|+ T Consensus 136 ---~~~~~t~d~i~~A~~~lgd~------~~~~-~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi 205 (276) T protein:vir:10 136 ---SADIGTLAGLEAAIDTFDDE------DLEP-MVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIV 205 (276) T ss_pred ---cccccCHHHHHHHHHHhccc------cCcc-cEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEE Confidence 00112345555555544211 1222 35789999999887642 122221 111 1 1234699999 Q ss_pred ecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccC Q lcl|NC_019921. 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGT 377 (381) Q Consensus 302 ~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~ 377 (381) .|+.+|++++++.......++..+++.++..++... ....+++..++..+++++..+|++ +. +.....+++ T Consensus 206 ~s~~~p~~t~~l~~~gAi~~~~~~~~~vE~dRd~~~--~~d~i~~~~~y~~~~~~~~~vv~~--t~-~~~~~~~~~ 276 (276) T protein:vir:10 206 RSKKLDEGEAILAKRGAVKLITKRDFFLETDRDPST--KTTALYSDKHYVAYLYDESKAVKV--TK-GAGTTDSGA 276 (276) T ss_pred EcCCCCcceEEEEeccceeeeecCCceeecccchhh--cccEEEEeeEEEEEEEcCcceEEE--ec-CCcCCcCCC Confidence 999999998766655556667788888888776554 578899999999999999997764 44 334445555 No 112 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.56 E-value=1.1e-15 Score=102.58 Aligned_cols=309 Identities=10% Similarity=0.041 Sum_probs=170.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec Q lcl|NC_019921. 36 MINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA 115 (381) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~ 115 (381) |-+ +....+.+.++...+.+. .=+.+.=||.+++++...++++.+.+.+++++.++++++ T Consensus 1 ~~~---~~~~~~~~n~~~~~i~k~-----------------~it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~ 60 (360) T protein:vir:99 1 MSS---NSTIDSVRNQNMNSLSQK-----------------DIGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTL 60 (360) T ss_pred Ccc---hhHHHHHhhhHHHHHHhh-----------------hccccccCceeecHHHHHHHHHHHhhccchhhhcceeec Confidence 000 000011111111111111 111122246788999999999999999999999999886 Q ss_pred CC-ceEEEEecCCcce-EEeecccccccccCcceeeEeec-ceeEEEeeeccHHhhhcC----HHHHHHHHHHHHHHHHH Q lcl|NC_019921. 116 GL-RLKFLKSETSGVA-VWGKIYGEIKGQLDAAFSEETAI-QNKLTAFVVLPKDLNDFG----PAWIERFVRVQIEEAFA 188 (381) Q Consensus 116 ~g-~~~~p~~~~~~~a-~wv~e~~~~~~~~~~~f~~v~l~-~~kl~~~~~iS~ell~ds----~~~~e~~l~~~la~~~~ 188 (381) .. ...+++-.-+..- .-..|.+......+++...+.+. .+++.....++.+-+.+. ...++..|.+.++++++ T Consensus 61 ~s~~~ei~kig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~ 140 (360) T protein:vir:99 61 ARLEMEVPQFGVPRLSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYG 140 (360) T ss_pred ccccccccccccceeeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHH Confidence 43 2333321111100 00112222222234455556663 345666667777766654 23567999999999999 Q ss_pred HHHhhheeeccCC---------Ccc-----eEeeeccccccc------ccccccc-ceeeeeeec---cc------c-cc Q lcl|NC_019921. 189 VALETAFLKGTGK---------DQP-----IGLNRQVQKGVS------VTEGAYP-EKEEQGTLT---FA------N-PR 237 (381) Q Consensus 189 ~~~~~a~i~G~G~---------~~P-----~Gil~~~~~~~~------~~~~~~~-~~~~~~~~t---~~------~-~~ 237 (381) +-++.-.++|+.. +.| .|+|+....... .+.++.. ++....+.+ .+ + .. T Consensus 141 ~Dle~l~~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 220 (360) T protein:vir:99 141 NDLGLMGIRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQP 220 (360) T ss_pred HHHHHHHhhccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhcccccccccc Confidence 9999999988743 234 488766532210 0000000 000000000 00 0 00 Q ss_pred hhHHHHHHHHHHhhhccccccccccC----ceEEEEchhhHHHHhhhhhccC-CCCc--cc---cccCCCceeEecCCCC Q lcl|NC_019921. 238 ATVNELTQVFKYHSTNEKGKSVAVKG----NVTMVVNPSDAFEVQAQYTHLN-ANGV--YV---TALPFNLNVIESTVQE 307 (381) Q Consensus 238 ~~~~~l~~l~~~l~~~~~~~~~~~~~----~a~~~mn~~t~~~~~~~~~~~~-~~G~--~~---~~l~~G~pVv~s~~~p 307 (381) .....+.++++.| |..|++ +..|+|++.+...++....-+. +-|. .. ...++|+||+..+.|| T Consensus 221 ~~~~lf~~~~~~L-------p~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~p 293 (360) T protein:vir:99 221 VDTSLFNETIQTL-------DSRYRESDAYSPVLMTSPNQVQSYTMSLTEREDPLGSAVIFGDSDITPFSYDLVGVNGFP 293 (360) T ss_pred chHHHHHHHHHhc-------chhhhcCcccceEEEccCchHHHHHHHHhccCcccchhheecccccccceeeeEEcCCCC Confidence 1122233444444 555765 4589999998776665433222 1111 11 1235799999999999 Q ss_pred CCcEEEEeecceEEEeecceEEEeehhh-hhhcCc--eEEEEEEEEcCEEecCceEEEEEEEecCCcc Q lcl|NC_019921. 308 AGKVLTYVKGLYDGYLAGGINVQKFKET-LALDDM--DLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 308 ~~~i~fgd~~~y~i~~r~~i~i~~~~~~-~~~~d~--~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~ 372 (381) ++.++|-++++.+++....++|+.+.+- +..+.. +.|-....+|...-+.+|.|+++ .+..+++ T Consensus 294 d~~~mlT~p~NLi~g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt-~~~~~~~ 360 (360) T protein:vir:99 294 DEYMMFTDPNNLAFGLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVT-DLETPTA 360 (360) T ss_pred CCceEEeccCceeEEeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEe-cCCCCCC Confidence 9999999999999999999999765432 211222 33334456788878888988765 6655555 No 113 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.54 E-value=2.1e-15 Score=101.02 Aligned_cols=260 Identities=13% Similarity=0.033 Sum_probs=179.5 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~----~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |..+...-...++|+-+...+.+.+.......+++.+-+. +| .+++|+....+.+....++.+++. ...+.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccc-ccccccee Confidence 3323333445789999999999988777666677765432 34 488998776677777888777754 56788899 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++..++.+..+.++.+-...+..|+.+.+.+.++.++++..+..++.--.+. ..... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a------------~~~~~----------- 136 (274) T protein:vir:94 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLTVN----------- 136 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc------------Ccccc----------- Confidence 9999998888999999999999999999999999999999999877531110 00000 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh---hccCC-CCccc-----cccCCCceeE Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---THLNA-NGVYV-----TALPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~---~~~~~-~G~~~-----~~l~~G~pVv 301 (381) .....++.+.+....+.. ..+. ....+|||..+..+++.. ....+ .|..+ -+...|++|+ T Consensus 137 ----~~~~~~d~i~dA~~~l~d------~~~~-~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:94 137 ----ADITKLNGLQSAIDKFND------EDLE-PMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIV 205 (274) T ss_pred ----ccccCHHHHHHHHHHhhc------cCCC-ceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEE Confidence 011234555555544421 1122 235789999998887531 12222 12211 1123599999 Q ss_pred ecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 302 ~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) .|+.||.++.++.......++.++++.++..++... ....+++..+++.++++++.+|+++...+ ++|- T Consensus 206 ~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~--~~d~i~~~~~y~~~~~~~~~vv~~t~~~~----~~~~ 274 (274) T protein:vir:94 206 RTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSG----SLEM 274 (274) T ss_pred EcCCCCcceEEEEeCcceEeeecCCceeccccchhh--cccEEEEEEEEEEEEEcCCceEEEecCcc----cccC Confidence 999999999877766667777788888888777554 56789999999999999999887663333 2222 No 114 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.54 E-value=2.1e-15 Score=101.02 Aligned_cols=260 Identities=13% Similarity=0.033 Sum_probs=179.5 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~----~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |..+...-...++|+-+...+.+.+.......+++.+-+. +| .+++|+....+.+....++.+++. ...+.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccc-ccccccee Confidence 3323333445789999999999988777666677765432 34 488998776677777888777754 56788899 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++..++.+..+.++.+-...+..|+.+.+.+.++.++++..+..++.--.+. ..... T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a------------~~~~~----------- 136 (274) T protein:vir:97 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLTVN----------- 136 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc------------Ccccc----------- Confidence 9999998888999999999999999999999999999999999877531110 00000 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh---hccCC-CCccc-----cccCCCceeE Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---THLNA-NGVYV-----TALPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~---~~~~~-~G~~~-----~~l~~G~pVv 301 (381) .....++.+.+....+.. ..+. ....+|||..+..+++.. ....+ .|..+ -+...|++|+ T Consensus 137 ----~~~~~~d~i~dA~~~l~d------~~~~-~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:97 137 ----ADITKLNGLQSAIDKFND------EDLE-PMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIV 205 (274) T ss_pred ----ccccCHHHHHHHHHHhhc------cCCC-ceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEE Confidence 011234555555544421 1122 235789999998887531 12222 12211 1123599999 Q ss_pred ecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 302 ~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) .|+.||.++.++.......++.++++.++..++... ....+++..+++.++++++.+|+++...+ ++|- T Consensus 206 ~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~--~~d~i~~~~~y~~~~~~~~~vv~~t~~~~----~~~~ 274 (274) T protein:vir:97 206 RTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKITKGSG----SLEM 274 (274) T ss_pred EcCCCCcceEEEEeCcceEeeecCCceeccccchhh--cccEEEEEEEEEEEEEcCCceEEEecCcc----cccC Confidence 999999999877766667777788888888777554 56789999999999999999887663333 2222 No 115 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.54 E-value=1.1e-15 Score=102.59 Aligned_cols=261 Identities=13% Similarity=0.022 Sum_probs=178.4 Q ss_pred HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEecCCcceEEeecccccccccCccee Q lcl|NC_019921. 74 MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFS 148 (381) Q Consensus 74 ~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~----~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~ 148 (381) .++.. ...-...++|+-+...+.+.+.....+.+++.+-+. +| .+++|+....+.+.+..++++++. .+.+++ T Consensus 1 ~~~~~-~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~ 78 (275) T protein:vir:96 1 MALEN-MTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPI-DLIETK 78 (275) T ss_pred CCCcc-cchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcch-hhcccc Confidence 11111 122233678999999999999988888888876542 23 489998877778888888887764 567888 Q ss_pred eEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeee Q lcl|NC_019921. 149 EETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 149 ~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) +.+...++.+..+.++.+....+..|+.+...+.++..+++.++..++.--++. .. T Consensus 79 ~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a------------~~------------ 134 (275) T protein:vir:96 79 KRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGA------------TL------------ 134 (275) T ss_pred eeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cc------------ Confidence 999999999989999999999988899999999999999999998876421110 00 Q ss_pred eeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh---hccCC-CCccc--c---ccCCCce Q lcl|NC_019921. 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---THLNA-NGVYV--T---ALPFNLN 299 (381) Q Consensus 229 ~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~---~~~~~-~G~~~--~---~l~~G~p 299 (381) +......+++.+.+....+... .+.. ..++|||..+..+++.. +...+ .|..+ . +...|.+ T Consensus 135 ---~~~~~~~~~d~i~dA~~~lgd~------~~~~-~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~ 204 (275) T protein:vir:96 135 ---KVEADITKLAGLQTAIDKFNDE------DLEP-MVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAI 204 (275) T ss_pred ---cccccccCHHHHHHHHHHhccc------cCCc-cEEEeCHHHHHHHHhcccccccccccccccceeccccceecCee Confidence 0001123456666665555311 1222 35889999998887642 22222 12111 1 1235999 Q ss_pred eEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcc Q lcl|NC_019921. 300 VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 300 Vv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~ 372 (381) |+.|+.+|.++.++.....+.++.+.++.++..++... ....+++..++..+++++++.|+++.+=+++.- T Consensus 205 Vi~s~~~p~~t~~i~~~gA~~~~~~~~~~vE~~Rd~~~--~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 205 IVRSNKIKEGEAILAKRGAVKLITKRDFFLETERHASH--KSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred EEEeCCCCcceEEEEeccceeeeecCCcccccccchhh--cCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 99999999998665545555566677888887776543 678999999999999999998765532222222 No 116 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.44 E-value=2.8e-14 Score=94.89 Aligned_cols=260 Identities=13% Similarity=0.031 Sum_probs=175.4 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe----cCC-ceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~----~~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |..+...-...++|+-+...+.+.+.....+.+++.+-. .+| .+++|+....+.+....++.+++. ...+.++. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccch-hhccccee Confidence 222233334468999999999998877766666766532 234 488998776677777788777654 46778888 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) .+..++.+..+.++.+-...+..|+.+.+.+.++..+++..+..++.--.+. .. T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a------------~~-------------- 133 (274) T protein:vir:12 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KL-------------- 133 (274) T ss_pred eEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cc-------------- Confidence 8888898888999999888888899999999999999999999876532110 00 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh---hccCCCC-ccc--c---ccCCCceeE Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---THLNANG-VYV--T---ALPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~---~~~~~~G-~~~--~---~l~~G~pVv 301 (381) +.......++.+.+....|... .... .+.+|||..+..+++.. +...+++ .-+ . +...|++|+ T Consensus 134 -~~~~~a~~~d~i~dA~~~lgd~------~~~~-~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:12 134 -TVNADITKLNGLQSAIDKFNDE------DLEP-MVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIV 205 (274) T ss_pred -cccccccCHHHHHHHHHHhccc------cccc-cEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEE Confidence 0001112355555555544211 1222 35789999998887632 2222221 111 1 123599999 Q ss_pred ecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 302 ~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) .|+.||.+..++.-...+.++..+++.++..++... ....+++..++..++++++..|+++ ..+.++|- T Consensus 206 ~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~--~~d~i~~~~~y~~~~~~~~~vv~~t----~~~~~~~~ 274 (274) T protein:vir:12 206 RSNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST--KTTALYSDKHYVAYLYDESKAVKIT----KGSGSLEM 274 (274) T ss_pred EeCCCCcceEEEEeccceeeeecCCceeccccchhh--cccEEEeeeEEEEEEEcCCceEEEE----cCCccccC Confidence 999999988654434445555678888888777654 5579999999999999999988655 33444444 No 117 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.39 E-value=9.4e-14 Score=92.00 Aligned_cols=260 Identities=13% Similarity=0.051 Sum_probs=173.9 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~----~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |..+...=...++|+-+...+.+.+.....+.+++.+-+. +| .+++|+....+.+....++.+++. ...+.+.. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccch-hhccccee Confidence 2222222334688999999999988877666677655432 24 589998776677777788777654 46778888 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++..++.+..+.++.+-...+..|+.+.+.+.++..+++..+..++.--.+.. ... T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-----------~~~------------- 135 (274) T protein:vir:96 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-----------LTV------------- 135 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------ccc------------- Confidence 88888888889999999888888999999999999999999987764211100 000 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh---hccCCC-Cc-cc-c---ccCCCceeE Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---THLNAN-GV-YV-T---ALPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~---~~~~~~-G~-~~-~---~l~~G~pVv 301 (381) ......++.+.+....|... ... ..+.+|||..+..+++.. +...++ |. .+ . +...|++|+ T Consensus 136 ---~~~~~~~d~i~~A~~~lgd~------~~~-~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:96 136 ---EADITKLTGLQTAIDKFNDE------DLE-PMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIV 205 (274) T ss_pred ---cccccCHHHHHHHHHHhccc------ccc-ccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEE Confidence 00112355555555544211 122 235789999998887632 122222 11 11 1 122599999 Q ss_pred ecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 302 ~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) .|+.+|.+..++.-...+.++..+++.++..++.. ...+.+++..+++.++++++..|+++ ..+..+|- T Consensus 206 ~s~~~~~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~--~~~d~i~~~~~y~~~~~~~~~~v~~t----k~~~~~~~ 274 (274) T protein:vir:96 206 RSNKLEAGTAILAKKGAVKLITKRDFFLETDRDPS--TKTTALYSDKHYVAYLYDESKAVKIT----KGSGSLEM 274 (274) T ss_pred EeCCCCCceEEEEeccceeeeecCCcccccccccc--cccCEEEEeEEEEEEEEcCCcEEEEE----cCCccccC Confidence 99999998854444444555567788888777654 46788999999999999999988655 33344443 No 118 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.39 E-value=9.4e-14 Score=92.00 Aligned_cols=260 Identities=13% Similarity=0.051 Sum_probs=173.9 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~----~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |..+...=...++|+-+...+.+.+.....+.+++.+-+. +| .+++|+....+.+....++.+++. ...+.+.. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~~ 79 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPT-DILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccch-hhccccee Confidence 2222222334688999999999988877666677655432 24 589998776677777788777654 46778888 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ++..++.+..+.++.+-...+..|+.+.+.+.++..+++..+..++.--.+.. ... T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-----------~~~------------- 135 (274) T protein:vir:95 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-----------LTV------------- 135 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------ccc------------- Confidence 88888888889999999888888999999999999999999987764211100 000 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh---hccCCC-Cc-cc-c---ccCCCceeE Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---THLNAN-GV-YV-T---ALPFNLNVI 301 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~---~~~~~~-G~-~~-~---~l~~G~pVv 301 (381) ......++.+.+....|... ... ..+.+|||..+..+++.. +...++ |. .+ . +...|++|+ T Consensus 136 ---~~~~~~~d~i~~A~~~lgd~------~~~-~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:95 136 ---EADITKLTGLQTAIDKFNDE------DLE-PMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIV 205 (274) T ss_pred ---cccccCHHHHHHHHHHhccc------ccc-ccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEE Confidence 00112355555555544211 122 235789999998887632 122222 11 11 1 122599999 Q ss_pred ecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 302 ESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 302 ~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) .|+.+|.+..++.-...+.++..+++.++..++.. ...+.+++..+++.++++++..|+++ ..+..+|- T Consensus 206 ~s~~~~~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~--~~~d~i~~~~~y~~~~~~~~~~v~~t----k~~~~~~~ 274 (274) T protein:vir:95 206 RSNKLEAGTAILAKKGAVKLITKRDFFLETDRDPS--TKTTALYSDKHYVAYLYDESKAVKIT----KGSGSLEM 274 (274) T ss_pred EeCCCCCceEEEEeccceeeeecCCcccccccccc--cccCEEEEeEEEEEEEEcCCcEEEEE----cCCccccC Confidence 99999998854444444555567788888777654 46788999999999999999988655 33344443 No 119 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.39 E-value=3.5e-13 Score=88.84 Aligned_cols=342 Identities=11% Similarity=-0.024 Sum_probs=174.2 Q ss_pred CchhHHHHHHHHHHHHHHHHhhh---hh-HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhccccccCHHH Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNG---EP-QERQNELYGDMINQLF-------EETKLQAKAEAERVSSLPKSAQSLSANQ 69 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~---~~-~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~lt~~e 69 (381) |.-=|+ .+++. +. +.++.....+|.+... +-..++...+-.+...+...+.. ...| T Consensus 1 ~~~~~~------------~~~~~~~~~~~~~e~k~lr~~me~~et~~e~~~~~~~~~~~e~el~E~f~Kmm~G~~-p~~e 67 (393) T protein:vir:79 1 MENWLK------------QLKESGFTETQVQEQKSLRTRMERGETLAEADANKLALNEEETQILESFAKMMEGET-PTNE 67 (393) T ss_pred CchHHH------------HHHhccCchhHHHHHHHHHHHhhhhhhhhhhhhhhhhcchhHHHHHHHHHHHhcCCC-chhh Confidence 211111 11110 11 1111112222211100 00011111111111111111111 1111 Q ss_pred HHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec-CCceEEEEecCCcceEEeecccccccc--cCcc Q lcl|NC_019921. 70 RSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLRLKFLKSETSGVAVWGKIYGEIKGQ--LDAA 146 (381) Q Consensus 70 ~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~-~g~~~~p~~~~~~~a~wv~e~~~~~~~--~~~~ 146 (381) .+.... -++.++..+||..+++-|.+........-.+...... .|+..+...-+.--+.-++|+++.+.. +..+ T Consensus 68 V~~~e~---mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g~~Ra~~IgEGgE~~~~sld~~T 144 (393) T protein:vir:79 68 VNLREF---MATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIGIMRAYDVAEGQEIPEDSIDWQT 144 (393) T ss_pred eehhhh---hcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchheeeeccccccccccccchhhhc Confidence 111111 2455677999999999998844333322233333333 333222222223345567777777653 3479 Q ss_pred eeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCC-c--ceEeeecccccccccccccc Q lcl|NC_019921. 147 FSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD-Q--PIGLNRQVQKGVSVTEGAYP 223 (381) Q Consensus 147 f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~-~--P~Gil~~~~~~~~~~~~~~~ 223 (381) |+.+++...|++..+.+|+||+.||..|+-++..+...++|++..+.-.+++.-++ + .-|+.+...... ++ -.. T Consensus 145 ~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahp--tG-r~~ 221 (393) T protein:vir:79 145 HESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHT--TG-LDK 221 (393) T ss_pred CCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCcccee--ec-CCc Confidence 99999999999999999999999999999999999999999999999999988653 3 345554322211 11 000 Q ss_pred ceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhh----hhccCCCCccc-------- Q lcl|NC_019921. 224 EKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ----YTHLNANGVYV-------- 291 (381) Q Consensus 224 ~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~----~~~~~~~G~~~-------- 291 (381) .....|+ ...+++.+++... ++.-|. ..+++|+|--+..+.+. .+.+++-|+|. T Consensus 222 ~~~qNGT-------lSleDllDm~~av------~~~hyt-~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~ 287 (393) T protein:vir:79 222 NGVQNDT-------FSAEDFLDLIIAV------MANEYT-PSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSM 287 (393) T ss_pred ccccccc-------ccHHHHHHHHHHH------hcccCC-cceEEEcCchhhhhhhhhhhcceeeccccccCccccchhh Confidence 0011122 2334444443322 233344 35689999765444321 12344555542 Q ss_pred --------cccCCCceeEecCCCCCCc------EEEEeecce-EEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEec Q lcl|NC_019921. 292 --------TALPFNLNVIESTVQEAGK------VLTYVKGLY-DGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKD 356 (381) Q Consensus 292 --------~~l~~G~pVv~s~~~p~~~------i~fgd~~~y-~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~ 356 (381) ..+||...|+.|+.+|=++ ....|-... ++-.+-+++.++-++- ..|.+-++-++|++-.+.+ T Consensus 288 algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk--~rdiq~iKl~ERYG~gvLn 365 (393) T protein:vir:79 288 ALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEK--ARGLQNIKMIERYGIGILN 365 (393) T ss_pred hhchhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcceeccccc--cccceeeeeeeeeceeeee Confidence 2367889999999999332 111222221 1122335555554442 3578889999999987777 Q ss_pred Cc-eEE-EEEEEecCCccc--c---ccC Q lcl|NC_019921. 357 NK-VAA-VWKLDLKGHKPA--L---EGT 377 (381) Q Consensus 357 ~~-Afv-v~~~~~~~~~~~--~---~~~ 377 (381) .. |++ ...++++..-++ + .|+ T Consensus 366 ~gkaiavakNI~~~k~y~~P~~~~~~~~ 393 (393) T protein:vir:79 366 EGKAIAVAKNISMDKSYAEPMLIKNVGN 393 (393) T ss_pred CCceEEEEecceeecccccchhhhccCC Confidence 65 333 333444333221 1 111 No 120 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.36 E-value=2.3e-13 Score=89.88 Aligned_cols=271 Identities=14% Similarity=0.099 Sum_probs=167.6 Q ss_pred ccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceE-----Eeec Q lcl|NC_019921. 62 AQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAV-----WGKI 135 (381) Q Consensus 62 ~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~~~~a~-----wv~e 135 (381) +..+|-.|. +.+.+......|+|.+.+.|+|++.+.+.++.| ...+.+...-+.+. |-.- T Consensus 1 mpaltLaea--------------~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~ 66 (310) T protein:vir:97 1 MASVTLAES--------------AKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFS 66 (310) T ss_pred CcccchHHH--------------hhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCccccccccccc Confidence 222332222 245678899999999999999999998887654 45566554433332 2221 Q ss_pred ccccccccCcceeeEeecceeEEEeeeccHHhhhc--C-HHHHHHHHHHHHHHHHHHHHhhheeeccCCCcce-Eeeecc Q lcl|NC_019921. 136 YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--G-PAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI-GLNRQV 211 (381) Q Consensus 136 ~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s-~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~-Gil~~~ 211 (381) ... ..++..+|.+++...+-+.+.+.|...+.+- + ..|...+=.+...+++.+..+..|||||.+++|. ||++.+ T Consensus 67 ~~g-~~~~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~ 145 (310) T protein:vir:97 67 GAG-AGKAAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLC 145 (310) T ss_pred CCC-ccccccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcC Confidence 111 2356789999999999999999999877653 3 5566666677888999999999999999977666 998766 Q ss_pred ccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh--------- Q lcl|NC_019921. 212 QKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT--------- 282 (381) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~--------- 282 (381) ........+. -.........|++.+.+..+ -....+++||+.+...+.+..- T Consensus 146 ~~~q~i~~~~---------~gg~~t~d~LDeLl~~v~~~----------~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~ 206 (310) T protein:vir:97 146 ASGQKATTGA---------TGSAISFAILDELMDLVVDK----------DGQVDYLTMHARTLRSYKALLRALGGASINE 206 (310) T ss_pred CccceeecCC---------CCCCCCHHHHHHHHHHHhcC----------CCCCCEEEecHHHHHHHHHHHHHhcCCCCCC Confidence 4432221110 00111112333333332111 1123469999988666543211 Q ss_pred -ccCCCCccccccCCCceeEecCCCCCC----------cEE---EEee--cceEEEe----ecceEEEeehhhhhhcCce Q lcl|NC_019921. 283 -HLNANGVYVTALPFNLNVIESTVQEAG----------KVL---TYVK--GLYDGYL----AGGINVQKFKETLALDDMD 342 (381) Q Consensus 283 -~~~~~G~~~~~l~~G~pVv~s~~~p~~----------~i~---fgd~--~~y~i~~----r~~i~i~~~~~~~~~~d~~ 342 (381) ..+..|+++... -|+|++.|+.+|.+ .|+ ||+. .+-+++. ..|+++..--+. =.++.. T Consensus 207 ~~~~~~G~~v~~~-~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~-~~~~v~ 284 (310) T protein:vir:97 207 VVELPSGAEVPAY-SGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGES-EDSDEH 284 (310) T ss_pred ccccCCCCEEeee-CCeEEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcc-cCCcce Confidence 133455554322 38999999999853 144 4543 2344443 235555542111 124567 Q ss_pred EEEEEEEEcCEEecCceEEEEEEEecC Q lcl|NC_019921. 343 LYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 343 ~~r~~~r~dGk~~~~~Afvv~~~~~~~ 369 (381) -||..+++.-.+..++|++++. .+.. T Consensus 285 ~~~V~~Y~~~av~~~~A~a~L~-~V~~ 310 (310) T protein:vir:97 285 IWRVKWYCGLALFSEKGLACAD-GITN 310 (310) T ss_pred eEEEEEeeeEEEecccceeeec-cccC Confidence 7899999999999999999876 4444 No 121 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.27 E-value=1.3e-12 Score=85.79 Aligned_cols=258 Identities=9% Similarity=-0.001 Sum_probs=169.4 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec----CC-ceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA----GL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~----~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |... .-...++|+-+.+.+.+.+.+...+.+++.+-+. +| .+.+|.....+.+.-+.|+.+++. ...++++- T Consensus 1 Ma~T--~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~-~~lt~~~~ 77 (270) T protein:vir:95 1 MTQT--KKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDT-TQMSMTTT 77 (270) T ss_pred CCce--ehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccch-hhcccchh Confidence 2211 1223579999999999999888888888776442 34 488998777777776777777654 46778888 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) ....++.+..+.++.+-...+..|....+.+.++..+++..+..++. .++...... T Consensus 78 ~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~---------~l~~a~~~~--------------- 133 (270) T protein:vir:95 78 KVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIA---------ELNKSKQTA--------------- 133 (270) T ss_pred eeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHH---------Hhccccccc--------------- Confidence 88889988899999998888878899999999999999999987652 111110000 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC---CCCccc---cccCCCceeEecC Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN---ANGVYV---TALPFNLNVIEST 304 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~---~~G~~~---~~l~~G~pVv~s~ 304 (381) +.....+.+.+.+..+- + ......+.+|||.+++.+++...+.. .++.-. -....|++|+.++ T Consensus 134 ----~~~~t~~~~~dA~~~lg---d----~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~~G~~Viv~s 202 (270) T protein:vir:95 134 ----TVSADATGILDAIEVFN---S----ENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEIVGVSDIVKS 202 (270) T ss_pred ----ccccCHHHHHHHHHHhc---c----ccCCCcEEEEcHHHHHHHHhhhcccccccccchhcccccceecceeEEEeC Confidence 01122344444333321 1 11122358899999998876433222 222111 1122588988777 Q ss_pred CCC-CCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 305 VQE-AGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 305 ~~p-~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) .+| +++.++.-.....++...++.++..++... ..+.+++..++..+++++..+|+++++-++ +++- T Consensus 203 ~~~~~~~~~l~~~gAi~~~~~~~~~vEtdRd~~~--~~d~i~~~~~y~v~~~~~skvv~~t~~~a~---~~~~ 270 (270) T protein:vir:95 203 KRVSENTAFLQRYGAMEIVNKKKPEAYTDFDILK--RTHLLSTNYHYSVNLKDETGVVKVTFKPSG---SLEM 270 (270) T ss_pred CCCCceeEEEEeccceeeeecCCceeeeccchhh--cccEEEeeeEEEEEEEccceEEEEEecCCC---CcCC Confidence 665 555554444445666777888888777654 567889999999999999998876654322 2222 No 122 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=98.97 E-value=6.3e-11 Score=76.50 Aligned_cols=219 Identities=14% Similarity=0.101 Sum_probs=149.1 Q ss_pred ceeEecCCceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHH Q lcl|NC_019921. 110 LGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAV 189 (381) Q Consensus 110 ~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~ 189 (381) -+-++.|-.+.+|.. .+.+.-+.|+.+++. ...++++-+...++.+.-++|+.+-...+..|..+...+.++.+|++ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~eG~~i~~-~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVAEGGEISL-DKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccCCceEEeccc--ccchhhhcCCCcCCh-hhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 233455667899976 456677888887764 45778899999999999999999999988899999999999999999 Q ss_pred HHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEE Q lcl|NC_019921. 190 ALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVV 269 (381) Q Consensus 190 ~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~m 269 (381) ++|..++.= + ..... + .+...+++.+.+.+..+-- ......+.+| T Consensus 78 kvD~di~~~---------~---~~a~l---------------~-~~~~~t~d~i~~A~~~fgd-------e~~~~~vivv 122 (231) T protein:vir:73 78 KVDDDLLKA---------A---KTTSQ---------------T-VSTKANVDGVQAALDIFND-------EDAQAYVLIV 122 (231) T ss_pred hhhHHHHHh---------h---ccccc---------------c-ccccccHHHHHHHHHHhcc-------ccccceEEEE Confidence 999987631 0 00000 0 0111345555554444321 1122335789 Q ss_pred chhhHHHHhhhhhc---cC--CCCcccc---ccCCCceeEecCCCCCCcEEEEe----ecceEEEeecceEEEeehhhhh Q lcl|NC_019921. 270 NPSDAFEVQAQYTH---LN--ANGVYVT---ALPFNLNVIESTVQEAGKVLTYV----KGLYDGYLAGGINVQKFKETLA 337 (381) Q Consensus 270 n~~t~~~~~~~~~~---~~--~~G~~~~---~l~~G~pVv~s~~~p~~~i~fgd----~~~y~i~~r~~i~i~~~~~~~~ 337 (381) ||.+++.+++.... .. .++...+ +...|++|+.|+.+|.+..++.. .....+....++.++..++... T Consensus 123 ~p~~~~~Lrk~~~~~~~~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~ 202 (231) T protein:vir:73 123 NPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVT 202 (231) T ss_pred cchHHHhhhhccchhhhhhhhccceeeecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeeccccccc Confidence 99999988763221 11 1111111 12259999999999998765432 2336677788888887776444 Q ss_pred hcCceEEEEEEEEcCEEecCceEEEEEEEecCC Q lcl|NC_019921. 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 338 ~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~ 370 (381) ..+.+++.+++..+++++..+|+ +++++. T Consensus 203 --k~~~i~~~~~y~v~l~~~~~vv~--~t~~g~ 231 (231) T protein:vir:73 203 --KTTVITADEHYAAYLYDLTKVVN--ITFTGV 231 (231) T ss_pred --cccEEEEeEEEEEEEEcCccEEE--EEeecC Confidence 56889999999999999999776 555555 No 123 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.92 E-value=1.1e-10 Score=75.19 Aligned_cols=289 Identities=13% Similarity=0.048 Sum_probs=153.0 Q ss_pred ccccCHHHHHHHHHHhhccCC--CCc--eeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCcceEEeec Q lcl|NC_019921. 62 AQSLSANQRSFFMDINKNVNY--KEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGKI 135 (381) Q Consensus 62 ~~~lt~~e~~~~~~~~~~~~~--~gg--~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~-~~~p~~~~~~~a~wv~e 135 (381) ...++...+- .+..+... .|. .+-=+.+..++.+.....+.+++++++.++. |+ +++|+- +..++..... T Consensus 1 ~~~~~~~~~~---~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~~~~~~~ 76 (345) T protein:vir:22 1 MASMTGGQQM---GTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAP 76 (345) T ss_pred Ccccccchhc---ccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceEEEeeec Confidence 1101000000 00111111 111 2334889999999999999999999988864 44 677765 4444555555 Q ss_pred ccccccc-cCcceeeEeecceeE-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheee----ccC-----CCcc Q lcl|NC_019921. 136 YGEIKGQ-LDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTG-----KDQP 204 (381) Q Consensus 136 ~~~~~~~-~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~----G~G-----~~~P 204 (381) +.++..+ .+++..+.+|...++ +....|..-=--++..|+.+.+.++.+.++++..|.+++. +.. ++.| T Consensus 77 G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~ 156 (345) T protein:vir:22 77 GENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENI 156 (345) T ss_pred CCCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 4554332 346677755544333 3334444333345778999999999999999999987762 111 1223 Q ss_pred eEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhcc Q lcl|NC_019921. 205 IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL 284 (381) Q Consensus 205 ~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~ 284 (381) -|+-..........+ . ..+-........++.+.++...| +.+..+..+ -+.+++|..++.+.....+. T Consensus 157 ~~~~~~~~~~~~~~g-~------~~t~~~~~~~~~~~ai~~a~~~L----de~~VP~~~-R~~vv~P~~y~~Ll~~~~~~ 224 (345) T protein:vir:22 157 EGLGTATVIETTQNK-A------ALTDQVALGKEIIAALTKARAAL----TKNYVPAAD-RVFYCDPDSYSAILAALMPN 224 (345) T ss_pred ccccccccccccccc-c------cccccccCHHHHHHHHHHHHHHh----hhcCCCccC-CEEEeChHHHHHHhcccccc Confidence 332111110000000 0 00000011223344444433333 333333333 45688999888775432221 Q ss_pred C----CCCccccc---cCCCceeEecCCCCCCcE--------------------------------EEEeecceEEEeec Q lcl|NC_019921. 285 N----ANGVYVTA---LPFNLNVIESTVQEAGKV--------------------------------LTYVKGLYDGYLAG 325 (381) Q Consensus 285 ~----~~G~~~~~---l~~G~pVv~s~~~p~~~i--------------------------------~fgd~~~y~i~~r~ 325 (381) + +++.+.++ ...|.+|++|+.+|.+.+ +|+..+....+... T Consensus 225 ~~~~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~ 304 (345) T protein:vir:22 225 AANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLR 304 (345) T ss_pred ccccccccccccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeee Confidence 1 12223222 225999999998874210 11122222334444 Q ss_pred ceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEec Q lcl|NC_019921. 326 GINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 326 ~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~ 368 (381) +++++..++..... ..+++++-++.++++|+|++++++|+. T Consensus 305 ~~~~e~~r~~~~~~--d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 305 DLALERARRANFQA--DQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred cceeeeeechhHHH--HHHHHHHhcCCcccccceeEEEEEeeC Confidence 55565554433222 378888899999999999999999997 No 124 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=98.88 E-value=6e-10 Score=71.14 Aligned_cols=334 Identities=13% Similarity=0.114 Sum_probs=170.2 Q ss_pred CchhHHHHHHHH--HHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHhhccccccCH Q lcl|NC_019921. 1 MTINLSETFANA--KNEFINAVNNGEPQERQNELYGDMINQLFEETK-----------LQAKAEAERVSSLPKSAQSLSA 67 (381) Q Consensus 1 mt~el~~~~~~~--~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~lt~ 67 (381) =.+|.++..++. .+++.+.+-+...+..+ .++..+...+... .++..++.+..+. ..+++ T Consensus 35 ~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k---~e~eln~~~E~~Kgk~~mtefLkT~~A~~~fa~~l~~----nsg~s 107 (400) T protein:vir:93 35 SGFEVKNAIEDLPKVQELEKTLSENSIEIIK---IENELNAQEEKPKGKDKMTNFIESQNAVTEFFDVLKK----NSGKS 107 (400) T ss_pred hccchhhhhhhchhHHHHHHHHHHhHHHHHH---HhhhhhhhhhhcccchhHHHhhhhHHHHHHHHHHHHh----hcCCc Confidence 222222222221 12222222221111111 1111111111100 0111112111111 22333 Q ss_pred HHHHHHH-HHhhccC--CCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecCCcceEEeecccccccccC Q lcl|NC_019921. 68 NQRSFFM-DINKNVN--YKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKGQLD 144 (381) Q Consensus 68 ~e~~~~~-~~~~~~~--~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~ 144 (381) +-+.+.. .+.+.+- .+--..+|.-+...|-..+.++.|+++...+.++++-+. -.......-+|+--.+..+.++. T Consensus 108 d~knaW~A~l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~l~V-~~~~dt~~qa~gHk~G~~K~eq~ 186 (400) T protein:vir:93 108 EIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLV-SRSFDSANEAQVHKDGQTKTEQA 186 (400) T ss_pred chhhhhhhhhhhcccccCCchhhcchHHHHHHHHhhhccCCcccceeeecCCceee-ecchhhhcccceeccCCccccee Confidence 3333332 2222222 344457899999999999999999999999998854432 22233333567665566677778 Q ss_pred cceeeEeecceeEEEeeeccHHhhh--cCHHHHHHHHHHHHHHHHHH-HHhhheeeccCCCcceEee--ecccccccccc Q lcl|NC_019921. 145 AAFSEETAIQNKLTAFVVLPKDLND--FGPAWIERFVRVQIEEAFAV-ALETAFLKGTGKDQPIGLN--RQVQKGVSVTE 219 (381) Q Consensus 145 ~~f~~v~l~~~kl~~~~~iS~ell~--ds~~~~e~~l~~~la~~~~~-~~~~a~i~G~G~~~P~Gil--~~~~~~~~~~~ 219 (381) .+|..-+|.+.-+|.+..+..-..+ ++.-.|-.||-++|...+-. +.+.|++-|+|++...|+- +++...... T Consensus 187 ~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~d-- 264 (400) T protein:vir:93 187 ATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKI-- 264 (400) T ss_pred eeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhh-- Confidence 8999999998777776666433333 23345799999999999996 5799999999988665551 111111000 Q ss_pred ccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccc------ Q lcl|NC_019921. 220 GAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTA------ 293 (381) Q Consensus 220 ~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~------ 293 (381) ++ .....+++.+ ..+...+.+... +. ...+...+|+|.+++.++. .++++|++... T Consensus 265 -t~-kt~~a~~~~~---qdl~E~~~d~~~---------~~-aad~~~Iv~s~d~~A~L~~---lk~a~~~a~f~~~n~d~ 326 (400) T protein:vir:93 265 -TT-KAKSAGKTPF---ADAIEEAVDFVR---------PT-AGRRYLIVKAEDRKALLDE---LRQATANANVRIKNDDT 326 (400) T ss_pred -hh-hhhhcCCccH---HHHHHHHHhhhh---------hc-cCCceeEEeccchHHHHHH---hcCCcceeeeeeccccc Confidence 00 0001111111 111222222211 11 2234557788888776764 36777776431 Q ss_pred ---cCCCc-eeEecCCCCCC-cEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEE Q lcl|NC_019921. 294 ---LPFNL-NVIESTVQEAG-KVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) Q Consensus 294 ---l~~G~-pVv~s~~~p~~-~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~ 366 (381) .-||. .++....+|.. ..+..|-+.|+ .-+|++=.-+-+..+ ++-.+..-..+.|-+.-+++-++.++. T Consensus 327 ~IA~~fGv~~Lv~~Tr~~~~kp~V~VDek~~i--~~~~~~t~~sf~~~t--Ns~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 327 EIASEVGVDEIIVYTGSKALKPTVLVDQKYHI--DMQDLTKVDAFEWKT--NSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred hhhhhcccceeeeeccCCCCCceeeeehhhhc--cccCceeccceeeee--ccceEEeeeeeccceecccceeeEeeC Confidence 12443 23333444432 23334544444 444454444444444 444455555699999999999987776 No 125 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.85 E-value=3.6e-10 Score=72.36 Aligned_cols=289 Identities=11% Similarity=-0.024 Sum_probs=155.3 Q ss_pred HhhccccccCHHHHHHHHHHhhccC-CCCceecc-HHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCcceEE Q lcl|NC_019921. 57 SLPKSAQSLSANQRSFFMDINKNVN-YKEEKLLP-EETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVW 132 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~-~~gg~lvP-~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~-~~~p~~~~~~~a~w 132 (381) +.-.....++ +.+.+ +++-+-++ +.+..++.+.+...+.+++++.+.++. |+ +++|+- +..++.. T Consensus 1 m~~~~~~~~t----------~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~ 69 (334) T protein:vir:80 1 MTYPAANTHT----------RPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAG 69 (334) T ss_pred CCCCcCCCcc----------ccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeee Confidence 1111111111 11111 22223444 999999999999999999999988864 44 777864 4455555 Q ss_pred eecccccccccCcceeeEeeccee-EEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhee----eccCCCcceEe Q lcl|NC_019921. 133 GKIYGEIKGQLDAAFSEETAIQNK-LTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL----KGTGKDQPIGL 207 (381) Q Consensus 133 v~e~~~~~~~~~~~f~~v~l~~~k-l~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i----~G~G~~~P~Gi 207 (381) ..-+.++.. ...+-++++|.... ++....|..-=--++..|+.+.+.++++.++++..|.+++ .|.....|.+. T Consensus 70 ~~~g~~l~~-~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~ 148 (334) T protein:vir:80 70 RKAGEELVV-QKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHL 148 (334) T ss_pred ecCCCCCCC-CCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 555555443 23455677777665 3455555554445677899999999999999999998764 33333333211 Q ss_pred eeccccccccccccccceeeeeeecccccchhHHHHH----HHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc Q lcl|NC_019921. 208 NRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELT----QVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~----~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~ 283 (381) -..- ..|..... ..+.+........+.+. .+...|. ....|..-...-+.+|+|..++.++...-+ T Consensus 149 ~~~~------~~G~~~~~--~~~g~~~~~~~~~~~l~~a~~~a~~~L~--e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~ 218 (334) T protein:vir:80 149 KPAF------HDGILLPS--TISGLAADAAADADVLVAAHRQGVEAMV--FRDLGDQLMSEGVTLLDPVIFSFLLEHDRL 218 (334) T ss_pred cccc------cCCcceee--cccccccchhhhHHHHHHHHHHHHHHHH--hcCCCCCcCCceEEEeChHHHHHHhccccc Confidence 0000 00000000 01111122222333333 3333332 122222212234578999998888653222 Q ss_pred cC-----CC--Cccccc---cCCCceeEecCCCCCCc-----------EEEEeecceE--EEee--------cceEEEee Q lcl|NC_019921. 284 LN-----AN--GVYVTA---LPFNLNVIESTVQEAGK-----------VLTYVKGLYD--GYLA--------GGINVQKF 332 (381) Q Consensus 284 ~~-----~~--G~~~~~---l~~G~pVv~s~~~p~~~-----------i~fgd~~~y~--i~~r--------~~i~i~~~ 332 (381) .| +. ..+.+. ...|.+|+.|..+|... +.-|||+.-. +.-+ .++..+.. T Consensus 219 ~n~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~ 298 (334) T protein:vir:80 219 MNVEFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFW 298 (334) T ss_pred ccceeccccccccccceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeee Confidence 22 11 223332 22599999999999642 3345554422 1122 22333322 Q ss_pred h-hhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCC Q lcl|NC_019921. 333 K-ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 333 ~-~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~ 370 (381) + +.+|.. ..++++-++.++++|+|+++++|++..+ T Consensus 299 ~~~~~~~d---~i~~~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 299 EEKKDFGH---YLDTFQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred echhhHHH---HHHHHHHcCCceeccceEEEEEEeeecC Confidence 2 222322 3344556799999999999999999887 No 126 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.84 E-value=2e-10 Score=73.76 Aligned_cols=287 Identities=13% Similarity=0.035 Sum_probs=149.8 Q ss_pred Hh-hccccccCHHHHHHHHHHhhccCCCC-c---eeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCcc Q lcl|NC_019921. 57 SL-PKSAQSLSANQRSFFMDINKNVNYKE-E---KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGV 129 (381) Q Consensus 57 ~~-~~~~~~lt~~e~~~~~~~~~~~~~~g-g---~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~-~~~p~~~~~~~ 129 (381) +. ..++... |. .+.....| | .+-=+.+..++.+.....+.+++++++.++. |+ +++|+- +..+ T Consensus 1 ma~~~~~~~~--------n~-~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G~~~ 70 (344) T protein:vir:10 1 MANMTGGQQL--------GT-NQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQ 70 (344) T ss_pred CccccccccC--------Cc-ccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-ceeE Confidence 00 0000000 00 00000000 0 1223899999999999999999999988864 44 677865 3344 Q ss_pred eEEeecccccccc-cCcceeeEeecceeE-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhee----eccCCCc Q lcl|NC_019921. 130 AVWGKIYGEIKGQ-LDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL----KGTGKDQ 203 (381) Q Consensus 130 a~wv~e~~~~~~~-~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i----~G~G~~~ 203 (381) +.....+.++... .++.=.+++|...++ +....|..-=--++..|+.+.+.++.+.++++..|.+++ .+..... T Consensus 71 ~~~~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~ 150 (344) T protein:vir:10 71 AAYLAPGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVES 150 (344) T ss_pred EEeeecCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Confidence 4444544444432 234446655655442 343444443334577889999999999999999998774 2222222 Q ss_pred -----ceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHh Q lcl|NC_019921. 204 -----PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQ 278 (381) Q Consensus 204 -----P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~ 278 (381) |.|.-+..... ... ...+..++..+.+.+.+.+..+......+..+..+ -+.+++|..++.+. T Consensus 151 ~~~~~~~g~~~~~~~~----------~~~-~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~g-R~~vv~P~~y~~Ll 218 (344) T protein:vir:10 151 QYNENITGLGTATVIE----------TTQ-DKTTLTDQVALGKEIIAALTKARAALTKNYVPSSD-RVFYCDPDSYSAIL 218 (344) T ss_pred ccccccccccccceee----------ccc-ccccccchhhhHHHHHHHHHHHHHHHhhcCCCccC-CEEEeChHHHHHHh Confidence 22211100000 000 00111122233333333333323222333334333 34678999888775 Q ss_pred hhhhccC----CCCccccc---cCCCceeEecCCCCCCcE---------------------EEEeecce----------E Q lcl|NC_019921. 279 AQYTHLN----ANGVYVTA---LPFNLNVIESTVQEAGKV---------------------LTYVKGLY----------D 320 (381) Q Consensus 279 ~~~~~~~----~~G~~~~~---l~~G~pVv~s~~~p~~~i---------------------~fgd~~~y----------~ 320 (381) ....+.+ +++.+.++ ...|.+|+.|+.+|.+.+ ..++|+.- . T Consensus 219 ~~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~ 298 (344) T protein:vir:10 219 AALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVG 298 (344) T ss_pred hcccccccccccccceeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhh Confidence 4322111 22333332 125999999999985321 11233331 1 Q ss_pred EEeecceEEEeehh-hhhhcCceEEEEEEEEcCEEecCceEEEEEEEec Q lcl|NC_019921. 321 GYLAGGINVQKFKE-TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 321 i~~r~~i~i~~~~~-~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~ 368 (381) .+.-.+++++..++ .+|. ..+++++-++.++++|++.++++++-+ T Consensus 299 ~v~~~~~~~e~~r~~~~~~---d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 299 TVKLRDLALERARRANFQA---DQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred hhhhccceeecccchhHHH---HHHHHHhhcccceecccceEEEEeecC Confidence 22234456655443 3333 367788899999999999998888876 No 127 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=98.83 E-value=1.3e-09 Score=69.29 Aligned_cols=254 Identities=10% Similarity=0.022 Sum_probs=142.1 Q ss_pred CCCceeccHHHHHHHHHHHHhhhhhhhhceeE----ecCC-ceEEEEecCCcceEEeecccccccccCcceeeEeeccee Q lcl|NC_019921. 82 YKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----NAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNK 156 (381) Q Consensus 82 ~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~----~~~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~k 156 (381) =.--.++|+.+...+.+.+...+.+.++++.- ...| .+++|+....+.+....+++.+.. .+.+-..+++...+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCc-cccccceEEEEEee Confidence 11124689999999999999988887776431 1123 578888666555555555444332 23344556665544 Q ss_pred E-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccc Q lcl|NC_019921. 157 L-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFAN 235 (381) Q Consensus 157 l-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~ 235 (381) . +.-+.|+..-...+..+++++ .+..+++++.+.|..++. .+... ...... .+..+ T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~alA~~vD~~i~~---------~~~~a--~~~~~~-----------~~~~~ 136 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIAD---------MLVDN--GTALTG-----------SAPTD 136 (273) T ss_pred eeecceEeecHHHhhhhccHHHH-HHHHHHHHHHHHHHHHHH---------HHhcc--cccccc-----------ccccc Confidence 3 444567765555566789885 556789999999876642 11000 000000 00112 Q ss_pred cchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh-hccC-----CCCcccc---ccCCCceeEecCCC Q lcl|NC_019921. 236 PRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY-THLN-----ANGVYVT---ALPFNLNVIESTVQ 306 (381) Q Consensus 236 ~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~-~~~~-----~~G~~~~---~l~~G~pVv~s~~~ 306 (381) +....+.+.++...|. ....+- .+-+++++|..+..++... .+.+ .++.+.. +..+|.+|+.|..+ T Consensus 137 ~~~~~~~i~~a~~~ld----~~~vP~-~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~l 211 (273) T protein:vir:10 137 ADDAFDLIAKALKELT----KANVPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL 211 (273) T ss_pred hhHHHHHHHHHHHHhh----hcCCCc-CCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEeccc Confidence 2233455555444442 112222 3456789999988776421 1222 1122211 12369999999999 Q ss_pred CCCc---EEEEeecceEEEeec-ceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCC Q lcl|NC_019921. 307 EAGK---VLTYVKGLYDGYLAG-GINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 307 p~~~---i~fgd~~~y~i~~r~-~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~ 370 (381) |.+. ++.+.-+......+. .++..+.... | .+.+++.+++++++++|+++|+ |+-+++ T Consensus 212 p~~~~~~~~~~~~~A~~~a~q~~~~e~~r~~~~-~---~~~v~~~~~yg~~v~~~~~~~~--l~~~g~ 273 (273) T protein:vir:10 212 RDTDDEQFVAFHPSAAAYVSQIDTVEALRDQDS-F---SDRIRALHVYGGKVVRPTGVVV--FNKTGS 273 (273) T ss_pred ccCCccEEEEEeccceeeeeeeehhhcccCCCc-c---eeeeeeeeeeeeeEeccceEEE--EeccCC Confidence 9643 445544433332221 2222232222 2 4679999999999999999876 455444 No 128 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=98.83 E-value=1.3e-09 Score=69.29 Aligned_cols=254 Identities=10% Similarity=0.022 Sum_probs=142.1 Q ss_pred CCCceeccHHHHHHHHHHHHhhhhhhhhceeE----ecCC-ceEEEEecCCcceEEeecccccccccCcceeeEeeccee Q lcl|NC_019921. 82 YKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----NAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNK 156 (381) Q Consensus 82 ~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~----~~~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~k 156 (381) =.--.++|+.+...+.+.+...+.+.++++.- ...| .+++|+....+.+....+++.+.. .+.+-..+++...+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCc-cccccceEEEEEee Confidence 11124689999999999999988887776431 1123 578888666555555555444332 23344556665544 Q ss_pred E-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccc Q lcl|NC_019921. 157 L-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFAN 235 (381) Q Consensus 157 l-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~ 235 (381) . +.-+.|+..-...+..+++++ .+..+++++.+.|..++. .+... ...... .+..+ T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~alA~~vD~~i~~---------~~~~a--~~~~~~-----------~~~~~ 136 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIAD---------MLVDN--GTALTG-----------SAPTD 136 (273) T ss_pred eeecceEeecHHHhhhhccHHHH-HHHHHHHHHHHHHHHHHH---------HHhcc--cccccc-----------ccccc Confidence 3 444567765555566789885 556789999999876642 11000 000000 00112 Q ss_pred cchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh-hccC-----CCCcccc---ccCCCceeEecCCC Q lcl|NC_019921. 236 PRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY-THLN-----ANGVYVT---ALPFNLNVIESTVQ 306 (381) Q Consensus 236 ~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~-~~~~-----~~G~~~~---~l~~G~pVv~s~~~ 306 (381) +....+.+.++...|. ....+- .+-+++++|..+..++... .+.+ .++.+.. +..+|.+|+.|..+ T Consensus 137 ~~~~~~~i~~a~~~ld----~~~vP~-~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~l 211 (273) T protein:vir:10 137 ADDAFDLIAKALKELT----KANVPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL 211 (273) T ss_pred hhHHHHHHHHHHHHhh----hcCCCc-CCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEeccc Confidence 2233455555444442 112222 3456789999988776421 1222 1122211 12369999999999 Q ss_pred CCCc---EEEEeecceEEEeec-ceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCC Q lcl|NC_019921. 307 EAGK---VLTYVKGLYDGYLAG-GINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 307 p~~~---i~fgd~~~y~i~~r~-~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~ 370 (381) |.+. ++.+.-+......+. .++..+.... | .+.+++.+++++++++|+++|+ |+-+++ T Consensus 212 p~~~~~~~~~~~~~A~~~a~q~~~~e~~r~~~~-~---~~~v~~~~~yg~~v~~~~~~~~--l~~~g~ 273 (273) T protein:vir:10 212 RDTDDEQFVAFHPSAAAYVSQIDTVEALRDQDS-F---SDRIRALHVYGGKVVRPTGVVV--FNKTGS 273 (273) T ss_pred ccCCccEEEEEeccceeeeeeeehhhcccCCCc-c---eeeeeeeeeeeeeEeccceEEE--EeccCC Confidence 9643 445544433332221 2222232222 2 4679999999999999999876 455444 No 129 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.81 E-value=4.4e-10 Score=71.86 Aligned_cols=291 Identities=14% Similarity=0.039 Sum_probs=148.4 Q ss_pred HHhhccccccCHHHHHHHHHHhhccC-CCCc---eeccHHHHHHHHHHHHhhhhhhhhceeEec-CCc-eEEEEecCCcc Q lcl|NC_019921. 56 SSLPKSAQSLSANQRSFFMDINKNVN-YKEE---KLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGV 129 (381) Q Consensus 56 ~~~~~~~~~lt~~e~~~~~~~~~~~~-~~gg---~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~-~g~-~~~p~~~~~~~ 129 (381) +....+++.+. .+.+.+ +.|. ..| +.+..++.+.....+.+++++++.+. +|+ +++|+-.... T Consensus 1 ~~~~~~~~~~~---------t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~t- 69 (347) T protein:vir:33 1 MANIQGGQQIG---------TNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTK- 69 (347) T ss_pred CCCCccCcccc---------cccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccccccccceeEeeecccee- Confidence 11111111110 011111 1221 134 89999999999999999999988764 454 6677654433 Q ss_pred eEEeeccccccc-ccCcceeeEeecceeE-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhee-----eccCCC Q lcl|NC_019921. 130 AVWGKIYGEIKG-QLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL-----KGTGKD 202 (381) Q Consensus 130 a~wv~e~~~~~~-~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i-----~G~G~~ 202 (381) +.....+.++.. ..++...+.+|...++ +....|..-=--++..|+.+.+.++.+.++++..|..|+ .+.... T Consensus 70 ~~~~~~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~ 149 (347) T protein:vir:33 70 AAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPD 149 (347) T ss_pred eeeecCCCCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 333333333322 2234556766654433 333344443334567789999999999999999999886 223322 Q ss_pred cceEeeeccccccccccccccceeeeeeeccccc----chhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHh Q lcl|NC_019921. 203 QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANP----RATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQ 278 (381) Q Consensus 203 ~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~----~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~ 278 (381) .|.+......... +.. ....++.+..++ ...++.+.++...| +.+..+-.+ -+.+++|..++.++ T Consensus 150 ~~~~~~~~~~~~~----~~~--~~~~~tg~~~d~~~~a~~i~~~i~~a~~~L----de~~VP~~g-R~~vv~P~~y~~Ll 218 (347) T protein:vir:33 150 GSNENIEGLGKPT----VLT--LVKPTTGSLTDPVELGKAIIAQLTIARASL----TKNYVPAAD-RTFYTTPDNYSAIL 218 (347) T ss_pred ccccccccccccc----ccc--ccccccccccchhhhHHHHHHHHHHHHHHH----hhcCCCccC-cEEEeCHHHHHHHh Confidence 3322211110000 000 000111111222 22333333333333 223333333 45788999888776 Q ss_pred hhhhc--cC--CCCccccc---cCCCceeEecCCCCCCcE-------EE---------------Eeecc----------e Q lcl|NC_019921. 279 AQYTH--LN--ANGVYVTA---LPFNLNVIESTVQEAGKV-------LT---------------YVKGL----------Y 319 (381) Q Consensus 279 ~~~~~--~~--~~G~~~~~---l~~G~pVv~s~~~p~~~i-------~f---------------gd~~~----------y 319 (381) ....+ .+ +++.+.++ ...|.+|+.|+.+|.+.+ .. ++|+. . T Consensus 219 ~~~~~~~~d~~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~ 298 (347) T protein:vir:33 219 AALMPNAANYQALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAV 298 (347) T ss_pred ccccccccccccccccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhh Confidence 43221 11 12233332 236999999999986422 11 11111 1 Q ss_pred EEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCC Q lcl|NC_019921. 320 DGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 320 ~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~ 370 (381) ..+.-.++++++..+... -...+++.+.++.++++|++.|.+.++-..- T Consensus 299 g~v~~~~~~~e~~r~~~~--~~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 299 GTVKLKDLALERARRANY--QADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred eeeeeeceeeeeccchhh--hhHhhhhhhhcCCceecccceEEEecCCCCC Confidence 122334445554443322 2357889999999999999988765544322 No 130 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=98.78 E-value=2.7e-09 Score=67.53 Aligned_cols=252 Identities=10% Similarity=0.026 Sum_probs=143.6 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeE----ecCC-ceEEEEecCCcceEEeecccccccccCcceeeE Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----NAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~----~~~g-~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v 150 (381) |. . -.++|+.++..+.+.+.....+.++++.- ...| .+++|+....+.+....++..+.. .+.....+ T Consensus 1 MA--~----~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~-~~~~~~~~ 73 (273) T protein:vir:79 1 MA--F----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) T ss_pred Cc--c----hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCc-cccccceE Confidence 11 1 13689999999999999988777776432 1223 588898766555555555554432 34555677 Q ss_pred eecceeE-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheee---ccCCCcceEeeecccccccccccccccee Q lcl|NC_019921. 151 TAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK---GTGKDQPIGLNRQVQKGVSVTEGAYPEKE 226 (381) Q Consensus 151 ~l~~~kl-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~---G~G~~~P~Gil~~~~~~~~~~~~~~~~~~ 226 (381) ++...+. +.-+.|+..-...+..|+++++ +..+.+++++.|..++. +.++..+ . T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~~~~~~~~~-~~~~~ala~~vD~~i~~~~~~a~~~~~--------------~------- 131 (273) T protein:vir:79 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEAYT-RAGATALATDTDKFIADMLVDNGTALT--------------G------- 131 (273) T ss_pred EEEEeeecccceeeccHHHHhhcccHHHHH-HHHHHHHHHHHHHHHHHHHhhcccccc--------------c------- Confidence 7776553 4456777755555677898854 56789999999876542 1111100 0 Q ss_pred eeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh-ccC-----CCCcccc---ccCCC Q lcl|NC_019921. 227 EQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT-HLN-----ANGVYVT---ALPFN 297 (381) Q Consensus 227 ~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~-~~~-----~~G~~~~---~l~~G 297 (381) .+..++....+.+.++...|. ....+- .+-+++++|..+..++.... +.+ .++.+.. +..+| T Consensus 132 ----~~~~~~~~~~~~i~~a~~~ld----~~~vP~-~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G 202 (273) T protein:vir:79 132 ----SAPSDADDAFDLIASALKELT----KANVPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLG 202 (273) T ss_pred ----ccccchhhHHHHHHHHHHHhh----hccCCc-cCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEec Confidence 001112223444554444332 112222 23467899998887754211 111 1111211 12369 Q ss_pred ceeEecCCCCCCc---EEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCC Q lcl|NC_019921. 298 LNVIESTVQEAGK---VLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 298 ~pVv~s~~~p~~~---i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~ 370 (381) .+|+.|..+|.+. ++.|.-+......+ ...++..++- ..-.+.+++.+++++++++|+++|+ |+-+++ T Consensus 203 ~~i~~s~~lp~~~~~~~~a~~~~A~~~a~~-~~~~e~~r~~--~~~~~~v~~~~~yg~~v~~p~~vv~--~~~~g~ 273 (273) T protein:vir:79 203 ARIVESNNLRDTDDEQFVAFHPSAAAYVSQ-IDTVEALRDQ--DSFSDRIRALHVYGGKVVRPTGVVV--FNKTGS 273 (273) T ss_pred eEEEecccccccCceEEEEEeccceeeeee-hhhhhcccCc--ccceeeeeeeeeeeeEEecCceEEE--EeccCC Confidence 9999999999653 33443333333222 2233322211 1225679999999999999999876 455444 No 131 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.75 E-value=1.2e-09 Score=69.45 Aligned_cols=285 Identities=13% Similarity=0.017 Sum_probs=152.4 Q ss_pred HHhhccccccCHHHHHHHHHHhhccC-CCCce--eccHHHHHHHHHHHHhhhhhhhhceeEec-CCc-eEEEEecCCcce Q lcl|NC_019921. 56 SSLPKSAQSLSANQRSFFMDINKNVN-YKEEK--LLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVA 130 (381) Q Consensus 56 ~~~~~~~~~lt~~e~~~~~~~~~~~~-~~gg~--lvP~~~~~~I~~~l~~~~~l~~~~~v~~~-~g~-~~~p~~~~~~~a 130 (381) +....++..+. .+.+.+ ++|.. +-=+.+..++.+.+...+.+++++++.++ +|+ +++|+-.. .++ T Consensus 1 ma~~~~~~~~~---------t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~-~~~ 70 (347) T protein:vir:94 1 MANMNGGQQMG---------KDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGR-TKA 70 (347) T ss_pred CCccccccccc---------cccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccc-eeE Confidence 11111111110 011111 22221 23389999999999999999999988775 454 67775433 444 Q ss_pred EEeecccccccc-cCcceeeEeecceeE-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheee----ccCCC-- Q lcl|NC_019921. 131 VWGKIYGEIKGQ-LDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTGKD-- 202 (381) Q Consensus 131 ~wv~e~~~~~~~-~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~----G~G~~-- 202 (381) .....+.++... .+++.++++|....+ +....|..-=--++.+|+.+.+.++.+.++++..|.+|+. +.... T Consensus 71 ~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~ 150 (347) T protein:vir:94 71 AYLQPGENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTA 150 (347) T ss_pred eeeecCcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 445554444332 356778777776555 4445555444445777899999999999999999988752 21111 Q ss_pred -------cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHH Q lcl|NC_019921. 203 -------QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAF 275 (381) Q Consensus 203 -------~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~ 275 (381) .|.|..-.+..... ........+...++.+.++...|. .+..+.. .-+.+++|..++ T Consensus 151 ~~~~~~g~~~~~~v~i~~~~~-----------~~~~~~~~~~~~~d~i~~a~~~Ld----e~dVP~~-~R~~vv~P~~y~ 214 (347) T protein:vir:94 151 NNENIAGLGKAHVLEVGDQAT-----------LQGDQVKLGQAIIAQLTLARAKLT----GNYVPSS-DRVFYTTPDNYS 214 (347) T ss_pred cccccccCCcceeEeeecccc-----------ccccccccHHHHHHHHHHHHHHhh----hcCCCCC-CCEEEeChHHHH Confidence 11111000000000 000011223344455554444432 2222222 344566888888 Q ss_pred HHhhhhhccCCCC----cccc---ccCCCceeEecCCCCCCcE-------------------------EEEeecc--eEE Q lcl|NC_019921. 276 EVQAQYTHLNANG----VYVT---ALPFNLNVIESTVQEAGKV-------------------------LTYVKGL--YDG 321 (381) Q Consensus 276 ~~~~~~~~~~~~G----~~~~---~l~~G~pVv~s~~~p~~~i-------------------------~fgd~~~--y~i 321 (381) .+++.......+. .+.+ ....|.+|+.|+++|.+.+ +=+||+. .++ T Consensus 215 ~LLk~~~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~ 294 (347) T protein:vir:94 215 AILAALMPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLF 294 (347) T ss_pred HHHHhhcccccccccccccccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEE Confidence 8875322111111 1111 1225899999999995321 0122222 122 Q ss_pred --------EeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEec Q lcl|NC_019921. 322 --------YLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 322 --------~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~ 368 (381) +.-.++++++.++..... ..++++.-++..+++|++.+++.++-+ T Consensus 295 ~~~~A~~tv~~~~~~~e~~~~~~~~~--~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 295 NHRSAVGTVKLKDMALERARRANFQA--DQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred echhhhhhhhhcccceeeeechhhhh--hhhhhhhhhcCcccccceeEEEEecCC Confidence 223444555544433322 367788888999999999999888776 No 132 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=98.69 E-value=1.2e-09 Score=69.49 Aligned_cols=292 Identities=12% Similarity=0.013 Sum_probs=152.1 Q ss_pred HHhhccccccCHHHHHHHHHHhhccC-CCCc--eeccHHHHHHHHHHHHhhhhhhhhceeEec-CCc-eEEEEecCCcce Q lcl|NC_019921. 56 SSLPKSAQSLSANQRSFFMDINKNVN-YKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVA 130 (381) Q Consensus 56 ~~~~~~~~~lt~~e~~~~~~~~~~~~-~~gg--~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~-~g~-~~~p~~~~~~~a 130 (381) +....++..+ ..+.+.+ +++. .+-=+.+..++.......+.+++++++.++ +|+ +++|+-... ++ T Consensus 1 ~a~~~~~~~~---------~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~-~~ 70 (347) T protein:vir:88 1 MANATGGQQI---------GANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRT-KG 70 (347) T ss_pred CCCcccchhh---------hccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecce-ee Confidence 1111111110 0111222 2222 123389999999999988989999988774 454 677764443 33 Q ss_pred EEeecccccccc-cCcceeeEeecceeE-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheee----ccCCCcc Q lcl|NC_019921. 131 VWGKIYGEIKGQ-LDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTGKDQP 204 (381) Q Consensus 131 ~wv~e~~~~~~~-~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~----G~G~~~P 204 (381) .....+.++... .++..++++|...++ +....|..-=--++..|+.+.+.++.+.++++..|.+++. +.....+ T Consensus 71 ~~~~~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~ 150 (347) T protein:vir:88 71 YYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) T ss_pred eeeccccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 433443333322 346677888877665 4555666555556677899999999999999999998752 2111100 Q ss_pred e-EeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc Q lcl|NC_019921. 205 I-GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 205 ~-Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~ 283 (381) . +.......+.....++. ...+.........++.+.++...|. ....+-. .-+++++|..+..++..... T Consensus 151 ~~~~~~g~~~~~~~~~~~~----~~~~~~~~~~~~~~~~i~~a~~~Ld----e~~VP~~-gR~~vv~P~~y~~Ll~~~~~ 221 (347) T protein:vir:88 151 SNENIAGLGQAVVLNIGAA----ADLVDVEARGKAILKGLTLARARLT----KNYVPAG-DRRFYCAPEDYSAILSALMP 221 (347) T ss_pred cccccCCcccccccccccc----ccccchhhhHHHHHHHHHHHHHHHh----hcCCCCC-CCEEEeCHHHHHHHhcchhh Confidence 0 11111111100000000 0000001112223444444443332 2222222 34578899888777542211 Q ss_pred ----cCCCCccccc---cCCCceeEecCCCCCCc---E----------------------EEEeecc--eEEEe------ Q lcl|NC_019921. 284 ----LNANGVYVTA---LPFNLNVIESTVQEAGK---V----------------------LTYVKGL--YDGYL------ 323 (381) Q Consensus 284 ----~~~~G~~~~~---l~~G~pVv~s~~~p~~~---i----------------------~fgd~~~--y~i~~------ 323 (381) .++.+.+.++ ...|.+|+.++++|.+. . +.+|++. .++.- T Consensus 222 ~~~~~~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~ 301 (347) T protein:vir:88 222 NAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGT 301 (347) T ss_pred hhhhhccccchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhh Confidence 1122233222 22589999999998421 1 1223433 12211 Q ss_pred --ecceEEEeehhh-hhhcCceEEEEEEEEcCEEecCceEEEEEEEecC Q lcl|NC_019921. 324 --AGGINVQKFKET-LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 324 --r~~i~i~~~~~~-~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~ 369 (381) -.++.++..++. +|. ..+++++.++.++++|++.++++++-++ T Consensus 302 v~~~d~~~e~~r~~~~~~---d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 302 VKLKDMALERARRPEFQA---DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred eecccceeeeeechhhHH---HHhhhhhhhcCceeccceEEEEEeCCCC Confidence 223344433322 232 4789999999999999999998888777 No 133 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.65 E-value=3.8e-09 Score=66.75 Aligned_cols=298 Identities=12% Similarity=0.044 Sum_probs=147.1 Q ss_pred HHhhccccccCHHHHHHHHHHhhcc-CCCCc--eeccHHHHHHHHHHHHhhhhhhhhceeEec-CCc-eEEEEecCCcce Q lcl|NC_019921. 56 SSLPKSAQSLSANQRSFFMDINKNV-NYKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVA 130 (381) Q Consensus 56 ~~~~~~~~~lt~~e~~~~~~~~~~~-~~~gg--~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~-~g~-~~~p~~~~~~~a 130 (381) +..-.++..+.. +.+. ++.+. .+-=+.+..++.......|.+++++++.++ +|+ +++|+-.. .++ T Consensus 1 ma~~~~~~~~~t---------~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~-~t~ 70 (347) T protein:vir:15 1 MANIQGGQQIGT---------NQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR-TKA 70 (347) T ss_pred CCccccCCcccc---------ccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccc-eee Confidence 000001110000 0011 11111 123378899999999999989999988775 454 67776554 334 Q ss_pred EEeeccccccc-ccCcceeeEeecceeE-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheee---ccCCCcce Q lcl|NC_019921. 131 VWGKIYGEIKG-QLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK---GTGKDQPI 205 (381) Q Consensus 131 ~wv~e~~~~~~-~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~---G~G~~~P~ 205 (381) .-...+.++.. ..+.+..+++|...++ +.-..|..-=-.++..|+.+.+.++.+.++++..|..|+. +-....| T Consensus 71 ~~~~~g~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~- 149 (347) T protein:vir:15 71 AYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPD- 149 (347) T ss_pred eeeccCCCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc- Confidence 33343333322 2234556766654433 3333443333335677899999999999999999988862 1101111 Q ss_pred EeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC Q lcl|NC_019921. 206 GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN 285 (381) Q Consensus 206 Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~ 285 (381) ....... .+-+..........+....++....+.+.+.+.......+.+..+..+ -+.+++|..++.++......+ T Consensus 150 ~~~~~~~---~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~g-R~~vv~P~~y~~LL~~~~~~~ 225 (347) T protein:vir:15 150 ASNENIE---GLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAAD-RTFYTTPDNYSAILAALMPNA 225 (347) T ss_pred ccccccc---ccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccC-CEEEeCHHHHHHHhccccccc Confidence 0000000 000000000000111122233333444444443333222333333333 446789998888765332221 Q ss_pred ----CCCccccc---cCCCceeEecCCCCCCcE----------------------EEEeecc----------eEEEeecc Q lcl|NC_019921. 286 ----ANGVYVTA---LPFNLNVIESTVQEAGKV----------------------LTYVKGL----------YDGYLAGG 326 (381) Q Consensus 286 ----~~G~~~~~---l~~G~pVv~s~~~p~~~i----------------------~fgd~~~----------y~i~~r~~ 326 (381) +.+.+.++ ..+|.+|+.|+.+|.+.+ +-++|+. ...+..++ T Consensus 226 ~d~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~ 305 (347) T protein:vir:15 226 ANYQALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKD 305 (347) T ss_pred ccccccccccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeec Confidence 11222222 236999999999984321 0111111 11233344 Q ss_pred eEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCC Q lcl|NC_019921. 327 INVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 327 i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~ 370 (381) ++++++.+... -...+++...++.++++|++.|.+.++-..- T Consensus 306 ~~~e~~~~~~~--~~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 306 LALERARRANY--QADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred eeeeecccchh--hhhhhehhhhcCCceeccccEEEEecCCCCC Confidence 55655543322 2367888889999999999988765544322 No 134 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.64 E-value=9.6e-10 Score=70.03 Aligned_cols=276 Identities=13% Similarity=0.066 Sum_probs=145.0 Q ss_pred HHHHHHHhhc-------cCCCCc---eeccHHHHHHHHHHHHhhhhhhhhceeEec-CCc-eEEEEecCCcceEEeeccc Q lcl|NC_019921. 70 RSFFMDINKN-------VNYKEE---KLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVAVWGKIYG 137 (381) Q Consensus 70 ~~~~~~~~~~-------~~~~gg---~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~-~g~-~~~p~~~~~~~a~wv~e~~ 137 (381) .-+++.++.- +++++. .+.=+.+..++.+.....|.+++++++.++ +|+ +++|+-.... +.-...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~~~-~~~~~~g~ 79 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLS-AGYHTPGT 79 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEecccee-EeeecCCC Confidence 1223333211 122222 133389999999999999999999888775 344 7777764433 33333333 Q ss_pred ccccccCcceeeEeeccee-EEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheee----ccCCCcceEeeeccc Q lcl|NC_019921. 138 EIKGQLDAAFSEETAIQNK-LTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTGKDQPIGLNRQVQ 212 (381) Q Consensus 138 ~~~~~~~~~f~~v~l~~~k-l~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~----G~G~~~P~Gil~~~~ 212 (381) ++....+++=.+++|...+ ++....|..-=-.++..|+.+.+.++.+.++++..|..++. +...+-|.+..- T Consensus 80 ~l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~--- 156 (332) T protein:vir:78 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP--- 156 (332) T ss_pred CCCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccccc--- Confidence 3322222333444444433 23333343222225667899999999999999999987752 222221111100 Q ss_pred cccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh-------ccC Q lcl|NC_019921. 213 KGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT-------HLN 285 (381) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~-------~~~ 285 (381) ++.. ...+.....++...++.+.++...|. .+..+..+ -+.+++|..++.+++..+ ..+ T Consensus 157 g~~~---------~~~~~~~~~~~~~~~~~i~~a~~~Ld----e~~VP~~g-R~~vv~P~~y~~Ll~~~d~~~~n~~~~~ 222 (332) T protein:vir:78 157 GGFH---------VNIGAGNTNDAQAIVDGFFEAAAVLD----ERSAPQEG-RVAVLSPRQYYSLISSVDTNILNREIGN 222 (332) T ss_pred cccc---------cccCCccccCHHHHHHHHHHHHHHHh----hcCCCccC-CEEEeCHHHHHHHHhhcCceeeeeeccc Confidence 0000 00000111234445555655555443 22333333 345679999888865211 123 Q ss_pred CCCccccc----cCCCceeEecCCCCCCc--------------EEEEeecc----------eEEEeecceEEEeeh---- Q lcl|NC_019921. 286 ANGVYVTA----LPFNLNVIESTVQEAGK--------------VLTYVKGL----------YDGYLAGGINVQKFK---- 333 (381) Q Consensus 286 ~~G~~~~~----l~~G~pVv~s~~~p~~~--------------i~fgd~~~----------y~i~~r~~i~i~~~~---- 333 (381) .+|....+ ...|.+|+.|+.+|... .+-|+|+. .......++.+++.. T Consensus 223 ~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~ 302 (332) T protein:vir:78 223 SQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFN 302 (332) T ss_pred cccceecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccc Confidence 34433322 23599999999999532 12233433 212222334443222 Q ss_pred hhhhhcCceEEEEEEEEcCEEecCceEEEEEEEec Q lcl|NC_019921. 334 ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 334 ~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~ 368 (381) +.+| ...+++.+.++.++++|++.++++ -+ T Consensus 303 ~~~~---~d~i~~~~~~G~~v~rPe~~v~l~--~a 332 (332) T protein:vir:78 303 VQYQ---GDLIVGKLAMGCGSLRTSVAGSFQ--AA 332 (332) T ss_pred hhhh---HhhhhhhhhhcCceecccceEEEe--eC Confidence 2223 357889999999999999988754 33 No 135 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=98.55 E-value=8.4e-08 Score=59.38 Aligned_cols=324 Identities=13% Similarity=0.104 Sum_probs=155.3 Q ss_pred CchhHHHHHHHHHHHH--HHHHhhhhhHHHHHHHHHHHH----HHHHHH---HHHHHH--HHHHHHHHhhccccccCHHH Q lcl|NC_019921. 1 MTINLSETFANAKNEF--INAVNNGEPQERQNELYGDMI----NQLFEE---TKLQAK--AEAERVSSLPKSAQSLSANQ 69 (381) Q Consensus 1 mt~el~~~~~~~~~~~--~~~~k~~~~~~~~~~~~~~~~----~~~~~~---~~~~~~--~~~~~~~~~~~~~~~lt~~e 69 (381) |.-|.+.+.+...+.. ...+... +.+...+...+ ....+. ..-|.| .|+..+......+ ...- T Consensus 47 ~~~e~~~~~e~~en~~e~~~~~~~~---~~E~Rs~~~~i~~~~~~~r~~p~~~~veyRSaGE~lkal~~~~~G---d~~A 120 (410) T protein:vir:83 47 MVAECRGRMEQIKNQMEQAQEVNRI---AFETRSKGQAVDAAISAMRGSPVGTEVEYRSAGEYMLDMWNSAQG---NASA 120 (410) T ss_pred ccccccCcccchhhhhHHHHHHHHH---HHHHHHHHHHHHhhhccCcCCCCCCCcccccHHHHHHHHhccCCc---hHHH Confidence 2222111111111100 0000000 00000111100 000000 000111 1111111111111 1111 Q ss_pred HHHHHH---H-hhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceE-Ee------eccc Q lcl|NC_019921. 70 RSFFMD---I-NKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAV-WG------KIYG 137 (381) Q Consensus 70 ~~~~~~---~-~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g-~~~~p~~~~~~~a~-wv------~e~~ 137 (381) .+.++. . ..+...+--..||+++....++.+.+..|+.++....|..| .+..|+.+..++.+ .+ .|++ T Consensus 121 ~~~~e~~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd 200 (410) T protein:vir:83 121 ADRLEVYARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKT 200 (410) T ss_pred HHHHHHHHHhhccCcccccccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccccccccc Confidence 121222 1 12222222346788899999999999999999977777665 46777765544321 11 1333 Q ss_pred ccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhh---eeeccCCCcceEeeeccccc Q lcl|NC_019921. 138 EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETA---FLKGTGKDQPIGLNRQVQKG 214 (381) Q Consensus 138 ~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a---~i~G~G~~~P~Gil~~~~~~ 214 (381) +.+ -...+|+..+-..+.++++..+|++-++-|...+.+..-+.+..+.+++-+.+ +|.++ .+. T Consensus 201 ~L~-~gKl~~~t~tA~ikTyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t--------~t~---- 267 (410) T protein:vir:83 201 ELD-SQKMVIDRLTVNAKTLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALAST--------STG---- 267 (410) T ss_pred ccc-ccceeeeeccceeehhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHh--------hhh---- Confidence 332 23455556666788999999999999999999999999999977777766543 33322 000 Q ss_pred cccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC-----CCC- Q lcl|NC_019921. 215 VSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN-----ANG- 288 (381) Q Consensus 215 ~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~-----~~G- 288 (381) .. ..+..+++.++..+.+.. .+ .+...+ -.+--...++|..+-...+.....+ ..| T Consensus 268 ~~-------------a~~~~Tad~~~~~i~da~-~~-v~da~~---~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gf 329 (410) T protein:vir:83 268 AV-------------GYGNATADNVASAIWQAA-GA-VYTAVK---GMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGF 329 (410) T ss_pred hh-------------hhhhccHHHHHHHHHHHH-HH-Hhhhhc---cceeeeEEechhhhhhccceeeccCCCCcccccc Confidence 00 001112233333332211 11 111100 0111124566665433322111111 112 Q ss_pred ------ccccccCCCceeEecCCCCCCcEEEEeecceEEEeecc--eEEEeehhhhhhcCceEEEEEEEEcCEEecCceE Q lcl|NC_019921. 289 ------VYVTALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGG--INVQKFKETLALDDMDLYTAKQFAYGKAKDNKVA 360 (381) Q Consensus 289 ------~~~~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~--i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Af 360 (381) .-+.+...++||+..+..++|++.|.|.+...+...++ +.+...+-+...++-.+|-++ .++++++. T Consensus 330 g~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~~~Ai~~~eS~~gp~qL~d~~i~nLt~~ySgY~a~-----a~~~~~gl 404 (410) T protein:vir:83 330 EAGRFGQGVMGSISGIPVVMSAALGSGDAYLFSTAAIECFEQRVGTLQVVEPSVFGLQVAYAGYFST-----LVVNEDAI 404 (410) T ss_pred cccccccchhhhhcccceEEecCCCcCeeeEeccceeeeeecCCceeEeeCCchhhhhhhheeeeee-----ccccccce Confidence 22334456899999999999999999999988888775 777766655555554454433 34555564 Q ss_pred EEEEEEecCC Q lcl|NC_019921. 361 AVWKLDLKGH 370 (381) Q Consensus 361 vv~~~~~~~~ 370 (381) + -++++ T Consensus 405 i----Pv~g~ 410 (410) T protein:vir:83 405 V----PLVGS 410 (410) T ss_pred e----eeccC Confidence 3 22233 No 136 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.55 E-value=2.9e-08 Score=61.93 Aligned_cols=237 Identities=15% Similarity=0.200 Sum_probs=141.5 Q ss_pred HhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-C-ceEEEEecCCcceEEee Q lcl|NC_019921. 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-L-RLKFLKSETSGVAVWGK 134 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g-~~~~p~~~~~~~a~wv~ 134 (381) +...+...+|--|- . ..+-|......|+|.+.+.+||+..+.+.... + .....+.++-|+++|.. T Consensus 1 m~~~~~~~~TL~e~------A-------kr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~ 67 (328) T protein:vir:95 1 MAVKGLTALTLADW------G-------KRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRL 67 (328) T ss_pred CCccccccccHHHH------H-------hhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeee Confidence 11111112222210 0 01336678889999999999999999988763 3 46788899999999999 Q ss_pred cccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHH---HHHHHHHHHHHhhheeeccCCCcceEe---e Q lcl|NC_019921. 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVR---VQIEEAFAVALETAFLKGTGKDQPIGL---N 208 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~---~~la~~~~~~~~~a~i~G~G~~~P~Gi---l 208 (381) -+...+ ++.+++.+++-..+-+.+.+.|.+.+.+... +..+|-. ....++++......||+||.+..|.++ - T Consensus 68 lN~g~~-~s~~tt~q~t~~l~ilgg~~eVDr~la~~~G-n~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~ 145 (328) T protein:vir:95 68 LNYGVQ-PSKSTTVQVTDSVGMLETYAEVDKSLADLNG-NTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLS 145 (328) T ss_pred cCCccC-cccceeEEEEEEEEEEecceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchh Confidence 988876 5788999999999999999999999998764 4555544 458899999999999999988777654 2 Q ss_pred eccc---cc-------ccccc-----------------cccccee-------eeeeecc--------------------- Q lcl|NC_019921. 209 RQVQ---KG-------VSVTE-----------------GAYPEKE-------EQGTLTF--------------------- 233 (381) Q Consensus 209 ~~~~---~~-------~~~~~-----------------~~~~~~~-------~~~~~t~--------------------- 233 (381) +... .. ...++ |.||... ..|..+. T Consensus 146 ~R~~~~s~~~a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl 225 (328) T protein:vir:95 146 SRYSSLSAGNAQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGL 225 (328) T ss_pred hhcCccccccccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeee Confidence 2111 00 00011 1111110 0000000 Q ss_pred -------------c-----ccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC--------CC Q lcl|NC_019921. 234 -------------A-----NPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN--------AN 287 (381) Q Consensus 234 -------------~-----~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~--------~~ 287 (381) . ..+.....+.++ +.......|...+++.+|+||+.-...+++....+. -. T Consensus 226 ~i~d~r~vvrI~NId~~~l~~~~~~~~l~~l---m~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~ 302 (328) T protein:vir:95 226 ALRDWRYVVRIANIDVSNLSEPSSAANIAKL---MVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETE 302 (328) T ss_pred EEcCcccEEEEecCcccccccccChhhHHHH---HHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccC Confidence 0 000111112221 111122335566788899999876655554322221 23 Q ss_pred CccccccCCCceeEecCCCCCCc--EE Q lcl|NC_019921. 288 GVYVTALPFNLNVIESTVQEAGK--VL 312 (381) Q Consensus 288 G~~~~~l~~G~pVv~s~~~p~~~--i~ 312 (381) |..++ ..+|+||..++++-... ++ T Consensus 303 g~~~t-~~~gipir~~dai~~tE~~vv 328 (328) T protein:vir:95 303 GEWWT-SFRGVPIRETDALLETEARVV 328 (328) T ss_pred Cccee-EECCeEEEEEeeeecCccccC Confidence 33332 33688988888764322 22 No 137 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.51 E-value=2.7e-09 Score=67.54 Aligned_cols=287 Identities=12% Similarity=0.012 Sum_probs=140.8 Q ss_pred HhhccccccCHHHHHHHHHHhhccC-CCCc--eeccHHHHHHHHHHHHhhhhhhhhceeEec-CCc-eEEEEecCCcceE Q lcl|NC_019921. 57 SLPKSAQSLSANQRSFFMDINKNVN-YKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVAV 131 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~-~~gg--~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~-~g~-~~~p~~~~~~~a~ 131 (381) +.--....+ . .+.+.+ ++|. .+-=+.|..+++......+.+++++++.++ +|+ +++|+-.. .++. T Consensus 1 m~~~~~~~~--------~-t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~-~tv~ 70 (347) T protein:vir:94 1 MANVPGQKI--------G-TDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGR-TSGV 70 (347) T ss_pred CCCCCcccc--------c-cccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecccc-eeee Confidence 110000000 0 011111 2222 122378899999988888888999888875 344 67777543 3344 Q ss_pred Eeecccccccc-cCcceeeEeecceeE-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheee----ccC-CCcc Q lcl|NC_019921. 132 WGKIYGEIKGQ-LDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTG-KDQP 204 (381) Q Consensus 132 wv~e~~~~~~~-~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~----G~G-~~~P 204 (381) -...+.++... .+.+=.+++|...++ +....|..-=--++..|+.+.+.++.+.++++..|.+|+. ... +..+ T Consensus 71 ~~t~G~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~ 150 (347) T protein:vir:94 71 YLAPGERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAAS 150 (347) T ss_pred eecCCCCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 34433333211 112224544443333 2222333222224667899999999999999999987752 111 1111 Q ss_pred eEeeeccccccccccccccceeeeeeecc-cc----cchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhh Q lcl|NC_019921. 205 IGLNRQVQKGVSVTEGAYPEKEEQGTLTF-AN----PRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) Q Consensus 205 ~Gil~~~~~~~~~~~~~~~~~~~~~~~t~-~~----~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~ 279 (381) .+.......+... ..++... .+ ....++.+.++...| +.+..+-. +-+.+++|..++.++. T Consensus 151 ~~~~~g~~~~s~~---------~~~~~~~~~~~~~~~~~~~~~i~~a~~~L----de~~VP~~-~R~~vv~P~~~~~Ll~ 216 (347) T protein:vir:94 151 NENIAGLGTASVL---------EVGKKADLDTPAKLGEAIIGQLTIARAKL----TSNYVPAG-DRYFYTTPDNYSAILA 216 (347) T ss_pred ccccCCCccccee---------eccccccccchhhhHHHHHHHHHHHHHHH----hhcCCCCC-CcEEEeCHHHHHHHhc Confidence 1111110000000 0001111 11 122333333333333 22222323 3457899998887754 Q ss_pred hhhccC----CCCccccc---cCCCceeEecCCCCCCc-----------EEE-------Eee-cceE------------- Q lcl|NC_019921. 280 QYTHLN----ANGVYVTA---LPFNLNVIESTVQEAGK-----------VLT-------YVK-GLYD------------- 320 (381) Q Consensus 280 ~~~~~~----~~G~~~~~---l~~G~pVv~s~~~p~~~-----------i~f-------gd~-~~y~------------- 320 (381) .....+ +++....+ ..+|.+|++|+.+|.+. +.- +++ ..|. T Consensus 217 ~~~~~~~~~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~ 296 (347) T protein:vir:94 217 ALMPNAANYAALIDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHR 296 (347) T ss_pred cchhhhhhccccccccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeeh Confidence 322111 12222211 23799999999999421 111 111 1121 Q ss_pred ----EEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecC Q lcl|NC_019921. 321 ----GYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 321 ----i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~ 369 (381) .+...+++++..++.... ...+++++.++.++++|++.++++++-+. T Consensus 297 ~A~~~v~~~~~~~e~~r~~~~~--~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 297 SAVGTVKLRDLALERDRDVDAQ--GDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred hhhhhhhcccccccchhchhhH--HHHhhhhhhhcCcccccceeEEEEecCCC Confidence 112233455544433222 24789999999999999999988777544 No 138 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.47 E-value=1.3e-07 Score=58.41 Aligned_cols=296 Identities=9% Similarity=-0.041 Sum_probs=148.2 Q ss_pred cCHHHHHHHHHHhhc-c-CCCCceec-cHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCcceEEeeccccc Q lcl|NC_019921. 65 LSANQRSFFMDINKN-V-NYKEEKLL-PEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGKIYGEI 139 (381) Q Consensus 65 lt~~e~~~~~~~~~~-~-~~~gg~lv-P~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~-~~~p~~~~~~~a~wv~e~~~~ 139 (381) +|- .|.+... . +++.-+-+ =+.+..++.+.....+.++++..+.++. |+ +++|+-.. .++....-+.+. T Consensus 1 ms~-----~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~-~~~~~~~~G~~l 74 (364) T protein:vir:10 1 MSN-----PNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGE-TELQVLSPGKSP 74 (364) T ss_pred CCC-----cccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeee-eEEeeeccCccc Confidence 100 0111100 0 11111233 4889999999999999999999988864 43 77887533 333333333333 Q ss_pred ccccCcceeeEeecceeE-EEeeeccHHhhhcCHHH-HHHHHHHHHHHHHHHHHhhheee---ccCCCcceEeeeccccc Q lcl|NC_019921. 140 KGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAW-IERFVRVQIEEAFAVALETAFLK---GTGKDQPIGLNRQVQKG 214 (381) Q Consensus 140 ~~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~-~e~~l~~~la~~~~~~~~~a~i~---G~G~~~P~Gil~~~~~~ 214 (381) .. ..+.-++.+|....+ ++...|-.=---++.+| +.+.+..++++++++..|..++. --+...-.+...... T Consensus 75 d~-~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~-- 151 (364) T protein:vir:10 75 DA-SPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPR-- 151 (364) T ss_pred CC-CCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCc-- Confidence 32 345566766666443 33333333222356677 78899999999999999998741 011000000000000 Q ss_pred cccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC------CCC Q lcl|NC_019921. 215 VSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN------ANG 288 (381) Q Consensus 215 ~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~------~~G 288 (381) .. .+|.. .. ...+..+..+.+..+.+.+.........+..+..+ -+.+|+|..++.++....+.| ++| T Consensus 152 ~~-~~g~~---i~-~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~-R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~ 225 (364) T protein:vir:10 152 VA-GHGFS---IH-IVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSE-LCGLMPWTAFNCLRDADRIVDKSYTIAASD 225 (364) T ss_pred cc-CCcce---ee-ecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccc-cEEEeChHHHHHHhcCCccccccccccCCC Confidence 00 00000 00 01112223334444444333333333344444444 567899999888765322221 344 Q ss_pred ccccc---cCCCceeEecCCCCCC--------cE---------------EEEeecce--EEE--------eecceEEEee Q lcl|NC_019921. 289 VYVTA---LPFNLNVIESTVQEAG--------KV---------------LTYVKGLY--DGY--------LAGGINVQKF 332 (381) Q Consensus 289 ~~~~~---l~~G~pVv~s~~~p~~--------~i---------------~fgd~~~y--~i~--------~r~~i~i~~~ 332 (381) .|.++ ...|.||+.|+.+|.. .+ ..+||... +++ .-.++..+.. T Consensus 226 ~~~~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~ 305 (364) T protein:vir:10 226 NTVDGFVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIF 305 (364) T ss_pred ccccceeEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeee Confidence 55543 2369999999999841 00 11444332 222 2345555554 Q ss_pred hh-hhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 333 KE-TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 333 ~~-~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) ++ .++. ....+++-++.++.+|+|+++++. -.+.+|++.- +++ T Consensus 306 ~~~~~~~---~~ida~~a~G~g~lRPeaa~~i~~-~~~~~~~~~~--~~~ 349 (364) T protein:vir:10 306 YEKKEKT---WYIDTFLAEGAIPDRWEAVAVVTA-ADTAELATDH--NAI 349 (364) T ss_pred eccceee---eeeeeehcccCcccCccceEEEEe-cCCCCCccch--hhh Confidence 33 2232 233455668999999999987642 1222232221 111 No 139 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.47 E-value=2.8e-08 Score=61.99 Aligned_cols=294 Identities=11% Similarity=0.030 Sum_probs=152.6 Q ss_pred ccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCcceEEeeccccc Q lcl|NC_019921. 62 AQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGKIYGEI 139 (381) Q Consensus 62 ~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~-~~~p~~~~~~~a~wv~e~~~~ 139 (381) ...++.--|- -..+..++-- +-=+.+..++.+.+...+.++++..+.++. |+ +++|+- +..++....-+.++ T Consensus 1 ms~~~~~tr~----~~~~s~~d~a-l~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l 74 (335) T protein:vir:63 1 MSFLNDLTRP----NYAGKNADVD-IHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEEL 74 (335) T ss_pred CCCcccchhh----hcccccchhh-eehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eeeeeecccCCcCc Confidence 1101000000 0011222222 333999999999999999999999988864 44 677775 43444444444444 Q ss_pred ccccCcceeeEeecceeE-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhee----eccCCCcceEeeeccccc Q lcl|NC_019921. 140 KGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL----KGTGKDQPIGLNRQVQKG 214 (381) Q Consensus 140 ~~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i----~G~G~~~P~Gil~~~~~~ 214 (381) .. ..+..++..|....+ ++...|..----++.+|+.+.+..++++++++..|.+++ .+-+...|.++=... T Consensus 75 ~~-~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~--- 150 (335) T protein:vir:63 75 ER-SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAF--- 150 (335) T ss_pred CC-CCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCc--- Confidence 33 234556766666554 344444444444677899999999999999999999764 444433332211000 Q ss_pred cccccccccceeeeeeecccccchhHHHHHHHHHHhhhc--cccccccccCceEEEEchhhHHHHhhhhhc-----cCCC Q lcl|NC_019921. 215 VSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTN--EKGKSVAVKGNVTMVVNPSDAFEVQAQYTH-----LNAN 287 (381) Q Consensus 215 ~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~--~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~-----~~~~ 287 (381) ..|... ...++........+.+...+..+... ....|..-...-+.+|+|..++.++...-+ .+++ T Consensus 151 ---~~G~~~----~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~ 223 (335) T protein:vir:63 151 ---SPGVLE----KLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATG 223 (335) T ss_pred ---CCCcce----eeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccccccccccc Confidence 001000 00111112222233343333222222 222222112235678999998887653222 2233 Q ss_pred C--ccccc---cCCCceeEecCCCCCCcE-----------EEEeecceE----------EEeecceEEEeehh-hhhhcC Q lcl|NC_019921. 288 G--VYVTA---LPFNLNVIESTVQEAGKV-----------LTYVKGLYD----------GYLAGGINVQKFKE-TLALDD 340 (381) Q Consensus 288 G--~~~~~---l~~G~pVv~s~~~p~~~i-----------~fgd~~~y~----------i~~r~~i~i~~~~~-~~~~~d 340 (381) | .|.++ ...|.||+.|+.+|.+.+ .-|||+... .+.-.++..+...+ ..|. T Consensus 224 ~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~-- 301 (335) T protein:vir:63 224 ATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFS-- 301 (335) T ss_pred ccccccCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhh-- Confidence 3 24443 236999999999995432 234554322 22223333332222 2233 Q ss_pred ceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCc Q lcl|NC_019921. 341 MDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 341 ~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~ 378 (381) ..+.++.-++.++++|+|++++++ ..-++...|- T Consensus 302 -~~i~~~~a~G~g~lRPe~a~~i~~---tg~~~~~~~~ 335 (335) T protein:vir:63 302 -WVLDTFQMYNIGARRPDTAGAIEL---KGIGAFDITA 335 (335) T ss_pred -HHhHHHHHcCCcccccceEEEEEE---cCCCceeecC Confidence 244555558999999999988664 3334444444 No 140 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.45 E-value=1.1e-08 Score=64.12 Aligned_cols=292 Identities=10% Similarity=0.021 Sum_probs=152.7 Q ss_pred cCHHHHHHHHHHhh---ccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCcceEEeeccccc Q lcl|NC_019921. 65 LSANQRSFFMDINK---NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGKIYGEI 139 (381) Q Consensus 65 lt~~e~~~~~~~~~---~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~-~~~p~~~~~~~a~wv~e~~~~ 139 (381) +|- .|..+. ++..+-=.+.=+.+..++.+.....+.++++..++++. |+ +++|+- +..++....-+.+. T Consensus 1 Ms~-----~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG~~l 74 (401) T protein:vir:70 1 MST-----PNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSP 74 (401) T ss_pred CCC-----CccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCCCCc Confidence 100 000000 01111112456888999999999999999999998874 44 677775 43444444443444 Q ss_pred ccccCcceeeEeecceeE-EEeeeccHHhhhcCHHH-HHHHHHHHHHHHHHHHHhhheee-----ccCC-----CcceEe Q lcl|NC_019921. 140 KGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAW-IERFVRVQIEEAFAVALETAFLK-----GTGK-----DQPIGL 207 (381) Q Consensus 140 ~~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~-~e~~l~~~la~~~~~~~~~a~i~-----G~G~-----~~P~Gi 207 (381) .. ..+..++..|....+ ++...|..=---++.+| +.+.+.+++++++++..|..++. |-.. ..|.|. T Consensus 75 d~-~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~ 153 (401) T protein:vir:70 75 AA-TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVK 153 (401) T ss_pred CC-CCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcC Confidence 33 345666766665444 34444433333456677 78899999999999999986631 2110 111111 Q ss_pred eeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhh--h---h Q lcl|NC_019921. 208 NRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ--Y---T 282 (381) Q Consensus 208 l~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~--~---~ 282 (381) .+|..........-+..++..+.+.+.++...| ..+..+. +..+.++.|.-|..++.. . . T Consensus 154 ----------~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~L----dEkdVP~-~r~vvl~pp~~Ys~Ll~~d~L~nrd 218 (401) T protein:vir:70 154 ----------GHGFSINVEVAEGEALVNPQYVMAAVEFALEQQ----LEQEVDI-SDVAILMPWRYFNVLRDADRIVDKT 218 (401) T ss_pred ----------CCceEEeccccccccccCHHHHHHHHHHHHHHH----HhcCCCc-cceEEEcCHHHHHHHHhcCcccchh Confidence 001000000000111123333343343433332 2223333 345566665555444432 1 1 Q ss_pred cc-CCCCccccc---cCCCceeEecCCCCCCc--E---------------EEEeecc--eEEEeec--------ceEEEe Q lcl|NC_019921. 283 HL-NANGVYVTA---LPFNLNVIESTVQEAGK--V---------------LTYVKGL--YDGYLAG--------GINVQK 331 (381) Q Consensus 283 ~~-~~~G~~~~~---l~~G~pVv~s~~~p~~~--i---------------~fgd~~~--y~i~~r~--------~i~i~~ 331 (381) +. .++|.|+++ ...|+||+.|+++|.+. | .-|||+. -+++.+. +++-+. T Consensus 219 ~~~s~~g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~ 298 (401) T protein:vir:70 219 YTISQSGATIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDI 298 (401) T ss_pred hccccCCccccceEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccch Confidence 11 235666654 33699999999999521 1 0134433 1222222 222222 Q ss_pred ehh-hhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCc---ccC Q lcl|NC_019921. 332 FKE-TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE---ETL 381 (381) Q Consensus 332 ~~~-~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~---~~~ 381 (381) .++ ..|..- ..+++-++-.+.+|+|.+|++.+....+++.++++ .|. T Consensus 299 ~~d~r~~~~~---id~~~a~g~g~~RPeaa~vv~~k~~~~~~~~~~~~~~~~~~ 349 (401) T protein:vir:70 299 FYEKKEKTYY---IDTFMAEGAIPDRWEAVSVVTTKRNTTTGAVEGTDGAQHTI 349 (401) T ss_pred hhhhhhhHHH---HHHHHHhCCcccchhheEEEeecCcccccccccCCcchhhh Confidence 222 222222 22555578899999999999999999999999988 333 No 141 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.45 E-value=2.5e-08 Score=62.31 Aligned_cols=237 Identities=17% Similarity=0.196 Sum_probs=139.8 Q ss_pred HhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec--CCceEEEEecCCcceEEee Q lcl|NC_019921. 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA--GLRLKFLKSETSGVAVWGK 134 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~--~g~~~~p~~~~~~~a~wv~ 134 (381) +.......+|--| +.+ .+-|......|+|.+.+.++|+..+.+... +....-.+..+-|++.|-. T Consensus 1 m~~~~~~a~TL~e------~AK-------r~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~ 67 (330) T protein:vir:10 1 MATLSTNNPTMAD------VAK-------RLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRK 67 (330) T ss_pred CCcCCCCcccHHH------HHh-------hcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhh Confidence 1111112222221 111 133566778899999999999999887643 2223345667788999999 Q ss_pred cccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHH---HHHHHHHHHHHHHhhheeeccCCCcceEee--- Q lcl|NC_019921. 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERF---VRVQIEEAFAVALETAFLKGTGKDQPIGLN--- 208 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~---l~~~la~~~~~~~~~a~i~G~G~~~P~Gil--- 208 (381) -+...+ ++.+++.+++-..+-+.+...|.+.|.+... |..+| -.....+++++.....||+||-+..|.++. T Consensus 68 lN~g~~-~s~~tt~qvt~~l~ilgg~~eVDr~la~~~G-n~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~ 145 (330) T protein:vir:10 68 LYGGVL-PNKSSTAQVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLS 145 (330) T ss_pred cCCccc-cccceEEEEEEEeEEecchhhhhhHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchh Confidence 988776 4679999999999999999999999988654 45555 445588999999999999999877776542 Q ss_pred ecccc------cccc----c-----------------cccccce-------eeee--eecccc----------------- Q lcl|NC_019921. 209 RQVQK------GVSV----T-----------------EGAYPEK-------EEQG--TLTFAN----------------- 235 (381) Q Consensus 209 ~~~~~------~~~~----~-----------------~~~~~~~-------~~~~--~~t~~~----------------- 235 (381) +..+. .... + .|.||.. ...| ++...+ T Consensus 146 kR~~~~ta~~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~ 225 (330) T protein:vir:10 146 PRYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDI 225 (330) T ss_pred hhcCCCCCCchhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeee Confidence 21110 0000 0 1111211 0111 111010 Q ss_pred ----------------------cchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhcc--------C Q lcl|NC_019921. 236 ----------------------PRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL--------N 285 (381) Q Consensus 236 ----------------------~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~--------~ 285 (381) .+...+.|+++ +....+..|...+++.+|+||+.-...+......+ + T Consensus 226 Gl~i~d~r~vvRI~NIdvs~l~~~~~~~~li~l---m~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~ 302 (330) T protein:vir:10 226 GLTLRDWRYVARVCNIDVSDLATSANAQALIKY---MIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWET 302 (330) T ss_pred eeEEeCcccEEEEeecccccCCCCccHHHHHHH---HHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeee Confidence 11112223332 22223445666778899999987766665543322 2 Q ss_pred CCCccccccCCCceeEecCCCCCCc--EE Q lcl|NC_019921. 286 ANGVYVTALPFNLNVIESTVQEAGK--VL 312 (381) Q Consensus 286 ~~G~~~~~l~~G~pVv~s~~~p~~~--i~ 312 (381) ..|..++ ...|+||..++++-... ++ T Consensus 303 ~~g~~~t-~~~gipir~~Dail~tE~~vv 330 (330) T protein:vir:10 303 VSGERVM-TFDGIPVQRTDALLNTESRVV 330 (330) T ss_pred cCCeeeE-EECCeEEEEEeeeecCccccC Confidence 2344432 22588888888764322 22 No 142 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.45 E-value=5e-08 Score=60.61 Aligned_cols=294 Identities=12% Similarity=0.043 Sum_probs=150.4 Q ss_pred HhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCcceEEee Q lcl|NC_019921. 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGK 134 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~-~~~p~~~~~~~a~wv~ 134 (381) +.. ...+|.. -..+.+++- .+-=+.+..++.+.+...+.++++..+.++. |+ +++|+- +...+.... T Consensus 1 ms~--~~~~t~~-------~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~ 69 (335) T protein:vir:78 1 MSF--LNDLTRP-------NYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRR 69 (335) T ss_pred CCc--ccccccc-------ccccccchh-hhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccc Confidence 000 0000000 001122222 2334899999999999999999999988864 44 778865 444444444 Q ss_pred cccccccccCcceeeEeecceeE-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhee----eccCCCcceEeee Q lcl|NC_019921. 135 IYGEIKGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL----KGTGKDQPIGLNR 209 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i----~G~G~~~P~Gil~ 209 (381) -+.+... ..+..++..|....+ ++...|..----++.+|+.+.+.+++++++++..|++++ .+.+...|..+=. T Consensus 70 pG~~l~~-~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~ 148 (335) T protein:vir:78 70 AGEELER-SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLED 148 (335) T ss_pred cCcccCC-CCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCC Confidence 4444432 234556666666443 333444443334677899999999999999999999764 3433322221100 Q ss_pred ccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhc--cccccccccCceEEEEchhhHHHHhhhhhc---- Q lcl|NC_019921. 210 QVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTN--EKGKSVAVKGNVTMVVNPSDAFEVQAQYTH---- 283 (381) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~--~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~---- 283 (381) ....|.. ....++...+...++.+.+.+...... ....|..-...-+.+|+|..++.++...-+ T Consensus 149 ------~~~~G~~----~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~ 218 (335) T protein:vir:78 149 ------AFSPGVL----EKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVE 218 (335) T ss_pred ------CcCCCcc----eeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccc Confidence 0000000 000111111222233333333322211 222333222345688999998887653222 Q ss_pred -cCCCC--ccccc---cCCCceeEecCCCCCCcEE-----------EEeecc----------eEEEeecceEEEeehh-h Q lcl|NC_019921. 284 -LNANG--VYVTA---LPFNLNVIESTVQEAGKVL-----------TYVKGL----------YDGYLAGGINVQKFKE-T 335 (381) Q Consensus 284 -~~~~G--~~~~~---l~~G~pVv~s~~~p~~~i~-----------fgd~~~----------y~i~~r~~i~i~~~~~-~ 335 (381) .+++| .|.++ ...|.||+.|+.+|.+.+. -+||+. ...+.-.++..+...+ . T Consensus 219 ~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~ 298 (335) T protein:vir:78 219 YQATGATNDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHD 298 (335) T ss_pred ccccccccccccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccc Confidence 22333 24443 2369999999999964321 123322 2222223333333322 2 Q ss_pred hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCc Q lcl|NC_019921. 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 336 ~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~ 378 (381) .|. ..+.++.-++.++++|+|.|+++ +++. ++...|- T Consensus 299 ~~~---~~i~~~~a~G~g~lRPe~a~~i~--~tg~-~~~~~~~ 335 (335) T protein:vir:78 299 QFS---WVLDTFQMYNIGARRPDTAGAIE--LKGI-EAFDITA 335 (335) T ss_pred hhh---HhhhHHHHcCCcccCcceEEEEE--ecCC-CcccccC Confidence 233 24455556899999999988755 4333 3333333 No 143 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.36 E-value=6e-08 Score=60.20 Aligned_cols=287 Identities=11% Similarity=-0.033 Sum_probs=147.3 Q ss_pred cCHHHHHHHHHHhhccCCCCceec------cHHHHHHHHHHHHhhhhhhhhceeE-ec-CCceEEEEecC---CcceEEe Q lcl|NC_019921. 65 LSANQRSFFMDINKNVNYKEEKLL------PEETIDRIFEDLTTNHPLLADLGIK-NA-GLRLKFLKSET---SGVAVWG 133 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lv------P~~~~~~I~~~l~~~~~l~~~~~v~-~~-~g~~~~p~~~~---~~~a~wv 133 (381) ++.- .- ...+..++..+| |+-+-++|.+.++..-..-.+.+.. .. ++-+.+..... ...+.-+ T Consensus 1 ~~~~-----~~-i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~V 74 (318) T protein:vir:10 1 MTAP-----TG-IVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADV 74 (318) T ss_pred CCCC-----Cc-ceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhc Confidence 1110 00 000111233333 7777777777765544322333322 22 23333333221 2345567 Q ss_pred ecccccccccCcceeeEee-cceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccc Q lcl|NC_019921. 134 KIYGEIKGQLDAAFSEETA-IQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQ 212 (381) Q Consensus 134 ~e~~~~~~~~~~~f~~v~l-~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~ 212 (381) .|+++++ ...++++.-.+ ..+|.+.-++||.|++..+..+..+-....++++|++..|...+.- |.+..+ T Consensus 75 aEggEiP-~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~da--------l~sa~t 145 (318) T protein:vir:10 75 AEFGEIP-VSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKAL--------LQSPIV 145 (318) T ss_pred cCccccc-ccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHH--------Hhcccc Confidence 8988887 56778877777 4479999999999999999999999999999999999988765431 101000 Q ss_pred cccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc-----cCCC Q lcl|NC_019921. 213 KGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH-----LNAN 287 (381) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~-----~~~~ 287 (381) ... +.++.|...... ....+++...+....+-..............|..+. .+|||.++..+++...+ .+.+ T Consensus 146 ~~~-~~s~~w~~~~~~-~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdt-IVlhP~~~~~l~~n~~~~~~y~~~a~ 222 (318) T protein:vir:10 146 PTL-AVPTAWDNGGKV-RTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDT-IVMHYALLPILMDNENFMKVYERNAN 222 (318) T ss_pred ccc-cCCcCCCCcccc-cccchhhhhhhhhhhhhhhhhhhhhhhhccCcccee-eEECHHHHHHHhcchhhhhhhhccch Confidence 000 011111100000 000001110000000000000000011233466554 67999998877432211 1111 Q ss_pred -----Cccc---cccCCCceeEecCCCCCCcEEEEeecce-EEEeecceEEEeehh-----hhhhcCceEEEEEEEEcCE Q lcl|NC_019921. 288 -----GVYV---TALPFNLNVIESTVQEAGKVLTYVKGLY-DGYLAGGINVQKFKE-----TLALDDMDLYTAKQFAYGK 353 (381) Q Consensus 288 -----G~~~---~~l~~G~pVv~s~~~p~~~i~fgd~~~y-~i~~r~~i~i~~~~~-----~~~~~d~~~~r~~~r~dGk 353 (381) ..|. .+.++|+.|+.|..+|.++++..+-... .+.|-.+++...... ..-.+.....|++++---- T Consensus 223 ~~~~~~~~tg~~~g~~lGl~vi~s~~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~ 302 (318) T protein:vir:10 223 YVSTAPDWTGNFPGSVMGLNVIRSRTFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALA 302 (318) T ss_pred hhhhcccccccccceeeceEEeecCccCCCeeEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeee Confidence 1111 1234799999999999999877664442 345777777654331 1112333455666666667 Q ss_pred EecCceEEEEEEEecCCccccccCc Q lcl|NC_019921. 354 AKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 354 ~~~~~Afvv~~~~~~~~~~~~~~~~ 378 (381) +.+|+|.+.++ -- +|| T Consensus 303 V~~PkA~~~it--gi-------~~~ 318 (318) T protein:vir:10 303 VDQPKAALWLT--GI-------VTP 318 (318) T ss_pred eeCcceeEEEe--ec-------cCC Confidence 89999966444 21 222 No 144 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.36 E-value=9.2e-08 Score=59.16 Aligned_cols=295 Identities=13% Similarity=0.057 Sum_probs=148.6 Q ss_pred cCHHHHHHHHHHhhccCC-CCc-----eeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCcceEEeecc Q lcl|NC_019921. 65 LSANQRSFFMDINKNVNY-KEE-----KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGKIY 136 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~-~gg-----~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~-~~~p~~~~~~~a~wv~e~ 136 (381) ++.--...+-..+.++.. -|| .+-=+.+..++.+.....+.+++++++.++. |+ +++|+-.. .++....-+ T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~-~t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGR-MTSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeee-eEEeeecCC Confidence 000000000001111111 111 2334888999999999999999999988764 44 66776533 333333322 Q ss_pred cccc--cccCcceeeEeecceeE-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhhee----eccCCCcceEeee Q lcl|NC_019921. 137 GEIK--GQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL----KGTGKDQPIGLNR 209 (381) Q Consensus 137 ~~~~--~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i----~G~G~~~P~Gil~ 209 (381) .++. +..+++-.+.+|...++ +....|..-=--++..|+.+.+.++.+.++++..|.+++ .|-....|.+.-. T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~ 159 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSATN 159 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 3221 11233333333443333 333344433333577789999999999999999998775 2333333322110 Q ss_pred ccccccccccccccceeeeeeecc----cccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh--- Q lcl|NC_019921. 210 QVQKGVSVTEGAYPEKEEQGTLTF----ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT--- 282 (381) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~t~----~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~--- 282 (381) .. ..|... ...+..+. .++..+++.+.++...| ..+..+-.+ -+.+++|..++.++...+ T Consensus 160 ~~------~~Gg~~--i~~~sg~~~~~~~ta~~~~~ai~~a~~~L----de~~VP~~~-R~~vv~P~~y~~Ll~~~d~~~ 226 (375) T protein:vir:10 160 FV------EPGGTQ--IRVGSGTNESDAFTASALVNAFYDAAAAM----DEKGVSSQG-RCAVLNPRQYYALIQDIGSNG 226 (375) T ss_pred cc------ccCcce--eeeccccccccccCHHHHHHHHHHHHHHH----hhcCCCCCC-CEEEeChHHHHHHHhcCCccc Confidence 00 000000 00111111 23344445555444443 333334333 446789998877754311 Q ss_pred ccC----CCCcccc---ccCCCceeEecCCCCCCcE-------------------------------------EEEee-- Q lcl|NC_019921. 283 HLN----ANGVYVT---ALPFNLNVIESTVQEAGKV-------------------------------------LTYVK-- 316 (381) Q Consensus 283 ~~~----~~G~~~~---~l~~G~pVv~s~~~p~~~i-------------------------------------~fgd~-- 316 (381) +.+ .+|.+.+ ....|.+|+.|..+|...+ +-+|| T Consensus 227 ~~n~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~ 306 (375) T protein:vir:10 227 LVNRDVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAEL 306 (375) T ss_pred eeeecccccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccc Confidence 111 1222222 1235899999999984211 11233 Q ss_pred -c----------ceEEEeecceEEEeeh-hhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcccccc Q lcl|NC_019921. 317 -G----------LYDGYLAGGINVQKFK-ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEG 376 (381) Q Consensus 317 -~----------~y~i~~r~~i~i~~~~-~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~ 376 (381) + ....+.-.++.++++. ++.-.+-...+++++-++..+.+|+|+|. |+..++.++- + T Consensus 307 ~~~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~--l~~~~~~~~~-~ 375 (375) T protein:vir:10 307 GAKSCGLIFQKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVE--LYIGATAPSA-F 375 (375) T ss_pred cCceEEEEEchhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEE--EecCcCcccc-C Confidence 1 1122223456666653 23344456788999999999999999776 4554433222 2 No 145 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=98.35 E-value=1.1e-07 Score=58.63 Aligned_cols=285 Identities=11% Similarity=-0.011 Sum_probs=141.7 Q ss_pred HHHHhhccCC---------CCceeccHHHHHHHHHHHHhhhhhhhhceeEec---CC-ceEEEEecCCcceEEeeccccc Q lcl|NC_019921. 73 FMDINKNVNY---------KEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA---GL-RLKFLKSETSGVAVWGKIYGEI 139 (381) Q Consensus 73 ~~~~~~~~~~---------~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~---~g-~~~~p~~~~~~~a~wv~e~~~~ 139 (381) +..+. +++. .--.+||+.+..+|.+.+.....+.++++.... .| .+++|+.. .+++....++..+ T Consensus 1 ~~~~~-~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~a~d~~~g~~i 78 (381) T protein:vir:80 1 MATIQ-GTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAAVYDKQPQTPV 78 (381) T ss_pred Cceec-ccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cceeeeecCCCcc Confidence 11111 1100 011368999999999999888887777654332 33 47888755 4566667766655 Q ss_pred ccccCcceeeEeecceeE-EEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeecc--CCCcceEe-eecccccc Q lcl|NC_019921. 140 KGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGT--GKDQPIGL-NRQVQKGV 215 (381) Q Consensus 140 ~~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~--G~~~P~Gi-l~~~~~~~ 215 (381) .. .+.+..++++...+. +.-+.|+..-...+..|+.+.+.+.++.++++..|..++.-- ....+.+. .+.. . T Consensus 79 ~~-~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~---~ 154 (381) T protein:vir:80 79 NL-QARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYD---T 154 (381) T ss_pred cc-cccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc---c Confidence 43 344556666666443 455788887777788899999999999999999999886321 11111110 0000 0 Q ss_pred ccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCC--------- Q lcl|NC_019921. 216 SVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA--------- 286 (381) Q Consensus 216 ~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~--------- 286 (381) ....+ ......+......+++.+.++...|.. . ..+.. +-+.+++|..+..++.-..+.+. T Consensus 155 ~i~~~-----~~~~~~t~~~~~~t~~~i~~a~~~Lde--~--~VP~e-gR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~ 224 (381) T protein:vir:80 155 TLGDG-----TVNAHLTGTPAPLTYAALLLAKQKLDE--A--DVPQE-GRIVMVSPAQYIDLLSINQFISVDFSQVKPVT 224 (381) T ss_pred ccccc-----ccccccccchhhHHHHHHHHHHHHHhh--c--CCCcC-CcEEEeCHHHHHHHhhchhhhhhhhccchhhh Confidence 00000 001111222334455666665554432 1 22222 34678999998887643211111 Q ss_pred CCccccccCCCceeEecCCCCCCcE-----EEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEE Q lcl|NC_019921. 287 NGVYVTALPFNLNVIESTVQEAGKV-----LTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAA 361 (381) Q Consensus 287 ~G~~~~~l~~G~pVv~s~~~p~~~i-----~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afv 361 (381) +|. + +..+|.+|+.|..+|.+.+ .+|-..... ..+.-.. ..-.|..+....+.....|.+.... -+. T Consensus 225 ~G~-I-g~i~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~----~~~~~~~-~~g~~s~~a~av~~~k~yd~~~~~~-~~~ 296 (381) T protein:vir:80 225 SGV-V-GTILGMEVIVTTQIGINSLTGYVNGQGAPTQPT----PGVLGSP-YLPDQAGTANVVNTGSASDLAVSLS-YFG 296 (381) T ss_pred cee-e-eEEcceEEEeecccccccccceeeecccccccc----ccccccc-cccccccceeeeeeeeeeceeeeee-ecc Confidence 122 1 1337999999999997532 122111100 0000000 0111223344555555555554332 222 Q ss_pred EEEEEecCCcccc-ccCcccC Q lcl|NC_019921. 362 VWKLDLKGHKPAL-EGTEETL 381 (381) Q Consensus 362 v~~~~~~~~~~~~-~~~~~~~ 381 (381) +-..+-+...++. .+|-.+. T Consensus 297 ~~~~~g~~~~~~~~~~~~~~~ 317 (381) T protein:vir:80 297 LPVFSGAGATAADGGQTLGSF 317 (381) T ss_pred ceeeecceeeecCCCceeeee Confidence 2222222222111 1111111 No 146 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.27 E-value=5.5e-08 Score=60.39 Aligned_cols=259 Identities=13% Similarity=-0.026 Sum_probs=124.3 Q ss_pred hceeEecCCceEEEEecCCcceEEeeccccccc-ccCcceee--EeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHH Q lcl|NC_019921. 109 DLGIKNAGLRLKFLKSETSGVAVWGKIYGEIKG-QLDAAFSE--ETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEE 185 (381) Q Consensus 109 ~~~v~~~~g~~~~p~~~~~~~a~wv~e~~~~~~-~~~~~f~~--v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~ 185 (381) +++-+.-+..+++|+-. ..++....-+.++.. ..+..=.+ |++...++.. ..|..-=--++.+|+.+...++.+. T Consensus 1 ~vr~i~~g~s~~~~~iG-~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~-~~VdDiD~~qa~~Dlr~e~s~~~G~ 78 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMG-RTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTD-VLIYDIEDAMNHYDVRSEYSTQMGE 78 (324) T ss_pred CeeeeecCceEEEeeee-eeEeccccCCCCcCCCcCCcCcccEEEEecchhhhh-hhhhhHHHHhcCccchhHHHHHHHH Confidence 43333333347788753 333333333333321 11222244 4444443333 3333333334778999999999999 Q ss_pred HHHHHHhhheee----ccCCCcceEeeeccccccccccccccceeeeeeec-ccccchhHHHHHHHHHHhhhcccccccc Q lcl|NC_019921. 186 AFAVALETAFLK----GTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLT-FANPRATVNELTQVFKYHSTNEKGKSVA 260 (381) Q Consensus 186 ~~~~~~~~a~i~----G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t-~~~~~~~~~~l~~l~~~l~~~~~~~~~~ 260 (381) ++++..|.+++. +.....|.+-- .....++........+... ..++...++.+.++...| ..+..+ T Consensus 79 aLA~~~Dq~i~~~~a~~~~~~a~~~~~-----~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~L----de~~VP 149 (324) T protein:vir:99 79 ALAMAADVANYAEMAKLVNSRKETTNE-----NIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAF----AKKYIP 149 (324) T ss_pred HHHHHHHHHHHHHHHHhhhcccccccC-----CcccCCccceecccccccccccCHHHHHHHHHHHHHHH----hhcCCC Confidence 999999987641 11111111000 0000000000000000000 112333444444444443 333333 Q ss_pred ccCceEEEEchhhHHHHhhhhhccC----CCCcccccc---CCCceeEecCCCCCCcE---------------------- Q lcl|NC_019921. 261 VKGNVTMVVNPSDAFEVQAQYTHLN----ANGVYVTAL---PFNLNVIESTVQEAGKV---------------------- 311 (381) Q Consensus 261 ~~~~a~~~mn~~t~~~~~~~~~~~~----~~G~~~~~l---~~G~pVv~s~~~p~~~i---------------------- 311 (381) -.+ -+.+++|..++.+.......+ +.|.+.++. ..|.+|+.|+++|...+ T Consensus 150 ~~g-R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~ 228 (324) T protein:vir:99 150 AGD-RTFYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTT 228 (324) T ss_pred CCC-CEEEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccccccccccc Confidence 333 456889998876654322111 234343332 26999999999996321 Q ss_pred ---EEEeec----------ceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCc Q lcl|NC_019921. 312 ---LTYVKG----------LYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) Q Consensus 312 ---~fgd~~----------~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~ 378 (381) +-+|++ ....+.-.+++.+.+.+... -...+++++-++.++++||+++++++.-......+..-| T Consensus 229 ~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~--~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~~~~ 306 (324) T protein:vir:99 229 TGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPEY--QADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVAPDVI 306 (324) T ss_pred ccccccccCceeEEEEehhheEEEeeecceecceechhh--HHHhhhhhhhhcCcccccceEEEEEEccCccccccchhh Confidence 112221 12233334445554443322 225678888899999999999988775543333333333 Q ss_pred ccC Q lcl|NC_019921. 379 ETL 381 (381) Q Consensus 379 ~~~ 381 (381) .++ T Consensus 307 ~~~ 309 (324) T protein:vir:99 307 TGV 309 (324) T ss_pred hhh Confidence 333 No 147 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.23 E-value=1.8e-07 Score=57.55 Aligned_cols=236 Identities=14% Similarity=0.099 Sum_probs=134.9 Q ss_pred HhhccccccCHHHHHHHHHHhhccCCCCceeccH-HHHHHHHHHHHhhhhhhhhceeEecC--CceEEEEecCCcceEEe Q lcl|NC_019921. 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPE-ETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWG 133 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~-~~~~~I~~~l~~~~~l~~~~~v~~~~--g~~~~p~~~~~~~a~wv 133 (381) +...+...+|--| ..+..+ |. .+...|+|.+.+.+||+..+.+.... ......+.++-|++.|. T Consensus 1 m~~~~~~~~TL~e------~Ak~~~-------~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR 67 (331) T protein:vir:10 1 MPTLSTTNPTLAD------VAARMT-------PDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWR 67 (331) T ss_pred CCccccCcccHHH------HHHhcC-------cchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhh Confidence 1111111111111 111111 33 34567999999999999999987542 22445677888999999 Q ss_pred ecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHH---HHHHHHHHHHHHhhheeeccCCCcceEe--- Q lcl|NC_019921. 134 KIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFV---RVQIEEAFAVALETAFLKGTGKDQPIGL--- 207 (381) Q Consensus 134 ~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l---~~~la~~~~~~~~~a~i~G~G~~~P~Gi--- 207 (381) .-+...+ ++.+++.+++-..+-+.+.+.|.+.|.+... +..+|- ...+.++++......||+||-+..|.++ T Consensus 68 ~lN~g~~-~s~~tt~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL 145 (331) T protein:vir:10 68 KLNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGL 145 (331) T ss_pred ccCCccC-cccceeEEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccc Confidence 9988776 5788999999999999999999999988755 466554 4458889999999999999977666654 Q ss_pred eecc---cc---ccc----ccc-----------------cccccee-------eee--eecc------------------ Q lcl|NC_019921. 208 NRQV---QK---GVS----VTE-----------------GAYPEKE-------EQG--TLTF------------------ 233 (381) Q Consensus 208 l~~~---~~---~~~----~~~-----------------~~~~~~~-------~~~--~~t~------------------ 233 (381) -+.. +. ... .++ |.||... ..| ++.. T Consensus 146 ~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~G 225 (331) T protein:vir:10 146 TPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIG 225 (331) T ss_pred hhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeee Confidence 2211 10 000 001 1111111 000 0000 Q ss_pred --------------c-------ccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCC------ Q lcl|NC_019921. 234 --------------A-------NPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA------ 286 (381) Q Consensus 234 --------------~-------~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~------ 286 (381) . ++.+.+|. +..+....+..|...+++.+|+||+.-...++.....+++ T Consensus 226 l~i~d~r~v~ri~NIdvs~l~~~~~~~~dl----~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~ 301 (331) T protein:vir:10 226 LTLRDWRYVVRIANVDVSELTKNASAGADL----IDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTM 301 (331) T ss_pred eEEcCcccEEEEeccchhccCCCcchhhhH----HHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeee Confidence 0 01111222 2222222233455567889999998776666544332221 Q ss_pred ---CCccccccCCCceeEecCCCCCCc--EE Q lcl|NC_019921. 287 ---NGVYVTALPFNLNVIESTVQEAGK--VL 312 (381) Q Consensus 287 ---~G~~~~~l~~G~pVv~s~~~p~~~--i~ 312 (381) .|..++ ...|+||..++++-... ++ T Consensus 302 ~~~~g~~~t-~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 302 EEIAGKKVV-AFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred eecCCccee-EECCeeEEEeeeeecCccccC Confidence 122222 22588888887764322 22 No 148 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.23 E-value=1.8e-07 Score=57.55 Aligned_cols=236 Identities=14% Similarity=0.099 Sum_probs=134.9 Q ss_pred HhhccccccCHHHHHHHHHHhhccCCCCceeccH-HHHHHHHHHHHhhhhhhhhceeEecC--CceEEEEecCCcceEEe Q lcl|NC_019921. 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPE-ETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWG 133 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~-~~~~~I~~~l~~~~~l~~~~~v~~~~--g~~~~p~~~~~~~a~wv 133 (381) +...+...+|--| ..+..+ |. .+...|+|.+.+.+||+..+.+.... ......+.++-|++.|. T Consensus 1 m~~~~~~~~TL~e------~Ak~~~-------~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR 67 (331) T protein:vir:98 1 MPTLSTTNPTLAD------VAARMT-------PDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWR 67 (331) T ss_pred CCccccCcccHHH------HHHhcC-------cchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhh Confidence 1111111111111 111111 33 34567999999999999999987542 22445677888999999 Q ss_pred ecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHH---HHHHHHHHHHHHhhheeeccCCCcceEe--- Q lcl|NC_019921. 134 KIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFV---RVQIEEAFAVALETAFLKGTGKDQPIGL--- 207 (381) Q Consensus 134 ~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l---~~~la~~~~~~~~~a~i~G~G~~~P~Gi--- 207 (381) .-+...+ ++.+++.+++-..+-+.+.+.|.+.|.+... +..+|- ...+.++++......||+||-+..|.++ T Consensus 68 ~lN~g~~-~s~~tt~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL 145 (331) T protein:vir:98 68 KLNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGL 145 (331) T ss_pred ccCCccC-cccceeEEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccc Confidence 9988776 5788999999999999999999999988755 466554 4458889999999999999977666654 Q ss_pred eecc---cc---ccc----ccc-----------------cccccee-------eee--eecc------------------ Q lcl|NC_019921. 208 NRQV---QK---GVS----VTE-----------------GAYPEKE-------EQG--TLTF------------------ 233 (381) Q Consensus 208 l~~~---~~---~~~----~~~-----------------~~~~~~~-------~~~--~~t~------------------ 233 (381) -+.. +. ... .++ |.||... ..| ++.. T Consensus 146 ~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~G 225 (331) T protein:vir:98 146 TPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIG 225 (331) T ss_pred hhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeee Confidence 2211 10 000 001 1111111 000 0000 Q ss_pred --------------c-------ccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCC------ Q lcl|NC_019921. 234 --------------A-------NPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA------ 286 (381) Q Consensus 234 --------------~-------~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~------ 286 (381) . ++.+.+|. +..+....+..|...+++.+|+||+.-...++.....+++ T Consensus 226 l~i~d~r~v~ri~NIdvs~l~~~~~~~~dl----~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~ 301 (331) T protein:vir:98 226 LTLRDWRYVVRIANVDVSELTKNASAGADL----IDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTM 301 (331) T ss_pred eEEcCcccEEEEeccchhccCCCcchhhhH----HHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeee Confidence 0 01111222 2222222233455567889999998776666544332221 Q ss_pred ---CCccccccCCCceeEecCCCCCCc--EE Q lcl|NC_019921. 287 ---NGVYVTALPFNLNVIESTVQEAGK--VL 312 (381) Q Consensus 287 ---~G~~~~~l~~G~pVv~s~~~p~~~--i~ 312 (381) .|..++ ...|+||..++++-... ++ T Consensus 302 ~~~~g~~~t-~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:98 302 EEIAGKKVV-AFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred eecCCccee-EECCeeEEEeeeeecCccccC Confidence 122222 22588888887764322 22 No 149 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.23 E-value=1.8e-07 Score=57.55 Aligned_cols=236 Identities=14% Similarity=0.099 Sum_probs=134.9 Q ss_pred HhhccccccCHHHHHHHHHHhhccCCCCceeccH-HHHHHHHHHHHhhhhhhhhceeEecC--CceEEEEecCCcceEEe Q lcl|NC_019921. 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPE-ETIDRIFEDLTTNHPLLADLGIKNAG--LRLKFLKSETSGVAVWG 133 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~-~~~~~I~~~l~~~~~l~~~~~v~~~~--g~~~~p~~~~~~~a~wv 133 (381) +...+...+|--| ..+..+ |. .+...|+|.+.+.+||+..+.+.... ......+.++-|++.|. T Consensus 1 m~~~~~~~~TL~e------~Ak~~~-------~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR 67 (331) T protein:vir:10 1 MPTLSTTNPTLAD------VAARMT-------PDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWR 67 (331) T ss_pred CCccccCcccHHH------HHHhcC-------cchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhh Confidence 1111111111111 111111 33 34567999999999999999987542 22445677888999999 Q ss_pred ecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHH---HHHHHHHHHHHHhhheeeccCCCcceEe--- Q lcl|NC_019921. 134 KIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFV---RVQIEEAFAVALETAFLKGTGKDQPIGL--- 207 (381) Q Consensus 134 ~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l---~~~la~~~~~~~~~a~i~G~G~~~P~Gi--- 207 (381) .-+...+ ++.+++.+++-..+-+.+.+.|.+.|.+... +..+|- ...+.++++......||+||-+..|.++ T Consensus 68 ~lN~g~~-~s~~tt~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL 145 (331) T protein:vir:10 68 KLNYGVQ-PEKSRTVQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGL 145 (331) T ss_pred ccCCccC-cccceeEEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccc Confidence 9988776 5788999999999999999999999988755 466554 4458889999999999999977666654 Q ss_pred eecc---cc---ccc----ccc-----------------cccccee-------eee--eecc------------------ Q lcl|NC_019921. 208 NRQV---QK---GVS----VTE-----------------GAYPEKE-------EQG--TLTF------------------ 233 (381) Q Consensus 208 l~~~---~~---~~~----~~~-----------------~~~~~~~-------~~~--~~t~------------------ 233 (381) -+.. +. ... .++ |.||... ..| ++.. T Consensus 146 ~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~G 225 (331) T protein:vir:10 146 TPRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIG 225 (331) T ss_pred hhhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeee Confidence 2211 10 000 001 1111111 000 0000 Q ss_pred --------------c-------ccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCC------ Q lcl|NC_019921. 234 --------------A-------NPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA------ 286 (381) Q Consensus 234 --------------~-------~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~------ 286 (381) . ++.+.+|. +..+....+..|...+++.+|+||+.-...++.....+++ T Consensus 226 l~i~d~r~v~ri~NIdvs~l~~~~~~~~dl----~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~ 301 (331) T protein:vir:10 226 LTLRDWRYVVRIANVDVSELTKNASAGADL----IDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTM 301 (331) T ss_pred eEEcCcccEEEEeccchhccCCCcchhhhH----HHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeee Confidence 0 01111222 2222222233455567889999998776666544332221 Q ss_pred ---CCccccccCCCceeEecCCCCCCc--EE Q lcl|NC_019921. 287 ---NGVYVTALPFNLNVIESTVQEAGK--VL 312 (381) Q Consensus 287 ---~G~~~~~l~~G~pVv~s~~~p~~~--i~ 312 (381) .|..++ ...|+||..++++-... ++ T Consensus 302 ~~~~g~~~t-~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 302 EEIAGKKVV-AFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred eecCCccee-EECCeeEEEeeeeecCccccC Confidence 122222 22588888887764322 22 No 150 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.20 E-value=9.1e-07 Score=53.70 Aligned_cols=271 Identities=8% Similarity=-0.035 Sum_probs=137.1 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhh---------ceeEe--cCCc-eEEEEecCC-cceEEeecccccccc Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLAD---------LGIKN--AGLR-LKFLKSETS-GVAVWGKIYGEIKGQ 142 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~---------~~v~~--~~g~-~~~p~~~~~-~~a~wv~e~~~~~~~ 142 (381) |. +..-...++|+-|..-+.+.+.+.+.+++- ..... .+|. +.+|....- +.+.-+.++.+++.+ T Consensus 1 MA--~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~ 78 (324) T protein:vir:59 1 MA--YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQ 78 (324) T ss_pred CC--ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchh Confidence 22 222244678998888887777777666432 12222 2343 788887553 455555666665543 Q ss_pred cCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccc Q lcl|NC_019921. 143 LDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAY 222 (381) Q Consensus 143 ~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~ 222 (381) ..+-++-.-..+..+.-..++.+-..-+.-|....+.+.+++...+..+..+|.-- +|++......... T Consensus 79 -~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l-----~g~~~~~~~~~~~----- 147 (324) T protein:vir:59 79 -KINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAEL-----AGVFSNDDMKDNK----- 147 (324) T ss_pred -hcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhhhccccccce----- Confidence 33333434444445545567765555567788888999999999888887665411 1222111000000 Q ss_pred cceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh---ccCCCCccccccCCCce Q lcl|NC_019921. 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNANGVYVTALPFNLN 299 (381) Q Consensus 223 ~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~---~~~~~G~~~~~l~~G~p 299 (381) .... ...+.....+.+.+....+ . .....-.+|+||+.++..+++... .+.++|...-....|++ T Consensus 148 ----~dvs-a~~~~~~s~~~l~~A~~~~---G----D~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~~~~G~~ 215 (324) T protein:vir:59 148 ----LDIS-GTADGIYSAETFVDASYKL---G----DHESLLTAIGMHSATMASAVKQDLIEFVKDSQSGIRFPTYMNKR 215 (324) T ss_pred ----eeee-ccccceecHHHHHHHHHHh---C----CcccCcEEEEEchHHHHHHHHhhhhhhccccccCceeeeecccE Confidence 0000 0011112334444444332 1 112234579999999999876422 23444443333447999 Q ss_pred eEecCCCCCCc----------EEEEeecceEEEe-ecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEec Q lcl|NC_019921. 300 VIESTVQEAGK----------VLTYVKGLYDGYL-AGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) Q Consensus 300 Vv~s~~~p~~~----------i~fgd~~~y~i~~-r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~ 368 (381) |+.++.||... .+|+.-. ..+.. +.++.++..++. ..+++.+....++- +.+..+..-+-... T Consensus 216 VivdD~~p~~~~~~~~~~y~s~l~~~GA-i~~~~~~~~v~vE~dRd~--~~g~~~l~~r~~~~---~~p~G~s~~~~~~~ 289 (324) T protein:vir:59 216 VIVDDSMPVETLEDGTKVFTSYLFGAGA-LGYAEGQPEVPTETARNA--LGSQDILINRKHFV---LHPRGVKFTENAMA 289 (324) T ss_pred EEEeCCCCccccCCCCceEEEEEEecCe-EEEeecCCCcceecccCc--cccceEEEEeeEEE---eEeeeEEecccccC Confidence 99999998421 2333211 22222 334555555553 34566666666643 33334332111111 Q ss_pred CCccccccCcccC Q lcl|NC_019921. 369 GHKPALEGTEETL 381 (381) Q Consensus 369 ~~~~~~~~~~~~~ 381 (381) +.. .|...| T Consensus 290 ~~s----Pt~~~L 298 (324) T protein:vir:59 290 GTT----PTDEEL 298 (324) T ss_pred CCC----CChhhh Confidence 111 122222 No 151 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.15 E-value=2.7e-07 Score=56.60 Aligned_cols=296 Identities=8% Similarity=-0.044 Sum_probs=147.4 Q ss_pred cCHHHHHHHHHHhhc-c-CCCCce-eccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCcceEEeeccccc Q lcl|NC_019921. 65 LSANQRSFFMDINKN-V-NYKEEK-LLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGKIYGEI 139 (381) Q Consensus 65 lt~~e~~~~~~~~~~-~-~~~gg~-lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~-~~~p~~~~~~~a~wv~e~~~~ 139 (381) +|- .|.+... . +++.-+ +-=+.+..++.+.....+.++++..+.++. |+ +++|+-.. .++....-+.+. T Consensus 1 Ms~-----~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~-~~a~y~~~G~~l 74 (402) T protein:vir:97 1 MST-----PNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGE-TELQVLAPGQSP 74 (402) T ss_pred CCC-----cccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEee-eEEeeecccccc Confidence 100 0111100 0 111112 334889999999999999999999988864 44 67787533 333333333333 Q ss_pred ccccCcceeeEeecceeE-EEeeeccHHhhhcCHHH-HHHHHHHHHHHHHHHHHhhheee---ccC--CCcceEeeeccc Q lcl|NC_019921. 140 KGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAW-IERFVRVQIEEAFAVALETAFLK---GTG--KDQPIGLNRQVQ 212 (381) Q Consensus 140 ~~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~-~e~~l~~~la~~~~~~~~~a~i~---G~G--~~~P~Gil~~~~ 212 (381) .. ..+..++..|....+ ++...|..----++.+| +.+.+.+++++++++..|..++. -.| ...|.+=.. . T Consensus 75 dg-~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~--~ 151 (402) T protein:vir:97 75 NA-TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKP--R 151 (402) T ss_pred CC-CCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccC--c Confidence 32 345556666665443 33333332222255677 78899999999999999997742 111 111110000 0 Q ss_pred cccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc-----c-CC Q lcl|NC_019921. 213 KGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH-----L-NA 286 (381) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~-----~-~~ 286 (381) ... .+. ....+.+...+......+.+.+..+......+..+..+ -+.+++|..++.++....+ . .+ T Consensus 152 -~~~--~g~----s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~d-Rv~vv~P~~y~~Ll~~~rl~n~d~~~~~ 223 (402) T protein:vir:97 152 -VKG--HGF----SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISD-VAIMMPWKFFNALRDADRIVDKTYTISQ 223 (402) T ss_pred -ccc--ccc----ccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccc-cEEEeChHHHHHHhhcccccchhhcccc Confidence 000 000 00001122222233333333333333222334445555 4788999988877643211 1 24 Q ss_pred CCccccc---cCCCceeEecCCCCCCc--E---------------EEEeecc--eEEEeecceE--------EEee-hhh Q lcl|NC_019921. 287 NGVYVTA---LPFNLNVIESTVQEAGK--V---------------LTYVKGL--YDGYLAGGIN--------VQKF-KET 335 (381) Q Consensus 287 ~G~~~~~---l~~G~pVv~s~~~p~~~--i---------------~fgd~~~--y~i~~r~~i~--------i~~~-~~~ 335 (381) +|.|.++ ...|.||+.|+++|.+. + .-||++. .+++.+..+- -+.. +.. T Consensus 224 ~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r 303 (402) T protein:vir:97 224 SGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK 303 (402) T ss_pred CCccccceeEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchh Confidence 5556654 33699999999999521 1 1144432 2333332222 1211 122 Q ss_pred hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 336 ~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) .|..- +.+++-++-.+.+|+|..|+.++- ..+++.++-|-|= T Consensus 304 ~~~~~---id~~~a~G~g~~RPeaa~vv~~~~-~~t~~~~~~~~~~ 345 (402) T protein:vir:97 304 EKTYY---IDTFMAEGAIPDRWEAVSVVTTKR-DATTGDAGGPGDD 345 (402) T ss_pred HHHHH---HHHHHHhCCcccCccceEEEEEec-ccccccCCccccc Confidence 22222 234445688899999999998877 3333333333222 No 152 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=98.13 E-value=3.2e-07 Score=56.19 Aligned_cols=279 Identities=8% Similarity=-0.058 Sum_probs=141.4 Q ss_pred HHHHHHHhhc--cCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe---cCC-ceEEEEecCCcceEEeeccccccccc Q lcl|NC_019921. 70 RSFFMDINKN--VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN---AGL-RLKFLKSETSGVAVWGKIYGEIKGQL 143 (381) Q Consensus 70 ~~~~~~~~~~--~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~---~~g-~~~~p~~~~~~~a~wv~e~~~~~~~~ 143 (381) ..+.|.+.-. +.+.-...||+-++.+|.+.+.....+.++++-.+ .+| .+++|+.. .+++.-..++..+.. . T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~i~~-~ 78 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS-ELGVEDKATDVPVGV-Q 78 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC-cceeeeecCCCcccc-c Confidence 1111111100 11112235899999999999988887777765332 234 47888754 344444444444432 2 Q ss_pred CcceeeEeecc-eeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeecc--CCCcceEeeeccccccccccc Q lcl|NC_019921. 144 DAAFSEETAIQ-NKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGT--GKDQPIGLNRQVQKGVSVTEG 220 (381) Q Consensus 144 ~~~f~~v~l~~-~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~--G~~~P~Gil~~~~~~~~~~~~ 220 (381) +.+-.++++.. +..+.-+.|+..-..++..|+.+.+.+..++++++..|..++.-- ++.++.+-. T Consensus 79 ~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~------------ 146 (341) T protein:vir:94 79 PVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNV------------ 146 (341) T ss_pred cccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCcc------------ Confidence 33345566666 333555778876666778899999999999999999998876321 111111100 Q ss_pred cccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC--CCCc--ccc---c Q lcl|NC_019921. 221 AYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN--ANGV--YVT---A 293 (381) Q Consensus 221 ~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~--~~G~--~~~---~ 293 (381) .... ....+........+.+.++...|. ....+..+ -+.+++|..+..++....+.+ ..|+ ... + T Consensus 147 -~~~~--~~~~t~~~~~~~~~~i~~a~~~Ld----e~~VP~~g-R~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig 218 (341) T protein:vir:94 147 -FSSS--NGAITGNGQAFSFAVFLAARRLLL----EADVPEEK-IVLLISPGQESALFTIPQFISKDFINNAPIAQGQIG 218 (341) T ss_pred -ccCc--cccccCchhhhhHHHHHHHHHHHh----hcCCCccC-CEEEeCHHHHHHHhhchhhhhhhccccchhheeeee Confidence 0000 000111111223344444444332 22223233 456789998888764211111 1111 111 1 Q ss_pred cCCCceeEecCCCCCCcEE-----------------------E----Eeecce--EEEeec---ceEEEe---------- Q lcl|NC_019921. 294 LPFNLNVIESTVQEAGKVL-----------------------T----YVKGLY--DGYLAG---GINVQK---------- 331 (381) Q Consensus 294 l~~G~pVv~s~~~p~~~i~-----------------------f----gd~~~y--~i~~r~---~i~i~~---------- 331 (381) ..+|.+|+.|..+|.+... + +|+..+ +++-+. ++.+.. T Consensus 219 ~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~ 298 (341) T protein:vir:94 219 SLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSK 298 (341) T ss_pred eEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccc Confidence 2369999999999964311 0 011111 111111 111000 Q ss_pred ----ehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcc Q lcl|NC_019921. 332 ----FKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 332 ----~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~ 372 (381) .-+..-..-...+++.+-++.++++|++.| .|+..+.+- T Consensus 299 ~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v--~~~~~~~~~ 341 (341) T protein:vir:94 299 APRVTQSFENREQVWLMVGRQAYGARLYRPLHAV--NIHTTGDTV 341 (341) T ss_pred cccccccchhhhhhhhhhhhhhhcccccCcceeE--EEecCcCCC Confidence 001111123456788899999999999965 344433322 No 153 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.05 E-value=6.2e-07 Score=54.61 Aligned_cols=259 Identities=11% Similarity=0.005 Sum_probs=145.0 Q ss_pred HhhccCCCCceeccH---HHHHHHHHHHHhhhhhhhhceeEec--CCceEEEEecCCcceEEeecccccccccCccee-- Q lcl|NC_019921. 76 INKNVNYKEEKLLPE---ETIDRIFEDLTTNHPLLADLGIKNA--GLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFS-- 148 (381) Q Consensus 76 ~~~~~~~~gg~lvP~---~~~~~I~~~l~~~~~l~~~~~v~~~--~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~-- 148 (381) |.+..-...--++|. ++.++.-+.+.+...+++..+.+|+ +..+++|+..-.+.+.-|+|+++++ -+..+.. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Ip-lskvt~~~~ 79 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIP-LSKVTRTKD 79 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccc-hhhheeeee Confidence 222211111224433 3445554455555556666677886 3468999988788888899988876 3444443 Q ss_pred -eEeecceeEEEeeeccHHhhhcCH-HHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeecccccccccccccccee Q lcl|NC_019921. 149 -EETAIQNKLTAFVVLPKDLNDFGP-AWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKE 226 (381) Q Consensus 149 -~v~l~~~kl~~~~~iS~ell~ds~-~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~ 226 (381) ..++..+|+..- +|.|-++.|. -+-...-.+.|..+++..++..|+.=-.++ . T Consensus 80 ~t~t~kikK~rK~--tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lkta------------t----------- 134 (295) T protein:vir:99 80 KDYTVKWFKKRRA--TTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTK------------P----------- 134 (295) T ss_pred eeeEEEeeeeccc--ccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccC------------c----------- Confidence 366666777764 4899885444 346778888999999999998887621110 0 Q ss_pred eeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc--cCC--CC-ccccccCCCce-e Q lcl|NC_019921. 227 EQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH--LNA--NG-VYVTALPFNLN-V 300 (381) Q Consensus 227 ~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~--~~~--~G-~~~~~l~~G~p-V 300 (381) .+. ..+..-..+..+...+..... .+....+.+|||.+.+.+++.... +.+ -| +|+-. .+|.. | T Consensus 135 --~t~---tg~~lq~a~a~~~~al~~f~E----e~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~n-fLG~q~I 204 (295) T protein:vir:99 135 --TKV---KGVGLQKALSASWAKLATFNE----FEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKN-FLGMQNV 204 (295) T ss_pred --eee---ehhhHHHHHHHhhhhhhhccc----ccCCceEEEEehHHHHHHHhccccccchhhhhhhhhhhh-hhccceE Confidence 000 011122233333333333222 233456899999999988753222 222 12 34322 25875 9 Q ss_pred EecCCCCCCcEEEEee---cceEEEee-cceEEEeehhhhhhcCceEEEEEEEE-------------cCE---EecCceE Q lcl|NC_019921. 301 IESTVQEAGKVLTYVK---GLYDGYLA-GGINVQKFKETLALDDMDLYTAKQFA-------------YGK---AKDNKVA 360 (381) Q Consensus 301 v~s~~~p~~~i~fgd~---~~y~i~~r-~~i~i~~~~~~~~~~d~~~~r~~~r~-------------dGk---~~~~~Af 360 (381) +.|..+|+|+++.--- ..|.+..+ +++. .-..+..|++|+.|..+- .|- |=..++. T Consensus 205 I~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l~----~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgi 280 (295) T protein:vir:99 205 IVMPSVPEGKIYSTAVENLVFASLNVKGGDLG----GLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGV 280 (295) T ss_pred EEcccCCCceEEEeeccceEEEEecCCchhhh----hhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceE Confidence 9999999999875432 33333333 3333 334455788888887542 122 2234466 Q ss_pred EEEEEEecCCc-ccccc Q lcl|NC_019921. 361 AVWKLDLKGHK-PALEG 376 (381) Q Consensus 361 vv~~~~~~~~~-~~~~~ 376 (381) |+.++. +.+ |..-| T Consensus 281 v~~tI~--~~~~~~~~~ 295 (295) T protein:vir:99 281 VEATIE--AAAVPGIGG 295 (295) T ss_pred EEEEEe--cCcCCCCCC Confidence 655553 333 23333 No 154 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=97.98 E-value=4.6e-06 Score=49.86 Aligned_cols=283 Identities=7% Similarity=-0.079 Sum_probs=129.1 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe---------cCCc-eEEEEecCC-cceEEeeccc-cccccc Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN---------AGLR-LKFLKSETS-GVAVWGKIYG-EIKGQL 143 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~---------~~g~-~~~p~~~~~-~~a~wv~e~~-~~~~~~ 143 (381) |...+..-...++|+.|..-+.+.+.+.+.+++-.-+.+ .+|. +.+|.-..- +.+.-+.++. .++... T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~k 80 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETGK 80 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchhh Confidence 332223334467899888888777777666644221211 2343 788976543 4443344432 343221 Q ss_pred CcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeecccccccccccccc Q lcl|NC_019921. 144 DAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYP 223 (381) Q Consensus 144 ~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~ 223 (381) .+=++-.-..++.+.-..++..-..-+.-|....+.+.+++...+..+..++.- -.|++.......... . T Consensus 81 -i~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~-----l~gvf~~~~~~~~~~----~ 150 (330) T protein:vir:10 81 -ITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIAT-----LNGIFATGTAGEKGA----L 150 (330) T ss_pred -cccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHH-----HHhhhhhhhcccchh----h Confidence 222222333334444455555555556778888899999988777776655431 012322111000000 0 Q ss_pred ceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh---ccCCCCccccccCCCcee Q lcl|NC_019921. 224 EKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNANGVYVTALPFNLNV 300 (381) Q Consensus 224 ~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~---~~~~~G~~~~~l~~G~pV 300 (381) ........+...+...++.+.+....+ . .....-.+|+||+.++..+++... .+.++|...-....|++| T Consensus 151 ~~~~~~~~~~~~a~~s~~~l~~A~~~~---G----D~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~~~~G~~V 223 (330) T protein:vir:10 151 EETHVSDQSKASTGIDAGMVLDAKQLL---G----DSADQVTAIAMHSAVYTKLQKDNLIQYIQPTTATINIPTYLGYRV 223 (330) T ss_pred hhhheecccccccccCHHHHHHHHHHh---c----cccccceEEEEcHHHHHHHHHhhhhhhhcccccCcccccccceEE Confidence 000001111112222334444433322 1 112234579999999999886432 234444322233359999 Q ss_pred EecCCCCCCc-----EEEEeecceEEEe---ecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEE--EecCC Q lcl|NC_019921. 301 IESTVQEAGK-----VLTYVKGLYDGYL---AGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKL--DLKGH 370 (381) Q Consensus 301 v~s~~~p~~~-----i~fgd~~~y~i~~---r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~--~~~~~ 370 (381) +.++.||... .+|+.-. +.+.+ ...+.++..++.. .+++.+....++- +.+..+..-.- ...+. T Consensus 224 ivdD~~p~~~~~yt~yl~~~GA-i~~~~~~~~~~v~~EtdRd~~--~g~~~l~~r~~~~---~hp~G~s~~~~~~~~~~~ 297 (330) T protein:vir:10 224 IIDDGIAPTGDIYTSYLFRTGS-IGLNTGNPSGLTTFETSREAA--KGNDMIYTRRALV---MHPYGVKWTGAEVDAGNI 297 (330) T ss_pred EEeCCCCCCCCceeEEEEecCc-eeeecccCCccccccccCCcc--ccceEEEEeeEEE---eeeeeeeecccccccCcC Confidence 9999999422 2333211 22222 2224455555543 3445555554432 33333332110 11111 Q ss_pred ccccc--cCcccC Q lcl|NC_019921. 371 KPALE--GTEETL 381 (381) Q Consensus 371 ~~~~~--~~~~~~ 381 (381) .|+-+ +++... T Consensus 298 sPt~~~L~~~~NW 310 (330) T protein:vir:10 298 TPSNADLAKFKNW 310 (330) T ss_pred CcChHHhcCCcCc Confidence 22111 111111 No 155 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=97.97 E-value=1.6e-06 Score=52.31 Aligned_cols=239 Identities=14% Similarity=0.095 Sum_probs=133.0 Q ss_pred HhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEec--CCceEEEEecCCcceEEee Q lcl|NC_019921. 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNA--GLRLKFLKSETSGVAVWGK 134 (381) Q Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~--~g~~~~p~~~~~~~a~wv~ 134 (381) +...+...+|--|. .+. +-|......|+|.+.+.++|+..+.+... +....-.+..+-|.+.|-. T Consensus 1 m~~~~~~a~TL~E~------Akr-------~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~ 67 (335) T protein:vir:73 1 MALIGQTLPSLLDI------YNR-------TDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRR 67 (335) T ss_pred CCcCCCCchhHHHH------Hhh-------cCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhh Confidence 21111122222111 111 22455667799999999999999887643 2223345667788999999 Q ss_pred cccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHH---HHHHHHHHHHHhhheeeccCCCcceEee--- Q lcl|NC_019921. 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVR---VQIEEAFAVALETAFLKGTGKDQPIGLN--- 208 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~---~~la~~~~~~~~~a~i~G~G~~~P~Gil--- 208 (381) -+...+ ++.+++.+++-..+-+.+...|.+.|.+.+. |..+|-. ....++++......||+||-+..|.++. T Consensus 68 lN~g~~-~s~~tt~qvt~~l~ilgg~~eVDr~La~~~G-n~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~ 145 (335) T protein:vir:73 68 YNQGVQ-PTKTQTVPVTDTTGMLYDLGFVDKALADRSN-NAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLA 145 (335) T ss_pred cCCccc-cccceEEEEEEEEEEecchhhhhHHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchh Confidence 988776 4679999999999999999999998877654 4554444 4588999999999999999877776543 Q ss_pred ecc---c------------ccccc---------------ccccccceee-------eeeecccc---------------- Q lcl|NC_019921. 209 RQV---Q------------KGVSV---------------TEGAYPEKEE-------QGTLTFAN---------------- 235 (381) Q Consensus 209 ~~~---~------------~~~~~---------------~~~~~~~~~~-------~~~~t~~~---------------- 235 (381) +.. . .+.+. ..|.||.... .|..+..+ T Consensus 146 kR~~~~st~~a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~ 225 (335) T protein:vir:73 146 PRFNTLSTSKAASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWD 225 (335) T ss_pred hhhcCccccccCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeee Confidence 211 1 00000 0011221110 00000000 Q ss_pred -----------------------c-chhHHHHHHH-HHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC----- Q lcl|NC_019921. 236 -----------------------P-RATVNELTQV-FKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN----- 285 (381) Q Consensus 236 -----------------------~-~~~~~~l~~l-~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~----- 285 (381) . ......|.++ +..+. .+..|..-.++.+|+||+.-...+......+. T Consensus 226 ~Gl~i~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~--~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~ 303 (335) T protein:vir:73 226 IGLSVRDWRSISRICNIDVTTLTKDASTGADLISMMVDAYY--ARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLT 303 (335) T ss_pred eeeEEeCcccEEEEeecccccccccccchhhHHhhHHHHHH--HHhccCCCCCceEEEechHHHHHHHHHHhccCceeee Confidence 0 0011112221 11110 01224445577899999876665654433221 Q ss_pred ---CCCccccccCCCceeEecCCCCCCc--EEE Q lcl|NC_019921. 286 ---ANGVYVTALPFNLNVIESTVQEAGK--VLT 313 (381) Q Consensus 286 ---~~G~~~~~l~~G~pVv~s~~~p~~~--i~f 313 (381) ..|..++-. .|+||..++++-... +.. T Consensus 304 ~~~~~g~~~t~~-~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 304 IEEYGGKKIVSF-LGIPIRRVDAILNTESAVTA 335 (335) T ss_pred eeccCCceeEEE-CCeEEEEEeeeecCcccccC Confidence 122222222 488888887764322 222 No 156 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=97.87 E-value=1.8e-06 Score=52.11 Aligned_cols=295 Identities=8% Similarity=0.005 Sum_probs=147.3 Q ss_pred cCHHHHHHHHHHhh---ccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCcceEEeeccccc Q lcl|NC_019921. 65 LSANQRSFFMDINK---NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGKIYGEI 139 (381) Q Consensus 65 lt~~e~~~~~~~~~---~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~-~~~p~~~~~~~a~wv~e~~~~ 139 (381) +|- .|..+. ++..+--.+.=+.+..++.+.....+.++++..++++. |+ +++|+- +...+....-+.++ T Consensus 1 Ms~-----~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG~~l 74 (400) T protein:vir:10 1 MST-----PNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSP 74 (400) T ss_pred CCC-----CccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCCCCc Confidence 100 000000 11111113456888999999999999999999998874 44 677775 44555555554444 Q ss_pred ccccCcceeeEeecceeE-EEeeeccHHhhhcCHHH-HHHHHHHHHHHHHHHHHhhheee----cc-C-CCcceEeeecc Q lcl|NC_019921. 140 KGQLDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAW-IERFVRVQIEEAFAVALETAFLK----GT-G-KDQPIGLNRQV 211 (381) Q Consensus 140 ~~~~~~~f~~v~l~~~kl-~~~~~iS~ell~ds~~~-~e~~l~~~la~~~~~~~~~a~i~----G~-G-~~~P~Gil~~~ 211 (381) .. +.+..++..|....+ ++...|..=---++.+| +.+.+..++++++++..|++++. +. - +..|.|.-... T Consensus 75 dg-~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~ 153 (400) T protein:vir:10 75 AA-TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVK 153 (400) T ss_pred CC-CCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCcc Confidence 33 345666766666444 45555544334456677 78999999999999999987652 10 0 12222211000 Q ss_pred ccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhh--hhccC---- Q lcl|NC_019921. 212 QKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ--YTHLN---- 285 (381) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~--~~~~~---- 285 (381) ..+.+ ........-...++..+...+.++...| ..+..++ ...+.++.|..|..++.. .+.++ T Consensus 154 ~~g~s------~~v~~~~~~~~~~~~~l~~A~~~A~~~L----dEkdVP~-~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s 222 (400) T protein:vir:10 154 GHGFS------VNVEVNEGEALVNPQYVMAAVEFALEQQ----LEQEVDI-SDVAILMPWRYFNVLRDADRIVDKSYTIS 222 (400) T ss_pred ccccc------eeecccccccccCHHHHHHHHHHHHHHH----HhcCCCc-cceEEEcCHHHHHHHHhCCcccchhcccc Confidence 00000 0000000001112222222232332222 2223333 345566666555455432 11111 Q ss_pred CCCccccc---cCCCceeEecCCCCCCc-------E----------EEEeecce--EEEeecceE--------EEee-hh Q lcl|NC_019921. 286 ANGVYVTA---LPFNLNVIESTVQEAGK-------V----------LTYVKGLY--DGYLAGGIN--------VQKF-KE 334 (381) Q Consensus 286 ~~G~~~~~---l~~G~pVv~s~~~p~~~-------i----------~fgd~~~y--~i~~r~~i~--------i~~~-~~ 334 (381) ++|.|+++ ...|+||+.|+.+|... + .-|||+.- +++.+..+- -+.. ++ T Consensus 223 ~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~ 302 (400) T protein:vir:10 223 QSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEK 302 (400) T ss_pred CCCccccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccch Confidence 24556654 23699999999998521 1 12455442 223333222 2211 22 Q ss_pred hhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 335 TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 335 ~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) ..|.. .+.+++-++-.+.+|+|.+|++.+-. .++++-+-|-.- T Consensus 303 r~~~~---~id~~~a~G~g~~RPeaa~vv~~~~~-~~~~~~~~~~~~ 345 (400) T protein:vir:10 303 KEKTY---YIDTFMSEGAIPDRWEAVSVVTTKRQ-STGAVDSGNAAQ 345 (400) T ss_pred hhHHH---HHHHHHHhCCcccchhheEEEEecCC-cccccccCcchh Confidence 22332 23344557889999999999887663 344444333222 No 157 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=97.83 E-value=1e-05 Score=47.92 Aligned_cols=279 Identities=10% Similarity=-0.029 Sum_probs=129.6 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhce------e---EecCCc-eEEEEecCC-cceEEeecccccccccC Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLG------I---KNAGLR-LKFLKSETS-GVAVWGKIYGEIKGQLD 144 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~------v---~~~~g~-~~~p~~~~~-~~a~wv~e~~~~~~~~~ 144 (381) |. +..-...++|+.|..-+.+...+.+.+++-.- + ...+|. +.+|.-..- +.+.-+.++.++..+.- T Consensus 1 MA--~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~ki 78 (351) T protein:vir:15 1 MA--ETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNL 78 (351) T ss_pred CC--ceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhee Confidence 22 22224467899888877777766666544211 1 112343 788986543 45555566666554322 Q ss_pred cceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccc Q lcl|NC_019921. 145 AAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPE 224 (381) Q Consensus 145 ~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~ 224 (381) .+..+ .-..+..+.-..++..-..-+.-|....+.+.+++...+..+..+|.-- +|++....... +... + T Consensus 79 tt~~~-~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l-----~gv~~~~~~~~---~~~~-d 148 (351) T protein:vir:15 79 TSGKQ-QGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVL-----KGVMGVTKIAN---SKVY-D 148 (351) T ss_pred cccce-eEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhhchhhcc---ccee-c Confidence 22222 2233334444566665555566688888999999988888877765410 12221100000 0000 0 Q ss_pred eeeeeeecccccchhHHHHHHHHHHhhhccccccccccC-ceEEEEchhhHHHHhhhhh---ccCCCCccccccCCCcee Q lcl|NC_019921. 225 KEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKG-NVTMVVNPSDAFEVQAQYT---HLNANGVYVTALPFNLNV 300 (381) Q Consensus 225 ~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~-~a~~~mn~~t~~~~~~~~~---~~~~~G~~~~~l~~G~pV 300 (381) +...+..+.....+.+.+....+- + .... -.+|+||+.++..+++... .+.++|...-..-.|++| T Consensus 149 ---~t~~~~~~~~is~~~l~~A~~~~G---D----~~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~t~~G~~V 218 (351) T protein:vir:15 149 ---QTKVSPSEPMFGAKGFTGAIGLMG---D----LQDTAFGAIAVNSATYSLMKVQGLIETIQPQNGATPFEAYNGLRI 218 (351) T ss_pred ---cccccccccccCHHHHHHHHHHhc---c----ccccceEEEEEChHHHHHHHhhhhhhhccccccCcccceecceEE Confidence 000011222233455555444331 1 1111 2679999999998875432 344444322222259999 Q ss_pred EecCCCCCC----------cEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEe--c Q lcl|NC_019921. 301 IESTVQEAG----------KVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDL--K 368 (381) Q Consensus 301 v~s~~~p~~----------~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~--~ 368 (381) +.++.||.. ..+||.-. ..+.. ++..++..++.....++.......++ .+.+..+..-.-.. . T Consensus 219 ivdD~~p~~~~~~~~~~ytsyl~~~GA-i~~~~-~~~~ve~~rd~~~~~g~d~l~~r~~~---~~hp~G~s~~~~~~~~~ 293 (351) T protein:vir:15 219 VLDDDIEIDLTDKTKPVSTSYIFAPGA-VRYST-NMRSTETKYDPLINGGQDVIVQKRVG---TIHVAGTSIKASFSPSK 293 (351) T ss_pred EEcCCCccccCCCCCceeEEEEEecce-eeeec-CCcCcceeecccCCCCceEEEEeeee---eeeeeeeeecccccccC Confidence 999999842 12333211 11122 22334444444444444444443332 24444433211000 0 Q ss_pred CCccccc--cCcccC Q lcl|NC_019921. 369 GHKPALE--GTEETL 381 (381) Q Consensus 369 ~~~~~~~--~~~~~~ 381 (381) ...|+-+ +++... T Consensus 294 ~~sPt~~~L~~~~NW 308 (351) T protein:vir:15 294 ASFPTIDELAKSSTW 308 (351) T ss_pred cCCcChHHhcCCccc Confidence 1112111 111111 No 158 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=97.82 E-value=1.7e-05 Score=46.76 Aligned_cols=273 Identities=12% Similarity=0.077 Sum_probs=145.3 Q ss_pred cCHHHHHHHHHHhhccCCCCceecc--HHHHHHHHHHHHhhhhhhhhceeEe-cCC---ceEEEEecCCcceEEeecccc Q lcl|NC_019921. 65 LSANQRSFFMDINKNVNYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKN-AGL---RLKFLKSETSGVAVWGKIYGE 138 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP--~~~~~~I~~~l~~~~~l~~~~~v~~-~~g---~~~~p~~~~~~~a~wv~e~~~ 138 (381) |+- .. .+..|-+++. +.+...|++.....-..+.++.+.. .+. .+.+...+..+.+.|.+..+. T Consensus 1 ~~~---------~~-a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~ 70 (296) T protein:vir:10 1 MGV---------DK-ADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTD 70 (296) T ss_pred Ccc---------cc-hhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCcc Confidence 111 10 1222223332 2345555554433323333333332 111 234455566677888776544 Q ss_pred cccccCcceeeEeecceeEEEeeeccHHhhhcC---HHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeecccccc Q lcl|NC_019921. 139 IKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGV 215 (381) Q Consensus 139 ~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds---~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~ 215 (381) .-+..+..+.+.....+.++.-+.++.+=|+.+ ..+++.--....++++++.+|..+++|+....-.|+|+.+.... T Consensus 71 dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~ 150 (296) T protein:vir:10 71 DLPLVDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINN 150 (296) T ss_pred ccceeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCcc Confidence 223456677888888888888888887766644 56788888889999999999999999998878889998766433 Q ss_pred ccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCcc----c Q lcl|NC_019921. 216 SVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVY----V 291 (381) Q Consensus 216 ~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~----~ 291 (381) .+..+.|. ++...++++..++..+.....+ +.+...++++|.-+..+... .++.|.- + T Consensus 151 ~~~~~~W~-----------~~t~i~~Di~~~~~~l~~~s~g----~~~p~~l~L~p~~~~~L~~~---~~~~~~t~l~~i 212 (296) T protein:vir:10 151 VVSGGSWS-----------QPTTAVSDITSLLDIIETSTNG----QHRATHLLLPTTARRIMQNL---VPGTSVSYGEFF 212 (296) T ss_pred ccccCCcc-----------CHHHHHHHHHHHHHHHHHhhCc----eecceeEEeCHHHHHHHhhc---cCCCCccHHHHH Confidence 33322221 1223345555555444332222 22334688888776555321 1222321 1 Q ss_pred cccCCCceeEecCCCCC----C--cEEEEeec-ce-EEEeecceEEEeehhhhhhcCceEEEEEEEEc-CEEecCceEEE Q lcl|NC_019921. 292 TALPFNLNVIESTVQEA----G--KVLTYVKG-LY-DGYLAGGINVQKFKETLALDDMDLYTAKQFAY-GKAKDNKVAAV 362 (381) Q Consensus 292 ~~l~~G~pVv~s~~~p~----~--~i~fgd~~-~y-~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~d-Gk~~~~~Afvv 362 (381) .....+..|+....+.. + .+++.+.+ +| .+...+.++... .....-...+++..|+. ..+..|.|+++ T Consensus 213 k~~~~~l~i~~~~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~---~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~ 289 (296) T protein:vir:10 213 RQNNSGVTVEFVQYLNDYNGTGTSAAIAYEKDPNNMAIEIPEATNALP---AQPKDLHFKIPVTSKATGLIVYRPLTMAV 289 (296) T ss_pred HHhcCCceEEEeeeeccCCCCcceEEEEEEcCCceEEEEcCcceeeec---ccccCceEEEeeEeeEEEEEEECCceeEE Confidence 11112334443333321 1 13333322 23 233334443322 11223345677788886 67888999987 Q ss_pred EE-EEec Q lcl|NC_019921. 363 WK-LDLK 368 (381) Q Consensus 363 ~~-~~~~ 368 (381) ++ |+++ T Consensus 290 ~dGI~~~ 296 (296) T protein:vir:10 290 MKGITFA 296 (296) T ss_pred EeeeecC Confidence 64 2333 No 159 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=97.70 E-value=4.9e-06 Score=49.70 Aligned_cols=280 Identities=13% Similarity=0.063 Sum_probs=139.1 Q ss_pred HhhccCC-CCce-eccHHHHHHHHHHHHhhhhhhhhceeEe--cCCceEEEEecCCcceEEeecccccc-cccCcceeeE Q lcl|NC_019921. 76 INKNVNY-KEEK-LLPEETIDRIFEDLTTNHPLLADLGIKN--AGLRLKFLKSETSGVAVWGKIYGEIK-GQLDAAFSEE 150 (381) Q Consensus 76 ~~~~~~~-~gg~-lvP~~~~~~I~~~l~~~~~l~~~~~v~~--~~g~~~~p~~~~~~~a~wv~e~~~~~-~~~~~~f~~v 150 (381) |..+..+ ++-. .+|+.++.+|..-+.+.-....++++.. .|-.++||.-......... +++.+. .+.+.+=-.+ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~tV~dY~-~~~~i~~d~ltt~~~~l 79 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTPVVRSRP-EQGDFTFDNLDTGEISI 79 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEecccccccccccc-CCCCcccccCCCceEEE Confidence 4444333 3334 4599999999877766544334444433 2335777765443332221 112211 1112221256 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheee--ccCCCcceEeeeccccccccccccccceeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK--GTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~--G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) .+...|+.++. |+.+. .++..||.+...++.+.+++...|..+.. =+|..+-.++- ...+..+... . T Consensus 80 ~IDq~KYfaf~-VdDD~-~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~-----~p~vin~~~~----~ 148 (322) T protein:vir:31 80 ILRDEVYAGNA-ISKKL-RQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQN-----DPNVINGVPH----R 148 (322) T ss_pred EEehhhhhccc-cchhH-HHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccC-----CcceecCCcc----c Confidence 66777777654 88855 56788999999999999999988876522 11111100000 0000011000 0 Q ss_pred eeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh----hcc-------CCCC----ccccc Q lcl|NC_019921. 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY----THL-------NANG----VYVTA 293 (381) Q Consensus 229 ~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~----~~~-------~~~G----~~~~~ 293 (381) ...+..++...++.+.++...| +....+..+ -..+++|.-+..+.... ..+ +.+| +..-+ T Consensus 149 iv~~gt~~~~ay~~lv~l~~kL----dkanVP~~g-R~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg 223 (322) T protein:vir:31 149 FVGTGTDQTMDVTDFSRVNYVM----TQSKMPMGG-MIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVR 223 (322) T ss_pred eeccCCCchhhHHHHHHHHHHh----ccccCCCCC-eEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHH Confidence 0112223444566666665444 333333333 34567787654332110 111 2233 22223 Q ss_pred cCCCceeEecCCCCCCc--EEEE---------eecceE-EEeecceEE----Eeehhhh-h---hcCceEEEEEEEEcCE Q lcl|NC_019921. 294 LPFNLNVIESTVQEAGK--VLTY---------VKGLYD-GYLAGGINV----QKFKETL-A---LDDMDLYTAKQFAYGK 353 (381) Q Consensus 294 l~~G~pVv~s~~~p~~~--i~fg---------d~~~y~-i~~r~~i~i----~~~~~~~-~---~~d~~~~r~~~r~dGk 353 (381) ..+|..|+.|..+|+++ |..| -.+-+. +-+.+-... ++.+... | ..+...+|+.+|.+.+ T Consensus 224 ~~~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g 303 (322) T protein:vir:31 224 SVYGIDLFVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTARWGNG 303 (322) T ss_pred HHhceeeeeeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccceeeeeeecce Confidence 34689999999998543 2222 222221 111111000 0111011 1 1345679999999999 Q ss_pred EecCceEEEEEEEecCCcc Q lcl|NC_019921. 354 AKDNKVAAVWKLDLKGHKP 372 (381) Q Consensus 354 ~~~~~Afvv~~~~~~~~~~ 372 (381) ++++|..+++.-.+.+.+- T Consensus 304 ~~r~e~l~~~~a~~~~~~~ 322 (322) T protein:vir:31 304 LVRDENLVCVLANADKVTF 322 (322) T ss_pred eecccceEEEEeccccccC Confidence 9999998877655543332 No 160 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=97.70 E-value=2.8e-05 Score=45.58 Aligned_cols=277 Identities=8% Similarity=-0.044 Sum_probs=147.7 Q ss_pred hccCCCCceecc--HHHHHHHHHHHHhhhhhhhhceeEe-cC-C--ceEEEEecCCcceEEeecccccccccCcceeeEe Q lcl|NC_019921. 78 KNVNYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKN-AG-L--RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEET 151 (381) Q Consensus 78 ~~~~~~gg~lvP--~~~~~~I~~~l~~~~~l~~~~~v~~-~~-g--~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~ 151 (381) -.+.++|.+++- +.+...|++.+...-..|.++.+.. .+ + .+.+...+..+.+.|.+..+..-+..+..+++.. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 233444543322 2345555665555544555544432 22 1 1345556667778888765543334566778888 Q ss_pred ecceeEEEeeeccHHhhhcC---HHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeee Q lcl|NC_019921. 152 AIQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 152 l~~~kl~~~~~iS~ell~ds---~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) ...+.++.-+.++..=|+.+ ..++..--....++++++.+|..+++|+....-.||++.+........... .... T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~--~~~~ 158 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTG--VGNV 158 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcc--cccc Confidence 88888888888887766655 567888888999999999999999999988888999987653222111000 0001 Q ss_pred eeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCcccc----ccCCCceeEecC Q lcl|NC_019921. 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVT----ALPFNLNVIEST 304 (381) Q Consensus 229 ~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~----~l~~G~pVv~s~ 304 (381) ......+++..++++..++..+....++. .....++|+|.-+..+... ...+..|..+. .-..+..|+..+ T Consensus 159 ~~w~~~t~~ei~~di~~~~~~l~~~s~g~----~~p~~L~L~p~~~~~L~~~-~~~~~~~~tvl~~l~~~~~~~~I~~~p 233 (301) T protein:vir:80 159 SKWEKKTAEQIIDEIGEAHTKITVLPGYG----TASLKLCLPPKQFELINKK-RYSNEDSRSVLKVLQDNAWFSAIVRVP 233 (301) T ss_pred cccccCCHHHHHHHHHHHHHHHHHhcCce----ecccEEEecHHHHHhhhhc-cccCCCCeeHHHHHHHHcCcceEEEcc Confidence 11122344455566666655543322221 2234588998877666422 12233332221 111233444443 Q ss_pred CCCC----Cc--E-EEEeecce-EEEeecceEEEeehhhhhhcCceEE--EEEEEEcC-EEecCceEEEEEEEe Q lcl|NC_019921. 305 VQEA----GK--V-LTYVKGLY-DGYLAGGINVQKFKETLALDDMDLY--TAKQFAYG-KAKDNKVAAVWKLDL 367 (381) Q Consensus 305 ~~p~----~~--i-~fgd~~~y-~i~~r~~i~i~~~~~~~~~~d~~~~--r~~~r~dG-k~~~~~Afvv~~~~~ 367 (381) .... ++ + ++-+=.++ .+...+.++.. . ....-..| .+..|+.| .+..|.|+++++ =| T Consensus 234 ~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~--~---~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~-GI 301 (301) T protein:vir:80 234 DLAGMGTAGSDSFAVIHDSNETAELIIPMDITRH--P---EEYSFPRTKVPFEERTAGVVVRFPAAIVRVD-GI 301 (301) T ss_pred eeccCCCCcccEEEEEecCCcEEEEEecCceeee--c---ceecCceeEeeeeeeeEEEEEEccceEEEEe-cC Confidence 3321 11 2 22221122 22223333221 1 11111223 34566655 788899988765 33 No 161 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=97.68 E-value=3e-06 Score=50.86 Aligned_cols=267 Identities=10% Similarity=-0.031 Sum_probs=141.4 Q ss_pred hccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEE-EEecCCcceEEeec Q lcl|NC_019921. 59 PKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKF-LKSETSGVAVWGKI 135 (381) Q Consensus 59 ~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~-p~~~~~~~a~wv~e 135 (381) ....|.. +|+ +-....+=+...--+|.+++-+.+.+..-++...+.+|++. .++. |..+-...++-|+| T Consensus 1 ~~~~~~~-~e~-------nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaE 72 (296) T protein:vir:98 1 MVTSRTY-PEE-------NLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPE 72 (296) T ss_pred CCCcccc-CcC-------CCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccC Confidence 0000000 000 00112222233345677777776766666677778888743 4544 44566677788999 Q ss_pred ccccccccCccee---eEeecceeEEEeeeccHHhhhcCH-HHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeecc Q lcl|NC_019921. 136 YGEIKGQLDAAFS---EETAIQNKLTAFVVLPKDLNDFGP-AWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQV 211 (381) Q Consensus 136 ~~~~~~~~~~~f~---~v~l~~~kl~~~~~iS~ell~ds~-~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~ 211 (381) +++++ -+..+-. ..++..+|+..- +|.|-++.|. -+-...-.+.|..+++..++..|+.=-.++ T Consensus 73 Ge~Ip-lskvt~~~~~t~t~~ikK~rK~--tTdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~Lkta--------- 140 (296) T protein:vir:98 73 GEVIP-LSKVERKIHSEKKIELKKYRKA--TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG--------- 140 (296) T ss_pred Ccccc-hhhheeeecceEEEEeeccccc--cCHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcc--------- Confidence 88876 3444433 366667777765 4999985444 346677888899999999988877521100 Q ss_pred ccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCC---CC Q lcl|NC_019921. 212 QKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA---NG 288 (381) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~---~G 288 (381) +. +.+ ...+..-..+.+.+-.+....... -....+.+|||.+.+.+++-..+..+ .+ T Consensus 141 T~----------------t~~-~t~~~lQ~Ala~~~~~l~~~fede---d~~~~V~FVnP~D~a~ylg~a~it~qt~fG~ 200 (296) T protein:vir:98 141 TG----------------TQD-ALGAGLQGALASAWGKLQVLFEDY---GSERAIVFANSLDVAEYIAKAGITTQTAFGL 200 (296) T ss_pred cc----------------eee-echhhHHHHHHHHhhhhhhhcccc---CCCceEEEEehHHHHHHhcCCccchhheech Confidence 00 000 001111122222222222111111 01356789999999988754333211 24 Q ss_pred ccccccCCCceeEecCCCCCCcEEEEee---cceEEEeecceEEEeehhhhhhcCceEEEEEEEE-------------cC Q lcl|NC_019921. 289 VYVTALPFNLNVIESTVQEAGKVLTYVK---GLYDGYLAGGINVQKFKETLALDDMDLYTAKQFA-------------YG 352 (381) Q Consensus 289 ~~~~~l~~G~pVv~s~~~p~~~i~fgd~---~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~-------------dG 352 (381) +|+... +|..|+.|..+|+|+++.--- ..|.+..++| +.....-+..|++|+.|..+- .| T Consensus 201 tyl~nf-LG~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~---~l~~~f~~~~d~tglIGv~h~~~~~~~t~eT~~~~~ 276 (296) T protein:vir:98 201 TYLVDF-TGTVIISTNDVTKGEIWATVPENIIFAYINPNNS---ELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSG 276 (296) T ss_pred hhhhhc-cccEEEEcCcCCCceEEEeeecceEEEeeccccc---chhhhhccccccccceEEEeccccceeeehhHhHhH Confidence 455322 477899999999999875433 3333444433 333444455688888887542 12 Q ss_pred E---EecCceEEEEEEEecCCcccc Q lcl|NC_019921. 353 K---AKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 353 k---~~~~~Afvv~~~~~~~~~~~~ 374 (381) - |=..++.|+.++ +++. T Consensus 277 ~~lfpE~~dgiv~~tI-----~~~~ 296 (296) T protein:vir:98 277 MLMYPERIDGIVKVTL-----TPGV 296 (296) T ss_pred HHhcccccceEEEEEe-----cCCC Confidence 1 222345554444 2222 No 162 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=97.58 E-value=3.9e-06 Score=50.21 Aligned_cols=263 Identities=11% Similarity=-0.009 Sum_probs=141.6 Q ss_pred cCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC--Cce---EEEEecCCcceEEeeccccc Q lcl|NC_019921. 65 LSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG--LRL---KFLKSETSGVAVWGKIYGEI 139 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~--g~~---~~p~~~~~~~a~wv~e~~~~ 139 (381) ++.++. + ....+=+..+--+|.+++-+.+.+..-++...+.+|+. ..+ ++|..+..+.++-|+|++.+ T Consensus 1 M~~e~n-----l--~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~I 73 (303) T protein:vir:10 1 MSAENN-----L--INVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVI 73 (303) T ss_pred CCCCcC-----C--cchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCccc Confidence 111110 0 01111122334566777766666666667777888864 334 44544555677789998888 Q ss_pred ccccCcce---eeEeecceeEEEeeeccHHhhhcCHH-HHHHHHHHHHHHHHHHHHhhheee----ccCCCcceEeeecc Q lcl|NC_019921. 140 KGQLDAAF---SEETAIQNKLTAFVVLPKDLNDFGPA-WIERFVRVQIEEAFAVALETAFLK----GTGKDQPIGLNRQV 211 (381) Q Consensus 140 ~~~~~~~f---~~v~l~~~kl~~~~~iS~ell~ds~~-~~e~~l~~~la~~~~~~~~~a~i~----G~G~~~P~Gil~~~ 211 (381) + -+..+- ...++..+|+..-+ |.|-+..|.. +-...--+.|...++..++..|+. ++|+.+ T Consensus 74 p-lskvt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~-------- 142 (303) T protein:vir:10 74 P-LTKVTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGK-------- 142 (303) T ss_pred c-hhhheeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccc-------- Confidence 6 344443 24677788887744 9999854443 466777788888888888887764 211100 Q ss_pred ccccccccccccceeeeeeecccccchhHHHHHHHHH----HhhhccccccccccCceEEEEchhhHHHHhhhhhccCCC Q lcl|NC_019921. 212 QKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFK----YHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNAN 287 (381) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~----~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~ 287 (381) . + ...+..++.+...+. .+.... .. ..+.+.+|||.+.+.+++.......+ T Consensus 143 -----~------------t---~~t~~s~~glq~Al~~~~~kl~~~~----ed-~~~~V~FvNP~Daa~yl~~A~i~~~~ 197 (303) T protein:vir:10 143 -----R------------T---NKTKLSAENLQGALSKGRANLSVLL----DD-EITPIAFVNPNDTAEYLANGFINSTG 197 (303) T ss_pred -----c------------c---cceeecHHHHHHHHHhhhhhccccc----cc-cccEEEEEchHHHHHHhhcCCcchhh Confidence 0 0 001112222222221 121111 11 23568999999999887643332211 Q ss_pred ---C-ccccccCCCceeEecCCCCCCcEEEEe---ecceEEEeecceEEEeehhhhhhcCceEEEEEEEE---------- Q lcl|NC_019921. 288 ---G-VYVTALPFNLNVIESTVQEAGKVLTYV---KGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFA---------- 350 (381) Q Consensus 288 ---G-~~~~~l~~G~pVv~s~~~p~~~i~fgd---~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~---------- 350 (381) | +|+-. .+|..|+.|..+|+|+++.-- ...|.+..++.+ ..-.-+..|++|+.|..+- T Consensus 198 t~fG~n~L~n-fLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l----~~~f~~t~D~tglIGv~h~~~~~~~t~eT 272 (303) T protein:vir:10 198 AQFGVNLLTP-YVGVKIVEFADVPQGEVWMTVAENLNVAYANPRGEL----SRAFAFATDATGFVGVLHDIQPQRLTSDT 272 (303) T ss_pred hhhhhhhhhh-hhcceEEEeccCCCceEEEeeccceEEEEecCchhh----hhhhhhccccccceEEEeccccceeeehh Confidence 2 23322 246678999999999987543 333344444322 2344466788888887542 Q ss_pred ---cCE---EecCceEEEEEEEecCCccccccCcc Q lcl|NC_019921. 351 ---YGK---AKDNKVAAVWKLDLKGHKPALEGTEE 379 (381) Q Consensus 351 ---dGk---~~~~~Afvv~~~~~~~~~~~~~~~~~ 379 (381) .|- |=..++.|+ ..|.+.+ .++.|. T Consensus 273 ~~~~~~~lfpE~~dgiv~--~ti~~~e--~~~~~~ 303 (303) T protein:vir:10 273 IYASAISMFPENIDAVIK--VTIKKDE--AGELPS 303 (303) T ss_pred HhHhHHHhcccccceEEE--EEEeccc--cCCCCC Confidence 122 223346554 4444433 122333 No 163 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=97.20 E-value=0.00013 Score=41.82 Aligned_cols=288 Identities=11% Similarity=0.061 Sum_probs=145.5 Q ss_pred ccccCHHHHH--HH----HH--HhhccCCCCceecc---HHHHHHHHHHHHhhhhhhhhceeEec-C-Cc--eEEEEecC Q lcl|NC_019921. 62 AQSLSANQRS--FF----MD--INKNVNYKEEKLLP---EETIDRIFEDLTTNHPLLADLGIKNA-G-LR--LKFLKSET 126 (381) Q Consensus 62 ~~~lt~~e~~--~~----~~--~~~~~~~~gg~lvP---~~~~~~I~~~l~~~~~l~~~~~v~~~-~-g~--~~~p~~~~ 126 (381) .+.+.-+|.+ .. .. +......+.|++.- +.+...|++.....-..+.++.+.+. + +. +.+...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~ 80 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDK 80 (319) T ss_pred CCCcchhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeecc Confidence 1111111100 00 00 11112222333333 23344555544444334444444322 2 11 34445566 Q ss_pred CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcC---HHHHHHHHHHHHHHHHHHHHhhheeeccCCCc Q lcl|NC_019921. 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALETAFLKGTGKDQ 203 (381) Q Consensus 127 ~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds---~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~ 203 (381) .+.+.|.+..+..-+..+..+++.....+.++.-+.++..=|+.+ ..++..--....++++++.+|.-+++|+...+ T Consensus 81 ~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g 160 (319) T protein:vir:10 81 VGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAPHK 160 (319) T ss_pred ccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccccc Confidence 677888876544323456677788888888888888887666644 56788888889999999999999999998888 Q ss_pred ceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc Q lcl|NC_019921. 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) Q Consensus 204 P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~ 283 (381) -.|||+.+.......+. + ... ..++++..++++..++..+.....+. .....++++|.-+..+... T Consensus 161 ~~GLlN~p~~~~~~~~~-~---~~~---~t~t~~~i~~di~~~~~~l~~~s~g~----~~p~~L~L~p~~~~~L~~~--- 226 (319) T protein:vir:10 161 IVSVFNHPNITKITSGK-W---IDV---STMKPETAEAELTQAIETIETITRGQ----HRATNILIPPSMRKVLAIR--- 226 (319) T ss_pred ceeEEeCCCceeeecCC-C---CCc---cccCHHHHHHHHHHHHHHHHHhcCce----eeceEEEecHHHHHhhhcc--- Confidence 89999876533221111 0 011 11234455566666666554332222 2334588888877665321 Q ss_pred cCCCCccc----cccCCCceeEecCCCCC----Cc--EEEEee-cceE-EEeecceEEEeehhhhhhcCceEEEEEEEEc Q lcl|NC_019921. 284 LNANGVYV----TALPFNLNVIESTVQEA----GK--VLTYVK-GLYD-GYLAGGINVQKFKETLALDDMDLYTAKQFAY 351 (381) Q Consensus 284 ~~~~G~~~----~~l~~G~pVv~s~~~p~----~~--i~fgd~-~~y~-i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~d 351 (381) .++.|..+ .....++.|........ ++ +++... .+|+ +.....++.... +.+-. .....+..|+. T Consensus 227 ~~~~~~t~l~~lk~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~-e~~~l--~~~~~~~~r~~ 303 (319) T protein:vir:10 227 MPETTMSYLDYFKSQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPA-QPKDL--HFKVPCTSKCT 303 (319) T ss_pred cCCCCeeHHHHHHHhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCcceeeeee-eecCc--eEEEeeeeeeE Confidence 22333222 11112444554443331 22 233322 2232 333333333211 11111 12333455555 Q ss_pred -CEEecCceEEEEEEEe Q lcl|NC_019921. 352 -GKAKDNKVAAVWKLDL 367 (381) Q Consensus 352 -Gk~~~~~Afvv~~~~~ 367 (381) ..+..|.|+++++ =| T Consensus 304 Gv~i~~P~ai~~~d-GI 319 (319) T protein:vir:10 304 GLTIYRPMTIVLIT-GV 319 (319) T ss_pred EEEEEccceeEeee-cC Confidence 4467888988755 33 No 164 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=97.08 E-value=0.00016 Score=41.44 Aligned_cols=276 Identities=10% Similarity=-0.056 Sum_probs=130.0 Q ss_pred HHHHHHhhccCCCCceecc----HHHHHHHHHHHHhh-hhhhhhceeEecCCce---EEEEecCCcceEEeecccc---- Q lcl|NC_019921. 71 SFFMDINKNVNYKEEKLLP----EETIDRIFEDLTTN-HPLLADLGIKNAGLRL---KFLKSETSGVAVWGKIYGE---- 138 (381) Q Consensus 71 ~~~~~~~~~~~~~gg~lvP----~~~~~~I~~~l~~~-~~l~~~~~v~~~~g~~---~~p~~~~~~~a~wv~e~~~---- 138 (381) -.++++..+.+.=. ..|| +++.+++.-...+. +.|++-++..+-++.. ..+.....+. ++.+.. T Consensus 1 ~~~~~~~~~~~~Ms-~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~ 76 (322) T protein:vir:10 1 MKLNAIMSMLPLIA-GDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDA---VKRKRSRQQS 76 (322) T ss_pred Ccccceeeeeeeee-chhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccc---cccccccccc Confidence 11111111111000 0133 45555554443333 5566666644433221 2222111111 111000 Q ss_pred -----cccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeecc-CCCcceEeeeccc Q lcl|NC_019921. 139 -----IKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGT-GKDQPIGLNRQVQ 212 (381) Q Consensus 139 -----~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~-G~~~P~Gil~~~~ 212 (381) ..+...-.++.........+....|.+.-+.....|..+...+..+.+++++.|..|+.|- |... +|- .+ T Consensus 77 ~d~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~-~~~---~g 152 (322) T protein:vir:10 77 ADGTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPAS-IKG---TG 152 (322) T ss_pred cCcccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccc-ccc---cc Confidence 0011111234444444444555688887777788999999999999999999999887632 1110 000 00 Q ss_pred cccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCC------ Q lcl|NC_019921. 213 KGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA------ 286 (381) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~------ 286 (381) ..+ ..++.. .+.......+.+.+..+...|... ..+-.++-..+++|..+..++.-..+.+. T Consensus 153 --t~v---~~~ss~---~i~~g~~g~t~~kl~~a~~~l~~~----dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~ 220 (322) T protein:vir:10 153 --QPV---EFLATQ---EIGDGTKPISFDYVTEITERFLEN----EIEPEVSKVIVIGPTQARKLLQITEATSADYTSAM 220 (322) T ss_pred --ccc---ccCCCc---ccccCccchhHHHHHHHHHHHHhc----CCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccch Confidence 000 000000 011111233455555555554322 22223334567788887777642222111 Q ss_pred ----CCccccccCCCceeEecCCCCCC------------------cEEEEeecceEEEeecceEEEeehhhhhhcCceEE Q lcl|NC_019921. 287 ----NGVYVTALPFNLNVIESTVQEAG------------------KVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLY 344 (381) Q Consensus 287 ----~G~~~~~l~~G~pVv~s~~~p~~------------------~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~ 344 (381) +|. +. ..+|..++.++.+|.. ..+++.-+....+....+..+.+.+-. ......+ T Consensus 221 ~l~~~G~-ig-~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~-~~~a~~I 297 (322) T protein:vir:10 221 DLQSKGI-IT-NWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPS-ASFAWRI 297 (322) T ss_pred hhhhcCe-ee-eeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCC-cchhhhh Confidence 121 21 2268999999988832 122333344444555555544322111 1123446 Q ss_pred EEEEEEcCEEecCceEEEEEEEecC Q lcl|NC_019921. 345 TAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 345 r~~~r~dGk~~~~~Afvv~~~~~~~ 369 (381) ++.+-++++.++++.+|.+..+-+= T Consensus 298 ~~~~~~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 298 YSAFTADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred hhhhhhCceEeccCcEEEEEEeccC Confidence 7778899999999998876654432 No 165 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=96.66 E-value=0.00032 Score=39.74 Aligned_cols=281 Identities=11% Similarity=0.017 Sum_probs=138.0 Q ss_pred hccCCC-----CceeccHHHHHHHHHHHHhhhhhhhhceeEecC-CceEEEEecCCcce-EEeeccccccc-ccCcceee Q lcl|NC_019921. 78 KNVNYK-----EEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LRLKFLKSETSGVA-VWGKIYGEIKG-QLDAAFSE 149 (381) Q Consensus 78 ~~~~~~-----gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~-g~~~~p~~~~~~~a-~wv~e~~~~~~-~~~~~f~~ 149 (381) -..++. .....=+++.+.|...=....|+.+++...+.. -.+.|+.++-...+ .-..|+++.+. ...+ .. T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~--r~ 78 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSF--TT 78 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccC--CE Confidence 111111 112234678888888777788998886554433 34677754322221 22234443221 1111 11 Q ss_pred EeecceeE-EEeeeccHHhhhcCHHH---HHHHHHHHHHHHHHHHHhhheeecc-----CC-C---cceEeeeccccccc Q lcl|NC_019921. 150 ETAIQNKL-TAFVVLPKDLNDFGPAW---IERFVRVQIEEAFAVALETAFLKGT-----GK-D---QPIGLNRQVQKGVS 216 (381) Q Consensus 150 v~l~~~kl-~~~~~iS~ell~ds~~~---~e~~l~~~la~~~~~~~~~a~i~G~-----G~-~---~P~Gil~~~~~~~~ 216 (381) ..-+.-.+ ...+.||..+..-+... .-+|=..+-...+.+-+|.+||+|. |+ . +-.||+..+..... T Consensus 79 ~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~ 158 (317) T protein:vir:88 79 MLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGS 158 (317) T ss_pred EeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCce Confidence 11122222 34456666555543333 3344444455667788999999985 21 2 33577765432211 Q ss_pred cc-cccccceeeeeeecccc-cchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh------ccCCCC Q lcl|NC_019921. 217 VT-EGAYPEKEEQGTLTFAN-PRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT------HLNANG 288 (381) Q Consensus 217 ~~-~~~~~~~~~~~~~t~~~-~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~------~~~~~G 288 (381) .. .|..+........+... ...+-+.+.++++.+-.. ....+. ++||+...-.+-+... ..+.+. T Consensus 159 ~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~------Gg~~~~-i~v~a~~k~~i~~~~~~~~~~i~~~~~~ 231 (317) T protein:vir:88 159 LGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRN------GGQANS-IQTSSSIKKAISKNMKGRATEITLDASD 231 (317) T ss_pred eccCccccccCCCccccccccccccHHHHHHHHHHHHhc------CCCCCE-EEeChHHHHHHHHHhcCCceeEEEcccC Confidence 11 11110000000111111 112334444444433211 112233 4678765444432210 001111 Q ss_pred c-c-----ccccCCC-ceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEEecCceEE Q lcl|NC_019921. 289 V-Y-----VTALPFN-LNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAA 361 (381) Q Consensus 289 ~-~-----~~~l~~G-~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afv 361 (381) . + ....+|| ++++.+.+||++++++.|+++.-+.-=.++..+.+-. ..|...+.....+.=++.+++|.. T Consensus 232 ~~~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e~laK---tGd~~k~~i~~E~tLe~~N~~a~a 308 (317) T protein:vir:88 232 NRIAQTVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYLRPFFQHELAK---TGDSEKRQLLVEYTFRVNNEKSGA 308 (317) T ss_pred eEEEEEEEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeecccceeeccCC---CcccceeEEEEEEEEEEcCcccee Confidence 1 1 1124577 4789999999999999999886553334454443222 246667777888888999999998 Q ss_pred EEEEEecCC Q lcl|NC_019921. 362 VWKLDLKGH 370 (381) Q Consensus 362 v~~~~~~~~ 370 (381) ++..--+.. T Consensus 309 ~i~~l~~~~ 317 (317) T protein:vir:88 309 LIRDVVAQL 317 (317) T ss_pred EEEEecccC Confidence 876322222 No 166 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=96.27 E-value=0.0008 Score=37.56 Aligned_cols=287 Identities=13% Similarity=0.095 Sum_probs=140.1 Q ss_pred HHHHHHHHHHhhccccccCHHHHHH---HHHHh-hccCCCCceecc--HHHHHHHHHHHHhhhhhhhhceeEecCC---- Q lcl|NC_019921. 48 AKAEAERVSSLPKSAQSLSANQRSF---FMDIN-KNVNYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKNAGL---- 117 (381) Q Consensus 48 ~~~~~~~~~~~~~~~~~lt~~e~~~---~~~~~-~~~~~~gg~lvP--~~~~~~I~~~l~~~~~l~~~~~v~~~~g---- 117 (381) ...+++ .+..+. ...+. ...++.|-+++. +.+...|++.....-.-+.++.+.+..+ T Consensus 1 ~~~~~~-------------~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~e 67 (314) T protein:vir:10 1 MAIKFD-------------AEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAK 67 (314) T ss_pred CccchH-------------HHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCcee Confidence 011111 000000 01111 112222334443 2344455553333222223332222111 Q ss_pred ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcC---HHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019921. 118 RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALETA 194 (381) Q Consensus 118 ~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds---~~~~e~~l~~~la~~~~~~~~~a 194 (381) .+.+...+..+.+.|.+..+..-+..+..+++.....+.++.-+.++..=|.-+ ..++..--....+.++...+|.. T Consensus 68 t~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i 147 (314) T protein:vir:10 68 YFEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKL 147 (314) T ss_pred EEEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceE Confidence 234555666777888887554223456678888888888888888877666644 56788888889999999999999 Q ss_pred eeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhH Q lcl|NC_019921. 195 FLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) Q Consensus 195 ~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~ 274 (381) +++|+....-.|+|+.+.......+..| ++++..++++..++..+.....+. . ....+++.|.-+ T Consensus 148 ~f~G~~~~g~~GLlN~p~v~~~~~~~~W-----------aT~~ei~~Di~~~~~~l~~~s~g~---~-~p~~l~Lpp~~~ 212 (314) T protein:vir:10 148 VWSGSAPHGIVSVFDQPNINNVVATPNW-----------SVPQNAIDDVTAMIDAVESSTQGL---H-HVTDILLPASAR 212 (314) T ss_pred EEeecccccceeEeecCCCccccCCCCc-----------ccHHHHHHHHHHHHHHHHHhcCcc---c-cceeEEecHHHH Confidence 9999988888999987653322111111 133444566666665554332221 1 223577888765 Q ss_pred HHHhhhhhccCCCCcccc----ccCCCceeEecCCCCC----Cc--EEEEee-cceE-EEeecceEEEeehhhhhhcCce Q lcl|NC_019921. 275 FEVQAQYTHLNANGVYVT----ALPFNLNVIESTVQEA----GK--VLTYVK-GLYD-GYLAGGINVQKFKETLALDDMD 342 (381) Q Consensus 275 ~~~~~~~~~~~~~G~~~~----~l~~G~pVv~s~~~p~----~~--i~fgd~-~~y~-i~~r~~i~i~~~~~~~~~~d~~ 342 (381) ..+... .+..|.-+. .-..++.|.....+.. ++ +++.+- ..++ +.....++.-. -+.+-. .. T Consensus 213 ~~L~~~---~~~~~~tvl~~l~~n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~-~e~~~~--~~ 286 (314) T protein:vir:10 213 RVMQGL---VPQTNLSYGELFTRNNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVLP-AQPKDL--HF 286 (314) T ss_pred Hhhccc---ccCCCccHHHHHHHhCCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccceeec-ceecCc--eE Confidence 444221 122232111 1112344444333321 11 222222 2222 22223332211 011111 12 Q ss_pred EEEEEEEEc-CEEecCceEEEEE-EEec Q lcl|NC_019921. 343 LYTAKQFAY-GKAKDNKVAAVWK-LDLK 368 (381) Q Consensus 343 ~~r~~~r~d-Gk~~~~~Afvv~~-~~~~ 368 (381) ...+..|+. ..+..|.|+++++ |+++ T Consensus 287 ~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 287 RYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred EEcceeeeEEEEEECcceeEeeeeeecC Confidence 233455664 5677899988655 3333 No 167 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=95.55 E-value=0.0019 Score=35.54 Aligned_cols=291 Identities=12% Similarity=0.042 Sum_probs=141.4 Q ss_pred cCHHHHHHHHHHhh-------ccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEee- Q lcl|NC_019921. 65 LSANQRSFFMDINK-------NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGK- 134 (381) Q Consensus 65 lt~~e~~~~~~~~~-------~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv~- 134 (381) |+.+-|+.|+++.. -.+....+.|.+.....+...+.+.+-+++.++++++.- +-++....+++-++=.. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdT 80 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRTDT 80 (338) T ss_pred CCHHHHHHHHHHHHHHHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccccC Confidence 33344443333211 122345678999999999999999999999999988752 12333333333332211 Q ss_pred -cccccccccCc-ceeeEeecceeEEEeeeccHHhhhc--CHHHHHHHHHHHHHHHHHHHHhhheeeccCC-------Cc Q lcl|NC_019921. 135 -IYGEIKGQLDA-AFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK-------DQ 203 (381) Q Consensus 135 -e~~~~~~~~~~-~f~~v~l~~~kl~~~~~iS~ell~d--s~~~~e~~l~~~la~~~~~~~~~a~i~G~G~-------~~ 203 (381) ..+++.+ .++ .++.-.+..++.-.-..|+.+.|+. ...|+..-+++.+.++++.-.=.--++|+-. .- T Consensus 81 ~~~~~R~~-~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~n 159 (338) T protein:vir:11 81 TGDGVRKP-RDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAAN 159 (338) T ss_pred CCCCcccc-ccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhC Confidence 1122322 222 4455555666666667899999983 2347888999999999887666666777651 12 Q ss_pred c------eEeeecccc---ccccccccccceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--CceEEEEchh Q lcl|NC_019921. 204 P------IGLNRQVQK---GVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPS 272 (381) Q Consensus 204 P------~Gil~~~~~---~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a~~~mn~~ 272 (381) | +|.|..+-. ...-..+....+..++.... ..+..|..++..+... ..+..|+ +..+.+|.+. T Consensus 160 PllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~----gdy~nLDalV~d~~~~--lI~~~~~~d~dLVvivG~d 233 (338) T protein:vir:11 160 PLLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGAD----ADYKNLDALVFDVVSS--LIDPWHRRDPGLVVILGRE 233 (338) T ss_pred cCccccchhHHHHHHhhhhhhhhhcccccceeeecCCCC----CccccHHHHHHHHHhc--cCChHHhcCCCEEEEEchh Confidence 3 344422211 00111122222222222111 1122233333322111 0122333 3578888876 Q ss_pred hHHHHhhhhhccCCC------Cccc--cccCCCceeEecCCCCCCcEEEEeecceEEE-eecceEEEe--ehhh----hh Q lcl|NC_019921. 273 DAFEVQAQYTHLNAN------GVYV--TALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGINVQK--FKET----LA 337 (381) Q Consensus 273 t~~~~~~~~~~~~~~------G~~~--~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i~-~r~~i~i~~--~~~~----~~ 337 (381) -.+.-..+...+... ++-+ ....-|+|.+.-+++|++.+++=-|++.-|+ .++..+=.. .++. -+ T Consensus 234 Lladk~~~l~n~~~~ptE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y 313 (338) T protein:vir:11 234 LVHDKYFPMVNKDQPATEKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLKNLSLYWQIGGRRRYLKEVPEKNRIENY 313 (338) T ss_pred hhHHHHhHHHhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccch Confidence 555333222222111 1111 1123489999999999999988777774332 333333111 1111 01 Q ss_pred hcCceEEEEEEEEcCEEecCceEEEEEEEecC Q lcl|NC_019921. 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 338 ~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~ 369 (381) ..-..+|..--+.-+-.++ .++++. T Consensus 314 ~s~Ne~YvVEd~~~~a~ie-------ni~~~~ 338 (338) T protein:vir:11 314 ESSNDAYVVEDYGLGCLVE-------NIEVAE 338 (338) T ss_pred hhhccceeeeccccEEEee-------cceecC Confidence 1111233222221111111 223322 No 168 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=94.44 E-value=0.0044 Score=33.50 Aligned_cols=299 Identities=11% Similarity=0.035 Sum_probs=141.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHH--H-HH-----hhccCCCCceecc--HHHHHHHHHHHHhhhhh Q lcl|NC_019921. 37 INQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFF--M-DI-----NKNVNYKEEKLLP--EETIDRIFEDLTTNHPL 106 (381) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~--~-~~-----~~~~~~~gg~lvP--~~~~~~I~~~l~~~~~l 106 (381) +. .. ++. +.+..++..+. . .. .......+.+++. +.+...|++.....-.. T Consensus 1 ~~---~~------------~~~----~~~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~ 61 (329) T protein:vir:79 1 MR---GN------------IMS----KEMKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSA 61 (329) T ss_pred Cc---cc------------hhh----hhhccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccch Confidence 00 00 000 00111111100 0 00 0011111223332 23455566554444334 Q ss_pred hhhceeEecC--C--ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcC---HHHHHHHH Q lcl|NC_019921. 107 LADLGIKNAG--L--RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG---PAWIERFV 179 (381) Q Consensus 107 ~~~~~v~~~~--g--~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds---~~~~e~~l 179 (381) +.++.+.+.. + .+.+...+..+.+.|.+..+..-+..+..+.+-....+.++.-+.++..=|+-+ ..++..-- T Consensus 62 ~~~i~i~~~~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k 141 (329) T protein:vir:79 62 LRVFPVTSELSDTDKTFEYQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRK 141 (329) T ss_pred hhhcccccCCCCceeEEEeeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHH Confidence 4444333221 1 234555566677888876443223445566676777777777778876655544 56788888 Q ss_pred HHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccc Q lcl|NC_019921. 180 RVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSV 259 (381) Q Consensus 180 ~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~ 259 (381) ....++++.+.+|.-+++|++..+-.|+|+.+.....+.+ .+. .......+++..++++..++..+.....+. T Consensus 142 ~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~-~~~----~~~w~~kt~~ei~~di~~~~~~l~~~s~g~-- 214 (329) T protein:vir:79 142 ANAAQNAHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSA-GWN----NAAGTGKKPETAQDELEQAIEKIETLTNGQ-- 214 (329) T ss_pred HHHHHHHHHHhhccEEEeecccccceeeecCCCccccccC-CCC----CccccccCHHHHHHHHHHHHHHHHHhcCce-- Confidence 8899999999999999999988888999987664332221 110 011223345555666666655554332221 Q ss_pred cccCceEEEEchhhHHHHhhhhhccCCCCcccc----ccCCCceeEecCCCC-C---C--cEEEEeecc-eE-EEeecce Q lcl|NC_019921. 260 AVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVT----ALPFNLNVIESTVQE-A---G--KVLTYVKGL-YD-GYLAGGI 327 (381) Q Consensus 260 ~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~----~l~~G~pVv~s~~~p-~---~--~i~fgd~~~-y~-i~~r~~i 327 (381) ...-.++|+|.-+..+.. ..++.|.-+. ....++.|.....+. + + .++..+.+. ++ +.....+ T Consensus 215 --~~p~~L~Lpp~~~~~L~~---~~~~~~~tvl~~lk~~~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~ 289 (329) T protein:vir:79 215 --HRANMILIPPSMRKVLMV---RMPETTMSYLDYFKQQNGGITIESISELEDIDGAGTKAALVYEKDPMNMSIEIPEAF 289 (329) T ss_pred --ecccEEEecHHHHHHhhc---ccCCCCccHHHHHHHhCCCcEEEEcccccccCCCCceEEEEEecCCceEEEecCcce Confidence 123357888876544422 1223332221 111133343333221 1 1 133333322 32 2222333 Q ss_pred EEEeehhhhhhcCceEEEEEEEEc-CEEecCceEEEEEEEecCC Q lcl|NC_019921. 328 NVQKFKETLALDDMDLYTAKQFAY-GKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 328 ~i~~~~~~~~~~d~~~~r~~~r~d-Gk~~~~~Afvv~~~~~~~~ 370 (381) +... -+.+-.. ....+..|+. ..+..|.|++.++ =|... T Consensus 290 ~~l~-~q~~~~~--~~v~~~~r~~Gv~i~~P~ai~~~d-GI~~~ 329 (329) T protein:vir:79 290 NMLT-AQPKDLH--FKVPCTSKCTGLTIYRPLTLVLIK-GLVVG 329 (329) T ss_pred eeee-ceecCce--EEEceeeeEEEEEEECcceeeeee-eeeeC Confidence 3221 1111111 2233455555 4567788887766 33333 No 169 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=93.60 E-value=0.007 Score=32.40 Aligned_cols=302 Identities=8% Similarity=-0.042 Sum_probs=144.5 Q ss_pred cccccCHHHHHHHHHHhhc-------cCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceE Q lcl|NC_019921. 61 SAQSLSANQRSFFMDINKN-------VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAV 131 (381) Q Consensus 61 ~~~~lt~~e~~~~~~~~~~-------~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~ 131 (381) ....|+.+-|+.|+.+... .+....+.|-+.+...+.+.+.+.|-+++.++++++.- +-.+....+++-++ T Consensus 1 m~~~m~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iag 80 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQLAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTG 80 (341) T ss_pred CcccccHHHHHHHHHHHHHHHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceee Confidence 1112344444444433221 12234577888899999999999999999999988752 12233333333332 Q ss_pred EeecccccccccCcceeeEeecceeEEEeeeccHHhhhcC-----HHHHHHHHHHHHHHHHHHHHhhheeeccCC----- Q lcl|NC_019921. 132 WGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG-----PAWIERFVRVQIEEAFAVALETAFLKGTGK----- 201 (381) Q Consensus 132 wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds-----~~~~e~~l~~~la~~~~~~~~~a~i~G~G~----- 201 (381) =..- ++. ..++..+.-.+..++.-.-+.|+.+.|+.= ..|+..-+++.+.++++.-.=.--++|+-. T Consensus 81 rtdt--~R~-~r~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td 157 (341) T protein:vir:27 81 RKAG--GRF-TKQVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTD 157 (341) T ss_pred ccCC--Cce-ecccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCC Confidence 2211 122 223455556666666666678888888732 367889999999999988776666777651 Q ss_pred --Ccc------eEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--CceEEEEch Q lcl|NC_019921. 202 --DQP------IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNP 271 (381) Q Consensus 202 --~~P------~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a~~~mn~ 271 (381) .-| +|.|..+-.. .+...........+ ....+..|..++..+... ..+..++ +..+.+|.+ T Consensus 158 ~~anPllqDVNkGWlQ~~Re~-a~~rVl~~~~~~~g------~~gdy~nLDAlV~D~~~~--lI~~~~~~d~dLVvivG~ 228 (341) T protein:vir:27 158 PSANPLGQDVNEGWIAFVKNR-KASQVVDVDVYFDE------TNGDYRTLDAMASDIINN--QIHPMFRNDPRLTVFVGS 228 (341) T ss_pred hhhcccccccchhHHHHHHhh-cccceeccceeecc------CCCccccHHHHHHHHHhc--ccChHHhcCCCEEEEEch Confidence 123 3444322211 11100000011111 112222333333332111 0123333 356788886 Q ss_pred hhHHHHhhhhhccC-CC-----CccccccCCCceeEecCCCCCCcEEEEeecceEEE-eecceE--EEeehhh-hhhcCc Q lcl|NC_019921. 272 SDAFEVQAQYTHLN-AN-----GVYVTALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGIN--VQKFKET-LALDDM 341 (381) Q Consensus 272 ~t~~~~~~~~~~~~-~~-----G~~~~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i~-~r~~i~--i~~~~~~-~~~~d~ 341 (381) .-.+.-..+...+. .+ ++-+....-|+|.+.-+++|++.+++=-|++.-|+ ..+..+ ++-.++. ++.+.+ T Consensus 229 dLla~k~~~l~n~~~~ptE~~Aa~~i~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ye 308 (341) T protein:vir:27 229 GLIGAAQAKLYDKADKPSEQIAAQKLDKTIAGRPAYVPPFLPDNAMVVTIPENLQVLTQHGTAQRKAKHESDRKRSKTHT 308 (341) T ss_pred hhhhhhhhhhhccCCCCHHHHHHHHHHHhhCCCeEEEccccCCCceEEeeccceEEEEecCcEEEEEEeccccccccchh Confidence 65543322222221 11 11122233589999999999999988777774333 233322 1111111 122223 Q ss_pred eEEEEEEEEcCEEecCceEEEEEEEecCCccccccC Q lcl|NC_019921. 342 DLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGT 377 (381) Q Consensus 342 ~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~ 377 (381) .+|+.-- +++- ..=.|..+++..++.--.++-+ T Consensus 309 s~YvVEd-yg~~--~~~~~~~vkl~~~~~~~~~~~~ 341 (341) T protein:vir:27 309 GAWKVTQ-WVCW--KRSPLTTQKKSTSALNHRSERN 341 (341) T ss_pred hhheeeh-hhhh--hhccccccccCccccccccccC Confidence 3443322 2211 1111222222222222333333 No 170 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=93.33 E-value=0.0079 Score=32.10 Aligned_cols=267 Identities=11% Similarity=0.022 Sum_probs=117.7 Q ss_pred CCCceeccHHHHHHHHHHHHhhhhhhhhceeE---ecC---C-ceEEEEecCCcceEEee-----cccccccccCcceee Q lcl|NC_019921. 82 YKEEKLLPEETIDRIFEDLTTNHPLLADLGIK---NAG---L-RLKFLKSETSGVAVWGK-----IYGEIKGQLDAAFSE 149 (381) Q Consensus 82 ~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~---~~~---g-~~~~p~~~~~~~a~wv~-----e~~~~~~~~~~~f~~ 149 (381) =..-.++|+-++.++++.|++...+.++++.- ..+ | .++||+.... .+.+.. .++.+.. .+..=+. T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~-~~~~~~~~~~~~~~~~~~-~~~~~~~ 78 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPS-RGHTRKLRGAGAERNLTV-SDFTEDS 78 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccc-cceeeeccccccCCcccc-cccccce Confidence 11235889999999999999988887776432 111 3 3788764432 222221 1111111 1222233 Q ss_pred Eeec--ceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceee Q lcl|NC_019921. 150 ETAI--QNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEE 227 (381) Q Consensus 150 v~l~--~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~ 227 (381) +++. .+++. -+.++.+-+..+..|+..-+.+...++++..+|..++.-- .+.|.+.. T Consensus 79 ~~~~id~~k~~-~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~-~~a~~~~~------------------- 137 (392) T protein:vir:99 79 FPVTLTDVAYH-LGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYEAA------------------- 137 (392) T ss_pred EEEEEeeeeec-ceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-hccccccc------------------- Confidence 4444 44443 4556666666667788878888889999999998765311 11111100 Q ss_pred eeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccC-----CCCc--ccc---ccCCC Q lcl|NC_019921. 228 QGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN-----ANGV--YVT---ALPFN 297 (381) Q Consensus 228 ~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~-----~~G~--~~~---~l~~G 297 (381) ...+..++...++.+.++...|... ..| .+ -+++++|..+..+++...+.+ .++. +.. +..+| T Consensus 138 -~~~~~~~~~~~~~~i~~a~~~L~~~--~vP---~~-R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G 210 (392) T protein:vir:99 138 -GAVHEVAPDEFFKGVNGARRALNEL--YIP---QG-RVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYG 210 (392) T ss_pred -ccccccChhhhHHHHHHHHHHHhhc--CCC---CC-CEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeee Confidence 0001122334455666665554321 222 23 356788887777654211111 1111 111 12368 Q ss_pred ceeEecCCCCCCcEEEEeecceEEEeecceEE-------------------EeehhhhhhcCceEEEEEEEEcCEEecCc Q lcl|NC_019921. 298 LNVIESTVQEAGKVLTYVKGLYDGYLAGGINV-------------------QKFKETLALDDMDLYTAKQFAYGKAKDNK 358 (381) Q Consensus 298 ~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i-------------------~~~~~~~~~~d~~~~r~~~r~dGk~~~~~ 358 (381) .+|+.+..+|.+..+.+..+.+....+..... .++.......|...+.. ..+...+... T Consensus 211 ~~v~~s~~~~~~t~~a~~~~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~--~~g~~~v~~~ 288 (392) T protein:vir:99 211 YEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDT--YFGLKVVEDP 288 (392) T ss_pred eEEEeecccccccceeeeccccccccccccccccccceeEEecccceecceeecccceeeccccccce--eEEEEEEeec Confidence 89999999998765544333332222211111 01011111111111100 0001111100 Q ss_pred ---eEEEEEEEecCCccccccCc-----ccC Q lcl|NC_019921. 359 ---VAAVWKLDLKGHKPALEGTE-----ETL 381 (381) Q Consensus 359 ---Afvv~~~~~~~~~~~~~~~~-----~~~ 381 (381) +|.. ...+.......+-+| .++ T Consensus 289 ~~~~~~~-~~~~~~~~~~v~v~~v~~~~~~~ 318 (392) T protein:vir:99 289 NGVGFVR-ARKIHLIPGSIEVAPEAGANATI 318 (392) T ss_pred cccceee-eeeeeeecceeeeeeeeccccee Confidence 1100 001111111111111 111 No 171 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=92.28 E-value=0.012 Score=31.11 Aligned_cols=293 Identities=12% Similarity=0.018 Sum_probs=138.0 Q ss_pred cCHHHHHHHHHHhhc-------cCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEeec Q lcl|NC_019921. 65 LSANQRSFFMDINKN-------VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) Q Consensus 65 lt~~e~~~~~~~~~~-------~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv~e 135 (381) |+.+-|..|+.+... ...+..+.|-+.+...+...+.+.|-+++.++++++.- +-++....+++-++=..- T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTTDT 80 (339) T ss_pred CChHHHHHHHHHHHHHHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecccC Confidence 444444444332211 12345678889999999999999999999999988752 123333333332222111 Q ss_pred -ccccccccCcceeeEeecceeEEEeeeccHHhhhc--CHHHHHHHHHHHHHHHHHHHHhhheeeccCCC-------cc- Q lcl|NC_019921. 136 -YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGKD-------QP- 204 (381) Q Consensus 136 -~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~-------~P- 204 (381) ..++.+..-..++.-.+..++.-.-..|+.++|+. ...|+..-+++.+.++++.-.=.--++|+-.. -| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (339) T protein:vir:79 81 TQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANPM 160 (339) T ss_pred CCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCcC Confidence 12232222124445555555555567899999883 23468888888888888876555556776421 23 Q ss_pred -----eEeeeccccc---cccccccc-cceeee-eeecccccchhHHHHHHHHHHhhhcccccccccc--CceEEEEchh Q lcl|NC_019921. 205 -----IGLNRQVQKG---VSVTEGAY-PEKEEQ-GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPS 272 (381) Q Consensus 205 -----~Gil~~~~~~---~~~~~~~~-~~~~~~-~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a~~~mn~~ 272 (381) +|.|..+-.. ..-..+.. ..+..+ |+ ...+..|..++..+... ..+..|+ +..+.+|-+. T Consensus 161 lqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~------ggdy~NLDalV~d~~~~--lId~~~~~d~dLVvivG~d 232 (339) T protein:vir:79 161 LQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGA------GADYGNLDALVYDITNH--LVEPWYAEDPDLVVVCGRN 232 (339) T ss_pred ccccchhHHHHHHhhhhhhhhccceeccceeEeccC------CCCcccHHHHHHHHHhc--cCChHHhcCCCEEEEEchh Confidence 3444221110 00011110 111111 11 11222333333332211 0123344 3578888877 Q ss_pred hHHHHhhhhhccCC-C-----Ccccc--ccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--ehhh-hhhcC Q lcl|NC_019921. 273 DAFEVQAQYTHLNA-N-----GVYVT--ALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--FKET-LALDD 340 (381) Q Consensus 273 t~~~~~~~~~~~~~-~-----G~~~~--~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~~~~-~~~~d 340 (381) -...-..+...+.. + ++-+. ...=|+|.+.-+++|++.+++=-|++.-| ..++..+=.. .++. ++.+. T Consensus 233 Lla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y 312 (339) T protein:vir:79 233 LLSDKYFPLVNRDRDPVQQIAADLIISQKRIGNLPAIRVPYFPANGLLVTRLDNLSIYYQEGGRRRTILDNAKRDRIENY 312 (339) T ss_pred hhhhHhhhHhhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccch Confidence 65543322222211 0 11111 12248999999999999998877776333 3333333211 1111 11110 Q ss_pred ceEEEEEEEEcCEEecC-ceEEEEE-EEecCCccc Q lcl|NC_019921. 341 MDLYTAKQFAYGKAKDN-KVAAVWK-LDLKGHKPA 373 (381) Q Consensus 341 ~~~~r~~~r~dGk~~~~-~Afvv~~-~~~~~~~~~ 373 (381) ..|-+|-+|-+ ++++.++ +++ ...| T Consensus 313 ------~s~Ne~YvVEd~~~~a~iEni~~--~~aa 339 (339) T protein:vir:79 313 ------ESSNDAYVIEDLACAAMAENIAL--AAAA 339 (339) T ss_pred ------hhccceeeeeccccEEEeeeeec--ccCC Confidence 01222222222 2222222 222 2222 No 172 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=92.00 E-value=0.013 Score=30.88 Aligned_cols=293 Identities=10% Similarity=-0.009 Sum_probs=140.4 Q ss_pred cCHHHHHHHHHHhh------cc-----CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceE Q lcl|NC_019921. 65 LSANQRSFFMDINK------NV-----NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAV 131 (381) Q Consensus 65 lt~~e~~~~~~~~~------~~-----~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~ 131 (381) +..+-|..|+++.. +. +..--+.|-+.+...+...+.+.|-+++.++++++.- +-++....+++-++ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iag 80 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVAS 80 (342) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCccccc Confidence 33333333333221 11 1122478889999999999999999999999988752 22333333333332 Q ss_pred Eeec--ccccccccCcceeeEeecceeEEEeeeccHHhhhc--CHHHHHHHHHHHHHHHHHHHHhhheeeccCCC----- Q lcl|NC_019921. 132 WGKI--YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGKD----- 202 (381) Q Consensus 132 wv~e--~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~----- 202 (381) =..- ..++.+.+-..++.-.+..++.-.-..|+.++|+. ...|+..-+++.+.++++.-.=.--++|+-.. T Consensus 81 rtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 160 (342) T protein:vir:10 81 TTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDR 160 (342) T ss_pred ccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCCh Confidence 2211 11233222234455555555555567899999883 33468888888888888876655556776421 Q ss_pred --cc------eEeeeccc---cccccccccccceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--CceEEEE Q lcl|NC_019921. 203 --QP------IGLNRQVQ---KGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVV 269 (381) Q Consensus 203 --~P------~Gil~~~~---~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a~~~m 269 (381) -| +|.|..+- .......++...+..+|+. ..+..|..++..+... ..+..|+ +..+.+| T Consensus 161 ~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~------gdy~NLDalV~D~~~~--lI~~~~~~d~dLVviv 232 (342) T protein:vir:10 161 NSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGKG------QEYANLDALVMDATEE--LIDEWHRDDTDLVVIT 232 (342) T ss_pred hhCcCccccchHHHHHHHhhhhhhhcccceeccceeecCC------CCcccHHHHHHHHHhc--cCChHHhcCCCEEEEE Confidence 23 35443221 1111111221112222221 1222333333322111 0133344 3578888 Q ss_pred chhhHHHHhhhhhccC-CCC-----ccc--cccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--ehhh-hh Q lcl|NC_019921. 270 NPSDAFEVQAQYTHLN-ANG-----VYV--TALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--FKET-LA 337 (381) Q Consensus 270 n~~t~~~~~~~~~~~~-~~G-----~~~--~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~~~~-~~ 337 (381) .+.-...-..+...+. .+. +-+ ....=|+|.+.-+++|++.+++=-|++.-| ..++..+=.. .++. ++ T Consensus 233 G~dLladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~ri 312 (342) T protein:vir:10 233 GRKLLADKYFPIVNQQNAPTEELAADIVISQKRIGGLKAVRVPFFPANAILITKLENLAIYVQEGTTRKHIENVPKKDRI 312 (342) T ss_pred chhhhHHHHHHHHhcCCChHHHHHHHHHHhhhhhcCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccc Confidence 8776654333322221 111 111 112248999999999999998877766332 3333333211 1111 01 Q ss_pred hcCceEEEEEEEEcCEEecCc-eEEEE-EEEecCCc Q lcl|NC_019921. 338 LDDMDLYTAKQFAYGKAKDNK-VAAVW-KLDLKGHK 371 (381) Q Consensus 338 ~~d~~~~r~~~r~dGk~~~~~-Afvv~-~~~~~~~~ 371 (381) .+. ..|-+|-+|-+. +++++ .++++.++ T Consensus 313 e~y------~s~Ne~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 313 ETY------ESENIDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred cch------hhhccceeeeccccEEEeecceecCCC Confidence 100 112222222221 22221 23443333 No 173 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=90.74 E-value=0.019 Score=29.98 Aligned_cols=298 Identities=12% Similarity=0.047 Sum_probs=141.9 Q ss_pred cCHHHHHHHHHHhh------ccC---CCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEe Q lcl|NC_019921. 65 LSANQRSFFMDINK------NVN---YKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) Q Consensus 65 lt~~e~~~~~~~~~------~~~---~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv 133 (381) |+.+-|+.|+.+.. +.. ....+.|-+.+...+.+.+.+.|-+++.++++++.- +-++-...+++-++=+ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:98 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccccc Confidence 44444444433211 111 234577888899999999999999999999988752 1233333333333221 Q ss_pred ec--ccccccccCcceeeEeecceeEEEeeeccHHhhhc--CHHHHHHHHHHHHHHHHHHHHhhheeeccCC---C---- Q lcl|NC_019921. 134 KI--YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK---D---- 202 (381) Q Consensus 134 ~e--~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s~~~~e~~l~~~la~~~~~~~~~a~i~G~G~---~---- 202 (381) .- ..++.+.....++.-.+..++.-.-..|+.+.|+. ...|+..-+++.+.++++.-.=.--++|+-. . T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:98 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) T ss_pred cCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 11 11222222233444555555555667899999883 2346888888999998887666666777651 1 Q ss_pred cc------eEeeecccccc---cccccc------ccceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--Cce Q lcl|NC_019921. 203 QP------IGLNRQVQKGV---SVTEGA------YPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNV 265 (381) Q Consensus 203 ~P------~Gil~~~~~~~---~~~~~~------~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a 265 (381) -| +|.|..+-... .-..++ ......+|. ...+..|..++..+... ..+..|+ +.. T Consensus 161 nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~------~gdy~NLDAlV~D~~~~--lI~~~~~~d~dL 232 (355) T protein:vir:98 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGK------NGDYENIDALVMDATNN--LIDEVYQDDPNL 232 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCC------CCCcccHHHHHHHHHhc--cCChHHhcCCCE Confidence 23 34442221100 001010 011111111 11222333333322211 0123333 357 Q ss_pred EEEEchhhHHHHhhhhhccC-CC-----Ccccc--ccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEE--eehh Q lcl|NC_019921. 266 TMVVNPSDAFEVQAQYTHLN-AN-----GVYVT--ALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQ--KFKE 334 (381) Q Consensus 266 ~~~mn~~t~~~~~~~~~~~~-~~-----G~~~~--~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~--~~~~ 334 (381) +.+|.+.-.+.-..+...+. .+ ++-+. ...-|+|.+.-+++|++.+++=-|++.-| ..++..+=. -.++ T Consensus 233 VvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~ 312 (355) T protein:vir:98 233 VAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPK 312 (355) T ss_pred EEEEchhhhHHHhhhHhhccCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccc Confidence 88888765553322222221 11 11111 12348999999999999998877776433 233333211 1111 Q ss_pred h----hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCc---cccccC Q lcl|NC_019921. 335 T----LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK---PALEGT 377 (381) Q Consensus 335 ~----~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~---~~~~~~ 377 (381) . -+..-..+|..--+.-+-.++ .++++... ....++ T Consensus 313 r~rie~y~s~Ne~YvVEd~~~~a~ie-------nI~~~~~~~~~~~~~~a 355 (355) T protein:vir:98 313 KDRVENYESMNIDYVVEVYAAGCLLE-------NITLGDFTAPAAPESGA 355 (355) T ss_pred cccccchhhhcceeeeeccccEEEee-------ceeeeCCCCCcccccCC Confidence 1 111222344433333333332 23333222 112222 No 174 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=90.71 E-value=0.019 Score=29.97 Aligned_cols=307 Identities=9% Similarity=-0.016 Sum_probs=145.7 Q ss_pred cccccCHHHHHHHHHHhh------cc---CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcc Q lcl|NC_019921. 61 SAQSLSANQRSFFMDINK------NV---NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGV 129 (381) Q Consensus 61 ~~~~lt~~e~~~~~~~~~------~~---~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~ 129 (381) ....|+.+-|..|+.+.. +. ..+..+.|.+.+...+.+.+.+.|-+++.++++++.- +-++....+++- T Consensus 1 m~~~M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~i 80 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYCDALAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQLY 80 (358) T ss_pred CcccccHHHHHHHHHHHHHHHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCccc Confidence 111234444444443211 11 2245688999999999999999999999999988752 122333333333 Q ss_pred eEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcC-----HHHHHHHHHHHHHHHHHHHHhhheeeccCC--- Q lcl|NC_019921. 130 AVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG-----PAWIERFVRVQIEEAFAVALETAFLKGTGK--- 201 (381) Q Consensus 130 a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds-----~~~~e~~l~~~la~~~~~~~~~a~i~G~G~--- 201 (381) ++=.. .+.......++.-.+..++.-.-..|+.++|+.= ..|+..-+++.+.++++.-.=.--++|+-. T Consensus 81 agrt~---tr~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~ 157 (358) T protein:vir:78 81 TGRKK---GGRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADD 157 (358) T ss_pred ceecC---CCccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccC Confidence 32211 1222233344555555555556678999999842 226899999999999887665555677642 Q ss_pred ----Ccc------eEeeeccc---cccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccC--ceE Q lcl|NC_019921. 202 ----DQP------IGLNRQVQ---KGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKG--NVT 266 (381) Q Consensus 202 ----~~P------~Gil~~~~---~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~--~a~ 266 (381) .-| +|.|..+- .......++......++... ...+..|..++..+.. ...+..|+. ..+ T Consensus 158 Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~----~Gdy~NLDalV~D~~~--~lI~~~~~~d~dLV 231 (358) T protein:vir:78 158 TDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDG----KGEYKTLDEMASDLIN--TTIDPLFQQDPRLV 231 (358) T ss_pred CChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCC----CCccccHHHHHHHHHh--ccCChHHhcCCCEE Confidence 123 34443221 11111111111111222211 1122233333333211 011333443 578 Q ss_pred EEEchhhHHHHhhhhhccCC-C-----CccccccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--ehhh-- Q lcl|NC_019921. 267 MVVNPSDAFEVQAQYTHLNA-N-----GVYVTALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--FKET-- 335 (381) Q Consensus 267 ~~mn~~t~~~~~~~~~~~~~-~-----G~~~~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~~~~-- 335 (381) .+|.+.-...-..+...+.. + ++-+....=|+|.+.-+++|++.+++=-|++.-| ..++..+=.. .++. T Consensus 232 vivG~dLla~k~~~l~n~~~~pTE~~Aa~~i~k~iGGlpa~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~r 311 (358) T protein:vir:78 232 VLVGTDLVAAAQAKLYSEATKPSEQIAAQQLAKSIAGRKAYIPPFFPGKRMVVTTLDNLHCYTQRGTRKRKADDNQDSKS 311 (358) T ss_pred EEEchhhhhHHhhhHhhcCCCcHHHHHHHHHHHHhCCCeEEEccccCCCceEEeeccccEEEEecCcEEEEEEecccccc Confidence 88887765543322222211 1 1111122248999999999999998877766333 3333333211 1111 Q ss_pred --hhhcCceEEEEEEEEcCEEecCceEEEEEEEecC--CccccccCcccC Q lcl|NC_019921. 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG--HKPALEGTEETL 381 (381) Q Consensus 336 --~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~--~~~~~~~~~~~~ 381 (381) -+..-..+|..--+.-+-.++.-. +++.+ ..++-+..|..= T Consensus 312 iE~y~s~Ne~YvVEd~~~~a~iE~i~-----v~~~~~pa~~~~~~~~~~~ 356 (358) T protein:vir:78 312 FDNQYWRMEGYALGEHKAYGGFEEAD-----IEIGADPAVLAVEAAAQAG 356 (358) T ss_pred ccchhhhcceeeeeccccEEEEeeee-----eeeCCCCCccccCCccccC Confidence 111112334333222222232222 22211 111111222222 No 175 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=90.48 E-value=0.011 Score=31.22 Aligned_cols=192 Identities=14% Similarity=0.115 Sum_probs=89.8 Q ss_pred EEeeeccHHhhh-----cCHHHHHHHHHHHHHHHHHHHHhhheee----ccCCCcceEeeeccccccccccccccceeee Q lcl|NC_019921. 158 TAFVVLPKDLND-----FGPAWIERFVRVQIEEAFAVALETAFLK----GTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 158 ~~~~~iS~ell~-----ds~~~~e~~l~~~la~~~~~~~~~a~i~----G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) ---.-+|.-+++ ++..|+.+...++++++++...|..++. +..+..|..- .+.++.. .. T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~--------~~~g~~~----~~ 68 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTG--------QDGGFSV----NI 68 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccc--------cccCcce----ec Confidence 111234444444 5778899999999999999999988752 3222222100 0000000 00 Q ss_pred eeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh-------ccCCCCccccc----cCCC Q lcl|NC_019921. 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT-------HLNANGVYVTA----LPFN 297 (381) Q Consensus 229 ~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~-------~~~~~G~~~~~----l~~G 297 (381) ......++..+++.+.++...| +.+..+.. +-+++++|..++.+++... ..+++|....+ ...| T Consensus 69 ~a~~t~~~~~l~dai~~a~~~L----dekdVP~~-gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G 143 (221) T protein:vir:17 69 GAGNTNNAQAIVDGFFEAAAVL----DERSAPMD-GRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAG 143 (221) T ss_pred cccccCCHHHHHHHHHHHHHHH----hhcCCCCC-CCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecC Confidence 0001122334455555544443 33333322 3345679988888765311 11223322221 2359 Q ss_pred ceeEecCCCCC--CcEEEEeecceEEEeecceEEEeehhhhhhcCceEEEE-EEEEcCEEecCceEEEEEEEecCCcccc Q lcl|NC_019921. 298 LNVIESTVQEA--GKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTA-KQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 298 ~pVv~s~~~p~--~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~-~~r~dGk~~~~~Afvv~~~~~~~~~~~~ 374 (381) .+|+.|+++|. +.-+..+...+. ... .....||+ +.-.=|.+..++|...++|=-.++.|.+ T Consensus 144 ~~V~~SnnlP~~~gt~~~~~ag~~~--------~~~-------~~~~~yr~~fs~~~glv~~~~Avgtvkl~~~~~~~~~ 208 (221) T protein:vir:17 144 IRIYKSNVLASLYGTNLVTDPGDAT--------TSG-------ENNGSYRPAITDRAGLVFHKEAADTVEVLLPPSRPPL 208 (221) T ss_pred cEEEEeccCCcccccccccCCcccc--------ccc-------cccccccccccceEEEEEcchheeeeeeecCCCCCce Confidence 99999999996 221111111111 000 00111111 1112288899999555554444444544 Q ss_pred ccC------cccC Q lcl|NC_019921. 375 EGT------EETL 381 (381) Q Consensus 375 ~~~------~~~~ 381 (381) ... ||.- T Consensus 209 ~~~~~~~~~~~~~ 221 (221) T protein:vir:17 209 VISMFSIRRPDRR 221 (221) T ss_pred eeeeeeccCCCCC Confidence 433 2222 No 176 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=89.95 E-value=0.023 Score=29.51 Aligned_cols=290 Identities=11% Similarity=0.016 Sum_probs=138.8 Q ss_pred cCHHHHHHHHHHhhc-------cCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEeec Q lcl|NC_019921. 65 LSANQRSFFMDINKN-------VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) Q Consensus 65 lt~~e~~~~~~~~~~-------~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv~e 135 (381) |+.+-|..|+.+... ......+.|-+.....+...+.+.+-+++.++++++.- +-++....+++-++=..- T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 444444444332211 12234567888899999999999999999999988752 122333333332222111 Q ss_pred c-cccccccCcceeeEeecceeEEEeeeccHHhhhc--CHHHHHHHHHHHHHHHHHHHHhhheeeccCC-------Ccc- Q lcl|NC_019921. 136 Y-GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK-------DQP- 204 (381) Q Consensus 136 ~-~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s~~~~e~~l~~~la~~~~~~~~~a~i~G~G~-------~~P- 204 (381) + .++.+.+-..++.-.+..++.-.-..|+.+.|+. ...|+..-+++.+.++++.-.=.--++|+-. .-| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:10 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 1 1222222223444555555555567899999983 2347888899999999887666666777651 123 Q ss_pred -----eEeeecccc---cccccccccc-ceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--CceEEEEchhh Q lcl|NC_019921. 205 -----IGLNRQVQK---GVSVTEGAYP-EKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPSD 273 (381) Q Consensus 205 -----~Gil~~~~~---~~~~~~~~~~-~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a~~~mn~~t 273 (381) +|.|..+-. ....+.++.. .+..+|+. ..+..|..++..+... ..+..|+ +..+.+|.+.- T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~------gdy~nLDalV~D~~~~--lI~~~~~~d~~LVvivG~dL 232 (337) T protein:vir:10 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKA------GDYENLDALVMDIVSS--MIDPWFQEDTGLVVICGREL 232 (337) T ss_pred ccccchhHHHHHHhcchhhhhccccccCcceeecCC------CCcccHHHHHHHHHhc--cCChHHhcCCCEEEEEchhh Confidence 344432211 0000111110 11111211 1122233333322110 0123344 35788888766 Q ss_pred HHHHhhhhhccCCCCcc--------c--cccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--ehhh-hhhc Q lcl|NC_019921. 274 AFEVQAQYTHLNANGVY--------V--TALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--FKET-LALD 339 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~--------~--~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~~~~-~~~~ 339 (381) ...-..+...+ ...+ + ....-|+|.+.-+++|++.+++=-|++.-| ..++..+=.. .++. ++.+ T Consensus 233 ladk~~~l~n~--~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 310 (337) T protein:vir:10 233 LHDKYFPIVNA--TQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIEN 310 (337) T ss_pred hhHHhhHHhcc--CCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccc Confidence 55333222222 1111 1 112248999999999999998877777433 3333333211 1111 1111 Q ss_pred CceEEEEEEEEcCEEecCc-eEEEE-EEEecCC Q lcl|NC_019921. 340 DMDLYTAKQFAYGKAKDNK-VAAVW-KLDLKGH 370 (381) Q Consensus 340 d~~~~r~~~r~dGk~~~~~-Afvv~-~~~~~~~ 370 (381) . ..|-+|-+|-+. +++++ .|+++.. T Consensus 311 y------~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 311 Y------ESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred h------hhccceeeeeccccEEEEeceeecCC Confidence 0 012222222221 22221 2333333 No 177 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=89.79 E-value=0.024 Score=29.43 Aligned_cols=299 Identities=12% Similarity=0.044 Sum_probs=143.2 Q ss_pred cCHHHHHHHHHHhh------cc---CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEe Q lcl|NC_019921. 65 LSANQRSFFMDINK------NV---NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) Q Consensus 65 lt~~e~~~~~~~~~------~~---~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv 133 (381) |+.+-|..|+.+.. +. .....+.|-+.+...+.+.+.+.|-+++.++++++.- +-++....+++-++=+ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceeecc Confidence 33444444433211 11 1234577888899999999999999999999998752 1233333333333222 Q ss_pred ec--ccccccccCcceeeEeecceeEEEeeeccHHhhhc--CHHHHHHHHHHHHHHHHHHHHhhheeeccCC---C---- Q lcl|NC_019921. 134 KI--YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK---D---- 202 (381) Q Consensus 134 ~e--~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s~~~~e~~l~~~la~~~~~~~~~a~i~G~G~---~---- 202 (381) .- ..++.......++.-.+..++.-.-..|+.+.|+. ...|+..-+++.+.++++.-.=.--++|+-. . T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:18 81 DTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRVK 160 (355) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 11 11222222233444555555555667899999883 2346888888899998887666666677651 1 Q ss_pred cc------eEeeecccccc---ccccc------cccceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--Cce Q lcl|NC_019921. 203 QP------IGLNRQVQKGV---SVTEG------AYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNV 265 (381) Q Consensus 203 ~P------~Gil~~~~~~~---~~~~~------~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a 265 (381) -| +|.|..+-... .-..+ .......+|+ ...+..|..++..+... ..+..|+ +.. T Consensus 161 nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~------~gdy~NLDAlV~d~~~~--lI~~~~~~d~dL 232 (355) T protein:vir:18 161 NPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGK------NGDYENLDALVMDGTNT--LIDEIYQDDPKL 232 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhccccccccccccceeeecC------CCCcccHHHHHHHHHhc--cCChHHhcCCCE Confidence 23 34442221100 00001 0111111111 11222333333332211 0123333 357 Q ss_pred EEEEchhhHHHHhhhhhccC-CC-----Cccc--cccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--ehh Q lcl|NC_019921. 266 TMVVNPSDAFEVQAQYTHLN-AN-----GVYV--TALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--FKE 334 (381) Q Consensus 266 ~~~mn~~t~~~~~~~~~~~~-~~-----G~~~--~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~~~ 334 (381) +.+|.+.-.+.-..+...+. .+ ++-+ ....-|+|.+.-+++|++.+++=-|++.-| ..++..+=.. .++ T Consensus 233 VvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~ 312 (355) T protein:vir:18 233 VAIVGRKLLADKYFPLVNKQQENTESLAADIIISQKRIGNLPAVRVPYFPANAVFVTTLENLSIYFMDESHRRSIDENPK 312 (355) T ss_pred EEEEchhhhHHHHhHHhhccCChHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccc Confidence 88888765543322222221 11 1111 112348999999999999998877777433 2333333111 111 Q ss_pred h----hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCC--ccccccCc Q lcl|NC_019921. 335 T----LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH--KPALEGTE 378 (381) Q Consensus 335 ~----~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~--~~~~~~~~ 378 (381) . -+..-..+|..--+.-+-.++ .++++.. .++.++.. T Consensus 313 r~rie~y~s~Ne~YvVEd~~~~a~ie-------ni~~~~~~~~~~~~~g~ 355 (355) T protein:vir:18 313 KDRVENYESMNIDYVVEAYAAGCLLE-------NITLGDFTAPAAPEGGE 355 (355) T ss_pred cccccchhhhcceeeeeccccEEEEe-------eeeecCCCCcccccCCC Confidence 1 111122344333322222222 2333322 23333333 No 178 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=89.42 E-value=0.026 Score=29.23 Aligned_cols=290 Identities=11% Similarity=0.012 Sum_probs=138.5 Q ss_pred cCHHHHHHHHHHhhc-------cCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEeec Q lcl|NC_019921. 65 LSANQRSFFMDINKN-------VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) Q Consensus 65 lt~~e~~~~~~~~~~-------~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv~e 135 (381) |+.+-|..|+.+... ....-.+.|-+.....+...+.+.+-+++.++++++.- +-++-...+++-++=..- T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 444444444432211 12233467878899999999999999999999988752 122333333332222111 Q ss_pred c-cccccccCcceeeEeecceeEEEeeeccHHhhhc--CHHHHHHHHHHHHHHHHHHHHhhheeeccCC-------Ccc- Q lcl|NC_019921. 136 Y-GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK-------DQP- 204 (381) Q Consensus 136 ~-~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s~~~~e~~l~~~la~~~~~~~~~a~i~G~G~-------~~P- 204 (381) + .++.+.+-..++.-.+..++.-.-..|+.+.|+. ...|+..-+++.+.++++.-.=.--++|+-. .-| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:79 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 1 1222222223444555555555567899999983 2347888899999999887666666777651 123 Q ss_pred -----eEeeecccc---cccccccccc-ceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--CceEEEEchhh Q lcl|NC_019921. 205 -----IGLNRQVQK---GVSVTEGAYP-EKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPSD 273 (381) Q Consensus 205 -----~Gil~~~~~---~~~~~~~~~~-~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a~~~mn~~t 273 (381) +|.|..+-. ....+.++.. .+..+|+. ..+..|..++..+... ..+..|+ +..+.+|.+.- T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~------gdy~nLDalV~D~~~~--lI~~~~~~d~~LVvivG~dL 232 (337) T protein:vir:79 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKA------GDYENLDALVMDIVSS--MIDPWFQEDTGLVAICGREL 232 (337) T ss_pred ccccchhHHHHHHhcchhhhhccccccCcceeecCC------CCcccHHHHHHHHHhc--cCChHHhcCCCEEEEEchhh Confidence 344432211 0000111100 11111211 1222233333322110 0123344 35788888766 Q ss_pred HHHHhhhhhccCCCCcc--------c--cccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--ehhh-hhhc Q lcl|NC_019921. 274 AFEVQAQYTHLNANGVY--------V--TALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--FKET-LALD 339 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~--------~--~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~~~~-~~~~ 339 (381) ...-..+...+ ...+ + ....-|+|.+.-+++|++.+++=-|++.-| ..++..+=.. .++. ++.+ T Consensus 233 ladk~~~l~n~--~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 310 (337) T protein:vir:79 233 LHDKYFPIVNA--TQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIEN 310 (337) T ss_pred hhHHhhHHhcc--CCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccc Confidence 55333222222 1111 1 112248999999999999998877777433 3333333211 1111 1111 Q ss_pred CceEEEEEEEEcCEEecC-ceEEEE-EEEecCC Q lcl|NC_019921. 340 DMDLYTAKQFAYGKAKDN-KVAAVW-KLDLKGH 370 (381) Q Consensus 340 d~~~~r~~~r~dGk~~~~-~Afvv~-~~~~~~~ 370 (381) . ..|-+|-+|-+ ++++++ .|+++.. T Consensus 311 y------~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 311 Y------ESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred h------hhccceeeeeccccEEEEeceeecCC Confidence 0 01222222222 122221 2333333 No 179 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=89.03 E-value=0.029 Score=29.04 Aligned_cols=262 Identities=14% Similarity=0.066 Sum_probs=84.9 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhce---eE----ecCCceEE-EEe-cCCcce-EEeecccccccccCc Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLG---IK----NAGLRLKF-LKS-ETSGVA-VWGKIYGEIKGQLDA 145 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~---v~----~~~g~~~~-p~~-~~~~~a-~wv~e~~~~~~~~~~ 145 (381) |.+..-+|= .+--+.+....+|.+.+.-.++..+. +. +..|.+.. +.- ...... .-+...+...+..-. T Consensus 1 ~~~t~~sdl-~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit 79 (315) T protein:vir:96 1 MATTVNSDL-VIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIA 79 (315) T ss_pred Cceeeecce-eeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceecc Confidence 222222220 11123444444555444333332211 11 12232211 100 000000 000000111110000 Q ss_pred ceeeEeecceeEE-Ee--eeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheee-ccCCCcceEeeecccccccccccc Q lcl|NC_019921. 146 AFSEETAIQNKLT-AF--VVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK-GTGKDQPIGLNRQVQKGVSVTEGA 221 (381) Q Consensus 146 ~f~~v~l~~~kl~-~~--~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~-G~G~~~P~Gil~~~~~~~~~~~~~ 221 (381) +...+.. +++ +. +..+...+....-|.+.++. .+...+..+.-..+++ +-+ |++..+....... T Consensus 80 ~~~dvaV---k~~~~~~~~~~~~~~~a~~g~dp~~~~~-~i~~~~~~~~l~~~l~~~l~-----~~~aai~~~t~~~--- 147 (315) T protein:vir:96 80 ADEMVSV---KVPWKYGPYETTEEAFKRRARSPEEFSM-LIGQDMADATMAGWIGYALN-----ALQGAIGSNAGMN--- 147 (315) T ss_pred cccceeE---EEeecCCchhccHHHHHHhhcCHHHHHH-HHHHHHHHHHHHHHHHHHHh-----hhhhhhccccccc--- Confidence 1111111 222 22 23344445544455555544 2333333333222221 100 1111111110000 Q ss_pred ccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh---cc-CCCCccccccC-- Q lcl|NC_019921. 222 YPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HL-NANGVYVTALP-- 295 (381) Q Consensus 222 ~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~---~~-~~~G~~~~~l~-- 295 (381) . +...+......+.+..+.+ ......=..|+||..++..+++... .. +.++. +...+ T Consensus 148 -----~----~~~~a~~~~~~l~dA~~kl-------GD~~~~l~~~vMHS~v~~~L~~q~L~~~~~~~~~~~-~~~~~~~ 210 (315) T protein:vir:96 148 -----V----SGELATEGKKVLTKGLRTM-------GDKASSIAIWVMDSTSYFDIVDEAIDNKLYEEAGVV-VYGGTPG 210 (315) T ss_pred -----c----cccccccCHHHHHHHHHHh-------cccccCeeEEEEchHHHHHHHHhhhhhhccccccee-EecCcCc Confidence 0 0111122233333333332 1112223469999999999876321 11 22332 22111 Q ss_pred -CCceeEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhh----cCceEEEEEEEEcC-EEecCceEEEEEEEecC Q lcl|NC_019921. 296 -FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLAL----DDMDLYTAKQFAYG-KAKDNKVAAVWKLDLKG 369 (381) Q Consensus 296 -~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~----~d~~~~r~~~r~dG-k~~~~~Afvv~~~~~~~ 369 (381) +|++|++++.||...++.. -.+.+.+.......+. .++.-.....|.++ -.+.+..|.. +.++ T Consensus 211 ~lGkrViVdD~~P~~~~~gl--------~~GAi~~~~~~~~~~~~~~~~g~e~l~~~~r~e~tf~l~p~G~sw---~~~~ 279 (315) T protein:vir:96 211 TLGKPVLVTDQCPATKIFGL--------VAGAVMITESQAPGMRSYQIDDQENLAIGFRAEGTANVEVLGYKW---KTKT 279 (315) T ss_pred ccccEEEEECCCCcceeeee--------ecceeeecCCCccccccccCCCcceeEEEEeeeeEeeeeeeeEEe---ecCC Confidence 5999999999997543221 1122222222221111 22222222233332 3455555443 1111 Q ss_pred CccccccCcccC Q lcl|NC_019921. 370 HKPALEGTEETL 381 (381) Q Consensus 370 ~~~~~~~~~~~~ 381 (381) . .+.|-..| T Consensus 280 ~---~sPt~aeL 288 (315) T protein:vir:96 280 N---VNPASATL 288 (315) T ss_pred C---cCCChHHh Confidence 1 01111112 No 180 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=88.33 E-value=0.033 Score=28.71 Aligned_cols=271 Identities=11% Similarity=-0.029 Sum_probs=89.1 Q ss_pred cCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhc-----ee--EecCCc-eEEEEecCC-c---ceEE Q lcl|NC_019921. 65 LSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADL-----GI--KNAGLR-LKFLKSETS-G---VAVW 132 (381) Q Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~-----~v--~~~~g~-~~~p~~~~~-~---~a~w 132 (381) ++--....|| +.+....+|.+.+.......+ -. .+..|. +.+|.-.+. + ...- T Consensus 1 m~lsD~~vfN---------------~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~ 65 (325) T protein:vir:95 1 MALSDLAVYS---------------EYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRN 65 (325) T ss_pred Cchhhhhhhh---------------hhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeecccccccccccccccc Confidence 1111222222 222222333333222111111 11 122344 345543321 1 1001 Q ss_pred eecccccccccCcceeeEeecceeEEEeeeccHHhhh---cCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeee Q lcl|NC_019921. 133 GKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLND---FGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNR 209 (381) Q Consensus 133 v~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~---ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~ 209 (381) ..+.+...+..-.++.++....+.-.+......+.+. +....+...|.+.+++..-+.+-..++.+-. |.++ T Consensus 66 ~~~~~~vt~~kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~-----~a~~ 140 (325) T protein:vir:95 66 AYGSGTVAEKVLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVY-----SALS 140 (325) T ss_pred CCCCceeccceeccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----Hhhc Confidence 1112222222222333433333333332222222211 2222233333333333322221111211110 1111 Q ss_pred ccccccccccccccceeeeee-ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhc----- Q lcl|NC_019921. 210 QVQKGVSVTEGAYPEKEEQGT-LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH----- 283 (381) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~-~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~----- 283 (381) ..+ ..+. .+.+ ....+.......+.+..+.+ . .....=..|+||+.++..+++..+. T Consensus 141 ~~~------~~v~----dis~~~~~~~~~~s~~~l~~A~~kl-G------D~~~~l~~~~MHS~v~~~L~~~~L~~~~~~ 203 (325) T protein:vir:95 141 QVS------DVVY----DATANTDAADKLPTWNNLNNGQAKF-G------DQSSQIAAWIMHSTPMHKLYGSNLTNGERL 203 (325) T ss_pred ccc------ccee----eeecccCcccccccHHHHHHHHHHh-c------ccccceeEEEEchHHHHHHHHhhccccccc Confidence 000 0000 0000 01111111233344433332 1 1112224699999999999765443 Q ss_pred cCCCCccccccCCCceeEecCCCCCCcE---------EEEeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEcCEE Q lcl|NC_019921. 284 LNANGVYVTALPFNLNVIESTVQEAGKV---------LTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKA 354 (381) Q Consensus 284 ~~~~G~~~~~l~~G~pVv~s~~~p~~~i---------~fgd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~dGk~ 354 (381) .+..|......++|++|++++.||.... +||. -...+....++.........-.+...+||+... -. T Consensus 204 ~~~~g~~~i~t~~G~~VIVdD~~p~~~~g~~~~ytty~lg~-GAi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t---f~ 279 (325) T protein:vir:95 204 FTYGTVNVVRDPFGKLLVMTDSPNLFAAGTPNVYHILGLVP-GGVLIGQNNDFDANEETKNGDENIIRTYQAEWS---YN 279 (325) T ss_pred cccCCcccccccCCcEEEEeCCCCCCCccCceeEEEEEEec-CeEEecCCCCccccccccCcccceeeeeeeeee---EE Confidence 2344544344557999999999995331 1111 111122222222221111122233344443221 34 Q ss_pred ecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 355 KDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 355 ~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) +.|..+.. +.+... .+.|-..| T Consensus 280 lhp~G~sw---~~s~~g--~sPt~aeL 301 (325) T protein:vir:95 280 IGVKGFAW---DKANGG--KSPTDAAL 301 (325) T ss_pred eecceeee---eccccc--CCcChHhh Confidence 55666543 111111 11111123 No 181 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=87.90 E-value=0.036 Score=28.52 Aligned_cols=300 Identities=12% Similarity=0.036 Sum_probs=144.8 Q ss_pred cCHHHHHHHHHHhh------cc---CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEe Q lcl|NC_019921. 65 LSANQRSFFMDINK------NV---NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) Q Consensus 65 lt~~e~~~~~~~~~------~~---~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv 133 (381) |+.+-|..|+.+.. +. +....+.|-+.+...+...+.+.|-+++.++++++.- +-++....+++-++=. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 44444444433211 11 1234578888999999999999999999999988752 1233333333333221 Q ss_pred ec--ccccccccCcceeeEeecceeEEEeeeccHHhhhc--CHHHHHHHHHHHHHHHHHHHHhhheeeccCC-------C Q lcl|NC_019921. 134 KI--YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK-------D 202 (381) Q Consensus 134 ~e--~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s~~~~e~~l~~~la~~~~~~~~~a~i~G~G~-------~ 202 (381) .- +.++.+..-..++.-.+..++.-.-..|+.++|+. ...|+..-+++.+.++++.-.=.--++|+-. . T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:56 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 11 11222221124444455555555557788888873 2246888888888888877655555677642 1 Q ss_pred cc------eEeeecccccc--------ccccc-cccceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--Cce Q lcl|NC_019921. 203 QP------IGLNRQVQKGV--------SVTEG-AYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNV 265 (381) Q Consensus 203 ~P------~Gil~~~~~~~--------~~~~~-~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a 265 (381) -| +|.|..+-... ...+| +..+...+|. ...+..|..++..+... ..+..|+ +.. T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~------~gdy~NLDalV~D~~~~--lI~~~~~~d~dL 232 (357) T protein:vir:56 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGK------GGDYASLDALVMDATNN--LIEPWYQEDPDL 232 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecC------CCCcccHHHHHHHHHhc--cCChHHhcCCCE Confidence 23 34442221100 00011 1111111121 11222333333322111 0133344 357 Q ss_pred EEEEchhhHHHHhhhhhccCCCCcc--------c--cccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--e Q lcl|NC_019921. 266 TMVVNPSDAFEVQAQYTHLNANGVY--------V--TALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--F 332 (381) Q Consensus 266 ~~~mn~~t~~~~~~~~~~~~~~G~~--------~--~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~ 332 (381) +.+|-+.-...-..+...+ .+.+ + ....=|+|.+.-+++|.+.+++=-|++.-| ..++..+=.. . T Consensus 233 VvivG~dLla~k~~~l~n~--~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~ 310 (357) T protein:vir:56 233 VVIVGRQLLADKYFPIVNK--EQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEEN 310 (357) T ss_pred EEEEchhhhhhhhhhHhhc--cCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEec Confidence 8888876655433222222 1111 1 112248999999999999998877766332 3333333211 1 Q ss_pred hhh----hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 333 KET----LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 333 ~~~----~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) ++. -+..-..+|..--+.-+-.++ .++++...+..+.+|+.- T Consensus 311 p~r~riE~y~s~Ne~YvVEd~~~~a~iE-------~i~i~~~~~~~~~~~~~~ 356 (357) T protein:vir:56 311 PKLDRVENYESMNIDYVVEDYAAGCLVE-------KIKVGDFSTPAKATEEPG 356 (357) T ss_pred cccccccchhhhcceeeeeccccEEEee-------eeeeccCCCCcccCCCCC Confidence 111 111122334333222222222 344555554555555544 No 182 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=87.44 E-value=0.039 Score=28.32 Aligned_cols=292 Identities=11% Similarity=0.038 Sum_probs=135.9 Q ss_pred cCHHHHHHHHHHhh------cc-----CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC---CceEEEEecCCcce Q lcl|NC_019921. 65 LSANQRSFFMDINK------NV-----NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG---LRLKFLKSETSGVA 130 (381) Q Consensus 65 lt~~e~~~~~~~~~------~~-----~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~---g~~~~p~~~~~~~a 130 (381) |+.+-|..|+.+.. +. ..+.-+.|.+.+...+.+.+.+.|-+++.++++++. |.+ .....++..+ T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v-~~~~~sg~~t 79 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAI-DLRSNRKRHY 79 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceE-EEeecCcccc Confidence 44444444443321 11 122348899999999999999999999999998863 322 2222222211 Q ss_pred EEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcC--HHH-HHHHHHHHHHHHHHHHHhhheeeccC----CCc Q lcl|NC_019921. 131 VWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFG--PAW-IERFVRVQIEEAFAVALETAFLKGTG----KDQ 203 (381) Q Consensus 131 ~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds--~~~-~e~~l~~~la~~~~~~~~~a~i~G~G----~~~ 203 (381) +-....+...... .-+.-.+..++.-.-..|+.++|+.= ..| +..-+++.+.++++.-.=.--++|+- +.. T Consensus 80 ~r~~t~~~~~~~~--~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T~n 157 (343) T protein:vir:98 80 GAHDRRTPIQQRW--TRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDTSD 157 (343) T ss_pred CccccCCCccccc--cCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCCCC Confidence 1111101000000 00111244444444577899988842 245 77788888888877655555566664 224 Q ss_pred c------eEeeeccccc---ccccccccc-ceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--CceEEEEch Q lcl|NC_019921. 204 P------IGLNRQVQKG---VSVTEGAYP-EKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNP 271 (381) Q Consensus 204 P------~Gil~~~~~~---~~~~~~~~~-~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a~~~mn~ 271 (381) | +|.|..+-.. ..-+.+... ....+|+. ..+..|..++..+.. ..+..|+ +..+.+|.+ T Consensus 158 PllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~g------gdy~NLDalV~D~~~---~I~~~~~~d~dLVvivG~ 228 (343) T protein:vir:98 158 PNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEG------ADYVNLDELAYDLKQ---GLDARHRDAGDLVFLVGA 228 (343) T ss_pred cchhhcchHHHHHHHhcchhhhhccceeccceeEecCC------CCcccHHHHHHHHHh---cCchHHhcCCCEEEEEch Confidence 4 3444322111 111111111 11111221 122333333333321 1234444 357888887 Q ss_pred hhHHHHhhhhhccCCCCccc-----------cccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--ehhh-- Q lcl|NC_019921. 272 SDAFEVQAQYTHLNANGVYV-----------TALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--FKET-- 335 (381) Q Consensus 272 ~t~~~~~~~~~~~~~~G~~~-----------~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~~~~-- 335 (381) .-.+.-..+.. +..++.. ....=|+|.+.-+++|++.+++=-|++.-| ..++..+=.. .++. T Consensus 229 dLla~~~~~l~--n~~~~~ptEk~Aa~~~~~~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~r 306 (343) T protein:vir:98 229 DLVAKEASLVY--KGNGLIATEKAALNTHDLMKSFGGMPAMIVPNMPPRAAIVTSLSNLSIYTQEGSMRRGMKDDDDKKA 306 (343) T ss_pred hhhhhhhhhhh--hhcCCChHHHHHHHHHHHHHhhCCCeeEEccccCCCceEEeeccccEEEEecCcEEEEEEecccccc Confidence 65544333222 2223211 112248999999999999998877776333 3333333211 1111 Q ss_pred --hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccc Q lcl|NC_019921. 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALE 375 (381) Q Consensus 336 --~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~ 375 (381) -+..-..+|..--+.-+-.++.- ++++.+.+.+=+ T Consensus 307 ie~y~s~Ne~YvVEd~~~~a~iE~i-----~v~~~~~~g~w~ 343 (343) T protein:vir:98 307 VRDSYYRNEAYAVEDCGKFMAVDFT-----KVKLSSGKGTWK 343 (343) T ss_pred ccchhhhcceeeeeccccEEEeeee-----eeeecCCCCCCC Confidence 11112234433333333333322 234433333222 No 183 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=85.14 E-value=0.055 Score=27.49 Aligned_cols=300 Identities=11% Similarity=0.032 Sum_probs=142.2 Q ss_pred cCHHHHHHHHHHhh------cc---CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEe Q lcl|NC_019921. 65 LSANQRSFFMDINK------NV---NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) Q Consensus 65 lt~~e~~~~~~~~~------~~---~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv 133 (381) |+.+-|..|+.+.. +. +....+.|-+.+...+...+.+.|-+++.++++++.- +-++....+++-++=. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 44444444433211 11 1234578888999999999999999999999988752 1233333333333221 Q ss_pred ec--ccccccccCcceeeEeecceeEEEeeeccHHhhhc--CHHHHHHHHHHHHHHHHHHHHhhheeeccCC-------C Q lcl|NC_019921. 134 KI--YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK-------D 202 (381) Q Consensus 134 ~e--~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s~~~~e~~l~~~la~~~~~~~~~a~i~G~G~-------~ 202 (381) .- +.++.+..-..++.-.+..++.-.-..|+.++|+. ...|+..-+++.+.++++.-.=.--++|+-. . T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:20 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 11 11222221124444455555555557788888873 2246888888888888877655555677642 1 Q ss_pred cc------eEeeecccccc--------ccccc-cccceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--Cce Q lcl|NC_019921. 203 QP------IGLNRQVQKGV--------SVTEG-AYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNV 265 (381) Q Consensus 203 ~P------~Gil~~~~~~~--------~~~~~-~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a 265 (381) -| +|.|..+-... ....| +..+...+|. ...+..|..++..+... ..+..|+ +.. T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~------~gdy~NLDalV~D~~~~--lI~~~~~~d~dL 232 (357) T protein:vir:20 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGK------GGDYASLDALVMDATNN--LIEPWYQEDPDL 232 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccccccccceeeecC------CCCcccHHHHHHHHHhc--cCChHHhcCCCE Confidence 23 34442211100 00011 1111111221 11222333333322111 0133344 357 Q ss_pred EEEEchhhHHHHhhhhhccCCCCcc--------c--cccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--e Q lcl|NC_019921. 266 TMVVNPSDAFEVQAQYTHLNANGVY--------V--TALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--F 332 (381) Q Consensus 266 ~~~mn~~t~~~~~~~~~~~~~~G~~--------~--~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~ 332 (381) +.+|-+.-...-..+...+ .+.+ + ....=|+|.+.-+++|.+.+++=-|++.-| ..++..+=.. . T Consensus 233 VvivG~dLla~k~~~l~n~--~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~ 310 (357) T protein:vir:20 233 VVIVGRQLLADKYFPIVNK--EQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEEN 310 (357) T ss_pred EEEEchhhhhhhhhhHhhc--cCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEec Confidence 8888876655433222222 1111 1 112248999999999999998877766332 3333333211 1 Q ss_pred hhh----hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 333 KET----LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 333 ~~~----~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) ++. -+..-..+|..--+.-+-.++ .++++........+++.- T Consensus 311 p~r~riE~y~s~Ne~YvVEd~~~~a~iE-------~i~~~~~~~p~~~~~~~~ 356 (357) T protein:vir:20 311 PKLDRVENYESMNIDYVVEDYAAGCLVE-------KIKVGDFSTPAKATAEPG 356 (357) T ss_pred cccccccchhhhcceeeeeccccEEEee-------eeeeccccCCccCCCCCC Confidence 111 111112333332222222222 234444333333333333 No 184 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=84.80 E-value=0.058 Score=27.38 Aligned_cols=300 Identities=11% Similarity=0.028 Sum_probs=141.9 Q ss_pred cCHHHHHHHHHHhh------cc---CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEe Q lcl|NC_019921. 65 LSANQRSFFMDINK------NV---NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) Q Consensus 65 lt~~e~~~~~~~~~------~~---~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv 133 (381) |+.+-|..|+.+.. +. +....+.|-+.+...+...+.+.|-+++.++++++.- +-++....+++-++=. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 44444444433211 11 1234578888999999999999999999999988752 1233333333333221 Q ss_pred ec--ccccccccCcceeeEeecceeEEEeeeccHHhhhc--CHHHHHHHHHHHHHHHHHHHHhhheeeccCC-------C Q lcl|NC_019921. 134 KI--YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK-------D 202 (381) Q Consensus 134 ~e--~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s~~~~e~~l~~~la~~~~~~~~~a~i~G~G~-------~ 202 (381) .- +.++.+..-..++.-.+..++.-.-..|+.++|+. ...|+..-+++.+.++++.-.=.--++|+-. . T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:60 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRSS 160 (357) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 11 12222222124444455555555557889988883 2246888888888888877655555677642 1 Q ss_pred cc------eEeeecccccc--------ccccc-cccceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--Cce Q lcl|NC_019921. 203 QP------IGLNRQVQKGV--------SVTEG-AYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNV 265 (381) Q Consensus 203 ~P------~Gil~~~~~~~--------~~~~~-~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a 265 (381) -| +|.|..+-... ...+| +..+...+|. ...+..|..++..+... ..+..|+ +.. T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~------~gdy~NLDalV~D~~~~--lI~~~~~~d~dL 232 (357) T protein:vir:60 161 NQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGK------GGDYASLDALVMDATNN--LIEPWYQEDPDL 232 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecC------CCCcccHHHHHHHHHhc--cCChHHhcCCCE Confidence 23 34442221100 00011 1111111111 11222333333322211 0133344 357 Q ss_pred EEEEchhhHHHHhhhhhccCCCCcc--------c--cccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--e Q lcl|NC_019921. 266 TMVVNPSDAFEVQAQYTHLNANGVY--------V--TALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--F 332 (381) Q Consensus 266 ~~~mn~~t~~~~~~~~~~~~~~G~~--------~--~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~ 332 (381) +.+|.+.-...-..+... ..+.+ + ....=|+|.+.-+++|.+.+++=-|++.-| ..++..+=.. . T Consensus 233 VvivG~dLla~k~~~l~n--~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~ 310 (357) T protein:vir:60 233 VVIVGRQLLADKYFPIVN--REQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEEN 310 (357) T ss_pred EEEEchhhhhHHhhhHhh--cCCChHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeeccccEEEEecCcEEEEEEec Confidence 888887765543322222 22222 1 112248999999999999998877766332 3333333211 1 Q ss_pred hhh----hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 333 KET----LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 333 ~~~----~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) ++. -+..-..+|..--+.-+-.++ .++++......+.+++.- T Consensus 311 p~r~riE~y~s~Ne~YvVEd~~~~a~iE-------~i~~~~~~~pa~~~~~~~ 356 (357) T protein:vir:60 311 PKLDRVENYESMNIDYVVEDYAAGCLVE-------KIKVGDFSTPAKATAEPG 356 (357) T ss_pred cccccccchhhhcceeeeeccccEEEee-------eeeeccCcccccCCCCCC Confidence 111 111112333333222222222 233433332222222222 No 185 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=84.61 E-value=0.059 Score=27.32 Aligned_cols=288 Identities=11% Similarity=-0.031 Sum_probs=116.4 Q ss_pred ccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeE---------ecCCc-eEEEEecCC-cc- Q lcl|NC_019921. 62 AQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK---------NAGLR-LKFLKSETS-GV- 129 (381) Q Consensus 62 ~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~---------~~~g~-~~~p~~~~~-~~- 129 (381) +.. |++..+- ....+|+.|..-+.+...+.+.|++-.-+. ..+|. +.+|.-..- +. T Consensus 1 M~~--------~~~~T~l----~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~ 68 (367) T protein:vir:80 1 MPD--------FNNQVRL----VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLE 68 (367) T ss_pred Ccc--------hhhhhhh----hhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCc Confidence 000 0100000 114678877776666666666655432222 22343 678875332 21 Q ss_pred eEEeeccc--ccccccCcceee--EeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcce Q lcl|NC_019921. 130 AVWGKIYG--EIKGQLDAAFSE--ETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPI 205 (381) Q Consensus 130 a~wv~e~~--~~~~~~~~~f~~--v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~ 205 (381) -.|.+... +.....--+..+ +.+...|-.+.-.++..|- .-|...-|.+.+++--.+.....+|. -.+ T Consensus 69 ~n~~~d~~~~~~t~~kittg~~~a~v~~r~kaw~~~Dla~~ls---G~dpm~~Ia~qva~yW~r~~q~~Lla-----~L~ 140 (367) T protein:vir:80 69 PNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELA---GSNPMTRIRNRFGVYWTRQWQRRIIA-----MAV 140 (367) T ss_pred cccCCCCCcccccccccccchheeeeehhcccchhhhHHHHhh---CchHHHHHHHHHHHHhhhhhHHHHHH-----HHH Confidence 12212111 111111112222 2222223333344555553 34666777777765444443333322 001 Q ss_pred Eeeecccccc-----------ccccccccceeeeeeec--ccccchhHHHHHHHHHHhhhccccccccccCceEEEEchh Q lcl|NC_019921. 206 GLNRQVQKGV-----------SVTEGAYPEKEEQGTLT--FANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) Q Consensus 206 Gil~~~~~~~-----------~~~~~~~~~~~~~~~~t--~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~ 272 (381) |++..-.... ....+...+.+...+.. ..+....++.+.+..+.+ . .....=..++||+. T Consensus 141 Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~l-G------D~~~~l~~i~mHS~ 213 (367) T protein:vir:80 141 GVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTM-G------DHVGSIAAIAVHSM 213 (367) T ss_pred HhhccccccchhhhhhhhccccccccccCceeeeeeccCCCccceecHHHHHHHHHHh-c------cccccccEEEEchH Confidence 1221100000 00000000011111100 111223334444443322 1 11222346899999 Q ss_pred hHHHHhhhhh---ccCCCCccccccCCCceeEecCCCCC-----Cc----EEEEe--ecceEEEeecceEEEeehhhhhh Q lcl|NC_019921. 273 DAFEVQAQYT---HLNANGVYVTALPFNLNVIESTVQEA-----GK----VLTYV--KGLYDGYLAGGINVQKFKETLAL 338 (381) Q Consensus 273 t~~~~~~~~~---~~~~~G~~~~~l~~G~pVv~s~~~p~-----~~----i~fgd--~~~y~i~~r~~i~i~~~~~~~~~ 338 (381) .+..++++.. .++++|..--..-.|++|++++.||- ++ .+||. |.+.-..-... ++..++.... T Consensus 214 V~~~L~~~~li~~i~~sd~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~--~E~~Rd~~~~ 291 (367) T protein:vir:80 214 VYKRMTNNDEIEFIPDSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVP--VAVGRRELRG 291 (367) T ss_pred HHHHHHhccccccccCCCCccccceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccc--eecccchhhh Confidence 9998876543 35566642212224999999999994 21 34543 22222222222 3444444332 Q ss_pred --cCceEEEEEEEEcCEEecCceEEEEEEEecCCccc----------cccCcccC Q lcl|NC_019921. 339 --DDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPA----------LEGTEETL 381 (381) Q Consensus 339 --~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~----------~~~~~~~~ 381 (381) .++....-+.| .++.|..|..-.-.++.+.-. ..-|..-| T Consensus 292 ~~gG~d~L~~Rr~---~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~~eL 343 (367) T protein:vir:80 292 NGSGLEYILERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANL 343 (367) T ss_pred cCCceEEEEeeee---EEeecceeeecccccccccccccccccccccCCCChHHh Confidence 34444444444 566777666544333322211 11222223 No 186 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=81.57 E-value=0.085 Score=26.47 Aligned_cols=287 Identities=13% Similarity=0.064 Sum_probs=127.5 Q ss_pred cCHHH-HHHHHHHhh--cc-----CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEee Q lcl|NC_019921. 65 LSANQ-RSFFMDINK--NV-----NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGK 134 (381) Q Consensus 65 lt~~e-~~~~~~~~~--~~-----~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv~ 134 (381) ++.+- ..++..+.+ +. ..+.-+.|.+.+...+.+.+.+.|-+++.++++++.- +-++-...+++-++=.. T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtd 80 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATEKGVTGRKQ 80 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccCcccccccC Confidence 11110 011111111 11 1123488999999999999999999999999988752 12333333333322111 Q ss_pred cccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHH--Hhhh--eeeccC----CCcce- Q lcl|NC_019921. 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVA--LETA--FLKGTG----KDQPI- 205 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~--~~~a--~i~G~G----~~~P~- 205 (381) - .+. ..+...+.-.+..++.-.-..|+.+.|+. ...+..|..+.+...+.+. +|.- -++|+- +..|. T Consensus 81 t--~R~-~~~~~l~~~~Y~c~qTn~dt~i~y~~LD~-WA~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPll 156 (336) T protein:vir:37 81 T--GRN-LANLDHTQNGFELAETDSGIIVPWALFDS-FAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVADNTTKADL 156 (336) T ss_pred C--Ccc-ccccCcCCcccEEEEeeeeeeecHHHHHH-HhcChhHHHHHHHHHHHHHHhhchhhhcccceeeccCCCCCcc Confidence 1 122 22334555555555555667899999984 3334555544444444432 3333 345653 23453 Q ss_pred -----Eeeeccccc---cccccccccc-e-eeeeeecccccchhHHHHHHHHHHhhhccccccccccC--ceEEEEchhh Q lcl|NC_019921. 206 -----GLNRQVQKG---VSVTEGAYPE-K-EEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKG--NVTMVVNPSD 273 (381) Q Consensus 206 -----Gil~~~~~~---~~~~~~~~~~-~-~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~--~a~~~mn~~t 273 (381) |.|..+-.. ..-+.++... + ...|+. ..+..|..++..+.. ..+..|+. ..+.+|.+.- T Consensus 157 qDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~------gdy~NLDalV~D~~~---~I~~~~~~d~dLVvivG~dL 227 (336) T protein:vir:37 157 SDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDN------ADYANLDDLAFDLKQ---GLDFRHQNRNDLVFLVGADL 227 (336) T ss_pred cccchhHHHHHHhccchhhcccccccCCceEEecCC------CCcccHHHHHHHHHh---cCchHHhcCCCeEEEEchhh Confidence 444322111 1111111100 1 111211 112223333333221 12334443 6788887755 Q ss_pred HHHHhhhhhccCCCCc-c----------ccccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--ehhh---- Q lcl|NC_019921. 274 AFEVQAQYTHLNANGV-Y----------VTALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--FKET---- 335 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~-~----------~~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~~~~---- 335 (381) .+.-..+.. +.+|. | +....-|+|.+.-+++|++.+++=-|++.-| ..++..+=.. .++. T Consensus 228 la~~~~~l~--~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie 305 (336) T protein:vir:37 228 VSKETKLIQ--QKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDEDKKGLV 305 (336) T ss_pred hhhhhhhhh--hhcCCCHHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEcccccccc Confidence 443322222 22221 1 1122348999999999999998877777433 3333333211 1111 Q ss_pred hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCcccc Q lcl|NC_019921. 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPAL 374 (381) Q Consensus 336 ~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~ 374 (381) -+..-..+|..--+.-+-.++.-. +++ +.. + T Consensus 306 ~y~s~Ne~YvVEd~~~~a~iE~i~-----v~~--~~e-~ 336 (336) T protein:vir:37 306 TSYYRQEGYVVEDLGLMTAIDHTK-----VKL--NGE-V 336 (336) T ss_pred chhhhcceeeeeccccEEEeeeee-----eee--cCc-C Confidence 011112333332222222222111 111 111 1 No 187 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=81.57 E-value=0.085 Score=26.47 Aligned_cols=290 Identities=11% Similarity=0.016 Sum_probs=137.5 Q ss_pred cCHHHHHHHHHHhhc-------cCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEeec Q lcl|NC_019921. 65 LSANQRSFFMDINKN-------VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) Q Consensus 65 lt~~e~~~~~~~~~~-------~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv~e 135 (381) |+.+-|..|+.+... ......+.|-+.+...+...+.+.|-+++.++++++.- +-++....+++-++=..- T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeeecC Confidence 344444444332111 12234577888999999999999999999999988752 122333333332221111 Q ss_pred c-cccccccCcceeeEeecceeEEEeeeccHHhhhc--CHHHHHHHHHHHHHHHHHHHHhhheeeccCC-------Ccc- Q lcl|NC_019921. 136 Y-GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK-------DQP- 204 (381) Q Consensus 136 ~-~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~d--s~~~~e~~l~~~la~~~~~~~~~a~i~G~G~-------~~P- 204 (381) + .++.+.+-..++.-.+..++.-.-..|+.++|+. ...|+..-+++.+.++++.-.=.--++|+-. .-| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (337) T protein:vir:78 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcC Confidence 1 1222222223444444455544556789998883 3346888888888888887665555677642 123 Q ss_pred -----eEeeeccccc---ccccccccc-ceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc--CceEEEEchhh Q lcl|NC_019921. 205 -----IGLNRQVQKG---VSVTEGAYP-EKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPSD 273 (381) Q Consensus 205 -----~Gil~~~~~~---~~~~~~~~~-~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a~~~mn~~t 273 (381) +|.|..+-.. ...+.++.. .+..+|+. ..+..|..++..+... ..+..++ +..+.+|.+.- T Consensus 161 lqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~------gdy~NLDalV~d~~~~--lI~~~~~~d~dLVvivG~dL 232 (337) T protein:vir:78 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKA------GDYENLDALVMDIVSS--MIDPWFQEDTGLVVICGREL 232 (337) T ss_pred ccccchHHHHHHHhcchhhhhccccccCCceeecCC------CCcccHHHHHHHHHhc--cCChHHhcCCCEEEEEchhh Confidence 3544322110 000111111 11111211 1222333333332211 0133344 35788888776 Q ss_pred HHHHhhhhhccCCCCcc--------c--cccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--ehhh-hhhc Q lcl|NC_019921. 274 AFEVQAQYTHLNANGVY--------V--TALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--FKET-LALD 339 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~~--------~--~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~~~~-~~~~ 339 (381) ...-..+...+ ...+ + ....=|+|.+.-+++|++.+++=-|++.-| ..++..+=.. .++. ++.+ T Consensus 233 ladk~~~l~n~--~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 310 (337) T protein:vir:78 233 LHDKYFPIVNA--TQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIEN 310 (337) T ss_pred hHHHHHHHHhc--CCCcHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccc Confidence 65433332222 1111 1 112248999999999999998877776333 3333333211 1111 1111 Q ss_pred CceEEEEEEEEcCEEecCc-eEEEE-EEEecCC Q lcl|NC_019921. 340 DMDLYTAKQFAYGKAKDNK-VAAVW-KLDLKGH 370 (381) Q Consensus 340 d~~~~r~~~r~dGk~~~~~-Afvv~-~~~~~~~ 370 (381) . ..|-+|-+|-+. +++++ .|+++.. T Consensus 311 y------~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 311 Y------ESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred h------hhccceeeeeccccEEEEeceeecCC Confidence 0 012222222221 22221 2333333 No 188 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=80.58 E-value=0.094 Score=26.22 Aligned_cols=333 Identities=15% Similarity=0.165 Sum_probs=122.3 Q ss_pred Cc----hhHHHHHHHHH-------------------------HHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_019921. 1 MT----INLSETFANAK-------------------------NEFINAVNNGEPQERQNELYGDMINQLFEETKL----- 46 (381) Q Consensus 1 mt----~el~~~~~~~~-------------------------~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~----- 46 (381) |. ||..+.+.+.+ +++.+.+.+...+.++ .++..+...+.... T Consensus 1 mnkpdliekqnrlaelkennvslksqisgfevknaiedl~K~~ELe~TlSe~~iEI~k---~en~LN~~eE~~KGK~kMt 77 (393) T protein:vir:16 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIK---IENELNAQEEKPKGKDKMT 77 (393) T ss_pred CCCcchhhhhhhhhhhhhcccchhhhccchhhhhhhhhchhHHHHHHhHhhcchhhhh---hhhhhhhhhhcchhhHHHH Confidence 21 22222222221 1222222111111000 00111110000000 Q ss_pred ------HHHHHHHHHHHhhccccccCHHHHHHHHH-HhhccC--CCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC Q lcl|NC_019921. 47 ------QAKAEAERVSSLPKSAQSLSANQRSFFMD-INKNVN--YKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL 117 (381) Q Consensus 47 ------~~~~~~~~~~~~~~~~~~lt~~e~~~~~~-~~~~~~--~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g 117 (381) ++..++.+.. ..+..+++-+++..+ +.+.+- ++--+.+|+.+...|-..+..+.|++....|...+. T Consensus 78 ~~iesq~A~~eF~~vL----~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~ 153 (393) T protein:vir:16 78 NFIESQNAVTEFFDVL----KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 153 (393) T ss_pred HHHhhHHHHHHHHHHH----hccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchh Confidence 1111111111 112233444444432 223222 455678999999999999999999999877777664 Q ss_pred c-eEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHh---hhcCHHHHHHHHHHHHHHHHH-HHHh Q lcl|NC_019921. 118 R-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL---NDFGPAWIERFVRVQIEEAFA-VALE 192 (381) Q Consensus 118 ~-~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~el---l~ds~~~~e~~l~~~la~~~~-~~~~ 192 (381) - ++.-.++.. .|.-... |..+.+....|..-++.+--+|..-.+ -++ +.+|-..+-+|+..+++.+|. +..+ T Consensus 154 ~~V~~s~~s~~-eAq~Hkd-GqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd 230 (393) T protein:vir:16 154 LLVSRSFDSAN-EAQVHKD-GQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVD 230 (393) T ss_pred hhHHhhhhhhh-hhhhhcc-CCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 2 222222221 2322222 333333345555555554332221111 222 345666678999999999999 8999 Q ss_pred hheeeccCCCcceEeee--ccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEc Q lcl|NC_019921. 193 TAFLKGTGKDQPIGLNR--QVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVN 270 (381) Q Consensus 193 ~a~i~G~G~~~P~Gil~--~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn 270 (381) .|++-|||++...-+=+ .+... ...+.. ....|...+++ .+.+..+. -++..|.-.+++. T Consensus 231 ~AlV~GDG~N~f~~~DK~advK~I---~k~Ttk-aksagktpfad---aieeavdf-----------vrptagrrylivk 292 (393) T protein:vir:16 231 LALVEGDGTNGFKSIDKEADVKKI---KKITTK-AKSAGKTPFAD---AIEEAVDF-----------VRPTAGRRYLIVK 292 (393) T ss_pred hhhheecCCCCccchhhHHHHHHH---HHHhhh-hhhcCCCchhH---HHHHHHhh-----------hccCCCceEEEEe Confidence 99999999876544311 11000 000000 01112222221 12222222 1234444556665 Q ss_pred hhhHHHHhhhhhccCCCCcc-c------cccCCCce---eEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhhcC Q lcl|NC_019921. 271 PSDAFEVQAQYTHLNANGVY-V------TALPFNLN---VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDD 340 (381) Q Consensus 271 ~~t~~~~~~~~~~~~~~G~~-~------~~l~~G~p---Vv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~~d 340 (381) ..+.-.++...-...+|.+. + -..-.|.. |+.....- ..-++.|.+ |.|-+. +++ .-+-.-+..+ T Consensus 293 tedrkalldelrqatananvriknddteiasevgvdeiivytgskal-kptvlvdqk-yhidmq-dlt--kvdafewktn 367 (393) T protein:vir:16 293 TEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAL-KPTVLVDQK-YHIDMQ-DLT--KVDAFEWKTN 367 (393) T ss_pred ccchHHHHHHHHhhhccCceeeeccchhhhhhcCcceeeeeeccccc-cceeeeccc-cccchh-hhh--hhhhheeccC Confidence 44322222111011111110 0 00001110 11100000 000111111 221111 010 0000111111 Q ss_pred ceEEEEEEEEcCEEecCceEEEEEEE Q lcl|NC_019921. 341 MDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) Q Consensus 341 ~~~~r~~~r~dGk~~~~~Afvv~~~~ 366 (381) .--+..-...-|-+-.-+|=+|.++. T Consensus 368 snmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 368 SNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred CceEEEeecccCcceeeccceeEeeC Confidence 11111112223333333333344433 No 189 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=79.06 E-value=0.11 Score=25.88 Aligned_cols=287 Identities=13% Similarity=0.056 Sum_probs=125.4 Q ss_pred cCHHH-HHHHHHHhh--cc-----CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEee Q lcl|NC_019921. 65 LSANQ-RSFFMDINK--NV-----NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGK 134 (381) Q Consensus 65 lt~~e-~~~~~~~~~--~~-----~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g--~~~~p~~~~~~~a~wv~ 134 (381) ++.+- ..++..+.+ +. ..+.-+.|.+.+...+.+.+.+.|-+++.++++++.- +-++-...+++-++=.. T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtd 80 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEKGVTGRKQ 80 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCcccccccC Confidence 11110 011111111 11 1123588999999999999999999999999988752 12333333333332111 Q ss_pred cccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHH--HHhhh--eeeccC----CCcce- Q lcl|NC_019921. 135 IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAV--ALETA--FLKGTG----KDQPI- 205 (381) Q Consensus 135 e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~--~~~~a--~i~G~G----~~~P~- 205 (381) -+..+. ....+.-.+..++.-.-..|+.+.|+. ...+..|..+.+...+.+ ++|.- -++|+- ++.|. T Consensus 81 t~r~r~---~~~l~~~~Y~c~qTn~dt~i~y~~LD~-WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~TdnPll 156 (336) T protein:vir:37 81 TGRNLA---TLDHSQNGYELSETDSGILVNWSLFDS-FAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTTKTDL 156 (336) T ss_pred CCCCcc---ccCCCCCccEEEEeeeeeeccHHHHHH-HhcChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCCCccc Confidence 111111 122334444455555567899999984 334555555554444443 24433 345553 23453 Q ss_pred -----Eeeeccccc---cccccccccc-e-eeeeeecccccchhHHHHHHHHHHhhhcccccccccc--CceEEEEchhh Q lcl|NC_019921. 206 -----GLNRQVQKG---VSVTEGAYPE-K-EEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK--GNVTMVVNPSD 273 (381) Q Consensus 206 -----Gil~~~~~~---~~~~~~~~~~-~-~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~--~~a~~~mn~~t 273 (381) |.|..+-.. ..-+.++... + ...|+. ..+..|..++..+.. ..+..|+ +..+.+|.+.- T Consensus 157 qDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~------gdy~NLDalV~D~~~---~I~~~~~~d~dLVvivG~dL 227 (336) T protein:vir:37 157 SDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDN------ADYANLDDLAFDLKQ---GLDFRHQNRNDLVFLVGADL 227 (336) T ss_pred cccchhHHHHHHhccchhhcccccccCCceEEecCC------CCcccHHHHHHHHHh---ccchHHhcCCCeEEEEchhh Confidence 444322111 1111111000 1 111211 112223333333221 1233444 36788887755 Q ss_pred HHHHhhhhhccCCCCc-c----------ccccCCCceeEecCCCCCCcEEEEeecceEE-EeecceEEEe--ehhh---- Q lcl|NC_019921. 274 AFEVQAQYTHLNANGV-Y----------VTALPFNLNVIESTVQEAGKVLTYVKGLYDG-YLAGGINVQK--FKET---- 335 (381) Q Consensus 274 ~~~~~~~~~~~~~~G~-~----------~~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i-~~r~~i~i~~--~~~~---- 335 (381) .+.-..+.. +.+|. | +....-|+|.+.-+++|++.+++=-|++.-| ..++..+=.. .++. T Consensus 228 la~~~~~l~--~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie 305 (336) T protein:vir:37 228 VSKETKLIQ--QKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDEDKKGLV 305 (336) T ss_pred hhhhhhhhh--hhcCCCHHHHHHHHHHHHHHhhCCceEEEccccCCCceEEeeccccEEEEecCcEEEEEEEcccccccc Confidence 443322222 22221 1 1112348999999999999998877777433 3333333211 1111 Q ss_pred hhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccc Q lcl|NC_019921. 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) Q Consensus 336 ~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~ 373 (381) -+..-..+|..--+.-+-.++.-.+ ++ +... T Consensus 306 ~y~s~Ne~YvVEd~~~~a~iE~i~v-----~~--~~e~ 336 (336) T protein:vir:37 306 TSYYRQEGYVVEDLGLMTAIDHTKV-----KL--NGEV 336 (336) T ss_pred chhhhcceeeeeccccEEEeeeeee-----ec--cccC Confidence 0111122333322222222221111 11 1111 No 190 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=78.10 E-value=0.12 Score=25.67 Aligned_cols=338 Identities=13% Similarity=0.127 Sum_probs=121.5 Q ss_pred CchhHHH----HHHHHHH------HHHHHHhhhh-------------hHHHHHH----------HHHHHHHHHHHHHHHH Q lcl|NC_019921. 1 MTINLSE----TFANAKN------EFINAVNNGE-------------PQERQNE----------LYGDMINQLFEETKLQ 47 (381) Q Consensus 1 mt~el~~----~~~~~~~------~~~~~~k~~~-------------~~~~~~~----------~~~~~~~~~~~~~~~~ 47 (381) |+|-.++ .++++.+ +..-.+|... +..+... ..++..+.+.+..... T Consensus 1 mriS~~~~~K~~l~EK~~~~a~~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en~LNa~~E~~KGK 80 (400) T protein:vir:93 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) T ss_pred CcccccccccchHHHHHHHHhhhhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhhhhhh Confidence 5443221 1111111 1111111100 0000000 0011111111111111 Q ss_pred HHH-HHH---HH---HHhhccccccCHHHHHHHHH-HhhccC--CCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC Q lcl|NC_019921. 48 AKA-EAE---RV---SSLPKSAQSLSANQRSFFMD-INKNVN--YKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL 117 (381) Q Consensus 48 ~~~-~~~---~~---~~~~~~~~~lt~~e~~~~~~-~~~~~~--~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~~~g 117 (381) .++ ++- ++ +......+..+++-+++..+ +.+.+- ++--+.+|+.+...|-..+..+.|++....|...+. T Consensus 81 ~kMt~~i~sq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~ 160 (400) T protein:vir:93 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 160 (400) T ss_pred HHHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchh Confidence 111 100 00 00011112233344444432 222222 455678899999999999999999999887777664 Q ss_pred c-eEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHh---hhcCHHHHHHHHHHHHHHHHH-HHHh Q lcl|NC_019921. 118 R-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL---NDFGPAWIERFVRVQIEEAFA-VALE 192 (381) Q Consensus 118 ~-~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~el---l~ds~~~~e~~l~~~la~~~~-~~~~ 192 (381) - ++.-.++.. .|.-... |..+.+....|..-++.+--+|..-.+ -++ +.+|-..+-+|+..+++.+|. +..+ T Consensus 161 ~~V~~s~~s~~-~Aq~Hkd-GqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd 237 (400) T protein:vir:93 161 LLVSRSFDSAN-EAQVHKD-GQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVD 237 (400) T ss_pred hhHHhhhhhhh-hhhhhcc-CCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 2 222222221 2322222 333333445555555554333221111 222 345666678999999999999 8999 Q ss_pred hheeeccCCCcceEeee--ccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEc Q lcl|NC_019921. 193 TAFLKGTGKDQPIGLNR--QVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVN 270 (381) Q Consensus 193 ~a~i~G~G~~~P~Gil~--~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn 270 (381) .|++-|||++...-+=+ .+.... ..+.. ....|...+++ .+.+..+. -++..|.-.+++. T Consensus 238 ~AlV~GDG~N~f~~~DK~advK~I~---~~Ttk-aksagktpfad---aieeavdf-----------vrptagrrylivk 299 (400) T protein:vir:93 238 LALVEGDGTNGFKSIDKEADVKKIK---KITTK-AKSAGKTPFAD---AIEEAVDF-----------VRPTAGRRYLIVK 299 (400) T ss_pred hhhheecCCCCccchhhHHHHHHHH---HHhhh-hhhcCCCchhH---HHHHHHhh-----------hccCCCceEEEEe Confidence 99999999886544311 110000 00000 01112222221 12222222 1234444556665 Q ss_pred hhhHHHHhh-hhh--------ccCCCCccccccCCCce---eEecCCCCCCcEEEEeecceEEEeecceEEEeehhhhhh Q lcl|NC_019921. 271 PSDAFEVQA-QYT--------HLNANGVYVTALPFNLN---VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLAL 338 (381) Q Consensus 271 ~~t~~~~~~-~~~--------~~~~~G~~~~~l~~G~p---Vv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~~ 338 (381) ..+.-.++. ++. ++|.+...... .|.. |+.....-. .-++.|.+ |.|-+. +++ .-+-.-+. T Consensus 300 tedrkalldelrqatanahvriknddaeiase--vgvdeiivytgskalk-ptvlvdqk-yhidmq-dlt--kvdafewk 372 (400) T protein:vir:93 300 TEDRKALLDELRQATANAHVRIKNDDAEIASE--VGVDEIIVYTGSKALK-PTVLVDQK-YHIDMQ-DLT--KVDAFEWK 372 (400) T ss_pred ccchHHHHHHHHhhccccceEeecchhhhhhh--cCcceeeeeecccccc-ceeeeccc-cccchh-hhh--hhhhheec Confidence 443222221 111 11111100000 1110 111000000 00111111 221111 110 00001111 Q ss_pred cCceEEEEEEEEcCEEecCceEEEEEEE Q lcl|NC_019921. 339 DDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) Q Consensus 339 ~d~~~~r~~~r~dGk~~~~~Afvv~~~~ 366 (381) .+.--+..-...-|-+-.-+|=+|.++. T Consensus 373 tnsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 373 TNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred cCCceEEEeecccCcceeeccceeEeeC Confidence 1111111112223333333333344433 No 191 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=71.76 E-value=0.19 Score=24.52 Aligned_cols=286 Identities=12% Similarity=0.011 Sum_probs=126.4 Q ss_pred cccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhh----ceeEecCCc--eEEEEec-CCcceEEeec Q lcl|NC_019921. 63 QSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLAD----LGIKNAGLR--LKFLKSE-TSGVAVWGKI 135 (381) Q Consensus 63 ~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~----~~v~~~~g~--~~~p~~~-~~~~a~wv~e 135 (381) -+. +. +.++.+.+ =...+.++.+.+...+||+.+ .++.+.+|+ +..|..- ...++.|-.- T Consensus 1 mp~-~~----lsel~t~t--------l~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~G 67 (321) T protein:vir:34 1 MPF-PN----ISDIITTT--------IESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSG 67 (321) T ss_pred CCC-ch----HHHHHHHH--------HHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEe Confidence 000 00 11111111 012223344455555666544 334555654 5556543 3677888665 Q ss_pred ccccccccCcceeeEeecceeEEEeeeccH-HhhhcCH----HHHHHHHHHHHHHHHHHHHhhheee-ccC--CCcceEe Q lcl|NC_019921. 136 YGEIKGQLDAAFSEETAIQNKLTAFVVLPK-DLNDFGP----AWIERFVRVQIEEAFAVALETAFLK-GTG--KDQPIGL 207 (381) Q Consensus 136 ~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~-ell~ds~----~~~e~~l~~~la~~~~~~~~~a~i~-G~G--~~~P~Gi 207 (381) .+.......-.|.+-++..+.++..+.||- |+|..+. +|+...=.+...+.|...++..+.. |+| ..+..|+ T Consensus 68 yd~l~~~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~sdGTa~g~~~i~GL 147 (321) T protein:vir:34 68 YDVLPTAPQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYGDGTAFGGRAINGL 147 (321) T ss_pred eeeeccchhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhccccccccchhhhh Confidence 455544445578999999999998888874 4555443 3333333344456677777777765 665 4466666 Q ss_pred ee--ccccccccccccccc-----eeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhh Q lcl|NC_019921. 208 NR--QVQKGVSVTEGAYPE-----KEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) Q Consensus 208 l~--~~~~~~~~~~~~~~~-----~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~ 280 (381) =. .+..++.+.+|..-+ ...+..........++..++..++.- .. +.-..--.|++...-+..+... T Consensus 148 ~~lv~~~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~--~~----Rg~~~PDlii~~~~~y~~y~~s 221 (321) T protein:vir:34 148 DGAVPVDPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSR--CV----RGADMPDLIMSGNDAWTTYSNS 221 (321) T ss_pred hhhcccCCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHh--hc----cCCCCccEEEechHHHHHHHHh Confidence 22 222222223332111 11111111111122222233222111 01 1111112355554332222221 Q ss_pred hh----ccCCC---CccccccC-CCceeEecC----CCCCCcEEEEeecceEEEeecceEEEeehhhhh-hcCceEEEEE Q lcl|NC_019921. 281 YT----HLNAN---GVYVTALP-FNLNVIEST----VQEAGKVLTYVKGLYDGYLAGGINVQKFKETLA-LDDMDLYTAK 347 (381) Q Consensus 281 ~~----~~~~~---G~~~~~l~-~G~pVv~s~----~~p~~~i~fgd~~~y~i~~r~~i~i~~~~~~~~-~~d~~~~r~~ 347 (381) .. ..+.. ..+ ..|- .|..|+.+. .||++..+|=+-++..++...+-.+.......+ --+|.+..-. T Consensus 222 ~q~~qR~~~~~~a~~Gf-~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r~~~~NqdA~~q~ 300 (321) T protein:vir:34 222 LQVLQRFTSAEEANLGF-RSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSRRAAFNQDAEAQI 300 (321) T ss_pred hheeeeecccccccccc-eeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCcccccccchhHHhhh Confidence 11 11111 122 1121 366778776 689998888887775555443333332222222 1233333333 Q ss_pred EEEcCEEecCceEEEEEEEecCC Q lcl|NC_019921. 348 QFAYGKAKDNKVAAVWKLDLKGH 370 (381) Q Consensus 348 ~r~dGk~~~~~Afvv~~~~~~~~ 370 (381) .-.-|.++-.++..-.-|. +. T Consensus 301 I~~~GnL~~sn~~~~~vL~--~~ 321 (321) T protein:vir:34 301 LAWAGNLTCSGAQFQGRLI--AE 321 (321) T ss_pred hhhhheeeeecccceeEEe--eC Confidence 3444555555553322222 22 No 192 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=70.58 E-value=0.21 Score=24.33 Aligned_cols=345 Identities=10% Similarity=0.070 Sum_probs=125.4 Q ss_pred CchhHHHHHHHHHHHHHHHH--hhhhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHH---HH Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAV--NNGEPQERQ--NELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRS---FF 73 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~--k~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~---~~ 73 (381) |+|+-++++.++=+-+++.- .+....++. .+.++...+.+++.. ++++.......+.-|++.+.. .. T Consensus 1 ~~~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~a~~~enq~~~~~~~~------~~~~~~~~~~~~~~l~e~~~~~~~~~ 74 (521) T protein:vir:10 1 MTIKTKAELLNKWKPLLEGEGLPEIANSKQAIIAKIFENQEKDFQTAP------EYKDEKIAQAFGSFLTEAEIGGDHGY 74 (521) T ss_pred CCcchhHHHHHhhhhhhccCCCCccccchhhhhhhhhhhhhhhhhhcc------ccchhHHHHHHhhhhhhhcccCcccc Confidence 99999999988888777651 111111100 111222211221111 111111111111111110000 00 Q ss_pred --HHHhhccCCCCceeccHHHHHHHHHHHHhh---hhhhhhceeEecCCce------EEEEecCC--------------c Q lcl|NC_019921. 74 --MDINKNVNYKEEKLLPEETIDRIFEDLTTN---HPLLADLGIKNAGLRL------KFLKSETS--------------G 128 (381) Q Consensus 74 --~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~---~~l~~~~~v~~~~g~~------~~p~~~~~--------------~ 128 (381) ..+.+++.+ |... .+.-.++...|+. =.-.+++.|+|++|.. +--..... + T Consensus 75 ~~~~i~es~~t-~~v~---~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~a 150 (521) T protein:vir:10 75 NATNIAAGQTS-GAVT---QIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGP 150 (521) T ss_pred ccccccccccc-cccc---cCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhccc Confidence 001111111 1110 1111122222211 1123556666665421 11000000 0 Q ss_pred ceEEeeccc----------------------------------------------------------------------- Q lcl|NC_019921. 129 VAVWGKIYG----------------------------------------------------------------------- 137 (381) Q Consensus 129 ~a~wv~e~~----------------------------------------------------------------------- 137 (381) .+.|.+..+ T Consensus 151 da~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsT 230 (521) T protein:vir:10 151 DAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMAT 230 (521) T ss_pred cccccccccccccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccch Confidence 000100000 Q ss_pred ---c----cccccCcceeeEeecceeEEEe-------eeccHHhhhc----CHHHHHHHHHHHHHHHHHHHHhhheeecc Q lcl|NC_019921. 138 ---E----IKGQLDAAFSEETAIQNKLTAF-------VVLPKDLNDF----GPAWIERFVRVQIEEAFAVALETAFLKGT 199 (381) Q Consensus 138 ---~----~~~~~~~~f~~v~l~~~kl~~~-------~~iS~ell~d----s~~~~e~~l~~~la~~~~~~~~~a~i~G~ 199 (381) + .-..+...|.++.|...|.+.- ...|-||.+| -..|.|+.|.+-|+..|..-++..||.=- T Consensus 231 a~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i 310 (521) T protein:vir:10 231 SIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWI 310 (521) T ss_pred hhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhh Confidence 0 0011223466777766666533 3467777766 46789999999999999999999988310 Q ss_pred CCC---cceEeeeccccccccccccccceeeeeeeccccc------chhHHHHHHHHHHhhhccccccccc-cCceEE-E Q lcl|NC_019921. 200 GKD---QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANP------RATVNELTQVFKYHSTNEKGKSVAV-KGNVTM-V 268 (381) Q Consensus 200 G~~---~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~------~~~~~~l~~l~~~l~~~~~~~~~~~-~~~a~~-~ 268 (381) =.. +-.|+-+.. ....|.+.+.++ .-....+..|+..+-...+...+.. |+.+.| + T Consensus 311 ~~sa~~~~~g~t~~~-------------~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i 377 (521) T protein:vir:10 311 NYSAQVGKSGMTLTP-------------GSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFII 377 (521) T ss_pred hheeeeeeeeeeecc-------------CccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEE Confidence 000 011221100 001112211111 1122222223333222222222222 244544 4 Q ss_pred EchhhHHHHhhhh---h-----------ccCCCCc-cccccCCCceeEecCCCCCCcEEEEeecceEEEeecceE----- Q lcl|NC_019921. 269 VNPSDAFEVQAQY---T-----------HLNANGV-YVTALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGIN----- 328 (381) Q Consensus 269 mn~~t~~~~~~~~---~-----------~~~~~G~-~~~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~----- 328 (381) +++.-. .++... + ..+..+. +...+.-+++|+++.++|.+-++.|.--..-+ ..|+- T Consensus 378 ~S~~Va-~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~--~~glfyaPYv 454 (521) T protein:vir:10 378 ASRNVV-NVLASVDTGISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEM--DAGIYYAPYV 454 (521) T ss_pred EchHHH-HHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCccc--ccceeecccc Confidence 555433 332210 0 0111111 12234447889999988876666553211000 01111 Q ss_pred ----EEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccc---cccCcccC Q lcl|NC_019921. 329 ----VQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPA---LEGTEETL 381 (381) Q Consensus 329 ----i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~---~~~~~~~~ 381 (381) +...|...|. -.++|+.+ + |-.++| |+. .+...++ ..+.|+-+ T Consensus 455 ~l~~~~~~dp~sfq-P~~g~~tR--Y-~l~~NP--~~~----~~~~~~~~~i~~~~~~~~ 504 (521) T protein:vir:10 455 ALTPLRGSDPKNFQ-PVMGFKTR--Y-GIGINP--FAE----SAAQAPASRIQSGMPSIL 504 (521) T ss_pred ccccccccCCcccc-ceeeeeee--e-ceeecC--ccc----ccCCccceeecccchhhh Confidence 1122222221 12233222 2 333444 221 1122222 33344444 No 193 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=60.60 E-value=0.37 Score=22.96 Aligned_cols=286 Identities=6% Similarity=-0.075 Sum_probs=111.7 Q ss_pred HhhccCCCCceeccH--HHHHHHHHHHHhhhhhhhhceeE---------ecCCc-eEEEEecC-Ccc---eEEeecc-cc Q lcl|NC_019921. 76 INKNVNYKEEKLLPE--ETIDRIFEDLTTNHPLLADLGIK---------NAGLR-LKFLKSET-SGV---AVWGKIY-GE 138 (381) Q Consensus 76 ~~~~~~~~gg~lvP~--~~~~~I~~~l~~~~~l~~~~~v~---------~~~g~-~~~p~~~~-~~~---a~wv~e~-~~ 138 (381) |.... =....||+ .|..-+.+.-.+.+.|++-.-+. ..+|+ +.+|.-.. .+. -.|..-. +. T Consensus 1 Ma~T~--l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~ 78 (349) T protein:vir:94 1 MAITT--IGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceE--EeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 11111 12245676 35555555555555555422121 12344 67886433 122 1232210 11 Q ss_pred cccccCcceeeEeecceeEE--EeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccc Q lcl|NC_019921. 139 IKGQLDAAFSEETAIQNKLT--AFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVS 216 (381) Q Consensus 139 ~~~~~~~~f~~v~l~~~kl~--~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~ 216 (381) .+...--++.++-...+.-. ..-.++..|- .-|..+-|.+++++-..+.....+|. -.+|++..-..+.. T Consensus 79 ~t~~kit~~~~~a~~~~r~kaw~~~Dla~~ls---G~dpm~~Ia~~va~yW~r~~q~~Lia-----~L~Gvf~~~~~~~~ 150 (349) T protein:vir:94 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELT---SQNPLQSVASRLDNFWQRQAQRRLIA-----TALGLYNDNVSATD 150 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhh---CchHHHHHHHHHHHHHhhHHHHHHHH-----HHHhhhcccccccc Confidence 21111122333333333322 2334555553 33667777777777666665544432 11234422111111 Q ss_pred cccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh---ccCCCCccccc Q lcl|NC_019921. 217 VTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNANGVYVTA 293 (381) Q Consensus 217 ~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~---~~~~~G~~~~~ 293 (381) .......-...+......++....+....+.... ... .-..=..++||+..+..++++.. +++++|..--. T Consensus 151 ~~~~~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa-~Gd-----~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~i~ 224 (349) T protein:vir:94 151 AYHEQNDMVVDVSATSGFDAGAFIDATQTMGDAL-MGN-----GGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNTMFA 224 (349) T ss_pred cccccCceeEEecccCCCChhhHHHHHHHHHHHh-ccc-----cccceeEEEEchHHHHHHHhcchhhhccCcccCcccc Confidence 0000000000111111122222222222111110 000 00111358999999998876433 45565543222 Q ss_pred cCCCceeEecCCCCCC------c---EEEEeecceEEEeec-ceEEEeehhhhh--hcCceEEEEEEEEcCEEecCceEE Q lcl|NC_019921. 294 LPFNLNVIESTVQEAG------K---VLTYVKGLYDGYLAG-GINVQKFKETLA--LDDMDLYTAKQFAYGKAKDNKVAA 361 (381) Q Consensus 294 l~~G~pVv~s~~~p~~------~---i~fgd~~~y~i~~r~-~i~i~~~~~~~~--~~d~~~~r~~~r~dGk~~~~~Afv 361 (381) .-.|++|++++.||-. + .+||.-. +.+.+.+ .+.++..++... ..++..+..+.|+ .+.|..+. T Consensus 225 ty~G~~VivDD~~Pv~~~g~~~~yttylfg~GA-i~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~---~~hp~G~s 300 (349) T protein:vir:94 225 TYQGYRVIVDDSMTVVGQDTSRKFISIIFGQGA-IGYGEGNPEMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYS 300 (349) T ss_pred eecCcEEEEeCCCccccCCCCceEEEEEeecce-EEeecCCCCcceeeecccccCCcceeEEEEEeeEE---Eeeeeeee Confidence 2259999999999931 1 2444211 2222222 123444443332 2345555555443 34444444 Q ss_pred EEEEEecC-Ccc--ccccCcccC Q lcl|NC_019921. 362 VWKLDLKG-HKP--ALEGTEETL 381 (381) Q Consensus 362 v~~~~~~~-~~~--~~~~~~~~~ 381 (381) .-.-.++. +.. +.+.|..-| T Consensus 301 ~~~a~v~~~~~~~~~~sPt~aeL 323 (349) T protein:vir:94 301 FTSAVITGNGTETIARSASWQDL 323 (349) T ss_pred ecccccCCCccccccCCCChHHh Confidence 32211111 111 112222233 No 194 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=57.84 E-value=0.43 Score=22.62 Aligned_cols=272 Identities=11% Similarity=0.041 Sum_probs=122.1 Q ss_pred CCCCceeccH--HHHHHHHHHHHhhhhhhhhceeEecC---C-ceEEEEecCCcceE--EeecccccccccCcceeeEee Q lcl|NC_019921. 81 NYKEEKLLPE--ETIDRIFEDLTTNHPLLADLGIKNAG---L-RLKFLKSETSGVAV--WGKIYGEIKGQLDAAFSEETA 152 (381) Q Consensus 81 ~~~gg~lvP~--~~~~~I~~~l~~~~~l~~~~~v~~~~---g-~~~~p~~~~~~~a~--wv~e~~~~~~~~~~~f~~v~l 152 (381) -+...+++.+ -+..+|.+.....-..+.++.+.+.. . .......+..+.+. |.+..+..-+..+..+++-.. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 1122233321 22233333211111122232222211 1 13334445555666 877654433445667777777 Q ss_pred cceeEEEeeeccHHhhhcC---HHHHHHHHHHHHHHHHHHHHhhheeeccCC-CcceEeeeccccccccccccccceeee Q lcl|NC_019921. 153 IQNKLTAFVVLPKDLNDFG---PAWIERFVRVQIEEAFAVALETAFLKGTGK-DQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) Q Consensus 153 ~~~kl~~~~~iS~ell~ds---~~~~e~~l~~~la~~~~~~~~~a~i~G~G~-~~P~Gil~~~~~~~~~~~~~~~~~~~~ 228 (381) ..+..+.-+..|.+=|.-+ ..++.+-=.....+++...++.-.+.|+=. ..-.|+|+.+........+... . T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a----~ 156 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQ----N 156 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCcc----C Confidence 7777777677665544433 345666666777788888899999999743 3478999877654322211110 0 Q ss_pred eeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCcccc----c-cCC--Cce-- Q lcl|NC_019921. 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVT----A-LPF--NLN-- 299 (381) Q Consensus 229 ~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~----~-l~~--G~p-- 299 (381) .....++++-.++++..++..+.....+ ....+ .++|.+.-+..+.. +..+..|.-++ . .++ |.| T Consensus 157 ~~w~~~T~~eI~~di~~~~~~i~~~s~~---~~~p~-tl~Lpp~~~~~l~~--~~~~~~~~Tvl~~l~~n~~~~~g~~l~ 230 (304) T protein:vir:52 157 TKVQAMDFDKAVAFFKEIFLKGMEKTKR---IEAPN-TFAIDSLDLAHLAL--VQRANTDTTALEFLTKHLSAAAGRQVA 230 (304) T ss_pred CccccCCHHHHHHHHHHHHHHHHhccCc---eecCc-eEEeCHHHHHHHhh--ccCCCCCchHHHHHHHhcccccCCcce Confidence 1112223334444444444444322221 12222 46666665544421 11222221111 0 111 222 Q ss_pred eE--ecCCCCC---C--cEEEEeecc-eEEEeecceEEEeehhhhhhcCceEE-E-EEEEEcC-EEecCceEEEEEE Q lcl|NC_019921. 300 VI--ESTVQEA---G--KVLTYVKGL-YDGYLAGGINVQKFKETLALDDMDLY-T-AKQFAYG-KAKDNKVAAVWKL 365 (381) Q Consensus 300 Vv--~s~~~p~---~--~i~fgd~~~-y~i~~r~~i~i~~~~~~~~~~d~~~~-r-~~~r~dG-k~~~~~Afvv~~~ 365 (381) |. .+..... | .+++.+.+. |+ ...-.+.+.++.. .. .+...| . +..|++| .++.|+|++.++- T Consensus 231 I~~v~~~~~~~g~~g~~r~vvY~~d~~~~-~~~vP~p~~~l~~-q~-~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 231 IKALPSNYGTRVTDGKTRAMVYVNSKEHV-IFDVPMSPTVLDA-QP-KGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred EEEecccccccCCCCceEEEEEecChhhe-EEecCccccccch-hh-cCCceEEecceeeeeeEEEEccceeeeecC Confidence 22 1222211 1 144444433 32 2222333333221 11 133233 2 4666666 4667899998775 No 195 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=57.49 E-value=0.43 Score=22.58 Aligned_cols=286 Identities=10% Similarity=0.023 Sum_probs=110.3 Q ss_pred HHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhh--hce--eEecCC-ceEEEEecC Q lcl|NC_019921. 52 AERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLA--DLG--IKNAGL-RLKFLKSET 126 (381) Q Consensus 52 ~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~--~~~--v~~~~g-~~~~p~~~~ 126 (381) ....+....|.-.|.- +.|- .+... .+-..+-+-++..+ +.+.....+-+ .++ ..-.+| .++||+-+. T Consensus 1 ~~~~~~~~~~~~~~~~--~~~~---~~~~~-~nt~~l~~k~~~~L-D~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~ 73 (319) T protein:vir:94 1 MNKTIKNATGMLKLNL--QHFA---NKSVE-PGQTLLKNKHVGIL-ERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDT 73 (319) T ss_pred CCcccccccceeEeeh--hhhh---ccCCC-cchHHHHHHHHHHH-HHHHHHhhhhhhcccCcceEeccCcEEEEeeecc Confidence 0000001111111111 0110 11111 11122223333333 33322222111 111 233444 488998766 Q ss_pred CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHH-HHH-HHHHHHHHHHHHhhhee----eccC Q lcl|NC_019921. 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIE-RFV-RVQIEEAFAVALETAFL----KGTG 200 (381) Q Consensus 127 ~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e-~~l-~~~la~~~~~~~~~a~i----~G~G 200 (381) .+-..+-.-++-..+..+.+...++|...+...+. |..-=.+.+..++. +++ .+...+.++-.+|.-.+ .+.| T Consensus 74 ~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~-VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~ 152 (319) T protein:vir:94 74 TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRF-VDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA 152 (319) T ss_pred cccccccCCCCcccCCcccceeEEEeecccccccc-cchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcc Confidence 44333321111112233445566666666655442 11111223333331 122 22233333333333221 1111 Q ss_pred CCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhh Q lcl|NC_019921. 201 KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) Q Consensus 201 ~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~ 280 (381) +. .+ ....+...++.+.++...+.. ...+ .+.+.+|+|..+..+... T Consensus 153 ~~--------------~~-------------~~~t~~n~y~~i~~a~~~Lde----~~VP--~~Rvl~Vtp~~~~~L~~~ 199 (319) T protein:vir:94 153 KH--------------LT-------------VGTGSDAQYDAVLDVSVELDE----IKAP--ENRVLFVSPTFYKGIKKF 199 (319) T ss_pred cc--------------cc-------------cccCHHHHHHHHHHHHHHHHh----cCCC--CCcEEEeCHHHHHHHHhh Confidence 10 00 011223445555555544422 1222 244567898887666431 Q ss_pred h-hccCC----CCcccc--ccCCCceeEec--CCCCCCcEEEEeecceEEEe-ecceEEEeehhhhhhcCceEEEEEEEE Q lcl|NC_019921. 281 Y-THLNA----NGVYVT--ALPFNLNVIES--TVQEAGKVLTYVKGLYDGYL-AGGINVQKFKETLALDDMDLYTAKQFA 350 (381) Q Consensus 281 ~-~~~~~----~G~~~~--~l~~G~pVv~s--~~~p~~~i~fgd~~~y~i~~-r~~i~i~~~~~~~~~~d~~~~r~~~r~ 350 (381) . ..++. .+..-. +...|.+|+.. ..+.+-.+++|..+...-.. -..+++....+-. +...|++..+. T Consensus 200 ~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~~~~~~~p~~~~---~a~~v~gr~y~ 276 (319) T protein:vir:94 200 VIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGM---FGTLAEQLLYT 276 (319) T ss_pred hhhhccccccccceeeeeceeecCeEEEEecccccccceEEEEcCCeeeeeeeeeeeeccCCCccc---cceeeeeeeee Confidence 1 11111 111000 12258888764 33444445666544322111 1123332211222 23689999999 Q ss_pred cCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 351 YGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 351 dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) |.+++++++..++...-+++.....+...+- T Consensus 277 d~~V~~~k~~~Iy~~~~~~~~~~~~~~~~~~ 307 (319) T protein:vir:94 277 GAFVPEHLQKYIFTIGGTEVATKRDGVDAHA 307 (319) T ss_pred eeEEeccccceEEEeecCCcccCCCcccccc Confidence 9999999988776544433332222211111 No 196 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=57.49 E-value=0.43 Score=22.58 Aligned_cols=286 Identities=10% Similarity=0.023 Sum_probs=110.3 Q ss_pred HHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhh--hce--eEecCC-ceEEEEecC Q lcl|NC_019921. 52 AERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLA--DLG--IKNAGL-RLKFLKSET 126 (381) Q Consensus 52 ~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~--~~~--v~~~~g-~~~~p~~~~ 126 (381) ....+....|.-.|.- +.|- .+... .+-..+-+-++..+ +.+.....+-+ .++ ..-.+| .++||+-+. T Consensus 1 ~~~~~~~~~~~~~~~~--~~~~---~~~~~-~nt~~l~~k~~~~L-D~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~ 73 (319) T protein:vir:97 1 MNKTIKNATGMLKLNL--QHFA---NKSVE-PGQTLLKNKHVGIL-ERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDT 73 (319) T ss_pred CCcccccccceeEeeh--hhhh---ccCCC-cchHHHHHHHHHHH-HHHHHHhhhhhhcccCcceEeccCcEEEEeeecc Confidence 0000001111111111 0110 11111 11122223333333 33322222111 111 233444 488998766 Q ss_pred CcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHH-HHH-HHHHHHHHHHHHhhhee----eccC Q lcl|NC_019921. 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIE-RFV-RVQIEEAFAVALETAFL----KGTG 200 (381) Q Consensus 127 ~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e-~~l-~~~la~~~~~~~~~a~i----~G~G 200 (381) .+-..+-.-++-..+..+.+...++|...+...+. |..-=.+.+..++. +++ .+...+.++-.+|.-.+ .+.| T Consensus 74 ~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~-VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~ 152 (319) T protein:vir:97 74 TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRF-VDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA 152 (319) T ss_pred cccccccCCCCcccCCcccceeEEEeecccccccc-cchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcc Confidence 44333321111112233445566666666655442 11111223333331 122 22233333333333221 1111 Q ss_pred CCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhh Q lcl|NC_019921. 201 KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) Q Consensus 201 ~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~ 280 (381) +. .+ ....+...++.+.++...+.. ...+ .+.+.+|+|..+..+... T Consensus 153 ~~--------------~~-------------~~~t~~n~y~~i~~a~~~Lde----~~VP--~~Rvl~Vtp~~~~~L~~~ 199 (319) T protein:vir:97 153 KH--------------LT-------------VGTGSDAQYDAVLDVSVELDE----IKAP--ENRVLFVSPTFYKGIKKF 199 (319) T ss_pred cc--------------cc-------------cccCHHHHHHHHHHHHHHHHh----cCCC--CCcEEEeCHHHHHHHHhh Confidence 10 00 011223445555555544422 1222 244567898887666431 Q ss_pred h-hccCC----CCcccc--ccCCCceeEec--CCCCCCcEEEEeecceEEEe-ecceEEEeehhhhhhcCceEEEEEEEE Q lcl|NC_019921. 281 Y-THLNA----NGVYVT--ALPFNLNVIES--TVQEAGKVLTYVKGLYDGYL-AGGINVQKFKETLALDDMDLYTAKQFA 350 (381) Q Consensus 281 ~-~~~~~----~G~~~~--~l~~G~pVv~s--~~~p~~~i~fgd~~~y~i~~-r~~i~i~~~~~~~~~~d~~~~r~~~r~ 350 (381) . ..++. .+..-. +...|.+|+.. ..+.+-.+++|..+...-.. -..+++....+-. +...|++..+. T Consensus 200 ~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~~~~~~~p~~~~---~a~~v~gr~y~ 276 (319) T protein:vir:97 200 VIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGM---FGTLAEQLLYT 276 (319) T ss_pred hhhhccccccccceeeeeceeecCeEEEEecccccccceEEEEcCCeeeeeeeeeeeeccCCCccc---cceeeeeeeee Confidence 1 11111 111000 12258888764 33444445666544322111 1123332211222 23689999999 Q ss_pred cCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 351 YGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 351 dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) |.+++++++..++...-+++.....+...+- T Consensus 277 d~~V~~~k~~~Iy~~~~~~~~~~~~~~~~~~ 307 (319) T protein:vir:97 277 GAFVPEHLQKYIFTIGGTEVATKRDGVDAHA 307 (319) T ss_pred eeEEeccccceEEEeecCCcccCCCcccccc Confidence 9999999988776544433332222211111 No 197 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=53.10 E-value=0.54 Score=22.06 Aligned_cols=286 Identities=6% Similarity=-0.074 Sum_probs=110.8 Q ss_pred HhhccCCCCceeccH--HHHHHHHHHHHhhhhhhhhceeE---------ecCCc-eEEEEecCC-c--ce-EEeec-ccc Q lcl|NC_019921. 76 INKNVNYKEEKLLPE--ETIDRIFEDLTTNHPLLADLGIK---------NAGLR-LKFLKSETS-G--VA-VWGKI-YGE 138 (381) Q Consensus 76 ~~~~~~~~gg~lvP~--~~~~~I~~~l~~~~~l~~~~~v~---------~~~g~-~~~p~~~~~-~--~a-~wv~e-~~~ 138 (381) |.... =....||+ .|..-+.+...+.+.|++-.-+. ..+|. +.+|.-..- + .. .|..- .+. T Consensus 1 Ma~T~--l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MAITT--IGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceE--EeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 11111 12246676 35555555555555555421111 22343 678865432 2 11 23221 111 Q ss_pred cccccCcceeeEeecceeEEE--eeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccc Q lcl|NC_019921. 139 IKGQLDAAFSEETAIQNKLTA--FVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVS 216 (381) Q Consensus 139 ~~~~~~~~f~~v~l~~~kl~~--~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~ 216 (381) ..+..--++.++-...+.-.+ .-.++..|- .-|..+-|.+++++-..+.....+|. ..+|++..-..+.. T Consensus 79 ~t~~kitt~~~~a~~~~r~kaw~~~Dla~~ls---G~dpm~~Ia~~va~yW~r~~q~~Lia-----~L~Gvf~~~~~a~~ 150 (349) T protein:vir:78 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELT---SQNPLQSVASRLDNFWQRQAQRRLIA-----TALGLYNDNVSATD 150 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhh---CchHHHHHHHHHHHHHhhHHHHHHHH-----HHHHhhcccccccc Confidence 122222233344333333333 233455553 33667777777776655554443332 01123321111000 Q ss_pred cccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh---ccCCCCccccc Q lcl|NC_019921. 217 VTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---HLNANGVYVTA 293 (381) Q Consensus 217 ~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~---~~~~~G~~~~~ 293 (381) ..... .+.+... ...+....+.+.+....+-....+ .....=..++||+..+..++.+.. +++++|..--. T Consensus 151 ~~~~~-~~~t~d~---s~~a~~~~~~~~dA~~~lgda~~G--d~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~i~ 224 (349) T protein:vir:78 151 AYHEQ-NDMVVDV---SATLGFDAGAFIDATQTMGDALMG--NGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNTMFA 224 (349) T ss_pred hhhhc-ccceeee---ccccCCChhhhhhhHHHHHHHhcc--ccccceeEEEEchHHHHHHHhhhhhhhccCcccCcccc Confidence 00000 0000000 011111222222222111000000 000111358999999998876433 34555543222 Q ss_pred cCCCceeEecCCCCCC------c---EEEEeecceEEEeecc-eEEEeehhhhh--hcCceEEEEEEEEcCEEecCceEE Q lcl|NC_019921. 294 LPFNLNVIESTVQEAG------K---VLTYVKGLYDGYLAGG-INVQKFKETLA--LDDMDLYTAKQFAYGKAKDNKVAA 361 (381) Q Consensus 294 l~~G~pVv~s~~~p~~------~---i~fgd~~~y~i~~r~~-i~i~~~~~~~~--~~d~~~~r~~~r~dGk~~~~~Afv 361 (381) .-.|++|++++.||-. + .+||.-. +.+.+.+. +.++..++... ..++..+..+.|+ .+.|..+. T Consensus 225 ty~G~~VivDD~~Pv~~~g~~~~yttylfg~GA-i~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~---~~hp~G~s 300 (349) T protein:vir:78 225 TYQGYRVIVDDSMTVVGQGAQRKFISIIFGQGA-IGYGEGNPVMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYR 300 (349) T ss_pred eecCeEEEEeCCCccccCCCCceEEEEEeecce-EEEccCCCccceeeecccccCCcceeEEEEEeeEE---Eeeeeeee Confidence 2259999999999942 1 2444221 11222221 22444343332 2345555555554 34444443 Q ss_pred EEEEEecCC-ccccccCc--ccC Q lcl|NC_019921. 362 VWKLDLKGH-KPALEGTE--ETL 381 (381) Q Consensus 362 v~~~~~~~~-~~~~~~~~--~~~ 381 (381) .-.-.++.+ ....+..| .-| T Consensus 301 ~~~a~v~~~~~~~~~~sPt~aeL 323 (349) T protein:vir:78 301 FTSAVITGNGTETIARSASWQDL 323 (349) T ss_pred eccccccCCccccccCCCChHHh Confidence 322112111 11111222 223 No 198 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=47.77 E-value=0.69 Score=21.46 Aligned_cols=298 Identities=15% Similarity=0.172 Sum_probs=115.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHHHHHH-HhhccC--CCCceeccHHHHHHHHHHHHhhhhhhhh Q lcl|NC_019921. 33 YGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMD-INKNVN--YKEEKLLPEETIDRIFEDLTTNHPLLAD 109 (381) Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~~~~~-~~~~~~--~~gg~lvP~~~~~~I~~~l~~~~~l~~~ 109 (381) +...++ ..++..++-+..+. +..+++-+++.++ +.+.+- ++--+.+|+.+...|-..+.++.|++.. T Consensus 1 mtn~ie------sq~A~~eF~~vL~~----N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~v 70 (318) T protein:vir:86 1 MTNFIE------SQNAVTEFFDVLKK----NSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKV 70 (318) T ss_pred Ccchhh------hhHHHHHHHHHHhc----cCCchhhhhhhhhhhhhcCceeeccchhccHHHHHHHHHhhhccCcceee Confidence 111100 01111122221111 1122333444432 222222 4555789999999999999999999998 Q ss_pred ceeEecCCc-eEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHh---hhcCHHHHHHHHHHHHHH Q lcl|NC_019921. 110 LGIKNAGLR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL---NDFGPAWIERFVRVQIEE 185 (381) Q Consensus 110 ~~v~~~~g~-~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~el---l~ds~~~~e~~l~~~la~ 185 (381) ..|...+.- ++.-.++. ..|.-.. .+..+.+....|..-++++--+|..-.+ -++ +.+|-..+-+|+..+++. T Consensus 71 fHVT~~~~~~V~~s~~s~-AeAq~Hk-dGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ 147 (318) T protein:vir:86 71 FHVTNVGALLVSRSFDSS-AEAQVHK-DGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQ 147 (318) T ss_pred eeeccchhhhhhhhhhhh-hhhhhhc-cCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHH Confidence 777776642 22222222 3333222 2333333445555555554333221111 222 345666678999999999 Q ss_pred HHH-HHHhhheeeccCCCcceEeee--ccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhcccccccccc Q lcl|NC_019921. 186 AFA-VALETAFLKGTGKDQPIGLNR--QVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVK 262 (381) Q Consensus 186 ~~~-~~~~~a~i~G~G~~~P~Gil~--~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~ 262 (381) .|. +..+.|++-|||++...-+=+ .+.... ..+ ......|+..++. .+.+..+.+ ++.. T Consensus 148 ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~---k~T-tkaksagttpfan---aieeavdfv-----------rpta 209 (318) T protein:vir:86 148 AIVNKIVDLALVEGDGSNGFKSIDKEADVKKIK---KIT-TKAKSAGTTPFAN---AIEEAVDFV-----------RPTA 209 (318) T ss_pred HHHHHHHHhhheeecCCCCccchhhHHHHHHHH---HHh-hhhhccCCCchhh---HHHHHHhhh-----------ccCC Confidence 999 899999999999876544311 110000 000 0001112222221 122222221 2334 Q ss_pred CceEEEEchhhHHHHhhhhhccCCCCcc-c------cccCCCce---eEecCCCCCCcEEEEeecceEEEeecceEEEee Q lcl|NC_019921. 263 GNVTMVVNPSDAFEVQAQYTHLNANGVY-V------TALPFNLN---VIESTVQEAGKVLTYVKGLYDGYLAGGINVQKF 332 (381) Q Consensus 263 ~~a~~~mn~~t~~~~~~~~~~~~~~G~~-~------~~l~~G~p---Vv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~~ 332 (381) |.-.+++...+.-.++...-...+|... + -..-.|.. |+.....-. .-++.|.+ |.|-+. +++ .- T Consensus 210 grrylivkaedrkalldelrqatanahvriknddteiasevgvdeiivytgskalk-ptvlvdqk-yhidmq-dlt--kv 284 (318) T protein:vir:86 210 GRRYLIVKAEDRKALLDELRQATANAHVRIKNDDTEIASEVGVDEIIVYTGSKALK-PTVLVDQK-YHIDMQ-DLT--KV 284 (318) T ss_pred CceEEEEeecchHHHHHHHHhhcccceeEEeccchhhhhhcCcceeeeeecccccc-ceeeeccc-eecchh-hhh--hh Confidence 4445565544322222110001111100 0 00001110 111000000 00111111 222111 110 00 Q ss_pred hhhhhhcCceEEEEEEEEcCEEecCceEEEEEEE Q lcl|NC_019921. 333 KETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) Q Consensus 333 ~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~ 366 (381) +-.-+..+.--+..-...-|-+-.-+|=+|.++. T Consensus 285 dafewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:86 285 DAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred hcceeccCCceEEEeecccCcceeecCceeEEeC Confidence 0000111111111111222333333333343433 No 199 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=45.11 E-value=0.78 Score=21.17 Aligned_cols=266 Identities=9% Similarity=-0.046 Sum_probs=118.2 Q ss_pred ccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe------cCCceEEEEecCCcceEEeecccccccccCcceee--E Q lcl|NC_019921. 79 NVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN------AGLRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSE--E 150 (381) Q Consensus 79 ~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~------~~g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~--v 150 (381) -..-++-++-|+-+..++++.+++..++.++|+.-. .+..++||+....... .+..+..+ +..=.. + T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~----dg~~~~~~-~~te~~v~l 75 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSA----SGRTLVKQ-PMVDQTIPF 75 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceeec----ccCCcccc-ccccceEEE Confidence 122233456699999999999999999888776521 2235778763322211 11111111 122233 4 Q ss_pred eecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeee Q lcl|NC_019921. 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) Q Consensus 151 ~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 230 (381) +|.-+|.. -+.++.+=+..+..++..-+.+....+++..+|..+.. +++... . ..++ T Consensus 76 ~id~~k~~-~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~---------l~~~a~---~-~~gt--------- 132 (418) T protein:vir:10 76 KIAYQEHV-GLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLAL---------TLKKAF---H-SSGT--------- 132 (418) T ss_pred EEeccccc-ceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHH---------HHhhcc---c-cccc--------- Confidence 55555544 45666666666777888888888999999999987653 111110 0 0000 Q ss_pred ecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhhh-ccCCCCc---ccc---ccCCCceeEec Q lcl|NC_019921. 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT-HLNANGV---YVT---ALPFNLNVIES 303 (381) Q Consensus 231 ~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~-~~~~~G~---~~~---~l~~G~pVv~s 303 (381) .......++.+.++...|.. ...+-.++-..+++|..++.+.+-.. ..+..|. +-. +..+|..|+.| T Consensus 133 --~gt~~~~~~~i~~a~~~Ld~----~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S 206 (418) T protein:vir:10 133 --PGVRPGAFIDFANAGAKQTT----YAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAAYEVYES 206 (418) T ss_pred --CCcCcchHHHHHHHHHHHHh----cCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeeceEEEEe Confidence 00111235556665555432 12222233456789988776643211 1121111 111 23479999999 Q ss_pred CCCCCCc--------EEEEeecc-eEEEeecceEEEeeh-hhhhhcCceEEEEEE---EEcCEEe-cCceEEEE------ Q lcl|NC_019921. 304 TVQEAGK--------VLTYVKGL-YDGYLAGGINVQKFK-ETLALDDMDLYTAKQ---FAYGKAK-DNKVAAVW------ 363 (381) Q Consensus 304 ~~~p~~~--------i~fgd~~~-y~i~~r~~i~i~~~~-~~~~~~d~~~~r~~~---r~dGk~~-~~~Afvv~------ 363 (381) .++|... .+.|-... ..+...++ ..+. -..-..|...|-+.. .+.+.+. +..=|+|. T Consensus 207 ~nip~~tag~~~~t~~v~ga~~~~~~~~~~~~---t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~ 283 (418) T protein:vir:10 207 QNLPKHTVGDHGGTPLVNGTVVNGDTVGFDGG---TASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTD 283 (418) T ss_pred cCCCcccccccccceeeecccccceeEEEeec---ceeeccceeeccEEEECceeecccccccccccceEEEEEeecccc Confidence 9999532 12222111 11111111 0000 000111222222211 0000100 11223332 Q ss_pred -----EEEecCCc-cccccCcc------------cC Q lcl|NC_019921. 364 -----KLDLKGHK-PALEGTEE------------TL 381 (381) Q Consensus 364 -----~~~~~~~~-~~~~~~~~------------~~ 381 (381) +|+|.+.- +...+.+. ++ T Consensus 284 ~~~~~tv~i~p~~~~~~~~~~~~~~~~~~~~~~~~v 319 (418) T protein:vir:10 284 AGGAGSIKISPSLNDGTATINNENGDPVSLTAYQNV 319 (418) T ss_pred ccCcceeEeccccccccccccccccccccccCCCcc Confidence 23332211 00111111 11 No 200 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=40.37 E-value=0.97 Score=20.65 Aligned_cols=341 Identities=9% Similarity=-0.048 Sum_probs=145.0 Q ss_pred Cchh--------------------H---HHHHHHHHHHHHHHHhhhhhHH-----------HHHHHHHHHHHHHHHHHH- Q lcl|NC_019921. 1 MTIN--------------------L---SETFANAKNEFINAVNNGEPQE-----------RQNELYGDMINQLFEETK- 45 (381) Q Consensus 1 mt~e--------------------l---~~~~~~~~~~~~~~~k~~~~~~-----------~~~~~~~~~~~~~~~~~~- 45 (381) .++. + .-..++.+..+++.+....... ..+..-+.+.+++..... T Consensus 240 aRi~~I~~l~a~Fggr~~~l~~~~l~d~~~s~e~ar~~il~~l~~~~~p~~~~~~~~~~~~~g~~~~d~~~~aL~~R~g~ 319 (652) T protein:vir:79 240 ARVNGINDLFAMFGGRYQTLQAQCLADPECSLEQAREKLLNEMGRESTPSNKNTPAHIYAGNGNFVGDGIRQALMARAGF 319 (652) T ss_pred HHHHHHHHHHHhhccccchHHHHHhhccCCCHHHHHHHHHHHHHhhcCCCCCCcceeEeeccchhhHHHHHHHHHhhcCC Confidence 1111 0 0012233333333331100000 000000111111111100 Q ss_pred ------HH----HHHHHHHHHHhhccccccCHHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhh-----hhhhhhc Q lcl|NC_019921. 46 ------LQ----AKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTN-----HPLLADL 110 (381) Q Consensus 46 ------~~----~~~~~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~-----~~l~~~~ 110 (381) ++ .-.+..+..+..+|.........+.....-+.++++ .|.-+.+-+.+.|.+. ...+..| T Consensus 320 ~~~~~~~~~~g~~L~elAr~~L~~~G~~~~~~~~~~~v~~A~~hsTsD----Fp~IL~~~~nk~l~~~y~~a~~t~~~~~ 395 (652) T protein:vir:79 320 EKTERDNVYNGMTLREYARMSLTERGIGVSSYNPMQMVGAAFTHSTSD----FGNILLDVANKAILQGWEDAPETYEQWT 395 (652) T ss_pred cccccCccccCccHHHHHHHHHHhhccCCCCCCHHHHHHHHhhcCcch----HHHHHHHHHHHHHHHHHhhhHHHHHHHh Confidence 00 001122223333333222112222222211223332 3443333333333222 1344555 Q ss_pred eeEecCC--ceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHH Q lcl|NC_019921. 111 GIKNAGL--RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFA 188 (381) Q Consensus 111 ~v~~~~g--~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~ 188 (381) +..+++- ..+...-.+-+.-.-|.|+++.+.-+-.. +.-++...+++.++.||++.+-.-..+.-.-|-..++++.+ T Consensus 396 ~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e-~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~ 474 (652) T protein:vir:79 396 RKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGD-KQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAK 474 (652) T ss_pred ccCCCccccccceeecCCCCCccccCCCCccceeeecC-ccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHH Confidence 5544432 12233345556666788888876532222 56678889999999999999887778888888899999999 Q ss_pred HHHhhhe---eeccCCC--cceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccC Q lcl|NC_019921. 189 VALETAF---LKGTGKD--QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKG 263 (381) Q Consensus 189 ~~~~~a~---i~G~G~~--~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~ 263 (381) +.++..+ |.++.+- --+.+|....-++ ..+. +...+..+......+.....+...-... T Consensus 475 ~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~N------------l~~~----aa~~~~~l~~ar~aM~~Qk~g~~~l~i~ 538 (652) T protein:vir:79 475 STIADLVYAILTSNPKISTDNVSLFDKAKHAN------------VLES----AAMDVASLDKARQLMRVQKEGERHLNIR 538 (652) T ss_pred HHHHHHHHHHHhcCcccccCCceeeccccccc------------cccc----ccCCHHHHHHHHHHHHHhccCCcccccc Confidence 8888543 4444321 1123441111000 0000 0111222222222222211221111122 Q ss_pred ceEEEEchhhHHHHhhh---hhccCCC--CccccccCCC-ceeEecCCCCCCc---EEEEeec-c--e---EEEeecceE Q lcl|NC_019921. 264 NVTMVVNPSDAFEVQAQ---YTHLNAN--GVYVTALPFN-LNVIESTVQEAGK---VLTYVKG-L--Y---DGYLAGGIN 328 (381) Q Consensus 264 ~a~~~mn~~t~~~~~~~---~~~~~~~--G~~~~~l~~G-~pVv~s~~~p~~~---i~fgd~~-~--y---~i~~r~~i~ 328 (381) ...|++.+.-......+ ....+++ .+.++.+ .| ..|+.++.+.++. .++++-. . + ++.-..+.. T Consensus 539 P~~llvp~~le~~a~~ll~s~~v~~a~~~~~~~Np~-~~~~~~i~eprL~~~s~~~wylaa~~~~dtiev~yL~G~~~P~ 617 (652) T protein:vir:79 539 PAFVLVPTAMESVANQVIRSSSVKGADINAGIINPV-KDFATVIAEPRLDDNSQTTFYLAASKGSDTIEVAYLNGVDTPY 617 (652) T ss_pred ccEEEecchhHHHHHHHhccCCCccccccccccccc-ccccccccccccCCCCcccEEEecCCCCCeEEEEEecCCCCCe Confidence 34577766543322221 1112111 1111111 12 2566666664432 2223211 1 1 122234555 Q ss_pred EEeehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecC Q lcl|NC_019921. 329 VQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) Q Consensus 329 i~~~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~ 369 (381) |+.- ..|..|-+-||++.-++.+++|=-.++- -++ T Consensus 618 ie~~--~gf~~dG~~~kvrlD~G~~~iD~RG~~k----~t~ 652 (652) T protein:vir:79 618 IDQM--EGFSVDGVTTKVRIDAGVAPVDHRGLVK----CTA 652 (652) T ss_pred eeec--CCCCcceEEEEEEEeccCceeeccceee----ecC Confidence 5543 3599999999999999999999998763 223 No 201 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=39.91 E-value=1 Score=20.59 Aligned_cols=291 Identities=8% Similarity=-0.028 Sum_probs=113.4 Q ss_pred HHHHHhhccccccCHHHHHH-------HHHHhhccCCCCceeccHHHHHHHHHHHHhhh---hhhhhceeEecCC-ceEE Q lcl|NC_019921. 53 ERVSSLPKSAQSLSANQRSF-------FMDINKNVNYKEEKLLPEETIDRIFEDLTTNH---PLLADLGIKNAGL-RLKF 121 (381) Q Consensus 53 ~~~~~~~~~~~~lt~~e~~~-------~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~~---~l~~~~~v~~~~g-~~~~ 121 (381) ...+ .-.+.+.+..+-+.+ +....-..-.-+-...-+-++..+-+.+...+ ++.---.....+| .++| T Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkI 79 (329) T protein:vir:10 1 MDGI-FITGVKTMNKEIKNATGKLKLNLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTV 79 (329) T ss_pred CCce-EEechhhhhhhhhcccceeEEehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEE Confidence 0000 000111111110000 00000000000111222333333333333221 2111111233344 4889 Q ss_pred EEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhhhcCHHHH--HHHHHHHHHHHHHHHHhhheee-- Q lcl|NC_019921. 122 LKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWI--ERFVRVQIEEAFAVALETAFLK-- 197 (381) Q Consensus 122 p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~--e~~l~~~la~~~~~~~~~a~i~-- 197 (381) |+-+..+-..+---++-..+..+.++..++|...+...+.- ..-=.+.+...+ ..-+.+.....++-.+|...+. T Consensus 80 p~i~~~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~V-D~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skl 158 (329) T protein:vir:10 80 IKGDVTELKDYKRNATNEFDHPQIQETTYFLDQEKYWGRFV-DALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATL 158 (329) T ss_pred eeecccccccccCCCCccccccccceeEEEeecccceeeec-chhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHH Confidence 98765443333211121222334456667777766655432 111122222222 2222333333443344432221 Q ss_pred --ccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHH Q lcl|NC_019921. 198 --GTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAF 275 (381) Q Consensus 198 --G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~ 275 (381) +.|+. .. +...+...++.+.++...+... .. + .+-+.+|+|..+. T Consensus 159 a~~a~~~--------------~~-------------~~~t~~nay~~i~~a~~~Lde~--~v--p--~~Rvl~VtP~~~~ 205 (329) T protein:vir:10 159 ARNKAKH--------------LT-------------VGSGADAQYDAVLDVSVELDEI--GA--G--ASRILFVTPKFYK 205 (329) T ss_pred Hhhcccc--------------cc-------------cccCHHHHHHHHHHHHHHHHhc--CC--C--CCcEEEeCHHHHH Confidence 11110 00 0112334455555555554321 11 1 3445678888776 Q ss_pred HHhhhh-hccCC----CCcccc--ccCCCceeEec--CCCCCCcEEEEeecceE-EEeecceEEEeehhhhhhcCceEEE Q lcl|NC_019921. 276 EVQAQY-THLNA----NGVYVT--ALPFNLNVIES--TVQEAGKVLTYVKGLYD-GYLAGGINVQKFKETLALDDMDLYT 345 (381) Q Consensus 276 ~~~~~~-~~~~~----~G~~~~--~l~~G~pVv~s--~~~p~~~i~fgd~~~y~-i~~r~~i~i~~~~~~~~~~d~~~~r 345 (381) .+.... ..... .+..-. ....|.+|+.. ..|+.-.+++|..+... +.....+++....+-. +...|+ T Consensus 206 ~Lk~~~~f~~~~~~~~~~~~~g~Vg~idG~~Ii~vps~~~k~in~ii~~~~A~~~~~K~~~~~~~~p~~~~---~a~~v~ 282 (329) T protein:vir:10 206 GIKKFVIELPQGDNRQQVLGKGVQGELDGFTIVKVPSKMLQGVEAMAVIGEVMASPIQANEAKLNSNVPGM---FGTLAE 282 (329) T ss_pred HHHhhhhhhccccccccceeeeeeeeecCeEEEEecCCcccceeEEEEcCCceeeeeeeeeeeeeCCCCcc---chheee Confidence 654311 11111 111100 12358888864 34444345666554432 2222234443322222 346899 Q ss_pred EEEEEcCEEecCceEEEEEEEe-cCCccccccCcccC Q lcl|NC_019921. 346 AKQFAYGKAKDNKVAAVWKLDL-KGHKPALEGTEETL 381 (381) Q Consensus 346 ~~~r~dGk~~~~~Afvv~~~~~-~~~~~~~~~~~~~~ 381 (381) +..+.|.+++++++..++...- +.......+-|.|+ T Consensus 283 gr~yyd~~V~~~k~~~I~~~~~~a~~~~~~~~~~~~~ 319 (329) T protein:vir:10 283 QMLYTGAFVPEHLQKYIFTIGGKEVETNRDGVDAHAD 319 (329) T ss_pred eeeeeeeEEEccccCEEEEecccCcccCCCCCCcccc Confidence 9999999999999766655333 22222222233344 No 202 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=34.48 E-value=1.3 Score=19.98 Aligned_cols=269 Identities=10% Similarity=0.053 Sum_probs=117.1 Q ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe----c----CCceEEEEecCCcceEEeec-ccccc-cc-cC Q lcl|NC_019921. 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKN----A----GLRLKFLKSETSGVAVWGKI-YGEIK-GQ-LD 144 (381) Q Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~~~l~~~~~v~~----~----~g~~~~p~~~~~~~a~wv~e-~~~~~-~~-~~ 144 (381) |. ... --.||+-+..++++.+++...+-++++.-- . +-.++|++............ ...+. ++ .+ T Consensus 1 MA--N~l--lT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e 76 (423) T protein:vir:35 1 MA--NNL--ESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFS 76 (423) T ss_pred Cc--cch--hhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcccccccc Confidence 11 000 013799999999999999998888776521 1 22466765433221111111 01111 11 11 Q ss_pred cceeeEeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccc Q lcl|NC_019921. 145 AAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPE 224 (381) Q Consensus 145 ~~f~~v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~ 224 (381) .+ -.+.|..+|+.++ +++.+=+..+..+++.++...+ .+++..++..++.--=.+-|. ..| T Consensus 77 ~~-v~l~id~~k~~a~-~v~d~e~~l~i~~~~~~l~~a~-~ala~~vd~~l~~~l~~~a~~------------~vg---- 137 (423) T protein:vir:35 77 AK-ATGKVGKYITVAV-EWTQIEEALKLNQLDQILSPIH-ERMVTDLETELAHFMMNNGAL------------SLG---- 137 (423) T ss_pred ce-eeEEeccceeccc-eeCHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcccc------------ccc---- Confidence 12 2466666666655 4444444446778888777664 678888887775410000010 000 Q ss_pred eeeeeeecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh--hccCCCC---cc----ccccC Q lcl|NC_019921. 225 KEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY--THLNANG---VY----VTALP 295 (381) Q Consensus 225 ~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~--~~~~~~G---~~----~~~l~ 295 (381) + ...+...++.+.++...|.. ...|. ++-..+++|..+..+++.. ......+ .+ +.+.. T Consensus 138 -----t--~~t~~~~~~~i~~a~~~Ld~--~~vP~---~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i 205 (423) T protein:vir:35 138 -----S--PNTAIKKWADVAQTASFIKD--IGIKT---GENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNF 205 (423) T ss_pred -----c--ccCCcchHHHHHHHHHHHHH--hcCCc---CCCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeee Confidence 0 00111234556666555532 12222 3344588998887765321 1111111 11 22234 Q ss_pred CCceeEecCCCCCCc-------EEE-----------EeecceEEEeecceEEEeehhhhhhcCceEEEEEEEEc---CEE Q lcl|NC_019921. 296 FNLNVIESTVQEAGK-------VLT-----------YVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAY---GKA 354 (381) Q Consensus 296 ~G~pVv~s~~~p~~~-------i~f-----------gd~~~y~i~~r~~i~i~~~~~~~~~~d~~~~r~~~r~d---Gk~ 354 (381) +|..|+.|.++|..+ +.. .+.+++.+...+ ..+... ...-..|...|=|..-++ +.+ T Consensus 206 ~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~-~~~~~~-g~l~~GD~~t~aGv~~v~~~t~~~ 283 (423) T protein:vir:35 206 GGIRALMSNGLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTVALTG-ATPSKT-GFLKAGDQLKFTSTHWLNQQSKQT 283 (423) T ss_pred cceEEEEcCCCccccccccccceeeccccccccccccccccceeeeee-eeeccC-CcEEecceEEeeeeeeccccccce Confidence 799999999999532 110 011121111111 111111 112223444343332221 111 Q ss_pred e------cCceEEEEEE--EecCCccccccCcccC Q lcl|NC_019921. 355 K------DNKVAAVWKL--DLKGHKPALEGTEETL 381 (381) Q Consensus 355 ~------~~~Afvv~~~--~~~~~~~~~~~~~~~~ 381 (381) + ...=|+|..- ..++....+...|--+ T Consensus 284 ~~~~~t~~~~~~~V~~~~~~~a~g~~~v~i~p~~~ 318 (423) T protein:vir:35 284 LYNGSTAMSFTATVLEETNSTASGDVTVKLSGVPI 318 (423) T ss_pred eecccCCceeEEEEeccccccccCceeEEcccccc Confidence 1 1122333211 1122333344444322 No 203 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=30.75 E-value=1.6 Score=19.54 Aligned_cols=347 Identities=10% Similarity=0.054 Sum_probs=122.4 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHH---HH Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEPQERQNE----LYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRS---FF 73 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~---~~ 73 (381) |+|+-++++.++=+-+++.-.--+....++. .++...+.+++.. ++++.........-|++.+.. -. T Consensus 1 ~~~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~a~~~enq~~~~~~~~------~~~~~~~~~~~~~~l~e~~~~~~~~~ 74 (521) T protein:vir:72 1 MTIKTKAELLNKWKPLLEGEGLPEIANSKQAIIAKIFENQEKDFQTAP------EYKDEKIAQAFGSFLTEAEIGGDHGY 74 (521) T ss_pred CCcchhHHHHHhhhhhhccCCCCccccchhhhhhhhhhhhhhhhhhcc------cccchHHHHHHhhhhhhhcccCcccc Confidence 9999999998888877765110011111111 1222111111111 111110000000001100000 00 Q ss_pred --HHHhhccCCCCceeccHHHHHHHHHHHHhh---hhhhhhceeEecCCce------EEEEecCC--------------c Q lcl|NC_019921. 74 --MDINKNVNYKEEKLLPEETIDRIFEDLTTN---HPLLADLGIKNAGLRL------KFLKSETS--------------G 128 (381) Q Consensus 74 --~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~---~~l~~~~~v~~~~g~~------~~p~~~~~--------------~ 128 (381) ..+.+++.+ |... .+.-.++...|+. =.-.+++.|+|++|.. +--..... + T Consensus 75 ~~~~iaes~~t-~~v~---~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~ 150 (521) T protein:vir:72 75 NATNIAAGQTS-GAVT---QIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGP 150 (521) T ss_pred Ccccccccccc-cccc---cCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhccc Confidence 000111111 1110 1111122222211 1123456666654321 10000000 0 Q ss_pred ceEEee---------------------------------------------------------------------c---- Q lcl|NC_019921. 129 VAVWGK---------------------------------------------------------------------I---- 135 (381) Q Consensus 129 ~a~wv~---------------------------------------------------------------------e---- 135 (381) .+.|.+ . T Consensus 151 da~fSG~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~T 230 (521) T protein:vir:72 151 DAMFSGQGAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMAT 230 (521) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccch Confidence 000000 0 Q ss_pred -ccc----cccccCcceeeEeecceeEEEe-------eeccHHhhhc----CHHHHHHHHHHHHHHHHHHHHhhheeecc Q lcl|NC_019921. 136 -YGE----IKGQLDAAFSEETAIQNKLTAF-------VVLPKDLNDF----GPAWIERFVRVQIEEAFAVALETAFLKGT 199 (381) Q Consensus 136 -~~~----~~~~~~~~f~~v~l~~~kl~~~-------~~iS~ell~d----s~~~~e~~l~~~la~~~~~~~~~a~i~G~ 199 (381) .++ .-..++..|.++.|...|.+.- ...|-||.+| -.+|.|+.|.+-|+..|..-++..||. . T Consensus 231 a~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~-~ 309 (521) T protein:vir:72 231 SIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVD-W 309 (521) T ss_pred hhhhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhh-h Confidence 000 0011223466777766665532 3467677665 467899999999999999999999883 2 Q ss_pred C-CC---cceEeeeccccccccccccccceeeeeeeccccc------chhHHHHHHHHHHhhhccccccccc-cCceEE- Q lcl|NC_019921. 200 G-KD---QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANP------RATVNELTQVFKYHSTNEKGKSVAV-KGNVTM- 267 (381) Q Consensus 200 G-~~---~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~------~~~~~~l~~l~~~l~~~~~~~~~~~-~~~a~~- 267 (381) = .. +-.|+-+. .....|.+.+.++ .-....+..|+..+-...+...+.. |+.+.| T Consensus 310 i~~sa~~g~~g~t~~-------------~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~ 376 (521) T protein:vir:72 310 INYSAQVGKSGMTLT-------------PGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFI 376 (521) T ss_pred hhheeeeeeeeeeec-------------cCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEE Confidence 0 00 01122100 0001112211111 1122222223333222222222222 244544 Q ss_pred EEchhhHHHHhhhh---hc---cC-CCC-------c-cccccCCCceeEecCCCCCCcEEEEeecceEEEeecceEEEe- Q lcl|NC_019921. 268 VVNPSDAFEVQAQY---TH---LN-ANG-------V-YVTALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQK- 331 (381) Q Consensus 268 ~mn~~t~~~~~~~~---~~---~~-~~G-------~-~~~~l~~G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~i~i~~- 331 (381) ++++.-. .++... +. ++ +.| . +...+.-+++|+++.++|.+-++.|.--..-+ ..|+-... T Consensus 377 i~S~~Va-~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~--~~glfyaPY 453 (521) T protein:vir:72 377 IASRNVV-NVLASVDTGISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEM--DAGIYYAPY 453 (521) T ss_pred EEchHHH-HHHhhcccccccccccccccccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCccc--ccceeeccc Confidence 4555433 332210 00 11 111 1 12234457889999888876666664311100 01111111 Q ss_pred --------ehhhhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 332 --------FKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 332 --------~~~~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) .|...|. -.++|+.+ + |-.++|-+ . ..+..+..--..+.|+-+ T Consensus 454 v~l~~~~~~dp~sfq-P~~g~~tR--Y-~l~~NP~~--~-~~~~~~a~~i~~~~~~~~ 504 (521) T protein:vir:72 454 VALTPLRGSDPKNFQ-PVMGFKTR--Y-GIGINPFA--E-SAAQAPASRIQSGMPSIL 504 (521) T ss_pred cccccccccCCcccc-ceeeeeee--e-ceeecCcc--c-ccCcccceeecCcChhhh Confidence 1111111 11222221 1 22233322 1 112222222333344443 No 204 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=29.83 E-value=1.6 Score=19.43 Aligned_cols=309 Identities=12% Similarity=0.056 Sum_probs=131.1 Q ss_pred HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHH-HHHHHHhhcc--CCCCceeccHHHHHHH Q lcl|NC_019921. 20 VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQR-SFFMDINKNV--NYKEEKLLPEETIDRI 96 (381) Q Consensus 20 ~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~-~~~~~~~~~~--~~~gg~lvP~~~~~~I 96 (381) +++.+. -++.+.++ +-.......++.+-. .++.+...+. .+.+.--||..+.+-| T Consensus 1 ~~~~~~-~~~l~~~g---------------------i~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i 58 (336) T protein:vir:78 1 MRDAQR-IQNLARAG---------------------VILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYV 58 (336) T ss_pred CchHHH-HHHHhccC---------------------eecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHHhc Confidence 000000 00000000 000000000111100 0111111110 1111112455444433 Q ss_pred H-HHHHhhhhhhhhceeEe---cC----CceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeeccHHhh Q lcl|NC_019921. 97 F-EDLTTNHPLLADLGIKN---AG----LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLN 168 (381) Q Consensus 97 ~-~~l~~~~~l~~~~~v~~---~~----g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell 168 (381) - +.++...+-+....+++ .+ ..+.++..+..+.+.+.+-.... +..+..-.+.+-..+.+..-+.++..=+ T Consensus 59 ~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~-P~vd~~~~~~~~~v~~~~~g~~yg~~El 137 (336) T protein:vir:78 59 DPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSD-GDSGTNINYPQRQSYFFQTWTRWGEREL 137 (336) T ss_pred ccceeeehhhhhhhhhhcccccCCCccccEEEEeeeecceeeEEeecccCC-CeeecceeeEEEEEEEEEeeeeecHHHH Confidence 1 22222223333333333 32 23456677777778777765555 4556666666677777777778884444 Q ss_pred hc---CHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHH Q lcl|NC_019921. 169 DF---GPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQ 245 (381) Q Consensus 169 ~d---s~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~ 245 (381) .- ...++.+--+...++++.+.++.-.+.|++..+-.|+|+++......+..+. . ....+++..++++.. T Consensus 138 ~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~-~------w~~~T~~~I~~Di~~ 210 (336) T protein:vir:78 138 EMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTP-W------SGSPAVEAVVNEVVT 210 (336) T ss_pred HHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcC-c------ccccCHHHHHHHHHH Confidence 43 3566888888888999999999999999998899999997654322211110 0 111234455566666 Q ss_pred HHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccccC---C-CceeEecCCCCCCcEEEEeecceEE Q lcl|NC_019921. 246 VFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP---F-NLNVIESTVQEAGKVLTYVKGLYDG 321 (381) Q Consensus 246 l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~---~-G~pVv~s~~~p~~~i~fgd~~~y~i 321 (381) ++..+.....+.... .-..+++|.+.-+..+- ..++.|.-+.... | ++.|+..+...+ .-|+..+.+. T Consensus 211 ~~~~l~~qt~g~~~~-~~~~tL~Lp~~~~~~L~----~~n~~g~tv~~~lk~n~Pnl~i~t~pel~~---Agg~~~~~~~ 282 (336) T protein:vir:78 211 LFQVLQTQSQGIITQ-EAVLHMGLPPTAMSDLS----KTNQYGLSAAAKLKEIFPKLEFVTIPEYDT---ASGRLVQLWA 282 (336) T ss_pred HHHHHHHhcCCeeee-ccceEEEechHHHHhcc----CCCccCccHHHHHHHhcCccEEEEcccccc---cCcceEEEEE Confidence 666554433221111 11224556554433331 1233333222110 1 233444332211 1133222222 Q ss_pred Eeec---ceEEEeehhhhh---h--cCceEEEEEEEEcCEEe-cCceEEEEEEEecCC Q lcl|NC_019921. 322 YLAG---GINVQKFKETLA---L--DDMDLYTAKQFAYGKAK-DNKVAAVWKLDLKGH 370 (381) Q Consensus 322 ~~r~---~i~i~~~~~~~~---~--~d~~~~r~~~r~dGk~~-~~~Afvv~~~~~~~~ 370 (381) .+.. -..+.......+ . .-....-+..|..|-.+ .|-||+..+ +- T Consensus 283 ~~~~~~~t~~~~~p~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~----GI 336 (336) T protein:vir:78 283 PRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMI----GV 336 (336) T ss_pred eeccCCcceeeecchhhhccceeecCceeEeccccceeeeeeeccchheeec----cC Confidence 2211 122222111111 1 12223345566666654 455655322 22 No 205 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=29.01 E-value=1.7 Score=19.33 Aligned_cols=303 Identities=12% Similarity=0.046 Sum_probs=125.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccc------ccCHHHH-HHHHHHhhc--cCCCCceeccHHHHHHHH-HHHHhhhhh Q lcl|NC_019921. 37 INQLFEETKLQAKAEAERVSSLPKSAQ------SLSANQR-SFFMDINKN--VNYKEEKLLPEETIDRIF-EDLTTNHPL 106 (381) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~------~lt~~e~-~~~~~~~~~--~~~~gg~lvP~~~~~~I~-~~l~~~~~l 106 (381) ++..+. -.-+.+-|.. .++.+-. -.+.++..+ ..+.+..-||..+.+-|- ..++...+- T Consensus 1 ~~~~~~-----------~~~l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~ 69 (336) T protein:vir:36 1 MRDAQR-----------IQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAP 69 (336) T ss_pred CchHHH-----------HHHHhhcCeeecchhhhhhhHHHHhhhhhhhccCccccCCCcchHHHHHHhhccceEeeecch Confidence 000000 0000000100 0111100 011111111 011111235655554221 112222222 Q ss_pred hhhceeEe---cC----CceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeecc-HHhhh--cCHHHHH Q lcl|NC_019921. 107 LADLGIKN---AG----LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLP-KDLND--FGPAWIE 176 (381) Q Consensus 107 ~~~~~v~~---~~----g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS-~ell~--ds~~~~e 176 (381) +....+++ .+ ..+.++..+..+.+.+.+-.... +..+..-...+-..+.+..-+.++ .|+-. ....|+. T Consensus 70 ~~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~-P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~ 148 (336) T protein:vir:36 70 MKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSD-GDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLA 148 (336) T ss_pred hhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCC-ceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcH Confidence 33333333 22 13455666666777777765555 344544444445566777677777 44443 2345677 Q ss_pred HHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHHHHHHhhhcccc Q lcl|NC_019921. 177 RFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKG 256 (381) Q Consensus 177 ~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~l~~~l~~~~~~ 256 (381) +--+...++++.+.+|.-.+.|++..+-.|+|+++.-....+..+. .....++...++++..++..+....++ T Consensus 149 ~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~-------~~~~~t~~ei~~Di~~~~~~l~~qt~G 221 (336) T protein:vir:36 149 SELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTP-------WSGSPAVEAVVNEVVALFQVLQTQSQG 221 (336) T ss_pred HHHHHHHHHHHHHhhCcEEEEeccccceEEEEecCCCccccccCCC-------cccccCHHHHHHHHHHHHHHHHHhcCC Confidence 7788888899999999989999998899999997654322211110 011123344556666666655543322 Q ss_pred ccccccCceEEEEchhhHHHHhhhhhccCCCCccccccC---C-CceeEecCCCCCCcEEEEeecceEEEeecc---eEE Q lcl|NC_019921. 257 KSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP---F-NLNVIESTVQEAGKVLTYVKGLYDGYLAGG---INV 329 (381) Q Consensus 257 ~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~---~-G~pVv~s~~~p~~~i~fgd~~~y~i~~r~~---i~i 329 (381) .-.. ....+++|.+.-+..+ ...++.|.-++... | ++.++..+.... .-|+..++++....+ ..+ T Consensus 222 ~i~~-~~~~tL~LP~~~~~~L----s~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~---a~g~~~~l~~~~~~~~~t~~~ 293 (336) T protein:vir:36 222 IITQ-EDVLRMGLPPTAMSDL----SKTNQYGLAAAAKLKDIFPKLEFVTIPEYDT---ASGRLVQLWAPRVEGKDTATC 293 (336) T ss_pred eeee-ccccEEEechHHHHhc----cCCCccCccHHHHHHHhcCccEEEEcccccc---CCCceEEEEEEecCCCcceee Confidence 1111 1123455655433222 12233343222110 1 233433322211 112222222111111 222 Q ss_pred Eeehhhhh---hc--CceEEEEEEEEcCEEe-cCceEEEEEEEecCC Q lcl|NC_019921. 330 QKFKETLA---LD--DMDLYTAKQFAYGKAK-DNKVAAVWKLDLKGH 370 (381) Q Consensus 330 ~~~~~~~~---~~--d~~~~r~~~r~dGk~~-~~~Afvv~~~~~~~~ 370 (381) .......+ .. -....-+..|..|-.+ .|.||+..+ +- T Consensus 294 ~~p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~----GI 336 (336) T protein:vir:36 294 GFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMI----GV 336 (336) T ss_pred ecchhhhccceeecCceeEeccccceeeeeeeccchheeee----cC Confidence 21111111 11 1123345556666544 566666322 22 No 206 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=26.13 E-value=2 Score=18.96 Aligned_cols=347 Identities=12% Similarity=0.046 Sum_probs=110.5 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhhccccccCHHHHHHH----- Q lcl|NC_019921. 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKA--EAERVSSLPKSAQSLSANQRSFF----- 73 (381) Q Consensus 1 mt~el~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~lt~~e~~~~----- 73 (381) |+|+-. ++.++=.-+++.--. .+-...........+.+...+..+. .++...+.....+-|.+.+..-. T Consensus 1 ~~~~~~-~l~~kw~p~l~~~~~---~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~ 76 (529) T protein:vir:10 1 MSLKTK-EILNKWTPLLEGEGL---PEIAGKNKQALVAQILEAQEKDSKTDPVYRDDKLIEAFGQSLMEAEVAGDHGYDP 76 (529) T ss_pred CccchH-HHHHHhhHhhcCCcc---chhcchhhhhhhhhhhhhHHHHhhcccccchhhhhhhhhhccchhhccccccccc Confidence 888755 344444444333110 1100011111111111111111111 11111111111111221111000 Q ss_pred HHHhhccCCCCceeccHHHHHHHHHHHHh---hhhhhhhceeEecCCce------E--EEEecC---------------- Q lcl|NC_019921. 74 MDINKNVNYKEEKLLPEETIDRIFEDLTT---NHPLLADLGIKNAGLRL------K--FLKSET---------------- 126 (381) Q Consensus 74 ~~~~~~~~~~gg~lvP~~~~~~I~~~l~~---~~~l~~~~~v~~~~g~~------~--~p~~~~---------------- 126 (381) ..+.+++. .|... .+.-.++...|+ .=.-.+++.|+|++|.. + .+.... T Consensus 77 ~~ia~s~~-t~~v~---~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt 152 (529) T protein:vir:10 77 TNIAAGQS-SGAIT---NIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDA 152 (529) T ss_pred cccccccc-ccccc---cccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCcccccccccccccc Confidence 00001111 11110 001111111111 00112334444443211 0 000000 Q ss_pred -------------------------------CcceEEeec---------------------------------------- Q lcl|NC_019921. 127 -------------------------------SGVAVWGKI---------------------------------------- 135 (381) Q Consensus 127 -------------------------------~~~a~wv~e---------------------------------------- 135 (381) .+...|..| T Consensus 153 ~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~ 232 (529) T protein:vir:10 153 WHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAE 232 (529) T ss_pred cccccccccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccccccccc Confidence 000000000 Q ss_pred --------cccc----ccccCcceeeEeecceeEEEe-------eeccHHhhhc----CHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019921. 136 --------YGEI----KGQLDAAFSEETAIQNKLTAF-------VVLPKDLNDF----GPAWIERFVRVQIEEAFAVALE 192 (381) Q Consensus 136 --------~~~~----~~~~~~~f~~v~l~~~kl~~~-------~~iS~ell~d----s~~~~e~~l~~~la~~~~~~~~ 192 (381) .++. -..+.-.|.++.|...|.+.- ...|-||.+| -..|.|+.|.+-|+..|..-++ T Consensus 233 ~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEIN 312 (529) T protein:vir:10 233 IAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEIN 312 (529) T ss_pred cccccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhh Confidence 0000 011223466666666665432 3466677665 4678999999999999999999 Q ss_pred hheeeccC-C-Cc--ceEeeeccccccccccccccceeeeeeeccccc------chhHHHHHHHHHHhhhccccccccc- Q lcl|NC_019921. 193 TAFLKGTG-K-DQ--PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANP------RATVNELTQVFKYHSTNEKGKSVAV- 261 (381) Q Consensus 193 ~a~i~G~G-~-~~--P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~------~~~~~~l~~l~~~l~~~~~~~~~~~- 261 (381) ..||. .= . .+ -.|+-.. +. ...|.+++.++ .-....+..|+..+-...+...+.. T Consensus 313 Reii~-~i~~~a~~~~~g~~~~----~~---------~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~ 378 (529) T protein:vir:10 313 REVID-WINYTAQVGKSGWTQT----VG---------SAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTG 378 (529) T ss_pred HHHHH-Hhhhhceeeeeeeecc----cc---------ccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhc Confidence 99887 10 0 01 1122100 00 00111111111 0122222222222222222222222 Q ss_pred cCceEE-EEchhhHHHHhhhh----h---ccCCCC--------ccccccCCCceeEecCCCCCCcEEEEeecc--eEEEe Q lcl|NC_019921. 262 KGNVTM-VVNPSDAFEVQAQY----T---HLNANG--------VYVTALPFNLNVIESTVQEAGKVLTYVKGL--YDGYL 323 (381) Q Consensus 262 ~~~a~~-~mn~~t~~~~~~~~----~---~~~~~G--------~~~~~l~~G~pVv~s~~~p~~~i~fgd~~~--y~i~~ 323 (381) ++.+.+ ++++.-.. ++... + ...+.| .+...+.-+++|+++.+.|.+-++.|.-.. |. T Consensus 379 rg~~n~vi~S~~Va~-~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~--- 454 (529) T protein:vir:10 379 RGAGNFIIASRNVVS-ALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLD--- 454 (529) T ss_pred cccceEEEEchHHHH-HHhhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccc--- Confidence 234444 45554333 32210 0 011111 222234457889988888876666664311 11 Q ss_pred ecceEEEeehh---------hhhhcCceEEEEEEEEcCEEecCceEEEEEEEecCCccccccCcccC Q lcl|NC_019921. 324 AGGINVQKFKE---------TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) Q Consensus 324 r~~i~i~~~~~---------~~~~~d~~~~r~~~r~dGk~~~~~Afvv~~~~~~~~~~~~~~~~~~~ 381 (381) .|+-...+-+ ..|. -.++|+. |+ |-.++| |+.. ++-++.-.-+-++|-.. T Consensus 455 -~glfy~PYv~l~~~~~~dp~sfq-P~~g~~t--RY-~l~~NP--~~~~-~~~~~~~r~~~g~~~~~ 513 (529) T protein:vir:10 455 -AGIYYCPYVALTPLRGSDPKNFQ-PVMGFKT--RY-AIGVNP--FAES-RTQAPTSRISNGMPGAH 513 (529) T ss_pred -cceeeccccccccccccCCCccc-ceeeeee--ee-ceeecC--cccc-ccccccccccCCcchhh Confidence 1221111111 1111 1122222 11 222333 1110 00000111122333222 No 207 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=21.93 E-value=2.5 Score=18.38 Aligned_cols=309 Identities=14% Similarity=0.096 Sum_probs=125.0 Q ss_pred HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCHHHHH-HHHH--HhhccCCCCceeccHHHHHHH Q lcl|NC_019921. 20 VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRS-FFMD--INKNVNYKEEKLLPEETIDRI 96 (381) Q Consensus 20 ~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~e~~-~~~~--~~~~~~~~gg~lvP~~~~~~I 96 (381) +++.+. -++.+.++ +...+....++.+-.. ++.+ ........+...+|.-+.+-| T Consensus 1 ~~~~~~-~~~l~~~g---------------------i~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~i~~~l~~~i 58 (336) T protein:vir:10 1 MRDAQR-IQNLARAG---------------------VILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYV 58 (336) T ss_pred CchHHH-HHHHhhcC---------------------eeecchhhhhhhhHHHhhhhhhhccCccccCCCchhHHHHHhhc Confidence 000000 00000000 0000000001110000 0001 011111111223555444333 Q ss_pred ----HHHHHhhhhhhhhceeEecC----CceEEEEecCCcceEEeecccccccccCcceeeEeecceeEEEeeecc-HHh Q lcl|NC_019921. 97 ----FEDLTTNHPLLADLGIKNAG----LRLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLP-KDL 167 (381) Q Consensus 97 ----~~~l~~~~~l~~~~~v~~~~----g~~~~p~~~~~~~a~wv~e~~~~~~~~~~~f~~v~l~~~kl~~~~~iS-~el 167 (381) ++.+..---...+.-+.+.+ ..+.++..+..+.+.+.+-.... +..+..-...+-..+.+..-+.++ .|+ T Consensus 59 ~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~-P~~d~~~~~~~~~v~~~~~g~~yg~~El 137 (336) T protein:vir:10 59 DPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSD-GDSGANINYPQRQSYFFQTWTRWGEREL 137 (336) T ss_pred ccceeeehhhhhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCC-ceeecccceeeeeEEEEEeeeeeCHHHH Confidence 22222211122223233322 13455666667777777765555 344544444445566777777788 444 Q ss_pred hh--cCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeeeeecccccchhHHHHHH Q lcl|NC_019921. 168 ND--FGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQ 245 (381) Q Consensus 168 l~--ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~ 245 (381) -. ....|+.+--+...++++.+.+|.-.+.|++..+-.|+|+++......+..+. .....+++..++++.. T Consensus 138 ~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~-------~~~~~t~eei~~Di~~ 210 (336) T protein:vir:10 138 EMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTP-------WSGSPAVEAVVNEVVA 210 (336) T ss_pred HHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEeCCCCccccccCCC-------cccccCHHHHHHHHHH Confidence 33 33456778888888999999999999999998899999997654322211110 0111233445556666 Q ss_pred HHHHhhhccccccccccCceEEEEchhhHHHHhhhhhccCCCCccccccC---C-CceeEecCCCCCCcEEEEeecceEE Q lcl|NC_019921. 246 VFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP---F-NLNVIESTVQEAGKVLTYVKGLYDG 321 (381) Q Consensus 246 l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l~---~-G~pVv~s~~~p~~~i~fgd~~~y~i 321 (381) ++..+.....+.-.. ....+++|.+.-+..+ ...++.|.-++... | ++.++..+... -.-|...++++ T Consensus 211 ~~~~l~~qs~G~i~~-~~~~tL~LP~~~~~~L----s~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~---~a~G~~~~l~~ 282 (336) T protein:vir:10 211 LFQVLQTQSQGIITQ-EDVLRMGLPPTAMSDL----SKTNQYGLAAAAKLKDIFPKLEFVTIPEYD---TASGRLVQLWA 282 (336) T ss_pred HHHHHHHhcCCeecc-cCcceEEecHHHHHhc----cCCCccCccHHHHHHHhcCccEEEEccccc---cCCCceEEEEE Confidence 665554433221011 1123455655433222 12233343222110 1 23343332221 11122222221 Q ss_pred Eeecc---eEEEeehhhhh---hc--CceEEEEEEEEcCEEe-cCceEEEEEEEecCC Q lcl|NC_019921. 322 YLAGG---INVQKFKETLA---LD--DMDLYTAKQFAYGKAK-DNKVAAVWKLDLKGH 370 (381) Q Consensus 322 ~~r~~---i~i~~~~~~~~---~~--d~~~~r~~~r~dGk~~-~~~Afvv~~~~~~~~ 370 (381) ....+ ..+.......+ .. -....-+..|..|-.+ .|.||+..+ +- T Consensus 283 ~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~----GI 336 (336) T protein:vir:10 283 PRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMI----GV 336 (336) T ss_pred EecCCCcceeeecchhhhccceeecCceeEeccccceeeeeeeccchheeee----cC Confidence 11111 22221111111 11 1123345556666544 566666322 22 No 208 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=21.03 E-value=2.7 Score=18.25 Aligned_cols=270 Identities=11% Similarity=-0.001 Sum_probs=120.3 Q ss_pred CCCce--eccHHHHHHHHHHHHhhhhhhhhceeEe--------cCCceEEEEecCCcceEEee-ccccc-ccccCcceee Q lcl|NC_019921. 82 YKEEK--LLPEETIDRIFEDLTTNHPLLADLGIKN--------AGLRLKFLKSETSGVAVWGK-IYGEI-KGQLDAAFSE 149 (381) Q Consensus 82 ~~gg~--lvP~~~~~~I~~~l~~~~~l~~~~~v~~--------~~g~~~~p~~~~~~~a~wv~-e~~~~-~~~~~~~f~~ 149 (381) =..-+ .+|+-+..++++.+++..++.++++.-- .+-.++|++-.......... ....+ .+...-.=-. T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e~~v~ 80 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCccccceeE Confidence 01111 2699999999999999998888776521 12246666543322211111 10111 1111111125 Q ss_pred EeecceeEEEeeeccHHhhhcCHHHHHHHHHHHHHHHHHHHHhhheeeccCCCcceEeeeccccccccccccccceeeee Q lcl|NC_019921. 150 ETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQG 229 (381) Q Consensus 150 v~l~~~kl~~~~~iS~ell~ds~~~~e~~l~~~la~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~ 229 (381) +.|..+|..++--=..|+. .+.-+++.++... .++++..+|..++.-- .+.+ . . ..+. T Consensus 81 l~id~~k~va~~v~d~E~~-~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~-~~~~--------~--~-~~gt-------- 138 (423) T protein:vir:10 81 GRVGNYITVAVEYQQLEEA-IKLNQLEEILAPV-RQRIVTDLETELAHFM-MNNG--------A--L-SLGS-------- 138 (423) T ss_pred EEeeceeeeeeeechHHHh-cChhhHHHHHHHH-HHHHHHHHHHHHHHHH-hhcc--------c--c-cccc-------- Confidence 7777777777655555654 5667788877655 5889999998776410 1100 0 0 0000 Q ss_pred eecccccchhHHHHHHHHHHhhhccccccccccCceEEEEchhhHHHHhhhh--hccCCCC---cc----ccccCCCcee Q lcl|NC_019921. 230 TLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY--THLNANG---VY----VTALPFNLNV 300 (381) Q Consensus 230 ~~t~~~~~~~~~~l~~l~~~l~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~--~~~~~~G---~~----~~~l~~G~pV 300 (381) ...+...++.+.++...|... ..|. ++-..+++|..+..+++.. ...+..+ .+ +.+..+|..| T Consensus 139 ---~~t~~~a~~~i~~a~~~Ld~~--~vP~---~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv 210 (423) T protein:vir:10 139 ---PNTPITKWSDVAQTASFLKDL--GVNE---GENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRA 210 (423) T ss_pred ---CCcccchHHHHHHHHHHHHhc--cCCc---CCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEE Confidence 000112245555555554321 2222 3344688998877765321 1121111 12 2123469999 Q ss_pred EecCCCCCCcE-EEE----eecceEE-----EeecceEEEee----hhhhh--hcCceEEEEE---EEEcCEEe------ Q lcl|NC_019921. 301 IESTVQEAGKV-LTY----VKGLYDG-----YLAGGINVQKF----KETLA--LDDMDLYTAK---QFAYGKAK------ 355 (381) Q Consensus 301 v~s~~~p~~~i-~fg----d~~~y~i-----~~r~~i~i~~~----~~~~~--~~d~~~~r~~---~r~dGk~~------ 355 (381) +.|..+|.... .++ .-..+.+ .+.....+... ..+.+ ..|...|=|. .+..+.++ T Consensus 211 ~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~ 290 (423) T protein:vir:10 211 LMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATP 290 (423) T ss_pred EEeCCCccccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccC Confidence 99999996431 111 0001111 11111122111 11111 1333333332 22233322 Q ss_pred cCceEEEEEEEec--CCccccccCcccC Q lcl|NC_019921. 356 DNKVAAVWKLDLK--GHKPALEGTEETL 381 (381) Q Consensus 356 ~~~Afvv~~~~~~--~~~~~~~~~~~~~ 381 (381) ...-|+|..-..+ +....+...|--+ T Consensus 291 ~~~~~~v~a~~~~~~~g~~tv~i~p~~i 318 (423) T protein:vir:10 291 ISFTATVTADANSDSGGDVTVTLSGVPI 318 (423) T ss_pred cceEEEEEeeeeeccCCceeeeccCccc Confidence 2234444331111 1222344444333 Done!